Purdue STAT 350 Lab 1 Introduction to Statistical Packages Answer (2018)
BUY HERE⬊
STAT 350 Lab 1 Introduction to Statistical Packages Solution (2018) - Purdue University
Lab 1 (100 points): Introduction to Statistical Packages Objectives: Loading files, cleaning and manipulating the data.
A. (10 pts.) Online Prelab
B. (90 points) US Demographic, Crime, and Test-Score Data. This semester, we are going to be exploring some Demographic, Crime, and Test-Score data for counties across the United States. The data we will analyze are in the data set “USData.txt”. The variable names and definitions are listed in the file “US_Data set_Definition.pdf”. In this lab, we are going to explore what is included in the data set, load it into the software package, and do some basic manipulations.
(10 points) How many variables does this data set contain? Which are categorical or
qualitative variables and which are quantitative or numeric variables? Besides looking at the
documentation file provided, you might want to look at the data file itself in a spreadsheet,
notepad or the software package (R only).
(16 pts.) Write two analysis questions that can be answered from the data provided. In the
project due at the end of the semester, your group will have to pose general questions that
can be answered by three different statistical methods. You will be allowed to change the
questions when you start the project, but this will get you thinking of possibilities.
3. (20 points) Load the data into your software package, and provide the programming code used to do so. If you used menu options to load the data, rather than code, please describe the procedure you followed. No output is required.
4.
(19 points) Are there missing values (NA) in the data set? If so, please create a new data set by removing any rows that contain one or more NAs from the original data set. Please save this new data set to your computer and/or ITaP folder; this will be the data set that you will be using for the rest of the semester.
5. (10 points) For readability, we want to transform the values of "UrbanIndicator" from a number to what the number represents. That is, please create a new variable called "UrbanNew" such that::
If UrbanIndicator is "1", UrbanNew is "Urban" and If UrbanIndicator is "1", UrbanNew is ""Rural"
a. (5 pts.) Code. Remember that all code needed to answer part b) needs to be included in this part.
6. (15 points) We are going to show that "PopulationDensity" can be calculated from other variables in the data set.
a. (5 pts.) Write down the equation relating "PopulationDensity" to “Population” and “LandArea.”
Comments
Post a Comment