Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Lab 4: Cleaning/Munging Dataframes # Function 1: Create function called readStates: #Step 1: Create function (named readStates) to read a CSV file into R:within the

Lab 4: Cleaning/Munging Dataframes

# Function 1: Create function called "readStates":

#Step 1: Create function (named readStates) to read a CSV file into R:within the Function 1

#Q1. You need to read a URL, not a local file to your computer.

#Q2. The file is a dataset on state populations (within the United States).

# The URL is:http://www2.census.gov/programs-surveys/popest/tables/2010-2011/state/totals/nst-est2011-01.csvLinks to an external site.

# In case you have trouble using this url, use this csv filenst-est2011-01.csv

andread the file locally using read.csv function.

#Step 2: Clean the dataframe:within Function 1

#Q3. Note the issues that need to be fixed (removing columns, removing rows, changing column names).

#Q4. Within your function, make sure there are 51 rows (one per state + the district of Columbia). Make sure there are only 5 columns with the columns having the following names (stateName, Census, Estimates, Pop2010, Pop2011).

#Q5. Make sure the last four columns are numbers (i.e. not strings).

#Step 3: Store and explore the dataset:outside of Function 1

#Q6. Store the dataset into a dataframe, called dfStates.

# When you run the following, it should print a clean dataframe. Please include theoutputof "dfStates" in the compiled file by running dfStates as below.

dfStates <- readStates(urlToRead)

dfStates

#Q7. Test your dataframe by calculating the mean for the 2011 data, by doing (include your output):

mean(dfStates$Pop2011)

# You should get an answer of6,109,645

#Step 4: Find the state with the highest population:outside the Function 1

#Q8. Based on the 2011 data, what is the population of the state with the highest population? What is the name of that state, and what is the value of the population?

#Q9. Sort the data, in decreasing order, based on the 2011 data.

# Function 2: Create function called "Distribution"

#Step 5: Explore the distribution of the states: You need to create newfunction called "Distribution"

#Q10. You will write function to calculate percentage of states that have population that is lower than the average.The function (function name: "Distribution") takes two parameters. The first is a vector and the second is a number. For example, Distribution <- function(vector, number). This step is just a setup for the following instruction.

# The function will return the percentage of elements within the vector that is less than the number (i.e. cumulative distribution below the value provided). For example,

# (1) Think about this: You only keep the elementswithin the vector that are less than the number, and store the number of eligible elements into the variable "count". Populate XXXX to complete this line of code:

count <- length(vector[XXXX])

# (2) Then, you will calculate the percentage and return the results. Populate XXXX to complete this line of code:

return((count/XXXX) *100)

# ****For example, if the vector had 5 elements (1,2,3,4,5), with 2 being the number passed into the function, the function will return 20 (since 20% of the numbers--that is 1-- is below 2). Again, if you pass number 5, then the function should return 80, since 80% of the numbers--1,2,3, and 4 are below 5. Pass these numbers using your function and see if your function returns the correct values.

# (3) Test the function with the vector "dfStates$Pop2011", and the mean of "dfStates$Pop2011." *** you should get 66.66667 as a result. There are many ways to write this function (described in point 10) - so please try to write multiple versions of this function - which do you think is the best?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Finite Math And Applied Calculus

Authors: Stefan Waner, Steven Costenoble

6th Edition

1285415132, 9781285415130

More Books

Students also viewed these Mathematics questions