Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Part 1 - Cleaning without dplyr Instructions: 1. Remove the Region ID and SizeRank columns 2. Reshape the data appropriately using the gather() function 3.

image text in transcribed
image text in transcribed
image text in transcribed
Part 1 - Cleaning without dplyr Instructions: 1. Remove the Region ID and SizeRank columns 2. Reshape the data appropriately using the gather() function 3. Remove the X from the variable containing the date information 4. Using the separate() function, split the column containing the date information into Year and Month columns 5. Convert Year to a numeric variable 6. Convert Month to a numeric variable 7. Create a new variable, Date, defined as Year+Month/12 (I want this variable for easy plotting) 8. Remove any rows where the median spe price is missing (hint: you can use is na(x) to identify missing values, or lisna(x) to identityhuon-missing value, where is the column you are interested in, bot will return a TRUE/FALSE vector) 9. Order the data by state and date (hint: data-data Corder(variablet, variable2), )) 10. Print out the data structure using the str() function Each of the above should be one line of code 11100 Your code here Part 2 - Cleaning with dplyr Instructions: 1. Remove the Region ID and SizeRank columns 2. Reshape the data appropriately using the gather() function 3. Remove the x from the variable containing the date information 4. Using the separate() function, split the column containing the date information into Year and Month columns 5. Convert Year and Monthto numeric variables, and then create a new variable, Date, defined as Year+Month/12 6. Remove any rows where the median sale price is missing 7. Order the data by state and date When finished, connect your dplyr chain to the original read.cav() statement, so your data reads in "cleaned" and ready to go, like this: zillow % select() %>% mutate()... zillow Your code here Part 2 - Cleaning with dplyr Instructions: 1. Remove the Region ID and SizeRank columns 2. Reshape the data appropriately using the gather() function 3. Remove the x from the variable containing the date information 4. Using the separate() function, split the column containing the date information into Year and Month columns 5. Convert Year and Monthto numeric variables, and then create a new variable, Date, defined as Year+Month/12 6. Remove any rows where the median sale price is missing 7. Order the data by state and date When finished, connect your dplyr chain to the original read.cav() statement, so your data reads in "cleaned" and ready to go, like this: zillow % select() %>% mutate()... zillow

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Managing Your Information How To Design And Create A Textual Database On Your Microcomputer

Authors: Tenopir, Carol, Lundeen, Gerald

1st Edition

1555700233, 9781555700232

More Books

Students also viewed these Databases questions

Question

How was your life influenced by those events?

Answered: 1 week ago

Question

Which of these influenced your life most dramatically?

Answered: 1 week ago