Question
By Using R language Use the Adults.csv from https://archive.ics.uci.edu/ml/datasets/Adult 1 The data file contains few missing entries (denoted by ?). Find out how many rows
By Using R language
Use the Adults.csv from https://archive.ics.uci.edu/ml/datasets/Adult
1\ The data file contains few missing entries (denoted by ?). Find out how many rows have the missing values. Describe your methodology for dealing with missing values.
2\ Derive descriptive statistics regarding this dataset, including measures of central tendency for the following variables.
3\ Represent, graphically, the distribution of the continuous variables and discuss whether the data is skewed, contain outliers, or normally distributed. Explain the effect of outliers, if any, on the distribution of the data.
4\ Find the correlation of the following variables with the income and justify your answer. Represent the correlation graphically. (Education - Work Class - Occupation - Gender )
5\ Among the variables Capital Gain and Capital Loss, find the predictor variable of income, and the direction and strength of the linear relationship. Justify your answer graphically and explain using your own words.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started