Question
Download the file Data for Assignment from LMS. Select a simple random sample of size 1000 from the given data base. Open the Excel sheet
Download the file "Data for Assignment" from LMS. Select a simple random sample of size 1000 from the given data base. Open the Excel sheet "Dataset for Assignment". Press the function key F9 once or twice and make sure that the numbers in columns A to L change automatically. Copy the entire range A1:L1001 into a new Excel sheet as values. This is your own unique dataset. This is to be used for both the individual assignments. The data refers to the performance of beneficiaries in the adult education program. The variables are self-explanatory. The variables "WRITE", "READ", "MATH" and "TOTAL" are the scores in respective components and the total score. The "TOTAL" score is obtained by adding the three scores namely "WRITE", "READ" and "MATH". Check if there is any inconsistency in the "TOTAL" score and if so, replace it with the sum of the three scores namely "WRITE", "READ" and "MATH". You will have to clean up the data by replacing #NULL! with blank or zero. Details of the other variables are given below: GENDER: T: Male; F: Female AGE: Age in completed years CASTE: OT: Other Caste; SC: Scheduled Caste; ST: Scheduled Tribe RELIGN: Religion C: Christian; H: Hindu; M: Muslim MTOUNGUE: Mother Tongue: D: Urdu; K: Kannada; T: Tamil; U: Telugu OCCU: Occupation: A: Agriculture; B: Small Business; C: Cooli; H: Housewife; U: other INCOME: Income in Rs. Per month AREA: Agricultural land in acres Draw a histogram for the TOTAL score (you have to define appropriate class intervals) and comment on the distribution of the TOTAL score. We take a simple random sample of 16 learners. What is the probability that the sample average of this sample (X ) is more than 60? Create a new variable called "PERFORMANCE". If the score is above the sample average, label the PERFORMANCE as "HIGH", otherwise, "LOW". Using the new variable "PERFORMANCE" and GENDER, comment on the relative performance of Male and Female learners. Using the sample that you have just selected, calculate a 95% two-sided confidence interval for the following. A. Mean of TOTAL Score B. Mean of INCOME C. Proportion of "HIGH" performers among Male learners D. Proportion of "HIGH" performers among Female Learners. If we want to estimate a 95% confidence interval for the population mean of "TOTAL" marks within 5 marks, what is the sample size required? If we want to estimate a 95% confidence interval for the proportion of "HIGH" performers within 0.05, what is the appropriate sample size?
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started