Question
Download the Pisa2009 Dataset from the D2L. The Programme for International Student Assessment (PISA) is a test given every three years to 15yearold students from
Download the Pisa2009 Dataset from the D2L. The Programme for International Student Assessment (PISA) is a test given every three years to 15yearold students from around the world to evaluate their performance in mathematics, reading, and science. This test provides a quantitative way to compare the performance of students from different parts of the world. In this homework assignment, we will predict the reading scores of students from the United States of America on the 2009 PISA exam. The dataset contains information about the demographics and schools for American students taking the exam, derived from 2009 PISA PublicUse Data Files distributed by the United States National Center for Education Statistics (NCES). Each row in the dataset represents one student taking the exam. The datasets have the following variables:
grade: The grade in school of the student (most 15yearolds in America are in 10th grade)
male: Whether the student is male (1/0)
raceeth: The race/ethnicity composite of the student
preschool: Whether the student attended preschool (1/0)
expectBachelors: Whether the student expects to obtain a bachelor's degree (1/0)
motherHS: Whether the student's mother completed high school (1/0)
motherBachelors: Whether the student's mother obtained a bachelor's degree (1/0)
motherWork: Whether the student's mother has parttime or fulltime work (1/0)
fatherHS: Whether the student's father completed high school (1/0)
fatherBachelors: Whether the student's father obtained a bachelor's degree (1/0)
fatherWork: Whether the student's father has parttime or fulltime work (1/0)
selfBornUS: Whether the student was born in the United States of America (1/0)
motherBornUS: Whether the student's mother was born in the United States of America (1/0)
fatherBornUS: Whether the student's father was born in the United States of America (1/0)
englishAtHome: Whether the student speaks English at home (1/0)
computerForSchoolwork: Whether the student has access to a computer for schoolwork (1/0)
read30MinsADay: Whether the student reads for pleasure for 30 minutes/day (1/0)
minutesPerWeekEnglish: The number of minutes per week the student spend in English class
studentsInEnglish: The number of students in this student's English class at school
schoolHasLibrary: Whether this student's school has a library (1/0)
publicSchool: Whether this student attends a public school (1/0)
urban: Whether this student's school is in an urban area (1/0)
schoolSize: The number of students in this student's school
readingScore: The student's reading score, on a 1000point scale
Looking at the structure of the data set, can I only consider grade and raceeth as my dummy variables in r when building a model?
Thank you for your help.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started