Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

please help me. 'v'a rious demographic and cardiovascular risk factors were collected as a part of the Prevention of REnal and Vascular END-stage Disease [PREVENDII

please help me.

image text in transcribedimage text in transcribedimage text in transcribedimage text in transcribedimage text in transcribedimage text in transcribedimage text in transcribedimage text in transcribed
'v'a rious demographic and cardiovascular risk factors were collected as a part of the Prevention of REnal and Vascular END-stage Disease [PREVENDII study, which took plaoe in the Netherlands. The initial study population began as 3,592 participants aged 28-?5 years who took a first survey in HST-1998. Participants were followed over time; 6,894 participants took a second survey in 2001-2003, and 5,862 completed the third survey in 2003- 2005. In the third survey, measurement of cognitive function was added to the study protocol. Data from 4,095 individuals who completed cognitive testing were made available upon request. As adults age, cognitive function changes over time, largely due to various oerebrovascular and neu rodegenerative changes. It is thought that oognitive decline is a long-term process that may start as early as 45 years of age. The Ruff Figural Fluency Test [RFFTI is one measure of oognitive function that provides information about cognitive abilities such as planning and the ability to switch between different tasks. The test oonsists of drawing as many unique designs as possible from a pattern of dots, under timed conditions: soores range from 0 to 1?5 points [worst and best score, respectively]. Purpose of this lab: FIFFI' scores for a random sample of 32 individuals from this population are provided in the table below. You will be using your knowledge of statistical techniques and bivariate data [in this case Age in years and FtFl-'l' test see re) to investigate the relations hip, if any, between thie two variables. PART 1 1. Make a scatterplot of your 32 points, remembering to title the two axes. Use age as the explanatory variable. 180 160 140 120 100 80 60 40 20 10 20 30 40 50 60 70 80 90 2. Visually, does there appear to be a correlation between the two variables? What makes you say that? PART 2 3. a) Using a ruler, draw a line on the graph that represents where you think the best fit line would be. Use a pencil (not pen) and ruler. b) Select two convenient points anywhere on your line and put small circles around them. ("convenient" meansPART 2 3. a) Using a ruler, draw a line on the graph that represents where you think the best fit line would be. Use a pencil (not pen) and ruler. b) Select two convenient points anywhere on your line and put small circles around them. ("convenient" means the points have fairly easy coordinates to estimate.) c) Now, find the equation of your line using the two points you circled. Start by finding the slope, and then find the y-intercept. Show all of your algebra below. Write your final linear function in the form y = mx + b. Write the two points to intend to use here: (. _) and (. 4. Enter your data into L1 and L2, and then find the value of the correlation coefficient rounded to 3 decimal places. Write your answer here: r = 5. Using the data you've already entered into L1 and L2, use 2-Var Stats to find the following statistics: X = Sx = y : Sy = 6. Using the values above as well as your value for r, show how to find the slope of the least squares line (linear regression line) without having the calculator compute it. Then show how to find the y-intercept of the least squares line. Show all of your algebra, and write the final equation of the least squares line in the form y = mx + b.7. Using StatCrunch, do the following: a) Enter your 32 data points. Label the first column Age(years) and the second column RFFT Score. b) In rows 33 and 34, enter the x- and y-coordinates of the two points that used to make your estimated line in 3(c). c) In the third column of your StatCrunch spreadsheet, change the label heading var3 to the word Data. For your first 32 data points, type the abbreviation orig signifying these are the original sample data. For the last two points (rows 33 and 34), type mine in the third column to signify that these two data points came from your estimated line. d) At the top of the StatCrunch screen, select Graph + Scatter Plot. Then fill in the screen options as shown below. Once to have set the option, click on the Compute! button. Print a copy of your finalized graphs, and attach it to this project. Scatter Plot X variable: e) Does the least squares graph and your graph look pretty much alike? Age(years) How do they differ? Be specific. Y variable: RFFT Score + Where: optional- f) Based on the least squares model, how much does RFFT score differ, on Group by: Data average, between an individual who is 60 years old versus an individual who is 50 years old? How did you arrive at your answer? Grouping options: Color points by group label Overlay polynomial order: 1 + Best fit g) Is it valid to use the linear model to estimate the RFFT score for an O y=c0 + cl"x individual who is 20 years old? Explain your answer.PART 3 B. Using your sample statistics, conduct a formal hypothesis test of the claim that there is a linear correlation between the variables age and RFFT Score. Use an o = 0.05 significance level, and be sure to state your null and alternative hypotheses as well as show the calculation of the P-value. Show all steps and your conclusion below. 9. Compute the value of the coefficient of determination, and explain exactly what that value means. If you are unsure, read page 466 in your text as a refresher. Use complete sentences. CONCLUSION 10. Write a short paragraph addressing the question "Does the information obtained by doing parts 1 -3 of this investigation basically tell the same story about the relationship between age and cognitive ability? If so, how. If not, explain the discrepancies. Use complete sentences.The vertical distance between a point on a scatterplot and the predicted value on the regression line is called the residual for the point. For an observation in a sample like (I; J1] where f; is the predicted value according to the line 5' = mx + b, the residual is the value c; = 3r; 5i, . 5o suppose your rst data point, {1:131} was [40,911] and your least squares {regression} line was determined to be 5' = 1.4'x + 145, then 331 = 1.4{4{I) + 145 = 39, and the rst residual would be 91 = y1 jil = 9D - 39 = 1 To nd other residuals, you follow the same procedure. The least squares regression line is the line which minimizes the sum of the squares of thie residuals for all the points in the scatterplot. Cine tool that statisticians often use to analyze the correlation between two variables and the effectiveness of the regression line is to test the set of all n residuals II i.e. e1, Ba, Ba, . . ., en} computed from a sample of n data points for normality. That is, do the set of residuals form a Normal distribution? If they do, this indicates that the two variables being investigated are probably linearly related and that using the least squares line as a predictor is safe. Purpose of this lab: This lab uses the same age vs RFFI' scores data for a random sample of 32 individuals that you were given previously. The goal of this lab, in particular, is to investigate the normality of the residuals computed using the least sq uari line you developed already. 1. Enter the data you were given from the previous lab into L1Ilage] and L2 {RFFI' Score} on your calculator. 2. Compute the least squares line using your calculator. Write the equation below: $5: 3. For each data point, you are going to compute the corresponding residual, E; = 3?; - g I will show you a quick way to do this using your calculator. First, enter the x- and y-coo rdinates for your 32 points {from your previous la b.) 3. For each data point, you are going to compute the corresponding residual, et = y - yi I will show you a quick way to do this using your calculator. First, enter the x- and y-coordinates for your 32 points (from your previous lab.) X y e X y e 4. How to calculate the residuals for your 32 sample data points: Right now you have your x-values stored in L1 and your y-values stored in L2. In list 3 (L3), we are going to teach the calculator how to compute the estimated y-values (i.e. >; ) for each one of the x-values in our data set. Here's how: a) I've shown my first 2 data points stored in L1 and L2. Suppose my regression line equation was D = -1.6x + 150. Using the arrow key and move up to the header of L3. Press CLEAR and ENTER to clear any data that might be in L3 already. Now use the arrow key once more to get to the header of L3. While you're positioned in the header box, type your regression equation like this: 1.6(L1) + 150. The formula that you're typing will show in the function bar at the bottom of the screen. (See Figure A below.) b) Now go to the header of L4 using your arrow key. We're going to compute the residual for each point in our data by typing L2 - L3. (See Figure B below.)4. How to calculate the residuals for your 32 sample data points: Right now you have your x-values stored in L1 and your y-values stored in L2. In list 3 (L3), we are going to teach the calculator how to compute the estimated y-values (i.e. >; ) for each one of the x-values in our data set. Here's how: a) I've shown my first 2 data points stored in L1 and L2. Suppose my regression line equation was D= -1.6x + 150. Using the arrow key and move up to the header of L3. Press CLEAR and ENTER to clear any data that might be in L3 already. Now use the arrow key once more to get to the header of L3. While you're positioned in the header box, type your regression equation like this: -1.6(L1) + 150. The formula that you're typing will show in the function bar at the bottom of the screen. (See Figure A below.) b) Now go to the header of L4 using your arrow key. We're going to compute the residual for each point in our data by typing L2 - L3. (See Figure B below.) NORMAL FLOAT AUTO REAL NORMAL FLOAT AUTO REAL RADIAN Li L2 Li Ly 20 81 L2 L3 77 20 20.4 26. Lo= -1. 6 (L1)+150 L4= L 2-L3 Figure A Figure B c) Now copy the 32 residuals, rounded to 3 decimal places, in the table on page 1 using columns labeled "e". 5. Using StatCrunch, enter your 32 residual values in column 1 of the spreadsheet. Change the name of the heading to Residuals. 6. Even though linearity between two variables can be assessed from the scatterplot of x versus 4, it is helpful to consult graphs of residuals for a clearer view. Normality of residuals is best assessed by examining a normal probability plot ( aka quantile-quantile plot or Q-Q plot) as well as examining a histogram of the residuals. a) Using StatCrunch, make and print a copy of a histogram of your residuals. b) Using StatCruch, make and print a copy of a Q-Q plot of your residuals. Attach both graphs to this report. You may put both graphs on one sheet of paper if you wish. 7. Do the residuals appear to be normally distributed? Explain. Overall, does it seem that a least squares regression line is an appropriate model for estimating the relationship between cognitive function (as measured by RFFT score) and age? Give two reasons why or why not (one from your previous project on this topic and one from this one

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

High School Math 2012 Common-core Algebra 2 Grade 10/11

Authors: Savvas Learning Co

Student Edition

9780133186024, 0133186024

More Books

Students also viewed these Mathematics questions