Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Consider the data found in Table B.15 detailing mortality and air pollution on a by city/demographic basis. Following Example 9.3, use principle-components regression in comparison

Consider the data found in Table B.15 detailing mortality and air pollution on a by city/demographic basis. Following Example 9.3, use principle-components regression in comparison with standard ordinary least squares. Highlight what relationships can be elucidated from the principle components.

table B.15

City Mort Precip Educ Nonwhite Nox SO2

Akron-OH 921.87 36 11.4 8.8 15 59

Albany-NY 997.88 35 11 3.5 10 39

Allentown-PA 962.35 44 9.8 0.8 6 33

Atlanta-GA 982.29 47 11.1 27.1 8 24

Baltimore-MD 1071.29 43 9.6 24.4 38 206

Birmingham-AL 1030.38 53 10.2 38.5 32 72

Boston-MA 934.7 43 12.1 3.5 32 62

Bridgeport-CT 899.53 45 10.6 5.3 4 4

Buffalo-NY 1001.9 36 10.5 8.1 12 37

Canton-OH 912.35 36 10.7 6.7 7 20

Chattanooga-TN 1017.61 52 9.6 22.2 8 27

Chicago-IL 1024.89 33 10.9 16.3 63 278

Cincinnati-OH 970.47 40 10.2 13 26 146

Cleveland-OH 985.95 35 11.1 14.7 21 64

Columbus-OH 958.84 37 11.9 13.1 9 15

Dallas-TX 860.1 35 11.8 14.8 1 1

Dayton-OH 936.23 36 11.4 12.4 4 16

Denver-CO 871.77 15 12.2 4.7 8 28

Detroit-MI 959.22 31 10.8 15.8 35 124

Flint-MI 941.18 30 10.8 13.1 4 11

Fort Worth-TX 891.71 31 11.4 11.5 1 1

Grand Rapids-MI 871.34 31 10.9 5.1 3 10

Greensboro-NC 971.12 42 10.4 22.7 3 5

Hartford-CT 887.47 43 11.5 7.2 3 10

Houston-TX 952.53 46 11.4 21 5 1

Indianapolis-IN 968.66 39 11.4 15.6 7 33

Kansas City-MO 919.73 35 12 12.6 4 4

Lancaster-PA 844.05 43 9.5 2.9 7 32

Los Angeles-CA 861.83 11 12.1 7.8 319 130

Louisville-KY 989.27 30 9.9 13.1 37 193

Memphis-TE 1006.49 50 10.4 36.7 18 34

Miami-FL 861.44 60 11.5 11.5 1 1

Milwasukee-WI 929.15 30 11.1 5.8 23 125

Minneapolis-MN 857.62 25 12.1 3 11 26

Nashville-TN 961.01 45 10.1 21 14 78

New Haven-CT 923.23 46 11.3 8.8 3 8

New Orleans-LA 1113.06 54 9.7 31.4 17 1

New York-NY 994.65 42 10.7 11.3 26 108

Philadelphia-PA 1015.02 42 10.5 17.5 32 161

Pittsburgh-PA 991.29 36 10.6 8.1 59 263

Portland-OR 893.99 37 12 3.6 21 44

Providence-RI 938.5 42 10.1 2.2 4 18

Reading-PA 946.18 41 9.6 2.7 11 89

Richmond-VA 1025.5 44 11 28.6 9 48

Rochester-NY 874.28 32 11.1 5 4 18

Saint Louis-MO 953.56 34 9.7 17.2 15 68

San Diego-CA 839.71 10 12.1 5.9 66 20

San Francisco-CA 911.7 18 12.2 13.7 171 86

San Jose-CA 790.73 13 12.2 3 32 3

Seattle-WA 899.26 35 12.2 5.7 7 20

Springfield-MA 904.16 45 11.1 3.4 4 20

Syracuse-NY 950.67 38 11.4 3.8 5 25

Toledo-OH 972.46 31 10.7 9.5 7 25

Utica-NY 912.2 40 10.3 2.5 2 11

Washington-DC 967.8 41 12.3 25.9 28 102

Wichita-KS 823.76 28 12.1 7.5 2 1

Wilmington-DE 1003.5 45 11.3 12.1 11 42

Worcester-MA 895.7 45 11.1 1 3 8

York-PA 911.82 42 9 4.8 8 49

Youngstown-OH 954.44 38 10.7 11.7 13 39

Example 9.3 Principal-Component Regression for the Acetylene Data

We illustrate the use of principal-component regression for the acetylene data. We

begin with the linear transformation Z = XT that transforms the original standardized

regressors into an orthogonal set of variables (the principal components). The

TABLE 9.10 Matrix T of Eigenvectors and Eigenvalnes j for the Acetylene Data

Eigenvectors

Eigenvalues

j

.3387 .1057 .6495 .0073 .1428 .2488 .2077 .5436 .1768 4.20480

.1324 .3391 .0068 .7243 5843 .0205 .0102 .0295 .0035 2.16261

.4137 .0978 .4696 .0718 .0182 .0160 .1468 .7172 .2390 1.13839

.2191 .5403 .0897 .3612 .1661 .3733 .5885 .0909 .0003 1.04130

.4493 .0860 .2863 .1912 .0943 .0333 .0575 .1543 .7969 0.38453

.2524 .5172 .0570 .3447 .2007 .3232 .6209 .1280 .0061 0.04951

.4056 .0742 .4404 .2230 .1443 .5393 .3233 .0565 .4087 0.01363

.0258 .5316 .2240 .3417 .7342 .0705 .0057 .0761 .0050 0.00513

.4667 .0969 .1421 .1337 .0350 .6299 .3089 .3631 .3309 0.00010

eigenvalues j and the T matrix for the acetylene data are shown in Table 9.10. This

matrix indicates that the relationship between z1 (for example) and the standardized

regressors is

z1 0 3387x1 0 1324x2 0 4137x3 0 2191x1x2 0 4493x1x3

0 2524

. . . . .

. x2x3 x1 x x

2

2

2

3

0.4056 0.0258 0.4667 2

The relationships between the remaining principal components z2, z3, . . . , z9 and

the standardized regressors are determined similarly. Table 9.11 shows the elements

of the Z matrix (sometimes called the principal-component scores).

The principal-component estimator reduces the effects of multicollinearity by

using a subset of the principal components in the model. Since there are four small

eigenvalues for the acetylene data, this implies that there are four principal components

that should be deleted. We will exclude z6, z7, z8, and z9 and consider regressions

involving only the first five principal components.

Suppose we consider a regression model involving only the first principal component,

as in

y 1z1

The fitted model is

y 0.35225z1

or PC 0.35225, 0, 0, 0, 0, 0, 0, 0, 0 . The coefficients in terms of the standardized

regressors are found from PC T PC. Panel A of Table 9.11 shows the resulting

standardized regression coefficients as well as the regression coefficients in terms

of the original centered regressors. Note that even though only one principal component

is included, the model produces estimates for all nine standardized regression

coefficients.

The results of adding the other principal components z2, z3, z4, and z5 to the model

one at a time are displayed in panels B, C, D, and E, respectively, of Table 9.12. We

see that using different numbers of principal components in the model produces

TABLE 9.12 Principal Components Regression for the Acetylene Data

Principal Components in Model

Parameter

A B C D E

z1 z1, z2 z1, z2, z3 z1, z2, z3, z4 z1, z2, z3, z4, z5

Standarized

Estimate

Original

Estimate

Standardized

Estimate

Original

Estimate

Standardized

Estimate

Original

Estimate

Standardized

Estimate

Original

Estimate

Standardized

Estimate

Original

Estimate

0 .0000 42.1943 .0000 42.2219 .0000 36.6275 .0000 34.6688 .0000 34.7517

1 .1193 1.4194 .1188 1.4141 .5087 6.0508 .5070 6.0324 .5056 6.0139

2 .0466 .5530 .0450 .5346 .0409 .4885 .2139 2.5438 .2195 2.6129

3 .1457 1.7327 .1453 1.7281 .4272 5.0830 .4100 4.8803 .4099 4.8757

12 .0772 1.0369 .0798 1.0738 .0260 .3502 .1123 1.5115 .1107 1.4885

13 .1583 2.0968 .1578 2.0922 .0143 .1843 .0597 .7926 .0588 .7788

23 .0889 1.2627 .0914 1.2950 .0572 .8111 .1396 1.9816 .1377 1.9493

11 .1429 2.1429 .1425 2.1383 .1219 1.8295 .1751 2.6268 .1738 2.6083

22 .0091 .0968 .0065 .0691 .1280 1.3779 .0460 .4977 .0533 .5760

33 .1644 1.9033 .1639 1.8986 .0786 .9125 .0467 .5392 .0463 .5346

R2 .5217 .5218 .9320 .9914 .9915

MSRes .079713 .079705 .011333 .001427 .00142

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Recent Developments Of Mathematical Fluid Mechanics

Authors: Herbert Amann, Yoshikazu Giga, Hideo Kozono, Hisashi Okamoto, Masao Yamazaki

1st Edition

3034809395, 9783034809399

More Books

Students also viewed these Mathematics questions

Question

2 What participation techniques are used?

Answered: 1 week ago