Question
Regression As astatisticalprocedure,simplelinearregressionhasalotoflimitations.Therealpowerofregressioncomeswithmultipleregressionandwhatwemeanbymultipleregressioniswhenwestartaddingindependentvariablestothemodel.So whenwethinkabouttheneedtoaddmorevariablestothemodel,I'dliketobeginwithsomethingofanexample. Let'ssupposethatamarketinganalystwantstotesttherelationshipbetweenincomeandinternetusageandbelievesthatit'spositiveandprettystrong.Itmakessensetothinkthatpeoplewithmoreincomewouldhavebetterinternetaccess.Professionally,theymayneedmoreinternetaccess.Andsotheideabehinditis,thisanalystbelieves,thatasyourincomegoesup,weshouldalsoexpectyourinternetusagetogoup.Orsaidconversely,themoreinternetusageyouhave,chancesareyouhaveahigherincome.Andmoreover,thisparticularanalystbelievesthat'saprettystrongrelationship,feelsprettygoodabouthisorherconceptualhypothesisabouttherelationshipbetweenincomeandinternetusageandso,throughwhatevermeansoranother,acquiresdatathatteststherelationshipwithcorrelation. Sotheycollectdatafromavarietyofdifferentrespondentsandmakesuretheyasktwothings--what's Your income,andhowmuchdoyouusetheinternet.Andlet'ssaythatthatcorrelationcomesback,well,lookingprettyweak.Ifwelookatascatterplotofthishypotheticaldata,perhapsweputinternetusageontheverticalaxisandincomeonthehorizontalaxis.Andlet'ssaythatitproducesadataplot,ascatterplot,thatlookslikethat--notmuchtoit.Andlet'ssaythatthecalculatedcorrelationcoefficientisonly0.04--nexttozero,basically,whichmeansalmostnocorrelationseemstoexistbetweeninternetusageandincome. What'sgoingonhere?Isthemarketinganalystjustwrong?Arehisorherreallystrongintuitionabouttherelationshipbetweenthesevariablesjustthatfarofftarget?Oristheresomethingelseatplaythatwemightuncoverwiththedatabyaddingmorevariablestothissimplemodelthatsimplyincludesinternetusageand income,butonlyproducesacorrelationof0.04amidstastrongbeliefthatthere'smoretoitgoingon?Well,wecantakealookatthemoretoitcomingup. Suppose the analyst feels very confident in the relationship between internet usage and income and
Regression
Asastatisticalprocedure,simplelinearregressionhasalotoflimitations.Therealpowerofregressioncomeswithmultipleregressionandwhatwemeanbymultipleregressioniswhenwestartaddingindependentvariablestothemodel.So whenwethinkabouttheneedtoaddmorevariablestothemodel,I'dliketobeginwithsomethingofanexample.
Let'ssupposethatamarketinganalystwantstotesttherelationshipbetweenincomeandinternetusageandbelievesthatit'spositiveandprettystrong.Itmakessensetothinkthatpeoplewithmoreincomewouldhavebetterinternetaccess.Professionally,theymayneedmoreinternetaccess.Andsotheideabehinditis,thisanalystbelieves,thatasyourincomegoesup,weshouldalsoexpectyourinternetusagetogoup.Orsaidconversely,themoreinternetusageyouhave,chancesareyouhaveahigherincome.Andmoreover,thisparticularanalystbelievesthat'saprettystrongrelationship,feelsprettygoodabouthisorherconceptualhypothesisabouttherelationshipbetweenincomeandinternetusageandso,throughwhatevermeansoranother,acquiresdatathatteststherelationshipwithcorrelation.
Sotheycollectdatafromavarietyofdifferentrespondentsandmakesuretheyasktwothings--what's
Your
income,andhowmuchdoyouusetheinternet.Andlet'ssaythatthatcorrelationcomesback,well,lookingprettyweak.Ifwelookatascatterplotofthishypotheticaldata,perhapsweputinternetusageontheverticalaxisandincomeonthehorizontalaxis.Andlet'ssaythatitproducesadataplot,ascatterplot,thatlookslikethat--notmuchtoit.Andlet'ssaythatthecalculatedcorrelationcoefficientisonly0.04--nexttozero,basically,whichmeansalmostnocorrelationseemstoexistbetweeninternetusageandincome.
What'sgoingonhere?Isthemarketinganalystjustwrong?Arehisorherreallystrongintuitionabouttherelationshipbetweenthesevariablesjustthatfarofftarget?Oristheresomethingelseatplaythatwemightuncoverwiththedatabyaddingmorevariablestothissimplemodelthatsimplyincludesinternetusageand
income,butonlyproducesacorrelationof0.04amidstastrongbeliefthatthere'smoretoitgoingon?Well,wecantakealookatthemoretoitcomingup.
Suppose the analyst feels very confident in the relationship between internet usage and income and is quite surprised that the data do not support the hypothesis. What do you think might be the cause of the result? What might the analyst do to pursue this hunch further?
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started