answer asap
Campaign organizers for both the Republican and Democratic parties are interested in identifying individual undecided voters who would consider voting for their party in an upcoming election. The DATAIe BIueOrRed contains data on a sample of voters with tracked variables, Including whether or not they are undecided regarding their candidate preference, age, whether they own a home, gender, marital status, household size, Income, years of education, and whether they attend church. In the Data Mining tab, by selecting Partition named Standard Partition, create a standard partition of the data with all the tracked variables and 50% of observations in the training set, 30% in the validation set, and 20% in the test set. Use logistic regression to classify observations as undecided (or decided) using Age, Homeowner, Female, Married, Householdsize, Income, Education, and Church as input variables and Undecided as the output variable. Perform Feature Selection with the best subsets procedure with the number of best subsets equal to two. In the Scoring tab, generate a Detailed Report and Lift Charts on the validation data. Refer to the Appendix for instructions on how to perform logistic regression using the Analytic Solver Platform. Click on the datale logo to reference the data. (a) From the generated set of logistic regression models, use Mallow's Cp statistic to identify a pair of candidate models. Then evaluate these candidate models based on their classication error and deciIe-wise lift on the validation set. Recommend a nal model. Select your answe V Express the model as a mathematical equation relating the output variable to the input variables. If required, round youranswers to three decimal places. For subtractive or negative numbers use a minus sign even if there is a + sign before the blank. (Example: -300) Log odds of being undecided = l + ' *Age + *HomeOwner + *Female + l ' *HouseholdSize + *Income + ' *Education + ' *Church (b) Increases in which variables increase the chance of a voter being undecided? (I) Age, Education, Homeowner and Income (II) Household members, Education, Homeowners and Female ) Education, Married, Church and Female (iv) Age, Church, HomeOwner and Income - Select your answer - V Increases in which variables decrease the chance of a voter being decided? (I) Age, Education and Income (II) Age, Married and Church ) Age, Income and Church (iv) Age, Income and HomeOwner - Select your answer - V (c) Using the default cutoff value of 0.5 for your logistic regression model, what is the overall error rate on the test set? If required, round your answer to two decimal places. i l %