Question
I am having a hard time understanding which variables are continuous that best predict num_people. I have added the question 1 below. event_ID: used to
I am having a hard time understanding which variables are continuous that best predict num_people. I have added the question 1 below.
- event_ID: used to label each event
- data_type: the type of data breached
- num_people (in millions): the number of people impacted by a data breach, expressed in millions
- num_people_v2: coded version of the variable num_people
- num_records (in millions): the number of records breached, expressed in millions
- per_sensitive: percent of sensitive data breached
- per_sensitive_v2: coded version of the variable per_sensitive
- dys_impact: the length of the negative financial impact from the data breach
- dys_detect: the number of days it takes to detect the breach
- cost_controls (in millions): the amount of money spent on security controls, expressed in millions
Question 1. Use the best subsets approach to determine which variable(s) would best predict num_people. Please be sure to only use the continuous variables in your data to predict your outcome and exclude event_ID. The variables are listed above.
Remember: Continuous variable refers to entities that get a distinct score. It gives us a score for each entity and can take on any value on the measurement scale that we are using.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started