Answered step by step
Verified Expert Solution
Question
1 Approved Answer
1. Titanic Data Training [45 pts] The objective of the provided data set is to classify whether the passengers survived or not. In this assignment,
1. Titanic Data Training [45 pts] The objective of the provided data set is to classify whether the passengers survived or not. In this assignment, students will perform the train data preparation. Using the provided data set, train the dataset by completing the following. 1. (5 pts) Determine the data type of features. 2. (5 pts) Determine the ratio of survival rate. The ideal binary classification situation (e.g., T/F or 1/0) has a 50-to-50 ratio, which we call the balanced set. 3. (5 pts) Determine the missing value features and their number of observations. Explain how you would handle features. Explain whether deleting the feature(s) or observation(s) is the best choice. If not, explain why not. 4. (5 pts) Determine any feature(s) that can be removed. Explain why. Then remove them. 5. (5 pts) Convert text feature(s) into a numerical feature(s). Explain if any features do not require numerical conversion. 6. (10 pts) Write a method "data_split(data, ratio)" that shuffles the observations and spli
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started