Question
Data set working with: https://www.kaggle.com/c/titanic/overview Load the data from data/titanic.csv into R and familiarize yourself with the variables it contains using the codebook (data/titanic_codebook.txt). Notice
Data set working with: https://www.kaggle.com/c/titanic/overview
Load the data from data/titanic.csv into R and familiarize yourself with the variables it contains using the codebook (data/titanic_codebook.txt).
Notice that survived and pclass should be changed to factors. When changing survived to a factor, you may want to reorder the factor so that "Yes" is the first level.
Make sure you load the tidyverse and tidymodels!
Remember that you'll need to set a seed at the beginning of the document to reproduce your results.
Question 2
Using the training data set, explore/describe the distribution of the outcome variable survived.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started