Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 23, 2024

Python code please for the commented steps Handling Missing Values 2 of the columns have a relatively small amount of null values. There are 110

image text in transcribed Python code please for the commented steps

Handling Missing Values 2 of the columns have a relatively small amount of null values. There are 110 null values in the so (Strike Outs) column and 22 in the DP (Double Play) column. Two of the columns have a relatively large amount of them. There are 419 null values in the cs (Caught Stealing) column and 1777 in the P (Hit by Pitch) column. 1 df_Teams.isna( ).sum () [51] 1 \#Identifying the number of null values in the dataframe 2 \# Creating a for loop to display the column names and also their count of missing values 3 4 \#\#\#\# complete the code below 5 \#\#\#\# create an empty list named 'names' for columns names 6 7 \#\#\#\# create an empty list named 'val' for \# of null values in each column 8 9 10 \#\#\#\# create a for loop iterating each 'col' through 'df.columns' 11 for col in df.columns : 12 \#\#\#\# add column name 'col' to 'names' 13 \# Adding the column name to the names list 14 \#\#\#\# add \# of null values to 'val' \#\#\#\# you can get \# of null values for column ' c ' as ' df[c].isnull().sum() \# Adding the count of the missing values \#\#\#\# print out results as (column_name, \# of null values in column_name) \#\#\#\# Note that 'col' is the current column_name in iteration \#\#\#\# and you should retrieve the \# of null values in column_name as the last element in 'val' \#\#\#\# hint: the last element in a list 1 is: 1[1] \$\# Printing the column names and thier missing counts ie : 2 lists We are going to drop two columns ( S and ) with too many missing values. NOTE: even though we said that dropping columns with missing values is the last resort, the reason we are dropping the columns here is that because of the number of missing values, it will be very difficult for us to impute them in these two columns. [ ] 1 \#Dropping the columns with large number of null values 2 3 4 \#\#\#\# drop 'CS' \& 'HBP' from ' df 5 \#\#\#\# and save the remaining as 'df' 6 7 8 \#\#\#\# check the first 5 rows of the new 'df' to see 9 \#\#\#\# if the two columns are successfully dropped 10 \#\#\#\# you should expect to see 27 columns now 11 With the two columns dropped, we can impute the missing values in the other two columns ( so and DP ) since they have much less

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

User Defined Tensor Data Analysis

User Defined Tensor Data Analysis

Authors: Bin Dong ,Kesheng Wu ,Suren Byna

1st Edition

3030707490, 978-3030707491

More Books

Students also viewed these Databases questions

Question

Let 2. (a) Propose a plug-in estimator b for . (b) Calculate E b . (c) Propose an unbiased estimator for .

Answered: 1 week ago

Question

★★★★★

Jim Locke, an undergraduate student in the ESU College of Business, is trying to decide which microcomputer to purchase with the money his parents gave him for Christmas. He has reduced the number of...

Answered: 1 week ago

Question

★★★★★

Python code please for the commented steps Handling Missing Values 2 of the columns have a relatively small amount of null values. There are 110 null values in the so (Strike Outs) column and 22 in...

Answered: 1 week ago

Question

★★★★★

1. Determine the equation of the normal line to f(x)=(x 2 +2x-1)(4-3x) at x=1. 2. The tangents to the cubic function defined by y=x 3 -6x 3 +8x at A(3,-3) intersects the curve at another point B....

Answered: 1 week ago

Question

★★★★★

Consider the design of an engineered wetland amended with limestone to treat acidic drainage from a coal mine. For the purposes of this problem, we will focus on the total dissolved solids (TDS) of...

Answered: 1 week ago

Question

★★★★★

Hudson Jewelers' current layout displays jewelry designs from the CAD system on a screen in the front of the store. However, customers have little privacy when designing jewelry there. What are the...

Answered: 1 week ago

Question

★★★★★

You deposit $A today in a fund paying interest at j4 = 6%. You plan to make quarterly withdrawals for 5 years (for a total of 20 withdrawals), with the first deposit made 1 year from now. The first...

Answered: 1 week ago

Question

★★★★★

2. Titanic Corporation has three investment centre as below: Centre Nike Pine Feira (RM) (RM) (RM) Sales Operating income Average operating assets 4,000,000 6,000,000 3,500,000 4,000,000 6,000,000...

Answered: 1 week ago

Question

★★★★★

The standard cost card for a product indicates that one unit of the product requires 8 kilograms of a raw material at $0.80 per kilogram. The actual production of the product in April was 870 units,...

Answered: 1 week ago

Question

★★★★★

Debate the diffi culties that could emerge with the direct transfer of Western HRM methods into a Gulf country. What approaches should you adopt to avoid these diffi culties?

Answered: 1 week ago

Question

★★★★★

In the Meteor case study, Sarah has put together a comprehensive human resource plan. Think about what could go wrong or what diffi culties could be faced and work out how you could deal with these...

Answered: 1 week ago

Question

★★★★★

Read the article about the staff shortages in Essex County Council. Think of other areas where there are shortages of professional staff (accounting, legal, planning, hospitals) and suggest ideas how...

Answered: 1 week ago

Previous Question Next Question