Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 21, 2024

There are two parts to this problem I need help on: 1) Create dataframe called happy_final and display the shape 2) Drop all nulls and

There are two parts to this problem I need help on:

1) Create dataframe called happy_final and display the shape

2) Drop all nulls and display the dataframe info

Please see the iimage for problem 4 (has two parts)

image text in transcribed

- As you progress in the EDA process, there will end up being many dataframes created. A naming convention is a good plan to put into place. Your naming convention should be one that makes sense to you and to anyone that you will share your code with. - A common mistake made is to use nonmeaningful names like df1, df2, df3, etc. The weakness with this plan is that it is difficult to easily know which is the most current dataframe. - A possible convention is to use df following by a suffix which denotes where you are in the EDA process. - df_raw or dfRaw could be used when you first read in the raw data. - df_clean or dfClean could be used after cleaning the data - dfffinal or dffinal could be used after creating new variables and you are ready to write out your file. - There can be a number of additional suffices that you may end up using. - Another convention is to use dataframe names that relate to your topic; this is especially useful if you have a number of files. - Example: nfl2019, nfl2020, nfl2021 would be good names if you have three years of data that you will later combine. - You can use this example and combine it with the suffix idea: nfl2019Raw, nfl2019Clean, nfl2019Final. Problem 4 (4 pts.): Create a new dataframe called happy_final to include 2015, 2016, 2017, 2018 and 2019 data. - Show the shape of your new dataframe. - Drop all nulls from happy_final and show the info of dataframe after the drop. Problem 4 continued: Drop all nulls and display the dataframe info. Dataframe names continued In the cell below there are two new dataframes created, cross and subset. These names were chosen in such a way that they can be used again and again without the worry that they will interfere with or be overwritten by a dataframe that might be needed later. M what is missing per country cross = pd.crosstab(happy_final['Country'], happy_final['Year'], margins = True) type(cross) subset = cross[cross['Al1']

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Intelligent Information And Database Systems 6th Asian Conference Aciids 2014 Bangkok Thailand April 7 9 2014 Proceedings Part I 9 2014 Proceedings Part 1 Lnai 8397

Intelligent Information And Database Systems 6th Asian Conference Aciids 2014 Bangkok Thailand April 7 9 2014 Proceedings Part I 9 2014 Proceedings Part 1 Lnai 8397

Authors: Ngoc-Thanh Nguyen ,Boonwat Attachoo ,Bogdan Trawinski ,Kulwadee Somboonviwat

2014th Edition

3319054759, 978-3319054759

More Books

Students also viewed these Databases questions

Question

★★★★★

The average number of dropouts for a school district has been 305 per year, with a standard deviation of 50. What is the probability that next year the number of dropouts will be a. less than...

Answered: 1 week ago

Question

★★★★★

Assignment Topic Auditors and Legal Liability Read the following extract from the ACCA (the Association of Chartered Certified Accountants) website, which is the global body for professional...

Answered: 1 week ago

Question

★★★★★

A proposed open-pit mine would require the investment of $2 million at the beginning of the first year and a further investment of $1 million at the end of the first year. Mining operations are...

Answered: 1 week ago

Question

★★★★★

A building with a cost of $900,000 has an estimated residual value of $250,000, has an estimated useful life of 40 years, and is depreciated by the straight-line method. (a) What is the amount of the...

Answered: 1 week ago

Question

★★★★★

There are two parts to this problem I need help on: 1) Create dataframe called happy_final and display the shape 2) Drop all nulls and display the dataframe info Please see the iimage for problem 4...

Answered: 1 week ago

Question

★★★★★

SALES INCREASE Paladin Furnishings generated $4 million in sales during 2016, and its year-end total assets were $2.4 million. Also, at year-end 2016, current liabilities were $500,000, consisting of...

Answered: 1 week ago

Question

★★★★★

Write a C++ program that reads a single line of text from a file, then prints the contents of that line with the first and last characters swapped to an output file calledoutput.txt. The program...

Answered: 1 week ago

Question

★★★★★

Question 2 (1 point) A project requires the purchase of machinery for $40,000. The machinery belongs in a 20% CCA class and will have a salvage value of $4,000 at the end of the 4 year project. It...

Answered: 1 week ago

Question

★★★★★

Consider the effects on fast-food restaurants if the minimum wage is raised to $15 an hour. Is this a change in demand or supply? Which acronym for non-price determinants applies: TRIBE or ROTTEN?...

Answered: 1 week ago

Question

★★★★★

Based on Lab 2-4 Excel Sensitivity Analysis of Profitability under Changing Volume, Sales Price, and Fixed and Variable Cost Assumptions, using the Exhibit below, answer the following question. A...

Answered: 1 week ago

Question

★★★★★

Q4, 1 point for each question) Short answer questions: 1) Give an example about discretionary dependency between two activities 2) Name two inputs that you need to develop a communication plan 3)...

Answered: 1 week ago

Question

★★★★★

(Appendices) Megan, age 24, recently graduated from college. She was insured as a dependent under her fathers group health insurance policy, which provided coverage for her as a student until she...

Answered: 1 week ago

Question

★★★★★

(Appendices) The Social Security trust funds are invested in government securities that are both similar and dissimilar to other government securities available to the Public. LO4 a. Explain how the...

Answered: 1 week ago

Question

★★★★★

(Appendices) Ethics Some celebrities lend their names to products that later turn out to be ineffective or harmful to consumers. The celebrities are paid to endorse the product by claiming they have...

Answered: 1 week ago

Previous Question Next Question