Question: Problem #B 50 marks Consider the following data related to the final scores of the four sections of a data science course Section Scores Clytherin:

Problem #B 50 marks Consider the following data related to the

Problem #B 50 marks Consider the following data related to the final scores of the four sections of a data science course Section Scores Clytherin: 95, 95, 65, 70, 65, 65, 75, 75, 75, SO, 65, 95, 75, 65, 75, 50, 65, 95, 75, 65, 95, 75, 65, 80, 65, 70, 70,65 Jriffindor: 70, 75, 90, 75, 55, 55, 50, 60, 70, 60, 75, 75, 80, 70, 85, 95, 60, 60, 55, 65, 65, 90, 65, 75, 55, 95, 90 Hull buff: 75, 85, 80, 85, 55, 55, 55, 85, 85, 75, 70, 85, 80, 70, 85, 80, 85, 75, 70, 85, 55, 75, 55, 55, 85 Ravenklaw: 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75 Answer the following questions: B-1: Draw the histogruns of scores for each of the above sections. Take bin size is 10 units, starting from score 40. Draw each histogram in one figure. Write the title on each histogram. B-2: Comment on the skewness of the distributions by obeerving the histograms. B-3: Find the mean, median and mode for each of the above sections. B-4: Find the variance and standard deviation for each of the above sections B-5: Find the upper and lower quartile for each of the above sections. B-6: Draw the box-plot for each of the above sections in one figure. Label 3-axis with the name of the section B-7: Do hypothesis testing for each section to check if the mean is equal to 75. Assume p-value less than 0.05 or 5% a very small B-8: Do pair-wise hypothesis testing to compare the means of two distribution. Assume p-value less than 0.05 or 5% a very small Note: Solec all the above questions using Python (not by hand). To read the data in python and repeat for each section, you can use the following code: n (1): 1 Data Jriffindor": [70, 75, 90, 75, 55, 85, 80, 80, 70, 60, 75, 75, 80, 70, 85, 95, 60, 60, 85, 65. 65, 90, 65, 75, 55, 95, 90). "clytherin": [95, 95, 65, 70, 65, 65, 75, 75, 75, 80, 65, 95, 75, 65, 75, 80, 65, 95, 75, 65. 95, 75, 65, 80, 65, 70, 70, 65). "Hufflepuff": [75, B5, 80, E5, E5, B5, B5, B5, B5, T2, T3, B5, B2, 70, 85, 80, B5, 75, 73, B5, 55, 75, 55, 55, 85). "Ravenklas": [75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75. 75, 75, 75, 75, 75, 75, 75] ) for section in Data: dictionary's default iterator is key series-Data[section).copy() # list is mutable, so we copy to avoid chages in actual data #your code for each section # # end of the code for each section Problem #B 50 marks Consider the following data related to the final scores of the four sections of a data science course Section Scores Clytherin: 95, 95, 65, 70, 65, 65, 75, 75, 75, SO, 65, 95, 75, 65, 75, 50, 65, 95, 75, 65, 95, 75, 65, 80, 65, 70, 70,65 Jriffindor: 70, 75, 90, 75, 55, 55, 50, 60, 70, 60, 75, 75, 80, 70, 85, 95, 60, 60, 55, 65, 65, 90, 65, 75, 55, 95, 90 Hull buff: 75, 85, 80, 85, 55, 55, 55, 85, 85, 75, 70, 85, 80, 70, 85, 80, 85, 75, 70, 85, 55, 75, 55, 55, 85 Ravenklaw: 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75 Answer the following questions: B-1: Draw the histogruns of scores for each of the above sections. Take bin size is 10 units, starting from score 40. Draw each histogram in one figure. Write the title on each histogram. B-2: Comment on the skewness of the distributions by obeerving the histograms. B-3: Find the mean, median and mode for each of the above sections. B-4: Find the variance and standard deviation for each of the above sections B-5: Find the upper and lower quartile for each of the above sections. B-6: Draw the box-plot for each of the above sections in one figure. Label 3-axis with the name of the section B-7: Do hypothesis testing for each section to check if the mean is equal to 75. Assume p-value less than 0.05 or 5% a very small B-8: Do pair-wise hypothesis testing to compare the means of two distribution. Assume p-value less than 0.05 or 5% a very small Note: Solec all the above questions using Python (not by hand). To read the data in python and repeat for each section, you can use the following code: n (1): 1 Data Jriffindor": [70, 75, 90, 75, 55, 85, 80, 80, 70, 60, 75, 75, 80, 70, 85, 95, 60, 60, 85, 65. 65, 90, 65, 75, 55, 95, 90). "clytherin": [95, 95, 65, 70, 65, 65, 75, 75, 75, 80, 65, 95, 75, 65, 75, 80, 65, 95, 75, 65. 95, 75, 65, 80, 65, 70, 70, 65). "Hufflepuff": [75, B5, 80, E5, E5, B5, B5, B5, B5, T2, T3, B5, B2, 70, 85, 80, B5, 75, 73, B5, 55, 75, 55, 55, 85). "Ravenklas": [75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75. 75, 75, 75, 75, 75, 75, 75] ) for section in Data: dictionary's default iterator is key series-Data[section).copy() # list is mutable, so we copy to avoid chages in actual data #your code for each section # # end of the code for each

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Consider the following data related to the final scores of the four sections of a data science course: Section Scores Clytherin: 95, 95, 65, 70, 65, 65, 75, 75, 75, 80, 65, 95, 75, 65, 75, 80, 65,...

i want to solve from B5 to B8 Consider the following data related to the final scores of the four sections of a data science course: Section Scores Clytherin: 95, 95, 65, 70, 65, 65, 75, 75, 75, 80,...

By python Consider the following data related to the final scores of the four sections of a data science course: Section Scores Clytherin: 95, 95, 65, 70, 65, 65, 75, 75, 75, 80, 65, 95, 75, 65, 75,...

use this program to solve question B-1 and find the mean , mode and median Consider the following data related to the final scores of the four sections of a data science course: Section Clytherin:...

please solve this question by using python Consider the following data related to the final scores of the four sections of a data science course: Section Scores Clytherin: 95, 95, 65, 70, 65, 65, 75,...

Managerial Accounting - 6th Edition by James Jiambalvo Chapter 14 - Problems 13, 14, and 15 (see attachment) Problem 14-13. Common-Size Financial Statements (p.564) Problem 14-14. Horizontal Analysis...

2.2 51 Summarizing Data for a Quantitative Variable Exercises Methods 11. Consider the following data. WEB 14 19 24 19 16 20 24 20 file Frequency a. b. SELF test 21 22 24 18 17 23 26 22 23 25 25 19...

FINAL Exam Spring 2017 MATH 250 Elements of Statistics Multiple Choice. Due to alternate methods of computation, there may be slight differences in answer the problem number. Please do not add the...

A random sample is selected from a normal population with a mean of = 30 and a standard deviation of s = 8. After a treatment is administered to the individuals in the sample, the sample mean is...

The supervisor at the Precision Machine Shop wants to determine the staffing policy that minimizes total operating costs. The average arrival rate at the tool crib, where tools are dispensed to the...

If a company uses a cash method of a account, which of the following statements will be true? A revenue is recognized when cash has received no matter when the cell actually took place expenses for...

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

Assume that the banking system has total reserves of $100 billion. Assume also that required reserves are 10 percent of checking deposits and that banks hold no excess reserves and households hold no...

As shown in Figure 3, the overall labor-force participation rate of men declined between 1970 and 2000. At the same time, the labor-force participation rate of women increased sharply. This overall...

The Bureau of Labor Statistics announced that in February 2008, of all adult Americans, 145,993,000 were employed, 7,381,000 were unemployed, and 79,436,000 were not in the labor force. Use this...