Question

1 Approved Answer

Posted on Sep 25, 2024

i want to solve from B5 to B8 Consider the following data related to the final scores of the four sections of a data science

image text in transcribed i want to solve from B5 to B8

Consider the following data related to the final scores of the four sections of a data science course: Section Scores Clytherin: 95, 95, 65, 70, 65, 65, 75, 75, 75, 80, 65, 95, 75, 65, 75, 80, 65, 95, 75, 65, 95, 75, 65, 80, 65, 70, 70, 65 Jriffindor: 70, 75, 90, 75, 55, 85, 80, 80, 70, 60, 75, 75, 80, 70, 85, 95, 60, 60, 85, 65, 65, 90, 65, 75, 55, 95, 90 Hufflebuff: 75, 85, 80, 85, 55, 85, 55, 85, 85, 75, 70, 85, 80, 70, 85, 80, 85, 75, 70, 85, 55, 75, 55, 55, 85 Ravenklaw: 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75 Answer the following questions: B-1: Draw the histograms of scores for each of the above sections. Take bin size as 10 units, starting from score 40. Draw each histogram in one figure. Write the title on each histogram. B-2: Comment on the skewness of the distributions by observing the histograms. B-3: Find the mean, median and mode for each of the above sections. B-4: Find the variance and standard deviation for each of the above sections. B-5: Find the upper and lower quartile for each of the above sections. B-6: Draw the box-plot for each of the above sections in one figure. Label x-axis with the name of the section. B-7: Do hypothesis testing for each section to check if the mean is equal to 75. Assume p-value less than 0.05 or 5% as very small. B-8: Do pair-wise hypothesis testing to compare the means of two distribution. Assume p-value less than 0.05 or 5% as very small. * Note: Solve all the above questions using Python (not by hand). To read the data in python and repeat for each section, you can use the following code: In [1] : 1 3 Data = { "Jriffindor": [70, 75, 90, 75, 55, 85, 80, 80, 70, 60, 75, 75, 80, 70, 85, 95, 60, 60, 85, 65, 65, 90, 65, 75, 55, 95, 90], "Clytherin": [95, 95, 65, 70, 65, 65, 75, 75, 75, 80, 65, 95, 75, 65, 75, 80, 65, 95, 75, 65, 95, 75, 65, 80, 65, 70, 70, 65), "Hufflebuff": [75, 85, 80, 85, 55, 85, 55, 85, 85, 75, 70, 85, 80, 70, 85, 80, 85, 75, 70, 85, 55, 75, 55, 55, 85), "Ravenklaw": [75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75] } 4 7 9 for section in Data: #dictionary's default iterator is key series=Data [section].copy() # list is mutable, so we copy to avoid chages in actual data # your code for each section 10 11 # 12 13 # # # end of the code for each section Consider the following data related to the final scores of the four sections of a data science course: Section Scores Clytherin: 95, 95, 65, 70, 65, 65, 75, 75, 75, 80, 65, 95, 75, 65, 75, 80, 65, 95, 75, 65, 95, 75, 65, 80, 65, 70, 70, 65 Jriffindor: 70, 75, 90, 75, 55, 85, 80, 80, 70, 60, 75, 75, 80, 70, 85, 95, 60, 60, 85, 65, 65, 90, 65, 75, 55, 95, 90 Hufflebuff: 75, 85, 80, 85, 55, 85, 55, 85, 85, 75, 70, 85, 80, 70, 85, 80, 85, 75, 70, 85, 55, 75, 55, 55, 85 Ravenklaw: 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75 Answer the following questions: B-1: Draw the histograms of scores for each of the above sections. Take bin size as 10 units, starting from score 40. Draw each histogram in one figure. Write the title on each histogram. B-2: Comment on the skewness of the distributions by observing the histograms. B-3: Find the mean, median and mode for each of the above sections. B-4: Find the variance and standard deviation for each of the above sections. B-5: Find the upper and lower quartile for each of the above sections. B-6: Draw the box-plot for each of the above sections in one figure. Label x-axis with the name of the section. B-7: Do hypothesis testing for each section to check if the mean is equal to 75. Assume p-value less than 0.05 or 5% as very small. B-8: Do pair-wise hypothesis testing to compare the means of two distribution. Assume p-value less than 0.05 or 5% as very small. * Note: Solve all the above questions using Python (not by hand). To read the data in python and repeat for each section, you can use the following code: In [1] : 1 3 Data = { "Jriffindor": [70, 75, 90, 75, 55, 85, 80, 80, 70, 60, 75, 75, 80, 70, 85, 95, 60, 60, 85, 65, 65, 90, 65, 75, 55, 95, 90], "Clytherin": [95, 95, 65, 70, 65, 65, 75, 75, 75, 80, 65, 95, 75, 65, 75, 80, 65, 95, 75, 65, 95, 75, 65, 80, 65, 70, 70, 65), "Hufflebuff": [75, 85, 80, 85, 55, 85, 55, 85, 85, 75, 70, 85, 80, 70, 85, 80, 85, 75, 70, 85, 55, 75, 55, 55, 85), "Ravenklaw": [75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75, 75] } 4 7 9 for section in Data: #dictionary's default iterator is key series=Data [section].copy() # list is mutable, so we copy to avoid chages in actual data # your code for each section 10 11 # 12 13 # # # end of the code for each