Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Open an Excel file named Height-and-Weight.sx This file contains the athlete information of certain Olympic athletes from a recent Olympic Games. This file contains the
Open an Excel file named Height-and-Weight.sx This file contains the athlete information of certain Olympic athletes from a recent Olympic Games. This file contains the athlete ID, the height of the athlete in centimeters, the weight of the athlete in kilograms, the country, and the gender of the athlete. We are interested in studying the relationship between the two variables "height and "weight. Remember we plotted the scatter plot in the class, which was a plot for all athletes imespective of the gender. Plot a simlar scatter chart by segregating all the male athletes, the female athletes, and check whether similar patterns are observed across both genders For this exercise, open the data file called "Hw2_Dataset1xlsx.This file contains two main tabs. The first tab in the ile contains the data. The second tab contains a description of the variables in the data set 1 Plot a Column or Bar Chart for math and write scores in one figure. 2 Plot a Histogram of math scores for the students 3. What is the mean, min, max, sum, and count of math scores for students across this set which their math score is above of 310 4. Highlight the grade of students which their math score is above of 330 (Hint: using conditional fitering). Calculate the mean, sum, count, min, and max of math scores for Female students. 5. Open a file named "Car.txt.This dataset was taken from the Statlib ibrary which is maintained at Carnegie Mellon University. The dataset was used in the 1983 American statistical Association Exposition The data contains city-cycle fuel consumption in miles per gallon (Quinlan, 1993). The number of Instances are 398 and the number of Attributes are 9 including the class attribute. The attribute Information (columns) are: 1 mpgcontinuous 2 cylinders: multi-valued discrete s weight: continuous 6. acceleration: continuous 7. model year: multi-valued discrete 8 crigin: multi-valued discrete 9 car name: string (unique for each instance First insert this data into Excel and plot the scatter chart for MPG versus Weight for cars. Please answer to the following questions 1 Is there any relationship between these variables? If yes, please determine whether the relationship i positive or negative? Add a Trendline to this data and an equation on the plot R-squared is one of the main performance metric to evaluate the goodness of a fit. The higher R- squared is, the better fit you will get. Based on R-squared, identify which hyperplane is a best fit to the data, linear or non-dinear curves (Le. Exponential, Logarithmic, etc.? 2. 3. Reference: ailanR. (199 Combiningsstanceksed Conference of Machne Learning 236 24, University of Massaohasetts, Amberst Morgan Kodmon
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started