Question
Use R studio to perform the following: Exercise 1: Find a data set with at least 4 columns of numeric data and a categorical column
Use R studio to perform the following:
Exercise 1:
Find a data set with at least 4 columns of numeric data and a categorical column
Run several scatter plots of the data
Create a kmeans object from the numeric data, you can pick K to be whatever you want
Determine the size of each cluster
Determine the centers of each cluster
Compare the clusters to the categorical data column as we did with the iris$Species column
Excercise 2:
For your chosen data set - airquality
Describe what each row of data represents
Describe each of your columns used give a one sentence description of the column
If you know it, describe how the data was generated
For the clusters
Describe the size and means of clusters
Give a one- or two-word description to each cluster in other words, give each cluster a label or name
This is an exercise in turning your numeric data into something descriptive for non-statisticians
Use R studio to perform the following:
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started