Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Researchers studying chemical properties of wines collected data on a sample of white wines in Northern Portugal. A research goal was to cluster wines based

Researchers studying chemical properties of wines collected data on a sample of white wines in Northern Portugal. A research goal was to cluster wines based on similar chemical properties.
Using pdist(), calculate a distance matrix for wines. The matrix of input features, X, has already been created.
Use the distance matrix to cluster the wines with centroid linkage.
The code provided creates a dataframe with two features (free_sulfur_dioxide and sulphates), normalizes the dataframe, and displays the cluster membership of each data point.
import pandas as pd
from scipy.cluster.hierarchy import linkage
from scipy.spatial.distance import pdist
from sklearn.preprocessing import StandardScaler
wine = pd.read_csv('wine1.csv')
# Calculate a distance matrix with selected variables
X = wine[['free_sulfur_dioxide', 'sulphates']]
scaler = StandardScaler()
X = pd.DataFrame(scaler.fit_transform(X))
# Your code goes here
print(clusterModel)

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Professional Microsoft SQL Server 2014 Integration Services

Authors: Brian Knight, Devin Knight

1st Edition

1118850904, 9781118850909

More Books

Students also viewed these Databases questions