Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 12, 2024

Homework 2 In this homework you are going to implement PCA from scratch We are going to use the following dataset for our examples *****

Homework 2

In this homework you are going to implement PCA from scratch

We are going to use the following dataset for our examples

*****

Dataset

import numpy as np import matplotlib.pyplot as plt np.random.seed(0) X = np.random.multivariate_normal(mean = np.array([2,3]), cov = np.array([[2,1],[1,1]]),size = 100) plt.plot(X[:,0],X[:,1],"*b") plt.grid()

****

Problem 1

First normalize the each variable (column) by subtracting its mean and dividing by it standart deviation as follows:

image text in transcribed

your function should take input matrix X as input and should return normalized matrix, mean and standart deviation vector as output

def normalize_data(X): pass

# Problem 1 example X_norm, mu, sigma = normalize_data(X)

print(X_norm[0]) print(mu) print(sigma) plt.plot(X_norm[:,0],X_norm[:,1],"*b") plt.grid()

[-1.74865957 -1.32485349] [1.95492653 3.07587784] [1.43707517 1.03112934]

Problem 2

Find the eiegnvalues and eiegnvectors of the covariance matrix. You can use np.cov() function for covariance and numpy.linalg.eig() function to get eigen values.

def eigen(X): pass

# Problem 2 example

eigen_values, eigen_vectors = eigen(X)

print(eigen_vectors) print(eigen_values) plt.plot(X_norm[:,0],X_norm[:,1],"*b") plt.grid() plt.plot([0,eigen_vectors[0,0]],[0,eigen_vectors[1,0]],"r") plt.plot([0,eigen_vectors[0,1]],[0,eigen_vectors[1,1]],"r") plt.axis('square') plt.show()

[[ 0.84500497 -0.53475846] [ 0.53475846 0.84500497]] [2.76215622 0.39785666]

Problem 3

Using the following formulation calculate the transformed values which are coordinates on the new basis as follows

image text in transcribed

where B consists of eigenvectors corresponds the largest mm eigenvalues.

In the formulation above we assumed that columns corresponds to observations and rows corresponds to variables. So if your input matrix has rows as observations and columns as variables you may want to take the transpose of input matrix X as follows:

image text in transcribed

def transform(X, eigen_values, eigen_vectors, m): pass

#Problem 3 example

X_transformed = transform(X_norm, eigen_values, eigen_vectors, 2) print(X_transformed[0])

[-2.18610263 -0.18439728]

Problem 4

Put all the steps together in a function. You will use matrix X and the number of components m as inputs and the transformed matrix as output

def pca(X, m): pass

# Problem 4 example from sklearn.datasets import load_iris

X = load_iris()["data"]

X_transformed = pca(X, 2) print(X_transformed[0])

[-2.26470281 -0.4800266 ]

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

SQL For Data Science Data Cleaning Wrangling And Analytics With Relational Databases

SQL For Data Science Data Cleaning Wrangling And Analytics With Relational Databases

Authors: Antonio Badia

1st Edition

3030575918, 978-3030575915

More Books

Students also viewed these Databases questions

Question

★★★★★

What are the problems faced by researchers when they are conducting research in developing countries?

Answered: 1 week ago

Question

★★★★★

Identify the different types and charac- teristics of investment companies.

Answered: 1 week ago

Question

★★★★★

Suppose that in past years the average price per square foot for warehouses in the United States has been $32.28. A national real estate investor wants to determine whether that figure has changed...

Answered: 1 week ago

Question

★★★★★

Homework 2 In this homework you are going to implement PCA from scratch We are going to use the following dataset for our examples ***** Dataset import numpy as np import matplotlib.pyplot as plt...

Answered: 1 week ago

Question

★★★★★

There is particular legislation, regulations and industry codes of practice that relates to the preparation of activity statements. List at least 10 forms of legislation, regulation or codes that...

Answered: 1 week ago

Question

★★★★★

The slotted arm OA rotates about a horizontal axis through point O. The 0.27-kg slider P moves with negligible friction in the slot and is controlled by the inextensible cable BP. For the instant...

Answered: 1 week ago

Question

★★★★★

Canadian Auditing Standard (CAS) 315 provide guidance on improving the quality of auditing by requiring auditors to gain a deeper understanding of their clients\' businesses and the risks that could...

Answered: 1 week ago

Question

★★★★★

Alex appears upset and challenges the low ratings. The conversation does not go well as a result. Alex later says to Amy "my financial numbers are great" and "I make a lot of money for this company."...

Answered: 1 week ago

Question

★★★★★

Also, What is the domain of f? and what is the domain of f-1? Consider the graph of the one-to-one function shown in the figure below. y 10+ 00 8 6 4 2 Sketch the graph of f-1. -10 5 2 4 y 10 5 5 10...

Answered: 1 week ago

Question

★★★★★

Drew Company produces two products: a high end laptop computer under the label Bunsen Laptops, and an inexpensive desktop computer under the label Beaker Computers. The two products use two overhead...

Answered: 1 week ago

Question

★★★★★

Analyse and critically Evaluate Dorian executives decisions concerning the selection of crew, by examining the cultural compatibility.

Answered: 1 week ago

Question

★★★★★

What expatriate behavior is best for Japanese managers who are sent to Germany?

Answered: 1 week ago

Question

★★★★★

Identify the difficulties in communication between bidder managers and target employees that are apparent in the case study?

Answered: 1 week ago

Previous Question Next Question