Return to the variant of PCA examined in Exercise 11.2. Using a (possibly synthetic) data set of
Question:
Return to the variant of PCA examined in Exercise 11.2. Using a (possibly synthetic) data set of your choice, compare the classical PCA and the variant examined here, especially in terms of its sensitivity to outliers. Make sure to establish an evaluation protocol that is as rigorous as possible. Discuss your results.
Data from exercise 11.2:
Let X = [x1,...., xm] ∈ Rn,m. For p = 1, 2, we consider the problem
If the data is centered, the case p = 1 amounts to finding a direction of largest "deviation" from the origin, where deviation is measured using the ℓi-norm; arguably, this is less sensitive to outliers than the case p = 2, which corresponds to principal component analysis.
Step by Step Answer:
Optimization Models
ISBN: 9781107050877
1st Edition
Authors: Giuseppe C. Calafiore, Laurent El Ghaoui