Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Data from the World Happiness Report 2 0 1 9 record composite scores measuring the level of happiness ( happiness ) , GDP per capita
Data from the World Happiness Report record composite scores measuring
the level of happiness happiness GDP per capita gdp healthy life expectancy
feexp and the perceived level of corruption corruption for a sample of
countries in the world. A team of data scientists wishes to investigate the potential
presence of a clustering structure in these data using kmeans.
a The team uses silhouette analysis to guide the selection of the number of
clusters. The following silhouette plots are produced, for a number of clusters
ranging from to How many clusters does the average silhouette analysis suggest? Justify your
answer. b To further aid the selection of the number of clusters, the team computes the
gap statistic for ranging from to using function clusGap of package
cluster. The output table from the function and the gap statistic plot are
reported below next page Gap statistic plot
i What are the quantities logW ElogW and SEsim in the output? Explain
briefly.
ii How many clusters does the gap statistic method suggest? Justify your
answer.c The team selects and uses kmeans to cluster the data. The output
from function kmeans is reported below.
i Compute the values of the total between cluster sum of squares and of
the total sum of squares.
ii The USA have the following values for the scores of interest:
To which cluster would the USA be assigned to Justify your answer
using calculations.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started