Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on May 17, 2024

a. Create and report a scatter plot of the data. Describe the appearance of the points in the scatterplot? How many clusters (clearly separated

image text in transcribed

a. Create and report a scatter plot of the data. Describe the appearance of the points in the scatterplot? How many clusters (clearly separated sets of points) are there? b. Cluster the data using K-Means for the specified number of clusters as indicated in your response to the previous question. Use a random initialization of means. Create and report a scatter plot of the data with the clusters indicated by their color. How well does this method identify the clusters in the plot? (Hint: It may be helpful to use python's sklearn package for K-Means tasks) c. Consider the task of clustering with K-Means using K = 2 means and initialized means located at (-8,0) and (1, -1). Create and report a scatter plot of the data with the clusters identified under this clustering method indicated by their color. How well does this method identify the clusters in the plot? Does choosing this initialization help in identifying the true clusters? d. Consider the task of clustering with K-Means using K = 5 means and initialized means located at (7.5,0), (0,7.5), (7.5,0), (0, -7.5), and (0, 0). Create and report a scatter plot of the data with the clusters identified under this clustering method indicated by their color. How well does this method identify the clusters in the plot? Does choosing this initialization help in identifying the true clusters? e. Now consider the task of grouping the points using spectral clustering. Using the value of K as indicated by your response to part a, create and report a scatter plot of the data with the clusters identified under this clustering method indicated by their color. Use a Gaussian Kernel with a bandwidth parameter value () of 1.5. Use the numpy.linalg.eig() function to identify eigenvectors and use the top K eigenvectors in terms of decreasing magnitude of eigenvalues. How well does this method identify the clusters in the plot? How do the types of clusters found in this method differ from those found using K-Means? = f. Using K 2 means and spectral clustering, create clusters using different values of the bandwidth parameter $\sigma$. Consider the clusters created by = {.1, 1, 2}. For which of these values of do the most appropriate clusters get created? Create and report a scatterplot of the clusters under this method indicated by their color. How well does this method identify the clusters in the plot?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Discovering Advanced Algebra An Investigative Approach

Discovering Advanced Algebra An Investigative Approach

Authors: Jerald Murdock, Ellen Kamischke, Eric Kamischke

1st edition

1559539844, 978-1604400069, 1604400064, 978-1559539845

More Books

Students explore these related Mathematics questions

Question

The CEO of Hilton has been hearing a lot about green building standards lately and feels business might be getting lost to the Niagara Falls Convention Centre, a LEED-certified building. She has...

Answered: 3 weeks ago

Question

can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

Answered: 3 weeks ago

Question

Write a literature review for your study. See below for an example of a literature review. Your literature review should provide both analysis and synthesis of previous studies as related to the...

Answered: 3 weeks ago

Question

The following data have been extracted from the financial statements of Prentiss, Inc., a calendar-year merchandising corporation: Total sales for 2018 were $1,200,000 and for 2017 were $1,100,000....

Answered: 3 weeks ago

Question

What is audit risk?

Answered: 3 weeks ago

Question

=+7. What is a digital buyer persona? Why is it important in digital marketing?

Answered: 3 weeks ago

Question

What is the difference between statistical significance and practical/scientific significance?

Answered: 3 weeks ago

Question

Go to the books companion website, and use information found there to answer the following questions related to The Coca-Cola Company and PepsiCo, Inc. (a) What are the primary lines of business of...

Answered: 3 weeks ago

Question

11. During 2008, Bakery Company paid out $50,000 of common dividends. It ended the year with $200,000 of retained earnings versus the prior years retained earnings of $150,000. How much net income...

Answered: 3 weeks ago

Question

Resource scheduling is the process of identifying and prioritizing tasks based on their importance. True False

Answered: 3 weeks ago

Question

Is this Human Resource Officer a fiduciary (Answer Yes or No for each question)? The human resource officer directs the investment of plan assets. The human resource officer provides investment...

Answered: 3 weeks ago

Question

Your employer asks your opinion on the following situation. An employee has been on pregnancy/parental leave for the past year and is scheduled to return next month. However, this employee's...

Answered: 3 weeks ago

Question

Imagine that you are an innovation consultant. You are facing the task of helping enhance the STARBUCK Coffe Company innovation activities through open and collaborative innovation.b.Suggest some...

Answered: 3 weeks ago

Question

Case- Code Red- Healthcare.gov SETTING THE STAGE .... What is healthcare.gov and what was the issue? THE DASHBOARD .... What did it look like? Why is it important? How healthy was it? The ROLLOUT...

Answered: 3 weeks ago

Question

You are the manager of a busy insurance office. Last year's abnormal winter gales led to an exceptionally high level of insurance claims for house damage caused by strong winds, and you had...

Answered: 3 weeks ago

Question

5. Suponga que el valor a la par del bono es de $1,000. Pioneer Petroleum Corporation tiene un bono en circulacin con un pago de inters anual de $85, un precio de mercado de $800 y una fecha de...

Answered: 3 weeks ago

Question

Is the modified 5-question approach to ethical decision making superior to the modified moral standards or modified Past in approach?

Answered: 3 weeks ago

Question

19. For Example 4.4, calculate the proportion of days that it rains.

Answered: 3 weeks ago

Question

12. For a Markov chain {Xn,n 0} with transition probabilities Pi,j , consider the conditional probability that Xn = m given that the chain started at time 0 in state i and has not yet entered state r...

Answered: 3 weeks ago

Question

20. A transition probability matrix P is said to be doubly stochastic if the sum over each column equals one; that is, i Pij = 1, for all j If such a chain is irreducible and aperiodic and consists...

Answered: 3 weeks ago

Previous Question Next Question