Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on May 16, 2024

Please provide R Code: Traditional k-means initialization is based on choosing values from a uniform distribution. In this question, you are asked to improve k-means

Please provide R Code:

Traditional k-means initialization is based on choosing values from a uniform distribution. In this question,

you are asked to improve k-means through initialization. k-means ++ is an extended k-means clustering

algorithm and induces non-uniform distributions over the data that serve as the initial centroids. Read the

paper and discuss the idea in a paragraph. Implement this idea to improve your k-means program. Run

your program, Ck++, against the Diabetes and New York Times Comments data sets. Report the total error rates for k = 2,...,5 for 20 runs each for both data sets. Moreover, compare C_k, C_kSSE and C_k++'s run time for k = 2,...,5 for 20 runs using both data sets. Presenting the results that are easily understandable. Plots are generally a good way to convey complex ideas quickly, i.e., box plot. Discuss your results

Paper Link: http://ilpubs.stanford.edu:8090/778/1/2006-13.pdf

Diabetes Dataset: https://archive.ics.uci.edu/ml/datasets/Diabetes+130US+hospitals+for+years+1999-2008

New York Times Comments Data Sets: https://www.kaggle.com/datasets/benjaminawd/new-york-times-articles-comments-2020?select=nyt-comments-2020.csv

R script:

Discussion of Findings:

Plots:

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

An Introduction to Analysis

An Introduction to Analysis

Authors: William R. Wade

4th edition

132296381, 978-0132296380

More Books

Students also viewed these Mathematics questions

Question

can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

Answered: 1 week ago

Question

Write an alternative definition that is tail-recursive (iterative) and makes use of accumulator variables. [10 marks] Explain why your alternative definition executes more efficiently. [3 marks] 1...

Answered: 1 week ago

Question

Let A, B be sets. Define: (a) the Cartesian product (A B) (b) the set of relations R between A and B (c) the identity relation A on the set A [3 marks] Suppose S, T are relations between A and B, and...

Answered: 1 week ago

Question

★★★★★

The standard cost card for Balsam indicates each unit of product should take 1.5 hours of direct labor at a cost of $10 per hour. During the current period, Balsam produced 3,000 units, used 5,000...

Answered: 1 week ago

Question

★★★★★

Luxor Corporation has a 100% interest in a foreign subsidiary known as Luminaire. The foreign subsidiary was created for the primary purpose of distributing electronic components throughout a number...

Answered: 1 week ago

Question

★★★★★

Find the following. (a) (b) (c) (d) (e) (f) || n ||

Answered: 1 week ago

Question

★★★★★

Use the Body Fat data set to answer the following: (a) Regress Density on Age, Height, Neck, Chest, Abdomen, Hip, Thigh, Knee, Ankle, Biceps, Forearm, Wrist. (b) Check the assumptions associated with...

Answered: 1 week ago

Question

★★★★★

The accountant of Kooks Shoe Co. has compiled the following information from the companys records as a basis for an income statement for the year ended December 31, 2008. Rental revenue...

Answered: 1 week ago

Question

★★★★★

Mullineaux Corporation has a target capital structure of 66 percent common stock, 11 percent preferred stock, and 26 percent debt. Its cost of equity is 12 percent, the cost of preferred stock is 7...

Answered: 1 week ago

Question

★★★★★

A particle of mass 7 kg has position vector r = (2 x - 3 y) m at a particular instant of time when its velocity is v = (3.0 x) m/s with respect to the origin. a. What is the angular momentum of the...

Answered: 1 week ago

Question

★★★★★

Using the cognitive model and the BRUSO model discussed in class, write survey items for the following general questions. It may take more than one question. Responding to a survey item is itself a...

Answered: 1 week ago

Question

★★★★★

Read the following text carefully, then answer the question. Exciting New Gym Opening Soon!!! Northside Gym is opening soon, providing a new level of gym experience. The gym will meet all your...

Answered: 1 week ago

Question

★★★★★

Complete the self-scoring "Followership Questionnaire" in Ch. 13 (p. 487) of Leadership: Theory and Practice . DueThursday respond to the following: Based on your results from the "Followership...

Answered: 1 week ago

Question

★★★★★

Research by Ericsson and other suggests that about 10,000 hours of deliberate practice are required to become an expert. What is required for deliberate practice? Group of answer choices Focused...

Answered: 1 week ago

Question

★★★★★

International clothier Gap, Inc. is proud of its Code of Business Conduct that urges all those associated with the company to participate in "Doing the Right Thing." The business, founded in 1969,...

Answered: 1 week ago

Question

★★★★★

Which of the following is a consequence of the rise of partisanship in the House and Senate with respect to the presidency?Presidents are less likely to use executive ordersCongress is more willing...

Answered: 1 week ago

Question

★★★★★

Use the graphs of f and g to graph h(x) = (f + g) (x). To print an enlarged copy of the graph, go to MathGraphs.com. 1. 2. y 24 8. 2. -2 -2 4 6

Answered: 1 week ago

Question

★★★★★

15. Consider the type of clothes dryer (gas or electric) purchased by each of five different customers at a certain store. a. If the probability that at most one of these purchases an electric dryer...

Answered: 1 week ago

Question

★★★★★

24. Show that if one event A is contained in another event B (i.e., A is a subset of B), then . [Hint: For such A and B, A and are disjoint and , as can be seen from a Venn diagram.] For general A...

Answered: 1 week ago

Question

★★★★★

21. An insurance company offers four different deductible levelsnone, low, medium, and highfor its homeowners policyholders and three different levelslow, medium, and highfor its automobile...

Answered: 1 week ago

Previous Question Next Question