Answered step by step
Verified Expert Solution
Question
1 Approved Answer
You are to create a program using Python that does the following: 1 . Asks the user for the number of clusters. This is the
You are to create a program using Python that does the following:
Asks the user for the number of clusters. This is the parameter k that will be used for kmeans.
Reads the input file proginputdata.txt and stores the points into a list
Applies the kmeans algorithm to find the cluster for each point.
Displays the points that each cluster contains after each iteration of the algorithm
Writes the final cluster assignments to the screen and the output file progoutputdata.txt
YOU CANNOT USE ANY PYTHON PACKAGES FOR THIS PROGRAM NUMPY PANDAS, NO IMPORT STATEMENTS.
Additional Requirements
The name of your source code file should be kMeans.py All your code should be within a single file.
Your code should follow good coding practices, including good use of whitespace and use of both inline
and block comments.
You need to use meaningful identifier names that conform to standard naming conventions.
At the top of each file, you need to put in a block comment with the following information: your name,
date, course name, semester, and assignment name.
The output of your program should exactly match the sample program output given at the end. That is
for same input, it should generate the same output. Note that I may use other test cases for grading your
program and your code needs to work correctly in all cases.
Data File Format
Let N be the number of points and Pi to be the value of point i The input file should be of the following format:
P
P
PN
Example:
The name of the input file is always:
proginputdata.txt
What to Turn In
You will turn in a screenshot of your output and a single kMeans.py file using BlackBoard.
HINTS
Make use of list comprehensions for reading lines from a file and then converting the strings into a list of
floats.
Use pwd to check the directory where you should place your input file.
Use a dict data structures for storing centroids and clusters. The centroids dict will be a mapping from
cluster number to centroids. The clusters dict will be a mapping from cluster number to a list of points in
the cluster.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started