Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Problem Statement ( in natural language ) . You are given a set of 5 6 9 instances, each with information about a patient s
Problem Statement in natural language You are given a set of instances, each with information about a patients cell, that is or is not cancerous. The information includes the patients ID which should not be relevant to hisher diagnosis and a diagnosis malignant or benign as well as a set of attributes that are usually used for diagnosis. You are asked to design, implement, and train a machine learning model, and assess its quality in terms of a accuracy of diagnosis of instances not used for training and b computational efficiency and scalability of your solution. The machine learning model, which an expert recommended, is called knearest neighbour or kNN You are required to use a tree data structure that would optimize both accuracy and efficiency with accuracy taking precedence In answer to this problem, submit Java program, report PDF and signed expectations of originality form PDF The report must include the following IIII items.
I. A pargraph characterization of the problem, in abstract and precise computer science terms, excluding all irrelevant details and superfluous language. Adherence to formatting instructions see Formatting on page counts towards this part of the grade.
II A description of the model you will use to solve the problem: The main data structures as annotated diagrams; The main algorithms operating on the data structures as highlevel flow charts augmented as necessary by lowerlevel pseudocodes of the main methods employed.
III. The results with analysis of the testing of the trained model, presented as figures that show how much running time and testing accuracy change as a function of the number of training instances N T is the number of test instances; each figure should have three series of points for the three values of k see below
Guidelines. Use N The ratio NT must be maintained at An instance used for training, in a given trainandtest run, cannot be used for testing. Also, you must retrain the model prior to each test. Further, the NT instances must be chosen at random from the complete set. Though this is not the case in medical practice implement accuracy as the percentage of test instances T that are correctly predicted by your trained model ie predicted diagnosis actual diagnosis For running time, just use actual execution time of testing, not
Page of
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started