5. You are working as an assistant biologist to Charles Darwin on the Beagle voyage. You are...
Question:
5. You are working as an assistant biologist to Charles Darwin on the Beagle voyage. You are at the Galápagos Islands, and you have just discovered a new animal that has not yet been classified. Mr. Darwin has asked you to classify the animal using a nearest neighbor approach, and he has supplied you the following dataset of already classified animals
The descriptive features of the mysterious newly discovered animal are as follows:
a. A good measure of distance between two instances with categorical features is the overlap metric (also known as the hamming distance), which simply counts the number of descriptive features that have different values. Using this measure of distance, compute the distances between the mystery animal and each of the animals in the animal dataset.
b. If you used a 1-NN model, what class would be assigned to the mystery animal?
c. If you used a 4-NN model, what class would be assigned to the mystery animal? Would this be a good value for k for this dataset?
Step by Step Answer:
Fundamentals Of Machine Learning For Predictive Data Analytics Algorithms Worked Examples And Case Studies
ISBN: 9780262029445
1st Edition
Authors: John D. Kelleher, Brian Mac Namee, Aoife D'Arcy