Question: 5. You are working as an assistant biologist to Charles Darwin on the Beagle voyage. You are at the Galpagos Islands, and you have just

5. You are working as an assistant biologist to Charles Darwin on the Beagle voyage. You are at the Galápagos Islands, and you have just discovered a new animal that has not yet been classified. Mr. Darwin has asked you to classify the animal using a nearest neighbor approach, and he has supplied you the following dataset of already classified animals

BIRTHS LIVE YOUNG LAYS EGGS FEEDS OFFSPRING OWN MILK WARM-BLOODED COLD-BLOODED ID

The descriptive features of the mysterious newly discovered animal are as follows:

CLASS 1 true false true true false false true false mammal 234

a. A good measure of distance between two instances with categorical features is the overlap metric (also known as the hamming distance), which simply counts the number of descriptive features that have different values. Using this measure of distance, compute the distances between the mystery animal and each of the animals in the animal dataset.

b. If you used a 1-NN model, what class would be assigned to the mystery animal?

c. If you used a 4-NN model, what class would be assigned to the mystery animal? Would this be a good value for k for this dataset?

BIRTHS LIVE YOUNG LAYS EGGS FEEDS OFFSPRING OWN MILK WARM-BLOODED COLD-BLOODED ID CLASS 1 true false true true false false true false mammal 234 false true false false true true false raise amphibian true false true true false false true false mammal false true false true false true false true bird LAND AND WATER BASED HAS HAIR HAS FEATHERS

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Principles Algorithms And Systems Questions!