Answered step by step
Verified Expert Solution
Question
1 Approved Answer
A k - nearest neighbour algorithm is used to develop a classifier. The data set contains noise. Which of the following strategies will help to
A knearest neighbour algorithm is used to develop a classifier. The data set contains noise. Which of the
following strategies will help to reduce sensitivity to noise? Simply write down the letters of the correct
approaches Note that incorrect answers will be penalized.
a Reduce the value of
b Use a larger value for
c Find all Tomek links and remove both instances that form the Tomek link
d Nothing can be done to reduce sensitivity to noise
e Use SMOTE to oversample the minority class
Consider a regression problem, where the data set has the following characteristics:
There are instances
There is one categorical and five numeric descriptive features
The categorical feature has three possible values. One of these values occurs for of the instances,
and the other two values occur respectively for and instances
Four of the numeric features have values in the range and the fifth numeric feature has values
in the range
For of the instances there are outliers for the target feature
One of the numeric descriptive features has a few outliers
One of the numeric descriptive features has missing values
A knearest neighbour algorithm is used and Euclidean distance is used as the similarity measure. Which
of the following statements are correct? Simply write down the letters of the correct statements Be
careful; marks will be subtracted for incorrect answers.
a The value for has to be large
b Onehot encoding has to be applied to the categorical feature
c The numerical features have to be scaled to the same range, eg
d The outliers in the numeric descriptive feature have to be removed
e The missing values have to be imputed
f Undersampling or oversampling has to be applied to the categorical feature to balance the distri
bution of possible values
g The predicted value is calculated using the average over the target values of the neighbors
What is the inductive bias of the nearest neighbour algorithm?
Is it necessary to normalize input features when a nearest neighbour algorithm is used? Motivate your
answer.
Explain how nearest neighbours can be used to impute missing values.
Can the nearest neighbour algorithm be applied to problems with categorical descriptive features?
Motivate your answer.
Discuss the consequences of different values for when nearest neigbours is applied to regression prob
lerns.
AML: Describe how nearest neighbours can be used to classify images.
AML: Describe how nearest neighbours can be used to correct misspelt words using an example. How
would you decide how to modify the original word if is equal to three?
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started