Question
1. List three Algorithms or Models that can be used for Classification, and critically evaluate their merits. 2. Since many real-world relationships are not linear,
1. List three Algorithms or Models that can be used for Classification, and critically evaluate their merits. 2. Since many real-world relationships are not linear, should you always consider non-linear models such as high-degree polynomial functions instead of the basic linear regression models? Why? Please form your arguments based on theoretical considerations. However, empirical evidence is also acceptable.
3. What is the 'Curse of Dimensionality'? What problems could high dimensional data cause for the K-NN and the Decision Tree algorithms? In addition, what are the main motivations for reducing a dataset's dimensionality?
4. Prof. Ghahramani (Cambridge University) once said that one of his long-term research projects was to build an "automated statistician". What do you think that means? In addition, what do you think would go into building one and what are the major challenges?
5. How do Fitting graphs and Learning curves address the Overfitting problem? In addition, suppose you are using a multiple high-degree polynomial regression model to learn the data, and the Fitting graph shows that the testing data error is much higher than the training data error, please propose four possible solutions to address this.
6. Why would data-miners sometimes prefer to use a regularised model instead of a plain linear regression model? In addition, among the three regularisation methods you learnt from this module (RR, Lasso, E-N), discuss their data and model specification suitability
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started