Answered step by step
Verified Expert Solution
Question
1 Approved Answer
The classic MNIST Digit RecognizerLinks to an external site. problem is a competition on Kaggle.com, and you will compete in this competition. For this assignment,
The classic MNIST Digit RecognizerLinks to an external site. problem is a competition on Kaggle.com, and you will compete in this competition. For this assignment, you will develop a classifier that may be used to predict which of the digits is being written.
ManagementResearch Question
In laymans terms, what is the managementresearch question of interest, and why would anyone care?
Requirements
Fit a random forest classifier using the full set of explanatory variables and the model training set csv
Record the time it takes to fit the model and then evaluate the model on the csvdata by submitting to Kaggle.com. Provide your Kaggle.com score and user ID
Execute principal components analysis PCA on the combined training and test set data together, generating principal components that represent percent of the variability in the explanatory variables. The number of principal components in the solution should be substantially fewer than the explanatory variables.
Record the time it takes to identify the principal components.
Using the identified principal components from step use thecsvto build another random forest classifier.
Record the time it takes to fit the model and to evaluate the model on the csvdata by submitting to Kaggle.com. Provide your Kaggle.com score and user ID
Use kmeans clustering to group MNIST observations into of categories and then assign labels. Follow the example here if needed: kmeans mnist.pdf Download kmeans mnist.pdfkmeans mnistpdf Download kmeans mnistpdf
Submit the RF Classifier, the PCA RF and kmeans estimations to Kaggle.com, and provide screen snapshots of your scores as well as your Kaggle.com user name.
The experiment we have proposed has a major design flaw. Identify the flaw. Fix it Rerun the experiment in a way that is consistent with a trainingandtest regimen, and submit this to Kaggle.com.
Report total elapsed time measures for the training set analysis. It is sufficient to run a single timeelapsed test for this assignment. In practice, we might consider the possibility of repeated executions of the relevant portions of the programs, much as the Benchmark Example programs do Some code that might help you with reporting elapsed total time follows.
startdatetime.now
rffittrainimageslabels
enddatetime.now
printendstart
Python Programming
All programming will be done in Python.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started