Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jul 29, 2024

Data Acquisition and Initial Analysis: Retrieve the MNIST dataset. Perform exploratory data analysis to understand the dataset's structure, including i . how many images ii

Data Acquisition and Initial Analysis:

Retrieve the MNIST dataset.

Perform exploratory data analysis to understand the dataset's structure, including

.

how many images

.

how many features and the range of feature values

(

.

.,

histogram of the data value

),

relating it to real

-

world, such as real images.

iii. how many categories

/

labels

(

discrete or continuous type

)

and what they are?

.

visualize at least three randomly selected samples within each category

(

feel the variance

of the data

)

.

visualize more data samples to see whether there are bad data samples need to be

removed. What bad data samples do you think can be

?

2 .

Data Preparation and Manipulation:

Apply dimensionality reduction techniques

(

PCA and t

-

SNE

)

to the MNIST dataset and visualize the

results.

Split the dataset into training

(60, 000

samples

)

and testing

(10, 000

samples

)

sets.

3 .

Machine Learning Model Implementation:

Train a Random Forest classifier on the original dataset and record its performance.

Use PCA to reduce the dataset

s dimensionality to

174 .

Train a new Random Forest classifier on the

reduced dataset and see how long it takes. Was training much faster? Then, evaluate the classifier on

the test set. How does it compare to the previous classifier?

4 .

Critical Evaluation and Conclusion:

Provide a comprehensive evaluation of the performance of the models.

Summarize findings and insights.

5 .

Research Question: Explore how various image preprocessing methods

(

.

.,

normalization, binarization,

noise reduction, and image augmentation

)

influence the performance of at least two different machine

learning models

(

.

.,

Convolutional Neural Networks and Random Forest classifiers

)

trained on the MNIST

dataset. Analyze the models' accuracy, training time, and ability to generalize to test data. Discuss your

findings' implications for designing machine learning pipelines in digit recognition tasks.

6 .

Reflect on the composition and diversity of the MNIST dataset, considering its impact on the training process

and model performance. Explore how the inclusion of a more diverse set of handwriting samples

(

.

.,

different handwriting styles, inclusion of characters from non

-

Latin alphabets, or samples from wider age

3

groups

)

might affect the accuracy and generalizability of machine learning models trained for digit

recognition tasks.

Structure

Prepare a jupyter notebook for this assignment. The structure of the Jupyter notebook should alternate texts and

python codes and cover topics listed the in specific tasks above. Always refer to textbook

hands

-

on machine

learning with Scikit

-

Learn, Keras & TensorFlow

for coding help.

How do I submit?

1 .

Prepare Your Submission: Ensure your Jupyter notebook

(.

ipynb

)

is complete with all required work.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Structured Search For Big Data From Keywords To Key-objects

Authors: Mikhail Gilula

1st Edition

012804652X, 9780128046524

More Books

Students also viewed these Databases questions

Question

★★★★★

Suppose that a country with a closed economy has private saving of $5 trillion and a government budget deficit of $3 trillion. a. What is the equilibrium level of investment? b. If the economy is...

Answered: 1 week ago

Question

★★★★★

=+2-16 Thermal storage is often used to smooth the demand for cooling in large buildings. Imagine that the chiller can also make ice during the nighttime hours for use later when the peak cool- ing...

Answered: 1 week ago

Question

★★★★★

If A and B are two events on a sample space with P(B) Answered: 1 week ago

Answered: 1 week ago

Question

★★★★★

The demand for a product of Carolina Industries varies greatly from month to month. Based on the past two years of data, the following probability distribution shows the companys monthly demand: Unit...

Answered: 1 week ago

Question

★★★★★

How long will it take money to double if it is invested at 6% compounded monthly? 9% compounded monthly? (Round to the next higher month if not exact & please show step-by-step work)

Answered: 1 week ago

Question

★★★★★

18. According to a local department store the store charges customers 1.25% per month on the outstanding balances of their charge ascounts. What is the sffective annual rate on such customer credit?...

Answered: 1 week ago

Question

★★★★★

Build a stakeholder register for this You work for the local school district. A new mandate has been handed down that requires every school to record all verified student immunization records into...

Answered: 1 week ago

Question

★★★★★

The Scenario FordDirect (http://www.forddirect.com Links to an external site.) sells digital products to 4000+ Ford and Lincoln dealerships to help market and sell Ford and Lincoln vehicles A dealer...

Answered: 1 week ago

Question

★★★★★

The photograph below shows the interference of two water waves produced by two coherent sources operating at the same frequency and in phase. The waves travel outward at a constant speed from the two...

Answered: 1 week ago

Question

★★★★★

The majority of information needed to answer these can be found through our website. However, in cases where it isn't, please use your imagination and creativity to come up with a personally tailored...

Answered: 1 week ago

Question

★★★★★

Mary Kate is an intern with a minor league baseball team. She says, I report to two managers: the director of marketing and the director of community relations. From time to time, the director of...

Answered: 1 week ago

Question

★★★★★

2. The book encourages readers to pay attention to what is happening around them, anticipate change, and look at the opportunities that exist outside their comfort zones. How does this reflect on...

Answered: 1 week ago

Question

★★★★★

2. Lets assume your company has spent a large sum of money to bring in Lance Armstrong. He makes an enthusiastic presentation and receives a standing ovation. Define what steps your company would...

Answered: 1 week ago

Question

★★★★★

2. Lets assume you are a member of a productive team that has received considerable praise from top management. You learn that one member of the team has met with the manager who organized the team...

Answered: 1 week ago

Previous Question Next Question