Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jul 29, 2024

Data Acquisition and Initial Analysis: Retrieve the MNIST dataset. Perform exploratory data analysis to understand the dataset's structure, including i . how many images ii

Data Acquisition and Initial Analysis:

Retrieve the MNIST dataset.

Perform exploratory data analysis to understand the dataset's structure, including

.

how many images

.

how many features and the range of feature values

(

.

.,

histogram of the data value

),

relating it to real

-

world, such as real images.

iii. how many categories

/

labels

(

discrete or continuous type

)

and what they are?

.

visualize at least three randomly selected samples within each category

(

feel the variance

of the data

)

.

visualize more data samples to see whether there are bad data samples need to be

removed. What bad data samples do you think can be

?

2 .

Data Preparation and Manipulation:

Apply dimensionality reduction techniques

(

PCA and t

-

SNE

)

to the MNIST dataset and visualize the

results.

Split the dataset into training

(60, 000

samples

)

and testing

(10, 000

samples

)

sets.

3 .

Machine Learning Model Implementation:

Train a Random Forest classifier on the original dataset and record its performance.

Use PCA to reduce the dataset

s dimensionality to

174 .

Train a new Random Forest classifier on the

reduced dataset and see how long it takes. Was training much faster? Then, evaluate the classifier on

the test set. How does it compare to the previous classifier?

4 .

Critical Evaluation and Conclusion:

Provide a comprehensive evaluation of the performance of the models.

Summarize findings and insights.

5 .

Research Question: Explore how various image preprocessing methods

(

.

.,

normalization, binarization,

noise reduction, and image augmentation

)

influence the performance of at least two different machine

learning models

(

.

.,

Convolutional Neural Networks and Random Forest classifiers

)

trained on the MNIST

dataset. Analyze the models' accuracy, training time, and ability to generalize to test data. Discuss your

findings' implications for designing machine learning pipelines in digit recognition tasks.

6 .

Reflect on the composition and diversity of the MNIST dataset, considering its impact on the training process

and model performance. Explore how the inclusion of a more diverse set of handwriting samples

(

.

.,

different handwriting styles, inclusion of characters from non

-

Latin alphabets, or samples from wider age

3

groups

)

might affect the accuracy and generalizability of machine learning models trained for digit

recognition tasks.

Structure

Prepare a jupyter notebook for this assignment. The structure of the Jupyter notebook should alternate texts and

python codes and cover topics listed the in specific tasks above. Always refer to textbook

hands

-

on machine

learning with Scikit

-

Learn, Keras & TensorFlow

for coding help.

How do I submit?

1 .

Prepare Your Submission: Ensure your Jupyter notebook

(.

ipynb

)

is complete with all required work.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Structured Search For Big Data From Keywords To Key-objects

Authors: Mikhail Gilula

1st Edition

012804652X, 9780128046524

More Books

Students also viewed these Databases questions

Question

★★★★★

Suppose the Fed decides that it needs to pursue a contrac-tionary policy. It wants to decrease the money supply by $2 million. Assume people hold 20 percent of their money in the form of cash...

Answered: 1 week ago

Question

★★★★★

Use the DerivaGem software to value a European swap option that gives you the right in 2 years to enter into a 5-year swap in which you pay a fixed rate of 6% and receive floating. Cash flows are...

Answered: 1 week ago

Question

★★★★★

Using the Birth Weight data set (a) create a 22 contingency table that summarizes the percentage of babies weight classification (LOW) by mothers smoking status (SMOKE). (b) estimate the odds ratio...

Answered: 1 week ago

Question

★★★★★

Identify each of the following accounts of Advanced Services Co. as asset, liability, owners equity, revenue, or expense, and state in each case whether the normal balance is a debit or a credit. a....

Answered: 1 week ago

Question

★★★★★

(1 point) The table gives the U.S. population from 1790 to 1860. Year Popul. 1790 3,929,000 1800 5,308,000 1810 7,240,000 1820 9,639,000 1830 12,861,000 1840 17,063,000 1850 23, 192, 000 1860 31,...

Answered: 1 week ago

Question

★★★★★

Capital Budgeting Decision Since LX corporation is producing at full capacity, Amanda has decided to have Han examine the feasibility of a new manufacturing plant. This expansion would represent a...

Answered: 1 week ago

Question

★★★★★

Joe has designed a new smart water filter for industrial applications. Joe calls the water filter the "SMARTY SIEVE". The filter has a novel 3-dimensional shape. Joe asks a Mary to design a logo for...

Answered: 1 week ago

Question

★★★★★

The interview is not the final stage of the employment process. Background checks are frequently conducted to verify the information provided by applicants on their rsums and application forms and...

Answered: 1 week ago

Question

★★★★★

QUESTION 4 (a) Study the following balanced scorecard diagram: Financial perspective Customer perspective Learning and Internal growth perspective processes perspective Analyze how balanced scorecard...

Answered: 1 week ago

Question

★★★★★

1. Production and Inventory Management - Simulation Problem Neo EV is an electrical bus manufacturer headquartered in California. Based on historical sales data and committed contracts, the forecast...

Answered: 1 week ago

Question

★★★★★

Read the article Why Firms Go Green, Schumpeter, 2011, then: a) What type of single stories do you think internal stakeholders of a company may have in regards to Green initiatives? b) What role do...

Answered: 1 week ago

Question

★★★★★

How would you feel about working for a domestic company and competing against foreign companies? How would you feel about working for a foreign company at home? How would you feel about working in...

Answered: 1 week ago

Question

★★★★★

Will your apparel and grooming really affect your career success? Why or why not?

Answered: 1 week ago

Question

★★★★★

Do we really need laws to get organizations to give equal opportunities to all? Should the current employment laws be changed? How?

Answered: 1 week ago

Previous Question Next Question