Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Description : This aims to understanding of low-level neural network implementation and high-level neural network application. Specifically, students need to use the CIFAR100 data to

Description: This aims to understanding of low-level neural network implementation and high-level neural network application. Specifically, students need to use the CIFAR100 data to train two models: 1) a simple convolutional neural network (CNN) model building from scratch and 2) an existing CNN model.

Tasks I: Implement a simple CNN and train the model. 1.The basic architecture is shown in Figure 1. However, some parameters are moved on purpose. Please follow the description below to complete the implementation. o The model has four convolutional (Conv) layers and three fully connected (FC)layers. A ReLU functions follows each layer except the output layer. A max- pooling layer with a kernel size of 2x2 and a stride of 2 is applied to each of the first two Conv layers.

o For the Conv layers, the 1st layer has six 5x5 filters, the 2nd layer has 12 5x5 filters, the 3rd layer has 24 5x5 filters, and the 4th layer has 48 3x3 filters.

o For the FC layers, the 1st layer has 120 neurons, the 2nd layer has 84 neurons, and the last one has 100 neurons.

2. After implementing the CNN model, train the model for 20 epochs to see what kind of performance you might get. Please try to get the performance as good as possible. Plot the training/testing loss and F1 score for each epoch and submit those later. You should be able to achieve a performance similar to Figure 2 (e.g., ~25% accuracy or 0.25 F1 score). If your performance is way worse than that, it may indicate something is wrong.

Task II: Applying an existing CNN model on CIFAR100 (e.g., AlexNet, ResNet, DenseNet, etc.) and trying to get the best performance as higher as possible. The model needs to be trained for at least 10 epochs. Students may use any techniques to improve the performance, such as data augmentation, pre-training, different optimizers, etc. After the experiments, think about what are the major factors that make the performance difference between your model of Task II and the one in Task I

Note: PLEASE GIVE FULL ANSWER FOR BOTH TASK WITH FULL CODE. IN LAST POST I HAVE RECEIVED INCOMPLETE ANSWER.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Oracle Database 19c DBA By Examples Installation And Administration

Authors: Ravinder Gupta

1st Edition

B09FC7TQJ6, 979-8469226970

More Books

Students also viewed these Databases questions