Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Description : This aims to understanding of low-level neural network implementation and high-level neural network application. Specifically, students need to use the CIFAR100 data to

Description: This aims to understanding of low-level neural network implementation and high-level neural network application. Specifically, students need to use the CIFAR100 data to train two models: 1) a simple convolutional neural network (CNN) model building from scratch and 2) an existing CNN model.

Tasks I: Implement a simple CNN and train the model. 1.The basic architecture is shown in Figure 1. However, some parameters are moved on purpose. Please follow the description below to complete the implementation. o The model has four convolutional (Conv) layers and three fully connected (FC)layers. A ReLU functions follows each layer except the output layer. A max- pooling layer with a kernel size of 2x2 and a stride of 2 is applied to each of the first two Conv layers.

o For the Conv layers, the 1st layer has six 5x5 filters, the 2nd layer has 12 5x5 filters, the 3rd layer has 24 5x5 filters, and the 4th layer has 48 3x3 filters.

o For the FC layers, the 1st layer has 120 neurons, the 2nd layer has 84 neurons, and the last one has 100 neurons.

2. After implementing the CNN model, train the model for 20 epochs to see what kind of performance you might get. Please try to get the performance as good as possible. Plot the training/testing loss and F1 score for each epoch and submit those later. You should be able to achieve a performance similar to Figure 2 (e.g., ~25% accuracy or 0.25 F1 score). If your performance is way worse than that, it may indicate something is wrong.

Task II: Applying an existing CNN model on CIFAR100 (e.g., AlexNet, ResNet, DenseNet, etc.) and trying to get the best performance as higher as possible. The model needs to be trained for at least 10 epochs. Students may use any techniques to improve the performance, such as data augmentation, pre-training, different optimizers, etc. After the experiments, think about what are the major factors that make the performance difference between your model of Task II and the one in Task I

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Current Trends In Database Technology Edbt 2006 Edbt 2006 Workshops Phd Datax Iidb Iiha Icsnw Qlqp Pim Parma And Reactivity On The Web Munich Germany March 2006 Revised Selected Papers Lncs 4254

Authors: Torsten Grust ,Hagen Hopfner ,Arantza Illarramendi ,Stefan Jablonski ,Marco Mesiti ,Sascha Muller ,Paula-Lavinia Patranjan ,Kai-Uwe Sattler ,Myra Spiliopoulou ,Jef Wijsen

2006th Edition

3540467882, 978-3540467885

More Books

Students also viewed these Databases questions

Question

Explain NULL and how it affects filtering, SQL

Answered: 1 week ago