Question
Description : This aims to understanding of low-level neural network implementation and high-level neural network application. Specifically, students need to use the CIFAR100 data to
Description: This aims to understanding of low-level neural network implementation and high-level neural network application. Specifically, students need to use the CIFAR100 data to train two models: 1) a simple convolutional neural network (CNN) model building from scratch and 2) an existing CNN model.
Tasks I: Implement a simple CNN and train the model. 1.The basic architecture is shown in Figure 1. However, some parameters are moved on purpose. Please follow the description below to complete the implementation. o The model has four convolutional (Conv) layers and three fully connected (FC)layers. A ReLU functions follows each layer except the output layer. A max- pooling layer with a kernel size of 2x2 and a stride of 2 is applied to each of the first two Conv layers.
o For the Conv layers, the 1st layer has six 5x5 filters, the 2nd layer has 12 5x5 filters, the 3rd layer has 24 5x5 filters, and the 4th layer has 48 3x3 filters.
o For the FC layers, the 1st layer has 120 neurons, the 2nd layer has 84 neurons, and the last one has 100 neurons.
2. After implementing the CNN model, train the model for 20 epochs to see what kind of performance you might get. Please try to get the performance as good as possible. Plot the training/testing loss and F1 score for each epoch and submit those later. You should be able to achieve a performance similar to Figure 2 (e.g., ~25% accuracy or 0.25 F1 score). If your performance is way worse than that, it may indicate something is wrong.
Task II: Applying an existing CNN model on CIFAR100 (e.g., AlexNet, ResNet, DenseNet, etc.) and trying to get the best performance as higher as possible. The model needs to be trained for at least 10 epochs. Students may use any techniques to improve the performance, such as data augmentation, pre-training, different optimizers, etc. After the experiments, think about what are the major factors that make the performance difference between your model of Task II and the one in Task I
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started