Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 12, 2024

In this project, we will be developing a basic neural network from the ground up to classify various types of fashion items. The primary objective

In this project, we will be developing a basic neural network from the ground up to classify various types of fashion items. The primary objective of this project is to gain a comprehensive understanding of neural network architecture, including its theory and implementation details.

Categories in the dataset:

0

: T

-

shirt

/

top

1

: Trouser

2

: Pullover

3

: Dress

4

: Coat

5

: Sandal

6

: Shirt

7

: Sneaker

8

: Bag

9

: Ankle boot

# Notice that you don't need any other packages for this mid

-

term

import numpy as np

import pandas as pd

import random

from matplotlib import pyplot as plt

random.seed

(42)

# NEVER change this line; this is for grading

# Reading the dataset

data

=

.

read

_

csv

(' . /

fashion

_

data.csv

')

# The data pre

-

processing is done for you. Please do NOT edit the cell

# However, you should understand what these codes are doing

data

=

.

array

(

data

)

,

=

data.shape

.

random.shuffle

(

data

)

# shuffle before splitting into dev and training sets

data

_

dev

=

data

[0

400] .

_

dev

=

data

_

dev

[- 1]

_

dev

=

data

_

dev

[0

- 1]

_

dev

=

_

dev

/ 255 .

data

_

train

=

data

[400

] .

_

train

=

data

_

train

[- 1]

_

train

=

data

_

train

[0

- 1]

_

train

=

_

train

/ 255 .

_,

_

train

=

_

train.shape

# define a global variable specifying the number of hidden neurons after the first layer

# not the best practice, but we will do it for this mid

-

term project

num

_

hidden

_

neurons

= 20

# Initialize the parameters in the neural network

# Based on the figure above, we need the weight and bias matrices.

# W

1,

1

are the matrices for the first layer

# W

2,

2

are the matrices for the second layer

# You should think about the sizes of the matrices

# then initialize elements in the matrix to be random numbers between

- 0.5

+ 0.5

def init

_

params

()

1 =

# Your code here

1 =

# Your code here

2 =

# Your code here

2 =

# Your code here

return W

1,

1,

2,

2

# As a starting point, you only need a ReLu function, its derivative, and the softmax function

def ReLU

(

)

# Your code here

def ReLU

_

deriv

(

)

# Your code here

def softmax

(

)

# Your code here

return A

# In the forward propagation function, X is the inputs

(

the image in vector form

),

and we pass all the weights and biases

def forward

_

prop

(

1,

1,

2,

2,

)

1 =

# Your code here

1 =

# Your code here

2 =

# Your code here

2 =

# Your code here

return Z

1,

1,

2,

2

# This one hot function is to convert a numeric number into a one

-

hot vector

def one

_

hot

(

)

# Your code here

return one

_

hot

_

# Now performing the backward propagation

# Each function is only one line, but lots of Calculus behind

def backward

_

prop

(

1,

1,

2,

2,

1,

2,

,

)

one

_

hot

_

=

one

_

hot

(

)

2 =

# Your code here

2 =

# Your code here

2 =

# Your code here

1 =

# Your code here

1 =

# Your code here

1 =

# Your code here

return dW

1,

1,

2,

2

# Finally, we are ready to update the parameters

def update

_

params

(

1,

1,

2,

2,

1,

1,

2,

2,

alpha

)

1 =

# Your code here

1 =

# Your code here

2 =

# Your code here

2 =

# Your code here

return W

1,

1,

2,

2

# Implement the helper function. We need to convert the softmax output into a numeric label

# This is done through get

_

predictions function

def get

_

predictions

(

2)

# Your code here

# We also want to have a simple function to compute the accuracy. Notice that "predictions" and

"

"

are the same shape

def get

_

accuracy

(

predictions

,

)

return # Your code here

# Finally, we are ready to implement gradient descent

def gradient

_

descent

(

,

,

alpha, iterations

)

1,

1,

2,

2 =

# Your code here

-

using the function you have implemented

for i in range

(

iterations

)

1,

1,

2,

2 =

# Your code here

-

using the function you have implemented

1,

1,

2,

2 =

# Your code here

-

using the function you have implemented

1,

1,

2,

2 =

# Your code here

-

using the function you have implemented

if i

% 10 = = 0

("

Iteration:

",

)

predictions

=

get

_

predictions

(

2)

(

get

_

accuracy

(

predictions

,

))

return W

1,

1,

2,

2

#Validate set performance

def make

_

predictions

(

,

1,

1,

2,

2)

_,_,_,

2 =

forward

_

prop

(

1,

1,

2,

2,

)

predictions

=

get

_

predictions

(

2)

return predictions

dev

_

predictions

=

make

_

predictions

(

_

dev, W

1,

1,

2,

2)

get

_

accuracy

(

dev

_

predictions, Y

_

dev

)

Part

2

: Error Analysis and Performance Improvements

You now will try to improve the model performance through, for example, different activation functions, learning rate cahnges, expanding the network complexity, regularization, and dropouts. Implement these ideas for improvement and compare to the base model you completed in part

1 .

The ideas may not improve the model, they may improve, give reasons why you did or did not succeed in makeing a better model. You must implement at least three different models.