Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Oct 05, 2023

Part III Artificial Neural Networks (60 points) In this problem, we consider a bi-dimensional dataset of random points belonging to two classes. The first

|||||| || | // Figure 6: A 2-6-3-1 model to make the binary classification for our dataset. Finally, the

Model: "sequential" Layer (type) dense (Dense) dense 1 (Dense) dense 2 (Dense) Total params: 43 Trainable

Part III Artificial Neural Networks (60 points) In this problem, we consider a bi-dimensional dataset of random points belonging to two classes. The first class of points is inside the circle centered at 0 with radius 1, whereas the second class is inside a ring located between radius 1.5 and radius 2.5. All points are perturbed with a Gaussian noise of zero mean and standard deviation 0.1. A Bernoulli(p=0.6) process allows us to select 40% of points from the first class (red points) and 60% of points from the second class (blue points). 4000 points of the training dataset are plotted in Figure 5. We also generated a validation dataset of size 800 points. The two datasets can be uploaded to TensorFlow from the files: training_circular_bidim_dataset.csv and test_circular_bidim_dataset.csv 3 2 W = 0 -3 -3 0 X W = Figure 5: A circular bi-dimensional dataset with 2 classes (2 categories/2 clusters). The binary classification is performed via a 2-6-3-1 dense neural network depicted in Figure 6. The input has 2 dimensions, the model has 2 hidden layers with tanh activation functions, and the output has one unit only with a sigmoid. The output sigmoid is well suited to our binary classification problem. (a) Suppose the model input is x = = (x1, x)t = (0.5, 0.5). The 6 x 2 weight matrix of the first hidden layer is set to 0.50 -0.05 -0.25 0.25 0.04 0.75 0.04 -0.06 0.64 0.04 -0.64 0.08 2 The 3 x 6 weight matrix of the second hidden layer is set to 3 6 0.50 1.00 -0.05 0.40 -0.06 1.00 1.00 -0.25 0.25 0.02 0.01 -0.40 0.04 0.75 1.00 -0.50 0.01 0.64 (7) (8) |||||| || | // Figure 6: A 2-6-3-1 model to make the binary classification for our dataset. Finally, the weights of the output layer are set to W3 (2.0, 0.5, -1.0). We denote by h (h11, h12, ..., h16)t the output of the first hidden layer after the tanh activation. We also denote by h (h21, h22, h23) the output of the second hidden layer after the tanh activation. The model output is denoted by the letter o. For simplicity, it is assumed that the model units have no bias in question (a). = = = Using the linear-algebra equations of forward propagation, without a bias, we calcu- late hi (Wx), h (Wh), and the value of the output o (W3h), where (x) and (x) are the tanh and the sigmoid functions respectively. We get h (0.22127, 0.0, 0.37566,-0.00999, 0.32748,-0.27290)t, h = (-0.20188, 0.40317, 0.21473), and o= : 0.39725. = (a) Based on the values computed by the above forward propagation, compute the gra- dient of the output o with respect to the weight parameter w1 between the first input and the first unit on the first hidden layer. Use the rules of backpropagation. dw11 (b) If Xavier initialization is used for the 12 parameters in W with a Gaussian distribu- tion, what should be the distribution variance? (c) Build the 2-6-3-1 neural network of Figure 6 in the Jupyter notebook template re- ceived with the final exam PDF file. Set the activation functions to tanh for the hidden layers. The output activation should be a sigmoid. Write the model construction via TensorFlow/Keras. Save your Jupyter notebook before sending it back to the instructor by email. After model.summary(), the neural network should look like this: Model: "sequential" Layer (type) dense (Dense) dense 1 (Dense) dense 2 (Dense) Total params: 43 Trainable params: 43 Non-trainable params: 0 Output Shape (None, 6) (None, 3) (None, 1) Param # 18 21 4 ====== Set the epochs to 100, the batch size to 32, the learning rate to 0.0002, and the optimizer to Adam. Run model.compile with the correct arguments and then launch the learning by model.fit (train_points, train_labels, epochs epochs). The training accuracy should be above 99%. The validation accuracy is most likely of 100%. (d) Replace the tanh activation of the hidden layers by ReLu. Try also sigmoids instead of tanh. What are the accuracy results after changing the activation functions?

Step by Step Solution

★★★★★

3.46 Rating (159 Votes )

There are 3 Steps involved in it

Step: 1

The skin friction coefficient Cf for a laminar boundary ... blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Managerial Accounting

Authors: Charles E. Davis, Elizabeth Davis

2nd edition

1118548639, 9781118800713, 1118338448, 9781118548639, 1118800710, 978-1118338445

More Books

Students also viewed these Programming questions

Question

1.Choose from among the definitions/notions that you could explain fully and with examples. 2. Be able to explain the details of input and output of governance framework. 3. Be able to research for...

Answered: 1 week ago

Question

★★★★★

:{"foster", "enthusiasm", "wagon", "ally", "lehigh", "programming", "dog", "cat", "Ally", "smile", "pet" }; a. Suppose you perform insertion sort in order to sort A in ascending order. How many...

Answered: 1 week ago

Question

★★★★★

In this problem we consider annual U.S. lumber production over 30 years. The data were obtained from the U.S. Department of Commerce Survey of Current Business and are presented in Table 16.5 a. Plot...

Answered: 1 week ago

Question

★★★★★

k) Assume that one of these portfolio's is the Market Portfolio and all portfolios, except Portfolio G, are fairly priced according to the CAPM. What is the highest utility score that can be achieved...

Answered: 1 week ago

Question

★★★★★

A circular concrete shaft liner with Youngs modulus of 3.4 million psi, Poissons ratio of 0.25, unconfined compressive strength 3,500 psi and tensile strength 350 psi is loaded to the verge of...

Answered: 1 week ago

Question

★★★★★

Infinite Leisure Group owns and operates a number of pubs and clubs across Europe and South East Asia. Since inception the group has made exclusive use of the cost model for the purpose of its annual...

Answered: 1 week ago

Question

★★★★★

Does the normal probability plot given in Figure 7.12 support the use of a ????-test for testing a claim about the difference of two means in a paired comparison study? Explain.

Answered: 1 week ago

Question

★★★★★

Open the Multiplication Solution.sln file contained in the VB2017\Chap05\Multiplication Solution folder. Code the application to display a multiplication table similar to the one shown in Figure...

Answered: 1 week ago

Question

★★★★★

Yusef purchased a $10,000 30-year Treasury bond that promised to pay him 1.3% of the face value every 6 months for the life of the loan. Which of those numbers is the coupon rate of the bond? 1.3 6 3...

Answered: 1 week ago

Question

★★★★★

Volkswagen AG and Daimler AG, both based in Germany, are two of the largest automobile manufacturers in the world. The following information was provided in each companys 2016 annual report. * In...

Answered: 1 week ago

Question

★★★★★

Please show formulas, Q3 ( 2 point +1 bonus point): Use the following table to answer the questions below the table The formula in C14 is -PMT(B14/12, C13, F9). The PMT(interest rate, number of...

Answered: 1 week ago

Question

★★★★★

A. Draw a graph to show equilibrium price. (3 points) Demand Supply Equilibrium price Quantity A Draw a graph to show equilibriurn price. (3 points)

Answered: 1 week ago

Question

★★★★★

Imagine a senior executive has asked you to conduct an organizational self-assessment based on the Four-Frame Model. Choose an organization that you either worked for, are familiar with, or have...

Answered: 1 week ago

Question

★★★★★

A pony, tied to the end of a lead, can walk in a full circle, a distance of 56.55 meters. About how long is its lead? A. 28.3 meters B. 18.0 meters C. 9.0 meters D. 7.5 meters

Answered: 1 week ago

Question

★★★★★

2. For an idealized slipper-pad bearing, the bottom wall moves at a constant velocity U and the upper block is fixed. Both the moving and fixed walls are non- permeable. If the velocity within the...

Answered: 1 week ago

Question

★★★★★

4. The monthly profits of Hi-Mark corporation were organized into a frequency distribution. The mean of monthly profits was computed to be $105 600, the median $104 700, and the mode $104 200. (a)...

Answered: 1 week ago

Question

★★★★★

Find grammars for \ sum = { a , b } that generate the sets of all strings with at least two a s

Answered: 1 week ago

Question

★★★★★

Medi-Exam Health Services, Inc. (MEHS), located in a major metropolitan area, provides annual physical screening examinations, including a routine physical, EKG, and blood and urine tests. MEUS's...

Answered: 1 week ago

Question

★★★★★

Blake field, Inc. has grown significantly over the past decade through innovation and acquisition. Information on several of its divisions follows. The OlliePods division sells children's...

Answered: 1 week ago

Question

★★★★★

Lancaster Orthopedics specializes in hip, knee, and shoulder replacement surgery. In addition to the actual surgery, the company provides its patients with preoperative and postoperative inpatient...

Answered: 1 week ago

Question

★★★★★

Forrest Gump was one of the biggest movie hits of 1994. The movie's fortunes continued to climb in 1995, as it took home Oscars in six of 13 categories in which it was nominated, including best...

Answered: 1 week ago

Question

★★★★★

The Future-Vision Cable TV Company recently surveyed its customers. A total of 548 responses were received. Among other things, the respondents were asked to indicate their household income. The data...

Answered: 1 week ago

Question

★★★★★

An online article (http://beauty.about.com) by Julyne Derrick, Shelf Lives: How Long Can You Keep Makeup, suggests that eye shadow and eyeliner each have a shelf life of up to three years. Suppose...

Answered: 1 week ago

Question

★★★★★

The probability that a value from a normally distributed random variable will exceed the mean is 0.50. The same is true for the uniform distribution. Why is this not necessarily true for the...

Answered: 1 week ago

Previous Question Next Question