Assume we are training a neural network with two inputs and for the OR gate. The...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Assume we are training a neural network with two inputs and for the OR gate. The table below represents the input and expected output for the OR gate: i 1 2 3 4 1 2 0 0 0 1 0 1 output y 0 1 1 1 We will create a neural network with two inputs, one output, and no hidden layer. The activation function will be the sigmoid function S(x) = 1 1+ e-z' x1 X2 W W2 = 0.4 and = -0.2, and initial bias b = 0.1. Begin with the initial weights w = The observed output is calculated as = S(s) where s = wx1 +wx2 + b. If we were to implement this in a practical example, we know that the output is expected as either 0 or 1, so we could simply round to the nearest integer afterwards. (a) For each of the four inputs (1, 2) in the table above, determine the observed output for the initial weights and bias. (b) Using the mean squared error function error = i=1 (i - y) calculate the error observed in part (a). (c) Using the chain rule, calculate the following derivatives derror wi Jerror w derror b After the 100 epochs, what are the values s and for each input? What are the values of w, W2, b? (e) Did the observed output adjust properly to mimic the expected output? Note: For those familiar with neural networks, this training simulation has quite a few issues with it. For instance, we are overfitting the data since our actual data is the training data. This is due to the small data amounts. Another issue is we are not randomizing the order of the inputs when training. All of this (and more!) will be discussed in your future ML & AI courses. Assume we are training a neural network with two inputs and for the OR gate. The table below represents the input and expected output for the OR gate: i 1 2 3 4 1 2 0 0 0 1 0 1 output y 0 1 1 1 We will create a neural network with two inputs, one output, and no hidden layer. The activation function will be the sigmoid function S(x) = 1 1+ e-z' x1 X2 W W2 = 0.4 and = -0.2, and initial bias b = 0.1. Begin with the initial weights w = The observed output is calculated as = S(s) where s = wx1 +wx2 + b. If we were to implement this in a practical example, we know that the output is expected as either 0 or 1, so we could simply round to the nearest integer afterwards. (a) For each of the four inputs (1, 2) in the table above, determine the observed output for the initial weights and bias. (b) Using the mean squared error function error = i=1 (i - y) calculate the error observed in part (a). (c) Using the chain rule, calculate the following derivatives derror wi Jerror w derror b After the 100 epochs, what are the values s and for each input? What are the values of w, W2, b? (e) Did the observed output adjust properly to mimic the expected output? Note: For those familiar with neural networks, this training simulation has quite a few issues with it. For instance, we are overfitting the data since our actual data is the training data. This is due to the small data amounts. Another issue is we are not randomizing the order of the inputs when training. All of this (and more!) will be discussed in your future ML & AI courses.
Expert Answer:
Answer rating: 100% (QA)
Solutions Step 1 This is the complete answer to this with a very detailed explanation This is the complete answer for part A in this we need to calculate the observed output for each of the four input... View the full answer
Related Book For
Business Statistics In Practice
ISBN: 9780073401836
6th Edition
Authors: Bruce Bowerman, Richard O'Connell
Posted Date:
Students also viewed these mechanical engineering questions
-
Give the rightC++ syntax for a function prototype (declaration)of the function named myAge which takes the current year and birth year as arguments and returns the calculated age: and Give theC++...
-
Note: All ML code must be explained clearly (INJAVAXX)and should be free of needless complexity. 2 CST.2016.1.3 2 Foundations of Computer Science Please help. (2c) (a) A prime number sieve is an...
-
I need it in JAVAx Objects: Electronic health records (EHRs) in a nationwide service. Policy: The owner (patient) may read from its own EHR. A qualified and employed doctor may read and write the EHR...
-
The file CigaretteTax contains the state cigarette tax ($) for each state as of January 1, 2013. a. Construct an ordered array. b. Plot a percentage histogram. c. What conclusions can you reach about...
-
Identify the elementary row operation(s) performed to obtain the new row-equivalent matrix. 1. Original Matrix New Row-Equivalent Matrix 2. Original Matrix New Row-Equivalent Matrix 3. Original...
-
International Genetic Technologies (InGen) and The Resources Development Association (RDA) are companies involved in cutting-edge genetics research. Selected financial data are provided below:...
-
The February 2000 issue of the Journal of Accountancy contains an article by Russ Banham entitled "Better Budgets." Instructions Read the article and answer the following questions. (a) Why have some...
-
Using the data in E16-15, assume Royale Mortgage Company uses the FIFO method. Also assume that the applications in process on September 1 were 100% complete as to materials (forms) and 40% complete...
-
QUESTION 2- (15 MARKS) After completion of your course, you start working at an accounting and tax office. Juliana is your first client. She requires to lodge her income tax for 2021/22. She gave her...
-
On January 1, Boston Company completed the following transactions (use a 7% annual interest rate for all transactions): ( EV of $1. PV of $1. EVA of $1. and PVA of $1) (Use the appropriate factor(s)...
-
11- For the flow net shown the head difference is H=6.0 m and permeability k= 8x10-6 m/s. If we count the last flow channel as 0.5 the discharge quantity is; a) b) C) e) 2.9x 10-5 m3/s 5.1x 10-6 m3/s...
-
In a machine shop, six employees repaired 45 engines during a 5-day workweek. How would the productivity of the machine shop best be described?
-
An automobile assembly plant produced 600 cars during a 5-day workweek. It then produced 500 cars the following week, although it only operated 4 days due to a national holiday. During which week was...
-
You have 100 doses of a vaccine against a deadly strain of influenza that is sweeping the country, with no prospect of obtaining more. Standing in line are 100 schoolchildren and 100 elderly people....
-
For Dousmann Company actual sales are \(\$ 1,200,000\) and break-even sales are \(\$ 840,000\). Compute (a) the margin of safety in dollars and (b) the margin of safety ratio.
-
Sally Newton was the only purchasing officer of First Canadian Club, a fitness club with 20 centres scattered around Canada. In February 1988, she was thinking of how to handle the resistance coming...
-
??????? Gulf Shore Lawn and Garden Maintenance provides two general outdoor services: lawn maintenance and garden maintenance. The company charges customers \( \$ 18.0 \) per hour for each type of...
-
The water in tank A is at 270 F with quality of 10% and mass 1 lbm. It is connected to a piston/cylinder holding constant pressure of 40 psia initially with 1 lbm water at 700 F. The valve is opened,...
-
Consider the display panel situation in Exercise 11.3, and let A, B, and C (represent the mean times to stabilize the emergency condition when using display panels A, B, and C, respectively. Figure...
-
On January 4. 2000, the Gallup Organization released the results of a poll dealing with the likelihood of computer-related Y2K problems and the possibility of terrorist attacks during the New Year's...
-
Discuss how we use residual plots to check the regression assumptions for a multiple regression model.
-
What conditions are necessary for a risk-free asset to be free of risk? Provide an example. Is it really risk-free?
-
Will very wide diversification eliminate specific risk? And market risk?
-
Can the risk of a portfolio be greater than the individual risk of each of the securities it contains? Under what circumstances?
Study smarter with the SolutionInn App