Question: You may use the mlp.py model provided although this is not mandatory. This exercise deals with the approximation of functions by neural networks. The so

You may use the mlp.py model provided although this is not mandatory. This exercise deals with the approximation of functions by neural networks. The so called function approximation (regression), is to find a mapping f' satisfying || f'(x) - f(x) ||Preparation of data For this function approximation problem, three kinds of data sets are prepared, namely the training set, the validation set and the test set. The training set is a set of value pairs which comprise information about the target function for training the network. The validation set is associated with the early stopping technique, described below. During the training phase, the validation error is monitored in order to prevent the network from overfitting the training data. Normally, the test set is just used to evaluate the network performance afterwards. But, in this exercise the root mean-square error (rmse) on the test set is used as the performance goal of the network training. For the current problem, the training and the test data are taken from uniform grids (10x10 pairs of values for the training data, 9x9 pairs for the test data). As shown in Fig.1 the range of the function output is already within the interval [-1 1]. So, it is not necessary to scale the target function. For the validation data, in order to make it a better representation of the original function, it is taken randomly from the function surface. Network Design Theoretical results indicate that given enough hidden (non-linear) units, a feedforward neural network can approximate any non-linear functions (with a finite number of discontinuities) to a required degree of accuracy. In other words, any non-linear function can be expressed as a linear combination of non-linear basis functions. Therefore, a two-layer feedforward neural network with 2 layers: one layer of non-linear hidden neurons and one linear output neuron seems a reasonable design for a function approximation task. The target function as defined above has two inputs (x, y), and one output (z = f(x,y)). Thus, as shown in Fig.2, the network solution consists of two inputs, one layer of sigmoid transfer (aka activation) function neurons and one linear transfer function output neuron. You may also want to consider using the hyperbolic tangent activation function for the hidden layer. The number of the hidden neurons is an important design issue. On the one hand, having more hidden neurons allows the network to approximate functions of greater complexity. But, as a result of network's high degree of freedom, it may overfit the training data while the unseen data will be poorly fit to the desired function. On the other hand, although a small network won't have enough power to overfit the training data, it may be too small to adequately represent the target function. In order to choose a reasonable amount of hidden neurons, three different networks with 2, 8 and 50 hidden neurons are examined. The training result (see Fig.3) shows the network with 8 hidden neurons outperforms the other two networks after they are trained with the same training parameters. Here, the number of epochs to convergence will generally vary You may use the mlp.py model provided although this is not mandatory.

Part (a). You must:

1. Using the GradientDescentOptimizer, investigate the converged performance of a 2 layer neural network with 2, 8 and 50 hidden layer neurons.

2. You must produce a contour diagram similar to Fig.3.

3. Include a table of MSE for the 3 different network sizes and number of epochs to convergence. 4. Indicate whether sigmoid or hyperbolic tangent activation functions were used in your experiments

Im very new to neural networks and don't know how to approach the question.

Thank you for your help

f(x,y) = cos(x + 6*0.35y) + 2*0.35xy x,y E[-1 1] Target Function Surface Target Function Contour -0.2- -0.4- -5.5511e-017 02 over 0.5 -0.6 0.5 0.4 -0.6 0 Z 0.6 gu -0.5 0.4 0.2 -0.5 -0.2 -5.5511e-012-02 -0.6 0.4 0.2 0.5 -1 -1 X -0.5 0 Fig. 1: Parametric surface and contour of the target function 0.8 0.6 target 2 neurons 8 neurons 50 neurons Output(purelin) 04 02 0 ...Hidden(tansig) -0.2 -0.4 -0.6 Inputs -0.8 -0.5 0.5 Fig. 2: Network Architecture Fig.3: Function Contours

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

3 COLLEGE ALGEBRA - TRIGONOMETRY Business and Finance (MAT115) This course will start with a review of basic algebra (factoring, solving linear equations, and equalities, etc.) and proceed to a study...

Let A, B be sets. Define: (a) the Cartesian product (A B) (b) the set of relations R between A and B (c) the identity relation A on the set A [3 marks] Suppose S, T are relations between A and B, and...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

deacribe the main findings of this paper 400 words (be clear on what the analysis involved says, interpret results, or if you could redo the study, what other varaibles or questions would you...

Please summarize this journal, the length of the summary should not be more than two pages with 1.5 spacing, size 12 Times New Rome. Expert Systems with Applications 38 (2011) 11347-11354 Contents...

2.7. Explicit Solutions to Dierential Equations 109 power (kw) 20 15 10 5 0 08:00 10:00 12:00 14:00 16:00 time 18:00 2.7 Explicit Solutions to Dierential Equations In the very rare case in which an...

Managing Service Quality: An International Journal Service quality in automated teller machines: an empirical investigation Bedman Narteh Article information: Downloaded by QATAR UNIVERSITY At 22:38...

Machine Learning - doing neural networks This is all to be written in Python Introduction In Part 1 of this assignment you will implement a basic neural net in numpy. You are not to use any libraries...

Just some points on the main findings of the paper/ what the results mean! 6 67% + T Applying Regression Models to Predict Business Results Jelena Rusov PhD Student Durereado, grade Mirjana Misita...

\f6e Foundations in Strategic Management Jeffrey S. Harrison Robins School of Business University of Richmond Caron H. St. John College of Business Administration University of Alabama in Huntsville...

The given project has an installed cost of $20,000. The annual cost, A= $2000 for the forever, and the interest rate i=8% per Year. In addition, it is expected to be a recurring major upgrade cost of...

The effect of an increase in the government budget deficit on the equilibrium level of GDPis essentially the same as a(n): A. Decrease in saving B. Increase in saving C. Decrease in consumption D....

Suppose an investor who is in the 4 0 percent marginal tax bracket is considering the acquisition of a tax - exempt municipal bond which offers a yield of 3 . 5 percent. What would be the equivalent...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

3. What decisions would a city manager at the conventional level be expected to make?

1. Which position would you take?

6. What reaction would you expect of employees at the postconventional level?