Neural%20Networks

About This Presentation

Title:

Description:

Number of Views:223

Avg rating:3.0/5.0

Slides: 15

Provided by: acil150

Category:

Tags: 20networks | network | neural

Transcript and Presenter's Notes

Title: Neural%20Networks

1
Neural Networks

output function
PPR
hidden layer
bias unit
synaptic weight
activation function
also known as ridge functions in PPR
PPR
2
Activation Function

large s
small s
3
Output Function

Regression
Classification
Weihao Any other justification to make output
layer sum-to-one using, say, softmax function in
Eq. 11.6?
Answer Think of NN as a function approximator.

4
Fitting NN (contd)

5
Back-propagation(aka delta rule)

given/computed values
6
Back-propagation (contd)

given/computed values
7
Back-propagation (contd)

8
Back-propagation (contd)

Batch learning vs. online learning
Often too slow
Newton method not attractive (2nd derivative too
costly)
Use conjugate gradients, variable metric methods,
etc. (Ch. 10, Numerical Recipes in C
http//www.library.cornell.edu/nr/bookcpdf.html)

9
Back-propagation (contd)
regularization!

10
Back-propagation (contd)

Joy since all parameters for starting are
close to 0, how could different starting points
ended in models differ that much?
Answer non-linearity.
Joy To prevent fitting is there any way to
train the model to the global minimum point and
then "prune" it?
Answer Global minimum is elusive. But the people
have tried the idea of pruning, in weight decay
(later) and optimal brain damageY. LeCun, J.
S. Denker, and S. A. Solla. Optimal brain damage.
In D. S. Touretzky, editor, Advances in Neural
Information Processing Systems II, pages
598--605. Morgan Kaufmann, San Mateo, CA, 1990.
Less salient connections can be removed
(pruned) saliency ? magnitude!

11
Historical Background

McCulloch Pitts, 1941 behavior of simple
neural networks.
A. Turing, 1948 B-type unorganized machine
consisting of networks of NAND gates.
Rosenblatt, 1958 two-layer perceptrons.
Minsky Papert, 1969 (Perceptrons) showing XOR
problems for perceptrons connectionism winter
came.
Rumelhart, Hinton Williams, 1986 first
well-known introduction of back-propagation
algorithm connectionism revived.

12
Model Complexity of NN

13
NN Universal Approximator?