Neural Networks Outline The biological neuron History of

Neural Networks Outline: The biological neuron History of neural networks research The Perceptron Examples Training algorithm Fundamental training theorem Two-level perceptrons CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks 1

The Biological Neuron The human brain contains approximately 1011 neurons. Activation process: Inputs are transmitted electrochemically across the input synapses Input potentials are summed. If the potential reaches a threshold, a pulse or action potential moves down the axon. (The neuron has “fired”. ) The pulse is distributed at the axonal arborization to the input synapses of other neurons. After firing, there is a refractory period of inactivity. CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks 2

History of Neural Networks Research 1943 Mc. Culloch & Pitts model of neuron. ni(t+1) = ( j wij nj(t) - i), (x) = 1 if x 0; 0, otherwise. 1962 Frank Rosenblatt’s book gives a training algorithm for finding the weights wij from examples. 1969 Marvin Minsky and Seymour Papert publish Perceptrons, and prove that 1 -layer perceptrons are incapable of computing image connectedness. 1974 -89, 1982: Associated content-addressable memory. Backpropagation: Werbos 1974, Parker 1985, Rumelhart, Hinton, & Williams 1986. CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks 3

The Perceptron . . . x 1 x 2 xn w 1 w 2 wn y weights inputs summation y= thresholding output 1 if wi xi ; 0, otherwise. CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks 4

Perceptron Examples: Boolean AND and OR. y = x 1 x 2 . . . xk x 1 1 x 2 1 xk = 1/2 1 y = x 1 x 2 . . . xk . . . x 1 1 x 2 1 xk = k - 1/2 1 CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks xi {0, 1} 5

Perceptron Examples: Boolean NOT x -1 y = x = - 1/2 xi {0, 1} CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks 6

Perceptron Example: Template Matching -1 -1 1 -1 1 1 1 -1 -1 -1 1 weights w 1 through w 25 xi {-1, 1} = 25 - Recognizes the letter A provided the exact pattern is present. CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks 7

Perceptron Training Sets Let X = X+ U X- be the set of training examples. SX = X 1, X 2, . . . , Xk, . . . is a training sequence on X, provided: (1) Each Xk is a member of X, and (2) Each element of X occurs infinitely often in SX. An element e occurs infinitely often in a sequence z = z 1, z 2, . . . provided that for any nonzero integer i, there exists a nonnegative integer j such that there is an occurrence of e in zi, zi+1, . . . , zj. CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks 8

Perceptron Training Algorithm Let X = X+ U X- be the set of training examples. and let SX = X 1, X 2, . . . , Xk, . . . be a training sequence on X. Let wk be the weight vector at step k. Choose w 0 arbitrarily. For example. w 0 = (0, 0, . . . , 0). Each each step k, k = 0, 1, 2, . . . Classify Xk using wk. If Xk is correctly classified, take wk+1 = wk. If Xk is in X- but misclassified, take wk+1 = wk - ck Xk. If Xk is in X+ but misclassified, take wk+1 = wk + ck Xk. The sequence ck should be chosen according to the data. Overly large constant values can lead to oscillation during training. Values that are too small will increase training time. However, ck = c 0/k will work for any positive c 0. CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks 9

Perceptron Limitations Perceptron training always converges if the training data X+ and X- are linearly separable sets. The boolean function XOR (exclusive or) is not linearly separable. (Its positive and negative instances cannot be separated by a line or hyperplane. ) It cannot be computed by a single-layer perceptron. It cannot be learned by a single-layer perceptron. + X = { (0, 1), (1, 0) } X = { (0, 0), (1, 1) } + X= X UX x 2 x 1 CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks 10

Two-Layer Perceptrons +1 x 1 -1 +1 = 0. 5 -1 x 2 +1 +1 y = XOR(x 1, x 2) = 0. 5 CSE 415 -- (c) S. Tanimoto, 2004 Neural Networks 11