next up previous
Next: The Backpropagation Algorithm Up: Artificial Neural Nets Previous: Stochastic approximation to gradient

Multilayer Nets, Sigmoid Units

Motivation:

Note the very useful fact that

\begin{displaymath}\frac{d\sigma}{dy} = k\sigma(1-\sigma)
\end{displaymath}

If we use the sigmoid function for thresholding, then we can use the gradient descent rule to train cascaded thresholded units. The backpropagation algorithm does just this.



Anand Venkataraman
1999-09-16