Sketch of the backpropagation algorithm
on a single training case
First convert the discrepancy
between each output and its
target value into an error
derivative.
Then compute error
derivatives in each hidden
layer from error derivatives in
the layer above.
Then use error derivatives
w.r.t. activities to get error
derivatives w.r.t. the weights.