The momentum method
    Imagine a ball on the error
surface with velocity v.
It starts off by following the
gradient, but once it has
velocity, it no longer does
steepest descent.
It damps oscillations by
combining gradients with
opposite signs.
It builds up speed in
directions with a gentle but
consistent gradient.
On an inclined plane it
reaches a terminal velocity.