An advantage of modeling sequential data
We can get a teaching signal by trying to predict
the next term in a series.
This seems much more natural than trying to
predict each column of pixels in an image
from the columns to the left of it.
But its actually equivalent.