Why the simple system does not work
A two layer network with a single winner in the top layer
is equivalent to having a rigid template for each shape.
The winner is the template that has the biggest
overlap with the ink.
The ways in which shapes vary are much too
complicated to be captured by  simple template matches
of whole shapes.
To capture all the allowable variations of a shape we
need to learn the features that it is composed of.