Is preprocessing cheating?
It seems like cheating if the aim to show how
powerful learning is. The really hard bit is done
by the preprocessing.
Its not cheating if we learn the non-linear
preprocessing.
This makes learning much more difficult and
much more interesting..
Its not cheating if we use a very big set of non-
linear features that is task-independent.
Support Vector Machines make it possible to
use a huge number of features without
requiring much computation or data.