 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
| • |
This takes
N-dimensional data and finds the M orthogonal
|
|
directions in
which the data has the most variance
|
|
|
|
– |
These
M principal directions form a subspace.
|
|
|
|
– |
We
can represent an N-dimensional datapoint by its
|
|
|
projections
onto the M principal directions
|
|
|
|
• |
This
loses all information about where the datapoint is located
|
|
|
in the
remaining orthogonal directions.
|
|
|
|
– |
We
reconstruct by using the mean value (over all the
|
|
|
data)
on the N-M directions that are not represented.
|
|
|
|
• |
The
reconstruction error is the sum over all these
|
|
|
unrepresented
directions of the squared differences from the
|
|
|
mean.
|
|