Math of Fisher’s linear discriminants
What linear transformation is best for
discrimination?
The projection onto the vector
separating the class means seems
sensible:
But we also want small variance
within each class:
Fisher’s objective function is:
between
within