Multiple outputs
If there are multiple outputs we can often treat the
learning problem as a set of independent problems, one
per output.
Not true if the output noise is correlated and changes
from case to case.
Even though they are independent problems we can
save work by only multiplying the input vectors by the
inverse covariance of the input components once. For
output k we have:
does not
depend on a