 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
|
|
The bias-variance decomposition
|
|
|
|
|
 |
 |
 |
model’s
estimate
|
for
test case n
|
|
when
trained on
|
|
|
dataset D
|
|
|
|
 |
 |
 |
average
|
|
target
|
|
value
for
|
|
|
test
case n
|
|
|
|
|
|
 |
 |
The
“bias” term is the squared error of the
|
average,
over all training datasets, of the
|
|
estimates.
|
|
|
|
|
|
|
|
 |
|
|
 |
|
|
|
|
|
|
 |
|
|
|
|
 |
|
|
|
|
 |
 |
 |
angle
brackets are
|
physics
notation
|
|
|
for
expectation
|
|
over D
|
|
|
|
|
|
|
see Bishop page 149 for a
derivation using a different notation
|
|
|
|