Using the posterior distribution
If we can afford the computation, we ought to average
the predictions of all parameter settings using the
posterior distribution to weight the predictions:
precision
of output
noise
precision
of prior
training
data