CSC321: Neural
Networks
Lecture 9: Bayesian learning continued
The Bayesian interpretation of weight decay
Overfitting: A frequentist illusion?
A classic example of overfitting
Approximating full Bayesian learning in a neural network
An example of full Bayesian learning
Computing the likelihood term for a logistic output unit
What can we do if there are too many parameters for a grid to be feasible?