lecture 11

A weird measure of model complexity

•

Suppose that we pick n datapoints and assign labels of +

or – to them at random. If our model class (e.g. a neural

net with a certain number of hidden units) is powerful

enough to learn any association of labels with data, its

too powerful!

•

Maybe we can characterize the power of a model class

by asking how many datapoints it can learn perfectly for

all possible assignments of labels.

–

This number of datapoints is called the Vapnik-

Chervonenkis dimension.

–

Creationism has infinite VC dimension.