lecture 10: Support Vector Machines

Some commonly used kernels

Polynomial:

Neural net:


For the neural network kernel, there is one “hidden unit”
per support vector, so the process of fitting the maximum
margin hyperplane decides how many hidden units to use.
Also, it may violate Mercer’s condition.