Max Welling's Classnotes in Machine Learning

Statistical Estimation [ps]
- bayesian estimation
- maximum a posteriori (MAP) estimation
- maximum likelihood (ML) estimation
- Bias/Variance tradeoff & minimum description length (MDL)

Expectation Maximization (EM) Algorithm [ps]
-
detailed derivation plus some examples

Supervised Learning (Function Approximation) [ps]
- mixture of experts (MoE)
- cluster weighted modeling (CWM)

Clustering [ps]
- mixture of gaussians (MoG)
- vector quantization (VQ) with k-means.

Linear Models [ps]
- factor analysis (FA)
- probabilistic principal component analysis (PPCA)
- principal component analysis (PCA)

Independent Component Analysis (ICA) [ps]
- noiseless ICA
- noisy ICA
- variational ICA

Mixture of Factor Analysers (MoFA) [ps]
- derivation of learning algorithm

Hidden Markov Models (HMM) [ps]
- viterbi decoding algorithm
- Baum-Welch learning algorithm

Kalman Filters (KF) [ps]
- kalman filter algorithm (very detailed derivation)
- kalman smoother algorithm (very detailed derivation)

Approximate Inference Algorithms [ps]
- variational EM
- laplace approximation
- importance sampling
- rejection sampling
- markov chain monte carlo (MCMC) sampling
- gibbs sampling
- hybrid monte carlo sampling (HMC)

Belief Propagation (BP) [ps]
- converting directed acyclic graphical models (DAG) into junction trees (JT)
- Shafer-Shenoy belief propagation on junction trees
- some examples

Boltzmann Machine (BM) [ps]
- derivation of learning algorithm

Generative Topographic Mapping (GTM) [ps]
- derivation of learning algorithm

 


 


[Back to Max Welling's's home page