Modeling binary data
Given a training set of binary vectors, fit a model that will
assign a probability to other binary vectors.
Useful for deciding if other binary vectors come from
the same distribution.
This can be used for monitoring complex systems to
detect unusual behavior.
If we have models of several different distributions it
can be used to compute the posterior probability that
a particular  distribution produced the observed data.