A mixture of naïve models
Assume that the data was generated by first
picking  a particular naïve model and then
generating a binary vector from this naïve
model.
This is just like the mixture of Gaussians, but
for binary data.