This assignment is about fitting mixtures of axis-aligned Gaussians to two-dimensional data. Download and extract the archive below into your directory: http://www.cs.toronto.edu/~nitish/csc321/mog.tar.gz PART 1 (5 points) Run moginit to create training and validation datasets from 5 random axis aligned Gaussians. Then use the function mogem to fit various numbers of axis aligned Gaussians to the training data. Using performance on the validation data, determine the optimal number of Gaussians to fit to the training data. Present your results as a graph that plots both the negative log validation density and the negative log training density as a function of the number of Gaussians. Include a brief statement of what you think the graph shows. Also include a brief statement about the effects of changing the initial standard deviation used in mogem. Use a standard deviation of 0.1 for making the plots. Please do not change the random seeds in moginit.m (this will produce different data). You can do this later if you want to just play around and get a feel for how the algorithm behaves. PART 2 (2 points) Change moginit.m to use only 10 cases per Gaussian, and 10 axis-aligned gaussians to generate the data and repeat the experiment above (without changing the random seeds). Present your results as a graph and include a brief statement of what you think the graph shows and why it differs from the graph in PART 1. PART 3 (3 points) Change mogem.m so that in addition to fitting the means and axis-aligned variances, it also fits the mixing proportions. Currently, mogem does not mention mixing proportions so it is currently assuming that they are all equal (which makes them all cancel out when computing the posterior probability of each Gaussian for each datapoint.). So the first thing to do is to include mixing proportions when computing the posterior, but keep them fixed (and not all equal). Once you have debugged this, try learning the mixing proportions. Make mogem print out the final mixing proportions and hand in final mixing proportions that you get when fitting 4 Gaussians to the data, when you start with mixing proportions [0.25, 0.25, 0.25, 0.25], [0.3, 0.2, 0.2, 0.3], [0.2, 0.3, 0.3, 0.2] and [0.9, 0.05, 0.025, 0.025]. Use the same settings to generate the data as you did in PART 2.