Description
Problems for Grading
- Problem 1 Note this is a Collaborative Problem
- Points Total
In this problem, develop pseudocode and code for the Expectation Maximization method. This should be done for a generic number of clusters, at a minimum you should be able to handle 3 clusters to build a three class classifiers. Using the following data
x (1)
for 5 iteration show the values for +1) using your code. You can either use a built in EM algorithm or the one you implement to show how well the clusters create the two separations as in slide 15 of the Expectation Maximization.pdf for the 5 iterations. In this example, are the clusters starting to converge? If no, why not? If yes, why?
- Problem 2 Note this is a Collaborative Problem
30 Points Total
Using the EM algorithm from Problem 1 the IRIS data set estimate the the unknown parameters µk,σk,pk.
- Problem 3
- Points Total 15 Points Each
Consider three mean values of µ = [µ1,µ2,µ3] = [4.5,2.2,3.3] with a corresponding covariance matrix as follows:
(2)
The respective minimums are min = [3.5,1.7,2.5] and maximums are max = [5.5,2.7,4.1].
Generate 300 observations.
Using the EM algorithm from Problem 1 and the generated date estimate the the unknown parameters µk,σk,pk.
1
References
- Bishop, Christopher M., Neural Networks for pattern Recognition, Oxford University Press, 1995
- Bishop, Christopher M., Pattern Recognition and Machine Learning, Springer, 2006, https://www.microsoft.com/en-us/research/uploads/prod/2006/01/Bishop-Pattern-Recognitionand-Machine-Learning-2006.pdf
- Duin, Robert P.W., Tax, David and Pekalska, Elzbieta, PRTools, http://prtools.tudelft.nl/
- Dempster, A. P., Laird, N. M. and Rubin, D. B., Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society B, Volume 39, Number 1, pp.1–22, 1977
- Franc, Vojtech and Hlavac, Vaclav, Statistical Pattern Recognition Toolbox, https://cmp.felk.cvut.cz/cmp/software/stprtool/index.html
- Fukunaga, Keinosuke, Introduction to Statistical Pattern Recognition, Academic Press, 1972
- Machine Learning at Waikato University, WEKA, https://www.cs.waikato.ac.nz/ ml/index.html
- Tomasi, C., Estimating Gaussian Mixture Densities with EM – A Tutorial, Duke University
Course Notes, 2006, http://www.cs.duke.edu/courses/spring04/cps196.1/handouts/EM/tomasiEM.pdf, Retrieved Sept 2006
2




