Description
Problem 1 For the K-means clustering problem, when the binary indicators (responsibilities) πππβs are fixed for k=1, 2, β¦, K and n=1, 2, β¦, N, derive for the cluster centers π¦π, k=1, 2, β¦, K, such that the following objective function J is minimized:
Problem 2 Iris.xls contains 150 data samples of three Iris categories, labeled by outcome values 0, 1, and 2. Each data sample has four attributes: sepal length, sepal width, petal length, and petal width.
Implement the K-means clustering algorithm to group the samples into K=3 clusters. Randomly choose three samples as the initial cluster centers. Calculate the objective function value J as defined in Problem 1 after the assignment step in each iteration. Exit the iterations if the following criterion is met: π½(Iterβ1)βπ½(Iter) < Ξ΅, where Ξ΅ = 10β5, and Iter is the iteration number. Plot the objective function value J versus the iteration number Iter. Comment on the result. Attach the code at the end of the homework.
Problem 3 Assume a data sample π± β βπ· comes from one of two classes, πΆ1 and πΆ2. Use logistic regression to do classification.
- Write the math expression of the logistic regression output, and the criterion used for the final classification.
- How many parameters (weights) need to be calculated/trained in this method?
Problem 4 Assume a data sample π± β βπ· comes from one of πΎ classes, πΆ1, πΆ2, β¦, πΆπΎ. Use logistic regression to do classification.
- Write the math expression of the logistic regression output, and the criterion used for the final classification.
- How many parameters (weights) need to be calculated/trained in this method?




