Name: EE5904-Project 2 Solved
SKU: 60303
Availability: InStock

Description

5/5 - (1 vote)

Task 1

Write a MATLAB (M-file) program to compute the discriminant function, if one exists, for the following SVMs, using the training set provided.

Hard margin SVM with the linear kernel:

𝐾𝐾 (𝑥𝑥₁, 𝑥𝑥₂) = 𝑥𝑥₁^𝑇𝑇𝑥𝑥₂

Hard margin SVM with a polynomial kernel:

𝐾𝐾 (𝑥𝑥₁, 𝑥𝑥₂) = (𝑥𝑥₁^𝑇𝑇𝑥𝑥₂+ 1)^𝑝𝑝

Soft margin with a polynomial kernel as given in Equation (2).

The step of computing the discriminate function can be summarized as follows:

Data preparation and pre-preprocess.

Here our training and testing dataset have 2,000 and 1,536 samples respectively. Each sample has 57 features. To ensure the data in internally consistent and enhance our training confidence, I firstly use standardization method to pre-process the data.

Gram matrix

With the given kernel, we need to calculate the gram matrix.

c. Mercer condition

Compute the eigenvalues of (b) gram matrix. We need to make sure the minimal eigenvalue is still non-negative. If not, the kernel candidate is not admissible. In Matlab, the computed eigenvalues are pretty much small. Here I set -1e-6 as the threshold. Only when the min eigenvalue is greater than the threshold, the training can move on.

d. Quadratic programing

Since the Mercer condition is already met, we have a global optimal value for this optimization problem. Following the Quadratic programming documentation in Matlab, we can calculate the optimal values.

e. Support vector determination

Set the threshold as 1e-6 to determine supper vectors in Matlab.

Weights calculation For linear case:

∑𝒊𝒊=𝟏𝟏

𝑵𝑵 𝟏𝟏 𝑻𝑻𝒙𝒙𝒊𝒊 ; 𝒃𝒃𝟎𝟎 = 𝒎𝒎 𝒃𝒃𝟎𝟎,𝒊𝒊

𝒘𝒘𝟎𝟎 = 𝜶𝜶𝟎𝟎,𝒊𝒊𝒅𝒅𝒊𝒊𝒙𝒙𝒊𝒊; 𝒃𝒃𝟎𝟎,𝒊𝒊 = − 𝒘𝒘𝟎𝟎

𝒅𝒅_𝒊𝒊𝒎𝒎

𝒊𝒊=𝟏𝟏

For non-linear case:

𝑵𝑵 ∑𝒎𝒎 𝒃𝒃𝟎𝟎,𝒊𝒊

𝒊𝒊=𝟏𝟏

𝒈𝒈(𝒙𝒙) = 𝜶𝜶_{𝟎𝟎,𝒊𝒊}𝒅𝒅_𝒊𝒊𝑲𝑲(𝒙𝒙, 𝒙𝒙_𝒊𝒊) + 𝒃𝒃_𝟎𝟎; 𝒃𝒃_𝟎𝟎=

𝒎𝒎

𝒊𝒊=𝟏𝟏

Task 2

Write a MATLAB (M-file) program to implement the SVMs with the discriminant functions obtained in Task 1.

Type of SVM	Training accuracy	Test accuracy
Hard margin with Linear kernel	0.938	0.92513

Hard margin with Polynomial kernel	p = 2	p = 3	p = 4	p = 5	p = 2	p = 3	p = 4	p = 5
Hard margin with Polynomial kernel	0.995	0.998	Nonconvex	Nonconvex	0.9014	0.8951 8	Nonconvex	Nonconvex

Soft margin with Polynomial kernel	C = 0.1	C = 0.6	C = 1.1	C = 2.1	C = 0.1	C = 0.6	C = 1.1	C = 2.1
p = 1	0.932	0.940 5	0.938	0.937	0.9205 7	0.9270 8	0.92383	0.92383
p = 2	0.980 5	0.994	0.995	0.9955	0.9095 1	0.8990 9	0.89909	0.89648
p = 3	0.996	0.998	0.998	0.998	0.9095 1	0.9010 4	0.89323	0.89063
p = 4		Non-convex				Non-convex
p = 5		Non-convex				Non-convex

Discussion

In terms of the hard margin with polynomial kernel, when p =4,5, the results are non-convex. The Mercer condition cannot be met in this situation.
In terms of the soft margin with polynomial kernel, when p = 4,5, the results are non-convex. The Mercer condition is not met no matter what c value is.
In terms of soft margin with polynomial kernel when p =2, we can find that with a larger tolerance of mis-classified samples (c value increases), the testing accuracy drops though the training accuracy is still consistent.
By comparing the hard margin with different kernel, although the training accuracy of linear kernel is lower than that of polynomial kernel, the testing accuracy of linear is higher.
Even though the training accuracy can reach almost 100% like p =2&3 with soft margin, the testing accuracy is still around 90%. It might be caused by overfitting.

Task 3

Design a SVM of your own.

In order to determine a better SVM classifier, we can start from two aspects based on previous experiment.

Kernel selection.

Here I tested two popular kernel besides linear and polynomial. They are Gaussian kernel and sigmoid kernel. After experiment, Gaussian kernel as a general-purpose kernel is used in this case. It works well when there is no prior knowledge about the data.

Soft margin parameters and Gaussian kernel parameters.

Type of SVM		Training accuracy				Test accuracy
Soft margin with Gaussian kernel	C = 0.1	C = 1	C = 10	C = 20	C = 0.1	C = 1	C = 10	C = 20
sigma = 1	0.7645	0.992	0.9975	0.9975	0.64909	0.75911	0.76237	0.75911
sigma = 10	0.9105	0.935	0.9555	0.9625	0.9056	0.92383	0.9362	0.93685
sigma = 20	0.876	0.921	0.938	0.942	0.87956	0.91146	0.93034	0.93164
sigma = 50	0.8565	0.8915	0.922	0.927	0.86393	0.88932	0.91602	0.91667

Here we found when sigma = 10 and c = 20, both training and testing accuracy achieve highest values in our experiment. Thus, we chose these two as our own SVM classifier.

[SOLVED] EE5904-Project 2

If Helpful Share:

Description

Task 1

c. Mercer condition

d. Quadratic programing

e. Support vector determination

Task 2

Discussion

Task 3

Related products

EE5904-Homework 2

EE5904-Homework 2 Rosenbrock’s Valley Problem, Function Approximation and Scene Classification

EE5904-Homework 3

Related in this category

More in this category

EE5904-Homework 3

EE5904-Project 1

EE5904-Homework 2 Rosenbrock’s Valley Problem, Function Approximation and Scene Classification

EE5904-Homework 2

EE5904-Homework 1

EE5904-Homework 1