Normal version is Hard-margin SVM
The hyperparameter C controls the effect of slack variables, which allows the classifier to handle:
- Outliers and misclassified data points
- Non-linearly separable datasets
The key difference in optimization compared to hard-margin SVM is that the Lagrange multipliers α are bounded: