Multi-class Support Vector Machine with Maximizing Minimum Margin (2312.06578v2)
Abstract: Support Vector Machine (SVM) stands out as a prominent machine learning technique widely applied in practical pattern recognition tasks. It achieves binary classification by maximizing the "margin", which represents the minimum distance between instances and the decision boundary. Although many efforts have been dedicated to expanding SVM for multi-class case through strategies such as one versus one and one versus the rest, satisfactory solutions remain to be developed. In this paper, we propose a novel method for multi-class SVM that incorporates pairwise class loss considerations and maximizes the minimum margin. Adhering to this concept, we embrace a new formulation that imparts heightened flexibility to multi-class SVM. Furthermore, the correlations between the proposed method and multiple forms of multi-class SVM are analyzed. The proposed regularizer, akin to the concept of "margin", can serve as a seamless enhancement over the softmax in deep learning, providing guidance for network parameter learning. Empirical evaluations demonstrate the effectiveness and superiority of our proposed method over existing multi-classification methods.Code is available at https://github.com/zz-haooo/M3SVM.
- Enhancing one-class support vector machines for unsupervised anomaly detection. In KDD, 8–15.
- Rademacher and Gaussian complexities: Risk bounds and structural results. J. Mach. Learn. Res., 3: 463–482.
- Böhning, D. 1992. Multinomial logistic regression algorithm. Annal. Inst. Stat. Math., 44(1): 197–200.
- A training algorithm for optimal margin classifiers. In COLT, 144–152.
- Feature selection via concave minimization and support vector machines. In ICML, volume 98, 82–90.
- Multicategory classification by support vector machines. In Comput Optim Appl, 53–79.
- Modeling inter and intra-class relations in the triplet loss for zero-shot learning. In ICCV, 10333–10342.
- Weak-shot fine-grained classification via similarity transfer. In NeurIPS, volume 34, 7306–7318.
- Support-vector networks. Mach Learn, 20(3): 273–297.
- On the algorithmic implementation of multiclass kernel-based vector machines. J. Mach. Learn. Res., 2: 265–292.
- A geometric interpretation of v-SVM classifiers. In NeurIPS, volume 12.
- A unified view on multi-class support vector classification. J. Mach. Learn. Res.
- GenSVM: A generalized multiclass support vector machine. J. Mach. Learn. Res., 17: 1–42.
- Support vector machines with a reject option. In NeurIPS, volume 21.
- Guermeur, Y. 2002. Combining discriminant models with new multi-class SVMs. Pattern Analysis & Applications, 5(2): 168–179.
- An introduction to variable and feature selection. J. Mach. Learn. Res., 3: 1157–1182.
- A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Networks Learn. Syst., 13(2): 415–425.
- Twin support vector machines for pattern classification. IEEE Trans. Pattern Anal. Mach. Intell., 29(5): 905–910.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
- Sparse multinomial logistic regression: Fast algorithms and generalization bounds. IEEE Trans. Pattern Anal. Mach. Intell., 27(6): 957–968.
- Learning multiple layers of features from tiny images. Tech Report.
- Locally linear support vector machines. In ICML, 985–992.
- Top-k multiclass SVM. In NeurIPS, volume 28.
- Lauer, F.; et al. 2011. MSVMpack: a multi-class support vector machine package. J. Mach. Learn. Res., 12: 2269–2272.
- Sphereface: Deep hypersphere embedding for face recognition. In CVPR, 212–220.
- Large-Margin Softmax Loss for Convolutional Neural Networks. In ICML, 507–516.
- Lagrangian support vector machines. J. Mach. Learn. Res., 1: 161–177.
- New primal SVM solver with linear computational cost for big data classifications. In ICML, volume 32, II–505.
- Multiclass capped lp-Norm SVM for robust classifications. In AAAI.
- Decision Tree SVM: An extension of linear SVM for non-linear classification. Neurocomputing, 401: 153–159.
- Generalized risk zone: Selecting observations for classification. IEEE Trans. Pattern Anal. Mach. Intell., 31(7): 1331–1337.
- Support vector machine. In Mach Learn, 101–121. Elsevier.
- Scaling multinomial logistic regression via hybrid parallelism. In KDD, 1460–1470.
- Extracting support data for a given task. In KDD, 252–257.
- Vapnik, V. N. 1999. An overview of statistical learning theory. IEEE Trans. Neural Networks Learn. Syst., 10(5): 988–999.
- On the uniform convergence of relative frequencies of events to their probabilities. In Meas. of Complex., 11–30.
- A hierarchical method for multi-class support vector machines. In ICML, 105.
- Region ranking SVM for image classification. In CVPR, 2987–2996.
- Multi-class support vector machines. Technical report.
- Multi-class support vector machine via maximizing multi-class margins. In IJCAI.
- Unsupervised and semi-supervised multi-class support vector machines. In AAAI.
- Modified logistic regression: an approximation to SVM and its applications in large-scale text categorization. In ICML, 888–895.
- Optimal margin distribution machine. IEEE Trans. Knowl. Data Eng., 32(6): 1143–1156.
- Scaling up sparse support vector machines by simultaneous feature and sample reduction. In ICML, 4016–4025.
- Adversarial support vector machine learning. In KDD, 1059–1067.