ConCave-Convex procedure for support vector machines with Huber loss for text classification

被引:0
|
作者
Borah, Parashjyoti [1 ]
Gupta, Deepak [2 ]
Hazarika, Barenya Bikash [3 ]
机构
[1] Indian Inst Informat Technol Guwahati Bongora, Dept Comp Sci & Engn, Gauhati 781015, Assam, India
[2] Motilal Nehru Natl Inst Technol Allahabad, Dept Comp Sci & Engn, Prayagraj 211004, Uttar Pradesh, India
[3] Assam Town Univ, Fac Comp Technol, Sankar Madhab Path,Gandhinagar, Gauhati 781026, Assam, India
关键词
Support vector machine; Hinge loss; ConCave-Convex procedure; Ramp loss function; Huber loss functions; REGRESSION; CLASSIFIERS; ALGORITHM;
D O I
10.1016/j.compeleceng.2024.109925
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The classical support vector machine (SVM) adopts the linear Hinge loss whereas the least squares SVM (LS-SVM) employs the quadratically growing least squares loss function. The robust Ramp loss function is employed in Ramp loss SVM (RSVM) that truncates the Hinge loss function and becomes flat a specified point afterwards, thus, increases robustness to outliers. Recently proposed SVM with pinball loss (pin-SVM) utilizes pinball loss function that maximizes the margin between the class hyperplanes based on quantile distance. Huber loss function is the generalization of linear Hinge loss and quadratic loss. Huber loss solves sensitivity issues of least squares loss to noise and outlier. In this work, we employ the robust Huber loss function for SVM classification for improved generalization performance. The cost function of the proposed approach consists of one convex and one non-convex part, which might sometimes provide local optimum solution instead of a global optimum. We suggest a ConCave-Convex Procedure (CCCP) to resolve this issue. Additionally, the proximal cost is scaled for each class sample based on their class size to reduce the effect of the class imbalance problem. Thus, it can be claimed that the proposed approach incorporates class imbalance learning as well. Extensive experimental analysis establishes efficacy of the proposed method. Furthermore, a sequential minimal optimization (SMO) procedure for high dimensional HSVM is proposed and its performance is tested on two text classification datasets.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Representative sampling for text classification using support vector machines
    Xu, Z
    Yu, K
    Tresp, V
    Xu, XW
    Wang, JZ
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 393 - 407
  • [22] Support vector machines for text categorization in Chinese question classification
    Lin, Xu-Dong
    Peng, Hong
    Liu, Bo
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 334 - +
  • [23] Word combination kernel for text classification with support vector machines
    School of Automation Science and Electrical Engineering, Beijing University of Aeronautics and Astronautics, Beijing 100191, China
    Comput. Inf., 2013, 4 (877-896):
  • [24] Transductive inference for text classification using Support Vector Machines
    Joachims, T
    MACHINE LEARNING, PROCEEDINGS, 1999, : 200 - 209
  • [25] Slow cortical potential signal classification using concave-convex feature
    Hou, Huirang
    Sun, Biao
    Meng, Qinghao
    JOURNAL OF NEUROSCIENCE METHODS, 2019, 324
  • [26] A complete classification of bifurcation diagrams of a Dirichlet problem with concave-convex nonlinearities
    Wang, SH
    Yeh, TS
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2004, 291 (01) : 128 - 153
  • [27] A New Convex Loss Function For Multiple Instance Support Vector Machines
    Kim, Sang-Baeg
    Bae, Jung-Man
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9023 - 9029
  • [28] Twin support vector regression with Huber loss
    Niu, Jiayi
    Chen, Jing
    Xu, Yitian
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 32 (06) : 4247 - 4258
  • [29] Margin maximization model of text classification based on support vector machines
    Chen, Peng
    Wen, Tao
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 3514 - +
  • [30] Text Message Authorship Classification Using Kernel Support Vector Machines
    Kretchmar, Matt
    Zhao, Yifu
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 2, 2014, : 215 - 218