Application of large-scale L2-SVM for microarray classification

被引:1
|
作者
Li, Baosheng [1 ]
Han, Baole [1 ]
Qin, Chuandong [1 ,2 ]
机构
[1] North Minzu Univ, Sch Math & Informat Sci, Yinchuan 750021, Ningxia, Peoples R China
[2] Ningxia Key Lab Intelligent Informat & Big Data P, Yinchuan 750021, Ningxia, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 02期
基金
中国国家自然科学基金;
关键词
Microarray; Large-scale learning; Support vector machine; Stochastic gradient descent; GENE-EXPRESSION; SELECTION;
D O I
10.1007/s11227-021-03962-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional classification algorithms work well on general small-scale microarray datasets, but for large-scale scenarios, general machines are not capable of supporting the operation of these algorithms anymore for the memory and time costs. In this paper, we design a new application framework to perform the computation of at the fastest speed. First, the synthetic minority over-sampling technique is used to sample a few classes of sample for obtaining the balanced data. Then, a large-scale algorithm for L-2-SVM based on the stochastic gradient descent method is proposed and used for microarray classification. Also, We give a simple proof of the convergence of stochastic gradient descent algorithm. Next, various large-scale algorithms for support vector machines are performed on the microarray datasets to identify the most appropriate algorithm. Finally, a comparative analysis of loss functions is done to clearly understand the differences. The experimental results show that the stochastic gradient descent algorithm and the squared hinge loss is an attractive choice, which can achieve high accuracy in seconds.
引用
收藏
页码:2265 / 2286
页数:22
相关论文
共 50 条
  • [1] Application of large-scale L2-SVM for microarray classification
    Baosheng Li
    Baole Han
    Chuandong Qin
    The Journal of Supercomputing, 2022, 78 : 2265 - 2286
  • [2] L2-SVM: Dependence on the regularization parameter
    Doktorski L.
    Pattern Recognition and Image Analysis, 2011, 21 (2) : 254 - 257
  • [3] L2-SVM Training with Distributed Data
    Lodi, Stefano
    Nanculef, Ricardo
    Sartori, Claudio
    MULTI-AGENT SYSTEM TECHNOLOGIES, PROCEEDINGS, 2009, 5774 : 208 - +
  • [4] Fast SVM classifier for large-scale classification problems
    Wang, Huajun
    Li, Genghui
    Wang, Zhenkun
    INFORMATION SCIENCES, 2023, 642
  • [5] Online Nearest Point Algorithm for L2-SVM
    Wang, Guosheng
    FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 316 - 319
  • [6] A parallel SVM training algorithm on large-scale classification problems
    Zhang, JP
    Li, ZW
    Yang, J
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 1637 - 1641
  • [7] Neighborhood Preprocessing SVM for Large-scale Data Sets Classification
    Chen, Guangxi
    Xu, Jian
    Xiang, Xiaolin
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 245 - +
  • [8] Model Selection for the l2-SVM by Following the Regularization Path
    Bonidal, Remi
    Tindel, Samy
    Guermeur, Yann
    TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE XIII, 2014, 8342 : 83 - 112
  • [9] Large-scale Image Classification: Fast Feature Extraction and SVM Training
    Lin, Yuanqing
    Lv, Fengjun
    Zhu, Shenghuo
    Yang, Ming
    Cour, Timothee
    Yu, Kai
    Cao, Liangliang
    Huang, Thomas
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1689 - 1696
  • [10] Developing Anytime SVM Training Algorithms for Large-Scale Data Classification
    Han, Rui
    Ghanem, Moustafa
    Williams, Andreas
    Guo, Yike
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFTWARE ENGINEERING (AISE 2014), 2014, : 360 - 366