Application of large-scale L2-SVM for microarray classification

被引：1

作者：

Li, Baosheng ^{[1
]}

Han, Baole ^{[1
]}

Qin, Chuandong ^{[1
,2
]}

机构：

[1] North Minzu Univ, Sch Math & Informat Sci, Yinchuan 750021, Ningxia, Peoples R China

[2] Ningxia Key Lab Intelligent Informat & Big Data P, Yinchuan 750021, Ningxia, Peoples R China

来源：

JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Microarray; Large-scale learning; Support vector machine; Stochastic gradient descent; GENE-EXPRESSION; SELECTION;

D O I：

10.1007/s11227-021-03962-7

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Traditional classification algorithms work well on general small-scale microarray datasets, but for large-scale scenarios, general machines are not capable of supporting the operation of these algorithms anymore for the memory and time costs. In this paper, we design a new application framework to perform the computation of at the fastest speed. First, the synthetic minority over-sampling technique is used to sample a few classes of sample for obtaining the balanced data. Then, a large-scale algorithm for L-2-SVM based on the stochastic gradient descent method is proposed and used for microarray classification. Also, We give a simple proof of the convergence of stochastic gradient descent algorithm. Next, various large-scale algorithms for support vector machines are performed on the microarray datasets to identify the most appropriate algorithm. Finally, a comparative analysis of loss functions is done to clearly understand the differences. The experimental results show that the stochastic gradient descent algorithm and the squared hinge loss is an attractive choice, which can achieve high accuracy in seconds.

引用

页码：2265 / 2286

页数：22

共 50 条

[1] Application of large-scale L2-SVM for microarray classification
Baosheng Li
Baole Han
Chuandong Qin
The Journal of Supercomputing, 2022, 78 : 2265 - 2286
[2] L2-SVM: Dependence on the regularization parameter
Doktorski L.
Pattern Recognition and Image Analysis, 2011, 21 (2) : 254 - 257
[3] L2-SVM Training with Distributed Data
Lodi, Stefano
Nanculef, Ricardo
Sartori, Claudio
MULTI-AGENT SYSTEM TECHNOLOGIES, PROCEEDINGS, 2009, 5774 : 208 - +
[4] Fast SVM classifier for large-scale classification problems
Wang, Huajun
Li, Genghui
Wang, Zhenkun
INFORMATION SCIENCES, 2023, 642
[5] Online Nearest Point Algorithm for L2-SVM
Wang, Guosheng
FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 316 - 319
[6] A parallel SVM training algorithm on large-scale classification problems
Zhang, JP
Li, ZW
Yang, J
Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 1637 - 1641
[7] Neighborhood Preprocessing SVM for Large-scale Data Sets Classification
Chen, Guangxi
Xu, Jian
Xiang, Xiaolin
FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 245 - +
[8] Model Selection for the l2-SVM by Following the Regularization Path
Bonidal, Remi
Tindel, Samy
Guermeur, Yann
TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE XIII, 2014, 8342 : 83 - 112
[9] Large-scale Image Classification: Fast Feature Extraction and SVM Training
Lin, Yuanqing
Lv, Fengjun
Zhu, Shenghuo
Yang, Ming
Cour, Timothee
Yu, Kai
Cao, Liangliang
Huang, Thomas
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1689 - 1696
[10] Developing Anytime SVM Training Algorithms for Large-Scale Data Classification
Han, Rui
Ghanem, Moustafa
Williams, Andreas
Guo, Yike
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFTWARE ENGINEERING (AISE 2014), 2014, : 360 - 366

← 1 2 3 4 5 →