A STUDY ON THE ERROR OF DISTRIBUTED ALGORITHMS FOR BIG DATA CLASSIFICATION WITH SVM

被引:0
|
作者
Wang, Cheng [1 ]
Cao, Feilong [1 ]
机构
[1] China Jiliang Univ, Appl Math Dept, Hangzhou, Zhejiang, Peoples R China
来源
ANZIAM JOURNAL | 2017年 / 58卷 / 3-4期
基金
中国国家自然科学基金;
关键词
distributed algorithm; big data; support vector machine; Tsybakov exponent; geometric noise exponent;
D O I
10.1017/S1446181116000390
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The error of a distributed algorithm for big data classification with a support vector machine (SVM) is analysed in this paper. First, the given big data sets are divided into small subsets, on which the classical SVM with Gaussian kernels is used. Then, the classification error of the SVM for each subset is analysed based on the Tsybakov exponent, geometric noise, and width of the Gaussian kernels. Finally, the whole error of the distributed algorithm is estimated in terms of the error of each subset.
引用
收藏
页码:231 / 237
页数:7
相关论文
共 50 条
  • [1] Comparative Study of Big Data Classification Algorithm Based on SVM
    Zou, Huasheng
    Jin, Zhiyuan
    2018 CROSS STRAIT QUAD-REGIONAL RADIO SCIENCE AND WIRELESS TECHNOLOGY CONFERENCE (CSQRWC), 2018,
  • [2] An overview of recent distributed algorithms for learning fuzzy models in Big Data classification
    Pietro Ducange
    Michela Fazzolari
    Francesco Marcelloni
    Journal of Big Data, 7
  • [3] An overview of recent distributed algorithms for learning fuzzy models in Big Data classification
    Ducange, Pietro
    Fazzolari, Michela
    Marcelloni, Francesco
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [4] Distributed classification for imbalanced big data in distributed environments
    Wang, Huihui
    Xiao, Mingfei
    Wu, Changsheng
    Zhang, Jing
    WIRELESS NETWORKS, 2024, 30 (05) : 3657 - 3668
  • [5] A Performance Evaluation of Classification Algorithms for Big Data
    Hai, Mo
    Zhang, You
    Zhang, Youjin
    5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2017, 2017, 122 : 1100 - 1107
  • [6] Crucial Power Flow Interface Discrimination Based on Distributed Improved-SVM Classification in a Big Data Set
    Huang, Tian-en
    Guo, Qinglai
    Sun, Hongbin
    Niu, Tao
    Guo, Wenxin
    Wang, Bin
    2016 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING (PESGM), 2016,
  • [7] Classification using ASTER data and SVM algorithms; The case study of Beer Sheva, Israel
    Zhu, GB
    Blumberg, DG
    REMOTE SENSING OF ENVIRONMENT, 2002, 80 (02) : 233 - 240
  • [8] Random Partition Based Adaptive Distributed Kernelized SVM for Big Data
    Pal, Amrit
    Chowdhury, Abishi
    Satakshi
    Narman, Husnu S.
    Chowdhury, Arkabandhu
    Kumar, Manish
    IEEE ACCESS, 2022, 10 : 95623 - 95637
  • [9] Models and algorithms for classifying big data based on distributed data streams
    Mao G.-J.
    Hu D.-J.
    Xie S.-Y.
    1600, Science Press (40): : 161 - 175
  • [10] Imbalanced Big Data Classification: A Distributed Implementation of SMOTE
    Rastogi, Avnish Kumar
    Narang, Nitin
    Siddiqui, Zamir Ahmad
    PROCEEDINGS OF THE WORKSHOP PROGRAM OF THE 19TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING (ICDCN'18), 2018,