Parallel fuzzy rough support vector machine for data classification in cloud environment

被引:0
|
作者
Chaudhuri, Arindam [1 ]
机构
[1] Samsung R and D Institute, Delhi Noida,201304, India
来源
Informatica (Slovenia) | 2015年 / 39卷 / 04期
关键词
Classification (of information) - Rough set theory - Support vector machines;
D O I
暂无
中图分类号
学科分类号
摘要
Data classification has been actively used for most effective means of conveying knowledge and information to users. With emergence of huge datasets existing classification techniques fail to produce desirable results where the challenge lies in analyzing characteristics of massive datasets by retrieving useful geometric and statistical patterns. We propose a supervised parallel fuzzy rough support vector machine (PFRSVM) for in-data classification in cloud environment. The fuzzy rough set model takes care of sensitiveness of noisy samples and handles impreciseness in training samples bringing robustness to results. The algorithm is parallelized with a view to reduce training times. The system is built on support vector machine library using Hadoop implementation of MapReduce. The algorithm is tested on large datasets present at the cloud environment available at University of Technology and Management, India to check its feasibility and convergence. It effectively resolves outliers' effects, imbalance and overlapping class problems, normalizes to unseen data and relaxes dependency between features and labels with better average classification accuracy. The experimental results on both synthetic and real datasets clearly demonstrate the superiority of the proposed technique. PFRSVM is scalable and reliable in nature and is characterized by order independence, computational transaction, failure recovery, atomic transactions, fault tolerant and high availability attributes as exhibited through various experiments.
引用
收藏
页码:397 / 420
相关论文
共 50 条
  • [1] Parallel Fuzzy Rough Support Vector Machine for Data Classification in Cloud Environment
    Chaudhuri, Arindam
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2015, 39 (04): : 397 - 420
  • [2] ROUGH SUPPORT VECTOR MACHINE FOR CLASSIFICATION WITH INTERVAL AND INCOMPLETE DATA
    Nowicki, Robert K.
    Grzanek, Konrad
    Hayashi, Yoichi
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2020, 10 (01) : 47 - 56
  • [3] Support vector machine for classification based on fuzzy training data
    Ji, Ai-bing
    Pang, Jia-hong
    Qiu, Hong-jie
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (04) : 3495 - 3498
  • [4] Support vector machine for classification based on fuzzy training data
    Ji, Ai-Bing
    Pang, Jia-Hong
    Li, Shu-Huan
    Sun, Jian-Ping
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1609 - +
  • [5] Fuzzy Support Vector Machine for Microarray Imbalanced Data Classification
    Ladayya, Faroh
    Purnami, Santi Wulan
    Irhamah
    13TH IMT-GT INTERNATIONAL CONFERENCE ON MATHEMATICS, STATISTICS AND THEIR APPLICATIONS (ICMSA2017), 2017, 1905
  • [6] A rough margin based fuzzy support vector machine
    Li, Kai
    Lu, Xiao-Xia
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2013, 41 (06): : 1183 - 1187
  • [7] A rough margin based fuzzy support vector machine
    Li, Kai
    Lu, Xiaoxia
    ADVANCED RESEARCH ON INDUSTRY, INFORMATION SYSTEMS AND MATERIAL ENGINEERING, PTS 1-7, 2011, 204-210 : 879 - 882
  • [8] Dense fuzzy support vector machine to binary classification for imbalanced data
    Wang, Qingling
    Zheng, Jian
    Zhang, Wenjing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9643 - 9653
  • [9] Data Classification with Support Vector Machine and Generalized Support Vector Machine
    Qi, Xiaomin
    Silvestrov, Sergei
    Nazir, Talat
    ICNPAA 2016 WORLD CONGRESS: 11TH INTERNATIONAL CONFERENCE ON MATHEMATICAL PROBLEMS IN ENGINEERING, AEROSPACE AND SCIENCES, 2017, 1798
  • [10] Iterative fuzzy support vector machine classification
    Shilton, Alistair
    Lai, Daniel T. H.
    2007 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-4, 2007, : 1396 - 1401