Parallel fuzzy rough support vector machine for data classification in cloud environment

被引:0
|
作者
Chaudhuri, Arindam [1 ]
机构
[1] Samsung R and D Institute, Delhi Noida,201304, India
来源
Informatica (Slovenia) | 2015年 / 39卷 / 04期
关键词
Classification (of information) - Rough set theory - Support vector machines;
D O I
暂无
中图分类号
学科分类号
摘要
Data classification has been actively used for most effective means of conveying knowledge and information to users. With emergence of huge datasets existing classification techniques fail to produce desirable results where the challenge lies in analyzing characteristics of massive datasets by retrieving useful geometric and statistical patterns. We propose a supervised parallel fuzzy rough support vector machine (PFRSVM) for in-data classification in cloud environment. The fuzzy rough set model takes care of sensitiveness of noisy samples and handles impreciseness in training samples bringing robustness to results. The algorithm is parallelized with a view to reduce training times. The system is built on support vector machine library using Hadoop implementation of MapReduce. The algorithm is tested on large datasets present at the cloud environment available at University of Technology and Management, India to check its feasibility and convergence. It effectively resolves outliers' effects, imbalance and overlapping class problems, normalizes to unseen data and relaxes dependency between features and labels with better average classification accuracy. The experimental results on both synthetic and real datasets clearly demonstrate the superiority of the proposed technique. PFRSVM is scalable and reliable in nature and is characterized by order independence, computational transaction, failure recovery, atomic transactions, fault tolerant and high availability attributes as exhibited through various experiments.
引用
收藏
页码:397 / 420
相关论文
共 50 条
  • [21] A weighted support vector machine for data classification
    Yang, Xulei
    Song, Qing
    Wang, Yue
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (05) : 961 - 976
  • [22] Cloud Environment Service Classification of the Least Square Support Vector Machine Optimized by the Firefly Algorithm
    Lian, Sun
    AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 3050 - 3055
  • [23] A comparative study of surface EMG classification by fuzzy relevance vector machine and fuzzy support vector machine
    Xie, Hong-Bo
    Huang, Hu
    Wu, Jianhua
    Liu, Lei
    PHYSIOLOGICAL MEASUREMENT, 2015, 36 (02) : 191 - 206
  • [24] A Comparative study of Classification techniques: Support vector Machine, Fuzzy Support vector Machine & Decision Trees
    Pandey, Priyank
    Jain, Amita
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3620 - 3624
  • [25] Weighted support vector machine using fuzzy rough set theory
    Moslemnejad, Somaye
    Hamidzadeh, Javad
    SOFT COMPUTING, 2021, 25 (13) : 8461 - 8481
  • [26] Weighted support vector machine using fuzzy rough set theory
    Somaye Moslemnejad
    Javad Hamidzadeh
    Soft Computing, 2021, 25 : 8461 - 8481
  • [27] Imbalanced Data Classification using Complementary Fuzzy Support Vector Machine Techniques and SMOTE
    Pruengkarn, Ratchakoon
    Wong, Kok Wai
    Fung, Chun Che
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 978 - 983
  • [28] Support vector machine classification trees based on fuzzy entropy of classification
    Harrington, Peter de Boves
    ANALYTICA CHIMICA ACTA, 2017, 954 : 14 - 21
  • [29] An Email Classification Model Based on Rough Set and Support Vector Machine
    Zhu, Zhiqing
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 236 - 240
  • [30] Data mining with parallel support vector machines for classification
    Eitrich, Tatjana
    Lang, Bruno
    ADVANCES IN INFORMATION SYSTEMS, PROCEEDINGS, 2006, 4243 : 197 - 206