Parallel fuzzy rough support vector machine for data classification in cloud environment

被引:0
|
作者
Chaudhuri, Arindam [1 ]
机构
[1] Samsung R and D Institute, Delhi Noida,201304, India
来源
Informatica (Slovenia) | 2015年 / 39卷 / 04期
关键词
Classification (of information) - Rough set theory - Support vector machines;
D O I
暂无
中图分类号
学科分类号
摘要
Data classification has been actively used for most effective means of conveying knowledge and information to users. With emergence of huge datasets existing classification techniques fail to produce desirable results where the challenge lies in analyzing characteristics of massive datasets by retrieving useful geometric and statistical patterns. We propose a supervised parallel fuzzy rough support vector machine (PFRSVM) for in-data classification in cloud environment. The fuzzy rough set model takes care of sensitiveness of noisy samples and handles impreciseness in training samples bringing robustness to results. The algorithm is parallelized with a view to reduce training times. The system is built on support vector machine library using Hadoop implementation of MapReduce. The algorithm is tested on large datasets present at the cloud environment available at University of Technology and Management, India to check its feasibility and convergence. It effectively resolves outliers' effects, imbalance and overlapping class problems, normalizes to unseen data and relaxes dependency between features and labels with better average classification accuracy. The experimental results on both synthetic and real datasets clearly demonstrate the superiority of the proposed technique. PFRSVM is scalable and reliable in nature and is characterized by order independence, computational transaction, failure recovery, atomic transactions, fault tolerant and high availability attributes as exhibited through various experiments.
引用
收藏
页码:397 / 420
相关论文
共 50 条
  • [41] A fuzzy regression based support vector machine (SVM) approach to fuzzy classification
    Chen, Yu
    Pedrycz, Witold
    Watada, Junzo
    ICIC Express Letters, 2010, 4 (6 B): : 2355 - 2362
  • [42] Quantum Support Vector Machine for Big Data Classification
    Rebentrost, Patrick
    Mohseni, Masoud
    Lloyd, Seth
    PHYSICAL REVIEW LETTERS, 2014, 113 (13)
  • [43] The Research of Support Vector Machine in Agricultural Data Classification
    Shi, Lei
    Duan, Qiguo
    Ma, Xinming
    Weng, Mei
    COMPUTER AND COMPUTING TECHNOLOGIES IN AGRICULTURE V, PT III, 2012, 370 : 265 - +
  • [44] Application of support vector machine for classification of multispectral data
    Bahari, Nurul Iman Saiful
    Ahmad, Asmala
    Aboobaider, Burhanuddin Mohd
    7TH IGRSM INTERNATIONAL REMOTE SENSING & GIS CONFERENCE AND EXHIBITION, 2014, 20
  • [45] ν-twin support vector machine with Universum data for classification
    Yitian Xu
    Mei Chen
    Zhiji Yang
    Guohui Li
    Applied Intelligence, 2016, 44 : 956 - 968
  • [46] Classification of lung data by sampling and support vector machine
    Dehmeshki, J
    Chen, J
    Casique, MV
    Karakoy, M
    PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2004, 26 : 3194 - 3197
  • [47] Improving Classification with Support Vector Machine for Unbalanced Data
    Muntean, M.
    Valean, H.
    Ileana, I.
    Rotar, C.
    PROCEEDINGS OF 2010 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR 2010), VOLS. 1-3, 2010,
  • [48] Classification of hyperspectral data using support vector machine
    Zhang, JP
    Zhang, Y
    Zhou, TX
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2001, : 882 - 885
  • [49] Support vector machine classification over encrypted data
    Huang, Hai
    Wang, Yongjian
    Zong, Haoran
    APPLIED INTELLIGENCE, 2022, 52 (06) : 5938 - 5948
  • [50] Incremental support vector machine for unlabeled data classification
    Hong, JH
    Cho, SB
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 1403 - 1407