A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis

被引:252
|
作者
Chen, Hui-Ling [1 ,2 ]
Yang, Bo [1 ,2 ]
Liu, Jie [1 ,2 ]
Liu, Da-You [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
Breast cancer diagnosis; Rough set theory; Support vector machines; Feature selection; SYSTEM; RULES;
D O I
10.1016/j.eswa.2011.01.120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Breast cancer is becoming a leading cause of death among women in the whole world, meanwhile, it is confirmed that the early detection and accurate diagnosis of this disease can ensure a long survival of the patients. Expert systems and machine learning techniques are gaining popularity in this field because of the effective classification and high diagnostic capability. In this paper, a rough set (RS) based supporting vector machine classifier (RS_SVM) is proposed for breast cancer diagnosis. In the proposed method (RS_SVM), RS reduction algorithm is employed as a feature selection tool to remove the redundant features and further improve the diagnostic accuracy by SVM. The effectiveness of the RS_SVM is examined on Wisconsin Breast Cancer Dataset (WBCD) using classification accuracy, sensitivity, specificity, confusion matrix and receiver operating characteristic (ROC) curves. Experimental results demonstrate the proposed RS_SVM can not only achieve very high classification accuracy but also detect a combination of five informative features, which can give an important clue to the physicians for breast diagnosis. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:9014 / 9022
页数:9
相关论文
共 50 条
  • [11] Gear fault diagnosis based on rough set and support vector machine
    Tian Huifang
    Sun Shanxia
    1st International Symposium on Digital Manufacture, Vols 1-3, 2006, : 1046 - 1051
  • [12] Feature selection and classification of breast cancer diagnosis based on support vector machines
    Purnami, Santi Wulan
    Rahayu, S. P.
    Embong, Abdullah
    INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 500 - 505
  • [13] Diagnosis of Breast Cancer Tumor Based on PCA and Fuzzy Support Vector Machine Classifier
    Luo, Zhaohui
    Wu, Xiaoming
    Guo, Shengwen
    Ye, Binggang
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 363 - +
  • [14] Optimize Support Vector Machine Classifier based on Evolutionary Algorithm for Breast Cancer Diagnosis
    Hassan, Riyadh AbdEl-Salam
    Hegazy, AbdEl-Fatah
    Badr, Amr Ahmed
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (12): : 85 - 90
  • [15] Rough set-based feature selection for weakly labeled data
    Campagner, Andrea
    Ciucci, Davide
    Huellermeier, Eyke
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 136 : 150 - 167
  • [16] A neural network classifier with rough set-based feature selection to classify multiclass IC package products
    Hung, Y. H.
    ADVANCED ENGINEERING INFORMATICS, 2009, 23 (03) : 348 - 357
  • [17] Rough-set classifier based on discretization for breast cancer diagnosis
    Sun, Yingjuan
    Pu, Dongbing
    Sun, Yinghui
    Jiang, Yan
    Li, Xiaoning
    Journal of Computational Information Systems, 2014, 10 (22): : 9469 - 9478
  • [18] AN IMPROVED SUPPORT VECTOR CLASSIFIER BASED ON NEIGHBORHOOD ROUGH SET
    Han, Hu
    Ren, Enen
    Dang, Jianwu
    Li, Tianrui
    INTELLIGENT DECISION MAKING SYSTEMS, VOL. 2, 2010, : 64 - +
  • [19] Feature Selection for Cancer Classification Based on Support Vector Machine
    Luo, Wei
    Wang, Lipo
    Sun, Jingjing
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 422 - +
  • [20] Fault diagnosis system based on rough set theory and support vector machine
    Xu, YT
    Wang, LS
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 980 - 988