A novel approach to feature extraction from classification models based on information gene pairs

被引:10
|
作者
Li, J. [1 ]
Tang, X. [1 ]
Liu, J. [1 ]
Huang, J. [1 ]
Wang, Y. [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
feature extraction; information gene pair; microarray data; cancer classification; genetic algorithm;
D O I
10.1016/j.patcog.2007.11.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Various microarray experiments are now done in many laboratories, resulting in the rapid accumulation of microarray data in public repositories. One of the major challenges of analyzing microarray data is how to extract and select efficient features from it for accurate cancer classification. Here we introduce a new feature extraction and selection method based on information gene pairs that have significant change in different tissue samples. Experimental results on five public microarray data sets demonstrate that the feature subset selected by the proposed method performs well and achieves higher classification accuracy on several classifiers. We perform extensive experimental comparison of the features selected by the proposed method and features selected by other methods using different evaluation methods and classifiers. The results confirm that the proposed method performs as well as other methods on acute lymphoblastic-acute myeloid leukemia, adenocarcinoma and breast cancer data sets using a fewer information genes and leads to significant improvement of classification accuracy on colon and diffuse large B cell lymphoma cancer data sets. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1975 / 1984
页数:10
相关论文
共 50 条
  • [21] Approach to the extraction and classification of feature vector for power quality based on wavelet
    PLA Unit 63880, Luoyang 471003, China
    不详
    Kong Zhi Li Lun Yu Ying Yong, 2008, 2 (325-328): : 325 - 328
  • [22] A Novel Feature Selection and Extraction Technique for Classification
    Goel, Kratarch
    Vohra, Raunaq
    Bakshi, Ainesh
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 4033 - 4034
  • [23] Feature extraction based on information gain and sequential pattern for English question classification
    Liu, Yaqing
    Yi, Xiaokai
    Chen, Rong
    Zhai, Zhengguo
    Gu, Jingxuan
    IET SOFTWARE, 2018, 12 (06) : 520 - 526
  • [24] A novel feature extraction algorithm for asymmetric classification
    Lindgren, D
    Spångéus, P
    IEEE SENSORS JOURNAL, 2004, 4 (05) : 643 - 650
  • [25] A Novel Feature Selection and Extraction Technique for Classification
    Goel, Kratarth
    Vohra, Raunaq
    Bakshi, Ainesh
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 104 - 109
  • [26] Automatic Modulation Classification for MIMO System Based on the Mutual Information Feature Extraction
    Ussipov, N.
    Akhtanov, S.
    Zhanabaev, Z.
    Turlykozhayeva, D.
    Karibayev, B.
    Namazbayev, T.
    Almen, D.
    Akhmetali, A.
    Tang, Xiao
    IEEE ACCESS, 2024, 12 : 68463 - 68470
  • [27] Feature Extraction for Surface Classification - An approach with Wavelets
    Bhandari, Srnriti H.
    Deshpande, S. M.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 23, 2007, 23 : 322 - 326
  • [28] A Novel Approach of Audio Based Feature Optimisation for Bird Classification
    Ramashini, Murugaiya
    Abas, Pg Emeroylariffion
    De Silva, Liyanage C.
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2021, 29 (04): : 2383 - 2407
  • [29] UNSUPERVISED FEATURE EXTRACTION BASED ON A MUTUAL INFORMATION MEASURE FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Hossain, Md Ali
    Pickering, Mark
    Jia, Xiuping
    2011 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011, : 1720 - 1723
  • [30] Bacteria classification based on feature extraction from sensor data
    Holmberg, M
    Gustafsson, F
    Hornsten, EG
    Winquist, F
    Nilsson, LE
    Ljung, L
    Lundstrom, I
    BIOTECHNOLOGY TECHNIQUES, 1998, 12 (04) : 319 - 324