Dimension Reduction of Microarray Data Using Gene Ontology and Correlation Filter

被引:1
|
作者
Banerjee, Ayan [1 ]
Pati, Soumen Kumar [2 ]
Gupta, Manan Kumar [2 ]
机构
[1] Jalpaiguri Govt Engn Coll, Dept Comp Sci, Jalpaiguri, W Bengal, India
[2] Maulana Abul Kalam Azad Univ Technol, Dept Bioinformat, Nadia, W Bengal, India
关键词
Gene ontology; Protein-protein interaction; Pearson's correlation coefficient; Gene selection;
D O I
10.1007/978-981-15-2449-3_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analysis of every variable at a microscopic level is not a feasible way. It might take a long time to perform any meaningful analysis. So, the valuable time and money are wasted for analysis of high dimensional data. In this paper, a better way is given to deal with high dimensional data and proposed a novel dimension reduction technique based on gene ontology and Pearson's correlation coefficient. The names of genes are extracted from microarray data and classify them based on biological processes of gene ontology and find their relation with cellular components. Next, the gene correlation factor is identified on every network of each of the group. Lastly, the independent component (IC) value of the genes related to the gene ontology is calculated and selects only those genes having the highest IC value of their corresponding network and eliminates the rest of the genes. It reduces size of the dataset at least 40% from its original size.
引用
收藏
页码:303 / 313
页数:11
相关论文
共 50 条
  • [1] Microarray data mining using gene ontology
    Li, SH
    Becich, MJ
    Gilbertson, J
    MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2, 2004, 107 : 778 - 782
  • [2] Microarray data mining using gene ontology
    Li, Songhui
    Becich, Michael J.
    Studies in Health Technology and Informatics, 2004, 107 : 778 - 782
  • [3] Dimension reduction for classification with gene expression microarray data
    Dai, Jian J.
    Lieu, Linh
    Rocke, David
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2006, 5
  • [4] Using Gene Ontology to Enhance Effectiveness of Similarity Measures for Microarray Data
    Chen, Zheng
    Tang, Jian
    2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2008, : 66 - 71
  • [5] Using Gene Ontology to enhance effectiveness of similarity measures for microarray data
    Chen, Zheng
    Tang, Jian
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (05) : 520 - 534
  • [6] Novel method for microarray data dimension reduction
    Wang, Gang
    Zhang, Yu-Xuan
    Li, Ying
    Chen, Hui-Ling
    Hu, Wei-Tong
    Qin, Lei
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2014, 44 (05): : 1429 - 1434
  • [7] Bayesian Dimension Reduction Models for Microarray Data
    Shieh, Albert D.
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, 2009, 5495 : 498 - 506
  • [8] Using fuzzy patterns for gene selection and data reduction on microarray data
    Diaz, Fernando
    Fdez-Riverola, Florentino
    Glez-Pena, Daniel
    Corchado, Juan M.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 1087 - 1094
  • [9] Comparative study of unsupervised dimension reduction techniques for the visualization of microarray gene expression data
    Bartenhagen, Christoph
    Klein, Hans-Ulrich
    Ruckert, Christian
    Jiang, Xiaoyi
    Dugas, Martin
    BMC BIOINFORMATICS, 2010, 11
  • [10] Partial least squares dimension reduction for microarray gene expression data with a censored response
    Nguyen, DV
    MATHEMATICAL BIOSCIENCES, 2005, 193 (01) : 119 - 137