Smoothing Blemished Gene Expression Microarray Data via Missing Value Imputation

被引:0
|
作者
Cai, Zhipeng [1 ]
Shi, Yi [1 ]
Song, Meng [1 ]
Goebel, Randy [1 ]
Lin, Guohui [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
关键词
D O I
10.1109/IEMBS.2008.4650505
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Gene expression microarray technology has enabled advanced biological and medical research, but the data are well-recognized noisy and must be used with caution, since they are greatly affected by many experimental factors such as RNA concentration, spot typing, hybridization condition, and image analysis. It is highly desirable that the inaccurate data entries ("stains") can be identified and subsequently curated. In this paper, we propose a novel computational method, based on feature gene selection and sample classification, to efficiently discover the stains and apply imputation methods to estimate their values. Extensive experimental results on three Affymetrix platforms for human cancer diagnosis showed that by picking only 1-4% data entries as the most likely stains, the smoothed datasets could be used for better downstream data analyses such as robust biomarker identification and disease diagnosis.
引用
收藏
页码:5688 / 5691
页数:4
相关论文
共 50 条
  • [31] Missing Value Imputation for Mixed Data via Gaussian Copula
    Zhao, Yuxuan
    Udell, Madeleine
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 636 - 646
  • [32] A hybrid imputation approach for microarray missing value estimation
    Huihui Li
    Changbo Zhao
    Fengfeng Shao
    Guo-Zheng Li
    Xiao Wang
    BMC Genomics, 16
  • [33] A hybrid imputation approach for microarray missing value estimation
    Li, Huihui
    Zhao, Changbo
    Shao, Fengfeng
    Li, Guo-Zheng
    Wang, Xiao
    BMC GENOMICS, 2015, 16
  • [34] Incorporating Nonlinear Relationships in Microarray Missing Value Imputation
    Yu, Tianwei
    Peng, Hesen
    Sun, Wei
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 723 - 731
  • [35] Missing value imputation in DNA microarray gene expression data: a comparative study of an improved collaborative filtering method with decision tree based approach
    Saha, Sujay
    Ghosh, Anupam
    Bandopadhyay, Saikat
    Dey, Kashi Nath
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 18 (02) : 130 - 139
  • [36] Estimating Missing Value in Microarray Gene Expression Data Using Fuzzy Similarity Measure
    Paul, Amit
    Sil, Jaya
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1890 - 1895
  • [37] A Novel Biclustering Based Missing Value Prediction Method for Microarray Gene Expression Data
    Bose, Shilpi
    Das, Chandra
    Chattopadhyay, Samiran
    PROCEEDINGS 2015 INTERNATIONAL CONFERENCE ON MAN AND MACHINE INTERFACING (MAMI), 2015,
  • [38] The influence of missing value imputation on detection of differentially expressed genes from microarray data
    Scheel, I
    Aldrin, M
    Glad, IK
    Sorum, R
    Lyng, H
    Frigessi, A
    BIOINFORMATICS, 2005, 21 (23) : 4272 - 4279
  • [39] A Bicluster-Based Sequential Interpolation Imputation Method for Estimation of Missing Values in Microarray Gene Expression Data
    Das, Chandra
    Bose, Shilpi
    Chattopadhyay, Samiran
    Chattopadhyay, Matangini
    Hossain, Alamgir
    CURRENT BIOINFORMATICS, 2017, 12 (02) : 118 - 130
  • [40] KNN-DTW Based Missing Value Imputation for Microarray Time Series Data
    Hsu, Hui-Huang
    Yang, Andy C.
    Lu, Ming-Da
    JOURNAL OF COMPUTERS, 2011, 6 (03) : 418 - 425