A Concordance Study of the Preprocessing Orders in Microarray Data

被引:0
|
作者
Kim, Sang-Cheol
Lee, Jae-hwi
Kim, Byung Soo
机构
关键词
Concordance measure; imputation; microarray; normalization; preprocessing; t3; statistic;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Researchers of microarray experiment transpose processed images of raw data to possible data of statistical analysis: it is preprocessing. Preprocessing of microarray has image filtering, imputation and normalization. There have been studied about several different methods of normalization and imputation, but there was not further study on the order of the procedures. We have no further study about which things put first on our procedure between normalization and imputation. This study is about the identification of differentially expressed genes(DEG) on the order of the preprocessing steps using two-dye cDNA microarray in colon cancer and gastric cancer. That is, we check for compare which combination of imputation and normalization steps can detect the DEG. We used imputation methods(K-nearly neighbor, Baysian principle comparison analysis) and normalization methods(global, within-print tip group, variance stabilization). Therefore, preprocessing steps have 12 methods. We identified concordance measure of DEG using the datasets to which the 12 different preprocessing orders were applied. When we applied preprocessing using variance stabilization of normalization method, there was a little variance in a sensitive way for detecting DEG.
引用
收藏
页码:585 / 594
页数:10
相关论文
共 50 条
  • [41] Comparative study of microarray data for cancer research
    Phan, JH
    Quo, CF
    Wang, MD
    PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2004, 26 : 2960 - 2963
  • [42] Data Preprocessing for Web Data Mining
    Zhang, Wei
    Chen, Tinggui
    ADVANCES IN ELECTRONIC COMMERCE, WEB APPLICATION AND COMMUNICATION, VOL 2, 2012, 149 : 303 - +
  • [43] A study on combining dynamic selection and data preprocessing for imbalance learning
    Roy, Anandarup
    Cruz, Rafael M. O.
    Sabourin, Robert
    Cavalcanti, George D. C.
    NEUROCOMPUTING, 2018, 286 : 179 - 192
  • [44] A study of the suitability of autoencoders for preprocessing data in breast cancer experimentation
    Macias-Garcia, Laura
    Maria Luna-Romera, Jose
    Garcia-Gutierrez, Jorge
    Martinez-Ballesteros, Maria
    Riquelme-Santos, Jose C.
    Gonzalez-Campora, Ricardo
    JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 72 : 33 - 44
  • [45] Preprocessing of Alarm Data for Data Mining
    Mannani, Zahra
    Izadi, Iman
    Ghadiri, Nasser
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2019, 58 (26) : 11261 - 11274
  • [46] Data preprocessing in predictive data mining
    Alexandropoulos, Stamatios-Aggelos N.
    Kotsiantis, Sotiris B.
    Vrahatis, Michael N.
    KNOWLEDGE ENGINEERING REVIEW, 2019, 34
  • [47] Multivariate data approximation with preprocessing of data
    Kosin´ski, Witold
    Kowalczyk, Dorota
    Weigl, Martyna
    Computer Assisted Mechanics and Engineering Sciences, 2007, 14 (04): : 651 - 658
  • [48] Data Preprocessing as a Service - Outsourcing data preprocessing for AI models using a digital platform
    Kureljusic M.
    Karger E.
    Informatik Spektrum, 2022, 45 (1) : 13 - 19
  • [49] Noninferiority tests based on concordance correlation coefficient for assessment of the agreement for gene expression data from microarray experiments
    Liao, Chen-Tuo
    Lin, Chia-Ying
    Liu, Jen-pei
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2007, 17 (02) : 309 - 327
  • [50] Data Preprocessing as a Service - Outsourcing data preprocessing for AI models using a digital platform
    Kureljusic, Marko
    Karger, Erik
    Informatik-Spektrum, 2022, 45 (01) : 13 - 19