CROP: correlation-based reduction of feature multiplicities in untargeted metabolomic data

被引:11
|
作者
Kouril, Stepan [1 ,2 ]
de Sousa, Julie [1 ,3 ]
Vaclavik, Jan [1 ]
Friedecky, David [1 ,2 ]
Adam, Tomas [1 ,2 ]
机构
[1] Palacky Univ Olomouc, Inst Mol & Translat Med, Lab Metabol, Olomouc 77900, Czech Republic
[2] Univ Hosp Olomouc, Dept Clin Biochem, Olomouc 77900, Czech Republic
[3] Palacky Univ Olomouc, Dept Math Anal & Applicat Math, Fac Sci, Olomouc 77900, Czech Republic
关键词
MASS-SPECTROMETRY; ANNOTATION;
D O I
10.1093/bioinformatics/btaa012
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
aSummary: Untargeted liquid chromatography-high-resolution mass spectrometry analysis produces a large number of features which correspond to the potential compounds in the sample that is analyzed. During the data processing, it is necessary to merge features associated with one compound to prevent multiplicities in the data and possible misidentification. The processing tools that are currently employed use complex algorithms to detect abundances, such as adducts or isotopes. However, most of them are not able to deal with unpredictable adducts and in-source fragments. We introduce a simple open-source R-script CROP based on Pearson pairwise correlations and retention time together with a graphical representation of the correlation network to remove these redundant features.
引用
收藏
页码:2941 / 2942
页数:2
相关论文
共 50 条
  • [1] Correlation-Based Feature Mapping of Crowdsourced LTE Data
    Apajalahti, Kasper
    Walelgne, Ermias Andargie
    Manner, Jukka
    Hyvonen, Eero
    2018 IEEE 29TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2018,
  • [2] Monitoring Data Reduction in Data Centers: A Correlation-Based Approach
    Peng, Xuesong
    Pernici, Barbara
    SMART CITIES, GREEN TECHNOLOGIES, AND INTELLIGENT TRANSPORT SYSTEMS, 2017, 738 : 135 - 153
  • [3] Correlation-Based Feature Selection and Regression
    Cui, Yue
    Lin, Jesse S.
    Zhang, Shiliang
    Luo, Suhuai
    Tian, Qi
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT I, 2010, 6297 : 25 - +
  • [4] Enhancing Big Data Feature Selection Using a Hybrid Correlation-Based Feature Selection
    Mohamad, Masurah
    Selamat, Ali
    Krejcar, Ondrej
    Crespo, Ruben Gonzalez
    Herrera-Viedma, Enrique
    Fujita, Hamido
    ELECTRONICS, 2021, 10 (23)
  • [5] Distributed correlation-based feature selection in spark
    Palma-Mendoza, Raul Jose
    de-Marcos, Luis
    Rodriguez, Daniel
    Alonso-Betanzos, Amparo
    INFORMATION SCIENCES, 2019, 496 : 287 - 299
  • [6] An improved correlation-based algorithm with discretization for attribute reduction in data clustering
    Kannan, S. Senthamarai
    Ramaraj, N.
    Data Science Journal, 2009, 8 : 125 - 138
  • [7] Classification via Correlation-based Feature Grouping
    Maleki, Mina
    Rueda, Luis
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2015, : 61 - 66
  • [8] Correlation-based Feature Ranking for Online Classification
    Osman, Hassab Elgawi
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3077 - 3082
  • [9] A Correlation-Based Feature Selection Algorithm for Operating Data of Nuclear Power Plants
    He, Yuxuan
    Yu, Hongxing
    Yu, Ren
    Song, Jian
    Lian, Haibo
    He, Jiangyang
    Yuan, Jiangtao
    SCIENCE AND TECHNOLOGY OF NUCLEAR INSTALLATIONS, 2021, 2021
  • [10] CoDR: Correlation-Based Data Reduction Scheme for Efficient Gathering of Heterogeneous Driving Data
    Park, Junho
    Chung, Yoojin
    Choi, Jongmoo
    SENSORS, 2020, 20 (06)