A Two-Phase Clustering Approach for Peak Alignment in Mining Mass Spectrometry Data

被引:0
|
作者
Chen, Lien-Chin [1 ]
Liu, Yu-Cheng [1 ]
Liu, Chi-Wei [1 ]
Tseng, Vincent S. [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
Biomarker discovery; Clustering; Mass spectrometry analysis; Peak alignment; BIOMARKER DISCOVERY; PROTEOMICS; CLASSIFICATION; SERUM;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In recent years, the mass spectrometry technologies emerge as useful tools for biomarker discovery through studying protein profiles in various biological specimens. In mining mass spectrometry datasets, peak alignment is a critical issue among the preprocessing steps that affect the quality of analysis results. In this paper, we proposed a novel algorithm named Two-Phases Clustering for peak Alignment (TPC-Align) to align mass spectrometry peaks across samples in the pre-processing phase. The TPC-Align algorithm sequentially considers the distribution of intensity values and the locations of mass-to-charge ratio values of peaks between samples. Moreover, TPC-Align algorithm can also report a list of significantly differential peaks between samples, which serve as the candidate biomarkers for further biological study. The proposed peak alignment method was compared to the current peak alignment approach based on one-dimension hierarchical clustering through experimental evaluations, and the results show that TPC-Align outperforms the traditional method on the real dataset.
引用
收藏
页码:223 / 227
页数:5
相关论文
共 50 条
  • [1] Effective peak alignment for mass spectrometry data analysis using two-phase clustering approach
    Liu, Yu-Cheng
    Chen, Lien-Chin
    Liu, Chi-Wei
    Tseng, Vincent S.
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2014, 9 (01) : 52 - 66
  • [2] Two-phase support vector clustering for multi-relational data mining
    Ling, P
    Wang, Y
    Lu, N
    Wang, JY
    Liang, S
    Zhou, CG
    2005 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2005, : 139 - 146
  • [3] A Two-Phase Fuzzy Mining Approach
    Lin, Chun-Wei
    Hong, Tzung-Pei
    Lu, Wen-Hsiang
    2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [4] TPICDS: A Two-Phase Parallel Approach for Incremental Clustering of Data Streams
    Alazeez, Ammar Al Abd
    Jassim, Sabah
    Du, Hongbo
    EURO-PAR 2018: PARALLEL PROCESSING WORKSHOPS, 2019, 11339 : 5 - 16
  • [5] Fast Elastic Peak Detection for Mass Spectrometry Data Mining
    Zhang, Xin
    Shasha, Dennis
    Song, Yang
    Wang, Jason T. L.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (04) : 634 - 648
  • [6] Two-phase data warehouse optimized for data mining
    Racz, Balazs
    Sidlo, Csaba Istvan
    Lukacs, Andras
    Benczur, Andras A.
    BUSINESS INTELLIGENCE FOR THE REAL-TIME ENTERPRISES, 2007, 4365 : 63 - +
  • [7] A two-phase approach for unexpected pattern mining
    Zhang, Jingtian
    Shou, Lidan
    Wu, Sai
    Chen, Gang
    Chen, Ke
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 141
  • [8] Mining entity latent semantic relationships by two-phase clustering
    Zhao, Ke
    Li, Qingzhong
    Yan, Zhongmin
    Li, Hui
    Chen, Zhiyong
    Journal of Computational Information Systems, 2015, 11 (21): : 7731 - 7739
  • [9] A time series approach for clustering mass spectrometry data
    Gullo, Francesco
    Ponti, Giovanni
    Tagarelli, Andrea
    Tradigo, Giuseppe
    Veltri, Pierangelo
    JOURNAL OF COMPUTATIONAL SCIENCE, 2012, 3 (05) : 344 - 355
  • [10] Two-phase data types transformation framework in data mining
    Jiang, MF
    Tseng, SS
    Liao, SY
    Chen, WC
    KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 490 - 494