A Two-Phase Clustering Approach for Peak Alignment in Mining Mass Spectrometry Data

被引:0
|
作者
Chen, Lien-Chin [1 ]
Liu, Yu-Cheng [1 ]
Liu, Chi-Wei [1 ]
Tseng, Vincent S. [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
Biomarker discovery; Clustering; Mass spectrometry analysis; Peak alignment; BIOMARKER DISCOVERY; PROTEOMICS; CLASSIFICATION; SERUM;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In recent years, the mass spectrometry technologies emerge as useful tools for biomarker discovery through studying protein profiles in various biological specimens. In mining mass spectrometry datasets, peak alignment is a critical issue among the preprocessing steps that affect the quality of analysis results. In this paper, we proposed a novel algorithm named Two-Phases Clustering for peak Alignment (TPC-Align) to align mass spectrometry peaks across samples in the pre-processing phase. The TPC-Align algorithm sequentially considers the distribution of intensity values and the locations of mass-to-charge ratio values of peaks between samples. Moreover, TPC-Align algorithm can also report a list of significantly differential peaks between samples, which serve as the candidate biomarkers for further biological study. The proposed peak alignment method was compared to the current peak alignment approach based on one-dimension hierarchical clustering through experimental evaluations, and the results show that TPC-Align outperforms the traditional method on the real dataset.
引用
收藏
页码:223 / 227
页数:5
相关论文
共 50 条
  • [21] An extended two-phase architecture for mining time series data
    Chen, AP
    Chen, YC
    Hsu, NW
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2005, 3681 : 1186 - 1192
  • [22] Peak tree and peak detection for mass spectrometry data
    Zhang, Peng
    Li, Houqiang
    Zhou, Xiaobo
    Wong, Stephen
    COMPUTATIONAL MODELS FOR LIFE SCIENCES (CMLS 07), 2007, 952 : 127 - +
  • [23] Improving mass spectrometry peak detection using multiple peak alignment results
    Yu, Weichaun
    He, Zengyou
    Liu, Junfeng
    Zhao, Hongyu
    JOURNAL OF PROTEOME RESEARCH, 2008, 7 (01) : 123 - 129
  • [24] WCDS: A Two-Phase Weightless Neural System for Data Stream Clustering
    Douglas O. Cardoso
    Felipe M. G. França
    João Gama
    New Generation Computing, 2017, 35 : 391 - 416
  • [25] WCDS: A Two-Phase Weightless Neural System for Data Stream Clustering
    Cardoso, Douglas O.
    Franca, Felipe M. G.
    Gama, Joao
    NEW GENERATION COMPUTING, 2017, 35 (04) : 391 - 416
  • [26] Clustering metrics for two-phase composites
    Wilding, Samuel E.
    Fullwood, David T.
    COMPUTATIONAL MATERIALS SCIENCE, 2011, 50 (07) : 2262 - 2272
  • [27] Data mining in proteomic mass spectrometry
    Thomas A.
    Tourassi G.D.
    Elmaghraby A.S.
    Valdes Jr. R.
    Jortani S.A.
    Clinical Proteomics, 2006, 2 (1-2) : 13 - 32
  • [28] Identification of biomarkers from mass spectrometry data using a "common" peak approach
    Fushiki, Tadayoshi
    Fujisawa, Hironori
    Eguchi, Shinto
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [29] Identification of biomarkers from mass spectrometry data using a "common" peak approach
    Tadayoshi Fushiki
    Hironori Fujisawa
    Shinto Eguchi
    BMC Bioinformatics, 7
  • [30] Clustering for data mining: A data recovery approach
    Leslie Rutkowski
    Psychometrika, 2007, 72 : 109 - 110