The non-negative matrix factorization toolbox for biological data mining

被引:133
|
作者
Li, Yifeng [1 ]
Ngom, Alioune [1 ]
机构
[1] Univ Windsor, Sch Comp Sci, Windsor, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Non-negative matrix factorization; Clustering; Bi-clustering; Feature extraction; Feature selection; Classification; Missing values;
D O I
10.1186/1751-0473-8-10
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Non-negative matrix factorization (NMF) has been introduced as an important method for mining biological data. Though there currently exists packages implemented in R and other programming languages, they either provide only a few optimization algorithms or focus on a specific application field. There does not exist a complete NMF package for the bioinformatics community, and in order to perform various data mining tasks on biological data. Results: We provide a convenient MATLAB toolbox containing both the implementations of various NMF techniques and a variety of NMF-based data mining approaches for analyzing biological data. Data mining approaches implemented within the toolbox include data clustering and bi-clustering, feature extraction and selection, sample classification, missing values imputation, data visualization, and statistical comparison. Conclusions: A series of analysis such as molecular pattern discovery, biological process identification, dimension reduction, disease prediction, visualization, and statistical comparison can be performed using this toolbox.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Kernel Joint Non-Negative Matrix Factorization for Genomic Data
    Salazar, Diego
    Rios, Juan
    Aceros, Sara
    Florez-Vargas, Oscar
    Valencia, Carlos
    IEEE ACCESS, 2021, 9 : 101863 - 101875
  • [32] Sparse non-negative matrix factorization for uncertain data clustering
    Chen, Danyang
    Wang, Xiangyu
    Xu, Xiu
    Zhong, Cheng
    Xu, Jinhui
    INTELLIGENT DATA ANALYSIS, 2022, 26 (03) : 615 - 636
  • [33] Filtering Wind in Infrasound Data by Non-Negative Matrix Factorization
    Carniel, Roberto
    Cabras, Giuseppe
    Ichihara, Mie
    Takeo, Minoru
    SEISMOLOGICAL RESEARCH LETTERS, 2014, 85 (05) : 1056 - 1062
  • [34] Non-negative matrix factorization in bioinformaltics: towards understanding biological processes
    Pascual-Montano, Alberto
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 1332 - 1335
  • [35] A zero-inflated non-negative matrix factorization for the deconvolution of mixed signals of biological data
    Kong, Yixin
    Kozik, Ariangela
    Nakatsu, Cindy H.
    Jones-Hall, Yava L.
    Chun, Hyonho
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2022, 18 (01): : 203 - 218
  • [36] Non-negative matrix factorization based text mining: Feature extraction and classification
    Barman, P. C.
    Iqbal, Nadeem
    Lee, Soo-Young
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 703 - 712
  • [37] Traffic Risk Mining Using Partially Ordered Non-negative Matrix Factorization
    Lee, Taito
    Matsushima, Shin
    Yamanishi, Kenji
    PROCEEDINGS OF 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, (DSAA 2016), 2016, : 622 - 631
  • [38] Mining ratio rules via principal sparse non-negative matrix factorization
    Hu, CY
    Zhang, BY
    Yan, SC
    Yang, Q
    Yan, J
    Chen, Z
    Ma, WY
    FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 407 - 410
  • [39] Non-negative matrix factorization for target recognition
    Long, Hong-Lin
    Pi, Yi-Ming
    Cao, Zong-Jie
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (06): : 1425 - 1429
  • [40] A framework for intelligent Twitter data analysis with non-negative matrix factorization
    Casalino, Gabriella
    Castiello, Ciro
    Del Buono, Nicoletta
    Mencar, Corrado
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2018, 14 (03) : 334 - 356