Poisson factor models with applications to non-normalized microRNA profiling

被引:16
|
作者
Lee, Seonjoo [1 ]
Chugh, Pauline E. [2 ]
Shen, Haipeng [3 ]
Eberle, R. [4 ]
Dittmer, Dirk P. [2 ]
机构
[1] Henry M Jackson Fdn Adv Mil Med, Ctr Neurosci & Regenerat Med, Bethesda, MD 20892 USA
[2] Univ N Carolina, Dept Microbiol & Immunol, Chapel Hill, NC 27599 USA
[3] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
[4] Oklahoma State Univ, Ctr Vet Hlth Sci, Dept Vet Patholobiol, Stillwater, OK 74078 USA
基金
美国国家科学基金会;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; RNA-SEQ;
D O I
10.1093/bioinformatics/btt091
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Next-generation (NextGen) sequencing is becoming increasingly popular as an alternative for transcriptional profiling, as is the case for micro RNAs (miRNA) profiling and classification. miRNAs are a new class of molecules that are regulated in response to differentiation, tumorigenesis or infection. Our primary motivating application is to identify different viral infections based on the induced change in the host miRNA profile. Statistical challenges are encountered because of special features of NextGen sequencing data: the data are read counts that are extremely skewed and non-negative; the total number of reads varies dramatically across samples that require appropriate normalization. Statistical tools developed for microarray expression data, such as principal component analysis, are sub-optimal for analyzing NextGen sequencing data. Results: We propose a family of Poisson factor models that explicitly takes into account the count nature of sequencing data and automatically incorporates sample normalization through the use of offsets. We develop an efficient algorithm for estimating the Poisson factor model, entitled Poisson Singular Value Decomposition with Offset (PSVDOS). The method is shown to outperform several other normalization and dimension reduction methods in a simulation study. Through analysis of an miRNA profiling experiment, we further illustrate that our model achieves insightful dimension reduction of the miRNA profiles of 18 samples: the extracted factors lead to more accurate and meaningful clustering of the cell lines.
引用
收藏
页码:1105 / 1111
页数:7
相关论文
共 50 条
  • [31] A Model of Mixed Lubrication Based on Non-Normalized Discretization and Its Application for Multilayered Materials
    Dong, Qingbing
    Wang, Zhanjiang
    Zhu, Dong
    Meng, Fanming
    Xu, Lixin
    Zhou, Kun
    JOURNAL OF TRIBOLOGY-TRANSACTIONS OF THE ASME, 2019, 141 (04):
  • [32] Geometrical-light-propagation in non-normalized symmetric gradient-index media
    Correa, J. E. Gomez
    OPTICS EXPRESS, 2022, 30 (19) : 33896 - 33910
  • [33] INEQUALITIES OF TURAN TYPE FOR NON-NORMALIZED ULTRASPHERICAL POLYNOMIALS AND JACOBIAN FUNCTIONS OF SECOND KIND
    MAKAROV, VL
    DOPOVIDI AKADEMII NAUK UKRAINSKOI RSR, 1972, (02): : 124 - &
  • [34] Determination of Chromatographic Elution Profiles Using Non-normalized Singular Value Evolving Profiles
    Messick, N. J.
    Kalivas, J. H.
    Microchemical Journal, 55 (02):
  • [35] Performance Improvement Validation of Decision Tree Algorithms with Non-normalized Information Distance in Experiments
    Araki, Takeru
    Luo, Yuan
    Guo, Minyi
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2022, 13629 : 450 - 464
  • [36] New improvements of the regularized phase tracking technique for the processing of non-normalized fringe patterns
    Legarda-Sáenz, R
    Rivera, M
    Eighth International Symposium on Laser Metrology: MACRO-, MICRO-, AND NANO-TECHNOLOGIES APPLIED IN SCIENCE, ENGINEERING, AND INDUSTRY, 2005, 5776 : 692 - 698
  • [37] Uncertainty quantification: a minimum variance unbiased (joint) estimator of the non-normalized Sobol’ indices
    Matieyendou Lamboni
    Statistical Papers, 2020, 61 : 1939 - 1970
  • [38] Performance of intensity-based non-normalized pointwise algorithms in dynamic speckle analysis
    Stoykova, E.
    Nazarova, D.
    Berberova, N.
    Gotchev, A.
    OPTICS EXPRESS, 2015, 23 (19): : 25128 - 25142
  • [39] A FINE-GRAINED ARC-CONSISTENCY ALGORITHM FOR NON-NORMALIZED CONSTRAINT SATISFACTION PROBLEMS
    Arangu, Marlene
    Salido, Miguel A.
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2011, 21 (04) : 733 - 744
  • [40] Use of Non-Normalized, Non-Amplified cDNA for 454-Based RNA Sequencing of Fleshy Melon Fruit
    Portnoy, Vitaly
    Diber, Alex
    Pollock, Sarah
    Karchi, Hagai
    Lev, Shery
    Tzuri, Galil
    Harel-Beja, Rotem
    Forer, Relly
    Portnoy, Vitaly H.
    Lewinsohn, Efraim
    Tadmor, Yaakov
    Burger, Joseph
    Schaffer, Arthur
    Katzir, Nurit
    PLANT GENOME, 2011, 4 (01): : 36 - 46