Optimal Bayesian Transfer Learning for Count Data

被引:2
|
作者
Karbalayghareh, Alireza [1 ]
Qian, Xiaoning [1 ]
Dougherty, Edward R. [1 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
Bayes methods; Cancer; Bioinformatics; Shape; Genomics; Data models; Optimal Bayesian transfer learning; optimal Bayesian classification; transfer learning; DIFFERENTIAL EXPRESSION ANALYSIS; MINIMUM EXPECTED ERROR; OPTIMAL CLASSIFIERS; CLASSIFICATION; FRAMEWORK; DISCRETE;
D O I
10.1109/TCBB.2019.2920981
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
There is often a limited amount of omics data to design predictive models in biomedicine. Knowing that these omics data come from underlying processes that may share common pathways and disease mechanisms, it may be beneficial for designing a more accurate and reliable predictor in a target domain of interest, where there is a lack of labeled data to leverage available data in relevant source domains. Here, we focus on developing Bayesian transfer learning methods for analyzing next-generation sequencing (NGS) data to help improve predictions in the target domain. We formulate transfer learning in a fully Bayesian framework and define the relatedness by a joint prior distribution of the model parameters of the source and target domains. Defining joint priors acts as a bridge across domains, through which the related knowledge of source data is transferred to the target domain. We focus on RNA-seq discrete count data, which are often overdispersed. To appropriately model them, we consider the Negative Binomial model and propose an Optimal Bayesian Transfer Learning (OBTL) classifier that minimizes the expected classification error in the target domain. We evaluate the performance of the OBTL classifier via both synthetic and cancer data from The Cancer Genome Atlas (TCGA).
引用
收藏
页码:644 / 655
页数:12
相关论文
共 50 条
  • [21] Bayesian methods for time series of count data
    Obeidat, Mohammed
    Liu, Juxin
    Osgood, Nathaniel
    Klassen, Geoff
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (02) : 486 - 504
  • [22] Bayesian semiparametric isotonic regression for count data
    Dunson, DB
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 618 - 627
  • [23] Robust importance sampling for error estimation in the context of optimal Bayesian transfer learning
    Maddouri, Omar
    Qian, Xiaoning
    Alexander, Francis J.
    Dougherty, Edward R.
    Yoon, Byung-Jun
    PATTERNS, 2022, 3 (03):
  • [24] Sparse Bayesian modelling of underreported count data
    Dvorzak, Michaela
    Wagner, Helga
    STATISTICAL MODELLING, 2016, 16 (01) : 24 - 46
  • [25] Bayesian Model Selection for Longitudinal Count Data
    Oludare Ariyo
    Emmanuel Lesaffre
    Geert Verbeke
    Adrian Quintero
    Sankhya B, 2022, 84 : 516 - 547
  • [26] Bayesian Correlation Analysis for Sequence Count Data
    Sanchez-Taltavull, Daniel
    Ramachandran, Parameswaran
    Lau, Nelson
    Perkins, Theodore J.
    PLOS ONE, 2016, 11 (10):
  • [27] Bayesian Downscaling Methods for Aggregated Count Data
    Michaud, Clayton P.
    Sproul, Thomas W.
    AGRICULTURAL AND RESOURCE ECONOMICS REVIEW, 2018, 47 (01) : 178 - 194
  • [28] Bayesian quantile regression for longitudinal count data
    Jantre, Sanket
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (01) : 103 - 127
  • [29] Bayesian shrinkage estimation for stratified count data
    Hamura, Yasuyuki
    JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE, 2024, 7 (01) : 431 - 453
  • [30] Bayesian Forecasting for Time Series of Count Data
    Nariswari, Rinda
    Pudjihastuti, Herena
    4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 427 - 435