Optimal Bayesian Transfer Learning for Count Data

被引:2
|
作者
Karbalayghareh, Alireza [1 ]
Qian, Xiaoning [1 ]
Dougherty, Edward R. [1 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
Bayes methods; Cancer; Bioinformatics; Shape; Genomics; Data models; Optimal Bayesian transfer learning; optimal Bayesian classification; transfer learning; DIFFERENTIAL EXPRESSION ANALYSIS; MINIMUM EXPECTED ERROR; OPTIMAL CLASSIFIERS; CLASSIFICATION; FRAMEWORK; DISCRETE;
D O I
10.1109/TCBB.2019.2920981
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
There is often a limited amount of omics data to design predictive models in biomedicine. Knowing that these omics data come from underlying processes that may share common pathways and disease mechanisms, it may be beneficial for designing a more accurate and reliable predictor in a target domain of interest, where there is a lack of labeled data to leverage available data in relevant source domains. Here, we focus on developing Bayesian transfer learning methods for analyzing next-generation sequencing (NGS) data to help improve predictions in the target domain. We formulate transfer learning in a fully Bayesian framework and define the relatedness by a joint prior distribution of the model parameters of the source and target domains. Defining joint priors acts as a bridge across domains, through which the related knowledge of source data is transferred to the target domain. We focus on RNA-seq discrete count data, which are often overdispersed. To appropriately model them, we consider the Negative Binomial model and propose an Optimal Bayesian Transfer Learning (OBTL) classifier that minimizes the expected classification error in the target domain. We evaluate the performance of the OBTL classifier via both synthetic and cancer data from The Cancer Genome Atlas (TCGA).
引用
收藏
页码:644 / 655
页数:12
相关论文
共 50 条
  • [31] Kernelized Bayesian Transfer Learning
    Gonen, Mehmet
    Margolin, Adam A.
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1831 - 1839
  • [32] Transfer Learning for Bayesian Networks
    Luis, Roger
    Sucar, L. Enrique
    Morales, Eduardo F.
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2008, PROCEEDINGS, 2008, 5290 : 93 - 102
  • [33] PeakSeg: constrained optimal segmentation and supervised penalty learning for peak detection in count data
    Hocking, Toby Dylan
    Rigaill, Guillem
    Bourque, Guillaume
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 324 - 332
  • [34] Bayesian inference on quasi-sparse count data
    Datta, Jyotishka
    Dunson, David B.
    BIOMETRIKA, 2016, 103 (04) : 971 - 983
  • [35] Fast Bayesian estimation of spatial count data models
    Bansal, Prateek
    Krueger, Rico
    Graham, Daniel J.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 157
  • [36] Bayesian analysis of econometric models for count data: A survey
    Winkelmann, R
    EXPLORATORY DATA ANALYSIS IN EMPIRICAL RESEARCH, PROCEEDINGS, 2003, : 204 - 215
  • [37] Bayesian Spatial Modelling of Gamma Ray Count Data
    Daniela Leonte
    David J. Nott
    Mathematical Geology, 2006, 38 : 135 - 154
  • [38] A Nonparametric Bayesian Poisson Gamma Model for Count Data
    Gupta, Sunil Kumar
    Dinh Phung
    Venkatesh, Svetha
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1815 - 1818
  • [39] Bayesian predictive inference for time series count data
    Chen, MH
    Ibrahim, JG
    BIOMETRICS, 2000, 56 (03) : 678 - 685
  • [40] Bayesian Correction for Misclassification in Multilevel Count Data Models
    Nelson, Tyler
    Song, Joon Jin
    Chin, Yoo-Mi
    Stamey, James D.
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2018, 2018