Poisson Dependency Networks: Gradient Boosted Models for Multivariate Count Data

被引:15
|
作者
Hadiji, Fabian [1 ]
Molina, Alejandro [1 ]
Natarajan, Sriraam [2 ]
Kersting, Kristian [1 ]
机构
[1] TU Dortmund Univ, LS 8, Dortmund, Germany
[2] Indiana Univ, Sch Informat & Comp, Bloomington, IN USA
关键词
Graphical models; Dependency networks; Poisson distribution; Learning; MAP inference; ALGORITHM; SELECTION; IMAGES; GUIDE;
D O I
10.1007/s10994-015-5506-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although count data are increasingly ubiquitous, surprisingly little work has employed probabilistic graphical models for modeling count data. Indeed the univariate case has been well studied, however, in many situations counts influence each other and should not be considered independently. Standard graphical models such as multinomial or Gaussian ones are also often ill-suited, too, since they disregard either the infinite range over the natural numbers or the potentially asymmetric shape of the distribution of count variables. Existing classes of Poisson graphical models can only model negative conditional dependencies or neglect the prediction of counts or do not scale well. To ease the modeling of multivariate count data, we therefore introduce a novel family of Poisson graphical models, called Poisson Dependency Networks (PDNs). A PDN consists of a set of local conditional Poisson distributions, each representing the probability of a single count variable given the others, that naturally facilitates a simple Gibbs sampling inference. In contrast to existing Poisson graphical models, PDNs are non-parametric and trained using functional gradient ascent, i.e., boosting. The particularly simple form of the Poisson distribution allows us to develop the first multiplicative boosting approach: starting from an initial constant value, alternatively a log-linear Poisson model, or a Poisson regression tree, a PDN is represented as products of regression models grown in a stage-wise optimization. We demonstrate on several real world datasets that PDNs can model positive and negative dependencies and scale well while often outperforming state-of-the-art, in particular when using multiplicative updates.
引用
收藏
页码:477 / 507
页数:31
相关论文
共 50 条
  • [31] Boosted multivariate trees for longitudinal data
    Pande, Amol
    Li, Liang
    Rajeswaran, Jeevanantham
    Ehrlinger, John
    Kogalur, Udaya B.
    Blackstone, Eugene H.
    Ishwaran, Hemant
    MACHINE LEARNING, 2017, 106 (02) : 277 - 305
  • [32] Multivariate random parameters collision count data models with spatial heterogeneity
    Barua, Sudip
    El-Basyouny, Karim
    Islam, Md. Tazul
    ANALYTIC METHODS IN ACCIDENT RESEARCH, 2016, 9 : 1 - 15
  • [33] Finite Mixtures of Multivariate Poisson-Log Normal Factor Analyzers for Clustering Count Data
    Payne, Andrea
    Silva, Anjali
    Rothstein, Steven J.
    McNicholas, Paul D.
    Subedi, Sanjeena
    arXiv, 2023,
  • [34] Zero-Inflated Poisson Regression Models with Right Censored Count Data
    Saffari, Seyed Ehsan
    Adnan, Robiah
    MATEMATIKA, 2011, 27 (01) : 21 - 29
  • [35] On Poisson-exponential-Tweedie models for ultra-overdispersed count data
    Rahma Abid
    Célestin C. Kokonendji
    Afif Masmoudi
    AStA Advances in Statistical Analysis, 2021, 105 : 1 - 23
  • [36] MIXED POISSON LIKELIHOOD REGRESSION-MODELS FOR LONGITUDINAL INTERVAL COUNT DATA
    THALL, PF
    BIOMETRICS, 1988, 44 (01) : 197 - 209
  • [37] ANALYZING HISTORICAL COUNT DATA - POISSON AND NEGATIVE BINOMIAL REGRESSION-MODELS
    BECK, EM
    TOLNAY, SE
    HISTORICAL METHODS, 1995, 28 (03): : 125 - 131
  • [38] On Poisson-exponential-Tweedie models for ultra-overdispersed count data
    Abid, Rahma
    Kokonendji, Celestin C.
    Masmoudi, Afif
    ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2021, 105 (01) : 1 - 23
  • [39] MULTIVARIATE GAMMA-POISSON MODELS
    NELSON, JF
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1985, 80 (392) : 828 - 834
  • [40] The Curious Case of Stacking Boosted Relational Dependency Networks
    Yan, Siwen
    Dhami, Devendra Singh
    Natarajan, Sriraam
    NEURIPS WORKSHOPS, 2020, 2020, 137 : 33 - 42