Poisson Dependency Networks: Gradient Boosted Models for Multivariate Count Data

被引:15
|
作者
Hadiji, Fabian [1 ]
Molina, Alejandro [1 ]
Natarajan, Sriraam [2 ]
Kersting, Kristian [1 ]
机构
[1] TU Dortmund Univ, LS 8, Dortmund, Germany
[2] Indiana Univ, Sch Informat & Comp, Bloomington, IN USA
关键词
Graphical models; Dependency networks; Poisson distribution; Learning; MAP inference; ALGORITHM; SELECTION; IMAGES; GUIDE;
D O I
10.1007/s10994-015-5506-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although count data are increasingly ubiquitous, surprisingly little work has employed probabilistic graphical models for modeling count data. Indeed the univariate case has been well studied, however, in many situations counts influence each other and should not be considered independently. Standard graphical models such as multinomial or Gaussian ones are also often ill-suited, too, since they disregard either the infinite range over the natural numbers or the potentially asymmetric shape of the distribution of count variables. Existing classes of Poisson graphical models can only model negative conditional dependencies or neglect the prediction of counts or do not scale well. To ease the modeling of multivariate count data, we therefore introduce a novel family of Poisson graphical models, called Poisson Dependency Networks (PDNs). A PDN consists of a set of local conditional Poisson distributions, each representing the probability of a single count variable given the others, that naturally facilitates a simple Gibbs sampling inference. In contrast to existing Poisson graphical models, PDNs are non-parametric and trained using functional gradient ascent, i.e., boosting. The particularly simple form of the Poisson distribution allows us to develop the first multiplicative boosting approach: starting from an initial constant value, alternatively a log-linear Poisson model, or a Poisson regression tree, a PDN is represented as products of regression models grown in a stage-wise optimization. We demonstrate on several real world datasets that PDNs can model positive and negative dependencies and scale well while often outperforming state-of-the-art, in particular when using multiplicative updates.
引用
收藏
页码:477 / 507
页数:31
相关论文
共 50 条
  • [1] Poisson Dependency Networks: Gradient Boosted Models for Multivariate Count Data
    Fabian Hadiji
    Alejandro Molina
    Sriraam Natarajan
    Kristian Kersting
    Machine Learning, 2015, 100 : 477 - 507
  • [2] A multivariate Poisson regression model for count data
    Munoz-Pichardo, J. M.
    Pino-Mejias, R.
    Garcia-Heras, J.
    Ruiz-Munoz, F.
    Luz Gonzalez-Regalado, M.
    JOURNAL OF APPLIED STATISTICS, 2021, 48 (13-15) : 2525 - 2541
  • [3] Sparse estimation of multivariate Poisson log-normal models from count data
    Wu, Hao
    Deng, Xinwei
    Ramakrishnan, Naren
    STATISTICAL ANALYSIS AND DATA MINING, 2018, 11 (02) : 66 - 77
  • [4] Factor models for multivariate count data
    Wedel, M
    Böckenholt, U
    Kamakura, WA
    JOURNAL OF MULTIVARIATE ANALYSIS, 2003, 87 (02) : 356 - 369
  • [5] Multivariate models for correlated count data
    Rodrigues-Motta, Mariana
    Pinheiro, Hildete P.
    Martins, Eduardo G.
    Araujo, Marcio S.
    dos Reis, Sergio F.
    JOURNAL OF APPLIED STATISTICS, 2013, 40 (07) : 1586 - 1596
  • [6] Regression Models for Multivariate Count Data
    Zhang, Yiwen
    Zhou, Hua
    Zhou, Jin
    Sun, Wei
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (01) : 1 - 13
  • [7] Splitting models for multivariate count data
    Peyhardi, Jean
    Fernique, Pierre
    Durand, Jean-Baptiste
    JOURNAL OF MULTIVARIATE ANALYSIS, 2021, 181
  • [8] Bayesian multivariate Poisson regression for models of injury count, by severity
    Ma, Jianming
    Kockelman, Kara M.
    STATISTICAL METHODS AND CRASH PREDICTION MODELING, 2006, (1950): : 24 - 34
  • [9] Hierarchical Poisson models for spatial count data
    De Oliveira, Victor
    JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 122 : 393 - 408
  • [10] The effect of aggregating multivariate count data using Poisson profiles
    Moralesa, Victor Hugo
    Vargas, Jose Alberto
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (05) : 2646 - 2666