Semi-parametric Bayesian Inference for Multi-Season Baseball Data

被引:5
|
作者
Quintana, Fernando A. [1 ]
Mueller, Peter [2 ]
Rosner, Gary L. [2 ]
Munsell, Mark [2 ]
机构
[1] Pontificia Univ Catolica Chile, Dept Estadist, Santiago, Chile
[2] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
来源
BAYESIAN ANALYSIS | 2008年 / 3卷 / 02期
关键词
Dirichlet Process; Partial Exchangeability; Semiparametric Random Effects;
D O I
10.1214/08-BA312
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We analyze complete sequences of successes (hits, walks, and sacrifices) for a group of players from the American and National Leagues, collected over 4 seasons. The goal is to describe how players' performance vary from season to season. In particular, we wish to assess and compare the effect of available occasion-specific covariates over seasons. The data are binary sequences for each player and each season. We model dependence in the binary sequence by an autoregressive logistic model. The model includes lagged terms up to a fixed order. For each player and season we introduce a different set of autologistic regression coefficients, i.e., the regression coefficients are random effects that are specific of each season and player. We use a nonparametric approach to define a random effects distribution. The nonparametric model is defined as a mixture with a Dirichlet process prior for the mixing measure. The described model is justified by a representation theorem for order-k exchangeable sequences. Besides the repeated measurements for each season and player, multiple seasons within a given player define an additional level of repeated measurements. We introduce dependence at this level of repeated measurements by relating the season-specific random effects vectors in an autoregressive fashion. We ultimately conclude that while some covariates like the ERA of the opposing pitcher are always relevant, others like an indicator for the game being into the seventh inning may be significant only for certain season, and some others, like the score of the game, can safely be ignored.
引用
收藏
页码:317 / 338
页数:22
相关论文
共 50 条
  • [31] Bayesian Semi-Parametric Regression for Quantile Residual Lifetime
    Park, Taeyoung
    Bae, Wonho
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2014, 21 (04) : 285 - 296
  • [32] A semi-parametric Bayesian approach to the instrumental variable problem
    Conley, Timothy G.
    Hansen, Christian B.
    McCulloch, Robert E.
    Rossi, Peter E.
    JOURNAL OF ECONOMETRICS, 2008, 144 (01) : 276 - 305
  • [33] A Bayesian semi-parametric model for thermal proteome profiling
    Siqi Fang
    Paul D. W. Kirk
    Marcus Bantscheff
    Kathryn S. Lilley
    Oliver M. Crook
    Communications Biology, 4
  • [34] A Bayesian semi-parametric bivariate failure time model
    Nieto-Barajas, Luis E.
    Walker, Stephen G.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (12) : 6102 - 6113
  • [35] A Bayesian semi-parametric approach to extreme regime identification
    do Nascimento, Fernando Ferraz
    Gamerman, Dani
    Davis, Richard
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2016, 30 (04) : 540 - 561
  • [36] Bayesian analysis of generalized elliptical semi-parametric models
    Rondon, Luz Marina
    Bolfarine, Heleno
    JOURNAL OF APPLIED STATISTICS, 2016, 43 (08) : 1508 - 1524
  • [37] Bayesian semi-parametric G-computation for causal inference in a cohort study with MNAR dropout and death
    Josefsson, Maria
    Daniels, Michael J.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2021, 70 (02) : 398 - 414
  • [38] A Semi-parametric Bayesian Approach for Differential Expression Analysis of RNA-seq Data
    Fangfang Liu
    Chong Wang
    Peng Liu
    Journal of Agricultural, Biological, and Environmental Statistics, 2015, 20 : 555 - 576
  • [39] Fast and efficient Bayesian semi-parametric curve-fitting and clustering in massive data
    Sabyasachi Mukhopadhyay
    Sisir Roy
    Sourabh Bhattacharya
    Sankhya B, 2012, 74 (1) : 77 - 106
  • [40] Fast and efficient Bayesian semi-parametric curve-fitting and clustering in massive data
    Mukhopadhyay, Sabyasachi
    Roy, Sisir
    Bhattacharya, Sourabh
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2012, 74 (01): : 77 - 106