Semi-parametric Bayesian Inference for Multi-Season Baseball Data

被引:5
|
作者
Quintana, Fernando A. [1 ]
Mueller, Peter [2 ]
Rosner, Gary L. [2 ]
Munsell, Mark [2 ]
机构
[1] Pontificia Univ Catolica Chile, Dept Estadist, Santiago, Chile
[2] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
来源
BAYESIAN ANALYSIS | 2008年 / 3卷 / 02期
关键词
Dirichlet Process; Partial Exchangeability; Semiparametric Random Effects;
D O I
10.1214/08-BA312
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We analyze complete sequences of successes (hits, walks, and sacrifices) for a group of players from the American and National Leagues, collected over 4 seasons. The goal is to describe how players' performance vary from season to season. In particular, we wish to assess and compare the effect of available occasion-specific covariates over seasons. The data are binary sequences for each player and each season. We model dependence in the binary sequence by an autoregressive logistic model. The model includes lagged terms up to a fixed order. For each player and season we introduce a different set of autologistic regression coefficients, i.e., the regression coefficients are random effects that are specific of each season and player. We use a nonparametric approach to define a random effects distribution. The nonparametric model is defined as a mixture with a Dirichlet process prior for the mixing measure. The described model is justified by a representation theorem for order-k exchangeable sequences. Besides the repeated measurements for each season and player, multiple seasons within a given player define an additional level of repeated measurements. We introduce dependence at this level of repeated measurements by relating the season-specific random effects vectors in an autoregressive fashion. We ultimately conclude that while some covariates like the ERA of the opposing pitcher are always relevant, others like an indicator for the game being into the seventh inning may be significant only for certain season, and some others, like the score of the game, can safely be ignored.
引用
收藏
页码:317 / 338
页数:22
相关论文
共 50 条
  • [41] A Semi-parametric Bayesian Approach for Differential Expression Analysis of RNA-seq Data
    Liu, Fangfang
    Wang, Chong
    Liu, Peng
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2015, 20 (04) : 555 - 576
  • [42] A semi-parametric Bayesian analysis of survival data based on levy-driven processes
    Nieto-Barajas, LE
    Walker, SG
    LIFETIME DATA ANALYSIS, 2005, 11 (04) : 529 - 543
  • [43] Multivariate binormal mixtures for semi-parametric inference on ROC curves
    Sarat C. Dass
    Seong W. Kim
    Journal of the Korean Statistical Society, 2011, 40 : 397 - 410
  • [44] Multivariate binormal mixtures for semi-parametric inference on ROC curves
    Dass, Sarat C.
    Kim, Seong W.
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2011, 40 (04) : 397 - 410
  • [45] Statistical inference on semi-parametric partial linear additive models
    Wei, Chuan-hua
    Liu, Chunling
    JOURNAL OF NONPARAMETRIC STATISTICS, 2012, 24 (04) : 809 - 823
  • [46] A semi-parametric method for transforming data to normality
    Koekemoer, Gerhard
    Swanepoel, Jan W. H.
    STATISTICS AND COMPUTING, 2008, 18 (03) : 241 - 257
  • [47] A semi-parametric model of the hemodynamic response for multi-subject fMRI data
    Zhang, Tingting
    Li, Fan
    Beckes, Lane
    Coan, James A.
    NEUROIMAGE, 2013, 75 : 136 - 145
  • [48] Robust Bayesian synthetic likelihood via a semi-parametric approach
    Ziwen An
    David J. Nott
    Christopher Drovandi
    Statistics and Computing, 2020, 30 : 543 - 557
  • [49] Semi-parametric optimization for missing data imputation
    Yongsong Qin
    Shichao Zhang
    Xiaofeng Zhu
    Jilian Zhang
    Chengqi Zhang
    Applied Intelligence, 2007, 27 : 79 - 88
  • [50] A Semi-Parametric Mode Regression with Censored Data
    S. Khardani
    Mathematical Methods of Statistics, 2019, 28 : 39 - 56