Text Retrieval Priors for Bayesian Logistic Regression

被引:6
|
作者
Yang, Eugene [1 ]
Lewis, David D. [2 ]
Frieder, Ophir [1 ]
机构
[1] Georgetown Univ, IR Lab, Washington, DC 20057 USA
[2] Cyxtera Technol, Dallas, TX USA
关键词
text classification; regularization; ad hoc retrieval; Bayesian priors; Bayesian logistic regression;
D O I
10.1145/3331184.3331299
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discriminative learning algorithms such as logistic regression excel when training data are plentiful, but falter when it is meager. An extreme case is text retrieval (zero training data), where discriminative learning is impossible and heuristics such as BM25, which combine domain knowledge (a topical keyword query) with generative learning (Naive Bayes), are dominant. Building on past work, we show that BM25-inspired Gaussian priors for Bayesian logistic regression based on topical keywords provide better effectiveness than the usual L2 (zero mode, uniform variance) Gaussian prior. On two high recall retrieval datasets, the resulting models transition smoothly from BM25 level effectiveness to discriminative effectiveness as training data volume increases, dominating L2 regularization even when substantial training data is available.
引用
收藏
页码:1045 / 1048
页数:4
相关论文
共 50 条
  • [1] Mapping the probability of ripened subsoils using Bayesian logistic regression with informative priors
    Steinbuch, Luc
    Brus, Dick J.
    Heuvelink, Gerard B. M.
    GEODERMA, 2018, 316 : 56 - 69
  • [2] Large-scale Bayesian logistic regression for text categorization
    Genkin, Alexander
    Lewis, David D.
    Madigan, David
    TECHNOMETRICS, 2007, 49 (03) : 291 - 304
  • [3] Logistic regression with weight grouping priors
    Korzen, M.
    Jaroszewicz, S.
    Klesk, P.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 64 : 281 - 298
  • [4] Sparse Bayesian variable selection in high-dimensional logistic regression models with correlated priors
    Ma, Zhuanzhuan
    Han, Zifei
    Ghosh, Souparno
    Wu, Liucang
    Wang, Min
    STATISTICAL ANALYSIS AND DATA MINING, 2024, 17 (01)
  • [5] Informative g-Priors for Logistic Regression
    Hanson, Timothy E.
    Branscum, Adam J.
    Johnson, Wesley O.
    BAYESIAN ANALYSIS, 2014, 9 (03): : 597 - 611
  • [6] Shrinkage priors for Bayesian penalized regression
    van Erp, Sara
    Oberski, Daniel L.
    Mulder, Joris
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2019, 89 : 31 - 50
  • [7] Bayesian regression filters and the issue of priors
    Zhu, HY
    Rohwer, R
    NEURAL COMPUTING & APPLICATIONS, 1996, 4 (03): : 130 - 142
  • [8] BAYESIAN LINEAR REGRESSION WITH SPARSE PRIORS
    Castillo, Ismael
    Schmidt-Hieber, Johannes
    Van der Vaart, Aad
    ANNALS OF STATISTICS, 2015, 43 (05): : 1986 - 2018
  • [9] Contraction properties of shrinkage priors in logistic regression
    Wei, Ran
    Ghosal, Subhashis
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2020, 207 : 215 - 229
  • [10] BAYESIAN LOGISTIC REGRESSION ANALYSIS
    van Erp, N.
    van Gelder, P.
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2013, 1553 : 147 - 154