We present tight convergence rate bounds for gradient descent and MM algorithms for maximum likelihood (ML) estimation and maximum a posteriori probability (MAP) estimation of a popular Bayesian inference method, for Bradley-Terry models of ranking data. Our results show that MM algorithms have the same convergence rate, up to a constant factor, as gradient descent algorithms with optimal constant step size. For the ML estimation objective, the convergence is linear with the rate crucially determined by the algebraic connectivity of the matrix of item pair co-occurrences in observed comparison data. For the MAP estimation objective, we show that the convergence rate is also linear, with the rate determined by a parameter of the prior distribution in a way that can make convergence arbitrarily slow for small values of this parameter. The limit of small values of this parameter corresponds to a flat, non-informative prior distribution.
机构:
Aoyama Gakuin Univ, Dept Integrated Informat Technol, Chuo Ku, Kanagawa 2525258, JapanAoyama Gakuin Univ, Dept Integrated Informat Technol, Chuo Ku, Kanagawa 2525258, Japan
Fujimoto, Yu
论文数: 引用数:
h-index:
机构:
Hino, Hideitsu
Murata, Noboru
论文数: 0引用数: 0
h-index: 0
机构:
Waseda Univ, Sch Sci & Engn, Shinjuku Ku, Tokyo 1698555, JapanAoyama Gakuin Univ, Dept Integrated Informat Technol, Chuo Ku, Kanagawa 2525258, Japan