Improving Training Stability for Multitask Ranking Models in Recommender Systems

被引:4
|
作者
Tang, Jiaxi [1 ]
Drori, Yoel [2 ]
Chang, Daryl [3 ]
Sathiamoorthy, Maheswaran [1 ]
Gilmer, Justin [1 ]
Wei, Li [3 ]
Yi, Xinyang [1 ]
Hong, Lichan [1 ]
Chi, Ed H. [1 ]
机构
[1] Google Deepmind, Mountain View, CA 94043 USA
[2] Google Res, Tel Aviv, Israel
[3] Google Inc, Mountain View, CA USA
关键词
Recommender System; Optimization; Training Stability;
D O I
10.1145/3580305.3599846
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recommender systems play an important role in many content platforms. While most recommendation research is dedicated to designing better models to improve user experience, we found that research on stabilizing the training for such models is severely under-explored. As recommendation models become larger and more sophisticated, they are more susceptible to training instability issues, i.e., loss divergence, which can make the model unusable, waste significant resources and block model developments. In this paper, we share our findings and best practices we learned for improving the training stability of a real-world multitask ranking model for YouTube recommendations. We show some properties of the model that lead to unstable training and conjecture on the causes. Furthermore, based on our observations of training dynamics near the point of training instability, we hypothesize why existing solutions would fail, and propose a new algorithm to mitigate the limitations of existing solutions. Our experiments on YouTube production dataset show the proposed algorithm can significantly improve training stability while not compromising convergence, comparing with several commonly used baseline methods. We open source our implementation at https://github.com/tensorflow/recommenders/ tree/main/tensorflow_recommenders/experimental/optimizers/clippy_adagrad.py.
引用
收藏
页码:4882 / 4893
页数:12
相关论文
共 50 条
  • [1] Improving Personalized Ranking in Recommender Systems with Multimodal Interactions
    da Costa, Arthur F.
    Domingues, Marcos A.
    Rezende, Solange O.
    Manzato, Marcelo G.
    2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2014, : 198 - 204
  • [2] Improving Recommender Systems with Large Language Models
    Lubos, Sebastian
    ADJUNCT PROCEEDINGS OF THE 32ND ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2024, 2024, : 40 - 44
  • [3] Ranking of signals in multitask agency models
    Xie, Jia
    ECONOMICS LETTERS, 2011, 112 (01) : 82 - 84
  • [4] Improving Personalized Ranking in Recommender Systems with Topic Hierarchies and Implicit Feedback
    Manzato, Marcelo G.
    Domingues, Marcos A.
    Marcacini, Ricardo M.
    Rezende, Solange O.
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3696 - 3701
  • [5] Analyzing and improving stability of matrix factorization for recommender systems
    D'Amico, Edoardo
    Gabbolini, Giovanni
    Bernardis, Cesare
    Cremonesi, Paolo
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2022, 58 (02) : 255 - 285
  • [6] Analyzing and improving stability of matrix factorization for recommender systems
    Edoardo D’Amico
    Giovanni Gabbolini
    Cesare Bernardis
    Paolo Cremonesi
    Journal of Intelligent Information Systems, 2022, 58 : 255 - 285
  • [7] Improving Stability of Recommender Systems: A Meta-Algorithmic Approach
    Adomavicius, Gediminas
    Zhang, Jingjing
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (06) : 1573 - 1587
  • [8] Streaming Ranking Based Recommender Systems
    Wang, Weiqing
    Yin, Hongzhi
    Huang, Zi
    Wang, Qinyong
    Du, Xingzhong
    Quoc Viet Hung Nguyen
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 525 - 534
  • [9] Recommender systems based on ranking performance optimization
    Richong ZHANG
    Han BAO
    Hailong SUN
    Yanghao WANG
    Xudong LIU
    Frontiers of Computer Science, 2016, 10 (02) : 270 - 280
  • [10] Recommender systems based on ranking performance optimization
    Richong Zhang
    Han Bao
    Hailong Sun
    Yanghao Wang
    Xudong Liu
    Frontiers of Computer Science, 2016, 10 : 270 - 280