Improving Training Stability for Multitask Ranking Models in Recommender Systems

被引:4
|
作者
Tang, Jiaxi [1 ]
Drori, Yoel [2 ]
Chang, Daryl [3 ]
Sathiamoorthy, Maheswaran [1 ]
Gilmer, Justin [1 ]
Wei, Li [3 ]
Yi, Xinyang [1 ]
Hong, Lichan [1 ]
Chi, Ed H. [1 ]
机构
[1] Google Deepmind, Mountain View, CA 94043 USA
[2] Google Res, Tel Aviv, Israel
[3] Google Inc, Mountain View, CA USA
关键词
Recommender System; Optimization; Training Stability;
D O I
10.1145/3580305.3599846
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recommender systems play an important role in many content platforms. While most recommendation research is dedicated to designing better models to improve user experience, we found that research on stabilizing the training for such models is severely under-explored. As recommendation models become larger and more sophisticated, they are more susceptible to training instability issues, i.e., loss divergence, which can make the model unusable, waste significant resources and block model developments. In this paper, we share our findings and best practices we learned for improving the training stability of a real-world multitask ranking model for YouTube recommendations. We show some properties of the model that lead to unstable training and conjecture on the causes. Furthermore, based on our observations of training dynamics near the point of training instability, we hypothesize why existing solutions would fail, and propose a new algorithm to mitigate the limitations of existing solutions. Our experiments on YouTube production dataset show the proposed algorithm can significantly improve training stability while not compromising convergence, comparing with several commonly used baseline methods. We open source our implementation at https://github.com/tensorflow/recommenders/ tree/main/tensorflow_recommenders/experimental/optimizers/clippy_adagrad.py.
引用
收藏
页码:4882 / 4893
页数:12
相关论文
共 50 条
  • [31] Compound classification models for recommender systems
    Schmidt-Thieme, L
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 378 - 385
  • [32] Bridging Cognitive Models and Recommender Systems
    Cecilio Angulo
    Ing. Zoe Falomir
    Davide Anguita
    Núria Agell
    Erik Cambria
    Cognitive Computation, 2020, 12 : 426 - 427
  • [33] Neural Click Models for Recommender Systems
    Shirokikh, Mikhail
    Shenbin, Ilya
    Alekseev, Anton
    Volodkevich, Anna
    Vasilev, Alexey
    Savchenko, Andrey V.
    Nikolenko, Sergey
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2553 - 2558
  • [34] Improving Implicit Recommender Systems with View Data
    Ding, Jingtao
    Yu, Guanghui
    He, Xiangnan
    Quan, Yuhan
    Li, Yong
    Chua, Tat-Seng
    Jin, Depeng
    Yu, Jiajie
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3343 - 3349
  • [35] Recommender Systems: Improving Collaborative Filtering Results
    Bobadilla, Jesus
    Serradilla, Francisco
    Gutierrez, Abraham
    2009 7TH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING, 2009, : 93 - 99
  • [36] Improving the performance of recommender systems that use critiquing
    McGinty, L
    Smyth, B
    INTELLIGENT TECHNIQUES FOR WEB PERSONALIZATION, 2005, 3169 : 114 - 132
  • [37] Improving Recommender Systems with Human-in-the-Loop
    Ustalov, Dmitry
    Fedorova, Natalia
    Pavlichenko, Nikita
    PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 708 - 709
  • [38] Bridging Cognitive Models and Recommender Systems
    Angulo, Cecilio
    Falomir, Ing. Zoe
    Anguita, Davide
    Agell, Nuria
    Cambria, Erik
    COGNITIVE COMPUTATION, 2020, 12 (02) : 426 - 427
  • [39] Adversarial Training-Based Mean Bayesian Personalized Ranking for Recommender System
    Wang, Jianfang
    Han, Pengfei
    IEEE ACCESS, 2020, 8 : 7958 - 7968
  • [40] Neural Re-ranking for Multi-stage Recommender Systems
    Liu, Weiwen
    Qin, Jiarui
    Tang, Ruiming
    Chen, Bo
    PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 698 - 699