Improving Training Stability for Multitask Ranking Models in Recommender Systems

被引:4
|
作者
Tang, Jiaxi [1 ]
Drori, Yoel [2 ]
Chang, Daryl [3 ]
Sathiamoorthy, Maheswaran [1 ]
Gilmer, Justin [1 ]
Wei, Li [3 ]
Yi, Xinyang [1 ]
Hong, Lichan [1 ]
Chi, Ed H. [1 ]
机构
[1] Google Deepmind, Mountain View, CA 94043 USA
[2] Google Res, Tel Aviv, Israel
[3] Google Inc, Mountain View, CA USA
关键词
Recommender System; Optimization; Training Stability;
D O I
10.1145/3580305.3599846
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recommender systems play an important role in many content platforms. While most recommendation research is dedicated to designing better models to improve user experience, we found that research on stabilizing the training for such models is severely under-explored. As recommendation models become larger and more sophisticated, they are more susceptible to training instability issues, i.e., loss divergence, which can make the model unusable, waste significant resources and block model developments. In this paper, we share our findings and best practices we learned for improving the training stability of a real-world multitask ranking model for YouTube recommendations. We show some properties of the model that lead to unstable training and conjecture on the causes. Furthermore, based on our observations of training dynamics near the point of training instability, we hypothesize why existing solutions would fail, and propose a new algorithm to mitigate the limitations of existing solutions. Our experiments on YouTube production dataset show the proposed algorithm can significantly improve training stability while not compromising convergence, comparing with several commonly used baseline methods. We open source our implementation at https://github.com/tensorflow/recommenders/ tree/main/tensorflow_recommenders/experimental/optimizers/clippy_adagrad.py.
引用
收藏
页码:4882 / 4893
页数:12
相关论文
共 50 条
  • [21] Multi-feedback Pairwise Ranking via Adversarial Training for Recommender
    WANG Jianfang
    FU Zhiyuan
    NIU Mingxin
    ZHANG Pengbo
    ZHANG Qiuling
    ChineseJournalofElectronics, 2020, 29 (04) : 615 - 622
  • [22] Multi-feedback Pairwise Ranking via Adversarial Training for Recommender
    Wang, Jianfang
    Fu, Zhiyuan
    Niu, Mingxin
    Zhang, Pengbo
    Zhang, Qiuling
    CHINESE JOURNAL OF ELECTRONICS, 2020, 29 (04) : 615 - 622
  • [23] Directional Adversarial Training for Recommender Systems
    Xu, Yangjun
    Chen, Liang
    Xie, Fenfang
    Hu, Weibo
    Zhu, Jieming
    Chen, Chuan
    Zheng, Zibin
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 553 - 560
  • [24] Sustainable transparency on recommender systems: Bayesian ranking of images for explainability
    Paz-Ruza, Jorge
    Alonso-Betanzos, Amparo
    Guijarro-Berdinas, Bertha
    Cancela, Brais
    Eiras-Franco, Carlos
    INFORMATION FUSION, 2024, 111
  • [25] TPR: Text-aware Preference Ranking for Recommender Systems
    Chuang, Yu-Neng
    Chen, Chih-Ming
    Wang, Chuan-Ju
    Tsai, Ming-Feng
    Fang, Yuan
    Lim, Ee-Peng
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 215 - 224
  • [26] Improving Implicit Recommender Systems with Auxiliary Data
    Ding, Jingtao
    Yu, Guanghui
    Li, Yong
    He, Xiangnan
    Jin, Depeng
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2020, 38 (01)
  • [27] Improving expertise recommender systems by odds ratio
    Ru, Zhao
    Guo, Jun
    Xu, Weiran
    INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 1 - 9
  • [28] Improving Recommender Systems: User Roles and Lifecycles
    Nguyen, Tien T.
    PROCEEDINGS OF THE 8TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'14), 2014, : 417 - 420
  • [29] Improving Recommender Systems with Adaptive Conversational Strategies
    Mahmood, Tariq
    Ricci, Francesco
    20TH ACM CONFERENCE ON HYPERTEXT AND HYPERMEDIA (HYPERTEXT 2009), 2009, : 73 - 82
  • [30] Comparing Preference Models in Recommender Systems
    Liu, Juntao
    Deng, Dewei
    Wu, Hanbao
    Wu, Caihua
    2015 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS IHMSC 2015, VOL I, 2015, : 210 - 213