Uncertainty-Aware Reinforcement Learning for Risk-Sensitive Player Evaluation in Sports Game

被引:0
|
作者
Liu, Guiliang [1 ,2 ,3 ]
Luo, Yudong [1 ,3 ]
Schulte, Oliver [4 ]
Poupart, Pascal [1 ,3 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
[2] Chinese Univ Hong Kong, Shenzhen, Peoples R China
[3] Vector Inst, Toronto, ON, Canada
[4] Simon Fraser Univ, Burnaby, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A major task of sports analytics is player evaluation. Previous methods commonly measured the impact of players' actions on desirable outcomes (e.g., goals or winning) without considering the risk induced by stochastic game dynamics. In this paper, we design an uncertainty-aware Reinforcement Learning (RL) framework to learn a risk-sensitive player evaluation metric from stochastic game dynamics. To embed the risk of a player's movements into the distribution of action-values, we model their 1) aleatoric uncertainty, which represents the intrinsic stochasticity in a sports game, and 2) epistemic uncertainty, which is due to a model's insufficient knowledge regarding Out-of-Distribution (OoD) samples. We demonstrate how a distributional Bellman operator and a feature-space density model can capture these uncertainties. Based on such uncertainty estimation, we propose a Risk-sensitive Game Impact Metric (RiGIM) that measures players' performance over a season by conditioning on a specific confidence level. Empirical evaluation, based on over 9M play-by-play ice hockey and soccer events, shows that RiGIM correlates highly with standard success measures and has a consistent risk sensitivity.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Adaptive risk tendency in uncertainty-aware motion planning using risk-sensitive Reinforcement Learning
    Wang, Zhidong
    Wei, Chongfeng
    Tang, Xiaolin
    Zhao, Wanzhong
    Hu, Chuan
    Zhang, Xi
    ADVANCED ENGINEERING INFORMATICS, 2025, 63
  • [2] Uncertainty-Aware Portfolio Management With Risk-Sensitive Multiagent Network
    Park, Kidon
    Jung, Hong-Gyu
    Eom, Tae-San
    Lee, Seong-Whan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 362 - 375
  • [3] Risk-Sensitive Reinforcement Learning
    Shen, Yun
    Tobia, Michael J.
    Sommer, Tobias
    Obermayer, Klaus
    NEURAL COMPUTATION, 2014, 26 (07) : 1298 - 1328
  • [4] Risk-sensitive reinforcement learning
    Mihatsch, O
    Neuneier, R
    MACHINE LEARNING, 2002, 49 (2-3) : 267 - 290
  • [5] Risk-Sensitive Reinforcement Learning
    Oliver Mihatsch
    Ralph Neuneier
    Machine Learning, 2002, 49 : 267 - 290
  • [6] Uncertainty-Aware Reinforcement Learning for Portfolio Optimization
    Enkhsaikhan, Bayaraa
    Jo, Ohyun
    IEEE ACCESS, 2024, 12 : 166553 - 166563
  • [7] Inverse Risk-Sensitive Reinforcement Learning
    Ratliff, Lillian J.
    Mazumdar, Eric
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (03) : 1256 - 1263
  • [8] Uncertainty-aware autonomous sensing with deep reinforcement learning
    Murad, Abdulmajid
    Kraemer, Frank Alexander
    Bach, Kerstin
    Taylor, Gavin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 156 : 242 - 253
  • [9] Uncertainty-Aware Data Augmentation for Offline Reinforcement Learning
    Su, Yunjie
    Kong, Yilun
    Wang, Xueqian
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [10] Uncertainty-Aware Pedestrian Crossing Prediction via Reinforcement Learning
    Dai, Siyang
    Liu, Jun
    Cheung, Ngai-Man
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9540 - 9549