Budgeted Recommendation with Delayed Feedback

被引:0
|
作者
Liu, Kweiguu [1 ]
Maghsudi, Setareh [2 ]
Yokoo, Makoto [1 ]
机构
[1] Kyushu Univ, Fac Informat Sci & Elect Engn, Fukuoka 8190395, Japan
[2] Ruhr Univ Bochum, Fac Elect Engn & Informat Technol, D-44801 Bochum, Germany
来源
GOOD PRACTICES AND NEW PERSPECTIVES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 3, WORLDCIST 2024 | 2024年 / 987卷
关键词
Budget Constraints; Delayed Feedback; Online Learning; Resource Allocation;
D O I
10.1007/978-3-031-60221-4_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a conventional contextual multi-armed bandit problem, the feedback (or reward) is immediately observable after an action. Nevertheless, delayed feedback arises in numerous real-life situations and is particularly crucial in time-sensitive applications. The exploration-exploitation dilemma becomes particularly challenging under such conditions, as it couples with the interplay between delays and limited resources. Besides, a limited budget often aggravates the problem by restricting the exploration potential. A motivating example is the distribution of medical supplies at the early stage of COVID-19. The delayed feedback of testing results, thus insufficient information for learning, degraded the efficiency of resource allocation. Motivated by such applications, we study the effect of delayed feedback on constrained contextual bandits. We develop a decision-making policy, delay-oriented resource allocation with learning (DORAL), to optimize the resource expenditure in a contextual multi-armed bandit problem with arm-dependent delayed feedback.
引用
收藏
页码:202 / 213
页数:12
相关论文
共 50 条
  • [41] FLUCTUATIONS IN OSCILLATORS WITH DELAYED FEEDBACK
    DEMIN, NV
    YAKIMOV, AV
    RADIOTEKHNIKA I ELEKTRONIKA, 1993, 38 (09): : 1615 - 1618
  • [42] CHAOS INDUCED BY DELAYED FEEDBACK
    ROESKY, PW
    DOUMBOUYA, SI
    SCHNEIDER, FW
    JOURNAL OF PHYSICAL CHEMISTRY, 1993, 97 (02): : 398 - 402
  • [43] DELAYED AUDITORY FEEDBACK WITH DYSLEXICS
    JACK, WH
    HEBERT, BH
    JOURNAL OF EDUCATIONAL RESEARCH, 1975, 68 (09): : 338 - 340
  • [44] DELAYED AUDITORY FEEDBACK AND STUTTERING
    AOKI, T
    JAPANESE JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1974, 22 (03): : 186 - 191
  • [45] DELAYED AUDITORY FEEDBACK AND BREATHING
    DANCE, FEX
    CENTRAL STATES SPEECH JOURNAL, 1968, 19 (01): : 52 - 54
  • [46] Fluctuations in oscillators with delayed feedback
    Demin, N.V.
    Yakimov, A.V.
    Journal of Communications Technology and Electronics, 1994, 39 (01)
  • [47] DELAYED SPEECH FEEDBACK AND AGE
    SMITH, KU
    TIERNEY, D
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1971, 14 (01): : 214 - &
  • [48] On budgeted optimization problems
    Juttner, Alpar
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2006, 20 (04) : 880 - 892
  • [49] Budgeted matching and budgeted matroid intersection via the gasoline puzzle
    André Berger
    Vincenzo Bonifaci
    Fabrizio Grandoni
    Guido Schäfer
    Mathematical Programming, 2011, 128 : 355 - 372
  • [50] Budgeted matching and budgeted matroid intersection via the gasoline puzzle
    Department of Quantitative Economics, Maastricht University, Maastricht, Netherlands
    不详
    不详
    不详
    不详
    不详
    Math. Program., 1-2 (355-372):