共 50 条
- [1] On Logarithmic Regret for Bandits with Knapsacks 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
- [2] Logarithmic Regret from Sublinear Hints ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [3] Neural Contextual Bandits without Regret INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 240 - 278
- [4] Breaking the √T Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [5] Sublinear Optimal Policy Value Estimation in Contextual Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4377 - 4386
- [6] Constant or Logarithmic Regret in Asynchronous Multiplayer Bandits with Limited Communication INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [7] Bandits with Side Observations: Bounded vs. Logarithmic Regret UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 467 - 476
- [8] Logarithmic regret in communicating MDPs: Leveraging known dynamics with bandits ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
- [9] Constrained Online Learning in Networks with Sublinear Regret and Fit 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5486 - 5493
- [10] Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,