共 22 条
- [1] On Logarithmic Regret for Bandits with Knapsacks 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
- [2] On Regret-Optimal Learning in Decentralized Multiplayer Multiarmed Bandits IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2018, 5 (01): : 597 - 606
- [3] Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
- [4] Bandits with Side Observations: Bounded vs. Logarithmic Regret UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 467 - 476
- [5] Logarithmic regret in communicating MDPs: Leveraging known dynamics with bandits ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
- [6] Strategies for Safe Multi-Armed Bandits with Logarithmic Regret and Risk INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [8] Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [9] Constant regret for sequence prediction with limited advice INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 1343 - 1386
- [10] SIC - MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32