共 50 条
- [41] Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [42] Distributed No-Regret Learning in Aggregative Games With Residual Bandit Feedback IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2024, 11 (04): : 1734 - 1745
- [43] Meta-Scheduling for the Wireless Downlink through Learning with Bandit Feedback 2020 18TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT), 2020,
- [44] Targeting Optimization for Internet Advertising by Learning from Logged Bandit Feedback 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
- [48] Nearest Neighbour with Bandit Feedback ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [49] Adaptive estimation of random vectors with bandit feedback: A mean-squared error viewpoint 2023 NINTH INDIAN CONTROL CONFERENCE, ICC, 2023, : 180 - 181