共 13 条
- [1] Online Continuous Submodular Maximization: From Full-Information to Bandit Feedback ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [2] Online Learning for Non-monotone DR-Submodular Maximization: From Full Information to Bandit Feedback INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [3] Counterfactual Risk Minimization: Learning from Logged Bandit Feedback INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 814 - 823
- [7] Learning-augmented Online Minimization of Age of Information and Transmission Costs IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS, INFOCOM WKSHPS 2024, 2024,
- [8] Learning from Delayed Semi-Bandit Feedback under Strong Fairness Guarantees IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2022), 2022, : 1379 - 1388
- [9] A Markov Game of Age of Information From Strategic Sources With Full Online Information ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 76 - 81