共 50 条
- [41] Near-Optimal Regret Bounds for Contextual Combinatorial Semi-Bandits with Linear Payoff Functions THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9791 - 9798
- [42] Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [43] Comparative Synthesis: Learning Near-Optimal Network Designs by Query PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2023, 7 (POPL): : 91 - 120
- [47] Neural-Network-based Near-optimal Control for a Class of Nonlinear Descriptor Systems with Control Constraint 2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2521 - 2526