共 50 条
- [21] Learning to Rank in the Position Based Model with Bandit Feedback CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2405 - 2412
- [23] Learning Multiclass Classifier Under Noisy Bandit Feedback ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II, 2021, 12713 : 448 - 460
- [24] Multi-Feedback Bandit Learning with Probabilistic Contexts PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3087 - 3093
- [25] Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10210 - 10217
- [26] Bandit Online Learning on Graphs via Adaptive Optimization PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2991 - 2997
- [27] Adaptive quantized online distributed mirror descent algorithm with Bandit feedback Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (10): : 1774 - 1782
- [29] Learning Structured Predictors from Bandit Feedback for Interactive NLP PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1610 - 1620
- [30] Counterfactual Risk Minimization: Learning from Logged Bandit Feedback INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 814 - 823