共 50 条
- [22] Learning solution similarity in preference-based CBR Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8765 : 17 - 31
- [23] Versatile Dueling Bandits: Best-of-both World Analyses for Online Learning from Relative Preferences INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19011 - 19026
- [24] Inverse Preference Learning: Preference-based RL without a Reward Function ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [25] Online Certification of Preference-Based Fairness for Personalized Recommender Systems THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6532 - 6540
- [26] Online Rank Elicitation for Plackett-Luce: A Dueling Bandits Approach ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
- [27] A Generalized Acquisition Function for Preference-based Reward Learning 2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 2814 - 2821
- [28] Model-Free Preference-Based Reinforcement Learning THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2222 - 2228
- [29] Embedding Learning for Preference-based Speech Quality Assessment INTERSPEECH 2024, 2024, : 2685 - 2689
- [30] Learning to Identify Top Elo Ratings: A Dueling Bandits Approach THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8797 - 8805