共 50 条
- [31] On Evaluating Adversarial Robustness of Large Vision-Language Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [33] Evaluating the Ability of Large Language Models to Generate Motivational Feedback GENERATIVE INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, PT I, ITS 2024, 2024, 14798 : 188 - 201
- [34] Establishing vocabulary tests as a benchmark for evaluating large language models PLOS ONE, 2024, 19 (12):
- [35] Evaluating Attribute Comprehension in Large Vision-Language Models PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 98 - 113
- [38] Evaluating Large Language Models in Cybersecurity Knowledge with Cisco Certificates SECURE IT SYSTEMS, NORDSEC 2024, 2025, 15396 : 219 - 238
- [39] Towards evaluating and building versatile large language models for medicine NPJ DIGITAL MEDICINE, 2025, 8 (01):