共 50 条
- [13] CLAIR: Evaluating Image Captions with Large Language Models 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13638 - 13646
- [15] Baby steps in evaluating the capacities of large language models NATURE REVIEWS PSYCHOLOGY, 2023, 2 (08): : 451 - 452
- [16] Evaluating the ability of large language models to emulate personality SCIENTIFIC REPORTS, 2025, 15 (01):
- [17] Evaluating Large Language Models on Controlled Generation Tasks 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3155 - 3168
- [18] Baby steps in evaluating the capacities of large language models Nature Reviews Psychology, 2023, 2 : 451 - 452
- [19] EconNLI: Evaluating Large Language Models on Economics Reasoning FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 982 - 994
- [20] Evaluating Large Language Models for Tax Law Reasoning INTELLIGENT SYSTEMS, BRACIS 2024, PT I, 2025, 15412 : 460 - 474