共 50 条
- [31] Cognitive Overload: Jailbreaking Large Language Models with Overloaded Logical Thinking Findings of the Association for Computational Linguistics: NAACL 2024 - Findings, 2024, : 3526 - 3548
- [33] Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [34] Large Language Models Are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 18417 - 18425
- [35] Large Language Models in Orthopaedic Publications: The Good, the Bad and the Ugly AMERICAN JOURNAL OF SPORTS MEDICINE, 2024, 52 (09): : 2193 - 2195
- [39] A Comprehensive Survey of Datasets for Large Language Model Evaluation 2024 5TH INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE, ICTC 2024, 2024, : 330 - 336
- [40] Large language models and rheumatology: a comparative evaluation LANCET RHEUMATOLOGY, 2023, 5 (10): : E574 - E578