共 50 条
- [21] TRAM: Benchmarking Temporal Reasoning for Large Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 6389 - 6415
- [22] EconNLI: Evaluating Large Language Models on Economics Reasoning FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 982 - 994
- [23] Evaluating Large Language Models for Tax Law Reasoning INTELLIGENT SYSTEMS, BRACIS 2024, PT I, 2025, 15412 : 460 - 474
- [25] Automatic Model Selection with Large Language Models for Reasoning FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 758 - 783
- [26] NEWTON: Are Large Language Models Capable of Physical Reasoning? FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 9743 - 9758
- [27] Dynamic Voting for Efficient Reasoning in Large Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 3085 - 3104
- [29] Rationality of Thought Improves Reasoning in Large Language Models KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2024, 2024, 14887 : 343 - 358
- [30] Isabelle/SACM: Computer-Assisted Assurance Cases with Integrated Formal Methods INTEGRATED FORMAL METHODS, IFM 2019, 2019, 11918 : 379 - 398