Mitigating Exaggerated Safety in Large Language Models

被引：0

作者：

Ray, Ruchira ^{[1
]}

Bhalani, Ruchi ^{[1
]}

机构：

[1] University of Texas at Austin, Department of Computer Science, United States

来源：

arXiv |

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

引用

共 50 条

[41] Large Language Models in der WissenschaftLarge language models in science
Karl-Friedrich Kowalewski
Severin Rodler
Die Urologie, 2024, 63 (9) : 860 - 866
[42] Mitigating Hallucinations in Large Language Models via Semantic Enrichment of Prompts: Insights from BioBERT and Ontological Integration
Penkov, Stanislav
PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA, CLIB 2024, 2024, : 272 - 276
[43] The Importance of Understanding Language in Large Language Models
Youssef, Alaa
Stein, Samantha
Clapp, Justin
Magnus, David
AMERICAN JOURNAL OF BIOETHICS, 2023, 23 (10): : 6 - 7
[44] Dissociating language and thought in large language models
Mahowald, Kyle
Ivanova, Anna A.
Blank, Idan A.
Kanwisher, Nancy
Tenenbaum, Joshua B.
Fedorenko, Evelina
TRENDS IN COGNITIVE SCIENCES, 2024, 28 (06) : 517 - 540
[45] On the creativity of large language models
Franceschelli, Giorgio
Musolesi, Mirco
AI & SOCIETY, 2024,
[46] Large language models and psychiatry
Orru, Graziella
Melis, Giulia
Sartori, Giuseppe
INTERNATIONAL JOURNAL OF LAW AND PSYCHIATRY, 2025, 101
[47] Large Language Models in Cyberattacks
S. V. Lebed
D. E. Namiot
E. V. Zubareva
P. V. Khenkin
A. A. Vorobeva
D. A. Svichkar
Doklady Mathematics, 2024, 110 (Suppl 2) : S510 - S520
[48] Autoformalization with Large Language Models
Wu, Yuhuai
Jiang, Albert Q.
Li, Wenda
Rabe, Markus N.
Staats, Charles
Jamnik, Mateja
Szegedy, Christian
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[49] Imitation and Large Language Models
Boisseau, Eloise
MINDS AND MACHINES, 2024, 34 (04)
[50] The Smallness of Large Language Models
Denning, Peter J.
COMMUNICATIONS OF THE ACM, 2023, 66 (09) : 24 - 27

← 1 2 3 4 5 →