Mitigating Exaggerated Safety in Large Language Models

被引:0
|
作者
Ray, Ruchira [1 ]
Bhalani, Ruchi [1 ]
机构
[1] University of Texas at Austin, Department of Computer Science, United States
来源
关键词
Compilation and indexing terms; Copyright 2025 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [21] Leveraging Large Language Models for Enhancing Safety in Maritime Operations
    Miller, Tymoteusz
    Durlik, Irmina
    Kostecka, Ewelina
    Lobodzinska, Adrianna
    Lazuga, Kinga
    Kozlovska, Polina
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [22] Mitigating Grand Challenges in Life Cycle Inventory Modeling through the Applications of Large Language Models
    Tu, Qingshi
    Guo, Jing
    Li, Nan
    Qi, Jianchuan
    Xu, Ming
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2024, 58 (44) : 19595 - 19603
  • [23] Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training
    Guo, Qingyan
    Wang, Rui
    Guo, Junliang
    Tan, Xu
    Bian, Jiang
    Yang, Yujiu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 11453 - 11464
  • [24] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
    Leng, Sicong
    Zhang, Hang
    Chen, Guanzheng
    Li, Xin
    Lug, Shijian
    Miao, Chunyan
    Bing, Lidong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13872 - 13882
  • [25] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
    Zhou, Zhanhui
    Liu, Jie
    Dong, Zhichen
    Liu, Jiaheng
    Yang, Chao
    Ouyang, Wanli
    Qiao, Yu
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 15810 - 15830
  • [26] Text Summarization in Aviation Safety: A Comparative Study of Large Language Models
    Emmons, Jonathan
    Sharma, Taneesha
    Salloum, Mariam
    Matthews, Bryan
    AIAA AVIATION FORUM AND ASCEND 2024, 2024,
  • [27] LEVERAGING LARGE LANGUAGE MODELS FOR ENHANCED CONSTRUCTION SAFETY REGULATION EXTRACTION
    Tran, Si Van-Tien
    Yang, Jaehun
    Hussain, Rahat
    Khan, Nasrullah
    Kimito, Emmanuel Charles
    Pedro, Akeem
    Sotani, Mehrtash
    Lee, Ung-Kyun
    Park, Chansik
    JOURNAL OF INFORMATION TECHNOLOGY IN CONSTRUCTION, 2024, 29 : 1026 - 1038
  • [28] Revolutionizing patient safety with artificial intelligence: the potential of natural language processing and large language models
    Klang, Eyal
    Garcia-Elorrio, Ezequiel
    Zimlichman, Eyal
    INTERNATIONAL JOURNAL FOR QUALITY IN HEALTH CARE, 2023, 35 (03)
  • [29] Large Language Models are Not Models of Natural Language: They are Corpus Models
    Veres, Csaba
    IEEE ACCESS, 2022, 10 : 61970 - 61979
  • [30] Large Language Models
    Vargas, Diego Collarana
    Katsamanis, Nassos
    ERCIM NEWS, 2024, (136): : 12 - 13