Foot In The Door: Understanding Large Language Model Jailbreaking via Cognitive Psychology

被引:0
|
作者
National University of Defense Technology, China [1 ]
不详 [2 ]
机构
来源
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
'current - Black boxes - Cognitive psychology - Consistency theory - Decision-making mechanisms - Language model - Model security - Multisteps - Psychological explanation - Security protection
引用
收藏
相关论文
共 50 条
  • [21] SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding
    Yu, Tianyu
    Jiang, Chengyue
    Lou, Chao
    Huang, Shen
    Wang, Xiaobin
    Liu, Wei
    Cai, Jiong
    Li, Yangning
    Li, Yinghui
    Tu, Kewei
    Zheng, Hai-Tao
    Zhang, Ningyu
    Xie, Pengjun
    Huang, Fei
    Jiang, Yong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19458 - 19467
  • [22] Understanding the Potential of FPGA-based Spatial Acceleration for Large Language Model Inference
    Chen, Hongzheng
    Du, Y.
    Xiang, S.
    Yue, Z.
    Zhang, N.
    Cai, Y.
    Zhang, Z.
    Zhang, J.
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2025, 18 (01)
  • [23] Understanding Large-Language Model (LLM)-powered Human-Robot Interaction
    Kim, Callie Y.
    Lee, Christine P.
    Mutlu, Bilge
    PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024, 2024, : 371 - 380
  • [24] Large Language Model-based Tools in Language Teaching to Develop Critical Thinking and Sustainable Cognitive Structures
    Joseph, Sindhu
    RUPKATHA JOURNAL ON INTERDISCIPLINARY STUDIES IN HUMANITIES, 2023, 15 (04):
  • [25] SEMI-SUPERVISED SPOKEN LANGUAGE UNDERSTANDING VIA SELF-SUPERVISED SPEECH AND LANGUAGE MODEL PRETRAINING
    Lai, Cheng-, I
    Chuang, Yung-Sung
    Lee, Hung-Yi
    Li, Shang-Wen
    Glass, James
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7468 - 7472
  • [26] LLM-CDM: A Large Language Model Enhanced Cognitive Diagnosis for Intelligent Education
    Chen, Xin
    Zhang, Jin
    Zhou, Tong
    Zhang, Feng
    IEEE ACCESS, 2025, 13 : 47165 - 47180
  • [27] Cognitive Hearing Science: Three Memory Systems, Two Approaches, and the Ease of Language Understanding Model
    Ronnberg, Jerker
    Holmer, Emil
    Rudner, Mary
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2021, 64 (02): : 359 - 370
  • [28] Fake-GPT: Detecting Fake Image via Large Language Model
    Fan, Yuming
    Yang, Dongming
    Zhang, Jiguang
    Yan, Bang
    Zou, Yuexian
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII, 2025, 15038 : 122 - 136
  • [29] LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model
    Wang, Dongkai
    Xuan, Shiyu
    Zhang, Shiliang
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 614 - 623
  • [30] Unsupervised Large Language Model Alignment for Information Retrieval via Contrastive Feedback
    Dong, Qian
    Liu, Yiding
    Ai, Qingyao
    Wu, Zhijing
    Li, Haitao
    Liu, Yiqun
    Wang, Shuaiqiang
    Yin, Dawei
    Ma, Shaoping
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 48 - 58