Foot In The Door: Understanding Large Language Model Jailbreaking via Cognitive Psychology

被引:0
|
作者
National University of Defense Technology, China [1 ]
不详 [2 ]
机构
来源
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
'current - Black boxes - Cognitive psychology - Consistency theory - Decision-making mechanisms - Language model - Model security - Multisteps - Psychological explanation - Security protection
引用
收藏
相关论文
共 50 条
  • [31] GEM: Gestalt Enhanced Markup Language Model for Web Understanding via Render Tree
    Shao, Zirui
    Gao, Feiyu
    Qi, Zhongda
    Xing, Hangdi
    Bu, Jiajun
    Yu, Zhi
    Zheng, Qi
    Liu, Xiaozhong
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 6132 - 6145
  • [32] Understanding Before Recommendation: Semantic Aspect-Aware Review Exploitation via Large Language Models
    Liu, Fan
    Liu, Yaqi
    Chen, Huilin
    Cheng, Zhiyong
    Nie, Liqiang
    Kankanhalli, Mohan
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2025, 43 (02)
  • [33] LARR: Large Language Model Aided Real-time Scene Recommendation with Semantic Understanding
    Wan, Zhizhong
    Yin, Bin
    Xie, Junjie
    Jiang, Fei
    Li, Xiang
    Lin, Wei
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 23 - 32
  • [34] SkyEyeGPT: Unifying remote sensing vision-language tasks via instruction tuning with large language model
    Zhan, Yang
    Xiong, Zhitong
    Yuan, Yuan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 221 : 64 - 77
  • [35] From jargon to clarity: Improving the readability of foot and ankle radiology reports with an artificial intelligence large language model
    Butler, James J.
    Harrington, Michael C.
    Tong, Yixuan
    Rosenbaum, Andrew J.
    Samsonov, Alan P.
    Walls, Raymond J.
    Kennedy, John G.
    FOOT AND ANKLE SURGERY, 2024, 30 (04) : 331 - 337
  • [36] Multi-Intent Inline Code Comment Generation via Large Language Model
    Zhang, Xiaowei
    Chen, Zhifei
    Cao, Yulu
    Chen, Lin
    Zhou, Yuming
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2024, 34 (06) : 845 - 868
  • [37] GreenLLM: Towards Efficient Large Language Model via Energy-aware Pruning
    Tian, Chunlin
    Qin, Xinpeng
    Li, Li
    2024 IEEE/ACM 32ND INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE, IWQOS, 2024,
  • [38] Chinese Text Open Domain Tag Generation Method via Large Language Model
    He, Chunhui
    Ge, Bin
    Zhang, Chong
    2024 10TH INTERNATIONAL CONFERENCE ON BIG DATA AND INFORMATION ANALYTICS, BIGDIA 2024, 2024, : 183 - 188
  • [39] Explainable automated debugging via large language model-driven scientific debugging
    Kang, Sungmin
    Chen, Bei
    Yoo, Shin
    Lou, Jian-Guang
    EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (02)
  • [40] VulLibGen: Generating Names of Vulnerability-Affected Packages via a Large Language Model
    Chen, Tianyu
    Li, Lin
    Zhu, Liuchuan
    Li, Zongyang
    Liu, Xueqing
    Liang, Guangtai
    Wang, Qianxiang
    Xie, Tao
    arXiv, 2023,