Efficient Detection of Toxic Prompts in Large Language Models

被引:0
|
作者
Liu, Yi [1 ]
Yu, Junzhe [2 ]
Sun, Huijia [2 ]
Shi, Ling [1 ]
Deng, Gelei [1 ]
Chen, Yuqi [2 ]
Liu, Yang [1 ]
机构
[1] Nanyang Technological University, Singapore, Singapore
[2] ShanghaiTech University, Shanghai, China
来源
arXiv | 1600年
关键词
D O I
暂无
中图分类号
学科分类号
摘要
49
引用
收藏
相关论文
共 50 条
  • [31] Software Vulnerability Detection using Large Language Models
    Das Purba, Moumita
    Ghosh, Arpita
    Radford, Benjamin J.
    Chu, Bill
    2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS, ISSREW, 2023, : 112 - 119
  • [32] A survey of large language models for cyber threat detection☆
    Chen, Yiren
    Cui, Mengjiao
    Wang, Ding
    Cao, Yiyang
    Yang, Peian
    Jiang, Bo
    Lu, Zhigang
    Liu, Baoxu
    COMPUTERS & SECURITY, 2024, 145
  • [33] Leveraging Large Language Models for Efficient Alert Aggregation in AIOPs
    Zha, Junjie
    Shan, Xinwen
    Lu, Jiaxin
    Zhu, Jiajia
    Liu, Zihan
    ELECTRONICS, 2024, 13 (22)
  • [34] A Method for Efficient Structured Data Generation with Large Language Models
    Hou, Zongzhi
    Zhao, Ruohan
    Li, Zhongyang
    Wang, Zheng
    Wu, Yizhen
    Gou, Junwei
    Zhu, Zhifeng
    PROCEEDINGS OF THE 2ND WORKSHOP ON LARGE GENERATIVE MODELS MEET MULTIMODAL APPLICATIONS, LGM(CUBE)A 2024, 2024, : 36 - 44
  • [35] Towards efficient and effective unlearning of large language models for recommendation
    Wang, Hangyu
    Lin, Jianghao
    Chen, Bo
    Yang, Yang
    Tang, Ruiming
    Zhang, Weinan
    Yu, Yong
    FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (03)
  • [36] Efficient Tuning and Inference for Large Language Models on Textual Graphs
    Zhu, Yun
    Wang, Yaoke
    Shi, Haizhou
    Tang, Siliang
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 5734 - 5742
  • [37] Extensible Prompts for Language Models on Zero-shot Language Style Customization
    Ge, Tao
    Hu, Jing
    Dong, Li
    Mao, Shaoguang
    Xia, Yan
    Wang, Xun
    Chen, Si-Qing
    Wei, Furu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [38] Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models
    Cheng, Myra
    Durmus, Esin
    Jurafsky, Dan
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1504 - 1532
  • [39] Assertion Detection in Clinical Natural Language Processing using Large Language Models
    Ji, Yuelyu
    Yu, Zeshui
    Wang, Yanshan
    2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, : 242 - 247
  • [40] Exploring Lottery Prompts for Pre-trained Language Models
    Chen, Yulin
    Ding, Ning
    Wang, Xiaobin
    Hu, Shengding
    Zheng, Hai-Tao
    Liu, Zhiyuan
    Xie, Pengjun
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 15428 - 15444