Efficient Detection of Toxic Prompts in Large Language Models

被引:0
|
作者
Liu, Yi [1 ]
Yu, Junzhe [2 ]
Sun, Huijia [2 ]
Shi, Ling [1 ]
Deng, Gelei [1 ]
Chen, Yuqi [2 ]
Liu, Yang [1 ]
机构
[1] Nanyang Technological University, Singapore, Singapore
[2] ShanghaiTech University, Shanghai, China
来源
arXiv | 1600年
关键词
D O I
暂无
中图分类号
学科分类号
摘要
49
引用
收藏
相关论文
共 50 条
  • [1] Efficient Toxic Content Detection by Bootstrapping and Distilling Large Language Models
    Zhang, Jiang
    Wu, Qiong
    Xu, Yiming
    Cao, Cheng
    Du, Zheng
    Psounis, Konstantinos
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21779 - 21787
  • [2] How to write effective prompts for large language models
    Lin, Zhicheng
    NATURE HUMAN BEHAVIOUR, 2024, 8 (4) : 611 - 615
  • [3] How to write effective prompts for large language models
    Zhicheng Lin
    Nature Human Behaviour, 2024, 8 : 611 - 615
  • [4] Can Large Language Models Truly Understand Prompts? A Case Study with Negated Prompts
    Jang, Joel
    Ye, Seongheyon
    Seo, Minjoon
    TRANSFER LEARNING FOR NATURAL LANGUAGE PROCESSING WORKSHOP, VOL 203, 2022, 203 : 52 - 62
  • [5] LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
    Jiang, Huiqiang
    Wu, Qianhui
    Lin, Chin-Yew
    Yang, Yuqing
    Qiu, Lili
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13358 - 13376
  • [6] GenKP: generative knowledge prompts for enhancing large language models
    Li, Xinbai
    Peng, Shaowen
    Yada, Shuntaro
    Wakamiya, Shoko
    Aramaki, Eiji
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [7] On the reliability of Large Language Models to misinformed and demographically informed prompts
    Aremu, Toluwani
    Akinwehinmi, Oluwakemi
    Nwagu, Chukwuemeka
    Ahmed, Syed Ishtiaque
    Orji, Rita
    Del Amo, Pedro Arnau
    El Saddik, Abdulmotaleb
    AI MAGAZINE, 2025, 46 (01)
  • [8] RelayAttention for Efficient Large Language Model Serving with Long System Prompts
    Zhu, Lei
    Wang, Xinjiang
    Zhang, Wayne
    Lau, Rynson
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 4945 - 4957
  • [9] Predictive Prompts with Joint Training of Large Language Models for Explainable Recommendation
    Lin, Ching-Sheng
    Tsai, Chung-Nan
    Su, Shao-Tang
    Jwo, Jung-Sing
    Lee, Cheng-Hsiung
    Wang, Xin
    MATHEMATICS, 2023, 11 (20)
  • [10] Systematic synthesis of design prompts for large language models in conceptual design
    Tian, Yu
    Liu, Ang
    Dai, Yun
    Nagato, Keisuke
    Nakao, Masayuki
    CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2024, 73 (01) : 85 - 88