共 50 条
- [1] Defending ChatGPT against jailbreak attack via self-reminders Nature Machine Intelligence, 2023, 5 : 1486 - 1496
- [2] SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 5587 - 5605
- [3] Defending Large Language Models Against Jailbreak Attacks via Layer-specific Editing EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024, 2024, : 5094 - 5109
- [5] Defending Against Wormhole Attack in MANET 2015 FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT2015), 2015, : 674 - 678
- [10] Defending against the Pirate Evolution Attack INFORMATION SECURITY PRACTICE AND EXPERIENCE, PROCEEDINGS: 5TH INTERNATIONAL CONFERENCE, ISPEC 2009, 2009, 5451 : 147 - 158