共 50 条
- [1] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 8865 - 8887
- [6] DiffDefense: Defending Against Adversarial Attacks via Diffusion Models IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT II, 2023, 14234 : 430 - 442
- [7] Defending Against Adversarial Attacks via Neural Dynamic System ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [9] Defending against Whitebox Adversarial Attacks via Randomized Discretization 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 684 - 693