共 50 条
- [1] Against The Achilles' Heel: A Survey on Red Teaming for Generative Models JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2025, 82 : 687 - 775
- [3] Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models COMPUTER VISION - ECCV 2024, PT LXXIII, 2025, 15131 : 174 - 189
- [4] Audio in Multimodal Applications JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2010, 58 (03): : 191 - 195
- [5] Audio in multimodal applications AES: Journal of the Audio Engineering Society, 2010, 58 (03): : 191 - 195
- [6] Audio-LLM: Activating the Capabilities of Large Language Models to Comprehend Audio Data ADVANCES IN NEURAL NETWORKS-ISNN 2024, 2024, 14827 : 133 - 142
- [7] Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding INTERSPEECH 2024, 2024, : 1135 - 1139
- [8] TRAINING AUDIO CAPTIONING MODELS WITHOUT AUDIO 2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 371 - 375