Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models

被引:0
|
作者
Hao, Shuyang [1 ]
Hooi, Bryan [2 ]
Liu, Jun [3 ]
Chang, Kai-Wei [4 ]
Huang, Zi [5 ]
Cai, Yujun [5 ]
机构
[1] Southeast University, China
[2] National University of Singapore, Singapore
[3] Lancaster University, United Kingdom
[4] University of California, Los Angeles, United States
[5] University of Queensland, Australia
来源
关键词
Compilation and indexing terms; Copyright 2025 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Adversarial machine learning - Semantics
引用
收藏
相关论文
共 50 条
  • [31] VLUE: A Multi-Task Benchmark for Evaluating Vision-Language Models
    Zhou, Wangchunshu
    Zeng, Yan
    Diao, Shizhe
    Zhang, Xinsong
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [32] Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
    Ye, Shuquan
    Xie, Yujia
    Chen, Dongdong
    Xu, Yichong
    Yuan, Lu
    Zhu, Chenguang
    Liao, Jing
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2634 - 2645
  • [33] Multimodal Search on Iconclass using Vision-Language Pre-Trained Models
    Santini, Cristian
    Posthumus, Etienne
    Tietz, Tabea
    Tan, Mary Ann
    Bruns, Oleksandra
    Sack, Harald
    2023 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, JCDL, 2023, : 285 - 287
  • [34] Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models
    Hu, Yushi
    Stretcu, Otilia
    Lu, Chun-Ta
    Viswanathan, Krishnamurthy
    Hata, Kenji
    Luo, Enming
    Krishna, Ranjay
    Fuxman, Ariel
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 9590 - 9601
  • [35] Vision-language models for medical report generation and visual question answering: a review
    Hartsock, Iryna
    Rasool, Ghulam
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [36] Generating Robot Action Sequences: An Efficient Vision-Language Models with Visual Prompts
    Cai, Weihao
    Mori, Yoshiki
    Shimada, Nobutaka
    2024 INTERNATIONAL WORKSHOP ON INTELLIGENT SYSTEMS, IWIS 2024, 2024,
  • [37] Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?
    Zhang, Yichi
    Pan, Jiayi
    Zhou, Yuchen
    Pan, Rui
    Chai, Joyce
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 5718 - 5728
  • [38] Patch is enough: naturalistic adversarial patch against vision-language pre-training models
    Dehong Kong
    Siyuan Liang
    Xiaopeng Zhu
    Yuansheng Zhong
    Wenqi Ren
    Visual Intelligence, 2 (1):
  • [39] Boosting Transferability in Vision-Language Attacks via Diversification Along the Intersection Region of Adversarial Trajectory
    Gao, Sensen
    Jia, Xiaojun
    Rene, Xuhong
    Tsang, Ivor
    Guo, Qing
    COMPUTER VISION-ECCV 2024, PT LVII, 2025, 15115 : 442 - 460
  • [40] Concept-Based Analysis of Neural Networks via Vision-Language Models
    Mangal, Ravi
    Narodytska, Nina
    Gopinath, Divya
    Hu, Boyue Caroline
    Roy, Anirban
    Jha, Susmit
    Pasareanu, Corina S.
    AI VERIFICATION, SAIV 2024, 2024, 14846 : 49 - 77