MMAD: THE FIRST-EVER COMPREHENSIVE BENCHMARK FOR MULTIMODAL LARGE LANGUAGE MODELS IN INDUSTRIAL ANOMALY DETECTION

被引:0
|
作者
Southern University of Science and Technology, China [1 ]
不详 [2 ]
不详 [3 ]
不详 [4 ]
机构
来源
关键词
1106.6 - 913.3 Quality Assurance and Control;
D O I
暂无
中图分类号
学科分类号
摘要
Anomaly detection
引用
收藏
相关论文
共 35 条
  • [1] A comprehensive survey of large language models and multimodal large models in medicine
    Xiao, Hanguang
    Zhou, Feizhong
    Liu, Xingyue
    Liu, Tianqi
    Li, Zhipeng
    Liu, Xin
    Huang, Xiaoxuan
    INFORMATION FUSION, 2025, 117
  • [2] Semantic anomaly detection with large language models
    Amine Elhafsi
    Rohan Sinha
    Christopher Agia
    Edward Schmerling
    Issa A. D. Nesnas
    Marco Pavone
    Autonomous Robots, 2023, 47 : 1035 - 1055
  • [3] Semantic anomaly detection with large language models
    Elhafsi, Amine
    Sinha, Rohan
    Agia, Christopher
    Schmerling, Edward
    Nesnas, Issa A. D.
    Pavone, Marco
    AUTONOMOUS ROBOTS, 2023, 47 (08) : 1035 - 1055
  • [4] MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models
    Liu, Xin
    Zhu, Yichen
    Gu, Jindong
    Lan, Yunshi
    Yang, Chao
    Qiao, Yu
    COMPUTER VISION - ECCV 2024, PT LVI, 2025, 15114 : 386 - 403
  • [5] Contextual Object Detection with Multimodal Large Language Models
    Zang, Yuhang
    Li, Wei
    Han, Jun
    Zhou, Kaiyang
    Loy, Chen Change
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (02) : 825 - 843
  • [6] STAR: A First-Ever Dataset and a Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite Imagery
    Li, Yansheng
    Wang, Linlin
    Wang, Tingzhu
    Yang, Xue
    Luo, Junwei
    Wang, Qi
    Deng, Youming
    Wang, Wenbin
    Sun, Xian
    Li, Haifeng
    Dang, Bo
    Zhang, Yongjun
    Yu, Yi
    Yan, Junchi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1832 - 1849
  • [7] DesignQA: A Multimodal Benchmark for Evaluating Large Language Models' Understanding of Engineering Documentation
    Doris, Anna C.
    Grandi, Daniele
    Tomich, Ryan
    Alam, Md Ferdous
    Ataei, Mohammadmehdi
    Cheong, Hyunmin
    Ahmed, Faez
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2025, 25 (02)
  • [8] Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
    Wang, Xiyao
    Zhou, Yuhang
    Liu, Xiaoyu
    Lu, Hongjin
    Xu, Yuancheng
    He, Feihong
    Yoon, Jaehong
    Lu, Taixi
    Liu, Fuxiao
    Bertasius, Gedas
    Bansal, Mohit
    Yao, Huaxiu
    Huang, Furong
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 416 - 442
  • [9] A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
    Wu, Tianhe
    Ma, Kede
    Liang, Jie
    Yang, Yujiu
    Zhang, Lei
    COMPUTER VISION - ECCV 2024, PT LXXIV, 2025, 15132 : 143 - 160
  • [10] ADAGENT: Anomaly Detection Agent With Multimodal Large Models in Adverse Environments
    Zhang, Miao
    Shen, Yiqing
    Yin, Jun
    Lu, Shuai
    Wang, Xueqian
    IEEE ACCESS, 2024, 12 : 172061 - 172074