MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization

被引:0
|
作者
Yang, Zhiyu [4 ]
Zhou, Zihan [5 ]
Wang, Shuo [1 ]
ConG, Xin [1 ,2 ,3 ]
Han, Xu [1 ,2 ,3 ]
Yan, Yukun [1 ]
Liu, Zhenghao [6 ]
Tan, Zhixing [7 ]
Liu, Pengyuan [4 ]
Yu, Dong [4 ]
Liu, Zhiyuan [1 ,2 ,3 ]
Shi, Xiaodong [5 ]
Sun, Maosong [1 ,2 ,3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Tech, Beijing, Peoples R China
[2] Tsinghua Univ, Inst AI, Beijing, Peoples R China
[3] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[4] Beijing Language & Culture Univ, Beijing, Peoples R China
[5] Xiamen Univ, Xiamen, Peoples R China
[6] Northeastern Univ, Shenyang, Peoples R China
[7] Zhongguancun Lab, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Scientific data visualization plays a crucial role in research by enabling the direct display of complex information and assisting researchers in identifying implicit patterns. Despite its importance, the use of Large Language Models (LLMs) for scientific data visualization remains rather unexplored. In this study, we introduce MatPlotAgent, an efficient modelagnostic LLM agent framework designed to automate scientific data visualization tasks. Leveraging the capabilities of both code LLMs and multi-modal LLMs, MatPlotAgent consists of three core modules: query understanding, code generation with iterative debugging, and a visual feedback mechanism for error correction. To address the lack of benchmarks in this field, we present MatPlotBench, a high-quality benchmark consisting of 100 human-verified test cases. Additionally, we introduce a scoring approach that utilizes GPT-4V for automatic evaluation. Experimental results demonstrate that MatPlotAgent can improve the performance of various LLMs, including both commercial and open-source models. Furthermore, the proposed evaluation method shows a strong correlation with human-annotated scores.
引用
收藏
页码:11789 / 11804
页数:16
相关论文
共 50 条
  • [31] An Evaluation Method of Visualization Using Visual Momentum Based on Eye-Tracking Data
    Zhou, Xiaozhou
    Xue, Chengqi
    Zhou, Lei
    Niu, Yafeng
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (05)
  • [32] Evaluation of the potential of LLM-based generative AIs in nutrition education: a comparative study of ChatGPT and Bing for Japanese registered dietitian licensure exam preparation
    Kosai, M.
    Nagamori, Y.
    Kawai, Y.
    Marumo, H.
    Shibuya, M.
    Negishi, T.
    Sawai, A.
    Miyamoto, L.
    DIABETOLOGIA, 2024, 67 : S396 - S396
  • [33] Video formatting method of near-space data for Web scientific visualization
    Tan, Jian
    Wang, Shenghua
    Guo, Changshun
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2020, 46 (04): : 712 - 723
  • [34] Two Level Parallel Data Read Acceleration Method for Visualization in Scientific Computing
    Shi L.
    Xiao L.
    Cao L.
    Mo Z.
    1600, Science Press (54): : 844 - 854
  • [35] SCIENTIFIC PROCESSING METHOD FOR COURSE EVALUATION DATA IN UlNIVERSITIES
    Ai, Dongmei
    Ning, Xiaojun
    Yang, Bo
    Li, Jianing
    2011 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS, 2011, : 574 - 576
  • [36] VisUAM: A web-based tool for data visualization in scientific research
    Perez-Espinosa, Adriana
    Aguilar-Cornejo, Manuel
    Dagdug, Leonardo
    Quiroz-Fabian, Jose Luis
    Roman-Alonso, Graciela
    Castro-Garcia, Miguel A.
    SOFTWAREX, 2024, 27
  • [37] A cognitive data visualization method based on hyper surface
    He, Qing
    Zhao, Xiurong
    Shi, Zhongzhi
    PROCEEDINGS OF THE SIXTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, 2007, : 85 - +
  • [38] DEA based Hierarchical Structure Evaluation and Visualization Method
    Inoue, Kazushige
    Ichinotsubo, Takeo
    Aoki, Shingo
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1701 - 1704
  • [39] Application of Big Data Tourism Management Based on Scientific Computing Visualization Algorithms
    Shi, Qingbo
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (09) : 603 - 610
  • [40] Scientific Data Visualization via Hybrid Model based on Fractal Spline Interpolation
    Arooj, Tayba
    Hussain, Farsia
    Hussain, Malik Zawwar
    PUNJAB UNIVERSITY JOURNAL OF MATHEMATICS, 2019, 51 (09): : 123 - 136