MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization

被引：0

作者：

Yang, Zhiyu ^{[4
]}

Zhou, Zihan ^{[5
]}

Wang, Shuo ^{[1
]}

ConG, Xin ^{[1
,2
,3
]}

Han, Xu ^{[1
,2
,3
]}

Yan, Yukun ^{[1
]}

Liu, Zhenghao ^{[6
]}

Tan, Zhixing ^{[7
]}

Liu, Pengyuan ^{[4
]}

Yu, Dong ^{[4
]}

Liu, Zhiyuan ^{[1
,2
,3
]}

Shi, Xiaodong ^{[5
]}

Sun, Maosong ^{[1
,2
,3
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Tech, Beijing, Peoples R China

[2] Tsinghua Univ, Inst AI, Beijing, Peoples R China

[3] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China

[4] Beijing Language & Culture Univ, Beijing, Peoples R China

[5] Xiamen Univ, Xiamen, Peoples R China

[6] Northeastern Univ, Shenyang, Peoples R China

[7] Zhongguancun Lab, Beijing, Peoples R China

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024 | 2024年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Scientific data visualization plays a crucial role in research by enabling the direct display of complex information and assisting researchers in identifying implicit patterns. Despite its importance, the use of Large Language Models (LLMs) for scientific data visualization remains rather unexplored. In this study, we introduce MatPlotAgent, an efficient modelagnostic LLM agent framework designed to automate scientific data visualization tasks. Leveraging the capabilities of both code LLMs and multi-modal LLMs, MatPlotAgent consists of three core modules: query understanding, code generation with iterative debugging, and a visual feedback mechanism for error correction. To address the lack of benchmarks in this field, we present MatPlotBench, a high-quality benchmark consisting of 100 human-verified test cases. Additionally, we introduce a scoring approach that utilizes GPT-4V for automatic evaluation. Experimental results demonstrate that MatPlotAgent can improve the performance of various LLMs, including both commercial and open-source models. Furthermore, the proposed evaluation method shows a strong correlation with human-annotated scores.

引用

页码：11789 / 11804

页数：16

共 50 条

[21] Identifying Citizen-Related Issues from Social Media Using LLM-Based Data Augmentation
dos Santos, Vitor Gaboardi
Santos, Guto Leoni
Lynn, Theo
Benatallah, Boualem
ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2024, 2024, 14663 : 531 - 546
[22] ConDefects: A New Dataset to Address the Data Leakage Concern for LLM-based Fault Localization and Program Repair
Wu, Yonghao
Zhang, Jie M.
Li, Zheng
Liu, Yong
arXiv, 2023,
[23] Speak From Heart: An Emotion-Guided LLM-Based Multimodal Method for Emotional Dialogue Generation
Liu, Chenxiao
Xie, Zheyong
Zhao, Sirui
Zhou, Jin
Xu, Tong
Li, Minglei
Chen, Enhong
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 533 - 542
[24] UniDE: A multi-level and low-resource framework for automatic dialogue evaluation via LLM-based data augmentation and multitask learning
Ye, Guanghui
Zhao, Huan
Zhang, Zixing
Jiang, Zhihua
INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)
[25] Usability Heuristic Evaluation of Scientific Data Analysis and Visualization Tools
Swaid, Samar
Maat, Mnsa
Krishnan, Hari
Ghoshal, Devarshi
Ramakrishnan, Lavanya
ADVANCES IN USABILITY AND USER EXPERIENCE, AHFE 2017, 2018, 607 : 471 - 482
[26] Some theoretical issues of scientific visualization as a method of data analysis
Pilyugin, Victor
Malikova, Eugeniya
Adzhiev, Valery
Pasko, Alexander
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2013, 7870 : 131 - 142
[27] GenG: An LLM-based Generic Time Series Data Generation Approach for Edge Intelligence via Cross-domain Collaboration
Zhou, Xiaomao
Jia, Qingmin
Hu, Yujiao
Xie, Renchao
Huang, Tao
Yu, E. Richard
IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS, INFOCOM WKSHPS 2024, 2024,
[28] A Visualization Method for Scientific Research Data of University Teachers Based on Temporal Hierarchical Layout Strategy
Yang, Qihang
Xiao, Bin
Fu, Lijun
Li, Jin
Liu, Xiaojuan
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3979 - 3985
[29] Evaluation Study for an ISO 13606 Archetype Based Medical Data Visualization Method
Kopanitsa, Georgy
JOURNAL OF MEDICAL SYSTEMS, 2015, 39 (08)
[30] Evaluation Study for an ISO 13606 Archetype Based Medical Data Visualization Method
Georgy Kopanitsa
Journal of Medical Systems, 2015, 39

← 1 2 3 4 5 →