A Study Case of Automatic Archival Research and Compilation using Large Language Models

被引：0

作者：

Guo, Dongsheng ^{[1
]}

Yue, Aizhen ^{[1
]}

Ning, Fanggang ^{[2
]}

Huang, Dengrong ^{[1
]}

Chang, Bingxin ^{[1
]}

Duan, Qiang ^{[1
]}

Zhang, Lianchao ^{[2
]}

Chen, Zhaoliang ^{[2
]}

Zhang, Zheng ^{[1
]}

Zhan, Enhao ^{[1
]}

Zhang, Qilai ^{[1
]}

Jiang, Kai ^{[1
]}

Li, Rui ^{[1
]}

Zhao, Shaoxiang ^{[2
]}

Wei, Zizhong ^{[1
]}

机构：

[1] Inspur Acad Sci & Technol, Jinan, Shandong, Peoples R China

[2] Inspur Software Co Ltd, Jinan, Shandong, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH, ICKG | 2023年

关键词：

Archival research and compilation; Automatic method; Large language models; Fine-tuning;

D O I：

10.1109/ICKG59574.2023.00012

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Archival research and compilation is a specialized task that focuses on exploration, selection and processing of vast quantities of archival documents pertaining to specific subjects. Traditionally, this task has been characterized by its labor-intensive and time-consuming requirements. In recent years, the advancement of artificial intelligence has made automatic archival research and compilation tasks feasible. However, the limited availability of relevant samples imposes significant constraints on the application of deep learning models, given their high demand for sufficient data and knowledge. In this paper, we present a study case and propose an innovative method for automatic archival research and compilation, leveraging the robust knowledge base and text generation ability offered by large language models. Specifically, our method comprises three essential components: document retrieval, document summarization, and rule-based compilation. In the document summarization component, we leverage fine-tuned large language models to enhance the performance by simulation data generation and summary generation. Experimental results substantiate the effectiveness of our method. Furthermore, our method provides a general idea in using large language models, as well as a solution for addressing similar challenges in different domains.

引用

页码：52 / 59

页数：8

共 50 条

[41] Lexical Semantics with Large Language Models: A Case Study of English break
Petersen, Erika
Potts, Christopher
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 490 - 511
[42] Using large language models in psychology
Demszky, Dorottya
Yang, Diyi
Yeager, David
Bryan, Christopher
Clapper, Margarett
Chandhok, Susannah
Eichstaedt, Johannes
Hecht, Cameron
Jamieson, Jeremy
Johnson, Meghann
Jones, Michaela
Krettek-Cobb, Danielle
Lai, Leslie
Jonesmitchell, Nirel
Ong, Desmond
Dweck, Carol
Gross, James
Pennebaker, James
NATURE REVIEWS PSYCHOLOGY, 2023, 2 (11): : 688 - 701
[43] Using large language models in psychology
Dorottya Demszky
Diyi Yang
David S. Yeager
Christopher J. Bryan
Margarett Clapper
Susannah Chandhok
Johannes C. Eichstaedt
Cameron Hecht
Jeremy Jamieson
Meghann Johnson
Michaela Jones
Danielle Krettek-Cobb
Leslie Lai
Nirel JonesMitchell
Desmond C. Ong
Carol S. Dweck
James J. Gross
James W. Pennebaker
Nature Reviews Psychology, 2023, 2 : 688 - 701
[44] Using large language models wisely
不详
NATURE ASTRONOMY, 2025, 9 (03): : 315 - 315
[45] Generating Automatic Feedback on UI Mockups with Large Language Models
Duan, Peitong
Warner, Jeremy
Li, Yang
Hartmann, Bjoern
PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
[46] Using Large Language Models to Support Content Analysis: A Case Study of ChatGPT for Adverse Event Detection
Leas, Eric C.
Ayers, John W.
Desai, Nimit
Dredze, Mark
Hogarth, Michael
Smith, Davey M.
JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
[47] From statistics to deep learning: Using large language models in psychiatric research
Hua, Yining
Beam, Andrew
Chibnik, Lori B.
Torous, John
INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2025, 34 (01)
[48] Characterizing Spin in Psychiatric Clinical Research Literature Using Large Language Models
Perlis, Roy H.
JAMA NETWORK OPEN, 2025, 8 (02)
[49] Tastle: Distract Large Language Models for Automatic Jailbreak Attack
Xiao, Zeguan
Yang, Yan
Chen, Guanhua
Chen, Yun
arXiv, 1600,
[50] Leveraging Large Language Models for Automatic Smart Contract Generation
Napoli, Emanuele Antonio
Barbara, Fadi
Gatteschi, Valentina
Schifanella, Claudio
2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 701 - 710

← 1 2 3 4 5 →