Multi-party Goal Tracking with LLMs: Comparing Pre-training, Fine-tuning, and Prompt Engineering

被引：0

作者：

Addlesee, Angus ^{[1
]}

Sieinska, Weronika ^{[1
]}

Gunson, Nancie ^{[1
]}

Garcia, Daniel Hernandez ^{[1
]}

Dondrup, Christian ^{[1
]}

Lemon, Oliver ^{[1
,2
,3
]}

机构：

[1] Heriot Watt Univ, Edinburgh, Midlothian, Scotland

[2] Alana AI, London, England

[3] Edinburgh Ctr Robot, Edinburgh, Midlothian, Scotland

来源：

24TH MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper evaluates the extent to which current Large Language Models (LLMs) can capture task-oriented multi-party conversations (MPCs). We have recorded and transcribed 29 MPCs between patients, their companions, and a social robot in a hospital. We then annotated this corpus for multi-party goal-tracking and intent-slot recognition. People share goals, answer each other's goals, and provide other people's goals in MPCs - none of which occur in dyadic interactions. To understand user goals in MPCs, we compared three methods in zero-shot and few-shot settings: we fine-tuned T5, created pre-training tasks to train DialogLM using LED, and employed prompt engineering techniques with GPT-3.5-turbo, to determine which approach can complete this novel task with limited data. GPT-3.5-turbo significantly outperformed the others in a few-shot setting. The 'reasoning' style prompt, when given 7% of the corpus as example annotated conversations, was the best performing method. It correctly annotated 62.32% of the goal tracking MPCs, and 69.57% of the intent-slot recognition MPCs. A 'story' style prompt increased model hallucination, which could be detrimental if deployed in safety-critical settings. We conclude that multi-party conversations still challenge state-of-the-art LLMs.

引用

页码：229 / 241

页数：13

共 50 条

[21] Fine-tuning and multilingual pre-training for abstractive summarization task for the Arabic language
Kahla, Mram
Novak, Attila
Yang, Zijian Gyozo
ANNALES MATHEMATICAE ET INFORMATICAE, 2023, 57 : 24 - 35
[22] KNOWLEDGE DISTILLATION FROM BERT IN PRE-TRAINING AND FINE-TUNING FOR POLYPHONE DISAMBIGUATION
Sun, Hao
Tan, Xu
Gan, Jun-Wei
Zhao, Sheng
Han, Dongxu
Liu, Hongzhi
Qin, Tao
Liu, Tie-Yan
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 168 - 175
[23] Pre-training Fine-tuning data Enhancement method based on active learning
Cao, Deqi
Ding, Zhaoyun
Wang, Fei
Ma, Haoyang
2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1447 - 1454
[24] Investigation of improving the pre-training and fine-tuning of BERT model for biomedical relation extraction
Peng Su
K. Vijay-Shanker
BMC Bioinformatics, 23
[25] Investigation of improving the pre-training and fine-tuning of BERT model for biomedical relation extraction
Su, Peng
Vijay-Shanker, K.
BMC BIOINFORMATICS, 2022, 23 (01)
[26] Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding
Cao, Jin
Wang, Jun
Hamza, Wael
Vanee, Kelly
Li, Shang-Wen
INTERSPEECH 2020, 2020, : 1570 - 1574
[27] Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning
Zhang, Jian-Guo
Bui, Trung
Yoon, Seunghyun
Chen, Xiang
Liu, Zhiwei
Xia, Congying
Tran, Quan Hung
Chang, Walter
Yu, Philip
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1906 - 1912
[28] FOOD IMAGE RECOGNITION USING DEEP CONVOLUTIONAL NETWORK WITH PRE-TRAINING AND FINE-TUNING
Yanai, Keiji
Kawano, Yoshiyuki
2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2015,
[29] P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning
Hu, Xiaomeng
Yu, Shi
Xiong, Chenyan
Liu, Zhenghao
Liu, Zhiyuan
Yu, Ge
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1956 - 1962
[30] ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation
Xiao, Dongling
Zhang, Han
Li, Yukun
Sun, Yu
Tian, Hao
Wu, Hua
Wang, Haifeng
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3997 - 4003

← 1 2 3 4 5 →