Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT

被引：0

作者：

Song, Xiaoshuai ^{[1
]}

He, Keqing ^{[2
]}

Wang, Pei ^{[1
]}

Dong, Guanting ^{[1
]}

Mou, Yutao ^{[1
]}

Wang, Jingang ^{[2
]}

Xiang, Yunsen ^{[2
]}

Cai, Xunliang ^{[2
]}

Xu, Weiran ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China

[2] Meituan, Beijing, Peoples R China

来源：

2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The tasks of out-of-domain (OOD) intent discovery and generalized intent discovery (GID) aim to extend a closed intent classifier to open-world intent sets, which is crucial to task-oriented dialogue (TOD) systems. Previous methods address them by fine-tuning discriminative models. Recently, although some studies have been exploring the application of large language models (LLMs) represented by ChatGPT to various downstream tasks, it is still unclear for the ability of ChatGPT to discover and incrementally extent OOD intents. In this paper, we comprehensively evaluate ChatGPT on OOD intent discovery and GID, and then outline the strengths and weaknesses of ChatGPT. Overall, ChatGPT exhibits consistent advantages under zero-shot settings, but is still at a disadvantage compared to fine-tuned models. More deeply, through a series of analytical experiments, we summarize and discuss the challenges faced by LLMs including clustering, domain-specific understanding, and cross-domain in-context learning scenarios. Finally, we provide empirical guidance for future directions to address these challenges.

引用

页码：10291 / 10304

页数：14

共 50 条

[21] An Open-World Extension to Knowledge Graph Completion Models
Shah, Haseeb
Villmow, Johannes
Ulges, Adrian
Schwanecke, Ulrich
Shafait, Faisal
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3044 - 3051
[22] Open-World Object Manipulation using Pre-Trained Vision-Language Models
Stone, Austin
Xiao, Ted
Lu, Yao
Gopalakrishnan, Keerthana
Lee, Kuang-Huei
Quan Vuong
Wohlhart, Paul
Kirmani, Sean
Zitkovich, Brianna
Xia, Fei
Finn, Chelsea
Hausman, Karol
CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[23] Artificial intelligence enabled ChatGPT and large language models in drug target discovery, drug discovery, and development
Chakraborty, Chiranjib
Bhattacharya, Manojit
Lee, Sang-Soo
MOLECULAR THERAPY-NUCLEIC ACIDS, 2023, 33 : 866 - 868
[24] Guest Editorial: Special Issue on Open-World Visual Recognition
Zhong, Zhun
Liu, Hong
Cui, Yin
Satoh, Shin'ichi
Sebe, Nicu
Yang, Ming-Hsuan
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (02) : 985 - 988
[25] OWNER - Toward Unsupervised Open-World Named Entity Recognition
Genest, Pierre-Yves
Portier, Pierre-Edouard
Egyed-Zsigmond, Elod
Lovisetto, Martino
IEEE ACCESS, 2025, 13 : 50077 - 50105
[26] Towards Unsupervised Domain-Specific Open-World Recognition
Alfarisy, Gusti Ahmad Fanshuri
Malik, Owais Ahmed
Hong, Ong Wee
NEUROCOMPUTING, 2025, 619
[27] Open-World Recognition in Remote Sensing: Concepts, challenges, and opportunities
Fang, Leyuan
Yang, Zhen
Ma, Tianlei
Yue, Jun
Xie, Weiying
Ghamisi, Pedram
Li, Jun
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2024, 12 (02) : 8 - 31
[28] Beyond the Known: Novel Class Discovery for Open-World Graph Learning
Jin, Yucheng
Xiong, Yun
Fang, Juncheng
Wu, Xixi
He, Dongxiao
Jia, Xing
Zhao, Bingchen
Yu, Philip S.
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT VI, DASFAA 2024, 2024, 14855 : 117 - 133
[29] LARGE LANGUAGE MODELS (LLMS) AND CHATGPT FOR BIOMEDICINE
Arighi, Cecilia
Brenner, Steven
Lu, Zhiyong
BIOCOMPUTING 2024, PSB 2024, 2024, : 641 - 644
[30] Language Models Meet World Models: Embodied Experiences Enhance Language Models
Xiang, Jiannan
Tao, Tianhua
Gu, Yi
Shu, Tianmin
Wang, Zirui
Yang, Zichao
Hu, Zhiting
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →