Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT

被引:0
|
作者
Song, Xiaoshuai [1 ]
He, Keqing [2 ]
Wang, Pei [1 ]
Dong, Guanting [1 ]
Mou, Yutao [1 ]
Wang, Jingang [2 ]
Xiang, Yunsen [2 ]
Cai, Xunliang [2 ]
Xu, Weiran [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Meituan, Beijing, Peoples R China
来源
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The tasks of out-of-domain (OOD) intent discovery and generalized intent discovery (GID) aim to extend a closed intent classifier to open-world intent sets, which is crucial to task-oriented dialogue (TOD) systems. Previous methods address them by fine-tuning discriminative models. Recently, although some studies have been exploring the application of large language models (LLMs) represented by ChatGPT to various downstream tasks, it is still unclear for the ability of ChatGPT to discover and incrementally extent OOD intents. In this paper, we comprehensively evaluate ChatGPT on OOD intent discovery and GID, and then outline the strengths and weaknesses of ChatGPT. Overall, ChatGPT exhibits consistent advantages under zero-shot settings, but is still at a disadvantage compared to fine-tuned models. More deeply, through a series of analytical experiments, we summarize and discuss the challenges faced by LLMs including clustering, domain-specific understanding, and cross-domain in-context learning scenarios. Finally, we provide empirical guidance for future directions to address these challenges.
引用
收藏
页码:10291 / 10304
页数:14
相关论文
共 50 条
  • [21] An Open-World Extension to Knowledge Graph Completion Models
    Shah, Haseeb
    Villmow, Johannes
    Ulges, Adrian
    Schwanecke, Ulrich
    Shafait, Faisal
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3044 - 3051
  • [22] Open-World Object Manipulation using Pre-Trained Vision-Language Models
    Stone, Austin
    Xiao, Ted
    Lu, Yao
    Gopalakrishnan, Keerthana
    Lee, Kuang-Huei
    Quan Vuong
    Wohlhart, Paul
    Kirmani, Sean
    Zitkovich, Brianna
    Xia, Fei
    Finn, Chelsea
    Hausman, Karol
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [23] Artificial intelligence enabled ChatGPT and large language models in drug target discovery, drug discovery, and development
    Chakraborty, Chiranjib
    Bhattacharya, Manojit
    Lee, Sang-Soo
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2023, 33 : 866 - 868
  • [24] Guest Editorial: Special Issue on Open-World Visual Recognition
    Zhong, Zhun
    Liu, Hong
    Cui, Yin
    Satoh, Shin'ichi
    Sebe, Nicu
    Yang, Ming-Hsuan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (02) : 985 - 988
  • [25] OWNER - Toward Unsupervised Open-World Named Entity Recognition
    Genest, Pierre-Yves
    Portier, Pierre-Edouard
    Egyed-Zsigmond, Elod
    Lovisetto, Martino
    IEEE ACCESS, 2025, 13 : 50077 - 50105
  • [26] Towards Unsupervised Domain-Specific Open-World Recognition
    Alfarisy, Gusti Ahmad Fanshuri
    Malik, Owais Ahmed
    Hong, Ong Wee
    NEUROCOMPUTING, 2025, 619
  • [27] Open-World Recognition in Remote Sensing: Concepts, challenges, and opportunities
    Fang, Leyuan
    Yang, Zhen
    Ma, Tianlei
    Yue, Jun
    Xie, Weiying
    Ghamisi, Pedram
    Li, Jun
    IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2024, 12 (02) : 8 - 31
  • [28] Beyond the Known: Novel Class Discovery for Open-World Graph Learning
    Jin, Yucheng
    Xiong, Yun
    Fang, Juncheng
    Wu, Xixi
    He, Dongxiao
    Jia, Xing
    Zhao, Bingchen
    Yu, Philip S.
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT VI, DASFAA 2024, 2024, 14855 : 117 - 133
  • [29] LARGE LANGUAGE MODELS (LLMS) AND CHATGPT FOR BIOMEDICINE
    Arighi, Cecilia
    Brenner, Steven
    Lu, Zhiyong
    BIOCOMPUTING 2024, PSB 2024, 2024, : 641 - 644
  • [30] Language Models Meet World Models: Embodied Experiences Enhance Language Models
    Xiang, Jiannan
    Tao, Tianhua
    Gu, Yi
    Shu, Tianmin
    Wang, Zirui
    Yang, Zichao
    Hu, Zhiting
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,