Model-Agnostic Zero-Shot Intent Detection via Contrastive Transfer Learning

被引:2
|
作者
Maqbool, M. H. [1 ]
Fereidouni, Moghis [2 ]
Siddique, A. B. [2 ]
Foroosh, Hassan [1 ]
机构
[1] Univ Cent Florida, Orlando, FL USA
[2] Univ Kentucky, Lexington, KY 40506 USA
关键词
Intent detection; zero-shot learning; dialog systems;
D O I
10.1142/S1793351X24410010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An intent detector is a central component of any task-oriented conversational system. The goal of the intent detector is to identify the user's goal by classifying natural language utterances. In recent years, research has focused on supervised intent detection models. Supervised learning approaches cannot accommodate unseen intents, which may emerge after the system has been deployed- the more practically relevant setting, known as zero-shot intent detection. The existing zero-shot learning approaches split a dataset into seen and unseen intents for training and evaluations without taking the sensitivity of the data collection process into account. That is, humans tend to use repeated vocabulary and compose sentences with similar compositional structures. We argue that the source-to-target relationship learning objective of zero-shot approaches under typical data split procedure renders the zero-shot models prone to misclassifications when target intents are divergent from source intents. To this end, we propose INTEND, a zero-shot INTENt Detection methodology that leverages contrastive transfer learning and employs a zero-shot learning paradigm in its true sense. First, in contrast to partitioning the training and testing sets from the same dataset, we demonstrate that selecting training and testing sets from two different datasets allows for rigorous zero-shot intent detection evaluations. Second, our employed contrastive learning goal is model-agnostic and encourages the system to focus on learning a generic similarity function, rather than on commonly encountered patterns in the training set. We conduct extensive experimental evaluations using a range of transformer models on four public intent detection datasets for up to 150 unseen classes. Our experimental results show that INTEND consistently outperforms state-of-the-art zero-shot techniques by a substantial margin. Furthermore, our approach achieves significantly better performance than few-shot intent detection models.
引用
收藏
页码:5 / 24
页数:20
相关论文
共 50 条
  • [1] Robust Zero-Shot Intent Detection via Contrastive Transfer Learning
    Maqbool, M. H.
    Khan, F. A.
    Siddique, A. B.
    Foroosh, Hassan
    2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 49 - 56
  • [2] Model-Agnostic Metric for Zero-Shot Learning
    Shen, Jiayi
    Wang, Haochen
    Zhang, Anran
    Qiu, Qiang
    Zhen, Xiantong
    Cao, Xianbin
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 775 - 784
  • [3] Zero-Shot Stance Detection via Contrastive Learning
    Liang, Bin
    Chen, Zixiao
    Gui, Lin
    He, Yulan
    Yang, Min
    Xu, Ruifeng
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 2738 - 2747
  • [4] Feature Enhanced Zero-Shot Stance Detection via Contrastive Learning
    Zhao, Xuechen
    Zou, Jiaying
    Zhang, Zhong
    Xie, Feng
    Zhou, Bin
    Tian, Lei
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 900 - 908
  • [5] Transformer-Based Zero-Shot Detection via Contrastive Learning
    Liu, Wei
    Chen, Hui
    Ma, Yongqiang
    Wang, Jianji
    Zheng, Nanning
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART I, 2022, 646 : 316 - 327
  • [6] Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion
    Shipard, Jordan
    Wiliem, Arnold
    Thanh, Kien Nguyen
    Xiang, Wei
    Fookes, Clinton
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2023, : 769 - 778
  • [7] Zero-Shot Stance Detection via Sentiment-Stance Contrastive Learning
    Zou, Jiaying
    Zhao, Xuechen
    Xie, Feng
    Zhou, Bin
    Zhang, Zhong
    Tian, Lei
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 251 - 258
  • [8] Transformer-Based Approach Via Contrastive Learning for Zero-Shot Detection
    Liu, Wei
    Chen, Hui
    Ma, Yongqiang
    Wang, Jianji
    Zheng, Nanning
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (07)
  • [9] Zero-shot classification of small target on sea bottom using model-agnostic meta-learning
    You, Heewon
    Choo, Yongmin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 156 (01): : 256 - 261
  • [10] Zero-Shot Learning via Contrastive Learning on Dual Knowledge Graphs
    Wang, Jin
    Jiang, Bo
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 885 - 892