Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems

被引:32
|
作者
Ryu, Seonghan [1 ]
Kim, Seokhwan [2 ]
Choi, Junhwi [1 ]
Yu, Hwanjo [1 ]
Lee, Gary Geunbae [1 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, 77 Cheongam Ro, Pohang 37673, South Korea
[2] I2R, 1 Fusionopolis Way,21-01 Connexis South Tower, Singapore 138632, Singapore
关键词
Natural language processing; Dialog systems; Out-of-domain sentence detection; Neural sentence embedding; Artificial neural networks; Distributional semantics; CLASSIFICATION;
D O I
10.1016/j.patrec.2017.01.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To ensure satisfactory user experience, dialog systems must be able to determine whether an input sentence is in-domain (ID) or out-of-domain (OOD). We assume that only ID sentences are available as training data because collecting enough OOD sentences in an unbiased way is a laborious and time-consuming job. This paper proposes a novel neural sentence embedding method that represents sentences in a low dimensional continuous vector space that emphasizes aspects that distinguish ID cases from OOD cases. We first used a large set of unlabeled text to pre-train word representations that are used to initialize neural sentence embedding. Then we used domain-category analysis as an auxiliary task to train neural sentence embedding for OOD sentence detection. After the sentence representations were learned, we used them to train an autoencoder aimed at OOD sentence detection. We evaluated our method by experimentally comparing it to the state-of-the-art methods in an eight-domain dialog system; our proposed method achieved the highest accuracy in all tests. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:26 / 32
页数:7
相关论文
共 17 条
  • [1] Out-of-Domain Detection Method Based on Sentence Distance for Dialogue Systems
    Oh, Kyo-Joong
    Lee, DongKun
    Park, Chanyong
    Choi, Ho-Jin
    Jeong, Young-Seob
    Hong, Sawook
    Kwon, Sungtae
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 673 - 676
  • [2] GAN-BASED OUT-OF-DOMAIN DETECTION USING BOTH IN-DOMAIN AND OUT-OF-DOMAIN SAMPLES
    Liang, Chaojie
    Huang, Peijie
    Lai, Wenbin
    Ruan, Ziheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7663 - 7667
  • [3] Towards Textual Out-of-Domain Detection Without In-Domain Labels
    Jin, Di
    Gao, Shuyang
    Kim, Seokhwan
    Liu, Yang
    Hakkani-Tur, Dilek
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1386 - 1395
  • [4] Accounting for Sentence Position and Legal Domain Sentence Embedding in Learning to Classify Case Sentences
    Xu, Huihui
    Savelka, Jaromir
    Ashley, Kevin D.
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 346 : 33 - 42
  • [5] Out-of-Domain Detection for Natural Language Understanding in Dialog Systems
    Zheng, Yinhe
    Chen, Guanyi
    Huang, Minlie
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1198 - 1209
  • [6] Exploiting Out-of-Vocabulary Words for Out-of-Domain Detection in Dialog Systems
    Ryu, Seonghan
    Lee, Donghyeon
    Lee, Gary Geunbae
    Kim, Kyungduk
    Noh, Hyungjong
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 165 - +
  • [7] Sentence Embedding for Neural Machine Translation Domain Adaptation
    Wang, Rui
    Finch, Andrew
    Utiyama, Masao
    Sumita, Eiichiro
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 560 - 566
  • [8] Neural sentence embedding models for semantic similarity estimation in the biomedical domain
    Blagec, Kathrin
    Xu, Hong
    Agibetov, Asan
    Samwald, Matthias
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [9] Neural sentence embedding models for semantic similarity estimation in the biomedical domain
    Kathrin Blagec
    Hong Xu
    Asan Agibetov
    Matthias Samwald
    BMC Bioinformatics, 20
  • [10] Glioma subtype classification from histopathological images using in-domain and out-of-domain transfer learning: An experimental study
    Despotovic, Vladimir
    Kim, Sang-Yoon
    Hau, Ann-Christin
    Kakoichankava, Aliaksandra
    Klamminger, Gilbert Georg
    Borgmann, Felix Bruno Kleine
    Frauenknecht, Katrin B. M.
    Mittelbronn, Michel
    Nazarov, Petr, V
    HELIYON, 2024, 10 (05)