Pointing the Unknown Words

被引:152
|
作者
Gulcehre, Caglar [1 ]
Ahn, Sungjin [1 ]
Nallapati, Ramesh [2 ]
Zhou, Bowen [2 ]
Bengio, Yoshua [1 ]
机构
[1] Univ Montreal, Montreal, PQ, Canada
[2] IBM TJ Watson Res, Yorktown Hts, NY USA
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.18653/v1/p16-1014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of rare and unknown words is an important issue that can potentially effect the performance of many NLP systems, including traditional count-based and deep learning models. We propose a novel way to deal with the rare and unseen words for the neural network models using attention. Our model uses two softmax layers in order to predict the next word in conditional language models: one predicts the location of a word in the source sentence, and the other predicts a word in the shortlist vocabulary. At each timestep, the decision of which softmax layer to use is adaptively made by an MLP which is conditioned on the context. We motivate this work from a psychological evidence that humans naturally have a tendency to point towards objects in the context or the environment when the name of an object is not known. Using our proposed model, we observe improvements on two tasks, neural machine translation on the Europarl English to French parallel corpora and text summarization on the Gigaword dataset.
引用
收藏
页码:140 / 149
页数:10
相关论文
共 50 条
  • [1] UNDERSTANDING OF UNKNOWN WORDS
    DOBLAEV, LP
    VOPROSY PSIKHOLOGII, 1969, (04) : 85 - 95
  • [2] COMPREHENSION OF UNKNOWN WORDS
    DOBLAEV, LP
    SOVIET PSYCHOLOGY-USSR, 1971, 9 (03): : 250 - &
  • [3] IMPLICIT KNOWLEDGE OF UNKNOWN WORDS
    DURSO, FT
    SHORE, WJ
    BAILEY, KA
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1989, 27 (06) : 488 - 488
  • [4] Modelling unknown words in spontaneous speech
    Kemp, T
    Jusek, A
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 530 - 533
  • [5] "Pointing at Shadows": Wallace, Wittgenstein, and the Problem of Putting Pain into Words
    Chase, Greg
    CRITIQUE-STUDIES IN CONTEMPORARY FICTION, 2023, 64 (02) : 182 - 194
  • [6] The unknown gospel: A source for the words of Jesus ("Q")
    Cazelais, S
    STUDIES IN RELIGION-SCIENCES RELIGIEUSES, 2002, 31 (3-4) : 402 - 404
  • [7] Dealing with unknown words in statistical machine translation
    Silva, Joao
    Coheur, Luisa
    Costa, Angela
    Trancoso, Isabel
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3977 - 3981
  • [8] Chinese Unknown Words Extraction for Incomplete Sentences
    Chen, Yi-Hui
    Lu, Eric Jui-Lin
    Huang, Jeng-Jie
    International Journal of Network Security, 2022, 24 (04) : 755 - 764
  • [9] The Devil is in the Details: Parsing Unknown German Words
    Dakota, Daniel
    LANGUAGE TECHNOLOGIES FOR THE CHALLENGES OF THE DIGITAL AGE, GSCL 2017, 2018, 10713 : 23 - 39
  • [10] An iterative method for extracting Chinese unknown words
    He, S
    Zhu, J
    CHINESE JOURNAL OF ELECTRONICS, 2001, 10 (04): : 461 - 464