Sentence Rephrasing for Parsing Sentences with OOV Words

被引:0
|
作者
Huang, Hen-Hsen [1 ]
Chen, Huan-Yuan [1 ]
Yu, Chang-Sheng [1 ]
Chen, Hsin-Hsi [1 ]
Lee, Po-Ching [2 ]
Chen, Chun-Hsun [2 ]
机构
[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 10764, Taiwan
[2] Chunghwa Telecom Co Ltd, Telecommun Labs, Taipei, Taiwan
关键词
Sentence Rephrasing; Named Entity; Dependency Parsing;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper addresses the problems of out-of-vocabulary (OOV) words, named entities in particular, in dependency parsing. The OOV words, whose word forms are unknown to the learning-based parser, in a sentence may decrease the parsing performance. To deal with this problem, we propose a sentence rephrasing approach to replace each OOV word in a sentence with a popular word of the same named entity type in the training set, so that the knowledge of the word forms can be used for parsing. The highest-frequency-based rephrasing strategy and the information-retrieval-based rephrasing strategy are explored to select the word to replace, and the Chinese Treebank 6.0 (CTB6) corpus is adopted to evaluate the feasibility of the proposed sentence rephrasing strategies. Experimental results show that rephrasing some specific types of OOV words such as Corporation, Organization, and Competition increases the parsing performances. This methodology can be applied to domain adaptation to deal with OOV problems.
引用
收藏
页码:2859 / 2862
页数:4
相关论文
共 50 条
  • [1] Repeating Words in Sentences: Effects of Sentence Structure
    Wheeldon, Linda R.
    Smith, Mark C.
    Apperly, Ian A.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2011, 37 (05) : 1051 - 1064
  • [2] Parsing method for identifying words in mandarin Chinese sentences
    Liang-Jyh, Wang
    Pei, Tzusheng
    Wei-Chuan, Li
    Lih-Ching, R. Huang
    IJCAI, Proceedings of the International Joint Conference on Artificial Intelligence, 1600, 2
  • [3] The recognition of isolated words and words in sentences: Individual variability in the use of sentence context
    Grant, Ken W.
    Seitz, Philip F.
    1600, American Institute of Physics Inc. (107):
  • [4] The recognition of isolated words and words in sentences: Individual variability in the use of sentence context
    Grant, KW
    Seitz, PF
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 107 (02): : 1000 - 1011
  • [5] PARSING IF SENTENCES
    DUDMAN, VH
    ANALYSIS, 1984, 44 (04) : 145 - 153
  • [6] McGurk effect in Finnish syllables, isolated words, and words in sentences:: Effects of word meaning and sentence context
    Sams, M
    Manninen, P
    Surakka, V
    Helin, P
    Kättö, R
    SPEECH COMMUNICATION, 1998, 26 (1-2) : 75 - 87
  • [7] Constraints on sentence priming in the cerebral hemispheres: Effects of intervening words in sentences and lists
    Faust, M
    Chiarello, C
    BRAIN AND LANGUAGE, 1998, 63 (02) : 219 - 236
  • [8] CONDITIONAL SENTENCE REPHRASING WITHOUT PARALLEL TRAINING CORPUS
    Lee, Yen-Ting
    Li, Cheng-Te
    Lin, Shou-De
    2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
  • [9] Parsing incomplete sentences revisited
    Vilares, M
    Darriba, VM
    Vilares, J
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2004, 2945 : 102 - 111
  • [10] Improved modeling of OOV words in spontaneous speech
    Fetter, P
    Kaltenmeier, A
    Kuhn, T
    RegelBrietzmann, P
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 534 - 537