Sentence Rephrasing for Parsing Sentences with OOV Words

被引:0
|
作者
Huang, Hen-Hsen [1 ]
Chen, Huan-Yuan [1 ]
Yu, Chang-Sheng [1 ]
Chen, Hsin-Hsi [1 ]
Lee, Po-Ching [2 ]
Chen, Chun-Hsun [2 ]
机构
[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 10764, Taiwan
[2] Chunghwa Telecom Co Ltd, Telecommun Labs, Taipei, Taiwan
关键词
Sentence Rephrasing; Named Entity; Dependency Parsing;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper addresses the problems of out-of-vocabulary (OOV) words, named entities in particular, in dependency parsing. The OOV words, whose word forms are unknown to the learning-based parser, in a sentence may decrease the parsing performance. To deal with this problem, we propose a sentence rephrasing approach to replace each OOV word in a sentence with a popular word of the same named entity type in the training set, so that the knowledge of the word forms can be used for parsing. The highest-frequency-based rephrasing strategy and the information-retrieval-based rephrasing strategy are explored to select the word to replace, and the Chinese Treebank 6.0 (CTB6) corpus is adopted to evaluate the feasibility of the proposed sentence rephrasing strategies. Experimental results show that rephrasing some specific types of OOV words such as Corporation, Organization, and Competition increases the parsing performances. This methodology can be applied to domain adaptation to deal with OOV problems.
引用
收藏
页码:2859 / 2862
页数:4
相关论文
共 50 条
  • [21] Parsing if-sentences and the conditions of sentencehood
    Barker, S
    ANALYSIS, 1996, 56 (04) : 210 - 218
  • [22] Sentences Generation by Frequent Parsing Patterns
    Yanagisawa, Takashi
    Miura, Takao
    Shioya, Isamu
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2010, 2010, 6283 : 53 - +
  • [23] Discourse Representation Parsing for Sentences and Documents
    Liu, Jiangming
    Cohen, Shay B.
    Lapata, Mirella
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 6248 - 6262
  • [24] Chunk parsing scheme for Chinese sentences
    Zhou, Qiang
    Sun, Maosong
    Huang, Changning
    Jisuanji Xuebao/Chinese Journal of Computers, 1999, 22 (11): : 1158 - 1165
  • [25] Shallow semantic parsing of Persian sentences
    Department of Artificial Intelligence, Azad University of Mashhad, Ostad Yousefi 0098511-6627512, Ghasem Abad - Mashhad, Iran
    不详
    PACLIC 23 - Proc. 23rd Pacific Asia Conf. Lang. Inf. Comput., 2009, (150-159): : 150 - 159
  • [26] On the parsing of garden-path sentences
    Fujita, Hiroki
    LANGUAGE COGNITION AND NEUROSCIENCE, 2021, 36 (10) : 1234 - 1245
  • [27] PARSING FILLER-GAP SENTENCES
    STOWE, LA
    TANENHAUS, MK
    CARLSON, GN
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1985, 23 (04) : 297 - 297
  • [28] Automatic lemmatizer construction with focus on OOV words lemmatization
    Kanis, J
    Müller, L
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2005, 3658 : 132 - 139
  • [29] PREDICTABILITY OF WORDS IN SENTENCES
    GIOLAS, TG
    COOKER, HS
    DUFFY, JR
    JOURNAL OF AUDITORY RESEARCH, 1970, 10 (04): : 328 - 334
  • [30] On the Understanding of Words and Sentences
    Taylor, Clifton O.
    ZEITSCHRIFT FUR PSYCHOLOGIE UND PHYSIOLOGIE DER SINNESORGANE, 1906, 40 : 225 - 251