An unsupervised & statistical word sense tagging using bilingual sources

被引:0
|
作者
Oliveira, F [1 ]
Wong, F [1 ]
Li, YP [1 ]
机构
[1] Univ Macau, Fac Sci & Technol, Macao, Peoples R China
关键词
word sense tagging; machine translation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an approach for choosing the correct translation of an ambiguous word in a given sentence. An unsupervised learning is applied and a non-aligned bilingual Portuguese to Chinese bilingual corpus is used in disambiguating word senses. The identification of the relationships between words is done by considering its surrounding words and their relative distance to tackle syntactical relationships. All the related words are then translated to the target language in finding out the correct senses of ambiguous words. The selection is based on a statistical and a mathematical model by assigning a score to each of the sense identified previously. After all the senses discovered, its semantic and syntactical information are converted into a set of rules and stored in the database for later use in the disambiguation process. Preliminary experiment results of the proposed method shows an improvement of 6% in assigning correctly the corresponding translation over the baseline method.
引用
收藏
页码:3749 / 3754
页数:6
相关论文
共 50 条
  • [21] Unsupervised Bilingual Word Embedding Agreement for Unsupervised Neural Machine Translation
    Sun, Haipeng
    Wang, Rui
    Chen, Kehai
    Utiyama, Masao
    Sumita, Eiichiro
    Zhao, Tiejun
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1235 - 1245
  • [22] Unsupervised word sense induction using rival penalized competitive learning
    Huang, Yanzhou
    Shi, Xiaodong
    Su, Jinsong
    Chen, Yidong
    Huang, Guimin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 41 : 166 - 174
  • [23] Word Sense Disambiguation in Bengali language using unsupervised methodology with modifications
    Pal, Alok Ranjan
    Saha, Diganta
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (07):
  • [24] Word Sense Disambiguation in Bengali language using unsupervised methodology with modifications
    Alok Ranjan Pal
    Diganta Saha
    Sādhanā, 2019, 44
  • [25] Improving Subjectivity Detection using Unsupervised Subjectivity Word Sense Disambiguation
    Ortega, Reynier
    Fonseca, Adrian
    Gutierrez, Yoan
    Montoyo, Andres
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (51): : 179 - 186
  • [26] Improved Unsupervised Statistical Machine Translation via Unsupervised Word Sense Disambiguation for a Low-Resource and Indic Languages
    Saxena, Shefali
    Chaurasia, Uttkarsh
    Bansal, Nitin
    Daniel, Philemon
    IETE JOURNAL OF RESEARCH, 2023, 69 (12) : 8848 - 8858
  • [27] Unsupervised Word Sense Disambiguation based on Word Embedding and Collocation
    Han, Shangzhuang
    Shirai, Kiyoaki
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 1218 - 1225
  • [28] Retrofitting Word Representations for Unsupervised Sense Aware Word Similarities
    Remus, Steffen
    Biemann, Chris
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1035 - 1041
  • [29] Unsupervised graph-based word sense disambiguation using measures of word semantic similarity
    Sinha, Ravi
    Mihalcea, Rada
    ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 363 - +
  • [30] Using a Bilingual Context in Word-Based Statistical Machine Translation
    Schmidt, Christoph
    Vilar, David
    Ney, Herrnann
    PATTERN RECOGNITION IN INFORMATION SYSTEMS, PROCEEDINGS, 2008, : 144 - 153