Multi-engine collaborative bootstrapping for word sense disambiguation

被引:1
|
作者
Duan, Jianyong [1 ]
Lu, Ruzhan
Li, Xuening
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[2] So Yangtze Univ, Wuxi 214036, Peoples R China
基金
中国国家自然科学基金;
关键词
bootstrapping algorithms; machine learning; word sense disambiguation;
D O I
10.1142/S0218213007003369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a new word sense disambiguation method called Multi- engine Collaborative Bootstrapping ( MCB) that combines different types of corpora and also uses two languages for bootstrapping. MCB uses the bilingual bootstrapping as its core algorithm that leading to incremental knowledge acquisition. The EM model is applied to train parameters in a base learner. The feature translation model is improved by semantic correlation estimation. In addition we use multi- engine selection to produce qualified starting seeds from parallel corpora and monolingual corpora. Those seeds that are generated through unsupervised machine learning approaches can also ensure bootstrapping effectiveness in contrast with manually selected seeds in spite of their different selection mechanisms. Experimental results prove the effectiveness of MCB. Some factors including feature space and starting seed number are concerned involved in our experiments because the EM algorithm is sensitive to starting values. Limitation of resources is also a concern.
引用
收藏
页码:465 / 482
页数:18
相关论文
共 50 条
  • [1] Word sense disambiguation using multi-engine collaborative bootstrapping
    Duan, JY
    Wu, WL
    Hu, Y
    Chen, YQ
    Lu, RZ
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 20 - 25
  • [2] Bootstrapping for Chinese word sense disambiguation based on grouping strategy
    Li, Lishuang
    Shang, Min
    Huang, Degen
    Wang, Ke
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 73 - 78
  • [3] Bootstrapping Word Sense Disambiguation using dynamic Web knowledge
    Wang, Yuanyong
    Hoffmann, Achim
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 1150 - 1154
  • [4] Multi-sense embeddings through a word sense disambiguation process
    Ruas, Terry
    Grosky, William
    Aizawa, Akiko
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 136 : 288 - 303
  • [5] Sense Space for Word Sense Disambiguation
    Kang, Myung Yun
    Min, Tae Hong
    Lee, Jae Sung
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 669 - 672
  • [6] Word sense disambiguation based on word sense clustering
    Anaya-Sanchez, Henry
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481
  • [7] Word translation disambiguation using bilingual bootstrapping
    Li, C
    Li, H
    40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 343 - 351
  • [8] Word translation disambiguation using bilingual bootstrapping
    Li, H
    Li, C
    COMPUTATIONAL LINGUISTICS, 2004, 30 (01) : 1 - 22
  • [9] Word sense disambiguation methods
    Turdakov, D. Yu.
    PROGRAMMING AND COMPUTER SOFTWARE, 2010, 36 (06) : 309 - 326
  • [10] Practice of Word Sense Disambiguation
    Sieminski, Andrzej
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2018, PT I, 2018, 10751 : 159 - 169