Multi-engine collaborative bootstrapping for word sense disambiguation

被引:1
|
作者
Duan, Jianyong [1 ]
Lu, Ruzhan
Li, Xuening
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[2] So Yangtze Univ, Wuxi 214036, Peoples R China
基金
中国国家自然科学基金;
关键词
bootstrapping algorithms; machine learning; word sense disambiguation;
D O I
10.1142/S0218213007003369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a new word sense disambiguation method called Multi- engine Collaborative Bootstrapping ( MCB) that combines different types of corpora and also uses two languages for bootstrapping. MCB uses the bilingual bootstrapping as its core algorithm that leading to incremental knowledge acquisition. The EM model is applied to train parameters in a base learner. The feature translation model is improved by semantic correlation estimation. In addition we use multi- engine selection to produce qualified starting seeds from parallel corpora and monolingual corpora. Those seeds that are generated through unsupervised machine learning approaches can also ensure bootstrapping effectiveness in contrast with manually selected seeds in spite of their different selection mechanisms. Experimental results prove the effectiveness of MCB. Some factors including feature space and starting seed number are concerned involved in our experiments because the EM algorithm is sensitive to starting values. Limitation of resources is also a concern.
引用
收藏
页码:465 / 482
页数:18
相关论文
共 50 条
  • [31] Multi-Engine Training Manual
    Bures, R.
    Nemec, V.
    Szabo, S.
    TRANSPORT MEANS 2015, PTS I AND II, 2015, : 583 - 586
  • [32] Word Sense Indicators: Effective Feature for Chinese Word Sense Disambiguation
    Quan, Changqin
    Ren, Fuji
    He, Tingting
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2009, 12 (05): : 1157 - 1164
  • [33] Combining classifiers with multi-representation of context in word sense disambiguation
    Le, CA
    Huynh, VN
    Shimazu, A
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 262 - 268
  • [34] A multi-aspect comparison study of supervised word sense disambiguation
    Liu, HF
    Teller, V
    Friedman, C
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2004, 11 (04) : 320 - 331
  • [35] An Improved Word Sense Disambiguation Method
    Yu, Linlin
    Song, Lifang
    Sun, Jianyan
    Li, Lin
    2016 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS (ITMS 2016), 2016, : 153 - 155
  • [36] Genetic Word Sense Disambiguation Algorithm
    Zhang, ChunHui
    Zhou, Yiming
    Martin, Trevor
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL I, PROCEEDINGS, 2008, : 123 - +
  • [37] Word sense disambiguation in evolutionary manner
    Abed, Saad Adnan
    Tiun, Sabrina
    Omar, Nazlia
    CONNECTION SCIENCE, 2016, 28 (03) : 226 - 241
  • [38] A Word Sense Disambiguation Technique for Sinhala
    Arukgoda, Janindu
    Bandara, Vidudaya
    Bashani, Samiththa
    Gamage, Vijayindu
    Wimalasuriya, Daya
    PROCEEDINGS 2014 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE WITH APPLICATIONS IN ENGINEERING AND TECHNOLOGY ICAIET 2014, 2014, : 207 - 211
  • [39] Minimal Semantics and Word Sense Disambiguation
    Gasparri, Luca
    DISPUTATIO-INTERNATIONAL JOURNAL OF PHILOSOPHY, 2014, 6 (39): : 147 - 171
  • [40] Arabic Word Sense Disambiguation - Survey
    Alian, Marwah
    Awajan, Arafat
    Al-Kouz, Akram
    2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 236 - 240