Selecting Training Data for Unsupervised Domain Adaptation in Word Sense Disambiguation

被引:0
|
作者
Komiya, Kanako [1 ]
Sasaki, Minoru [1 ]
Shinnou, Hiroyuki [1 ]
Kotani, Yoshiyuki [2 ]
Okumura, Manabu [3 ]
机构
[1] Ibaraki Univ, 4-12-1 Nakanarusawa, Hitachi, Ibaraki 3168511, Japan
[2] Tokyo Univ Agr & Thechnol, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
[3] Tokyo Inst Technol, Midori Ku, 4259 Nagatuta, Yokohama, Kanagawa 2268503, Japan
关键词
Domain adaptation; Word sense disambiguation; Data selection;
D O I
10.1007/978-3-319-42911-3_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a method of domain adaptation, which involves adapting a classifier developed from source to target data. We automatically select the training data set that is suitable for the target data from the whole source data of multiple domains. This is unsupervised domain adaptation for Japanese word sense disambiguation (WSD). Experiments revealed that the accuracies of WSD improved when we automatically selected the training data set using two criteria, the degree of confidence and the leave-one-out (LOO)-bound score, compared with when the classifier was trained with all the data.
引用
收藏
页码:220 / 232
页数:13
相关论文
共 50 条
  • [21] Unsupervised Hindi Word Sense Disambiguation based on Network Agglomeration
    Jain, Amita
    Lobiyal, D. K.
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 195 - 200
  • [22] Unsupervised word sense disambiguation with N-gram features
    Preotiuc-Pietro, Daniel
    Hristea, Florentina
    ARTIFICIAL INTELLIGENCE REVIEW, 2014, 41 (02) : 241 - 260
  • [23] Unsupervised bilingual word sense disambiguation using Web statistics
    Wang, Y
    Hoffmann, A
    AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3809 : 1167 - 1172
  • [24] The optimization of gibbs sampling model in unsupervised word sense disambiguation
    Li, Xu
    Shen, Lan
    Yao, Chunlong
    Yu, Xiaoqiang
    ICIC Express Letters, Part B: Applications, 2012, 3 (04): : 861 - 868
  • [25] A clustering-based Approach for Unsupervised Word Sense Disambiguation
    Martin-Wanton, Tamara
    Berlanga-Llavori, Rafael
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 49 - 56
  • [26] Research on dual pattern of unsupervised and supervised Word Sense Disambiguation
    Wang, Yao-Feng
    Zhang, Yue-Jie
    Xu, Zhi-Ting
    Zhang, Tao
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2665 - +
  • [27] AN IMPROVED UNSUPERVISED LEARNING PROBABILISTIC MODEL OF WORD SENSE DISAMBIGUATION
    Li, Xu
    Zhao, Xiuyan
    Ban, Fenglong
    Liu, Bai
    PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, 2012, : 1071 - 1075
  • [28] Combining unsupervised lexical knowledge methods for word sense disambiguation
    Rigau, G
    Atserias, J
    Agirre, E
    35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 1997, : 48 - 55
  • [29] Graph Connectivity for Unsupervised Word Sense Disambiguation for HINDI Language
    Nandanwar, Lokesh
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [30] Unsupervised word sense disambiguation with N-gram features
    Daniel Preotiuc-Pietro
    Florentina Hristea
    Artificial Intelligence Review, 2014, 41 : 241 - 260