Selecting Training Data for Unsupervised Domain Adaptation in Word Sense Disambiguation

被引:0
|
作者
Komiya, Kanako [1 ]
Sasaki, Minoru [1 ]
Shinnou, Hiroyuki [1 ]
Kotani, Yoshiyuki [2 ]
Okumura, Manabu [3 ]
机构
[1] Ibaraki Univ, 4-12-1 Nakanarusawa, Hitachi, Ibaraki 3168511, Japan
[2] Tokyo Univ Agr & Thechnol, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
[3] Tokyo Inst Technol, Midori Ku, 4259 Nagatuta, Yokohama, Kanagawa 2268503, Japan
关键词
Domain adaptation; Word sense disambiguation; Data selection;
D O I
10.1007/978-3-319-42911-3_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a method of domain adaptation, which involves adapting a classifier developed from source to target data. We automatically select the training data set that is suitable for the target data from the whole source data of multiple domains. This is unsupervised domain adaptation for Japanese word sense disambiguation (WSD). Experiments revealed that the accuracies of WSD improved when we automatically selected the training data set using two criteria, the degree of confidence and the leave-one-out (LOO)-bound score, compared with when the classifier was trained with all the data.
引用
收藏
页码:220 / 232
页数:13
相关论文
共 50 条
  • [1] Domain Adaptation for Word Sense Disambiguation Using Word Embeddings
    Komiya, Kanako
    Suzuki, Shota
    Sasaki, Minoru
    Shinnou, Hiroyuki
    Okumura, Manabu
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 195 - 206
  • [2] An unsupervised method for word sense disambiguation
    Rahman, Nazreena
    Borah, Bhogeswar
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 6643 - 6651
  • [3] Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation
    Chan, Yee Seng
    Ng, Hwee Tou
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 89 - 96
  • [4] Unsupervised Word Sense Disambiguation Using Word Embeddings
    Moradi, Behzad
    Ansari, Ebrahim
    Zabokrtsky, Zdenek
    PROCEEDINGS OF THE 2019 25TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 228 - 233
  • [5] ADOPTING DOMAIN KNOWLEDGE TO ENHANCE LEXICAL CHAIN FOR UNSUPERVISED WORD SENSE DISAMBIGUATION
    Lee, Wei Jan
    Mit, Edwin
    PROCEEDINGS OF THE 2011 3RD INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGY AND ENGINEERING (ICSTE 2011), 2011, : 13 - 18
  • [6] Unsupervised Word Sense Disambiguation with Multilingual Representations
    Fernandez-Ordonez, Erwin
    Mihalcea, Rada
    Hassan, Samer
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 847 - 851
  • [7] Mnogoznal: an Unsupervised System for Word Sense Disambiguation
    Ustalov, Dmitry
    Teslenko, Denis
    Panchenko, Alexander
    Chernoskutov, Mikhail
    2017 INTERNATIONAL MULTI-CONFERENCE ON ENGINEERING, COMPUTER AND INFORMATION SCIENCES (SIBIRCON), 2017, : 147 - 150
  • [8] Unsupervised Word Sense Disambiguation Using The WWW
    Klapaftis, Ioannis P.
    Manandhar, Suresh
    STAIRS 2006, 2006, 142 : 174 - 183
  • [9] Unsupervised Approach to Word Sense Disambiguation in Malayalam
    Sankar, Sruthi K. P.
    Raj, P. C. Reghu
    Jayan, V
    INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING, SCIENCE AND TECHNOLOGY (ICETEST - 2015), 2016, 24 : 1507 - 1513
  • [10] Word Sense Disambiguation in Bengali: an Unsupervised Approach
    Pal, Alok Ranjan
    Saha, Diganta
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,