Feature Distance-based Framework for Classification of Low-Frequency Semantic Relations

被引:0
|
作者
Horie, Andre Kenji [1 ]
Ishizuka, Mitsuru [1 ]
机构
[1] Univ Tokyo, Sch Informat Sci & Technol, Tokyo, Japan
关键词
Semantic Computing; Concept Description; Natural Language Text;
D O I
10.1109/ICSC.2011.9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the relation extraction of semantic relations, it is not uncommon to face settings in which the training data provides very few instances of some relation classes. This is mostly due to the high cost of producing such data and to the class imbalance problem, which may result in some classes presenting small frequencies even with a large annotated corpus. This work thus presents a semi-supervised bootstrapped method to expand this initial training dataset, using pattern matching to extract new candidate instances from the Web. The core of this process uses a multiview feature distance-based framework, which allows quantitative and qualitative analysis of intermediate steps of the process. Experimental results show that this framework provides better results in the relation classification task than the baseline, and the bootstrapped architecture improves the relation classification task as a whole for these low-frequency semantic relations settings.
引用
收藏
页码:59 / 66
页数:8
相关论文
共 50 条
  • [31] A distance-based separator representation for pattern classification
    Shih, Frank Y.
    Zhang, Kai
    IMAGE AND VISION COMPUTING, 2008, 26 (05) : 667 - 672
  • [32] A Geometric Theory of Feature Selection and Distance-Based Measures
    Shin, Kilho
    Angulo, Adrian Pino
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3812 - 3819
  • [33] Distance-based test feature classifiers and its applications
    Lashkia, V.
    Kaneko, S.
    Aleshin, S.
    IEICE Transactions on Information and Systems, 2000, E83-D (04) : 904 - 913
  • [35] Classifying imbalanced data in distance-based feature space
    Shin Ando
    Knowledge and Information Systems, 2016, 46 : 707 - 730
  • [36] Feature-rich distance-based terrain synthesis
    Rusnell, Brennan
    Mould, David
    Eramian, Mark
    VISUAL COMPUTER, 2009, 25 (5-7): : 573 - 579
  • [37] Distance-Based High-Frequency Trading
    Felker, Travis
    Mazalov, Vadim
    Watt, Stephen M.
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 2055 - 2064
  • [38] Distance-Based Network Recovery Under Feature Correlation
    Adametz, David
    Roth, Volker
    SIMILARITY-BASED PATTERN RECOGNITION, SIMBAD 2015, 2015, 9370 : 209 - 210
  • [39] Fast Fingertips Positioning Based on Distance-based Feature Pixels
    Le Dung
    Mizukawa, Makoto
    2010 THIRD INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2010, : 184 - 189
  • [40] Feature-rich distance-based terrain synthesis
    Brennan Rusnell
    David Mould
    Mark Eramian
    The Visual Computer, 2009, 25 : 573 - 579