Feature Distance-based Framework for Classification of Low-Frequency Semantic Relations

被引:0
|
作者
Horie, Andre Kenji [1 ]
Ishizuka, Mitsuru [1 ]
机构
[1] Univ Tokyo, Sch Informat Sci & Technol, Tokyo, Japan
关键词
Semantic Computing; Concept Description; Natural Language Text;
D O I
10.1109/ICSC.2011.9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the relation extraction of semantic relations, it is not uncommon to face settings in which the training data provides very few instances of some relation classes. This is mostly due to the high cost of producing such data and to the class imbalance problem, which may result in some classes presenting small frequencies even with a large annotated corpus. This work thus presents a semi-supervised bootstrapped method to expand this initial training dataset, using pattern matching to extract new candidate instances from the Web. The core of this process uses a multiview feature distance-based framework, which allows quantitative and qualitative analysis of intermediate steps of the process. Experimental results show that this framework provides better results in the relation classification task than the baseline, and the bootstrapped architecture improves the relation classification task as a whole for these low-frequency semantic relations settings.
引用
收藏
页码:59 / 66
页数:8
相关论文
共 50 条
  • [21] A Wasserstein Distance-Based Cost-Sensitive Framework for Imbalanced Data Classification
    Feng, Rui
    Ji, Hongbing
    Zhu, Zhigang
    Wang, Lei
    RADIOENGINEERING, 2023, 32 (03) : 451 - 466
  • [22] Classification Similarity Learning Using Feature-Based and Distance-Based Representations: A Comparative Study
    Lopez-Inesta, Emilia
    Grimaldo, Francisco
    Arevalillo-Herraez, Miguel
    APPLIED ARTIFICIAL INTELLIGENCE, 2015, 29 (05) : 445 - 458
  • [23] On Relations between Distance-Based Topological Indices
    Alizadeh, Yaser
    Gutman, Ivan
    Nosratollahi, Shima
    Hussain, Zaryab
    MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2025, 93 (02)
  • [24] Sparse distance-based learning for simultaneous multiclass classification and feature selection of metagenomic data
    Liu, Zhenqiu
    Hsiao, William
    Cantarel, Brandi L.
    Drabek, Elliott Franco
    Fraser-Liggett, Claire
    BIOINFORMATICS, 2011, 27 (23) : 3242 - 3249
  • [25] An efficient orientation distance-based discriminative feature extraction method for multi-classification
    Liu, Bo
    Xiao, Yanshan
    Yu, Philip S.
    Hao, Zhifeng
    Cao, Longbing
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 39 (02) : 409 - 433
  • [26] Ensemble feature selection using distance-based supervised and unsupervised methods in binary classification
    Hallajian, Bita
    Motameni, Homayun
    Akbari, Ebrahim
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
  • [27] A distance-based framework to deal with ordinal and additive inconsistencies for fuzzy reciprocal preference relations
    Xu, Yejun
    Herrera, Francisco
    Wang, Huimin
    INFORMATION SCIENCES, 2016, 328 : 189 - 205
  • [28] A Distance-based Approach for Semantic Dissimilarity in Knowledge Graphs
    De Nies, Tom
    Beecks, Christian
    Godin, Frederic
    De Neve, Wesley
    Stepien, Grzegorz
    Arndt, Dorthe
    De Vocht, Laurens
    Verborgh, Ruben
    Seidl, Thomas
    Mannens, Erik
    Van de Walle, Rik
    2016 IEEE TENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2016, : 253 - 256
  • [29] A Canberra distance-based complex network classification framework using lumped catchment characteristics
    Istalkar, Prashant
    Unnithan, S. L. Kesav
    Biswal, Basudev
    Sivakumar, Bellie
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2021, 35 (06) : 1293 - 1300
  • [30] A Canberra distance-based complex network classification framework using lumped catchment characteristics
    Prashant Istalkar
    S. L. Kesav Unnithan
    Basudev Biswal
    Bellie Sivakumar
    Stochastic Environmental Research and Risk Assessment, 2021, 35 : 1293 - 1300