Research on Weakly-Supervised Entity Relation Extraction of Specific Domain Based on Entropy Minimization

被引:0
|
作者
Zhao, Jun [1 ]
Guo, Jianyi [1 ]
Yu, Zhengtao [1 ]
Chen, Peng [1 ]
Mao, Cunli [1 ]
机构
[1] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming 650051, Yunnan, Peoples R China
关键词
Entity relation extraction; Entropy minimization; Weakly-supervised; Specific domain;
D O I
10.1007/978-3-642-38466-0_30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There are two major issues of automatic entity relation extraction: human intervention and difficulty in labeling corpus. For these two problems, combined with the characteristics of the tourism domain, this paper adopts a weakly-supervised extraction method of entity relation based on entropy minimization. This method firstly extracts the characteristic words by the idea of scalar clustering with small-scale stratified marked instances, and constructs the initial classifier with maximum entropy machine learning algorithm. Then use the initial classifier of certain accuracy to classify the unlabeled instances, and add the instances of the minimum information entropy to the training corpus set to continually expand the scale of training data set. Finally, repeat the above iterative process until the performance of classifier is to be stabilized, and then a final extraction classifier of entity relation in specific domain will be constructed. Experiments performed on the corpus of tourism domain show that, not only can this method reduce the dependence of entity relation extraction on manual intervention, but it could effectively improve the performance of entity relation extraction, the F value of which is up to 63.69 %.
引用
收藏
页码:265 / 273
页数:9
相关论文
共 50 条
  • [31] Weakly-supervised butterfly detection based on saliency map
    Zhang, Ting
    Waqas, Muhammad
    Fang, Yu
    Liu, Zhaoying
    Halim, Zahid
    Li, Yujian
    Chen, Sheng
    PATTERN RECOGNITION, 2023, 138
  • [32] Weakly-supervised Text Classification Based on Keyword Graph
    Zhang, Lu
    Ding, Jiandong
    Xu, Yi
    Liu, Yingyao
    Zhou, Shuigeng
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2803 - 2813
  • [33] Weakly-Supervised Semantic Segmentation Based on Improved CAM
    Yan, Xingya
    Gao, Ying
    Wang, Gaihua
    Lecture Notes on Data Engineering and Communications Technologies, 2022, 89 : 584 - 594
  • [34] Research on Pattern Representation and Reliability in Semi-Supervised Entity Relation Extraction
    Ye, Feiyue
    Tang, Nan
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2016, PT II, 2016, 9713 : 289 - 297
  • [35] Cross-Domain Weakly-Supervised Object Detection through Progressive Domain Adaptation
    Inoue, Naoto
    Furuta, Ryosuke
    Yamasaki, Toshihiko
    Aizawa, Kiyoharu
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5001 - 5009
  • [36] Semi-supervised Entity Relation Extraction Based on Trigger Word
    Tai, Liting
    Guo, Fenzhuo
    Qin, Sujuan
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 497 - 501
  • [37] Research on entity semantic relation extraction in Fusion Domain Knowledge Tree
    Li Zhen
    Zhang Youmin
    Bai Sen
    2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND BUSINESS ANALYTICS (ICDSBA 2018), 2018, : 84 - 91
  • [38] Entity relation extraction in the medical domain: based on data augmentation
    Wang, Anli
    Li, Linyi
    Wu, Xuehong
    Zhu, Jianping
    Yu, Shanshan
    Chen, Xi
    Li, Jianhua
    Zhu, Hongtao
    ANNALS OF TRANSLATIONAL MEDICINE, 2022, 10 (19)
  • [39] A Semi-automated Entity Relation Extraction Mechanism with Weakly Supervised Learning for Chinese Medical Webpages
    Liu, Zhao
    Tong, Jian
    Gu, Jinguang
    Liu, Kai
    Hu, Bo
    SMART HEALTH, ICSH 2016, 2017, 10219 : 44 - 56
  • [40] Infrared Ship Segmentation Based on Weakly-Supervised and Semi-Supervised Learning
    Ali Ibrahim, Isa
    Namoun, Abdallah
    Ullah, Sami
    Alasmary, Hisham
    Waqas, Muhammad
    Ahmad, Iftekhar
    IEEE ACCESS, 2024, 12 : 117908 - 117920