A distributed event extraction framework for large-scale unstructured text

被引:0
|
作者
Kan, Zhigang [1 ]
Mi, Haibo [1 ]
Yang, Sen [1 ]
Qiao, Linbo [1 ]
Feng, Dawei [1 ]
Li, Dongsheng [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
event extraction; massive data; inter-cloud;
D O I
10.1109/JCC49151.2020.00024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Event extraction is an important subtask of information extraction. The goal of event extraction is to quickly extract events of a specified type from a large amount of textual information. Many excellent models and algorithms have been proposed since ACE released the event extraction task in 2005. Most of them are based on the dataset published by ACE and have contributed to the accuracy of event extraction to a certain extent. In real-world applications, the processing object of the event extraction task is large-scale text data. However, as far as we know, there is currently no adequate model for using multiple computers for event extraction. In this paper, we propose a framework for event extraction based on inter-cloud computing technology, which aims to extract events from huge-amount of unstructured text data in the wild. The experimental results demonstrate that our method could improve the throughput, reduce time consumption of the event extraction process, and further gets better accuracy than advanced models.
引用
收藏
页码:102 / 108
页数:7
相关论文
共 50 条
  • [1] Simple Large-scale Relation Extraction from Unstructured Text
    Christodoulopoulos, Christos
    Mittal, Arpit
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 215 - 222
  • [2] A Word Distributed Representation Based Framework for Large-scale Short Text Classification
    Yao, Di
    Bi, Jingping
    Huang, Jianhui
    Zhu, Jin
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [3] Feature Extraction for Large-Scale Text Collections
    Gallagher, Luke
    Mallia, Antonio
    Culpepper, J. Shane
    Suel, Torsten
    Cambazoglu, B. Barla
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3015 - 3022
  • [4] Event Extraction from Unstructured Amharic Text
    Tadesse, Ephrem
    Aga, Rosa Tsegaye
    Qaqqabaa, Kuulaa
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2103 - 2109
  • [5] A scalable framework for large-scale distributed collaboration
    Yang, Shengwen
    Jiang, Jinlei
    Shi, Meilin
    2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 171 - 176
  • [6] UIMA GRID: Distributed large-scale text analysis
    Egner, Michael Thomas
    Lorch, Markus
    Biddle, Edd
    CCGRID 2007: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, 2007, : 317 - +
  • [7] A Distributed Topic Model for Large-Scale Streaming Text
    Li, Yicong
    Feng, Dawei
    Lu, Menglong
    Li, Dongsheng
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 37 - 48
  • [8] Large-Scale Extraction and Use of Knowledge from Text
    Clark, Peter
    Harrison, Phil
    K-CAP'09: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON KNOWLEDGE CAPTURE, 2009, : 153 - 160
  • [9] REES: A large-scale relation and event extraction system
    Aone, C
    Ramos-Santacruz, M
    6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : 76 - 83
  • [10] Mining Large-scale Event Knowledge from Web Text
    Cao, Ya-nan
    Zhang, Peng
    Guo, Jing
    Guo, Li
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 478 - 487