A distributed event extraction framework for large-scale unstructured text

被引:0
|
作者
Kan, Zhigang [1 ]
Mi, Haibo [1 ]
Yang, Sen [1 ]
Qiao, Linbo [1 ]
Feng, Dawei [1 ]
Li, Dongsheng [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
event extraction; massive data; inter-cloud;
D O I
10.1109/JCC49151.2020.00024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Event extraction is an important subtask of information extraction. The goal of event extraction is to quickly extract events of a specified type from a large amount of textual information. Many excellent models and algorithms have been proposed since ACE released the event extraction task in 2005. Most of them are based on the dataset published by ACE and have contributed to the accuracy of event extraction to a certain extent. In real-world applications, the processing object of the event extraction task is large-scale text data. However, as far as we know, there is currently no adequate model for using multiple computers for event extraction. In this paper, we propose a framework for event extraction based on inter-cloud computing technology, which aims to extract events from huge-amount of unstructured text data in the wild. The experimental results demonstrate that our method could improve the throughput, reduce time consumption of the event extraction process, and further gets better accuracy than advanced models.
引用
收藏
页码:102 / 108
页数:7
相关论文
共 50 条
  • [41] Pyramid: A General Framework for Distributed Similarity Search on Large-scale Datasets
    Deng, Shiyuan
    Yan, Xiao
    Ng, Kelvin K. W.
    Jiang, Chenyu
    Cheng, James
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1066 - 1071
  • [42] A sociotechnical framework for evaluating a large-scale distributed educational digital library
    Khoo, Michael
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2006, 4172 : 449 - 452
  • [43] A distributed, scalable, and synchronized framework for large-scale microscopic traffic simulation
    Klefstad, R
    Zhang, Y
    Lai, MJ
    Jayakrishnan, R
    Lavanya, R
    2005 IEEE Intelligent Transportation Systems Conference (ITSC), 2005, : 813 - 818
  • [44] Patterns and performance of a CORBA event service for large-scale distributed interactive simulations
    O'Ryan, C
    Schmidt, DC
    Noseworthy, JR
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2002, 17 (02): : 115 - 132
  • [45] Large-scale extraction of proteins
    Cunha, T
    Aires-Barros, R
    MOLECULAR BIOTECHNOLOGY, 2002, 20 (01) : 29 - 40
  • [46] Large-scale extraction of proteins
    Teresa Cunha
    Raquel Aires-Barros
    Molecular Biotechnology, 2002, 20 : 29 - 40
  • [47] A parallel parameterized level set topology optimization framework for large-scale structures with unstructured meshes
    Lin, Haoju
    Liu, Hui
    Wei, Peng
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2022, 397
  • [48] Improving the performance of large-scale unstructured PDE applications
    Cai, Xing
    APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2006, 3732 : 699 - 708
  • [49] A general framework for subjective information extraction from unstructured English text
    Mangassarian, Hratch
    Artail, Hassan
    DATA & KNOWLEDGE ENGINEERING, 2007, 62 (02) : 352 - 367
  • [50] EXTRACTION OF MANUFACTURING RULES FROM UNSTRUCTURED TEXT USING A SEMANTIC FRAMEWORK
    Kang, SungKu
    Patil, Lalit
    Rangarajan, Arvind
    Moitra, Abha
    Jia, Tao
    Robinson, Dean
    Dutta, Debasish
    INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2015, VOL 1B, 2016,