ZoomNet for Topic-Oriented Fragment Recognition in Long Documents

被引:0
|
作者
Yan, Yukun [1 ,2 ]
Zheng, Daqi [3 ]
Lu, Zhengdong [3 ]
Song, Sen [1 ,2 ]
机构
[1] Tsinghua Univ, Lab Brain & Intelligence, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Biomed Engn, Beijing 100084, Peoples R China
[3] Deeplycurious AI, Res Dept, Beijing 100085, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Labeling; Decoding; Context modeling; Encoding; Information retrieval; Computational modeling; Information extraction; neural network; long documents; reinforcement learning; TERM DEPENDENCIES;
D O I
10.1109/ACCESS.2022.3166235
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work introduces a new information extraction task called Topic-Oriented Fragment Recognition (TOFR), whose goal is to recognize information related to a specific topic in long documents from professional fields. In this paper, we introduce two TOFR datasets to study the problems of processing long documents. We propose a novel neural framework named Zooming Network (ZoomNet), which overcomes the challenge of combining information over long distances with limited computing resources by flexibly switching between skimming and intensive reading in processing long documents. In general, ZoomNet first establishes a hierarchical representation aligned to the text structure, which relieves the conflict between local information and extensive contextual information. Then, it synthesizes different levels of information to assign tags via multi-scale actions. We combine supervised and reinforcement learning methods to train our model. Experiments show that the proposed model outperforms several state-of-the-art sequence labeling models, including BiLSTM-CRF, BERT, XLNET, RoBERTa, and ELECTRA, on both TOFR datasets with big margins.
引用
收藏
页码:39545 / 39554
页数:10
相关论文
共 50 条
  • [21] Beyond audio and video retrieval: topic-oriented multimedia summarization
    Metze, Florian
    Ding, Duo
    Younessian, Ehsan
    Hauptmann, Alexander
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2013, 2 (02) : 131 - 144
  • [22] Detecting Topic-oriented Overlapping Community Using Hybrid a Hypergraph Model
    Shen, G. L.
    Yang, X. P.
    Sun, J.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2016, 11 (04) : 538 - 552
  • [23] Clustering and visualizing audiovisual dataset on mobile devices in a topic-oriented manner
    Wang, Lei
    Tjondrongoro, Dian
    Liu, Yuee
    ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 310 - 321
  • [24] Topic-oriented community detection of rating-based social networks
    Reihanian, Ali
    Minaei-Bidgoli, Behrouz
    Alizadeh, Hosein
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2016, 28 (03) : 303 - 310
  • [25] Topic-Oriented Spoken Dialogue Summarization for Customer Service with Saliency-Aware Topic Modeling
    Zou, Yicheng
    Zhao, Lujun
    Kang, Yangyang
    Lin, Jun
    Peng, Minlong
    Jiang, Zhuoren
    Sun, Changlong
    Zhang, Qi
    Huang, Xuanjing
    Liu, Xiaozhong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14665 - 14673
  • [26] TOSOM: A topic-oriented self-organizing map for text organization
    Yang, Hsin-Chang
    Lee, Chung-Hong
    Ke, Kuo-Lung
    World Academy of Science, Engineering and Technology, 2010, 65 : 1100 - 1104
  • [27] On the Usability of Clustering for Topic-oriented Multi-level Security Models
    Engelstad, Paal E.
    UKSIM-AMSS NINTH IEEE EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2015), 2015, : 14 - 20
  • [28] TOSOM: A topic-oriented self-organizing map for text organization
    Yang, Hsin-Chang
    Lee, Chung-Hong
    Ke, Kuo-Lung
    World Academy of Science, Engineering and Technology, 2010, 41 : 1100 - 1104
  • [29] Incorporating User Constraints into Topic-Oriented Self-Organizing Maps
    Yang, Hsin-Chang
    Lee, Chung-Hong
    Wu, Chun-Yen
    2013 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE (FOCI), 2013, : 91 - 97
  • [30] Path prediction of information diffusion based on a topic-oriented relationship strength network
    Zhu, Hengmin
    Yang, Xinyi
    Wei, Jing
    INFORMATION SCIENCES, 2023, 631 : 108 - 119