Data Augmentation for Abstractive Query-Focused Multi-Document Summarization

被引:0
|
作者
Pasunuru, Ramakanth [1 ]
Celikyilmaz, Asli [2 ]
Galley, Michel [2 ]
Xiong, Chenyan [2 ]
Zhang, Yizhe [2 ]
Bansal, Mohit [1 ]
Gao, Jianfeng [2 ]
机构
[1] Univ N Carolina, Chapel Hill, NC 27599 USA
[2] Microsoft Res, Redmond, WA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The progress in Query-focused Multi-Document Summarization (QMDS) has been limited by the lack of sufficient large-scale high-quality training datasets. We present two QMDS training datasets, which we construct using two data augmentation methods: (1) transferring the commonly used single-document CNN/Daily Mail summarization dataset to create the QMDSCNN dataset, and (2) mining search-query logs to create the QMDSIR dataset. These two datasets have complementary properties, i.e., QMDSCNN has real summaries but queries are simulated, while QMDSIR has real queries but simulated summaries. To cover both these real summary and query aspects, we build abstractive end-to-end neural network models on the combined datasets that yield new state-of-the-art transfer results on DUC datasets. We also introduce new hierarchical encoders that enable a more efficient encoding of the query together with multiple documents. Empirical results demonstrate that our data augmentation and encoding methods outperform baseline models on automatic metrics, as well as on human evaluations along multiple attributes.
引用
收藏
页码:13666 / 13674
页数:9
相关论文
共 50 条
  • [1] Query-Focused Multi-document Summarization Survey
    Alanzi, Entesar
    Alballaa, Safa
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (06) : 822 - 833
  • [2] Query-focused Multi-document Summarization Using Cloud Model
    Chen, Jinguang
    He, Tingting
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (03): : 951 - 956
  • [3] Query-Focused Multi-document Summarization Based on Concept Importance
    Zheng, Hai-Tao
    Guo, Ji-Min
    Jiang, Yong
    Xia, Shu-Tao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT II, 2016, 9652 : 443 - 453
  • [4] Applying regression models to query-focused multi-document summarization
    Ouyang, You
    Li, Wenjie
    Li, Sujian
    Lu, Qin
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (02) : 227 - 237
  • [5] Exploring actor–object relationships for query-focused multi-document summarization
    Mohammadreza Valizadeh
    Pavel Brazdil
    Soft Computing, 2015, 19 : 3109 - 3121
  • [6] A Novel Contextual Topic Model for Query-focused Multi-document Summarization
    Yang, Guangbing
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 576 - 583
  • [7] Query-focused multi-document summarization: automatic data annotations and supervised learning approaches
    Chali, Yllias
    Hasan, Sadid A.
    NATURAL LANGUAGE ENGINEERING, 2012, 18 : 109 - 145
  • [8] Query-focused multi-document text summarization using fuzzy inference
    Agarwal, Raksha
    Chatterjee, Niladri
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4641 - 4652
  • [9] Review on Query-focused Multi-document Summarization (QMDS) with Comparative Analysis
    Roy, Prasenjeet
    Kundu, Suman
    ACM COMPUTING SURVEYS, 2024, 56 (01)
  • [10] Exploiting relevance, coverage, and novelty for query-focused multi-document summarization
    Luo, Wenjuan
    Zhuang, Fuzhen
    He, Qing
    Shi, Zhongzhi
    KNOWLEDGE-BASED SYSTEMS, 2013, 46 : 33 - 42