An Abstractive Approach to Sentence Compression

被引:17
|
作者
Cohn, Trevor [1 ]
Lapata, Mirella [2 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, S Yorkshire, England
[2] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Experimentation; Language generation; language models; machine translation; sentence compression; paraphrases; transduction; synchronous grammars; EXTRACTION;
D O I
10.1145/2483669.2483674
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we generalize the sentence compression task. Rather than simply shorten a sentence by deleting words or constituents, as in previous work, we rewrite it using additional operations such as substitution, reordering, and insertion. We present an experimental study showing that humans can naturally create abstractive sentences using a variety of rewrite operations, not just deletion. We next create a new corpus that is suited to the abstractive compression task and formulate a discriminative tree-to-tree transduction model that can account for structural and lexical mismatches. The model incorporates a grammar extraction method, uses a language model for coherent output, and can be easily tuned to a wide range of compression-specific loss functions.
引用
收藏
页数:35
相关论文
共 50 条
  • [1] An abstractive approach to sentence compression
    Cohn, T. (t.cohn@dcs.shef.ac.uk), 1600, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (43):
  • [2] Abstractive Sentence Compression with Event Attention
    Choi, Su Jeong
    Jung, Ian
    Park, Seyoung
    Park, Seong-Bae
    APPLIED SCIENCES-BASEL, 2019, 9 (19):
  • [3] Paraphrastic Fusion for Abstractive Multi-Sentence Compression Generation
    Nayeem, Mir Tafseer
    Chali, Yllias
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2223 - 2226
  • [4] Unsupervised Abstractive Meeting Summarization with Multi-Sentence Compression and Budgeted Submodular Maximization
    Shang, Guokan
    Ding, Wensi
    Zhang, Zekun
    Tixier, Antoine J. -P.
    Meladianos, Polykarpos
    Vazirgiannis, Michalis
    Lorre, Jean-Pierre
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 664 - 674
  • [5] Abstractive Summarization: A Hybrid Approach for the Compression of Semantic Graphs
    Balaji, J.
    Geetha, T. V.
    Parthasarathi, Ranjani
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2016, 12 (02) : 76 - 99
  • [6] Selective Encoding for Abstractive Sentence Summarization
    Zhou, Qingyu
    Yang, Nan
    Wei, Furu
    Zhou, Ming
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1095 - 1104
  • [7] Summarization beyond sentence extraction: A probabilistic approach to sentence compression
    Knight, K
    Marcu, D
    ARTIFICIAL INTELLIGENCE, 2002, 139 (01) : 91 - 107
  • [8] Multi-Document Abstractive Summarization Using ILP Based Multi-Sentence Compression
    Banerjee, Siddhartha
    Mitra, Prasenjit
    Sugiyama, Kazunari
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1208 - 1214
  • [9] SEQ3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression
    Baziotis, Christos
    Androutsopoulos, Ion
    Konstas, Ioannis
    Potamianos, Alexandros
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 673 - 681
  • [10] Unsupervised abstractive summarization via sentence rewriting
    Zhang, Zhihao
    Liang, Xinnian
    Zuo, Yuan
    Li, Zhoujun
    COMPUTER SPEECH AND LANGUAGE, 2023, 78