An Abstractive Approach to Sentence Compression

被引:17
|
作者
Cohn, Trevor [1 ]
Lapata, Mirella [2 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, S Yorkshire, England
[2] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Experimentation; Language generation; language models; machine translation; sentence compression; paraphrases; transduction; synchronous grammars; EXTRACTION;
D O I
10.1145/2483669.2483674
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we generalize the sentence compression task. Rather than simply shorten a sentence by deleting words or constituents, as in previous work, we rewrite it using additional operations such as substitution, reordering, and insertion. We present an experimental study showing that humans can naturally create abstractive sentences using a variety of rewrite operations, not just deletion. We next create a new corpus that is suited to the abstractive compression task and formulate a discriminative tree-to-tree transduction model that can account for structural and lexical mismatches. The model incorporates a grammar extraction method, uses a language model for coherent output, and can be easily tuned to a wide range of compression-specific loss functions.
引用
收藏
页数:35
相关论文
共 50 条
  • [41] FAR-ASS: Fact-aware reinforced abstractive sentence summarization
    Zhang, Mengli
    Zhou, Gang
    Yu, Wanting
    Liu, Wenfen
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (03)
  • [42] Probabilistic Unsupervised Chinese Sentence Compression
    Chen, Jinguang
    He, Tingting
    Gui, Zhuoming
    Li, Fang
    2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 61 - +
  • [43] Chinese Sentence Compression: Corpus and Evaluation
    Zhang, Chunliang
    Hu, Minghan
    Xiao, Tong
    Jiang, Xue
    Shi, Lixin
    Zhu, Jingbo
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, 2013, 8208 : 257 - 267
  • [44] Sentence Compression with Natural Language Generation
    Li, Peng
    Wang, Yinglin
    KNOWLEDGE ENGINEERING AND MANAGEMENT, 2011, 123 : 357 - 363
  • [45] Sentence compression with a Markov logic network
    Jin, Feng
    Huang, Minlie
    Zhu, Xiaoyan
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2011, 51 (11): : 1596 - 1600
  • [46] Towards a New Hybrid Approach for Abstractive Summarization
    Jaafar, Younes
    Bouzoubaa, Karim
    ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 286 - 293
  • [47] CoMod: An Abstractive Approach to Discourse Context Identification
    Guetari, Ramzi
    Kraiem, Naoufel
    IEEE ACCESS, 2023, 11 : 82744 - 82770
  • [48] A New Approach to Overgenerating and Scoring Abstractive Summaries
    Song, Kaiqiang
    Wang, Bingqing
    Feng, Zhe
    Liu, Fei
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 1392 - 1404
  • [49] An Effective Approach of Sentence Compression Based on "Re-read" Mechanism and Bayesian Combination Model
    Lu, Zhonglei
    Liu, Wenfen
    Zhou, Yanfang
    Hu, Xuexian
    Wang, Binyu
    SOCIAL MEDIA PROCESSING, SMP 2017, 2017, 774 : 129 - 140
  • [50] A Language Model based Evaluator for Sentence Compression
    Zhao, Yang
    Luo, Zhiyuan
    Aizawa, Akiko
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 170 - 175