Table to text generation with accurate content copying

被引:0
|
作者
Yang Yang
Juan Cao
Yujun Wen
Pengzhou Zhang
机构
[1] Communication University of China,State Key Laboratory of Media Convergence and Communication
来源
Scientific Reports | / 11卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Generating fluent, coherent, and informative text from structured data is called table-to-text generation. Copying words from the table is a common method to solve the “out-of-vocabulary” problem, but it’s difficult to achieve accurate copying. In order to overcome this problem, we invent an auto-regressive framework based on the transformer that combines a copying mechanism and language modeling to generate target texts. Firstly, to make the model better learn the semantic relevance between table and text, we apply a word transformation method, which incorporates the field and position information into the target text to acquire the position of where to copy. Then we propose two auxiliary learning objectives, namely table-text constraint loss and copy loss. Table-text constraint loss is used to effectively model table inputs, whereas copy loss is exploited to precisely copy word fragments from a table. Furthermore, we improve the text search strategy to reduce the probability of generating incoherent and repetitive sentences. The model is verified by experiments on two datasets and better results are obtained than the baseline model. On WIKIBIO, the result is improved from 45.47 to 46.87 on BLEU and from 41.54 to 42.28 on ROUGE. On ROTOWIRE, the result is increased by 4.29% on CO metric, and 1.93 points higher on BLEU.
引用
收藏
相关论文
共 50 条
  • [1] Table to text generation with accurate content copying
    Yang, Yang
    Cao, Juan
    Wen, Yujun
    Zhang, Pengzhou
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [2] Enhancing Content Planning for Table-to-Text Generation with Data Understanding and Verification
    Gong, Heng
    Bi, Wei
    Feng, Xiaocheng
    Qin, Bing
    Liu, Xiaojiang
    Liu, Ting
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [3] F-SQL: Fuse Table Schema and Table Content for Single-Table Text2SQL Generation
    Zhang, Xiaoyu
    Yin, Fengjing
    Ma, Guojie
    Ge, Bin
    Xiao, Weidong
    IEEE ACCESS, 2020, 8 : 136409 - 136420
  • [4] Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints
    Wang, Zhenyi
    Wang, Xiaoyang
    An, Bang
    Yu, Dong
    Chen, Changyou
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1072 - 1086
  • [5] TABGENIE: A Toolkit for Table -to -Text Generation
    Kasner, Zdenek
    Garanina, Ekaterina
    Platek, Ondrej
    Dusek, Ondrej
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-DEMO 2023, VOL 3, 2023, : 444 - 455
  • [6] Retrieval augmentation for text-to-table generation
    Zhang, Jiahua
    Tan, Meijuan
    Zhang, Jing
    Zhang, Xiaolu
    Zhou, Jun
    Li, Chenliang
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (04)
  • [7] EVOL REPLICATOR COPYING TABLE
    不详
    MACHINERY AND PRODUCTION ENGINEERING, 1970, 117 (3021): : 592 - &
  • [8] Text, Table and Graph - which is faster & more accurate to understand?
    Prasad, Gollapudi V. R. J. Sai
    Ojha, Amitash
    2012 IEEE FOURTH INTERNATIONAL CONFERENCE ON TECHNOLOGY FOR EDUCATION (T4E), 2012, : 126 - 131
  • [9] TWT: Table with Written Text for Controlled Data-to-Text Generation
    Li, Tongliang
    Fang, Lei
    Lou, Jian-Guang
    Li, Zhoujun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1244 - 1254
  • [10] ToTTo: A Controlled Table-To-Text Generation Dataset
    Parikh, Ankur P.
    Wang, Xuezhi
    Gehrmann, Sebastian
    Faruqui, Manaal
    Dhingra, Bhuwan
    Yang, Diyi
    Das, Dipanjan
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1173 - 1186