Neural Pipeline for Zero-Shot Data-to-Text Generation

被引:0
|
作者
Kasner, Zdenek [1 ]
Dusek, Ondrej [1 ]
机构
[1] Charles Univ Prague, Fac Math & Phys, Inst Formal & Appl Linguist, Prague, Czech Republic
基金
欧洲研究理事会;
关键词
NATURAL-LANGUAGE GENERATION; OF-THE-ART;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In data-to-text (D2T) generation, training on in-domain data leads to overfitting to the data representation and repeating training data noise. We examine how to avoid finetuning pretrained language models (PLMs) on D2T generation datasets while still taking advantage of surface realization capabilities of PLMs. Inspired by pipeline approaches, we propose to generate text by transforming single-item descriptions with a sequence of modules trained on general-domain text-based operations: ordering, aggregation, and paragraph compression. We train PLMs for performing these operations on a synthetic corpus WIKIFLUENT which we build from English Wikipedia. Our experiments on two major triple-to-text datasets-WebNLG and E2E-show that our approach enables D2T generation from RDF triples in zero-shot settings.(1)
引用
收藏
页码:3914 / 3932
页数:19
相关论文
共 50 条
  • [41] Compositional Zero-Shot Domain Transfer with Text-to-Text Models
    Liu, Fangyu
    Liu, Qianchu
    Bannur, Shruthi
    Perez-Garcia, Fernando
    Usuyama, Naoto
    Zhang, Sheng
    Naumann, Tristan
    Nori, Aditya
    Poon, Hoifung
    Alvarez-Valle, Javier
    Oktay, Ozan
    Hyland, Stephanie L.
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 1097 - 1113
  • [42] Data-to-Text Generation with Attention Recurrent Unit
    Wang, Hechong
    Zhang, Wei
    Zhu, Yuesheng
    Bai, Zhiqiang
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [43] A benchmark dataset for Turkish data-to-text generation
    Demir, Seniz
    Oktem, Seza
    COMPUTER SPEECH AND LANGUAGE, 2023, 77
  • [44] Building RDF content for data-to-text generation
    1600, Association for Computational Linguistics, ACL Anthology
  • [45] Data-to-Text Generation with Content Selection and Planning
    Puduppully, Ratish
    Dong, Li
    Lapata, Mirella
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6908 - 6915
  • [46] TWT: Table with Written Text for Controlled Data-to-Text Generation
    Li, Tongliang
    Fang, Lei
    Lou, Jian-Guang
    Li, Zhoujun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1244 - 1254
  • [47] Zero-Shot Text Classification via Knowledge Graph Embedding for Social Media Data
    Chen, Qi
    Wang, Wei
    Huang, Kaizhu
    Coenen, Frans
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12): : 9205 - 9213
  • [48] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
    Hong, Fangzhou
    Zhang, Mingyuan
    Pan, Liang
    Cai, Zhongang
    Yang, Lei
    Liu, Ziwei
    ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (04):
  • [49] Zero-Shot Pipeline Detection for Sub-Bottom Profiler Data Based on Imaging Principles
    Zheng, Gen
    Zhao, Jianhu
    Li, Shaobo
    Feng, Jie
    REMOTE SENSING, 2021, 13 (21)
  • [50] Dual Projective Zero-Shot Learning Using Text Descriptions
    Rao, Yunbo
    Yang, Ziqiang
    Zeng, Shaoning
    Wang, Qifeng
    Pu, Jiansu
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)