UniGen: A Unified Generative Framework for Retrieval and Question Answering with Large Language Models

被引:0
|
作者
Li, Xiaoxi [1 ]
Zhou, Yujia
Dou, Zhicheng
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative information retrieval, encompassing two major tasks of Generative Document Retrieval (GDR) and Grounded Answer Generation (GAR), has gained significant attention in the area of information retrieval and natural language processing. Existing methods for GDR and GAR rely on separate retrieval and reader modules, which hinder simultaneous optimization. To overcome this, we present UniGen, a Unified Generative framework for retrieval and question answering that integrates both tasks into a single generative model leveraging the capabilities of large language models. UniGen employs a shared encoder and two distinct decoders for generative retrieval and question answering. To facilitate the learning of both tasks, we introduce connectors, generated by large language models, to bridge the gaps between query inputs and generation targets, as well as between document identifiers and answers. Furthermore, we propose an iterative enhancement strategy that leverages generated answers and retrieved documents to iteratively improve both tasks. Through extensive experiments on the MS MARCO and NQ datasets, we demonstrate the effectiveness of UniGen, showcasing its superior performance in both the retrieval and the question answering tasks.
引用
收藏
页码:8688 / 8696
页数:9
相关论文
共 50 条
  • [41] Towards Building a Robust Knowledge Intensive Question Answering Model with Large Language Models
    Hong, Xingyun
    Shao, Yan
    Wang, Zhilin
    Duan, Manni
    Jin, Xiongnan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT I, NLPCC 2024, 2025, 15359 : 228 - 242
  • [42] Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models
    Kim, Gangwoo
    Kim, Sungdong
    Jeon, Byeongguk
    Park, Joonsuk
    Kang, Jaewoo
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 996 - 1009
  • [43] Integrating Video Retrieval and Moment Detection in a Unified Corpus for Video Question Answering
    Luo, Hongyin
    Mohtarami, Mitra
    Glass, James
    Krishnanzurthy, Karthik
    Richardson, Brigitte
    INTERSPEECH 2019, 2019, : 599 - 603
  • [44] Unified Language Representation for Question Answering over Text, Tables, and Images
    Yu, Bowen
    Fu, Cheng
    Yu, Haiyang
    Huang, Fei
    Li, Yongbin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4756 - 4765
  • [45] A Unified Framework for Multilingual and Code-Mixed Visual Question Answering
    Gupta, Deepak
    Lenka, Pabitra
    Ekbal, Asif
    Bhattacharyya, Pushpak
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 900 - 913
  • [46] Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts
    Lan, Yunshi
    Li, Xiang
    Liu, Xin
    Li, Yang
    Qin, Wei
    Qian, Weining
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4389 - 4400
  • [47] UniMEL: A Unified Framework for Multimodal Entity Linking with Large Language Models
    Liu, Qi
    He, Yongyi
    Xu, Tong
    Lian, Defu
    Liu, Che
    Zheng, Zhi
    Chen, Enhong
    International Conference on Information and Knowledge Management, Proceedings, : 1909 - 1919
  • [48] Reimagining Retrieval Augmented Language Models for Answering Queries
    Tan, Wang-Chiew
    Li, Yuliang
    Rodriguez, Pedro
    James, Richard
    Lin, Xi Victoria
    Halevy, Alon
    Yih, Scott
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 6131 - 6145
  • [49] Natural language Question - Answering model applied to document retrieval system
    Dang, Nguyen Tuan
    Tuyen, Do Thi Thanh
    World Academy of Science, Engineering and Technology, 2009, 39 : 36 - 39
  • [50] RAVL: A Retrieval-Augmented Visual Language Model Framework for Knowledge-Based Visual Question Answering
    Chai, Naiquan
    Zou, Dongsheng
    Liu, Jiyuan
    Wang, Hao
    Yang, Yuming
    Song, Xinyi
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 394 - 406