CoMod: An Abstractive Approach to Discourse Context Identification

被引:0
|
作者
Guetari, Ramzi [1 ]
Kraiem, Naoufel [2 ]
机构
[1] Univ Carthage, Polytech Sch Tunisia, SERCOM Lab, La Marsa 2078, Tunisia
[2] King Khalid Univ, Coll Comp Sci, Abha 61421, Saudi Arabia
关键词
Context identification; generative models; machine learning; natural language processing; text summarizing; topic identification;
D O I
10.1109/ACCESS.2023.3302179
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generative text summarization can condense large volumes of information into a concise summary. It helps users quickly grasp the main points of a text without having to read the entire document. Machine learning (ML) plays a pivotal role in this domain, offering significant advantages in information processing and comprehension. In this paper we present CoMod, an abstractive method for generating the context of a document, from its content and that of the referenced documents, if any. CoMod analyzes the intricate patterns and relationships within a document's content, thereby extracting and inferring the underlying context. The context generation process involves using a word linearization process as well as a Markov model, specifically a Bigram model, to predict the likelihood of word sequences. The Markov model is trained on a corpus of text and used to generate coherent sentences based on the probabilities of transitioning from one word to another. Markov tables allows to adapt the generated context to a specific domain and can be built on the fly in CoMod. The approach was compared to other methods and demonstrated very encouraging capabilities by outperforming other approaches tested on the same datasets. It thus confirms the potential of generative methods in the field of automatic text summarization and their ability of leveraging the power of machine learning for context generation to revolutionize information management, boosting productivity, scalability and knowledge discovery in various domains.
引用
收藏
页码:82744 / 82770
页数:27
相关论文
共 50 条
  • [21] Examining online public discourse in context: A mixed method approach
    Witschge, Tamara
    JAVNOST-THE PUBLIC, 2008, 15 (02) : 75 - 91
  • [22] A Context based Coverage Model for Abstractive Document Summarization
    Kim, Heechan
    Lee, Soowon
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1129 - 1132
  • [23] A Study of Abstractive Summarization Using Semantic Representations and Discourse Level Information
    Valderrama Vilca, Gregory Cesar
    Sobrevilla Cabezudo, Marco Antonio
    TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 482 - 490
  • [24] Improving Transformer with Sequential Context Representations for Abstractive Text Summarization
    Cai, Tian
    Shen, Mengjun
    Peng, Huailiang
    Jiang, Lei
    Dai, Qiong
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 512 - 524
  • [25] Discourse abilities in Spanish in pre-universitary context: A SFL approach
    Moyano, Estela
    REVISTA SIGNOS, 2007, 40 (65): : 573 - 608
  • [26] Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs
    Chen, Jiaao
    Yang, Diyi
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 1380 - 1391
  • [27] Towards a New Hybrid Approach for Abstractive Summarization
    Jaafar, Younes
    Bouzoubaa, Karim
    ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 286 - 293
  • [28] Discourse, context and cognition
    van Dijk, TA
    DISCOURSE STUDIES, 2006, 8 (01) : 159 - 177
  • [29] DISCOURSE AND CONTEXT - INTRODUCTION
    COOKGUMPEREZ, J
    DISCOURSE PROCESSES, 1981, 4 (02) : 89 - 91
  • [30] DISCOURSE IN CONTEXT.
    Pang, Jixian
    APPLIED LINGUISTICS, 2016, 37 (01) : 146 - 149