CoMod: An Abstractive Approach to Discourse Context Identification

被引：0

作者：

Guetari, Ramzi ^{[1
]}

Kraiem, Naoufel ^{[2
]}

机构：

[1] Univ Carthage, Polytech Sch Tunisia, SERCOM Lab, La Marsa 2078, Tunisia

[2] King Khalid Univ, Coll Comp Sci, Abha 61421, Saudi Arabia

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Context identification; generative models; machine learning; natural language processing; text summarizing; topic identification;

D O I：

10.1109/ACCESS.2023.3302179

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Generative text summarization can condense large volumes of information into a concise summary. It helps users quickly grasp the main points of a text without having to read the entire document. Machine learning (ML) plays a pivotal role in this domain, offering significant advantages in information processing and comprehension. In this paper we present CoMod, an abstractive method for generating the context of a document, from its content and that of the referenced documents, if any. CoMod analyzes the intricate patterns and relationships within a document's content, thereby extracting and inferring the underlying context. The context generation process involves using a word linearization process as well as a Markov model, specifically a Bigram model, to predict the likelihood of word sequences. The Markov model is trained on a corpus of text and used to generate coherent sentences based on the probabilities of transitioning from one word to another. Markov tables allows to adapt the generated context to a specific domain and can be built on the fly in CoMod. The approach was compared to other methods and demonstrated very encouraging capabilities by outperforming other approaches tested on the same datasets. It thus confirms the potential of generative methods in the field of automatic text summarization and their ability of leveraging the power of machine learning for context generation to revolutionize information management, boosting productivity, scalability and knowledge discovery in various domains.

引用

页码：82744 / 82770

页数：27

共 50 条

[21] Examining online public discourse in context: A mixed method approach
Witschge, Tamara
JAVNOST-THE PUBLIC, 2008, 15 (02) : 75 - 91
[22] A Context based Coverage Model for Abstractive Document Summarization
Kim, Heechan
Lee, Soowon
2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1129 - 1132
[23] A Study of Abstractive Summarization Using Semantic Representations and Discourse Level Information
Valderrama Vilca, Gregory Cesar
Sobrevilla Cabezudo, Marco Antonio
TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 482 - 490
[24] Improving Transformer with Sequential Context Representations for Abstractive Text Summarization
Cai, Tian
Shen, Mengjun
Peng, Huailiang
Jiang, Lei
Dai, Qiong
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 512 - 524
[25] Discourse abilities in Spanish in pre-universitary context: A SFL approach
Moyano, Estela
REVISTA SIGNOS, 2007, 40 (65): : 573 - 608
[26] Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs
Chen, Jiaao
Yang, Diyi
2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 1380 - 1391
[27] Towards a New Hybrid Approach for Abstractive Summarization
Jaafar, Younes
Bouzoubaa, Karim
ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 286 - 293
[28] Discourse, context and cognition
van Dijk, TA
DISCOURSE STUDIES, 2006, 8 (01) : 159 - 177
[29] DISCOURSE AND CONTEXT - INTRODUCTION
COOKGUMPEREZ, J
DISCOURSE PROCESSES, 1981, 4 (02) : 89 - 91
[30] DISCOURSE IN CONTEXT.
Pang, Jixian
APPLIED LINGUISTICS, 2016, 37 (01) : 146 - 149

← 1 2 3 4 5 →