Extractive text summarization of arabic multi-document using fuzzy C-means and Latent Dirichlet Allocation

被引:4
|
作者
Al-Taani, Ahmad T. T. [1 ]
Al-Sayadi, Sami H. H. [1 ]
机构
[1] Yarmouk Univ, Dept Comp Sci, Irbid, Jordan
关键词
Multi-document text summarization; Arabic Language; Extractive-based summarization; Singular value decomposition (SVD); Fuzzy C-Means algorithm; Latent Dirichlet allocation (LDA) algorithm; RANKING;
D O I
10.1007/s13198-022-01783-2
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this research, we investigated the performance of the combination of fuzzy c-means and latent Dirichlet allocation algorithms for Arabic multi-document summarization. The summary should include the most essential sentences from multi-documents with the same topic. The TAC-2011 corpus is used for experiments, first, the documents in the corpus are clustered using fuzzy c-means algorithm. The aim of the clustering process here is to classify the documents according to their topics, e.g., economic, politic, sport, etc. The results are compared against some recent Arabic summarization approaches that used ant colony and discriminant analysis algorithms. The proposed approach has obtained competitive results compared to those recent approaches.
引用
收藏
页码:713 / 726
页数:14
相关论文
共 50 条
  • [21] Using Proximity in Query Focused Multi-document Extractive Summarization
    Li, Sujian
    Zhang, Yu
    Wang, Wei
    Wang, Chen
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES: LANGUAGE TECHNOLOGY FOR THE KNOWLEDGE-BASED ECONOMY, 2009, 5459 : 179 - 188
  • [22] Query-focused multi-document text summarization using fuzzy inference
    Agarwal, Raksha
    Chatterjee, Niladri
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4641 - 4652
  • [23] Parallelizing a multi-objective optimization approach for extractive multi-document text summarization
    Sanchez-Gomez, Jesus M.
    Vega-Rodriguez, Miguel A.
    Perez, Carlos J.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 134 : 166 - 179
  • [24] Extractive Multi-document Text Summarization Leveraging Hybrid Semantic Similarity Measures
    Bandaru, Rajesh
    Radhika, Dr. Y.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 844 - 852
  • [25] Topic modeling combined with classification technique for extractive multi-document text summarization
    Rajendra Kumar Roul
    Soft Computing, 2021, 25 : 1113 - 1127
  • [26] Topic modeling combined with classification technique for extractive multi-document text summarization
    Roul, Rajendra Kumar
    SOFT COMPUTING, 2021, 25 (02) : 1113 - 1127
  • [27] Multi-document Text Summarization Using Sentence Extraction
    Ahuja, Ravinder
    Anand, Willson
    ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2016, 2017, 517 : 235 - 242
  • [28] Extractive summarization of Malayalam documents using latent Dirichlet allocation: An experience
    Kondath, Manju
    Suseelan, David Peter
    Idicula, Sumam Mary
    JOURNAL OF INTELLIGENT SYSTEMS, 2022, 31 (01) : 393 - 406
  • [29] Minimum redundancy and maximum relevance for single and multi-document Arabic text summarization
    Oufaida, Houda
    Nouali, Omar
    Blache, Philippe
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2014, 26 (04) : 450 - 461
  • [30] Text Summarization as a Multi-objective Optimization Task: Applying Harmony Search to Extractive Multi-Document Summarization
    Bidoki, M.
    Fakhrahmad, M.
    Moosavi, M. R.
    COMPUTER JOURNAL, 2022, 65 (05): : 1053 - 1072