Thematic Clustering Methods Applied to News Texts Analysis

被引:0
|
作者
Soloshenko, Anastasia N. [1 ]
Orlova, Yulia A. [1 ]
Rozaliev, Vladimir L. [1 ]
Zaboleeva-Zotova, Alla V. [1 ]
机构
[1] Volgograd State Tech Univ, Volgograd, Russia
关键词
thematic clustering; clustering algorithms; news articles; document representation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is devoted to a problem of partition documents from the news flow into groups, where each group contains documents that are similar to each other. We use thematic clustering to solve this problem. The existing clustering algorithms such as k-means, minimum spanning tree and etc. are considered and analyzed. It is shown which of these algorithms give the best results working with news texts. Clustering is a powerful tool for text processing, but it can't give a complete picture of news article semantics. This paper also presents a methodic of comprehensive news texts analysis based on a combination of statistical algorithms for keywords extracting and algorithms forming the semantic coherence of text blocks. Particular attention is paid to the structural features of the news texts.
引用
收藏
页码:294 / 310
页数:17
相关论文
共 50 条
  • [1] Thematic choice and progression in English and Chinese radio news texts: a systemic functional analysis
    Liu, Lijin
    Tucker, Gordon
    TEXT & TALK, 2015, 35 (04) : 481 - 504
  • [2] Applied Thematic Analysis
    Nazari, Maryam
    ONLINE INFORMATION REVIEW, 2012, 36 (02) : 330 - 335
  • [3] Applied Thematic Analysis
    Tarsilla, Michele
    CANADIAN JOURNAL OF PROGRAM EVALUATION, 2014, 29 (01) : 141 - 143
  • [4] Thematic Analysis on VOA News
    孙彦彤
    校园英语, 2017, (02) : 205 - 205
  • [5] Thematic and textual analysis methods for developing social validity questionnaires in applied behavior analysis
    Anderson, Rachel
    Taylor, Sarah
    Taylor, Tessa
    Virues-Ortega, Javier
    BEHAVIORAL INTERVENTIONS, 2022, 37 (03) : 732 - 753
  • [6] TEXTS AND DATA MINING AND THEIR POSSIBILITIES APPLIED TO THE PROCESS OF NEWS PRODUCTION
    Lima Junior, Walter Teixeira
    BRAZILIAN JOURNALISM RESEARCH, 2008, 4 (01) : 104 - 120
  • [7] Comparative Analysis of Accuracy of Fuzzy Clustering Methods Applied for Image Processing
    Yazdani, Hossein
    Choros, Kazimierz
    MULTIMEDIA AND NETWORK INFORMATION SYSTEMS, 2019, 833 : 89 - 98
  • [8] Analysis of the thematic concentration of texts: A comparison of journalistic texts by Ladislav Jehlicka and Karel Capek
    Glogarova, Jana Davidova
    David, Jaroslav
    Cech, Radek
    SLOVO A SLOVESNOST, 2013, 74 (01): : 41 - 54
  • [9] A Thematic Analysis of Fake News in India During the Pandemic
    Borgohain P.
    Bhatt A.
    Borgohain T.
    Gamit R.M.
    Science and Technology Libraries, 2023, 42 (03): : 297 - 307
  • [10] A Thematic Analysis of Unpaid School Meals in the News Media
    Spruance, Lori Andersen
    McConkie, McKayla
    Patten, Emily
    Goates, Michael C.
    JOURNAL OF HUNGER & ENVIRONMENTAL NUTRITION, 2022, 17 (06) : 850 - 859