Event Graph-Based News Clustering: The Role of Named Entity-Centered Subgraphs

被引:0
|
作者
Komecoglu, Basak Buluz [1 ]
Yilmaz, Burcu [1 ]
机构
[1] Gebze Tech Univ, Inst Informat Technol, TR-41400 Gebze, Kocaeli, Turkiye
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Task analysis; Clustering algorithms; Vectors; Context modeling; Computational modeling; Analytical models; Semantics; Natural language processing; Text processing; Frequent subgraph mining; low-resource language; natural language processing; text clustering; TOPIC DETECTION; SIMILARITY;
D O I
10.1109/ACCESS.2024.3435343
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In an era of exponential growth in online news sources, the need for intelligent digital solutions capable of efficiently analyzing and organizing large amounts of news content has become crucial. This paper presents a graph-based methodology designed to enhance Topic Detection and Tracking (TDT) tasks in natural language processing by efficiently clustering news events into coherent stories. The proposed approach leverages a novel event graph model that captures not only the characteristics of individual news events but also their collective narrative context. Using Named Entity Centred Frequent Subgraphs, the model excels in identifying recurring patterns of events and thus provides a framework for learning a robust, language-independent, and structured representation for structuring news stories, which represents a significant advance in the refinement of traditional clustering algorithms. Empirical experiments using a multilingual benchmark dataset, the News Clustering Dataset, highlight the superior clustering performance of our approach compared to state-of-the-art monolingual document clustering techniques, particularly in English and the competitive results in Spanish. To underline the adaptability of the methodology to low-resource languages, the Turkish 'Story-Based News Dataset' developed specifically for this study also promises to serve as an important resource for a wide range of natural language processing tasks.
引用
收藏
页码:105613 / 105632
页数:20
相关论文
共 50 条
  • [41] Localized Graph-Based Feature Selection for Clustering
    Zhang, Zhihong
    Hancock, Edwin R.
    IMAGE ANALYSIS AND RECOGNITION, PT I, 2012, 7324 : 1 - 10
  • [42] A graph-based clustering method and its applications
    Foggia, Pasquale
    Percannella, Gennaro
    Sansone, Carlo
    Vento, Mario
    ADVANCES IN BRAIN, VISION, AND ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4729 : 277 - +
  • [43] Structured Doubly Stochastic Graph-Based Clustering
    Wang, Nian
    Cui, Zhigao
    Li, Aihua
    Lu, Yihang
    Wang, Rong
    Nie, Feiping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
  • [44] Recapitulization of Tweets Using Graph-based Clustering
    Lobo, Vivian Brian
    Ansari, Nazneen
    2017 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS, COMPUTING AND IT APPLICATIONS (CSCITA), 2017, : 101 - 106
  • [45] A Graph-Based Clustering Algorithm for the Internet of Vehicles
    Yang, Fan
    Zhang, ShiLong
    Huang, Jie
    Cao, Yang
    Zuo, Xun
    Yang, Chuan
    Zhang, Bo
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (01)
  • [46] A distributed genetic algorithm for graph-based clustering
    Buza K.
    Buza A.
    Kis P.B.
    Advances in Intelligent and Soft Computing, 2011, 103 : 323 - 331
  • [47] Graph-based Clustering under Differential Privacy
    Pinot, Rafael
    Morvan, Anne
    Yger, Florian
    Gouy-Pailler, Cedric
    Atif, Jamal
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 329 - 338
  • [48] Assessing the performance of a graph-based clustering algorithm
    Foggia, P.
    Percannella, G.
    Sansone, C.
    Vento, M.
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, PROCEEDINGS, 2007, 4538 : 215 - +
  • [49] Reduction Techniques for Graph-Based Convex Clustering
    Han, Lei
    Zhang, Yu
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1645 - 1651
  • [50] Practical Attacks Against Graph-based Clustering
    Chen, Yizheng
    Nadji, Yacin
    Kountouras, Athanasios
    Monrose, Fabian
    Perdisci, Roberto
    Antonakakis, Manos
    Vasiloglou, Nikolaos
    CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 1125 - 1142