Event Graph-Based News Clustering: The Role of Named Entity-Centered Subgraphs

被引:0
|
作者
Komecoglu, Basak Buluz [1 ]
Yilmaz, Burcu [1 ]
机构
[1] Gebze Tech Univ, Inst Informat Technol, TR-41400 Gebze, Kocaeli, Turkiye
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Task analysis; Clustering algorithms; Vectors; Context modeling; Computational modeling; Analytical models; Semantics; Natural language processing; Text processing; Frequent subgraph mining; low-resource language; natural language processing; text clustering; TOPIC DETECTION; SIMILARITY;
D O I
10.1109/ACCESS.2024.3435343
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In an era of exponential growth in online news sources, the need for intelligent digital solutions capable of efficiently analyzing and organizing large amounts of news content has become crucial. This paper presents a graph-based methodology designed to enhance Topic Detection and Tracking (TDT) tasks in natural language processing by efficiently clustering news events into coherent stories. The proposed approach leverages a novel event graph model that captures not only the characteristics of individual news events but also their collective narrative context. Using Named Entity Centred Frequent Subgraphs, the model excels in identifying recurring patterns of events and thus provides a framework for learning a robust, language-independent, and structured representation for structuring news stories, which represents a significant advance in the refinement of traditional clustering algorithms. Empirical experiments using a multilingual benchmark dataset, the News Clustering Dataset, highlight the superior clustering performance of our approach compared to state-of-the-art monolingual document clustering techniques, particularly in English and the competitive results in Spanish. To underline the adaptability of the methodology to low-resource languages, the Turkish 'Story-Based News Dataset' developed specifically for this study also promises to serve as an important resource for a wide range of natural language processing tasks.
引用
收藏
页码:105613 / 105632
页数:20
相关论文
共 50 条
  • [31] Graph-Based Opinion Entity Ranking in Customer Reviews
    Chutmongkolporn, Kunuch
    Manaskasemsak, Bundit
    Rungsawang, Arnon
    2015 15TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2015, : 161 - 164
  • [32] Graph-based collective chinese entity linking algorithm
    Liu Q.
    Zhong Y.
    Li Y.
    Liu Y.
    Qin Z.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2016, 53 (02): : 270 - 283
  • [33] Unsupervised Graph-Based Entity Resolution for Complex Entities
    Kirielle, Nishadi
    Christen, Peter
    Ranbaduge, Thilina
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (01)
  • [34] Graph-based Clustering for Time Series Data
    Li, Peiyu
    Boubrahimi, Soukaina Filali
    Hamdi, Shah Muhammad
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4464 - 4467
  • [35] Graph-based multimodal clustering for social multimedia
    Georgios Petkos
    Manos Schinas
    Symeon Papadopoulos
    Yiannis Kompatsiaris
    Multimedia Tools and Applications, 2017, 76 : 7897 - 7919
  • [36] Graph-based clustering of random point set
    Imiya, A
    Tatara, K
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 948 - 956
  • [37] Anchor graph-based multiview spectral clustering
    Lei, Yu
    Niu, Zuoyuan
    Wang, Qianqian
    Gao, Quanxue
    Yang, Ming
    NEUROCOMPUTING, 2024, 583
  • [38] Fuzzy Clustering Method with Graph-based Regularization
    Chen, Long
    Guo, Li
    Lu, Xiliang
    Chen, C. L. Philip
    2016 INTERNATIONAL CONFERENCE ON FUZZY THEORY AND ITS APPLICATIONS (IFUZZY), 2016,
  • [39] Graph-based multimodal clustering for social multimedia
    Petkos, Georgios
    Schinas, Manos
    Papadopoulos, Symeon
    Kompatsiaris, Yiannis
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (06) : 7897 - 7919
  • [40] A GENETIC GRAPH-BASED APPROACH FOR PARTITIONAL CLUSTERING
    Menendez, Hector D.
    Barrero, David F.
    Camacho, David
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2014, 24 (03)