News Hotspots Detection and Tracking Based on LDA Topic Model

被引:0
|
作者
Hu, Xiao [1 ]
机构
[1] Capital Normal Univ, Beijing 100048, Peoples R China
基金
美国国家科学基金会; 北京市自然科学基金;
关键词
LDA topic model; news reports; hotspots; detection; traking;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid spread of Internet and the mobile web, the number of news pages is increasing quickly as well as the content of news becomes highly dynamic. It's difficult for normal users to obtain specific information contained in a mass of news streams. So it's of great research significance to study how to analyze massive news, detect and track news hotspots automatically. This research proposes to apply LDA (Latent Dirichlet Allocation) model to the application of topic detection and tracking. The news articles collected by crawlers are modeled by the LDA model in a form of document-topic-word distribution. We propose a method to compute the heat of topics based on the distribution and to detect the news hotspots. In addition, we track the evolution of the topic trends in different time-slices. Jenson-Shannon distance is used to measure the similarity between topics to identify topic inheritance and topic mutation. We conducted experiments on a dataset consisting of 3462 news texts from news portals. The result revealed that the proposed model has a good effect both in detecting hotspots and discovering meaningful topical evolution trends.
引用
收藏
页码:248 / 252
页数:5
相关论文
共 50 条
  • [1] A LDA model based topic detection method
    1600, Northwestern Polytechnical University (34):
  • [2] News Recommender System Based on Topic Detection and Tracking
    Qiu, Jing
    Liao, Lejian
    Li, Peng
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2009, 5589 : 690 - 697
  • [3] Research on Hotspots of Educational Application of Natural Language Processing Based on LDA Topic Model
    Wang, Meng
    Xie, Yuyang
    Tian, Yu
    CHINESE LEXICAL SEMANTICS, CLSW 2022, PT II, 2023, 13496 : 315 - 325
  • [4] Topic Detection and Tracking in News Articles
    Patel, Sagar
    Suthar, Sanket
    Patel, Sandip
    Patel, Nehal
    Patel, Arpita
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 2, 2018, 84 : 420 - 426
  • [5] A Relevance-based Topic Model for News Event Tracking
    Ha-Thuc, Viet
    Mejova, Yelena
    Harris, Christopher
    Srinivasan, Padmini
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 764 - 765
  • [6] Topic analysis based on LDA model
    College of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China
    不详
    不详
    Zidonghua Xuebao Acta Auto. Sin., 2009, 12 (1586-1592):
  • [7] Topic detection and tracking for news web pages
    Mori, Masaki
    Miura, Takao
    Shioya, Isamu
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 338 - +
  • [8] Research on Topic Detection and Tracking for Online News Texts
    Xu, Guixian
    Meng, Yueting
    Chen, Zhan
    Qiu, Xiaoyu
    Wang, Changzhi
    Yao, Haishen
    IEEE ACCESS, 2019, 7 : 58407 - 58418
  • [9] Topic Detection and Tracking for Chinese News Web Pages
    Jing Qiu
    Liao, LeJian
    Dong, XiuJie
    ALPIT 2008: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 114 - 120
  • [10] Bilingual COVID-19 Fake News Detection Based on LDA Topic Modeling and BERT Transformer
    Omrani, Pouria
    Ebrahimian, Zahra
    Toosi, Ramin
    Akhaee, Mohammad Ali
    2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,