A Unified Model for Stable and Temporal Topic Detection from Social Media Data

被引:0
|
作者
Yin, Hongzhi [1 ]
Cui, Bin [1 ]
Lu, Hua [2 ]
Huang, Yuxin [1 ]
Yao, Junjie [1 ]
机构
[1] Peking Univ, Key Lab High Confidence Software Technol, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Aalborg Univ, Dept Comp Sci, Aalborg, Denmark
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web 2.0 users generate and spread huge amounts of messages in online social media. Such user-generated contents are mixture of temporal topics (e.g., breaking events) and stable topics (e.g., user interests). Due to their different natures, it is important and useful to distinguish temporal topics from stable topics in social media. However, such a discrimination is very challenging because the user-generated texts in social media are very short in length and thus lack useful linguistic features for precise analysis using traditional approaches. In this paper, we propose a novel solution to detect both stable and temporal topics simultaneously from social media data. Specifically, a unified user-temporal mixture model is proposed to distinguish temporal topics from stable topics. To improve this model's performance, we design a regularization framework that exploits prior spatial information in a social network, as well as a burst-weighted smoothing scheme that exploits temporal prior information in the time dimension. We conduct extensive experiments to evaluate our proposal on two real data sets obtained from Del.icio.us and Twitter. The experimental results verify that our mixture model is able to distinguish temporal topics from stable topics in a single detection process. Our mixture model enhanced with the spatial regularization and the burst-weighted smoothing scheme significantly outperforms competitor approaches, in terms of topic detection accuracy and discrimination in stable and temporal topics.
引用
收藏
页码:661 / 672
页数:12
相关论文
共 50 条
  • [1] Topic-Clustering Model with Temporal Distribution for Public Opinion Topic Analysis of Geospatial Social Media Data
    Hu, Chunchun
    Liang, Qin
    Luo, Nianxue
    Lu, Shuixiang
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2023, 12 (07)
  • [2] Heterogeneous Information Fusion based Topic Detection from Social Media Data
    Rani, Seema
    Kumar, Mukesh
    INFORMATION SYSTEMS FRONTIERS, 2023, 25 (02) : 513 - 528
  • [3] Heterogeneous Information Fusion based Topic Detection from Social Media Data
    Seema Rani
    Mukesh Kumar
    Information Systems Frontiers, 2023, 25 : 513 - 528
  • [4] Local Topic Detection Using Word Embedding from Spatio-Temporal Social Media
    Chen, Junsha
    Gao, Neng
    Zhang, Yifei
    Tu, Chenyang
    NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 629 - 641
  • [5] Constructing Topic Hierarchies from Social Media Data
    Zhang, Yuhao
    Mao, Wenji
    Zeng, Daniel
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1015 - 1018
  • [6] Emerging Topic Detection on the Meta-data of Images from Fashion Social Media
    Miyazaki, Kunihiro
    Uchiba, Takayuki
    Young, Scarlett
    Sasaki, Yuichi
    Tanaka, Kenji
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3995 - 4003
  • [7] Topic Sketch: Real Time Bursty Topic Detection From Social Media
    Keshav, B.
    Rajeshwari, J.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 904 - 908
  • [8] City Event Detection from Social Media with Neural Embeddings and Topic Model Visualization
    Liu, Shuhua
    Jansson, Patrick
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4111 - 4116
  • [9] Tip information from social media based on topic detection
    Hattori, Yuki
    Nadamoto, Akiyo
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2013, 9 (01) : 83 - +
  • [10] A Joint Model for Sentiment-Aware Topic Detection on Social Media
    Xu, Kang
    Qi, Guilin
    Huang, Junheng
    Wu, Tianxing
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 338 - 346