A Unified Model for Stable and Temporal Topic Detection from Social Media Data

被引:0
|
作者
Yin, Hongzhi [1 ]
Cui, Bin [1 ]
Lu, Hua [2 ]
Huang, Yuxin [1 ]
Yao, Junjie [1 ]
机构
[1] Peking Univ, Key Lab High Confidence Software Technol, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Aalborg Univ, Dept Comp Sci, Aalborg, Denmark
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web 2.0 users generate and spread huge amounts of messages in online social media. Such user-generated contents are mixture of temporal topics (e.g., breaking events) and stable topics (e.g., user interests). Due to their different natures, it is important and useful to distinguish temporal topics from stable topics in social media. However, such a discrimination is very challenging because the user-generated texts in social media are very short in length and thus lack useful linguistic features for precise analysis using traditional approaches. In this paper, we propose a novel solution to detect both stable and temporal topics simultaneously from social media data. Specifically, a unified user-temporal mixture model is proposed to distinguish temporal topics from stable topics. To improve this model's performance, we design a regularization framework that exploits prior spatial information in a social network, as well as a burst-weighted smoothing scheme that exploits temporal prior information in the time dimension. We conduct extensive experiments to evaluate our proposal on two real data sets obtained from Del.icio.us and Twitter. The experimental results verify that our mixture model is able to distinguish temporal topics from stable topics in a single detection process. Our mixture model enhanced with the spatial regularization and the burst-weighted smoothing scheme significantly outperforms competitor approaches, in terms of topic detection accuracy and discrimination in stable and temporal topics.
引用
收藏
页码:661 / 672
页数:12
相关论文
共 50 条
  • [41] Tracking geographical locations using a geo-aware topic model for analyzing social media data
    Lozano, Marianela Garcia
    Schreiber, Jonah
    Brynielsson, Joel
    DECISION SUPPORT SYSTEMS, 2017, 99 : 18 - 29
  • [42] Mining multiple spatial-temporal paths from social media data
    Yao, Hong
    Xiong, Muzhou
    Zeng, Deze
    Gong, Junfang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 782 - 791
  • [43] textPrep: A Text Preprocessing Toolkit for Topic Modeling on Social Media Data
    Churchill, Rob
    Singh, Lisa
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2021, : 60 - 70
  • [44] Analyzing Topic-Sentiment and Topic Evolution over Time from Social Media
    Hu, Yan
    Xu, Xiaofei
    Li, Li
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2016, 2016, 9983 : 97 - 109
  • [45] Characterizing the interests of social media users: Refinement of a topic model for incorporating heterogeneous media
    Han, Jonghyun
    Lee, Hyunju
    INFORMATION SCIENCES, 2016, 358 : 112 - 128
  • [46] EXTRACTING ACTIONABLE INSIGHTS FROM TEXT DATA : A STABLE TOPIC MODEL APPROACH1
    Yang, Yi
    Subramanyam, Ramanath
    MIS QUARTERLY, 2023, 47 (03) : 923 - 954
  • [47] A Unified Model for the Use and Acceptance of Stickers in Social Media Messaging
    Al-Maroof, Rana Saeed
    Salloum, Said A.
    AlHamadand, Ahmad Qasim Mohammad
    Shaalan, Khaled
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2019, 2020, 1058 : 370 - 381
  • [48] A Data Model for Content Modelling of Temporal Media
    Qasemizadeh, Behrang
    O'Neill, Ian
    Hanna, Philip
    Stewart, Darryl
    FUTURE MULTIMEDIA NETWORKING, PROCEEDINGS, 2009, 5630 : 194 - 199
  • [49] A Social Sensing Model for Event Detection and User Influence Discovering in Social Media Data Streams
    Shi, Lei-Lei
    Liu, Lu
    Wu, Yan
    Jiang, Liang
    Panneerselvam, John
    Crole, Roy
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (01) : 141 - 150
  • [50] Topic generation for Chinese stocks: a cognitively motivated topic modeling method using social media data
    Chen, Wenhao
    Lai, Kinkeung
    Cai, Yi
    QUANTITATIVE FINANCE AND ECONOMICS, 2018, 2 (02): : 279 - 293