Topic representation model based on microblogging behavior analysis

被引:10
|
作者
Han, Weihong [1 ]
Tian, Zhihong [1 ]
Huang, Zizhong [2 ]
Li, Shudong [1 ]
Jia, Yan [3 ]
机构
[1] Guangzhou Univ, Cyberspace Inst Adv Technol, Guangzhou 510006, Peoples R China
[2] Natl Univ Def Technol, Comp Sch, Changsha 410073, Peoples R China
[3] Cyberspace Secur Res Ctr, Peng Cheng Lab, Shenzhen 518000, Peoples R China
关键词
Topic representation model; Behavior analysis; Word distribution; LDA model; Topic detection; INTERNET;
D O I
10.1007/s11280-020-00822-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of microblogging, it has become an important way for people to obtain information, express opinions, and make suggestions. Identifying new topics quickly and accurately from the massive microblogging data plays a crucial role for recommending information and controlling public opinion. The topic representation model provides a basis for topic detection. In this paper, we propose a topic representation model based on user behavior analysis, i.e., microblogging behavior analysis-latent Dirichlet allocation (MBA-LDA) model, for microblogging datasets. Topic-word distribution is acquired by the LDA model which considers information on user behaviors (such as posting, forwarding and commenting) and word distribution among documents within one topic and among different topics. The model also re-assesses the importance of words in topic representation. The basic idea is that the distribution of words within a topic or among different topics has a great influence on the selection of topic expression words. If a word is evenly distributed among all documents of a certain topic, it indicates that the word is the common word of all documents in the topic, and it is more suitable to represent this topic. If a word is more evenly distributed among various topics, it indicates that the word is the common word of all topics, and it can't achieve the purpose of distinguishing topics, so it is less suitable to represent any topic. By experiments with Sina Microblogging's actual data set, the topic model based on the MBA-LDA algorithm makes the representative words more important and increases the differentiation of topic words, which effectively improves the accuracy of subsequent topic detection and evolutionary analysis.
引用
收藏
页码:3083 / 3097
页数:15
相关论文
共 50 条
  • [41] Word Polarity Analysis Method Based on Topic Model
    Fan, Xiao-Nan
    Wang, Shi-Min
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND MANAGEMENT INNOVATION, 2014, : 107 - 112
  • [42] Library Microblogging Based on Sina Microblogging Platform
    Yang Shuangqi
    Xu Jing
    Shao Shengchun
    INFORMATION COMPUTING AND APPLICATIONS, PT 1, 2012, 307 : 702 - 707
  • [43] Topic Evolution Analysis Based on Optimized Combined Topic Model: Illustrated as CRISPR Technology
    Zhang, Yuanda
    Xu, Shihuan
    Yang, Yan
    Huang, Ying
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 13972 LNCS : 47 - 64
  • [44] A More Effective Method For Image Representation: Topic Model Based on Latent Dirichlet Allocation
    Li, Zongmin
    Tian, Weiwei
    Li, Yante
    Kuang, Zhenzhong
    Liu, Yujie
    2015 14TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS (CAD/GRAPHICS), 2015, : 143 - 148
  • [45] A Novel Hybrid Clustering Algorithm for Topic Detection on Chinese Microblogging
    Geng, Xiao
    Zhang, Yanmei
    Jiao, Yuhang
    Mei, Yinan
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (02): : 289 - 300
  • [46] An Automatic Topic Ranking Approach for Event Detection on Microblogging Messages
    Lee, Chung-Hong
    Chien, Tzan-Feng
    Yang, Hsin-Chang
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 1358 - 1363
  • [47] Topic evolution analysis based on improved online LDA model
    He, Jianyun
    Chen, Xingshu
    Du, Min
    Jiang, Hao
    Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2015, 46 (02): : 547 - 553
  • [48] A Topic-Rank Recommendation Model Based on Microblog Topic Relevance & User Preference Analysis
    Bao, Fuguan
    Xu, Wenqian
    Feng, Yao
    Xu, Chonghuan
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2022, 12
  • [49] Social Network Analysis Based on Topic Model with Temporal Factor
    Thanh Ho
    Phuc Do
    INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2018, 9 (01) : 82 - 97
  • [50] Software crowdsourcing task pricing based on topic model analysis
    Shen, YuSong
    Yang, Ye
    Wang, Yong
    Chang, DeLin
    IET SOFTWARE, 2020, 14 (07) : 759 - 767