Mining the key predictors for event outbreaks in social networks

被引:11
|
作者
Yi, Chengqi [1 ]
Bao, Yuanyuan [2 ,3 ]
Xue, Yibo [2 ,3 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Peoples R China
[2] Tsinghua Univ, Res Inst Informat Technol, FIT Bldg 3-418, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
关键词
Social network; Outbreak prediction; Information dissemination; Predictors; Data-driven; INFORMATION PROPAGATION; DIFFUSION;
D O I
10.1016/j.physa.2015.12.019
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
It will be beneficial to devise a method to predict a so-called event outbreak. Existing works mainly focus on exploring effective methods for improving the accuracy of predictions, while ignoring the underlying causes: What makes event go viral? What factors that significantly influence the prediction of an event outbreak in social networks? In this paper, we proposed a novel definition for an event outbreak, taking into account the structural changes to a network during the propagation of content. In addition, we investigated features that were sensitive to predicting an event outbreak. In order to investigate the universality of these features at different stages of an event, we split the entire lifecycle of an event into 20 equal segments according to the proportion of the propagation time. We extracted 44 features, including features related to content, users, structure, and time, from each segment of the event. Based on these features, we proposed a prediction method using supervised classification algorithms to predict event outbreaks. Experimental results indicate that, as time goes by, our method is highly accurate, with a precision rate ranging from 79% to 97% and a recall rate ranging from 74% to 97%. In addition, after applying a feature-selection algorithm, the top five selected features can considerably improve the accuracy of the prediction. Data-driven experimental results show that the entropy of the eigenvector centrality, the entropy of the PageRank, the standard deviation of the betweenness centrality, the proportion of re-shares without content, and the average path length are the key predictors for an event outbreak. Our findings are especially useful for further exploring the intrinsic characteristics of outbreak prediction. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:247 / 260
页数:14
相关论文
共 50 条
  • [1] Social Networks Event Mining: A Systematic Literature Review
    Shaikh, Muniba
    Salleh, Norsaremah
    Marziana, Lili
    PATTERN ANALYSIS, INTELLIGENT SECURITY AND THE INTERNET OF THINGS, 2015, 355 : 169 - 177
  • [2] Social Networks Peacebuilding Event Mining: A Systematic Literature Review
    Shaikh, Muniba
    Salleh, Norsaremah
    Marziana, Lili
    3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES ACSAT 2014, 2014, : 119 - 124
  • [3] Using X Social Networks and web news mining to predict Marburg virus disease outbreaks
    Jokar, Mohammad
    Jahanbin, Kia
    Rahmanian, Vahid
    ASIAN PACIFIC JOURNAL OF TROPICAL MEDICINE, 2025, 18 (02) : 96 - 98
  • [4] Inferring Social Networks from Outbreaks
    Angluin, Dana
    Aspnes, James
    Reyzin, Lev
    ALGORITHMIC LEARNING THEORY, ALT 2010, 2010, 6331 : 104 - 118
  • [5] social networks and social Web mining
    Xu, Guandong
    Yu, Jeffrey
    Lee, Wookey
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2013, 16 (5-6): : 541 - 544
  • [6] Detecting rumor outbreaks in online social networks
    Fraszczak, Damian
    SOCIAL NETWORK ANALYSIS AND MINING, 2023, 13 (01)
  • [7] Detecting rumor outbreaks in online social networks
    Damian Frąszczak
    Social Network Analysis and Mining, 13
  • [8] Data mining in social networks
    Jensen, D
    Neville, J
    DYNAMIC SOCIAL NETWORK MODELING AND ANALYSIS, 2003, : 289 - 302
  • [9] Semantic Mining of Social Networks
    Tang, Jie
    Li, Juanzi
    Synthesis Lectures on the Semantic Web: Theory and Technology, 2015, 5 (02): : 1 - 207
  • [10] Long-term predictors of dengue outbreaks in Bangladesh: A data mining approach
    Muurlink, Olav Titus
    Stephenson, Peter
    Islam, Mohammad Zahirul
    Taylor-Robinson, Andrew W.
    INFECTIOUS DISEASE MODELLING, 2018, 3 : 322 - 330