Real-Time Novel Event Detection from Social Media

被引:37
|
作者
Li, Quanzhi [1 ]
Nourbakhsh, Armineh [1 ]
Shah, Sameena [1 ]
Liu, Xiaomo [1 ]
机构
[1] Thomson Reuters, Res & Dev, 3 Times Sq, New York, NY 10036 USA
关键词
event detection; event novelty; novel event; temporal identification; temporal information; semantic class; social media;
D O I
10.1109/ICDE.2017.157
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a new approach for detecting novel events from social media, specially Twitter, at real-time. An event is usually defined by who, what, where and when, and an event tweet usually contains terms corresponding to these aspects. To exploit this information, we propose a method that incorporates simple semantics by splitting the tweet term space into groups of terms that have the meaning of the same type. These groups are called semantic categories (classes) and each reflects one or more event aspects. The semantic classes include named entity, mention, location, hashtag, verb, noun and embedded link. To group tweets talking about the same event into the same cluster, similarity measuring is conducted by calculating class-wise similarity and then aggregating them together. Users of a real-time event detection system are usually only interested in novel (new) events, which are happening now or just happened a short time ago. To fulfill this requirement, a temporal identification module is used to filter out event clusters that are about old stories. The clustering module also computes a novelty score for each event cluster, which reflects how novel the event is, compared to previous events. We evaluated our event detection method using multiple quality metrics and a large-scale event corpus having millions of tweets. The experiment results show that the proposed online event detection method achieves the state-of-the-art performance. Our experiment also shows that the temporal identification module can effectively detect old events.
引用
收藏
页码:1129 / 1139
页数:11
相关论文
共 50 条
  • [21] From Twitter to detector: Real-time traffic incident detection using social media data
    Gu, Yiming
    Qian, Zhen
    Chen, Feng
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2016, 67 : 321 - 342
  • [22] Identifying Relevant Event Content for Real-time Event Detection
    Wang, Xinyue
    Tokarchuk, Laurissa
    Poslad, Stefan
    2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), 2014, : 395 - 398
  • [23] Real-Time Power System Event Detection: A Novel Instance Selection Approach
    Intriago, Gabriel
    Zhang, Yu
    IEEE ACCESS, 2023, 11 : 46765 - 46781
  • [24] A survey on real-time event detection from the Twitter data stream
    Hasan, Mahmud
    Orgun, Mehmet A.
    Schwitter, Rolf
    JOURNAL OF INFORMATION SCIENCE, 2018, 44 (04) : 443 - 463
  • [25] The Power of Real-time Social Media Marketing
    South, Jeff
    JOURNALISM STUDIES, 2011, 12 (05) : 705 - 707
  • [26] Real-Time Detection, Tracking, and Monitoring of Automatically Discovered Events in Social Media
    Osborne, Miles
    Moran, Sean
    McCreadie, Richard
    Von Lunen, Alexander
    Sykora, Martin
    Cano, Elizabeth
    Ireson, Neil
    Macdonald, Craig
    Ounis, Iadh
    He, Yulan
    Jackson, Tom
    Ciravegna, Fabio
    O'Brien, Ann
    PROCEEDINGS OF 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, 2014, : 37 - 42
  • [27] TrafficWatch: Real-Time Traffic Incident Detection and Monitoring Using Social Media
    Hoang Nguyen
    Liu, Wei
    Rivera, Paul
    Chen, Fang
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 : 540 - 551
  • [28] Real-time disease detection and analysis system using social media contents
    Yoo, SoYeop
    Kim, DaeHo
    Yang, SungMin
    Jeong, OkRan
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2020, 16 (01) : 22 - 38
  • [29] Real-Time Logo Detection in Brand-Related Social Media Images
    Orti, Oscar
    Tous, Ruben
    Gomez, Mauro
    Poveda, Jonatan
    Cruz, Leonel
    Wust, Otto
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT II, 2019, 11507 : 125 - 136
  • [30] Detection of Zero Day Exploits Using Real-Time Social Media Streams
    Kergl, Dennis
    Roedler, Robert
    Rodosek, Gabi Dreo
    ADVANCES IN NATURE AND BIOLOGICALLY INSPIRED COMPUTING, 2016, 419 : 405 - 416