Analysis of Twitter Data Using a Multiple-level Clustering Strategy

被引:0
|
作者
Baralis, Elena [1 ]
Cerquitelli, Tania [1 ]
Chiusano, Silvia [1 ]
Grimaudo, Luigi [1 ]
Xiao, Xin [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, I-10129 Turin, Italy
来源
关键词
clustering algorithms; association rules; social networks; tweets;
D O I
10.3280/MC2013-001003
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Twitter, currently the leading microblogging social network, has attracted a great body of research works. This paper proposes a data analysis framework to discover groups of similar twitter messages posted on a given event. By analyzing these groups, user emotions or thoughts that seem to be associated with specific events can be extracted, as well as aspects characterizing events according to user perception. To deal with the inherent sparseness of micro-messages, the proposed approach relies on a multiple-level strategy that allows clustering text data with a variable distribution. Clusters are then characterized through the most representative words appearing in their messages, and association rules are used to highlight correlations among these words. To measure the relevance of specific words for a given event, text data has been represented in the Vector Space Model using the TF-IDF weighting score. As a case study, two real Twitter datasets have been analysed.
引用
收藏
页码:13 / 24
页数:12
相关论文
共 50 条
  • [1] MULTIPLE-LEVEL REGRESSION ANALYSIS OF SURVEY AND ECOLOGICAL DATA
    HARDER, T
    PAPPI, FU
    SOCIAL SCIENCE INFORMATION SUR LES SCIENCES SOCIALES, 1969, 8 (05): : 43 - 67
  • [2] Clustering basal users based recommendation algorithm using multiple-level similarity
    Li, Tao
    Wang, Jiandong
    Nanjing Hangkong Hangtian Daxue Xuebao/Journal of Nanjing University of Aeronautics and Astronautics, 2006, 38 (06): : 717 - 721
  • [3] A multiple-level trajectory analysis of vortex filaments
    Schoeberl, MR
    Newman, PA
    JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 1995, 100 (D12) : 25801 - 25815
  • [4] Clustering and Sentiment Analysis on Twitter Data
    Ahuja, Shreya
    Dubey, Gaurav
    2017 2ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATION AND NETWORKS (TEL-NET), 2017, : 420 - 424
  • [5] Design of multiple-level hybrid classifier for intrusion detection system using Bayesian clustering and decision trees
    Xiang, Cheng
    Yong, Png Chin
    Meng, Lim Swee
    PATTERN RECOGNITION LETTERS, 2008, 29 (07) : 918 - 924
  • [6] HiCComp: Multiple-level Comparative Analysis of Hi-C Data by Triplet Network
    Zhang, Yan
    Zhang, Bo
    Zheng, W. Jim
    Tang, Jijun
    Yue, Feng
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 92 - 96
  • [7] Mining multiple-level fuzzy blocks from multidimensional data
    HELP University College, Kuala Lumpur, Malaysia
    不详
    不详
    Fuzzy Sets Syst, 1600, 12 (1535-1553):
  • [8] Mining multiple-level fuzzy blocks from multidimensional data
    Choong, Yeow Wei
    Laurent, Anne
    Laurent, Dominique
    FUZZY SETS AND SYSTEMS, 2008, 159 (12) : 1535 - 1553
  • [9] Multiple-level concatenated coding in ernbryonics: A dependability analysis
    Prodan, Lucian
    Udrescu, Mihai
    Vladutiu, Mircea
    GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 941 - 948
  • [10] Spondyloptosis and multiple-level spondylolysis
    Al-Sebai, MW
    Al-Khawashki, H
    EUROPEAN SPINE JOURNAL, 1999, 8 (01) : 75 - 77