A Methodological Framework for Statistical Analysis of Social Text Streams

被引:0
|
作者
Kleisarchaki, Sophia
Kotzinos, Dimitris
Tsamardinos, Ioannis
Christophides, Vassilis
机构
关键词
twitter; clustering algorithm; centroid; shape; density;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media are one of the main contributors of user generated content; providing vast amounts of data in daily basis, covering a wide range of topics, interests and events. In order to identify and link meaningful and relevant information, clustering algorithms have been used to partition the user generated content. We have identified though that these algorithms exhibit various shortcomings when they have to deal with social media textual information, which is dynamic and streaming in nature. Thus we explore the idea to estimate the algorithms' parameters based on observations on the clusters' properties' (like the centroid, shape and density) evolution. By experimenting with the clusters' properties, we propose a methodological framework that detects the evolution of the clusters' centroid, shape and density and explores their role in parameters' estimation.
引用
收藏
页码:101 / 110
页数:10
相关论文
共 50 条
  • [11] Unsupervised event exploration from social text streams
    Zhou, Deyu
    Chen, Liangyu
    Zhang, Xuan
    He, Yulan
    INTELLIGENT DATA ANALYSIS, 2017, 21 (04) : 849 - 866
  • [12] Incremental autoencoders for text streams clustering in social networks
    Rekik, Amal
    Jamoussi, Salma
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2021, 27 (11) : 1203 - 1221
  • [13] Dynamic sampling of text streams and its application in text analysis
    Gang Tian
    Jiajia Huang
    Min Peng
    Jiahui Zhu
    Yanchun Zhang
    Knowledge and Information Systems, 2017, 53 : 507 - 531
  • [14] Statistical Analysis and a Social Network Model Based on the SEIQR Framework
    Chimmalee, B.
    Sawangtong, W.
    Suwandechochai, R.
    Chamchod, F.
    2014 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2014, : 414 - 418
  • [15] Dynamic sampling of text streams and its application in text analysis
    Tian, Gang
    Huang, Jiajia
    Peng, Min
    Zhu, Jiahui
    Zhang, Yanchun
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 53 (02) : 507 - 531
  • [16] A Methodological Framework for Dictionary and Rule-based Text Classification
    Abel, Jennifer
    Lantow, Birger
    KDIR: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL 1: KDIR, 2019, : 330 - 337
  • [17] THE IMAGE IN THE TEXT - METHODOLOGICAL ASPECTS OF THE ANALYSIS OF ILLUSTRATIONS AND THEIR RELATION TO THE TEXT
    SEDRAJNA, G
    BULLETIN OF THE JOHN RYLANDS UNIVERSITY LIBRARY OF MANCHESTER, 1993, 75 (03): : 25 - &
  • [18] Context-Based Persuasion Analysis of Sentiment Polarity Disambiguation in Social Media Text Streams
    Singh, Tajinder
    Kumari, Madhu
    Gupta, Daya Sagar
    NEW GENERATION COMPUTING, 2024, 42 (04) : 497 - 531
  • [19] Topic Modeling over Text Streams from Social Media
    Smatana, Miroslav
    Paralic, Jan
    Butka, Peter
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 163 - 172
  • [20] Hierarchical Multi-Label Classification of Social Text Streams
    Ren, Zhaochun
    Peetz, Maria-Hendrike
    Liang, Shangsong
    van Dolen, Willemijn
    de Rijke, Maarten
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 213 - 222