Mining Social Media Data Using Topological Data Analysis

被引:8
|
作者
Almgren, Khaled [1 ]
Kim, Minkyu [2 ]
Lee, Jeongkyu [1 ]
机构
[1] Univ Bridgeport, Comp Sci & Engn Dept, Bridgeport, CT 06614 USA
[2] ASML, Wilton, CT 06897 USA
关键词
topological data analysis; social network analysis and mining; machine learning; clustering;
D O I
10.1109/IRI.2017.41
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Topological data analysis is a noble method to analyze high-dimensional qualitative data using a set of properties from topology. In this paper, we explore the feasibility of topological data analysis for mining social media data by investigating the problem of image popularity. We randomly crawl images from Instagram, convert their captions to 300 dimensional numerical vectors using Word2vec, calculate cosine distances to evaluate the similarities of the caption vectors, and then apply the distances to a topological data analysis algorithm called mapper. With caption vectors, the results show that topological data analysis is able to cluster the images related to the images' popularity. Moreover, the results show relationships between the clusters that are represented as a monotonic increase of popularity. This approach is compared with traditional clustering algorithms, including k-means and hierarchical clustering, and the results show that topological data analysis outperforms the others.
引用
收藏
页码:144 / 153
页数:10
相关论文
共 50 条
  • [21] Use of Social Media for Data Mining in Pharmacovigilance
    Dasgupta, N.
    Pierce, C.
    DRUG SAFETY, 2015, 38 (10) : 947 - 948
  • [22] Utilization of social media in floods assessment using data mining techniques
    Khan, Qasim
    Kalbus, Edda
    Zaki, Nazar
    Mohamed, Mohamed Mostafa
    PLOS ONE, 2022, 17 (04):
  • [23] Social Media Fake Profile Detection Using Data Mining Technique
    Kadam, Nitika
    Sharma, Sanjeev Kumar
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2022, 13 (05) : 518 - 523
  • [24] A Text Mining Analysis on Big Data Extracted from Social Media
    Schoier, Gabriella
    Borruso, Giuseppe
    Tossut, Pietro
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2020, PART IV, 2020, 12252 : 351 - 364
  • [25] Opinion Mining on Social Media Data: Sentiment Analysis of User Preferences
    Pavaloaia, Vasile-Daniel
    Teodor, Elena-Madalina
    Fotache, Doina
    Danilet, Magdalena
    SUSTAINABILITY, 2019, 11 (16)
  • [26] Emerging trends in social media marketing: a retrospective review using data mining and bibliometric analysis
    Bashar, Abu
    Wasiq, Mohammad
    Nyagadza, Brighton
    Maziriri, Eugine Tafadzwa
    FUTURE BUSINESS JOURNAL, 2024, 10 (01)
  • [27] Emerging trends in social media marketing: a retrospective review using data mining and bibliometric analysis
    Abu Bashar
    Mohammad Wasiq
    Brighton Nyagadza
    Eugine Tafadzwa Maziriri
    Future Business Journal, 10
  • [28] Analyzing Social Media Data Using Sentiment Mining and Bigram Analysis for the Recommendation of YouTube Videos
    McGarry, Ken
    INFORMATION, 2023, 14 (07)
  • [29] A Heuristic Data Mining Framework Towards Dynamic Data of Social Media
    Kee, Estelle Xin Ying
    Hong, Jer Lang
    NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 403 - 409
  • [30] Big Data vs. Data Mining for Social Media Analytics
    Danubianu, M.
    Barila, A.
    SMART 2014 - SOCIAL MEDIA IN ACADEMIA: RESEARCH AND TEACHING, 2015, : 261 - 269