Mining Social Media Data Using Topological Data Analysis

被引:8
|
作者
Almgren, Khaled [1 ]
Kim, Minkyu [2 ]
Lee, Jeongkyu [1 ]
机构
[1] Univ Bridgeport, Comp Sci & Engn Dept, Bridgeport, CT 06614 USA
[2] ASML, Wilton, CT 06897 USA
关键词
topological data analysis; social network analysis and mining; machine learning; clustering;
D O I
10.1109/IRI.2017.41
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Topological data analysis is a noble method to analyze high-dimensional qualitative data using a set of properties from topology. In this paper, we explore the feasibility of topological data analysis for mining social media data by investigating the problem of image popularity. We randomly crawl images from Instagram, convert their captions to 300 dimensional numerical vectors using Word2vec, calculate cosine distances to evaluate the similarities of the caption vectors, and then apply the distances to a topological data analysis algorithm called mapper. With caption vectors, the results show that topological data analysis is able to cluster the images related to the images' popularity. Moreover, the results show relationships between the clusters that are represented as a monotonic increase of popularity. This approach is compared with traditional clustering algorithms, including k-means and hierarchical clustering, and the results show that topological data analysis outperforms the others.
引用
收藏
页码:144 / 153
页数:10
相关论文
共 50 条
  • [1] A survey of Big Data in social media using data mining techniques
    Gole, Sheela
    Tidke, Bharat
    ICACCS 2015 PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS, 2015,
  • [2] Social Media Analytics Using Data Mining Algorithms
    Anand, Harnoor
    Mathur, Sandeep
    SUSTAINABLE COMMUNICATION NETWORKS AND APPLICATION, ICSCN 2019, 2020, 39 : 12 - 23
  • [3] Special issue on analysis and mining of social media data
    Zubiaga, Arkaitz
    Rosso, Paolo
    PeerJ Computer Science, 2024, 10
  • [4] Special issue on analysis and mining of social media data
    Zubiaga, Arkaitz
    Rosso, Paolo
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [5] Emotion Mining in Social Media Data
    Ranganathan, Jaishree
    Tzacheva, Angelina
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 : 58 - 66
  • [6] Opinion Mining on Social Media Data
    Liang, Po-Wei
    Dai, Bi-Ru
    2013 IEEE 14TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2013), VOL 2, 2013, : 91 - 96
  • [7] Social media data mining: An analysis & overview of social media networks and political landscape
    Joseph, Sethunya R. (Sethunya.joseph@studentmail.biust.ac.bw), 2016, Science and Engineering Research Support Society (09):
  • [8] Mining the vaccination willingness of China using social media data
    Ding, Jiaming
    Wang, Anning
    Zhang, Qiang
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 170
  • [9] Mitigating the Impact of Data Sampling on Social Media Analysis and Mining
    Xu, Kuai
    Wang, Feng
    Wang, Haiyan
    Wang, Yufang
    Zhang, Ying
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (02) : 546 - 555
  • [10] Analysis of Student Academic Performance and Social Media Activities by Using Data Mining Approach
    Pratama, Enda Esyudha
    Ripanti, Eva Faja
    2020 6TH INTERNATIONAL CONFERENCE ON E-BUSINESS AND APPLICATIONS (ICEBA 2020), 2020, : 111 - 115