Demographics and Personality Discovery on Social Media: A Machine Learning Approach

被引:3
|
作者
Tuomchomtam, Sarach [1 ]
Soonthornphisaj, Nuanwan [1 ]
机构
[1] Kasetsart Univ, Dept Comp Sci, Artificial Intelligence & Knowledge Discovery Lab, Fac Sci, Bangkok 10900, Thailand
关键词
demographic attributes; personality prediction; social media; machine learning; BRIGGS TYPE INDICATOR; MODEL;
D O I
10.3390/info12090353
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This research proposes a new feature extraction algorithm using aggregated user engagements on social media in order to achieve demographics and personality discovery tasks. Our proposed framework can discover seven essential attributes, including gender identity, age group, residential area, education level, political affiliation, religious belief, and personality type. Multiple feature sets are developed, including comment text, community activity, and hybrid features. Various machine learning algorithms are explored, such as support vector machines, random forest, multi-layer perceptron, and naive Bayes. An empirical analysis is performed on various aspects, including correctness, robustness, training time, and the class imbalance problem. We obtained the highest prediction performance by using our proposed feature extraction algorithm. The result on personality type prediction was 87.18%. For the demographic attribute prediction task, our feature sets also outperformed the baseline at 98.1% for residential area, 94.7% for education level, 92.1% for gender identity, 91.5% for political affiliation, 60.6% for religious belief, and 52.0% for the age group. Moreover, this paper provides the guideline for the choice of classifiers with appropriate feature sets.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Personality Recognition on Social Media With Label Distribution Learning
    Xue, Di
    Hong, Zheng
    Guo, Shize
    Gao, Liang
    Wu, Lifa
    Zheng, Jinghua
    Zhao, Nan
    IEEE ACCESS, 2017, 5 : 13478 - 13488
  • [22] A Brand-New Look at You: Predicting Brand Personality in Social Media Networks with Machine Learning
    Pamuksuz, Utku
    Yun, Joseph T.
    Humphreys, Ashlee
    JOURNAL OF INTERACTIVE MARKETING, 2021, 56 : 55 - 69
  • [23] Psychometric and Validity Issues in Machine Learning Approaches to Personality Assessment: A Focus on Social Media Text Mining
    Tay, Louis
    Woo, Sang Eun
    Hickman, Louis
    Saef, Rachel M.
    EUROPEAN JOURNAL OF PERSONALITY, 2020, 34 (05) : 826 - 844
  • [24] Machine Learning Based Social Media Recommendation
    Lai, Taiping
    Zheng, Xianghan
    PROCEEDINGS 2015 SECOND IEEE INTERNATIONAL CONFERENCE ON SPATIAL DATA MINING AND GEOGRAPHICAL KNOWLEDGE SERVICES (ICSDM 2015), 2015, : 28 - 32
  • [25] Classifying Social Media Users with Machine Learning
    Li G.
    Zhou H.
    Mao J.
    Chen S.
    Data Analysis and Knowledge Discovery, 2019, 3 (08) : 1 - 9
  • [26] Machine Learning Approach for Personality Recognition in Spanish Texts
    Hernandez, Yasmin
    Martinez, Alicia
    Estrada, Hugo
    Ortiz, Javier
    Acevedo, Carlos
    APPLIED SCIENCES-BASEL, 2022, 12 (06):
  • [27] Machine learning approach for threat detection on social media posts containing Arabic text
    AlAjlan, Shatha AbdulAziz
    Saudagar, Abdul Khader Jilani
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 811 - 822
  • [28] User Stress Detection Using Social Media Text: A Novel Machine Learning Approach
    Wan, X. X.
    Tian, L.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2024, 19 (05)
  • [29] Social Media Markers to Identify Fathers at Risk of Postpartum Depression: A Machine Learning Approach
    Shatte, Adrian B. R.
    Hutchinson, Delyse M.
    Fuller-Tyszkiewicz, Matthew
    Teague, Samantha J.
    CYBERPSYCHOLOGY BEHAVIOR AND SOCIAL NETWORKING, 2020, 23 (09) : 611 - 618
  • [30] Examining the Gateway Hypothesis and Mapping Substance UsePathways on Social Media:Machine Learning Approach
    Yuan, Yunhao
    Kasson, Erin
    Taylor, Jordan
    Cavazos-Rehg, Patricia
    De Choudhury, Munmun
    Aledavood, Talayeh
    JMIR FORMATIVE RESEARCH, 2024, 8