Semi-supervised learning for big social data analysis

被引:157
|
作者
Hussain, Amir [1 ]
Cambria, Erik [2 ]
机构
[1] Univ Stirling, Sch Nat Sci, Stirling FK9 4LA, Scotland
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, 50 Nanyang Ave, Singapore 639798, Singapore
基金
英国工程与自然科学研究理事会;
关键词
Semi-supervised learning; Big social data analysis; Sentiment analysis;
D O I
10.1016/j.neucom.2017.10.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an era of social media and connectivity, web users are becoming increasingly enthusiastic about interacting, sharing, and working together through online collaborative media. More recently, this collective intelligence has spread to many different areas, with a growing impact on everyday life, such as in education, health, commerce and tourism, leading to an exponential growth in the size of the social Web. However, the distillation of knowledge from such unstructured Big data is, an extremely challenging task. Consequently, the semantic and multimodal contents of the Web in this present day are, whilst being well suited for human use, still barely accessible to machines. In this work, we explore the potential of a novel semi-supervised learning model based on the combined use of random projection scaling as part of a vector space model, and support vector machines to perform reasoning on a knowledge base. The latter is developed by merging a graph representation of commonsense with a linguistic resource for the lexical representation of affect. Comparative simulation results show a significant improvement in tasks such as emotion recognition and polarity detection, and pave the way for development of future semi-supervised learning approaches to big social data analytics. (c) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1662 / 1673
页数:12
相关论文
共 50 条
  • [31] A constrained semi-supervised learning approach to data association
    Kück, H
    Carbonetto, P
    de Freitas, N
    COMPUTER VISION - ECCV 2004, PT 3, 2004, 3023 : 1 - 12
  • [32] Semi-supervised Cooperative Learning for Multiomics Data Fusion
    Ding, Daisy Yi
    Shen, Xiaotao
    Snyder, Michael
    Tibshirani, Robert
    MACHINE LEARNING FOR MULTIMODAL HEALTHCARE DATA, ML4MHD 2023, 2024, 14315 : 54 - 63
  • [33] Semi-supervised Learning from General Unlabeled Data
    Huang, Kaizhu
    Xu, Zenglin
    King, Irwin
    Lyu, Michael R.
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 273 - +
  • [34] A collective learning approach for semi-supervised data classification
    Uylas Sati, Nur
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2018, 24 (05): : 864 - 869
  • [35] Data Privacy Examination against Semi-Supervised Learning
    Lou, Jiadong
    Yuan, Xu
    Pan, Miao
    Wang, Hao
    Tzeng, Nian-Feng
    PROCEEDINGS OF THE 2023 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ASIA CCS 2023, 2023, : 136 - 148
  • [36] Semi-supervised learning for classification of protein sequence data
    King, Brian R.
    Guda, Chittibabu
    SCIENTIFIC PROGRAMMING, 2008, 16 (01) : 5 - 29
  • [37] Uncertainty Aware Semi-Supervised Learning on Graph Data
    Zhao, Xujiang
    Chen, Feng
    Hu, Shu
    Cho, Jin-Hee
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [38] Maximum margin semi-supervised learning with irrelevant data
    Yang, Haiqin
    Huang, Kaizhu
    King, Irwin
    Lyu, Michael R.
    NEURAL NETWORKS, 2015, 70 : 90 - 102
  • [39] COMBINED UNSUPERVISED AND SEMI-SUPERVISED LEARNING FOR DATA CLASSIFICATION
    Breve, Fabricio Aparecido
    Guimaraes Pedronette, Daniel Carlos
    2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [40] AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data
    Banitalebi-Dehkordi, Amin
    Gujjar, Pratik
    Zhang, Yong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3998 - 4005