Semi-supervised learning for big social data analysis

被引:157
|
作者
Hussain, Amir [1 ]
Cambria, Erik [2 ]
机构
[1] Univ Stirling, Sch Nat Sci, Stirling FK9 4LA, Scotland
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, 50 Nanyang Ave, Singapore 639798, Singapore
基金
英国工程与自然科学研究理事会;
关键词
Semi-supervised learning; Big social data analysis; Sentiment analysis;
D O I
10.1016/j.neucom.2017.10.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an era of social media and connectivity, web users are becoming increasingly enthusiastic about interacting, sharing, and working together through online collaborative media. More recently, this collective intelligence has spread to many different areas, with a growing impact on everyday life, such as in education, health, commerce and tourism, leading to an exponential growth in the size of the social Web. However, the distillation of knowledge from such unstructured Big data is, an extremely challenging task. Consequently, the semantic and multimodal contents of the Web in this present day are, whilst being well suited for human use, still barely accessible to machines. In this work, we explore the potential of a novel semi-supervised learning model based on the combined use of random projection scaling as part of a vector space model, and support vector machines to perform reasoning on a knowledge base. The latter is developed by merging a graph representation of commonsense with a linguistic resource for the lexical representation of affect. Comparative simulation results show a significant improvement in tasks such as emotion recognition and polarity detection, and pave the way for development of future semi-supervised learning approaches to big social data analytics. (c) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1662 / 1673
页数:12
相关论文
共 50 条
  • [1] Big data analytics using semi-supervised learning methods
    Frumosu, Flavia D.
    Kulahci, Murat
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2018, 34 (07) : 1413 - 1423
  • [2] Fraud Detection in Big Data using Supervised and Semi-supervised Learning Techniques
    Melo-Acosta, German E.
    Duitama-Munoz, Freddy
    Arias-Londono, Julian D.
    2017 IEEE COLOMBIAN CONFERENCE ON COMMUNICATIONS AND COMPUTING (COLCOM), 2017,
  • [3] Semi-Supervised Learning-Based Image Denoising for Big Data
    Zhang, Kun
    Chen, Kai
    IEEE ACCESS, 2020, 8 : 172678 - 172691
  • [4] A unified distributed ELM framework with supervised, semi-supervised and unsupervised big data learning
    Zhiqiong Wang
    Luxuan Qu
    Junchang Xin
    Hongxu Yang
    Xiaosong Gao
    Memetic Computing, 2019, 11 : 305 - 315
  • [5] A unified distributed ELM framework with supervised, semi-supervised and unsupervised big data learning
    Wang, Zhiqiong
    Qu, Luxuan
    Xin, Junchang
    Yang, Hongxu
    Gao, Xiaosong
    MEMETIC COMPUTING, 2019, 11 (03) : 305 - 315
  • [6] Data driven semi-supervised learning
    Balcan, Maria-Florina
    Sharma, Dravyansh
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Classification Model on Big Data in Medical Diagnosis Based on Semi-Supervised Learning
    Wang, Lei
    Qian, Qing
    Zhang, Qiang
    Wang, Jishuai
    Cheng, Wenbo
    Yan, Wei
    COMPUTER JOURNAL, 2022, 65 (02): : 177 - 191
  • [8] Image Noise Removing Using Semi-supervised Learning on Big Image Data
    Chen, Bo-Hao
    Yin, Jia-Li
    Li, Ying
    2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 338 - 345
  • [9] Deep Semi-supervised Learning for Virtual Screening Based on Big Data Analytics
    Bahi, Meriem
    Batouche, Mohamed
    BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 173 - 184
  • [10] A Theoretical Analysis of Semi-supervised Learning
    Fujii, Takashi
    Ito, Hidetaka
    Miyoshi, Seiji
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 28 - 36