Real-Time Social Media Analytics with Deep Transformer Language Models: A Big Data Approach

被引:6
|
作者
Ahmet, Ahmed [1 ]
Abdullah, Tariq [1 ]
机构
[1] Univ Derby, Dept Comp Sci, Derby, England
关键词
Real-time analytics; Social media; deep learning; machine learning; transfer learning; big data;
D O I
10.1109/BigDataSE50710.2020.00014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Utilisation of transfer learning with deep language models is regarded as one of the most important developments in deep learning. Their application on real-time high-velocity and volume user-generated data has been elusive due to the unprecedented size and complexity of the models which result in substantial computational overhead. Recent iterations of these architectures have produced significantly distilled models with state-of-the-art performance and reduced resource requirement. We utilize deep transformer language models on user-generated data alongside a robust text normalization pipeline to address what is considered as the Achilles heel of deep learning on user-generated text data, namely data normalization. In this paper, we propose a framework for the ingestion, analysis and storage of real-time data streams. A case study in sentiment analysis and offensive/hateful language detection is used to evaluate the framework. We demonstrate inference on a large Twitter dataset using CPU and GPU clusters, highlighting the viability of the fine-tuned distilled language model for high volume data. Fine-tuned model significantly outperforms previous state-of-the-art on several benchmark datasets, providing a powerful model that can be utilized for a variety of downstream tasks. To our knowledge, this is the only study demonstrating powerful transformer language models for real-time social media stream analytics in a distributed setting.
引用
收藏
页码:41 / 48
页数:8
相关论文
共 50 条
  • [41] Mapping the Big Data Landscape: Technologies, Platforms and Paradigms for Real-Time Analytics of Data Streams
    Dubuc, Timothee
    Stahl, Frederic
    Roesch, Etienne B.
    IEEE ACCESS, 2021, 9 : 15351 - 15374
  • [42] A Real-Time Update Approach for Visualizing Multidimensional Big Data
    Haihong, E.
    Kong, Huihui
    Liu, Yunfeng
    Song, Meina
    Ou, Zhonghong
    HUMAN CENTERED COMPUTING, 2019, 11956 : 98 - 104
  • [43] Studying Animation for Real-Time Visual Analytics: A Design Study of Social Media Analytics in Emergency Management
    Calderon, Nadya A.
    Arias-Hernandez, Richard
    Fisher, Brian
    2014 47TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2014, : 1364 - 1373
  • [44] ENHANCING SOCIAL MEDIA ANALYSIS WITH VISUAL DATA ANALYTICS: A DEEP LEARNING APPROACH
    Shin, Donghyuk
    He, Shu
    Lee, Gene Moo
    Whinston, Andrew B.
    Cetintas, Suleyman
    Lee, Kuang-Chih
    MIS QUARTERLY, 2020, 44 (04) : 1459 - 1492
  • [45] Enhancing social media analysis with visual data analytics: A deep learning approach
    Shin D.
    He S.
    Lee G.M.
    Whinston A.B.
    Cetintas S.
    Lee K.-C.
    MIS Quarterly: Management Information Systems, 2020, 44 (04): : 1459 - 1492
  • [46] Real-Time Data ETL Framework for Big Real-Time Data Analysis
    Li, Xiaofang
    Mao, Yingchi
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 1289 - 1294
  • [47] Near real-time big data analytics for NFC-enabled logistics trajectories
    Karim, Lamia
    Boulmakoul, Azedine
    Lbath, Ahmed
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON LOGISTICS OPERATIONS MANAGEMENT (GOL'16), 2016,
  • [48] Real-Time Big Data Analytics and Proactive Traffic Safety Management Visualization System
    Abdel-Aty, Mohamed
    Zheng, Ou
    Wu, Yina
    Abdelraouf, Amr
    Rim, Heesub
    Li, Pei
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2023, 149 (08)
  • [49] Toward a smart health: big data analytics and IoT for real-time miscarriage prediction
    Asri, Hiba
    Jarir, Zahi
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [50] Real-Time Large-Scale Big Data Networks Analytics and Visualization Architecture
    Chopade, Pravin
    Zhan, Justin
    Roy, Kaushik
    Flurchick, Kenneth
    2015 12TH INTERNATIONAL CONFERENCE & EXPO ON EMERGING TECHNOLOGIES FOR A SMARTER WORLD (CEWIT), 2015,