Deep Learning-Based Sentimental Analysis for Large-Scale Imbalanced Twitter Data

被引:8
|
作者
Jamal, Nasir [1 ]
Chen, Xianqiao [1 ]
Aldabbas, Hamza [2 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan 430070, Hubei, Peoples R China
[2] Al Balqa Appl Univ, Prince Abdullah bin Ghazi Fac Informat & Technol, Al Salt 19117, Jordan
关键词
data mining; deep learning; principal component analysis; emotions detection; sentimental analysis; text classification; CLASSIFICATION; FACES;
D O I
10.3390/fi11090190
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emotions detection in social media is very effective to measure the mood of people about a specific topic, news, or product. It has a wide range of applications, including identifying psychological conditions such as anxiety or depression in users. However, it is a challenging task to distinguish useful emotions' features from a large corpus of text because emotions are subjective, with limited fuzzy boundaries that may be expressed in different terminologies and perceptions. To tackle this issue, this paper presents a hybrid approach of deep learning based on TensorFlow with Keras for emotions detection on a large scale of imbalanced tweets' data. First, preprocessing steps are used to get useful features from raw tweets without noisy data. Second, the entropy weighting method is used to compute the importance of each feature. Third, class balancer is applied to balance each class. Fourth, Principal Component Analysis (PCA) is applied to transform high correlated features into normalized forms. Finally, the TensorFlow based deep learning with Keras algorithm is proposed to predict high-quality features for emotions classification. The proposed methodology is analyzed on a dataset of 1,600,000 tweets collected from the website 'kaggle'. Comparison is made of the proposed approach with other state of the art techniques on different training ratios. It is proved that the proposed approach outperformed among other techniques.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Deep learning-based extraction of building contours for large-scale 3D urban reconstruction
    Tripodi, S.
    Duan, L.
    Trastour, F.
    Poujad, V.
    Laurore, L.
    Tarabalka, Y.
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXV, 2019, 11155
  • [42] Deep learning-based cargo recognition and classification method for automated loading process in large-scale logistics
    Kim S.-M.
    Lee S.-D.
    Choi J.A.
    Lee K.-B.
    Transactions of the Korean Institute of Electrical Engineers, 2024, 73 (01): : 192 - 200
  • [43] THE INFLUENCE OF CHANGING FEATURES ON THE ACCURACY OF DEEP LEARNING-BASED LARGE-SCALE OUTDOOR LIDAR SEMANTIC SEGMENTATION
    Liu, Chang
    Zhang, Qi
    Shirowzhan, Sara
    Bai, Ting
    Sheng, Ziheng
    Wu, Yunhao
    Kuang, Jianming
    Ge, Linlin
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 4443 - 4446
  • [44] Deep learning-based pulmonary tuberculosis automated detection on chest radiography: large-scale independent testing
    Zhou, Wen
    Cheng, Guanxun
    Zhang, Ziqi
    Zhu, Litong
    Jaeger, Stefan
    Lure, Fleming Y. M.
    Guo, Lin
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2022, 12 (04) : 2344 - 2355
  • [45] Active Balancing Mechanism for Imbalanced Medical Data in Deep Learning-Based Classification Models
    Zhang, Hongyi
    Zhang, Haoke
    Pirbhulal, Sandeep
    Wu, Wanqing
    De Albuquerque, Victor Hugo C.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (01)
  • [46] A novel evaluation metric for deep learning-based side channel analysis and its extended application to imbalanced data
    Zhang J.
    Zheng M.
    Nan J.
    Hu H.
    Yu N.
    1600, Ruhr-University of Bochum (2020): : 73 - 96
  • [47] Deep-Autoencoder-Based Radar Source Recognition: Addressing Large-Scale Imbalanced Data and Edge Computing Constraints
    Liu, Yuehua
    Li, Xiaoyu
    Fang, Jifei
    ELECTRONICS, 2024, 13 (15)
  • [48] Analysis and mining of an election-based network using large-scale twitter data: a retrospective study
    Amartya Chakraborty
    Nandini Mukherjee
    Social Network Analysis and Mining, 13
  • [49] Analysis and mining of an election-based network using large-scale twitter data: a retrospective study
    Chakraborty, Amartya
    Mukherjee, Nandini
    SOCIAL NETWORK ANALYSIS AND MINING, 2023, 13 (01)
  • [50] A Fast Distributed Classification Algorithm for Large-scale Imbalanced Data
    Wang, Huihui
    Gao, Yang
    Shi, Yinghuan
    Wang, Hao
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1251 - 1256