共 50 条
Deep Learning-Based Sentimental Analysis for Large-Scale Imbalanced Twitter Data
被引:8
|作者:
Jamal, Nasir
[1
]
Chen, Xianqiao
[1
]
Aldabbas, Hamza
[2
]
机构:
[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan 430070, Hubei, Peoples R China
[2] Al Balqa Appl Univ, Prince Abdullah bin Ghazi Fac Informat & Technol, Al Salt 19117, Jordan
关键词:
data mining;
deep learning;
principal component analysis;
emotions detection;
sentimental analysis;
text classification;
CLASSIFICATION;
FACES;
D O I:
10.3390/fi11090190
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Emotions detection in social media is very effective to measure the mood of people about a specific topic, news, or product. It has a wide range of applications, including identifying psychological conditions such as anxiety or depression in users. However, it is a challenging task to distinguish useful emotions' features from a large corpus of text because emotions are subjective, with limited fuzzy boundaries that may be expressed in different terminologies and perceptions. To tackle this issue, this paper presents a hybrid approach of deep learning based on TensorFlow with Keras for emotions detection on a large scale of imbalanced tweets' data. First, preprocessing steps are used to get useful features from raw tweets without noisy data. Second, the entropy weighting method is used to compute the importance of each feature. Third, class balancer is applied to balance each class. Fourth, Principal Component Analysis (PCA) is applied to transform high correlated features into normalized forms. Finally, the TensorFlow based deep learning with Keras algorithm is proposed to predict high-quality features for emotions classification. The proposed methodology is analyzed on a dataset of 1,600,000 tweets collected from the website 'kaggle'. Comparison is made of the proposed approach with other state of the art techniques on different training ratios. It is proved that the proposed approach outperformed among other techniques.
引用
收藏
页数:13
相关论文