Synthetic minority oversampling in addressing imbalanced sarcasm detection in social media

被引:0
|
作者
Arghasree Banerjee
Mayukh Bhattacharjee
Kushankur Ghosh
Sankhadeep Chatterjee
机构
[1] University of Engineering & Management,Department of Computer Science & Engineering
来源
Multimedia Tools and Applications | 2020年 / 79卷
关键词
Sarcasm detection; SMOTE; Social media; Imbalanced class; Social emotion; Affective computing;
D O I
暂无
中图分类号
学科分类号
摘要
Recent developments in sarcasm detection have been emerged as extremely successful tools in Social media opinion mining. With the advent of machine learning tools, accurate detection has been made possible. However, the social media data used to train the machine learning models is often ill suited due to the presence of highly imbalanced classes. In absence of any thorough study on the effect of imbalanced classes in sarcasm detection for social media opinion mining, the current article proposed synthetic minority oversampling based methods to mitigate the issue of imbalanced classes which can severely effect the classifier performance in social media sarcasm detection. In the current study, five different variants of synthetic minority oversampling technique have been used on two different datasets of varying sizes. The trustworthiness is judged by training and testing of six well known classifiers and measuring their performance in terms of test phase confusion matrix based performance measuring metrics. The experimental results indicated that SMOTE and BorderlineSMOTE – 1 are extremely successful in improving the classifier performance. A thorough analysis has been performed to better understand the effect of imbalanced classes in social media sarcasm detection.
引用
收藏
页码:35995 / 36031
页数:36
相关论文
共 50 条
  • [1] Synthetic minority oversampling in addressing imbalanced sarcasm detection in social media
    Banerjee, Arghasree
    Bhattacharjee, Mayukh
    Ghosh, Kushankur
    Chatterjee, Sankhadeep
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 35995 - 36031
  • [2] Sarcasm Detection in Social Media Based on Imbalanced Classification
    Liu, Peng
    Chen, Wei
    Ou, Gaoyan
    Wang, Tengjiao
    Yang, Dongqing
    Lei, Kai
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 459 - 471
  • [3] Handling Class Imbalanced Data in Sarcasm Detection with Ensemble Oversampling Techniques
    Hu, Ya-Han
    Liu, Ting-Hsuan
    Tsai, Chih-Fong
    Lin, Yu-Jung
    APPLIED ARTIFICIAL INTELLIGENCE, 2025, 39 (01)
  • [4] An improved and random synthetic minority oversampling technique for imbalanced data
    Wei, Guoliang
    Mu, Weimeng
    Song, Yan
    Dou, Jun
    KNOWLEDGE-BASED SYSTEMS, 2022, 248
  • [5] Imbalanced Classification Based on Minority Clustering Synthetic Minority Oversampling Technique With Wind Turbine Fault Detection Application
    Yi, Huaikuan
    Jiang, Qingchao
    Yan, Xuefeng
    Wang, Bei
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (09) : 5867 - 5875
  • [6] A No Parameter Synthetic Minority Oversampling Technique Based on Finch for Imbalanced Data
    Xu, Shoukun
    Li, Zhibang
    Yuan, Baohua
    Yang, Gaochao
    Wang, Xueyuan
    Li, Ning
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 367 - 378
  • [7] A Novel Synthetic Minority Oversampling Technique for Imbalanced Data Set Learning
    Barua, Sukarna
    Islam, Md. Monirul
    Murase, Kazuyuki
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 735 - +
  • [8] A Synthetic Minority Based on Probabilistic Distribution (SyMProD) Oversampling for Imbalanced Datasets
    Kunakorntum, Intouch
    Hinthong, Woranich
    Phunchongharn, Phond
    IEEE ACCESS, 2020, 8 : 114692 - 114704
  • [9] Performance of Synthetic Minority Oversampling Technique on Imbalanced Breast Cancer Data
    Rani, K. Usha
    Ramadevi, G. Naga
    Lavanya, D.
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 1623 - 1627
  • [10] A minority oversampling approach for fault detection with heterogeneous imbalanced data
    Liu, Jie
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184