Machine Learning and feature engineering-based study into sarcasm and irony classification with application to cyberbullying detection

被引:48
|
作者
Chia, Zheng Lin [1 ]
Ptaszynski, Michal [1 ]
Masui, Fumito [1 ]
Leliwa, Gniewosz [2 ]
Wroczynski, Michal [2 ]
机构
[1] Kitami Inst Technol, Dept Comp Sci, Kitami, Hokkaido, Japan
[2] Samurailabs, Gdansk, Poland
关键词
Irony detection; Sarcasm detection; Machine Learning;
D O I
10.1016/j.ipm.2021.102600
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Irony and sarcasm detection is considered a complex task in Natural Language Processing. This paper set out to explore the sarcasm and irony on Twitter, using Machine Learning and Feature Engineering techniques. First we review and clarify the definition of irony and sarcasm by discussing various studies focusing on the terms. Next the first experiment is conducted comparing between various types of classification methods including some popular classifiers for text classification task. For the second experiment, different types of data preprocessing methods were compared and analyzed. Finally, the relationship between irony, sarcasm, and cyberbullying are discussed. The results are interesting as we observed high similarity between them.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A study of machine learning-based models for detection, control, and mitigation of cyberbullying in online social media
    Kumar, Raju
    Bhat, Aruna
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2022, 21 (06) : 1409 - 1431
  • [32] A study of machine learning-based models for detection, control, and mitigation of cyberbullying in online social media
    Raju Kumar
    Aruna Bhat
    International Journal of Information Security, 2022, 21 : 1409 - 1431
  • [33] A Deep Analysis of Textual Features Based Cyberbullying Detection Using Machine Learning
    Mahmud, Md Ishtyaq
    Mamun, Muntasir
    Abdelgawad, Ahmed
    2022 IEEE GLOBAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (GCAIOT), 2022, : 166 - 170
  • [34] Machine learning based feature engineering for thermoelectric materials by design
    Vaitesswar, U. S.
    Bash, Daniil
    Huang, Tan
    Recatala-Gomez, Jose
    Deng, Tianqi
    Yang, Shuo-Wang
    Wang, Xiaonan
    Hippalgaonkar, Kedar
    DIGITAL DISCOVERY, 2024, 3 (01): : 210 - 220
  • [35] Feature Extraction, Feature Selection and Machine Learning for Image Classification: A Case Study
    Popescu, Madalina Cosmina
    Sasu, Lucian Mircea
    2014 INTERNATIONAL CONFERENCE ON OPTIMIZATION OF ELECTRICAL AND ELECTRONIC EQUIPMENT (OPTIM), 2014, : 968 - 973
  • [36] A Study on Machine Learning-Based Feature Classification for the Early Diagnosis of Blade Rubbing
    Park, Dong-hee
    Choi, Byeong-keun
    SENSORS, 2024, 24 (18)
  • [37] INTRUSION DETECTION BASED ON MACHINE LEARNING AND FEATURE SELECTION
    Alaoui, Souad
    El Gonnouni, Amina
    Lyhyaoui, Abdelouahid
    MENDEL 2011 - 17TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING, 2011, : 199 - 206
  • [38] Multi-feature fusion framework for sarcasm identification on twitter data: A machine learning based approach
    Eke, Christopher Ifeanyi
    Norman, Azah Anir
    Shuib, Liyana
    PLOS ONE, 2021, 16 (06):
  • [39] Evolutionary feature selection for machine learning based malware classification
    Kale, Gulsade
    Bostanci, Gazi Erkan
    Celebi, Fatih Vehbi
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2024, 56
  • [40] An intelligent machine learning-based sarcasm detection and classification model on social networks (vol 78, pg 10575, year 2022)
    Vinoth, D.
    Prabhavathy, P.
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (09): : 10506 - 10506