Multi-label Classification for Hate Speech and Abusive Language in Indonesian-Local Languages

被引:0
|
作者
Asti, Ajeng Dwi [1 ]
Budi, Indra [1 ]
Ibrohim, Muhammad Okky [1 ]
机构
[1] Univ Indonesia, Fac Comp Sci, Depok, Indonesia
关键词
hate speech; multi-label classification; Indonesian local language; Twitter;
D O I
10.1109/ICACSIS53237.2021.9631316
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Each hate speech has a target, category, and level that needs to be detected to help the authorities prioritize hate speech cases that need to be solved first. Various studies have been conducted in Indonesia on abusive speech and hate speech and their targets, categories, and levels, but only in Indonesian and English. On the other hand, various local languages in Indonesia open up opportunities for hate speech to occur using the local language. This study aims to compare some of the best machine learning algorithms, transformation methods, and feature extraction techniques in classifying abusive language and hate speech and their targets, categories, and levels using Twitter data in Indonesian and local languages. This study uses five local languages in Indonesia with the most speakers: Javanese, Sundanese, Madurese, Minangkabau, and Musi (Palembang). The algorithms used are Support Vector Machine (SVM), Multinomial Naive Bayes (MNB), and Random Forest Decision Tree (RFDT) with Binary Relevance (BR), Classifier Chains (CC), and Label Powerset (LP) as transformation methods. The term weighting used in this study is TF-IDF with word n-gram and char n-gram features. The results showed that the SVM algorithm with the CC transformation method and unigram feature extraction gave the highest F1-score results, 66.33% for Javanese and 65.68% for Sundanese. In Madurese, Minangkabau, and Musi language data, the best F1-score was obtained using the RFDT algorithm with the CC transformation method and unigram feature extraction with F1-score 76.37% 80.75%, and 77.34%.
引用
收藏
页码:325 / 330
页数:6
相关论文
共 50 条
  • [21] Local positive and negative label correlation analysis with label awareness for multi-label classification
    Huang, Rui
    Kang, Liuyue
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (09) : 2659 - 2672
  • [22] Local positive and negative label correlation analysis with label awareness for multi-label classification
    Rui Huang
    Liuyue Kang
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 2659 - 2672
  • [23] A Label Embedding Method for Multi-label Classification via Exploiting Local Label Correlations
    Wang, Xidong
    Li, Jun
    Xu, Jianhua
    NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 168 - 180
  • [24] Multi-label classification by exploiting local positive and negative pairwise label correlation
    Huang, Jun
    Li, Guorong
    Wang, Shuhui
    Xue, Zhe
    Huang, Qingming
    NEUROCOMPUTING, 2017, 257 : 164 - 174
  • [25] Learning Label-Specific Multiple Local Metrics for Multi-Label Classification
    Mao, Jun-Xiang
    Hang, Jun-Yi
    Zhang, Min-Ling
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 4742 - 4750
  • [26] Enhancing Multi-label Classification Based on Local Label Constraints and Classifier Chains
    Chen, Benhui
    Li, Weite
    Zhang, Yuqing
    Hu, Jinglu
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1458 - 1463
  • [27] Hierarchical multi-label classification using local neural networks
    Cerri, Ricardo
    Barros, Rodrigo C.
    de Carvalho, Andre C. P. L. F.
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (01) : 39 - 56
  • [28] MLCE: A Multi-Label Crotch Ensemble Method for Multi-Label Classification
    Yao, Yuan
    Li, Yan
    Ye, Yunming
    Li, Xutao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (04)
  • [29] Multi-label classification of feedbacks
    Ruiz Alonso, Dorian
    Zepeda Cortes, Claudia
    Castillo Zacatelco, Hilda
    Carballido Carranza, Jose Luis
    Garcia Cue, Jose Luis
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4337 - 4343
  • [30] The advances in multi-label classification
    Chen, Shijun
    Gao, Lin
    2014 INTERNATIONAL CONFERENCE ON MANAGEMENT OF E-COMMERCE AND E-GOVERNMENT (ICMECG), 2014, : 240 - 245