Hate Special Detection In Indonesian Language Instagram

被引:0
|
作者
Putra, I. Gede Manggala [1 ]
Nurjanah, Dade [1 ]
机构
[1] Telkom Univ, Informat Engn, Bandung, Indonesia
关键词
hate speech comments; instagram; word2vec; TextCNN; imbalance dataset;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hate speech is a form of communication which contains hatred by doing things, such as inciting, insulting, disparaging, or demeaning a person or group. Hate speech issues in Indonesia often have linkages to politics. In 2018 and 2019, for example, the hate speech relates to the local leader and presidential elections. The hate speech actors commonly use social networks, such as Instagram, to spread their hatred words. About 60% of hate speech is found in the comments of the posts and it will be a real threat if not quickly detected. Our study aims to detect hate speech in Instagram comments. We propose the use of a word2vec method with skip-gram models and a modified TextCNN to learn and detect hate speech texts. Furthermore, random oversampling, random under sampling, and class weight was used to solve imbalanced dataset problems. The results show that the best accuracy, in term of F-score, is 93.70%, gained from a combination of word2vec skip-gram with window size 15, a modified TextCNN, and random oversampling methods.
引用
收藏
页码:413 / 419
页数:7
相关论文
共 50 条
  • [21] Multi-label Classification for Hate Speech and Abusive Language in Indonesian-Local Languages
    Asti, Ajeng Dwi
    Budi, Indra
    Ibrohim, Muhammad Okky
    13TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2021), 2021, : 325 - 330
  • [22] Hate Speech Detection on Indonesian Long Text Documents Using Machine Learning Approach
    Aulia, Nofa
    Budi, Indra
    ICCAI '19 - PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, : 164 - 169
  • [23] Fertility education on Instagram: advertisements vs. educational content analysis for posts in Bahasa (Indonesian language)
    Harzif, Achmad K.
    Kusuma, Berli
    Ummah, Nafi'atul
    Puspawardani, Aisyah R.
    Nurbaeti, Putri
    Wiweko, Budi
    ANNALS OF MEDICINE AND SURGERY, 2024, 86 (05): : 2639 - 2643
  • [24] Racial Bias in Hate Speech and Abusive Language Detection Datasets
    Davidson, Thomas
    Bhattacharya, Debasmita
    Weber, Ingmar
    THIRD WORKSHOP ON ABUSIVE LANGUAGE ONLINE, 2019, : 25 - 35
  • [25] Hate Speech Detection in Indonesian Twitter Texts using Bidirectional Gated Recurrent Unit
    Marpaung, Angela
    Rismala, Rita
    Nurrahmi, Hani
    2021 13TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST-2021), 2021, : 186 - 190
  • [26] Offensive Language and Hate Speech Detection Based on Transfer Learning
    Touahri, Ibtissam
    Mazroui, Azzeddine
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2020), VOL 2, 2022, 1418 : 300 - 311
  • [27] Hate-Speech and Offensive Language Detection in Roman Urdu
    Rizwan, Hammad
    Shakeel, Muhammad Haroon
    Karim, Asim
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2512 - 2522
  • [28] Towards Automatic Detection and Explanation of Hate Speech and Offensive Language
    Dorris, Wyatt
    Hu, Ruijia
    Vishwamitra, Nishant
    Luo, Feng
    Costello, Matthew
    PROCEEDINGS OF THE SIXTH INTERNATIONAL WORKSHOP ON SECURITY AND PRIVACY ANALYTICS (IWSPA'20), 2020, : 23 - 29
  • [29] Ceasing hate with MoH: Hate Speech Detection in Hindi-English code-switched language
    Sharma, Arushi
    Kabra, Anubha
    Jain, Minni
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (01)
  • [30] Hate speech and the harm in Indonesian judicial decisions
    Putri, Devita Kartika
    COGENT SOCIAL SCIENCES, 2023, 9 (02):