Hate Special Detection In Indonesian Language Instagram

被引：0

作者：

Putra, I. Gede Manggala ^{[1
]}

Nurjanah, Dade ^{[1
]}

机构：

[1] Telkom Univ, Informat Engn, Bandung, Indonesia

来源：

ICACSIS 2020: 2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS) | 2020年

关键词：

hate speech comments; instagram; word2vec; TextCNN; imbalance dataset;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hate speech is a form of communication which contains hatred by doing things, such as inciting, insulting, disparaging, or demeaning a person or group. Hate speech issues in Indonesia often have linkages to politics. In 2018 and 2019, for example, the hate speech relates to the local leader and presidential elections. The hate speech actors commonly use social networks, such as Instagram, to spread their hatred words. About 60% of hate speech is found in the comments of the posts and it will be a real threat if not quickly detected. Our study aims to detect hate speech in Instagram comments. We propose the use of a word2vec method with skip-gram models and a modified TextCNN to learn and detect hate speech texts. Furthermore, random oversampling, random under sampling, and class weight was used to solve imbalanced dataset problems. The results show that the best accuracy, in term of F-score, is 93.70%, gained from a combination of word2vec skip-gram with window size 15, a modified TextCNN, and random oversampling methods.

引用

页码：413 / 419

页数：7

共 50 条

[21] Multi-label Classification for Hate Speech and Abusive Language in Indonesian-Local Languages
Asti, Ajeng Dwi
Budi, Indra
Ibrohim, Muhammad Okky
13TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2021), 2021, : 325 - 330
[22] Hate Speech Detection on Indonesian Long Text Documents Using Machine Learning Approach
Aulia, Nofa
Budi, Indra
ICCAI '19 - PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, : 164 - 169
[23] Fertility education on Instagram: advertisements vs. educational content analysis for posts in Bahasa (Indonesian language)
Harzif, Achmad K.
Kusuma, Berli
Ummah, Nafi'atul
Puspawardani, Aisyah R.
Nurbaeti, Putri
Wiweko, Budi
ANNALS OF MEDICINE AND SURGERY, 2024, 86 (05): : 2639 - 2643
[24] Racial Bias in Hate Speech and Abusive Language Detection Datasets
Davidson, Thomas
Bhattacharya, Debasmita
Weber, Ingmar
THIRD WORKSHOP ON ABUSIVE LANGUAGE ONLINE, 2019, : 25 - 35
[25] Hate Speech Detection in Indonesian Twitter Texts using Bidirectional Gated Recurrent Unit
Marpaung, Angela
Rismala, Rita
Nurrahmi, Hani
2021 13TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST-2021), 2021, : 186 - 190
[26] Offensive Language and Hate Speech Detection Based on Transfer Learning
Touahri, Ibtissam
Mazroui, Azzeddine
ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2020), VOL 2, 2022, 1418 : 300 - 311
[27] Hate-Speech and Offensive Language Detection in Roman Urdu
Rizwan, Hammad
Shakeel, Muhammad Haroon
Karim, Asim
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2512 - 2522
[28] Towards Automatic Detection and Explanation of Hate Speech and Offensive Language
Dorris, Wyatt
Hu, Ruijia
Vishwamitra, Nishant
Luo, Feng
Costello, Matthew
PROCEEDINGS OF THE SIXTH INTERNATIONAL WORKSHOP ON SECURITY AND PRIVACY ANALYTICS (IWSPA'20), 2020, : 23 - 29
[29] Ceasing hate with MoH: Hate Speech Detection in Hindi-English code-switched language
Sharma, Arushi
Kabra, Anubha
Jain, Minni
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (01)
[30] Hate speech and the harm in Indonesian judicial decisions
Putri, Devita Kartika
COGENT SOCIAL SCIENCES, 2023, 9 (02):

← 1 2 3 4 5 →