Hate Special Detection In Indonesian Language Instagram

被引:0
|
作者
Putra, I. Gede Manggala [1 ]
Nurjanah, Dade [1 ]
机构
[1] Telkom Univ, Informat Engn, Bandung, Indonesia
关键词
hate speech comments; instagram; word2vec; TextCNN; imbalance dataset;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hate speech is a form of communication which contains hatred by doing things, such as inciting, insulting, disparaging, or demeaning a person or group. Hate speech issues in Indonesia often have linkages to politics. In 2018 and 2019, for example, the hate speech relates to the local leader and presidential elections. The hate speech actors commonly use social networks, such as Instagram, to spread their hatred words. About 60% of hate speech is found in the comments of the posts and it will be a real threat if not quickly detected. Our study aims to detect hate speech in Instagram comments. We propose the use of a word2vec method with skip-gram models and a modified TextCNN to learn and detect hate speech texts. Furthermore, random oversampling, random under sampling, and class weight was used to solve imbalanced dataset problems. The results show that the best accuracy, in term of F-score, is 93.70%, gained from a combination of word2vec skip-gram with window size 15, a modified TextCNN, and random oversampling methods.
引用
收藏
页码:413 / 419
页数:7
相关论文
共 50 条
  • [31] The (moral) language of hate
    Kennedy, Brendan
    Golazizian, Preni
    Trager, Jackson
    Atari, Mohammad
    Hoover, Joe
    Davani, Aida Mostafazadeh
    Dehghani, Morteza
    PNAS NEXUS, 2023, 2 (07):
  • [32] Hate speech on Instagram during 2019 General Election in Spain
    Losada-Diaz, Jose-Carlos
    Zamora-Medina, Rocio
    Martinez-Martinez, Helena
    REVISTA MEDITERRANEA COMUNICACION-JOURNAL OF COMMUNICATION, 2021, 12 (02): : 195 - 208
  • [33] Hate and Aggression Detection in Social Media Over Hindi English Language
    Pareek, Kapil
    Choudhary, Arjun
    Tripathi, Ashish
    Mishra, K. K.
    Mittal, Namita
    INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2022, 14 (01):
  • [34] UHated: hate speech detection in Urdu language using transfer learning
    Muhammad Umair Arshad
    Raza Ali
    Mirza Omer Beg
    Waseem Shahzad
    Language Resources and Evaluation, 2023, 57 : 713 - 732
  • [35] Application-specific word embeddings for hate and offensive language detection
    Claver P. Soto
    Gustavo M. S. Nunes
    José Gabriel R. C. Gomes
    Nadia Nedjah
    Multimedia Tools and Applications, 2022, 81 : 27111 - 27136
  • [36] On the Impact ofWord Representation in Hate Speech and Offensive Language Detection and Explanation
    Hu, Ruijia
    Dorris, Wyatt
    Vishwamitra, Nishant
    Luo, Feng
    Costello, Matthew
    PROCEEDINGS OF THE TENTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, CODASPY 2020, 2020, : 171 - 173
  • [37] Hijrah and the Articulation of Islamic Identity of Indonesian Millenials on Instagram
    Rahman, Taufiqur
    Nurnisya, Frizki Yulianti
    Nurjanah, Adhianty
    Hifziati, Lailia
    JURNAL KOMUNIKASI-MALAYSIAN JOURNAL OF COMMUNICATION, 2021, 37 (02) : 154 - 170
  • [38] IdSarcasm: Benchmarking and Evaluating Language Models for Indonesian Sarcasm Detection
    Suhartono, Derwin
    Wongso, Wilson
    Tri Handoyo, Alif
    IEEE ACCESS, 2024, 12 : 87323 - 87332
  • [39] Hate speech detection in the Arabic language: corpus design, construction, and evaluation
    Ahmad, Ashraf
    Azzeh, Mohammad
    Alnagi, Eman
    Abu Al-Haija, Qasem
    Halabi, Dana
    Aref, Abdullah
    AbuHour, Yousef
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [40] UHated: hate speech detection in Urdu language using transfer learning
    Arshad, Muhammad Umair
    Ali, Raza
    Beg, Mirza Omer
    Shahzad, Waseem
    LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (02) : 713 - 732