Cyberbullying Detection using BERT for Telugu Language

被引:0
|
作者
Talasila, Sri Lakshmi [1 ]
Kothuri, Dharani Priya [1 ]
Manchiraju, Savithri Jahnavi [1 ]
Mallavalli, Mutyala Sai Sasank [1 ]
Dande, Lourdu Gnana Harshith [1 ]
机构
[1] Prasad V Potluri Siddhartha Inst Technol, Comp Sci & Engn, Vijayawada, India
关键词
Cyberbullying; Telugu; Bidirectional Encoder Representations from Transformers (BERT); Bullying Preprocessing; Harassment; Language; Social Media;
D O I
10.1109/ICPCSN62568.2024.00077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid proliferation of online communication has introduced cyberbullying as a significant concern affecting individuals' well-being. Existing research employs various techniques like Tf-Idf, XLM-RoBERTa, and machine learning algorithms such as Logistic Regression, Random Forest, and Naive Bayes to detect cyberbullying across mixed and bilingual languages. However, these approaches often struggle with accuracy and fail to effectively discern cyberbullying instances due to language nuances and context misinterpretation. Key challenges faced by previous systems include limited linguistic coverage, contextual understanding, and nuanced interpretation of cyberbullying. The new advancement to address these challenges is the implementation of BERT (Bidirectional Encoder Representations from Transformers) architecture by leveraging bidirectional context understanding, allowing it to capture subtle linguistic nuances and contextual cues, thereby improving accuracy and contextual understanding. The proposed model is advancing further by integrating specialized models like IndicBERT, specifically tailored for languages like Telugu. By focusing on contextual nuances, our model aims to improve precision and accuracy of cyberbullying detection for a local language, Telugu content. This study has developed a local language, Telugu dataset comprising 27,000 sentences and achieve an accuracy rate of 90%, highlighting the efficacy of our approach in overcoming these challenges and contributing to online safety.
引用
收藏
页码:454 / 461
页数:8
相关论文
共 50 条
  • [1] Telugu named entity recognition using bert
    Gorla, SaiKiranmai
    Tangeda, Sai Sharan
    Neti, Lalita Bhanu Murthy
    Malapati, Aruna
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2022, 14 (02) : 127 - 140
  • [2] Telugu named entity recognition using bert
    SaiKiranmai Gorla
    Sai Sharan Tangeda
    Lalita Bhanu Murthy Neti
    Aruna Malapati
    International Journal of Data Science and Analytics, 2022, 14 : 127 - 140
  • [3] Cyberbullying Detection Using Bidirectional Encoder Representations from Transformers (BERT)
    Sujud, Razan
    Fahs, Walid
    Khatoun, Rida
    Chbib, Fadlallah
    2024 IEEE INTERNATIONAL MEDITERRANEAN CONFERENCE ON COMMUNICATIONS AND NETWORKING, MEDITCOM 2024, 2024, : 257 - 262
  • [4] CyberBERT: BERT for cyberbullying identification BERT for cyberbullying identification
    Paul, Sayanta
    Saha, Sriparna
    MULTIMEDIA SYSTEMS, 2022, 28 (06) : 1897 - 1904
  • [5] Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
    Muneer, Amgad
    Alwadain, Ayed
    Ragab, Mohammed Gamal
    Alqushaibi, Alawi
    INFORMATION, 2023, 14 (08)
  • [6] Fake News Detection in Telugu Language using Transformers Models
    Hariharan, R. L.
    Jinkathoti, Mahendranath
    Kumar, P. Sai Prasanna
    Kumar, M. Anand
    2024 5TH INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY, ICITIIT 2024, 2024,
  • [7] Cyberbullying Detection for Urdu Language Using Machine Learning
    Mustafa, Hamza
    Zafar, Kashif
    FORTHCOMING NETWORKS AND SUSTAINABILITY IN THE AIOT ERA, VOL 1, FONES-AIOT 2024, 2024, 1035 : 244 - 257
  • [8] CyberBERT: BERT for cyberbullying identificationBERT for cyberbullying identification
    Sayanta Paul
    Sriparna Saha
    Multimedia Systems, 2022, 28 : 1897 - 1904
  • [9] Cyberbullying detection system focusing on the isiXhosa language
    Matomela, Vuyokazi
    Henney, Andre J.
    2022 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2022, : 93 - 98
  • [10] Does BERT Pay Attention to Cyberbullying?
    Elsafoury, Fatma
    Katsigiannis, Stamos
    Wilson, Steven R.
    Ramzan, Naeem
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1900 - 1904