Cyberbullying detection in social media text based on character-level convolutional neural network with shortcuts

被引:35
|
作者
Lu, Nijia [1 ]
Wu, Guohua [1 ]
Zhang, Zhen [1 ,4 ]
Zheng, Yitao [1 ]
Ren, Yizhi [1 ]
Choo, Kim-Kwang Raymond [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou, Zhejiang, Peoples R China
[2] Univ Texas San Antonio, Dept Informat Syst & Cyber Secur, San Antonio, TX USA
[3] Univ Texas San Antonio, Dept Elect & Comp Engn, San Antonio, TX USA
[4] 1158,2 St,Baiyang St, Hangzhou 310018, Zhejiang, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
convolutional neural networks; cyberbullying detection; social network; text classification;
D O I
10.1002/cpe.5627
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As people spend increasingly more time on social networks, cyberbullying has become a social problem that needs to be solved by machine learning methods. Our research focuses on textual cyberbullying detection because text is the most common form of social media. However, the content information in social media is short, noisy, and unstructured with incorrect spellings and symbols, and this impacts the performance of some traditional machine learning methods based on vocabulary knowledge. For this reason, we propose a Char-CNNS (Character-level Convolutional Neural Network with Shortcuts) model to identify whether the text in social media contains cyberbullying. We use characters as the smallest unit of learning, enabling the model to overcome spelling errors and intentional obfuscation in real-world corpora. Shortcuts are utilized to stitch different levels of features to learn more granular bullying signals, and a focal loss function is adopted to overcome the class imbalance problem. We also provide a new Chinese Weibo comment dataset specifically for cyberbullying detection, and experiments are performed on both the Chinese Weibo dataset and the English Tweet dataset. The experimental results show that our approach is competitive with state-of-the-art techniques on cyberbullying detection task.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] A Metaverse text recognition model based on character-level contrastive learning
    Sun, Le
    Li, Huiyun
    Muhammad, Ghulam
    APPLIED SOFT COMPUTING, 2023, 149
  • [42] Automatically Classifying Chinese Judgment Documents Using Character-Level Convolutional Neural Networks
    Zhou, Xiaosong
    Li, Chuanyi
    Ge, Jidong
    Li, Zhongjin
    Zhou, Xiaoyu
    Luo, Bin
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 430 - 437
  • [43] MOJI: Character-level convolutional neural networks for Malicious Obfuscated Java']JavaScript Inspection
    Ishida, Minato
    Kaneko, Naoshi
    Sumi, Kazuhiko
    APPLIED SOFT COMPUTING, 2023, 137
  • [44] Character-Level Street View Text Spotting Based on Deep Multisegmentation Network for Smarter Autonomous Driving
    Zhang C.
    Tao Y.
    Du K.
    Ding W.
    Wang B.
    Liu J.
    Wang W.
    IEEE Transactions on Artificial Intelligence, 2022, 3 (02): : 297 - 308
  • [45] Character Segmentation in Text Line via Convolutional Neural Network
    Li, Xiaohe
    Zhang, Xingming
    Yang, Bin
    Xia, Siyu
    2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1175 - 1180
  • [46] A Novel Steganography Method for Character-Level Text Image Based on Adversarial Attacks
    Ding, Kangyi
    Hu, Teng
    Niu, Weina
    Liu, Xiaolei
    He, Junpeng
    Yin, Mingyong
    Zhang, Xiaosong
    SENSORS, 2022, 22 (17)
  • [47] RETRACTED: Automatic analysis of public health service text based on character level convolutional neural network (Retracted Article)
    Feng, Rui
    Weng, Lie'en
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (03) : 7185 - 7197
  • [48] Social Media Text Generation Based on Neural Network Model
    Cao, Jiarun
    Wang, Chongwen
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 58 - 61
  • [49] Video Text Detection with Text Edges and Convolutional Neural Network
    Hu, Ping
    Wang, Weiqiang
    Lu, Ke
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 675 - 679
  • [50] A Novel Scene Text Detection Algorithm Based On Convolutional Neural Network
    Ren, Xiaohang
    Chen, Kai
    Yang, Xiaokang
    Zhou, Yi
    He, Jianhua
    Sun, Jun
    2016 30TH ANNIVERSARY OF VISUAL COMMUNICATION AND IMAGE PROCESSING (VCIP), 2016,