Cyberbullying detection in social media text based on character-level convolutional neural network with shortcuts

被引:35
|
作者
Lu, Nijia [1 ]
Wu, Guohua [1 ]
Zhang, Zhen [1 ,4 ]
Zheng, Yitao [1 ]
Ren, Yizhi [1 ]
Choo, Kim-Kwang Raymond [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou, Zhejiang, Peoples R China
[2] Univ Texas San Antonio, Dept Informat Syst & Cyber Secur, San Antonio, TX USA
[3] Univ Texas San Antonio, Dept Elect & Comp Engn, San Antonio, TX USA
[4] 1158,2 St,Baiyang St, Hangzhou 310018, Zhejiang, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
convolutional neural networks; cyberbullying detection; social network; text classification;
D O I
10.1002/cpe.5627
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As people spend increasingly more time on social networks, cyberbullying has become a social problem that needs to be solved by machine learning methods. Our research focuses on textual cyberbullying detection because text is the most common form of social media. However, the content information in social media is short, noisy, and unstructured with incorrect spellings and symbols, and this impacts the performance of some traditional machine learning methods based on vocabulary knowledge. For this reason, we propose a Char-CNNS (Character-level Convolutional Neural Network with Shortcuts) model to identify whether the text in social media contains cyberbullying. We use characters as the smallest unit of learning, enabling the model to overcome spelling errors and intentional obfuscation in real-world corpora. Shortcuts are utilized to stitch different levels of features to learn more granular bullying signals, and a focal loss function is adopted to overcome the class imbalance problem. We also provide a new Chinese Weibo comment dataset specifically for cyberbullying detection, and experiments are performed on both the Chinese Weibo dataset and the English Tweet dataset. The experimental results show that our approach is competitive with state-of-the-art techniques on cyberbullying detection task.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Enhanced character-level deep convolutional neural networks for cardiovascular disease prediction
    Zhichang Zhang
    Yanlong Qiu
    Xiaoli Yang
    Minyu Zhang
    BMC Medical Informatics and Decision Making, 20
  • [32] Named-Entity Recognition in Sports Field Based on a Character-Level Graph Convolutional Network
    Seti, Xieraili
    Wumaier, Aishan
    Yibulayin, Turgen
    Paerhati, Diliyaer
    Wang, Lulu
    Saimaiti, Alimu
    INFORMATION, 2020, 11 (01)
  • [33] Disambiguation of biomedical acronyms based on a bidirectional recurrent neural network of character-level features
    Kai R.
    Na L.
    Wei X.
    Shi-Wen W.
    Journal of Engineering Science and Technology Review, 2019, 12 (06) : 105 - 112
  • [34] Calligraphy Character Detection Based on Deep Convolutional Neural Network
    Peng, Xianlin
    Kang, Jian
    Wu, Yinjie
    Feng, Xiaoyi
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [35] An Effective Phishing Detection Model Based on Character Level Convolutional Neural Network from URL
    Aljofey, Ali
    Jiang, Qingshan
    Qu, Qiang
    Huang, Mingqing
    Niyigena, Jean-Pierre
    ELECTRONICS, 2020, 9 (09) : 1 - 24
  • [36] Sentiment Analysis For Short Chinese Text Based On Character-level Methods
    An, Yanxin
    Tang, Xinhuai
    Xie, Bin
    2017 9TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST), 2017, : 78 - 82
  • [37] Deep Character-Level Anomaly Detection Based on a Convolutional Autoencoder for Zero-Day Phishing URL Detection
    Bu, Seok-Jun
    Cho, Sung-Bae
    ELECTRONICS, 2021, 10 (12)
  • [38] Character-Level Transformer-Based Neural Machine Translation
    Banar, Nikolay
    Daelemans, Walter
    Kestemont, Mike
    2020 4TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2020, 2020, : 149 - 156
  • [39] A Convolutional Neural Network for Traffic Information Sensing from Social Media Text
    Chen, Yuanyuan
    Lv, Yisheng
    Wang, Xiao
    Wang, Fei-Yue
    2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,
  • [40] Modified Convolutional Neural Network Filter Gate for Social Media Text Classification
    Suhaimi, Nur Suhailayani
    Othman, Zalinda
    Yaakub, Mohd Ridzwan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (05): : 617 - 627