Audio-Based Hate Speech Classification from Online Short-Form Videos

被引:4
|
作者
Ibanez, Michael [1 ]
Sapinit, Ranz [1 ]
Reyes, Lloyd Antonie [1 ]
Hussien, Mohammed [1 ]
Imperial, Joseph Marvin [1 ]
Rodriguez, Ramon [1 ]
机构
[1] Natl Univ, Manila, Philippines
关键词
hate speech; tiktok; audio classification; machine learning; speech processing;
D O I
10.1109/IALP54817.2021.9675250
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we pioneer the development of an audio-based hate speech classifier from online, short-form TikTok videos using traditional machine learning algorithms such as Logistic Regression, Random Forest, and Support Vector Machines. We scraped over 4746 videos using the TikTok API tool and extracted audio-based features such as MFCCs, Spectral Centroid, Rolloff, Bandwidth, Zero-Crossing Rate, and Chroma values as primary feature sets. Results show that using the extracted predictors for hate speech detection can obtain up to 78.5% accuracy on an optimized Random Forest model, crossing the 50% benchmark for models in this task. In addition, comparing the Information Gain scores and globally learned model weights identified that Spectral Rolloff and MFCCs are top predictors in discriminating hate speech for the Filipino language.
引用
收藏
页码:72 / 77
页数:6
相关论文
共 50 条
  • [41] Driving Factors and Moderating Effects Behind Citizen Engagement With Mobile Short-Form Videos
    Zhang, Cevin
    Zheng, Hemingxi
    Wang, Qing
    IEEE ACCESS, 2022, 10 : 40999 - 41009
  • [42] VisTellAR: Embedding Data Visualization to Short-Form Videos Using Mobile Augmented Reality
    Tong, Wai
    Shigyo, Kento
    Yuan, Lin-Ping
    Fan, Mingming
    Pong, Ting-Chuen
    Qu, Huamin
    Xia, Meng
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 31 (03) : 1862 - 1874
  • [43] The Relationship Between Parental Phubbing and Short-Form Videos Addiction Among Chinese Adolescents
    Wang, Hongxia
    Lei, Li
    JOURNAL OF RESEARCH ON ADOLESCENCE, 2022, 32 (04) : 1580 - 1591
  • [44] Content Quality of Web-Based Short-Form Videos for Fire and Burn Prevention in China: Content Analysis
    Qin, Lang
    Zheng, Ming
    Schwebel, David C.
    Li, Li
    Cheng, Peixia
    Rao, Zhenzhen
    Peng, Ruisha
    Ning, Peishan
    Hu, Guoqing
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [45] A Large-Scale UAV Audio Dataset and Audio-Based UAV Classification Using CNN
    Wang, Yaqin
    Chu, Zhiwei
    Ku, Ilmun
    Smith, E. Cho
    Matson, Eric T.
    2022 SIXTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC, 2022, : 186 - 189
  • [46] From hate speech to HateLess. The effectiveness of a prevention program on adolescents' online hate speech involvement
    Wachs, Sebastian
    Wright, Michelle F.
    Gamez-Guadix, Manuel
    COMPUTERS IN HUMAN BEHAVIOR, 2024, 157
  • [47] Short-form knowledge-based economy scorecards
    Chen, Chih-Kai
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2010, 31 (04): : 789 - 807
  • [48] Exploiting Speech Recognition Transcripts for Narrative Peak Detection in Short-Form Documentaries
    Larson, Martha
    Jochems, Bart
    Smits, Ewine
    Ordelman, Roeland
    MULTILINGUAL INFORMATION ACCESS EVALUATION II: MULTIMEDIA EXPERIMENTS, PT II, 2010, 6242 : 385 - +
  • [49] A 15-Category Audio Dataset for Drones and an Audio-Based UAV Classification Using Machine Learning
    Wang, Mia Yaqin
    Chu, Zhiwei
    Ku, Ilmun
    Smith, E. Cho
    Matson, Eric T.
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2024, 18 (02) : 257 - 272
  • [50] POSTING SHORT-FORM VIDEOS PROMOTES OLDER ADULTS' PSYCHOLOGICAL HEALTH: A MODEL BASED ON SELF-DETERMINATION THEORY
    Wu, Jingxuan
    Peng, Huamao
    INNOVATION IN AGING, 2024, 8 : 1102 - 1102