Audio-Based Hate Speech Classification from Online Short-Form Videos

被引:4
|
作者
Ibanez, Michael [1 ]
Sapinit, Ranz [1 ]
Reyes, Lloyd Antonie [1 ]
Hussien, Mohammed [1 ]
Imperial, Joseph Marvin [1 ]
Rodriguez, Ramon [1 ]
机构
[1] Natl Univ, Manila, Philippines
关键词
hate speech; tiktok; audio classification; machine learning; speech processing;
D O I
10.1109/IALP54817.2021.9675250
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we pioneer the development of an audio-based hate speech classifier from online, short-form TikTok videos using traditional machine learning algorithms such as Logistic Regression, Random Forest, and Support Vector Machines. We scraped over 4746 videos using the TikTok API tool and extracted audio-based features such as MFCCs, Spectral Centroid, Rolloff, Bandwidth, Zero-Crossing Rate, and Chroma values as primary feature sets. Results show that using the extracted predictors for hate speech detection can obtain up to 78.5% accuracy on an optimized Random Forest model, crossing the 50% benchmark for models in this task. In addition, comparing the Information Gain scores and globally learned model weights identified that Spectral Rolloff and MFCCs are top predictors in discriminating hate speech for the Filipino language.
引用
收藏
页码:72 / 77
页数:6
相关论文
共 50 条
  • [1] Audio-based description and structuring of videos
    Harb H.
    Chen L.
    International Journal on Digital Libraries, 2006, 6 (1) : 70 - 81
  • [2] AUDIO-BASED AFFECT DETECTION IN WEB VIDEOS
    Chisholm, Dave
    Siddiquie, Behjat
    Divakaran, Ajay
    Shriberg, Elizabeth
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [3] AUDIO-BASED CLASSIFICATION OF SPEAKER CHARACTERISTICS
    Dutta, Promiti
    Haubold, Alexander
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 422 - 425
  • [4] Combining audio-based and video-based shot classification systems for news videos segmentation
    De Santo, M
    Percannella, G
    Sansone, C
    Vento, M
    MULTIPLE CLASSIFIER SYSTEMS, 2005, 3541 : 397 - 406
  • [5] Short-Form Videos for Colorectal Cancer Screening Awareness
    Restrepo, Nicolas
    Escobar, Betsy
    Suarez, Milena G.
    Montealegre, Jane
    Jibaja-Weiss, Maria
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2024, 119 (10S): : S377 - S378
  • [6] Understanding the Effects of Short-Form Videos on Sustained Attention
    Lin, Bei-Hong
    Chung, Yu-Jung
    Cheng, Hao-Yuan
    Yen, Yu-Ting
    Li, Ching-Chuan
    Cherng, Fu-Yin
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [7] Generating Hashtags for Short-form Videos with Guided Signals
    Yu, Tiezheng
    Yu, Hanchao
    Liang, Davis
    Mao, Yuning
    Nie, Shaoliang
    Huang, Po-Yao
    Khabsa, Madian
    Fung, Pascale
    Wang, Yi-Chia
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 9482 - 9495
  • [8] Audio-based Activities of Daily Living (ADL) recognition with large-scale acoustic embeddings from online videos
    Liang, Dawei
    Thomaz, Edison
    Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2019, 3 (01)
  • [9] CONSTRUCTING AN AUDIO DATASET OF CONSTRUCTION EQUIPMENT FROM ONLINE SOURCES FOR AUDIO-BASED RECOGNITION
    Jeong, Gilsu
    Ahn, Changbum R.
    Park, Moonseo
    2022 WINTER SIMULATION CONFERENCE (WSC), 2022, : 2354 - 2364
  • [10] Robust Audio-based Classification of Video Genre
    Rouvier, Mickael
    Linares, Georges
    Matrouf, Driss
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1155 - 1158