Content moderation assistance through image caption generation

被引:0
|
作者
Kearns, Liam [1 ]
机构
[1] AuraQ, 33 Graham Rd, Malvern WR14 2HU, Worcestershire, England
来源
关键词
Content moderation; Caption generation; Computer vision; Machine learning;
D O I
10.1016/j.iswa.2025.200489
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid growth in digital media creation has led to an increased challenge in content moderation. Manual and automated moderation are susceptible to risks associated with a slower response time and false positives arising from unpredictable user inputs respectively. Image caption generation has been suggested as a viable content moderation tool, but there is a lack of real world deployment in this context. In this work, a collaborative approach is taken, where a machine learning model is used to assist human moderators in the approval and rejection of media within a scavenger hunt game. The proposed model is trained on the Flickr30k and MS Coco datasets to generate captions for images. The results demonstrate a 13% reduction in review times, indicating that human-machine collaboration contributes to mitigating the risk of unsustainable review backlog growth. Furthermore, fine-tuning the model led to a 28% reduction in review times when compared to the untuned model. Notably, this paper contributes to knowledge by demonstrating caption generation as a viable content moderation tool in addition to its sensitivity to accurate captions, whereby false positives risk a deterioration in moderator response time.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Image Caption Generation Using A Deep Architecture
    Hani, Ansar
    Tagougui, Najiba
    Kherallah, Monji
    2019 INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2019, : 246 - 251
  • [12] Cross-Lingual Image Caption Generation
    Miyazaki, Takashi
    Shimizu, Nobuyuki
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1780 - 1790
  • [13] Image Caption Generation Using Attention Model
    Ramalakshmi, Eliganti
    Jain, Moksh Sailesh
    Uddin, Mohammed Ameer
    INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 1009 - 1017
  • [14] Topic-Based Image Caption Generation
    Dash, Sandeep Kumar
    Acharya, Shantanu
    Pakray, Partha
    Das, Ranjita
    Gelbukh, Alexander
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2020, 45 (04) : 3025 - 3034
  • [15] Attention-Based Image Caption Generation
    Manasa, M.
    Sowmya, D.
    Reddy, Y. Supriya
    Sreedevi, Pogula
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, MACHINE LEARNING AND APPLICATIONS, VOL 1, ICDSMLA 2023, 2025, 1273 : 364 - 369
  • [16] Entity-aware Image Caption Generation
    Lu, Di
    Whitehead, Spencer
    Huang, Lifu
    Ji, Heng
    Chang, Shih-Fu
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4013 - 4023
  • [17] Topic-Based Image Caption Generation
    Sandeep Kumar Dash
    Shantanu Acharya
    Partha Pakray
    Ranjita Das
    Alexander Gelbukh
    Arabian Journal for Science and Engineering, 2020, 45 : 3025 - 3034
  • [18] Topic-Specific Image Caption Generation
    Zhou, Chang
    Mao, Yuzhao
    Wang, Xiaojie
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2017, 2017, 10565 : 321 - 332
  • [19] Image caption generation with high-level image features
    Ding, Songtao
    Qu, Shiru
    Xi, Yuling
    Sangaiah, Arun Kumar
    Wan, Shaohua
    PATTERN RECOGNITION LETTERS, 2019, 123 : 89 - 95
  • [20] Automatic image caption generation using deep learning
    Verma, Akash
    Yadav, Arun Kumar
    Kumar, Mohit
    Yadav, Divakar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 5309 - 5325