Content moderation assistance through image caption generation

被引:0
|
作者
Kearns, Liam [1 ]
机构
[1] AuraQ, 33 Graham Rd, Malvern WR14 2HU, Worcestershire, England
来源
关键词
Content moderation; Caption generation; Computer vision; Machine learning;
D O I
10.1016/j.iswa.2025.200489
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid growth in digital media creation has led to an increased challenge in content moderation. Manual and automated moderation are susceptible to risks associated with a slower response time and false positives arising from unpredictable user inputs respectively. Image caption generation has been suggested as a viable content moderation tool, but there is a lack of real world deployment in this context. In this work, a collaborative approach is taken, where a machine learning model is used to assist human moderators in the approval and rejection of media within a scavenger hunt game. The proposed model is trained on the Flickr30k and MS Coco datasets to generate captions for images. The results demonstrate a 13% reduction in review times, indicating that human-machine collaboration contributes to mitigating the risk of unsustainable review backlog growth. Furthermore, fine-tuning the model led to a 28% reduction in review times when compared to the untuned model. Notably, this paper contributes to knowledge by demonstrating caption generation as a viable content moderation tool in addition to its sensitivity to accurate captions, whereby false positives risk a deterioration in moderator response time.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Image difference caption generation with text information assistance
    Chen W.
    Wang W.
    Jin Q.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2022, 48 (08): : 1436 - 1444
  • [2] TVPRNN for image caption generation
    Yang, Liang
    Hu, Haifeng
    ELECTRONICS LETTERS, 2017, 53 (22) : 1471 - +
  • [3] CNN image caption generation
    Li Y.
    Cheng H.
    Liang X.
    Guo Q.
    Qian Y.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (02): : 152 - 157
  • [4] Image Caption Generation With Adaptive Transformer
    Zhang, Wei
    Nie, Wenbo
    Li, Xinle
    Yu, Yao
    2019 34RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2019, : 521 - 526
  • [5] The Accurate Guidance for Image Caption Generation
    Qi, Xinyuan
    Cao, Zhiguo
    Xiao, Yang
    Wang, Jian
    Zhang, Chao
    PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 15 - 26
  • [6] An Overview of Image Caption Generation Methods
    Wang, Haoran
    Zhang, Yue
    Yu, Xiaosheng
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2020, 2020
  • [7] A survey on automatic image caption generation
    Bai, Shuang
    An, Shan
    NEUROCOMPUTING, 2018, 311 : 291 - 304
  • [8] Enhancing image caption generation through context-aware attention mechanism
    Bhuiyan, Ahatesham
    Hossain, Eftekhar
    Hoque, Mohammed Moshiul
    Dewan, M. Ali Akber
    HELIYON, 2024, 10 (17)
  • [9] Image caption generation with dual attention mechanism
    Liu, Maofu
    Li, Lingjun
    Hu, Huijun
    Guan, Weili
    Tian, Jing
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (02)
  • [10] Image Caption Generation with Part of Speech Guidance
    He, Xinwei
    Shi, Baoguang
    Bai, Xiang
    Xia, Gui-Song
    Zhang, Zhaoxiang
    Dong, Weisheng
    PATTERN RECOGNITION LETTERS, 2019, 119 : 229 - 237