Semantic segmentation using tag label and transformer

被引:0
|
作者
Jeong S.-W. [1 ]
Kim E.-C. [2 ]
Yoo J. [3 ]
机构
[1] Department of IT Convergence Engineering, Daegu University
[2] Department of Psychology, Daegu University
[3] School of Artificial Intelligence, Daegu University
关键词
Child Abuse Protection System; Deep Learning; Face Detection; Mosaic Generation;
D O I
10.5302/J.ICROS.2021.21.0134
中图分类号
学科分类号
摘要
Though the mandatory policy of installing CCTV in the childhood care facilities of public institutions such as kindergarten and daycare center, the criminal of child abuse cases is gradually increasing due to the lack of awareness of violent acts and the difficulty in understanding the reporting processes. This paper proposes a novel Child Abuse Protection System (CAPS) to solve the above social problem. The proposed CAPS is composed of three functional software modules to implement a deep-learning-based system that autonomously detects violent acts against children. First, the clip creator module divides long CCTV videos into several pieces of short video clips. Second, the violence detector module classifies the abuse behaviors from the generated clips. Finally, the face detector module automatically processes the witnessed suspect’s face being blurred out by mosaic. Experimental evaluation results show that the most suitable feature extractor for detecting the child abuse behaviors is the MobileNetV2+LSTM model among several candidates of the proposed CNN+LSTM violence detection module, which has the best at 92.51% accuracy. Furthermore, the recall rate can be increased up to 6% by exploiting the proposed data augmentation technique. Codes are available at https://github.com/learningsteady0J0/ CAPSChild-Abuse-Protection-System. © ICROS 2021.
引用
收藏
页码:1029 / 1037
页数:8
相关论文
共 50 条
  • [31] SARFormer: Segmenting Anything Guided Transformer for semantic segmentation
    Zhang, Lixin
    Huang, Wenteng
    Fan, Bin
    NEUROCOMPUTING, 2025, 635
  • [32] Full-Scale Selective Transformer for Semantic Segmentation
    Lin, Fangjian
    Wu, Sitong
    Ma, Yizhe
    Tian, Shengwei
    COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 310 - 326
  • [33] Indoor semantic segmentation based on Swin-Transformer
    Zheng, Yunping
    Xu, Yuan
    Shu, Shiqiang
    Sarem, Mudar
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [34] TransKD: Transformer Knowledge Distillation for Efficient Semantic Segmentation
    Liu, Ruiping
    Yang, Kailun
    Roitberg, Alina
    Zhang, Jiaming
    Peng, Kunyu
    Liu, Huayao
    Wang, Yaonan
    Stiefelhagen, Rainer
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (12) : 20933 - 20949
  • [35] Efficient and adaptive semantic segmentation network based on Transformer
    Zhang H.-B.
    Cai L.
    Ren J.-P.
    Wang R.-Y.
    Liu F.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (06): : 1205 - 1214
  • [36] A Patch Diversity Transformer for Domain Generalized Semantic Segmentation
    He, Pei
    Jiao, Licheng
    Shang, Ronghua
    Liu, Xu
    Liu, Fang
    Yang, Shuyuan
    Zhang, Xiangrong
    Wang, Shuang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14138 - 14150
  • [37] SEMANTIC SEGMENTATION OF HIGH-RESOLUTION REMOTE SENSING IMAGES USING AN IMPROVED TRANSFORMER
    Liu, Yuheng
    Mei, Shaohui
    Zhang, Shun
    Wang, Ye
    He, Mingyi
    Du, Qian
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3496 - 3499
  • [38] ODFormer: Semantic fundus image segmentation using Transformer for optic nerve head detection
    Wang, Jiayi
    Mao, Yi-An
    Ma, Xiaoyu
    Guo, Sicen
    Shao, Yuting
    Lv, Xiao
    Han, Wenting
    Christopher, Mark
    Zangwill, Linda M.
    Bi, Yanlong
    Fan, Rui
    INFORMATION FUSION, 2024, 112
  • [39] Image Tag Refinement Using Tag Semantic and Visual Similarity
    Cheng, Wengang
    Wang, Xiaolei
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2146 - 2149
  • [40] COMPUTATIONALLY-EFFICIENT VISION TRANSFORMER FOR MEDICAL IMAGE SEMANTIC SEGMENTATION VIA DUAL PSEUDO-LABEL SUPERVISION
    Wang, Ziyang
    Dong, Nanqing
    Voiculescu, Irina
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1961 - 1965