FCGNet: Foreground and Class Guided Network for human parsing

被引:0
|
作者
Jang, Jaehyuk [1 ]
Wang, Yooseung [1 ]
Kim, Changick [1 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Sch Elect Engn, Daejeon 34141, South Korea
关键词
Human parsing; Semantic segmentation; Graph convolutional network;
D O I
10.1016/j.patcog.2024.110879
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding the inherent hierarchical human structure is key to human parsing. To capture the human- specific characteristic, it is necessary to focus on the spatial and class information corresponding to the foreground (i.e., human) in an image. Inspired by these insights, we introduce two supervision signals, spatial foreground information and existent class information in the image. By utilizing foreground information as guidance, the network is guided to generate a human-focused feature map and capture the pixel-wise hierarchical characteristics by computing correlations between pixels. Furthermore, we guide the network to consider class information in the image at the feature level and capture the class-wise relationship by calculating correlations between channels. Moreover, during the training phase, we prevent the network from misclassifying pixels into confusing classes by providing the existent class information in the image to the network at the prediction level. Our model achieves state-of-the-art performance with significantly reduced parameters and Multiply-Accumulate Operations (MACs) in three public benchmarks.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] PARSING MAP GUIDED MULTI-SCALE ATTENTION NETWORK FOR FACE HALLUCINATION
    Wang, Chenyang
    Zhong, Zhiwei
    Jiang, Junjun
    Zhai, Deming
    Liu, Xianming
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2518 - 2522
  • [32] Pose-Guided Hierarchical Semantic Decomposition and Composition for Human Parsing
    Yang, Beibei
    Yu, Changqian
    Yu, Jin-Gang
    Gao, Changxin
    Sang, Nong
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1641 - 1652
  • [33] Class relationship-based knowledge distillation for efficient human parsing
    Lang, Yuqi
    Liu, Kunliang
    Wang, Jianming
    Hwang, Wonjun
    ELECTRONICS LETTERS, 2023, 59 (15)
  • [34] Part-aware distillation and aggregation network for human parsing
    Lai, Yuntian
    Feng, Yuxin
    Zhou, Fan
    Su, Zhuo
    IMAGE AND VISION COMPUTING, 2025, 158
  • [35] MVSN: A Multi-view stack network for human parsing
    Su, Zhuo
    Chen, Minshi
    Huang, Enbo
    Lin, Ge
    Zhou, Fan
    NEUROCOMPUTING, 2021, 465 : 437 - 450
  • [36] A Pose-Aware Global Representation Network for Human Parsing
    Zhou, Yanghong
    Mok, P. Y.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1710 - 1724
  • [37] Human-Centric Parsing Network for Human-Object Interaction Detection
    Chen, Guanyu
    Chen, Chong
    Zhao, Zhicheng
    Su, Fei
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5488 - 5494
  • [38] Correlating Edge with Parsing for Human Parsing
    Gong, Kai
    Wang, Xiuying
    Tan, Shoubiao
    ELECTRONICS, 2023, 12 (04)
  • [39] HUMAN PARSING
    KLIX, F
    COGNITION IN INDIVIDUAL AND SOCIAL CONTEXTS, 1989, : 155 - 163
  • [40] Guided parsing of range concatenation languages
    Barthélemy, F
    Boullier, P
    Deschamp, P
    de la Clergerie, É
    39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 42 - 49