LCFFNet: A Lightweight Cross-scale Feature Fusion Network for human pose estimation

被引:0
|
作者
Zou, Xuelian [1 ]
Bi, Xiaojun [2 ,3 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin, Heilongjiang, Peoples R China
[2] Minzu Univ China, Key Lab Ethn Language Intelligent Anal & Secur Gov, Beijing, Peoples R China
[3] Minzu Univ China, Sch Informat Engn, Beijing, Peoples R China
关键词
Human pose estimation; 2d dynamic multi-scale convolution; Contextual semantic information; Adaptive feature fusion;
D O I
10.1016/j.neunet.2024.106959
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human pose estimation is one of the most critical and challenging problems in computer vision. It is applied in many computer vision fields and has important research significance. However, it is still a difficult challenge to strike a balance between the number of parameters and computing load of the model and the accuracy of human pose estimation. In this study, we suggest a Lightweight Cross-scale Feature Fusion Network (LCFFNet) to strike a balance between accuracy and computational load and parameter volume. The Lightweight HRNet-Like (LHRNet) network, Cross-Resolution-Aware Semantics Module (CRASM), and Adapt Feature Fusion Module (AFFM) makeup LCFFNet. To be more precise, first, we suggest a lightweight LHRNet network that includes Dynamic Multi-scale Convolution Basic (DMSC-Basic block) block, Basic block, and DMSC-Basic block submodules in the network's three high-resolution subnetwork stages. The proposed dynamic multi-scale convolution in DMSC-Basic block can reduces the amount of model parameters and complexity of the LHRNet network, and has the ability to extract variable pose features. In order to maintain the model's ability to express features, the Basic block is introduced. Asa result, the LHRNet network not only makes the model more lightweight but also enhances its feature expression capabilities. Second, we propose a CRASM module to enhance contextual semantic information while reducing the semantic gap between different scales by fusing features from different scales. Finally, the augmented semantic feature map's spatial resolution is finally restored from bottom to top using our suggested AFFM, and adaptive feature fusion is used to increase the positioning accuracy of important sites. Our method successfully predicts keypoints with 74.2 % AP, 89.9 % PCKh@0.5 and 66.9 % AP on the MSCOCO 2017, MPII and Crowdpose datasets, respectively. Our model reduces the number of parameters by 89.0 % and the computational complexity by 87.5 % compared with HRNet. The proposed network performs as well as current large-model human pose estimation networks while outperforming state-of the-art lightweight networks.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Lightweight Multi-Resolution Network for Human Pose Estimation
    Li, Pengxin
    Wang, Rong
    Zhang, Wenjing
    Liu, Yinuo
    Xu, Chenyue
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 138 (03): : 2239 - 2255
  • [42] Lightweight densely connected residual network for human pose estimation
    Lianping Yang
    Yu Qin
    Xiangde Zhang
    Journal of Real-Time Image Processing, 2021, 18 : 825 - 837
  • [43] LDNet: Lightweight dynamic convolution network for human pose estimation
    Xu, Dingning
    Zhang, Rong
    Guo, Lijun
    Feng, Cun
    Gao, Shangce
    ADVANCED ENGINEERING INFORMATICS, 2022, 54
  • [44] A lightweight pose estimation network with multi-scale receptive field
    Li, Shuo
    Dai, Ju
    Chen, Zhangmeng
    Pan, Junjun
    VISUAL COMPUTER, 2023, 39 (08): : 3429 - 3440
  • [45] A lightweight pose estimation network with multi-scale receptive field
    Shuo Li
    Ju Dai
    Zhangmeng Chen
    Junjun Pan
    The Visual Computer, 2023, 39 : 3429 - 3440
  • [46] Cross-scale global attention feature pyramid network for person search
    Li, Yang
    Xu, Huahu
    Bian, Minjie
    Xiao, Junsheng
    IMAGE AND VISION COMPUTING, 2021, 116
  • [47] Cross-Scale Feature Fusion for Object Detection in Optical Remote Sensing Images
    Cheng, Gong
    Si, Yongjie
    Hong, Hailong
    Yao, Xiwen
    Guo, Lei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (03) : 431 - 435
  • [48] CSINet: A Cross-Scale Interaction Network for Lightweight Image Super-Resolution
    Ke, Gang
    Lo, Sio-Long
    Zou, Hua
    Liu, Yi-Feng
    Chen, Zhen-Qiang
    Wang, Jing-Kai
    SENSORS, 2024, 24 (04)
  • [49] LiteHandNet: A Lightweight Hand Pose Estimation Network via Structural Feature Enhancement
    Huang, Zhi-Yong
    Chen, Song-Lu
    Liu, Qi
    Zhang, Chong-Jian
    Chen, Feng
    Yin, Xu-Cheng
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 321 - 333
  • [50] Development of a cross-scale weighted feature fusion network for hot-rolled steel surface defect detection
    Zhang, Yuzhong
    Wang, Wenjing
    Li, Zhaoming
    Shu, Shuangbao
    Lang, Xianli
    Zhang, Tengda
    Dong, Jingtao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117