Lightweight facial landmark detection network based on improved MobileViT

被引:5
|
作者
Song, Limei [1 ]
Hong, Chuanfei [1 ]
Gao, Tian [1 ]
Yu, Jiali [1 ]
机构
[1] Tiangong Univ, Tianjin Key Lab Intelligent Control Elect Equipmen, Tianjin, Peoples R China
关键词
Facial landmark; MobileViT; Lightweight; Transformer;
D O I
10.1007/s11760-023-02975-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The long-distance link between facial landmarks cannot be modeled by the current CNN-based facial landmark detection networks, and these networks typically have many parameters that consume substantial computational resources. This paper proposes a multi-scale lightweight facial landmark detection network with CNN and Transformer multi-branch parallelism. Based on MobileViT, the network incorporates MobileOne Block and simplified Ghost BottleNeck lightweight network structure. Compared to MobileViT on the WFLW dataset, the number of network parameters is reduced by 49.18%, the failure rate is reduced by 3.20%, the detection speed is improved by 41.73%, the FLOPS is reduced by 64.83%, and the NME is improved by 0.45% and 1.31% on the test and pose subsets, respectively. The data proves that the global information extraction of facial landmarks is more accurate after adding the Transformer structure. This paper also compares with other networks, and the result shows that improved MobileViT achieves more accurate detection with fewer model parameters.
引用
收藏
页码:3123 / 3131
页数:9
相关论文
共 50 条
  • [31] Detection Method for Loquat Surface Defect Based on MobileViT CBAM Network
    Zhao, Maocheng
    Zou, Tao
    Qi, Liang
    Wang, Xiwei
    Li, Dawei
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2024, 55 (09): : 420 - 427
  • [32] Summary on Facial Landmark Detection
    Wen, Jinghao
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MACHINERY, ELECTRONICS AND CONTROL SIMULATION (MECS 2017), 2017, 138 : 253 - 259
  • [33] A robust occlusion-adaptive attention-based deep network for facial landmark detection
    Sadiq, Muhammad
    Shi, D.
    Liang, Junwei
    APPLIED INTELLIGENCE, 2022, 52 (08) : 9320 - 9333
  • [34] A robust occlusion-adaptive attention-based deep network for facial landmark detection
    Muhammad Sadiq
    D. Shi
    Junwei Liang
    Applied Intelligence, 2022, 52 : 9320 - 9333
  • [35] Improved Heatmap-Based Landmark Detection
    Yao, Huifeng
    Guo, Ziyu
    Zhang, Yatao
    Li, Xiaomeng
    DEEP GENERATIVE MODELS, AND DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS, 2021, 13003 : 125 - 133
  • [36] SFRA: spatial fusion regression augmentation network for facial landmark detection
    Peng, Cheng
    Li, Guo Dong
    Zou, Kun
    Zhang, Bo Wen
    Lo, Sio Long
    Tsoi, Ah Chung
    MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [37] Facial Landmark Detection via Attention-Adaptive Deep Network
    Sadiq, Muhammad
    Shi, Daming
    Guo, Meiqin
    Cheng, Xiaochun
    IEEE ACCESS, 2019, 7 : 181041 - 181050
  • [38] Robust facial landmark detection by probability-guided hourglass network
    Fan, Jingyan
    Liang, Jiuzhen
    Liu, Hao
    Huan, Zhan
    Hou, Zhenjie
    Zhou, Xinwen
    IET IMAGE PROCESSING, 2023, 17 (08) : 2489 - 2502
  • [39] HafaNet: An Efficient Coarse-to-Fine Facial Landmark Detection Network
    Zheng, Shaun
    Bai, Xiuxiu
    Ye, Lele
    Fang, Zhan
    IEEE ACCESS, 2020, 8 : 123037 - 123043
  • [40] Teacher and Student Joint Learning for Compact Facial Landmark Detection Network
    Lee, Hong Joo
    Baddar, Wissam J.
    Kim, Hak Gu
    Kim, Seong Tae
    Ro, Yong Man
    MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 493 - 504