Body landmark detection with an extremely small dataset using transfer learning

被引:0
|
作者
Liao, Iman Yi [1 ]
Hermawan, Eric Savero [1 ]
Zaman, Munir [2 ]
机构
[1] Univ Nottingham Malaysia, Sch Comp Sci, Semenyih 43500, Malaysia
[2] Zaman Educ Res & Informat Consultancy Sdn Bhd, Semenyih, Malaysia
关键词
Body landmark detection; Transfer learning; Attention unit; Fashion landmarks; Convolutional neural network; NETWORK; REGRESSION; SCALE;
D O I
10.1007/s10044-022-01098-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new landmark detection problem on the upper body of a clothed person for tailoring purposes. This is a landmark detection problem unknown in the literature, which is in the same domain as, but different to the 'fashion' landmark detection problem where the landmarks are for classifying clothing. An existing 'attentive fashion network' (AFN) was trained using 800,000 annotated images of the DeepFashion dataset, with a base network of VGG16 pre-trained on the ImageNet dataset, to provide initial weights. To train a network for 'body' landmark detection would require a similar sized dataset. We propose a deep neural network for body landmark detection where the knowledge from an existing network was transferred and trained with an extremely small dataset of just 99 images, annotated with body landmarks. A baseline model was tested where only the fashion landmark branch was used, but retrained for body landmarks. This produced a testing error of 0.068 (normalised mean distance between the predicted landmarks and ground-truth). The error was significantly reduced by adopting the fashion landmark branch and the attention unit of AFN, but substituting the classification branch with a new body landmark detection branch for the proposed Attention-based Fashion-to-Body landmark Network (AFBN). We tested 6 variants of the proposed AFBN model with different convolutional block designs and auto-encoders for enforcing landmark relations. The trained model had a low testing error ranging from 0.022 to 0.028 over these variants. The variant with an increased number of channels and inception units with residual connections, had the best overall performance. Although AFBN and its variants were trained with a limited dataset, the performance exceeds the state-of-the-art attentive fashion network AFN (0.0534). The principle of transfer learning demonstrated here is relevant where labelled domain data are scarce providing a low solution cost of faster training of a deep neural network with a significantly small dataset. [GRAPHICS] .
引用
收藏
页码:163 / 199
页数:37
相关论文
共 50 条
  • [41] 1-Stage Face Landmark Detection Using Deep Learning
    Kim, Taehyung
    Mok, Ji Won
    Lee, Eui Chul
    INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2020, PT II, 2021, 12616 : 239 - 247
  • [42] Multiple Landmark Detection Using Multi-agent Reinforcement Learning
    Vlontzos, Athanasios
    Alansary, Amir
    Kamnitsas, Konstantinos
    Rueckert, Daniel
    Kainz, Bernhard
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT IV, 2019, 11767 : 262 - 270
  • [43] Retinal Vessel Landmark Detection Using Deep Learning and Hessian Matrix
    Fang, TingBing
    Su, Rou
    Xie, LinPei
    Gu, QiWei
    Li, QiaoLiang
    Liang, Ping
    Wang, TianFu
    2015 8TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2015, : 387 - 392
  • [44] Facial landmark detection transfer learning for a specific user in driver status monitoring systems
    Kim, Jaechul
    Taguchi, Kensuke
    Hayashi, Yusuke
    Miyazaki, Jungo
    Fujiyoshi, Hironobu
    PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [45] Region Detection in Medical Images Using HOG Classifiers and a Body Landmark Network
    Erdt, Marius
    Knapp, Oliver
    Drechsler, Klaus
    Wesarg, Stefan
    MEDICAL IMAGING 2013: COMPUTER-AIDED DIAGNOSIS, 2013, 8670
  • [46] Classification of Intrusion Detection Dataset using machine learning Approaches
    Subramanyam, Doodipalli
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES, ELECTRONICS AND MECHANICAL SYSTEMS (CTEMS), 2018, : 280 - 283
  • [47] Android Spyware Detection Using Machine Learning: A Novel Dataset
    Qabalin, Majdi K.
    Naser, Muawya
    Alkasassbeh, Mouhammd
    SENSORS, 2022, 22 (15)
  • [48] Breast Cancer Dataset, Classification and Detection Using Deep Learning
    Iqbal, Muhammad Shahid
    Ahmad, Waqas
    Alizadehsani, Roohallah
    Hussain, Sadiq
    Rehman, Rizwan
    HEALTHCARE, 2022, 10 (12)
  • [49] DeepFire: A Novel Dataset and Deep Transfer Learning Benchmark for Forest Fire Detection
    Khan, Ali
    Hassan, Bilal
    Khan, Somaiya
    Ahmed, Ramsha
    Abuassba, Adnan
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [50] Transfer Learning for Diabetic Retinopathy Detection: A Study of Dataset Combination and Model Performance
    Mutawa, A. M.
    Alnajdi, Shahad
    Sruthi, Sai
    APPLIED SCIENCES-BASEL, 2023, 13 (09):