Instance Segmentation, Body Part Parsing, and Pose Estimation of Human Figures in Pictorial Maps

被引:2
|
作者
Schnuerer, Raimund [1 ]
Oztireli, A. Cengiz [2 ]
Heitzler, Magnus [1 ]
Sieber, Rene [1 ]
Hurni, Lorenz [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Civil Environm & Geomat Engn, Zurich, Switzerland
[2] Univ Cambridge, Dept Comp Sci & Technol, Cambridge, England
关键词
Machine learning; convolutional neural networks; map digitisation;
D O I
10.1080/23729333.2021.1949087
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, convolutional neural networks (CNNs) have been applied successfully to recognise persons, their body parts and pose keypoints in photos and videos. The transfer of these techniques to artificially created images is rather unexplored, though challenging since these images are drawn in different styles, body proportions, and levels of abstraction. In this work, we study these problems on the basis of pictorial maps where we identify included human figures with two consecutive CNNs: We first segment individual figures with Mask R-CNN, and then parse their body parts and estimate their poses simultaneously with four different UNet++ versions. We train the CNNs with a mixture of real persons and synthetic figures and compare the results with manually annotated test datasets consisting of pictorial figures. By varying the training datasets and the CNN configurations, we were able to improve the original Mask R-CNN model and we achieved moderately satisfying results with the UNet++ versions. The extracted figures may be used for animation and storytelling and may be relevant for the analysis of historic and contemporary maps.
引用
收藏
页码:291 / 307
页数:17
相关论文
共 50 条
  • [41] Recognizing Human Actions as the Evolution of Pose Estimation Maps
    Liu, Mengyuan
    Yuan, Junsong
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1159 - 1168
  • [42] Human Pose Estimation by Exploiting Spatial and Temporal Constraints in Body-Part Configurations
    Li, Qingwu
    He, Feijia
    Wang, Tian
    Zhou, Liangji
    Xi, Shuya
    IEEE ACCESS, 2017, 5 : 443 - 454
  • [43] POSE ESTIMATION AND BODY SEGMENTATION BASED ON HIERARCHICAL SEARCHING TREE
    Li, Shifeng
    Lu, Huchuan
    Ruan, Xiang
    Chen, Yen-wei
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1289 - 1292
  • [44] 3D Pictorial Structures for Human Pose Estimation with Supervoxels
    Schick, Alexander
    Stiefelhagen, Rainer
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 140 - 147
  • [45] 3D Pictorial Structures for Multiple Human Pose Estimation
    Belagiannis, Vasileios
    Amin, Sikandar
    Andriluka, Mykhaylo
    Schiele, Bernt
    Navab, Nassir
    Ilic, Slobodan
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1669 - 1676
  • [46] Latent variable pictorial structure for human pose estimation on depth images
    He, Li
    Wang, Guijin
    Liao, Qingmin
    Xue, Jing-Hao
    NEUROCOMPUTING, 2016, 203 : 52 - 61
  • [47] AMIL: Adversarial Multi-instance Learning for Human Pose Estimation
    Shamsolmoali, Pourya
    Zareapoor, Masoumeh
    Zhou, Huiyu
    Yang, Jie
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (01)
  • [48] Efficient Human Pose Estimation via Parsing a Tree Structure Based Human Model
    Zhang, Xiaoqin
    Li, Changcheng
    Tong, Xiaofeng
    Hu, Weiming
    Maybank, Steve
    Zhang, Yimin
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1349 - 1356
  • [49] Segmentation Guided Attention Networks for Human Pose Estimation
    Tang, Jingfan
    Lu, Jipeng
    Zhang, Xuefeng
    Zhao, Fang
    TRAITEMENT DU SIGNAL, 2024, 41 (05) : 2485 - 2493
  • [50] Human Pose Estimation and Tracking via Parsing a Tree Structure Based Human Model
    Zhang, Xiaoqin
    Li, Changcheng
    Hu, Weiming
    Tong, Xiaofeng
    Maybank, Steve
    Zhang, Yimin
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2014, 44 (05): : 580 - 592