Multi-level graph convolutional recurrent neural network for semantic image segmentation

被引:7
|
作者
Jiang, Dingchao [1 ]
Qu, Hua [1 ]
Zhao, Jihong [2 ]
Zhao, Jianlong [3 ]
Liang, Wei [4 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Telecommun & Informat Engn, Xian, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
[4] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Peoples R China
关键词
Deep learning; Semantic image segmentation; Graph convolutional recurrent neural network; Multi-level features;
D O I
10.1007/s11235-021-00769-y
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
With the advent of the Internet of Things (IoT) era, many devices have surfaced that capture and generate various visual data. To recognize and extract a meaningful pattern from these visual data, powerful methods are required for different IoT applications. Fortunately, deep convolutional neural networks (CNNs) significantly improve the performance of almost all tasks in computer vision, including semantic image segmentation. However, the feature extraction of CNNs may cause the loss of contextual and spatial information. Moreover, the standard convolutional and pooling layers adopted by most CNN architectures lead to a fixed receptive field, which makes it challenging to deal with multi-scale objects in the image. To remedy these issues of CNNs for semantic image segmentation, this paper proposes a multi-level graph convolutional recurrent neural network (MGCRNN) to combine CNNs and graph neural networks (GNNs) for fusing multi-level features. By applying graph convolutional recurrent neural network (GCRNN), the proposed model acquires a global view of the image and aggregates multi-level contextual and structural information. The experiments verify the ability of GCRNN to obtain a flexible receptive field and learn structure features without losing spatial information. Results of these experiments conducted on the Pascal VOC 2012 and Cityscapes datasets show that the proposed model outperforms baseline approaches and can be competitive with state-of-the-art methods
引用
收藏
页码:563 / 576
页数:14
相关论文
共 50 条
  • [31] Multi-Level Generative Chaotic Recurrent Network for Image Inpainting
    Chen, Cong
    Abbott, Amos
    Stilwell, Daniel
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3625 - 3634
  • [32] A 3D Convolutional Neural Network for Volumetric Image Semantic Segmentation
    Lu, Hongya
    Wang, Haifeng
    Zhang, Qianqian
    Yoon, Sang Won
    Won, Daehan
    25TH INTERNATIONAL CONFERENCE ON PRODUCTION RESEARCH MANUFACTURING INNOVATION: CYBER PHYSICAL MANUFACTURING, 2019, 39 : 422 - 428
  • [33] Semantic Fire Segmentation Model Based on Convolutional Neural Network for Outdoor Image
    Choi, Han-Soo
    Jeon, Myeongho
    Song, Kyungmin
    Kang, Myungjoo
    FIRE TECHNOLOGY, 2021, 57 (06) : 3005 - 3019
  • [34] On the contextual aspects of using deep convolutional neural network for semantic image segmentation
    Wang, Chunlai
    Mauch, Lukas
    Saxena, Mehul Manoj
    Yang, Bin
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)
  • [35] Image Semantic Segmentation Based on Convolutional Neural Network and Conditional Random Field
    Tao, Hu
    Li, Weihua
    Qin, Xianxiang
    Jia, Dan
    PROCEEDINGS OF 2018 TENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2018, : 568 - 572
  • [36] Semantic Fire Segmentation Model Based on Convolutional Neural Network for Outdoor Image
    Han-Soo Choi
    Myeongho Jeon
    Kyungmin Song
    Myungjoo Kang
    Fire Technology, 2021, 57 : 3005 - 3019
  • [37] DenseGCN: A multi-level and multi-temporal graph convolutional network for action recognition
    Yu, Chengzhang
    Bao, Wenxia
    IET IMAGE PROCESSING, 2023, 17 (12) : 3401 - 3410
  • [38] Knowledge Graph Embedding Model Incorporating Multi-Level Convolutional Neural Networks
    Li, Min
    Li, Xuejun
    Liao, Jing
    Computer Engineering and Applications, 2025, 61 (06) : 192 - 198
  • [39] Graph convolutional networks with multi-level coarsening for graph classification
    Xie, Yu
    Yao, Chuanyu
    Gong, Maoguo
    Chen, Cheng
    Qin, A. K.
    KNOWLEDGE-BASED SYSTEMS, 2020, 194
  • [40] Facial Expression Recognition Using a Multi-level Convolutional Neural Network
    Hai-Duong Nguyen
    Yeom, Soonja
    Oh, Il-Seok
    Kim, Kyoung-Min
    Kim, Soo-Hyung
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (ICPRAI 2018), 2018, : 217 - 221