Semantic segmentation network stacking with genetic programming

被引:1
|
作者
Bakurov, Illya [1 ,2 ,3 ]
Buzzelli, Marco [4 ]
Schettini, Raimondo [4 ]
Castelli, Mauro [1 ]
Vanneschi, Leonardo [1 ]
机构
[1] Univ Nova Lisboa, Informat Management Sch, Campus Campolide, P-1070312 Lisbon, Lisboa, Portugal
[2] Michigan State Univ, BEACON Ctr Evolut Act, E Lansing, MI 48824 USA
[3] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
[4] Univ Milano Bicocca, Dept Informat Syst & Commun, Viale Sarca 336, I-20126 Milan, Italy
关键词
Genetic programming; Stacking; Semantic segmentation; Ensemble learning; Deep learning; CLASSIFICATION; ARCHITECTURE; ENSEMBLE;
D O I
10.1007/s10710-023-09464-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation consists of classifying each pixel of an image and constitutes an essential step towards scene recognition and understanding. Deep convolutional encoder-decoder neural networks now constitute state-of-the-art methods in the field of semantic segmentation. The problem of street scenes' segmentation for automotive applications constitutes an important application field of such networks and introduces a set of imperative exigencies. Since the models need to be executed on self-driving vehicles to make fast decisions in response to a constantly changing environment, they are not only expected to operate reliably but also to process the input images rapidly. In this paper, we explore genetic programming (GP) as a meta-model that combines four different efficiency-oriented networks for the analysis of urban scenes. Notably, we present and examine two approaches. In the first approach, we represent solutions as GP trees that combine networks' outputs such that each output class's prediction is obtained through the same meta-model. In the second approach, we propose representing solutions as lists of GP trees, each designed to provide a unique meta-model for a given target class. The main objective is to develop efficient and accurate combination models that could be easily interpreted, therefore allowing gathering some hints on how to improve the existing networks. The experiments performed on the Cityscapes dataset of urban scene images with semantic pixel-wise annotations confirm the effectiveness of the proposed approach. Specifically, our best-performing models improve systems' generalization ability by approximately 5% compared to traditional ensembles, 30% for the less performing state-of-the-art CNN and show competitive results with respect to state-of-the-art ensembles. Additionally, they are small in size, allow interpretability, and use fewer features due to GP's automatic feature selection.
引用
收藏
页数:37
相关论文
共 50 条
  • [21] Genetic programming with semantic equivalence classes
    Ruberto, Stefano
    Vanneschi, Leonardo
    Castelli, Mauro
    SWARM AND EVOLUTIONARY COMPUTATION, 2019, 44 : 453 - 469
  • [22] An Introduction to Geometric Semantic Genetic Programming
    Vanneschi, Leonardo
    NEO 2015, 2017, 663 : 3 - 42
  • [23] Cartesian Genetic Programming as an Optimizer of Programs Evolved with Geometric Semantic Genetic Programming
    Koncal, Ondrej
    Sekanina, Lukas
    GENETIC PROGRAMMING, EUROGP 2019, 2019, 11451 : 98 - 113
  • [24] Deconvolutional network for facial semantic segmentation
    Yang, Heekyung
    Min, Kyungha
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 124 : 10 - 11
  • [25] CROSS ATTENTION NETWORK FOR SEMANTIC SEGMENTATION
    Liu, Mengyu
    Yin, Hujun
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2434 - 2438
  • [26] Embedded Attention Network for Semantic Segmentation
    Lv, Qingxuan
    Feng, Mingzhe
    Sun, Xin
    Dong, Junyu
    Chen, Changrui
    Zhang, Yu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01): : 326 - 333
  • [27] Fully Attentional Network for Semantic Segmentation
    Song, Qi
    Li, Jie
    Li, Chenghong
    Guo, Hao
    Huang, Rui
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2280 - 2288
  • [28] Bilateral attention network for semantic segmentation
    Wang, Dongli
    Li, Nanjun
    Zhou, Yan
    Mu, Jinzhen
    IET IMAGE PROCESSING, 2021, 15 (08) : 1607 - 1616
  • [29] Neighborhood Encoding Network for Semantic Segmentation
    Lou, Xiaotian
    Chen, Xiaoyu
    Bai, Lianfa
    Han, Jing
    IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 568 - 578
  • [30] Dynamic attention network for semantic segmentation
    Wu, Fei
    Chen, Feng
    Jing, Xiao-Yuan
    Hu, Chang-Hui
    Ge, Qi
    Ji, Yimu
    NEUROCOMPUTING, 2020, 384 (384) : 182 - 191