Semantic segmentation network stacking with genetic programming

被引：1

作者：

Bakurov, Illya ^{[1
,2
,3
]}

Buzzelli, Marco ^{[4
]}

Schettini, Raimondo ^{[4
]}

Castelli, Mauro ^{[1
]}

Vanneschi, Leonardo ^{[1
]}

机构：

[1] Univ Nova Lisboa, Informat Management Sch, Campus Campolide, P-1070312 Lisbon, Lisboa, Portugal

[2] Michigan State Univ, BEACON Ctr Evolut Act, E Lansing, MI 48824 USA

[3] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA

[4] Univ Milano Bicocca, Dept Informat Syst & Commun, Viale Sarca 336, I-20126 Milan, Italy

来源：

GENETIC PROGRAMMING AND EVOLVABLE MACHINES | 2023年 / 24卷 / 02期

关键词：

Genetic programming; Stacking; Semantic segmentation; Ensemble learning; Deep learning; CLASSIFICATION; ARCHITECTURE; ENSEMBLE;

D O I：

10.1007/s10710-023-09464-0

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation consists of classifying each pixel of an image and constitutes an essential step towards scene recognition and understanding. Deep convolutional encoder-decoder neural networks now constitute state-of-the-art methods in the field of semantic segmentation. The problem of street scenes' segmentation for automotive applications constitutes an important application field of such networks and introduces a set of imperative exigencies. Since the models need to be executed on self-driving vehicles to make fast decisions in response to a constantly changing environment, they are not only expected to operate reliably but also to process the input images rapidly. In this paper, we explore genetic programming (GP) as a meta-model that combines four different efficiency-oriented networks for the analysis of urban scenes. Notably, we present and examine two approaches. In the first approach, we represent solutions as GP trees that combine networks' outputs such that each output class's prediction is obtained through the same meta-model. In the second approach, we propose representing solutions as lists of GP trees, each designed to provide a unique meta-model for a given target class. The main objective is to develop efficient and accurate combination models that could be easily interpreted, therefore allowing gathering some hints on how to improve the existing networks. The experiments performed on the Cityscapes dataset of urban scene images with semantic pixel-wise annotations confirm the effectiveness of the proposed approach. Specifically, our best-performing models improve systems' generalization ability by approximately 5% compared to traditional ensembles, 30% for the less performing state-of-the-art CNN and show competitive results with respect to state-of-the-art ensembles. Additionally, they are small in size, allow interpretability, and use fewer features due to GP's automatic feature selection.

引用

页数：37

共 50 条

[21] Genetic programming with semantic equivalence classes
Ruberto, Stefano
Vanneschi, Leonardo
Castelli, Mauro
SWARM AND EVOLUTIONARY COMPUTATION, 2019, 44 : 453 - 469
[22] An Introduction to Geometric Semantic Genetic Programming
Vanneschi, Leonardo
NEO 2015, 2017, 663 : 3 - 42
[23] Cartesian Genetic Programming as an Optimizer of Programs Evolved with Geometric Semantic Genetic Programming
Koncal, Ondrej
Sekanina, Lukas
GENETIC PROGRAMMING, EUROGP 2019, 2019, 11451 : 98 - 113
[24] Deconvolutional network for facial semantic segmentation
Yang, Heekyung
Min, Kyungha
BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 124 : 10 - 11
[25] CROSS ATTENTION NETWORK FOR SEMANTIC SEGMENTATION
Liu, Mengyu
Yin, Hujun
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2434 - 2438
[26] Embedded Attention Network for Semantic Segmentation
Lv, Qingxuan
Feng, Mingzhe
Sun, Xin
Dong, Junyu
Chen, Changrui
Zhang, Yu
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01): : 326 - 333
[27] Fully Attentional Network for Semantic Segmentation
Song, Qi
Li, Jie
Li, Chenghong
Guo, Hao
Huang, Rui
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2280 - 2288
[28] Bilateral attention network for semantic segmentation
Wang, Dongli
Li, Nanjun
Zhou, Yan
Mu, Jinzhen
IET IMAGE PROCESSING, 2021, 15 (08) : 1607 - 1616
[29] Neighborhood Encoding Network for Semantic Segmentation
Lou, Xiaotian
Chen, Xiaoyu
Bai, Lianfa
Han, Jing
IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 568 - 578
[30] Dynamic attention network for semantic segmentation
Wu, Fei
Chen, Feng
Jing, Xiao-Yuan
Hu, Chang-Hui
Ge, Qi
Ji, Yimu
NEUROCOMPUTING, 2020, 384 (384) : 182 - 191

← 1 2 3 4 5 →