Relevancy between Objects Based on Common Sense for Semantic Segmentation

被引:1
|
作者
Zhou, Jun [1 ]
Bai, Xing [1 ]
Zhang, Qin [2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Speech Acoust & Content Understanding, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Natl Sci Lib, Beijing 100190, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期
关键词
semantic segmentation; scene parsing; computer vision;
D O I
10.3390/app122412711
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Research on image classification sparked the latest deep-learning boom. Many downstream tasks, including semantic segmentation, benefit from it. The state-of-the-art semantic segmentation models are all based on deep learning, and they sometimes make some semantic mistakes. In a semantic segmentation dataset with a small number of categories, images are often collected from a single scene, and there is a close semantic connection between any two categories. However, in the semantic segmentation dataset collected from multiple scenes, two categories may be irrelevant. The probability of objects in one category appearing next to objects in other categories is different, which is the basis of the paper. Semantic segmentation methods need to solve two problems of positioning and classification. This paper is dedicated to correcting those clearly wrong classifications that are contrary to reality. Specifically, we first calculate the relevancy between different class pairs. Then, based on this knowledge, we infer the category of a connected component according to the relationships of this connected component with its surrounding connected components and correct the obviously wrong classifications made by a deep learning semantic segmentation model. Several well-performing deep learning models are experimented on two challenging public datasets in the field of semantic image segmentation. Our proposed method improves the performance of UPerNet, OCRNet and SETR from 40.7%, 43% and 48.64% to 42.07%, 44.09% and 49.09% mean IoU on the ADE20K validation set, and the performance of PSPNet, DeepLabV3 and OCRNet from 37.26%, 37.3% and 39.5% to 38.93%, 38.95% and 40.63% mean IoU on the COCO-Stuff dataset, which shows the effectiveness of the method.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] DenseU-Net-Based Semantic Segmentation of Objects in Urban Remote Sensing Images
    Dong, Rongsheng
    Pan, Xiaoquan
    Li, Fengying
    IEEE ACCESS, 2019, 7 : 65347 - 65356
  • [22] Seeing the Unseen: Visual Common Sense for Semantic Placement
    Ramrakhya, Ram
    Kembhavi, Aniruddha
    Batra, Dhruv
    Kira, Zsolt
    Zeng, Kuo-Hao
    Weihs, Luca
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16273 - 16283
  • [23] Between Communicability and Common Sense
    Friedlander, Eli
    BRITISH JOURNAL OF AESTHETICS, 2016, 56 (04): : 401 - 404
  • [24] Bridging the Gap Between Semantic Segmentation and Instance Segmentation
    Yin, Chengxiang
    Tang, Jian
    Yuan, Tongtong
    Xu, Zhiyuan
    Wang, Yanzhi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4183 - 4196
  • [25] Detection and Retrieval of Out-of-Distribution Objects in Semantic Segmentation
    Oberdiek, Philipp
    Rottmann, Matthias
    Fink, Gernot A.
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1331 - 1340
  • [26] Semantic spatio-temporal segmentation for extracting video objects
    Mao, JH
    Ma, KK
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 738 - 743
  • [27] Active learning based semantic segmentation for extraction of minute objects from multispectral satellite images
    Abhishek, R.
    Chakravorty, Anisha
    Chakraborty, Shounak
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 7274 - 7277
  • [28] Research on Topic Relevancy of Sentences Based on How Net Semantic Computation
    Xu, Jinzhong
    Liu, Jie
    Liu, Xiaoming
    HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 2, PROCEEDINGS, 2009, : 195 - +
  • [29] Human scanpath estimation based on semantic segmentation guided by common eye fixation behaviors
    Han, Yiyuan
    Han, Bing
    Gao, Xinbo
    NEUROCOMPUTING, 2021, 453 : 705 - 717
  • [30] DOCK: Detecting Objects by Transferring Common-Sense Knowledge
    Singh, Krishna Kumar
    Divvala, Santosh
    Farhadi, Ali
    Lee, Yong Jae
    COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 506 - 522