Relevancy between Objects Based on Common Sense for Semantic Segmentation

被引:1
|
作者
Zhou, Jun [1 ]
Bai, Xing [1 ]
Zhang, Qin [2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Speech Acoust & Content Understanding, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Natl Sci Lib, Beijing 100190, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期
关键词
semantic segmentation; scene parsing; computer vision;
D O I
10.3390/app122412711
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Research on image classification sparked the latest deep-learning boom. Many downstream tasks, including semantic segmentation, benefit from it. The state-of-the-art semantic segmentation models are all based on deep learning, and they sometimes make some semantic mistakes. In a semantic segmentation dataset with a small number of categories, images are often collected from a single scene, and there is a close semantic connection between any two categories. However, in the semantic segmentation dataset collected from multiple scenes, two categories may be irrelevant. The probability of objects in one category appearing next to objects in other categories is different, which is the basis of the paper. Semantic segmentation methods need to solve two problems of positioning and classification. This paper is dedicated to correcting those clearly wrong classifications that are contrary to reality. Specifically, we first calculate the relevancy between different class pairs. Then, based on this knowledge, we infer the category of a connected component according to the relationships of this connected component with its surrounding connected components and correct the obviously wrong classifications made by a deep learning semantic segmentation model. Several well-performing deep learning models are experimented on two challenging public datasets in the field of semantic image segmentation. Our proposed method improves the performance of UPerNet, OCRNet and SETR from 40.7%, 43% and 48.64% to 42.07%, 44.09% and 49.09% mean IoU on the ADE20K validation set, and the performance of PSPNet, DeepLabV3 and OCRNet from 37.26%, 37.3% and 39.5% to 38.93%, 38.95% and 40.63% mean IoU on the COCO-Stuff dataset, which shows the effectiveness of the method.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A word sense disambiguation algorithm for the simple semantic units based on semantic relevancy
    Liu, Y. (Liuyt_liuyt@126.com), 1600, Binary Information Press (10):
  • [2] Natural language, relevancy ranking, and common sense
    Clark, D
    IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (04): : 17 - 19
  • [3] Knowledge-based computational modeling on semantic relevancy between words
    Wang, Hongling
    Lv, Qiang
    Zhou, Guodong
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 63 - 67
  • [4] A semantic approach to segmentation of overlapping objects
    Wittenberg, T
    Grobe, M
    Münzenmayer, C
    Kuziela, H
    Spinnler, K
    METHODS OF INFORMATION IN MEDICINE, 2004, 43 (04) : 343 - 353
  • [5] RESEARCH AND IMPLEMENTATION ON SEMANTIC RELEVANCY COMPUTING BETWEEN WORDS
    Li, Min
    Yin, Shao-Hong
    FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2012), 2012, : 333 - 338
  • [6] Plants Don't Walk on the Street: Common-Sense Reasoning for Reliable Semantic Segmentation
    Adilova, Linara
    Schulz, Elena
    Akila, Maram
    Houben, Sebastian
    Schneider, Jan David
    Huger, Fabian
    Wirtz, Tim
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 85 - 92
  • [7] Towards automatic segmentation of semantic video objects
    Feng, Y
    Jiang, J
    Lpsm, SS
    EUROCON 2005: THE INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL, VOL 1 AND 2 , PROCEEDINGS, 2005, : 987 - 990
  • [8] Dense Semantic Image Segmentation with Objects and Attributes
    Zheng, Shuai
    Cheng, Ming-Ming
    Warrell, Jonathan
    Sturgess, Paul
    Vineet, Vibhav
    Rother, Carsten
    Torr, Philip H. S.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3214 - 3221
  • [9] Semantic Segmentation of Objects from Airborne Imagery
    Thuy Thi Nguyen
    Sang Viet Dinh
    Nguyen Tien Quang
    Huynh Thi Thanh Binh
    2017 FOURTH ASIAN CONFERENCE ON DEFENCE TECHNOLOGY - JAPAN (ACDT), 2017, : 140 - 145
  • [10] Semiautomatic segmentation and tracking of semantic video objects
    Gu, C
    Lee, MC
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) : 572 - 584