Relevancy between Objects Based on Common Sense for Semantic Segmentation

被引：1

作者：

Zhou, Jun ^{[1
]}

Bai, Xing ^{[1
]}

Zhang, Qin ^{[2
]}

机构：

[1] Chinese Acad Sci, Inst Acoust, Key Lab Speech Acoust & Content Understanding, Beijing 100190, Peoples R China

[2] Chinese Acad Sci, Natl Sci Lib, Beijing 100190, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期

关键词：

semantic segmentation; scene parsing; computer vision;

D O I：

10.3390/app122412711

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Research on image classification sparked the latest deep-learning boom. Many downstream tasks, including semantic segmentation, benefit from it. The state-of-the-art semantic segmentation models are all based on deep learning, and they sometimes make some semantic mistakes. In a semantic segmentation dataset with a small number of categories, images are often collected from a single scene, and there is a close semantic connection between any two categories. However, in the semantic segmentation dataset collected from multiple scenes, two categories may be irrelevant. The probability of objects in one category appearing next to objects in other categories is different, which is the basis of the paper. Semantic segmentation methods need to solve two problems of positioning and classification. This paper is dedicated to correcting those clearly wrong classifications that are contrary to reality. Specifically, we first calculate the relevancy between different class pairs. Then, based on this knowledge, we infer the category of a connected component according to the relationships of this connected component with its surrounding connected components and correct the obviously wrong classifications made by a deep learning semantic segmentation model. Several well-performing deep learning models are experimented on two challenging public datasets in the field of semantic image segmentation. Our proposed method improves the performance of UPerNet, OCRNet and SETR from 40.7%, 43% and 48.64% to 42.07%, 44.09% and 49.09% mean IoU on the ADE20K validation set, and the performance of PSPNet, DeepLabV3 and OCRNet from 37.26%, 37.3% and 39.5% to 38.93%, 38.95% and 40.63% mean IoU on the COCO-Stuff dataset, which shows the effectiveness of the method.

引用

页数：10

共 50 条

[1] A word sense disambiguation algorithm for the simple semantic units based on semantic relevancy
Liu, Y. (Liuyt_liuyt@126.com), 1600, Binary Information Press (10):
[2] Natural language, relevancy ranking, and common sense
Clark, D
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (04): : 17 - 19
[3] Knowledge-based computational modeling on semantic relevancy between words
Wang, Hongling
Lv, Qiang
Zhou, Guodong
RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 63 - 67
[4] A semantic approach to segmentation of overlapping objects
Wittenberg, T
Grobe, M
Münzenmayer, C
Kuziela, H
Spinnler, K
METHODS OF INFORMATION IN MEDICINE, 2004, 43 (04) : 343 - 353
[5] RESEARCH AND IMPLEMENTATION ON SEMANTIC RELEVANCY COMPUTING BETWEEN WORDS
Li, Min
Yin, Shao-Hong
FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2012), 2012, : 333 - 338
[6] Plants Don't Walk on the Street: Common-Sense Reasoning for Reliable Semantic Segmentation
Adilova, Linara
Schulz, Elena
Akila, Maram
Houben, Sebastian
Schneider, Jan David
Huger, Fabian
Wirtz, Tim
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 85 - 92
[7] Towards automatic segmentation of semantic video objects
Feng, Y
Jiang, J
Lpsm, SS
EUROCON 2005: THE INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL, VOL 1 AND 2 , PROCEEDINGS, 2005, : 987 - 990
[8] Dense Semantic Image Segmentation with Objects and Attributes
Zheng, Shuai
Cheng, Ming-Ming
Warrell, Jonathan
Sturgess, Paul
Vineet, Vibhav
Rother, Carsten
Torr, Philip H. S.
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3214 - 3221
[9] Semantic Segmentation of Objects from Airborne Imagery
Thuy Thi Nguyen
Sang Viet Dinh
Nguyen Tien Quang
Huynh Thi Thanh Binh
2017 FOURTH ASIAN CONFERENCE ON DEFENCE TECHNOLOGY - JAPAN (ACDT), 2017, : 140 - 145
[10] Semiautomatic segmentation and tracking of semantic video objects
Gu, C
Lee, MC
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) : 572 - 584

← 1 2 3 4 5 →