Bilateral Knowledge Interaction Network for Referring Image Segmentation

被引:7
|
作者
Ding, Haixin [1 ]
Zhang, Shengchuan [1 ]
Wu, Qiong [1 ]
Yu, Songlin [1 ]
Hu, Jie [1 ]
Cao, Liujuan [1 ]
Ji, Rongrong [1 ]
机构
[1] Xiamen Univ, Key Lab Multimedia Trusted Percept & Efficient Com, Minist Educ China, Xiamen 361005, Peoples R China
关键词
Image segmentation; Visualization; Kernel; Knowledge engineering; Feature extraction; Semantics; Convolution; Referring image segmentation; vision-language; AGGREGATION;
D O I
10.1109/TMM.2023.3305869
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Referring image segmentation aims to segment objects that are described by natural language expressions. Although remarkable advancements have been made to align natural language expressions with visual representations for better performance, the interaction between image-level and text-level information is still not formulated properly. Most of the previous works focus on building correlations between vision and language, ignoring the variety of objects. The target objects with unique appearances may not be correctly located or completely segmented. In this article, we propose a novel Bilateral Knowledge Interaction Network, termed BKINet, which reformulates the image-text interaction in a bilateral manner to adapt concrete knowledge of the target object in the image. BKINet contains two key components: a knowledge learning module (KLM) and a knowledge applying module (KAM). In the KLM, the abstract knowledge from text features is replenished with concrete knowledge from visual features to adapt to the target objects in the input images, which generates the knowledge interaction kernels (KI kernels) containing abundant referring information. With the referring information of KI kernels, the KAM is designed to highlight the most relevant visual features for predicting the accurate segmentation mask. Extensive experiments on three widely-used datasets, i.e. RefCOCO, RefCOCO+, and G-ref, demonstrate the superiority of BKINet over the state-of-the-art.
引用
收藏
页码:2966 / 2977
页数:12
相关论文
共 50 条
  • [41] Referring Image Segmentation via Joint Mask Contextual Embedding Learning and Progressive Alignment Network
    Huang, Ziling
    Satoh, Shin'ichi
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 7753 - 7762
  • [42] Cross-modal fusion encoder via graph neural network for referring image segmentation
    Zhang, Yuqing
    Zhang, Yong
    Piao, Xinglin
    Yuan, Peng
    Hu, Yongli
    Yin, Baocai
    IET IMAGE PROCESSING, 2024, 18 (04) : 1083 - 1095
  • [43] Bilateral Supervision Network for Semi-Supervised Medical Image Segmentation
    He, Along
    Li, Tao
    Yan, Juncheng
    Wang, Kai
    Fu, Huazhu
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (05) : 1715 - 1726
  • [44] PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
    Liu, Jiang
    Ding, Hui
    Cai, Zhaowei
    Zhang, Yuting
    Satzoda, Ravi Kumar
    Mahadevan, Vijay
    Manmatha, R.
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18653 - 18663
  • [45] CRIS: CLIP-Driven Referring Image Segmentation
    Wang, Zhaoqing
    Lu, Yu
    Li, Qiang
    Tao, Xunqiang
    Guo, Yandong
    Gong, Mingming
    Liu, Tongliang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11676 - 11685
  • [46] BPCN: bilateral progressive compensation network for lung infection image segmentation
    Wang, Xiaoyan
    Yang, Baoqi
    Pan, Xiang
    Liu, Fuchang
    Zhang, Sanyuan
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (03):
  • [47] Semantic segmentation of remote sensing image based on bilateral branch network
    Zhongyu Li
    Huajun Wang
    Yang Liu
    The Visual Computer, 2024, 40 : 3069 - 3090
  • [48] Attentive Excitation and Aggregation for Bilingual Referring Image Segmentation
    Zhou, Qianli
    Hui, Tianrui
    Wang, Rong
    Hu, Haimiao
    Liu, Si
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (02)
  • [49] A survey of methods for addressing the challenges of referring image segmentation
    Ji, Lixia
    Du, Yunlong
    Dang, Yiping
    Gao, Wenzhao
    Zhang, Han
    NEUROCOMPUTING, 2024, 583
  • [50] Locate then Segment: A Strong Pipeline for Referring Image Segmentation
    Jing, Ya
    Kong, Tao
    Wang, Wei
    Wang, Liang
    Li, Lei
    Tan, Tieniu
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9853 - 9862