Coarse Mask Guided Interactive Object Segmentation

被引:3
|
作者
Li, Jing [1 ,2 ]
Fan, Junsong [3 ,4 ]
Wang, Yuxi [3 ,4 ]
Yang, Yuran [5 ]
Zhang, Zhaoxiang [4 ,6 ,7 ,8 ]
机构
[1] Chinese Acad Sci CASIA, Inst Automat, Ctr Res Intelligent Percept & Comp CRIPAC, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci UCAS, Sch Artificial Intelligence, Beijing 100190, Peoples R China
[3] Chinese Acad Sci CASIA, Inst Automat, Ctr Res Intelligent Percept & Comp CRIPAC, Beijing 100190, Peoples R China
[4] HKISI CAS, Ctr Artificial Intelligence & Robot, Hong Kong, Peoples R China
[5] Tencent Maps, Beijing 100101, Peoples R China
[6] Chinese Acad Sci CASIA, Inst Automat, Beijing 100190, Peoples R China
[7] Univ Chinese Acad Sci UCAS, Sch Future Technol, Beijing 100049, Peoples R China
[8] State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Segmentation; interactive; transformer; annotation tool; RANDOM-WALKS; IMAGE; CUT;
D O I
10.1109/TIP.2023.3322564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interactive object segmentation aims to produce object masks with user interactions, such as clicks, bounding boxes, and scribbles. Click point is the most popular interactive cue for its efficiency, and related deep learning methods have attracted lots of interest in recent years. Most works encode click points as gaussian maps and concatenate them with images as the model's input. However, the spatial and semantic information of gaussian maps would be noised through multiple convolution layers and won't be fully exploited by top layers for mask prediction. To pass click information to top layers exactly and efficiently, we propose a coarse mask guided model (CMG) which predicts coarse masks with a coarse module to guide the object mask prediction. Specifically, the coarse module encodes user clicks as query features and enriches their semantic information with backbone features through transformer layers, coarse masks are generated based on the enriched query feature and fed into CMG's decoder. Benefiting from the efficiency of transformer, CMG's coarse module and decoder module are lightweight and computationally efficient, making the interaction process more smooth. Experiments on several segmentation benchmarks demonstrate the effectiveness of our method, and we get new state-of-the-art results compared with previous works.
引用
收藏
页码:5808 / 5822
页数:15
相关论文
共 50 条
  • [21] Mask Selection and Propagation for Unsupervised Video Object Segmentation
    Garg, Shubhika
    Goel, Vidit
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1679 - 1689
  • [22] MEM: Mask Enhancement Model for Video Object Segmentation
    Abdelfattah, Islam
    Shehata, Mohamed S.
    ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT I, 2025, 15046 : 262 - 274
  • [23] Improvement of Mask-RCNN Object Segmentation Algorithm
    Wu, Xin
    Wen, Shiguang
    Xie, Yuan-ai
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT I, 2019, 11740 : 582 - 591
  • [24] Segmentation mask and feature similarity loss guided GAN for object-oriented image-to-image translation
    Qin, Zhen
    Chen, Qingya
    Ding, Yi
    Zhuang, Tianming
    Qin, Zhiguang
    Choo, Kim-Kwang Raymond
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (03)
  • [25] Segmentation mask-guided person image generation
    Meichen Liu
    Xin Yan
    Chenhui Wang
    Kejun Wang
    Applied Intelligence, 2021, 51 : 1161 - 1176
  • [26] Segmentation mask-guided person image generation
    Liu, Meichen
    Yan, Xin
    Wang, Chenhui
    Wang, Kejun
    APPLIED INTELLIGENCE, 2021, 51 (02) : 1161 - 1176
  • [27] Uncertainty-Guided Segmentation Network for Geospatial Object Segmentation
    Jia, Hongyu
    Yang, Wenwu
    Wang, Lin
    Li, Haolin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 5824 - 5833
  • [28] Interactive Segmentation for Manga using Lossless Thinning and Coarse Labeling
    Aramaki, Yuji
    Matsui, Yusuke
    Yamasaki, Toshihiko
    Aizawa, Kiyoharu
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 293 - 296
  • [29] Research on Interactive Image Segmentation Algorithm Based on Coarse to Fine
    Yang, Kai
    Long, Jianwu
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 331 - 334
  • [30] Visual Attention Guided Video Object Segmentation
    Liang, Hao
    Tan, Yihua
    PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, : 345 - 349