Semantic R-CNN for Natural Language Object Detection

被引:0
|
作者
Ye, Shuxiong [1 ]
Qin, Zheng [1 ]
Xu, Kaiping [1 ]
Huang, Kai [1 ]
Wang, Guolong [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
关键词
Object detection; Natural language; RPN;
D O I
10.1007/978-3-319-77383-4_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a simple and effective framework for natural language object detection, to localize a target within an image based on description of the target. The method, called semantic R-CNN, extends RPN (Region Proposal Network) [1] by adding LSTM [20] module for processing natural language query text. LSTM [20] module take encoded query text and image descriptors as input and output the probability of the query text conditioned on visual features of candidate box and whole image. Those candidate boxes are generated by RPN and their local features are extracted by ROI pooling. RPN can be initialized from pre-trained Faster R-CNN model [1], transfers object visual knowledge from traditional object detection domain to our task. Experimental results demonstrate that our method significantly outperform previous baseline SCRC (Spatial Context Recurrent ConvNet) [7] model on Referit dataset [8], moreover, our model is simple to train similar to Faster R-CNN.
引用
收藏
页码:98 / 107
页数:10
相关论文
共 50 条
  • [21] A Page Object Detection Method Based on Mask R-CNN
    Xu, Canhui
    Shi, Cao
    Bi, Hengyue
    Liu, Chuanqi
    Yuan, Yongfeng
    Guo, Haoyan
    Chen, Yinong
    IEEE ACCESS, 2021, 9 : 143448 - 143457
  • [22] Libra R-CNN: Towards Balanced Learning for Object Detection
    Pang, Jiangmiao
    Chen, Kai
    Shi, Jianping
    Feng, Huajun
    Ouyang, Wanli
    Lin, Dahua
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 821 - 830
  • [23] A CLOSER LOOK: SMALL OBJECT DETECTION IN FASTER R-CNN
    Eggert, Christian
    Brehm, Stephan
    Winschel, Anton
    Zecha, Dan
    Lienhart, Rainer
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 421 - 426
  • [24] Mask R-CNN for Object Detection in Multitemporal SAR Images
    Qian, Yu
    Liu, Qin
    Zhu, Hongming
    Fan, Hongfei
    Du, Bowen
    Liu, Sicong
    2019 10TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2019,
  • [25] R-CNN Object Detection Inference With Deep Learning Accelerator
    Qian, Yuxin
    Zheng, Hongli
    He, Dazhi
    Zhang, Zhexi
    Zhang, Zongpu
    2018 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS), 2018, : 297 - 302
  • [26] Object Detection Algorithm Based on Improved Faster R-CNN
    Zhou Bing
    Li Runxin
    Shang Zhenhong
    Li Xiaowu
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (10)
  • [27] Domain Adaptive Faster R-CNN for Object Detection in the Wild
    Chen, Yuhua
    Li, Wen
    Sakaridis, Christos
    Dai, Dengxin
    Van Gool, Luc
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348
  • [28] Irregular Target Object Detection Based on Faster R-CNN
    Zhang, Bin
    Zhang, Yubo
    Pan, Qinghui
    2018 4TH INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND MATERIAL APPLICATION, 2019, 252
  • [29] A Novel Keypoint Supplemented R-CNN for UAV Object Detection
    Butler, Justin
    Leung, Henry
    IEEE SENSORS JOURNAL, 2023, 23 (24) : 30883 - 30892
  • [30] A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
    Liu, Hou-, I
    Tseng, Yu-Wen
    Chang, Kai-Cheng
    Wang, Pin-Jyun
    Shuai, Hong-Han
    Cheng, Wen-Huang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15