Semantic R-CNN for Natural Language Object Detection

被引:0
|
作者
Ye, Shuxiong [1 ]
Qin, Zheng [1 ]
Xu, Kaiping [1 ]
Huang, Kai [1 ]
Wang, Guolong [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
关键词
Object detection; Natural language; RPN;
D O I
10.1007/978-3-319-77383-4_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a simple and effective framework for natural language object detection, to localize a target within an image based on description of the target. The method, called semantic R-CNN, extends RPN (Region Proposal Network) [1] by adding LSTM [20] module for processing natural language query text. LSTM [20] module take encoded query text and image descriptors as input and output the probability of the query text conditioned on visual features of candidate box and whole image. Those candidate boxes are generated by RPN and their local features are extracted by ROI pooling. RPN can be initialized from pre-trained Faster R-CNN model [1], transfers object visual knowledge from traditional object detection domain to our task. Experimental results demonstrate that our method significantly outperform previous baseline SCRC (Spatial Context Recurrent ConvNet) [7] model on Referit dataset [8], moreover, our model is simple to train similar to Faster R-CNN.
引用
收藏
页码:98 / 107
页数:10
相关论文
共 50 条
  • [31] Atrous Faster R-CNN for Small Scale Object Detection
    Guan, Tongfan
    Zhu, Hao
    2017 2ND INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP), 2017, : 16 - 21
  • [32] Improvement of Object Detection Based on Faster R-CNN and YOLO
    Fan, Jiayi
    Lee, JangHyeon
    Jung, InSu
    Lee, YongKeun
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [33] Application of Mask R-CNN Algorithm for Apple Detection and Semantic Segmentation
    Jurewicz, Maciej
    Swiderski, Bartosz
    Kurek, Jarostaw
    PRZEGLAD ELEKTROTECHNICZNY, 2024, 100 (05): : 286 - 289
  • [34] Sparse R-CNN: An End-to-End Framework for Object Detection
    Sun, Peize
    Zhang, Rufeng
    Jiang, Yi
    Kong, Tao
    Xu, Chenfeng
    Zhan, Wei
    Tomizuka, Masayoshi
    Yuan, Zehuan
    Luo, Ping
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15650 - 15664
  • [35] DECONV R-CNN FOR SMALL OBJECT DETECTION ON REMOTE SENSING IMAGES
    Zhang, Wei
    Wang, Shihao
    Thachan, Sophanyouly
    Chen, Jingzhou
    Qian, Yuntao
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2483 - 2486
  • [36] Improved Faster R-CNN for Multi-Scale Object Detection
    Li X.
    Fu C.
    Li X.
    Wang Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (07): : 1095 - 1101
  • [37] Relief R-CNN: Utilizing Convolutional Features for Fast Object Detection
    Li, Guiying
    Liu, Junlong
    Jiang, Chunhui
    Zhang, Liangpeng
    Lin, Minlong
    Tang, Ke
    ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 386 - 394
  • [38] Foreign Object Detection of Transmission Lines Based on Faster R-CNN
    Guo, Shuqiang
    Bai, Qianlong
    Zhou, Xinxin
    INFORMATION SCIENCE AND APPLICATIONS, 2020, 621 : 269 - 275
  • [39] Faster R-CNN: an Approach to Real-Time Object Detection
    Gavrilescu, Raducu
    Fosalau, Cristian
    Zet, Cristian
    Skoczylas, Marcin
    Cotovanu, David
    2018 INTERNATIONAL CONFERENCE AND EXPOSITION ON ELECTRICAL AND POWER ENGINEERING (EPE), 2018, : 165 - 168
  • [40] Distributed Edge Cloud R-CNN for Real Time Object Detection
    Herrera, Joshua
    Demir, Mevlut A.
    Yousefi, Parsa
    Prevost, John J.
    Rad, Paul
    2018 WORLD AUTOMATION CONGRESS (WAC), 2018, : 146 - 151