Learning Multi-view Deep Features for Small Object Retrieval in Surveillance Scenarios

被引:13
|
作者
Guo, Haiyun [1 ]
Wang, Jinqiao [1 ]
Xu, Min [2 ]
Zha, Zheng-Jun [3 ]
Lu, Hanqing [1 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Technol Sydney, Sch Comp & Commun, GBDTC, Sydney, NSW, Australia
[3] Chinese Acad Sci, Inst Intelligent Machines, Hefei, Peoples R China
关键词
Object retrieval; Object representation;
D O I
10.1145/2733373.2806349
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the explosive growth of surveillance videos, object retrieval has become a significant task for security monitoring. However, visual objects in surveillance videos are usually of small size with complex light conditions, view changes and partial occlusions, which increases the difficulty level of efficiently retrieving objects of interest in a large-scale dataset. Although deep features have achieved promising results on object classification and retrieval and have been verified to contain rich semantic structure property, they lack of adequate color information, which is as crucial as structure information for effective object representation. In this paper, we propose to leverage discriminative Convolutional Neural Network (CNN) to learn deep structure and color feature to form an efficient multi-view object representation. Specifically, we utilize CNN trained on ImageNet to abstract rich semantic structure information. Meanwhile, we propose a CNN model supervised by 11 color names to extract deep color features. Compared with traditional color descriptors, deep color features can capture the common color property across different illumination conditions. Then, the complementary multi-view deep features are encoded into short binary codes by Locality-Sensitive Hash (LSH) and fused to retrieve objects. Retrieval experiments are performed on a dataset of 100k objects extracted from multi-camera surveillance videos. Comparison results with several popular visual descriptors show the effectiveness of the proposed approach.
引用
收藏
页码:859 / 862
页数:4
相关论文
共 50 条
  • [1] Multi-View Object Detection Based on Deep Learning
    Tang, Cong
    Ling, Yongshun
    Yang, Xing
    Jin, Wei
    Zheng, Chao
    APPLIED SCIENCES-BASEL, 2018, 8 (09):
  • [2] Multiple deep features learning for object retrieval in surveillance videos
    Guo, Haiyun
    Wang, Jinqiao
    Lu, Hanqing
    IET COMPUTER VISION, 2016, 10 (04) : 268 - 272
  • [3] Multi-View 3D Object Retrieval With Deep Embedding Network
    Guo, Haiyun
    Wang, Jinqiao
    Gao, Yue
    Li, Jianqiang
    Lu, Hanqing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (12) : 5526 - 5537
  • [4] Multi-View Learning Based on Common and Special Features in Multi-Task Scenarios
    Lu, Runkun
    Zuo, Xin
    Liu, Jianwei
    Lian, Siming
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9410 - 9415
  • [5] Object detection method of multi-view SSD based on deep learning
    Tang C.
    Ling Y.
    Zheng K.
    Yang X.
    Zheng C.
    Yang H.
    Jin W.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2018, 47 (01):
  • [6] Multi-view 3D object retrieval leveraging the aggregation of view and instance attentive features
    Lin, Dongyun
    Li, Yiqun
    Cheng, Yi
    Prasad, Shitala
    Nwe, Tin Lay
    Dong, Sheng
    Guo, Aiyuan
    KNOWLEDGE-BASED SYSTEMS, 2022, 247
  • [7] Unbalanced Multi-view Deep Learning
    Xu, Cai
    Li, Zehui
    Guan, Ziyu
    Zhao, Wei
    Song, Xiangyu
    Wu, Yue
    Li, Jianxin
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3051 - 3059
  • [8] Deep Multi-View Concept Learning
    Xu, Cai
    Guan, Ziyu
    Zhao, Wei
    Niu, Yunfei
    Wang, Quan
    Wang, Zhiheng
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2898 - 2904
  • [9] Deep Multi-View Learning to Rank
    Cao, Guanqun
    Iosifidis, Alexandros
    Gabbouj, Moncef
    Raghavan, Vijay
    Gottumukkala, Raju
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1426 - 1438
  • [10] On Deep Multi-View Representation Learning
    Wang, Weiran
    Arora, Raman
    Livescu, Karen
    Bilmes, Jeff
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1083 - 1092