MINet: Meta-Learning Instance Identifiers for Video Object Detection

被引:13
|
作者
Deng, Jiajun [1 ]
Pan, Yingwei [2 ]
Yao, Ting [2 ]
Zhou, Wengang [1 ]
Li, Houqiang [1 ]
Mei, Tao [2 ]
机构
[1] Univ Sci & Technol China USTC, Dept Elect Engn & Informat Sci, Hefei 230026, Peoples R China
[2] JD AI Res, Beijing 100105, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Feature extraction; Detectors; Proposals; Optical imaging; Robustness; History; Video object detection; meta learning; memory network; box association; NETWORKS;
D O I
10.1109/TIP.2021.3099409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in video object detection have characterized the exploration of temporal coherence across frames to enhance object detector. Nevertheless, previous solutions either rely on additional inputs (e.g., optical flow) to guide feature aggregation, or complex post-processing to associate bounding boxes. In this paper, we introduce a simple but effective design that learns instance identifiers for instance association in a meta-learning paradigm, which requires no auxiliary inputs or post-processing. Specifically, we present Meta-Learnt Instance Identifier Networks (namely MINet) that novelly meta-learns instance identifiers to recognize identical instances across frames in a single forward-pass, leading to the robust online linking of instances. Technically, depending on the detection results of previous frames, we teach MINet to learn the weights of an instance identifier on the fly, which can be well applied to up-coming frames. Such meta-learning paradigm enables instance identifiers to be flexibly adapted to novel frames at inference. Furthermore, MINet writes/updates the detection results of previous instances into memory and reads from memory when performing inference to encourage temporal consistency for video object detection. Our MINet is appealing in the sense that it is pluggable to any object detection model. Extensive experiments on ImageNet VID dataset demonstrate the superiority of MINet. More remarkably, by integrating MINet into Faster R-CNN, we achieve 80.2% mAP on ImageNet VID dataset.
引用
收藏
页码:6879 / 6891
页数:13
相关论文
共 50 条
  • [1] Tracking by Instance Detection: A Meta-Learning Approach
    Wang, Guangting
    Luo, Chong
    Sun, Xiaoyan
    Xiong, Zhiwei
    Zeng, Wenjun
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6287 - 6296
  • [2] Video Object Verification via Meta-learning
    Onur, Irem Beyza
    Gurkan, Filiz
    Gunsel, Bilge
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [3] Incremental Object Detection via Meta-Learning
    Joseph, K. J.
    Rajasegaran, Jathushan
    Khan, Salman
    Khan, Fahad Shahbaz
    Balasubramanian, Vineeth N.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9209 - 9216
  • [4] Meta-Learning Deep Visual Words for Fast Video Object Segmentation
    Behl, Harkirat Singh
    Najafi, Mohammad
    Arnab, Anurag
    Torr, Philip H. S.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 8484 - 8491
  • [5] Choosing instance selection method using meta-learning
    Moura, Shayane de Oliveira
    de Freitas, Marcelo Bassani
    Cardoso, Halisson A. C.
    Cavalcanti, George D. C.
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 2003 - 2007
  • [6] Few-Shot Cross-Domain Object Detection With Instance-Level Prototype-Based Meta-Learning
    Zhang, Lin
    Zhang, Bo
    Shi, Botian
    Fan, Jiayuan
    Chen, Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9078 - 9089
  • [7] MDPruner: Meta-Learning Driven Dynamic Filter Pruning for Efficient Object Detection
    Zhou, Lingyun
    Liu, Xiaoyong
    IEEE ACCESS, 2024, 12 : 136925 - 136935
  • [8] Meta-Learning for Data Summarization Based on Instance Selection Method
    Smith-Miles, Kate
    Islam, Rafiqul
    2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [9] On the use of meta-learning for instance selection: An architecture and an experimental study
    Leyva, Enrique
    Caises, Yoel
    Gonzalez, Antonio
    Perez, Raul
    INFORMATION SCIENCES, 2014, 266 : 16 - 30
  • [10] Meta-UDA: Unsupervised Domain Adaptive Thermal Object Detection using Meta-Learning
    Vs, Vibashan
    Poster, Domenick
    You, Suya
    Hu, Shuowen
    Patel, Vishal M.
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3697 - 3706