Adaptive skew-sensitive ensembles for face recognition in video surveillance

被引:28
|
作者
De-la-Torre, Miguel [1 ,2 ]
Granger, Eric [1 ]
Sabourin, Robert [1 ]
Gorodnichy, Dmitry O. [3 ]
机构
[1] Univ Quebec, Ecole Technol Super, Lab Imagerie Vis & Intelligence Artificielle, Montreal, PQ H3C 3P8, Canada
[2] Univ Guadalajara, Ctr Univ Los Valles, Ameca, Mexico
[3] Canada Border Serv Agcy, Sci & Engn Directorate, Ottawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Adaptive classifier ensembles; Boolean combination; Imbalance estimation; Video-to-video face recognition; Video surveillance; Adaptive multiple classifier systems; CLASSIFIERS; CURVES;
D O I
10.1016/j.patcog.2015.05.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decision support systems for surveillance rely more and more on face recognition (FR) to detect target individuals of interest captured with video cameras. FR is a challenging problem in video surveillance due to variations in capture conditions, to camera interoperability, and to the limited representativeness of target facial models used for matching. Although adaptive classifier ensembles have been applied for robust face matching, it is often assumed that the proportions of faces captured for target and non-target individuals are balanced, known a priori, and do not change over time. Recently, some techniques have been proposed to adapt the fusion function of an ensemble according to class imbalance of the input data stream. For instance, Skew-Sensitive Boolean combination (SSEC) is a active approach that estimates target vs. non-target proportions periodically during operations using Hellinger distance, and adapts its ensemble fusion function to operational class imbalance. Beyond the challenges of estimating class imbalance, such techniques commonly generate diverse pools of classifiers by selecting balanced training data, limiting the potential diversity produced using the abundant non-target data. In this paper, adaptive skew-sensitive ensembles are proposed to combine classifiers trained by selecting data with varying levels of imbalance and complexity, to sustain a high level the performance for video-to-video FR. Faces captured for each person in the scene are tracked and regrouped into trajectories. During enrollment, captures in a reference trajectory are combined with selected non-target captures to generate a pool of 2-class classifiers using data with various levels of imbalance and complexity. During operations, the level of imbalance is periodically estimated from the input trajectories using the HDx quantification method, and pre-computed histogram representations of imbalanced data distributions. This approach allows one to adapt pre-computed histograms and ensemble fusion functions based on the imbalance and complexity of operational data. Finally, the ensemble scores are accumulated of trajectories for robust spatio-temporal recognition. Results on synthetic data show that adapting the fusion function of ensemble trained with different complexities and levels of imbalance can significantly improve performance. Results on the Face in Action video data show that the proposed method can outperform reference techniques (including SSBC and meta-classification) in imbalanced video surveillance environments. Transaction-based analysis shows that performance is consistently higher across operational imbalances. Individual-specific analysis indicates that goat- and lamb-like individuals can benefit the most from adaptation to the operational imbalance. Finally, trajectory-based analysis shows that a video-to-video FR system based on the proposed approach can maintain, and even improve overall system discrimination. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3385 / 3406
页数:22
相关论文
共 50 条
  • [21] Three-view surveillance video based face modeling for recognition
    Von Duhn, Scott
    Ko, Myung Jin
    Yin, Lijun
    Hung, Terry
    Wei, Xiaozhou
    2007 BIOMETRICS SYMPOSIUM, 2007, : 1 - 6
  • [22] On Video Based Face Recognition Through Adaptive Sparse Dictionary
    Khan, Naimul Mefraz
    Nan, Xiaoming
    Quddus, Azhar
    Rosales, Edward
    Guan, Ling
    2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 1, 2015,
  • [23] An adaptive classification system for video-based face recognition
    Connolly, Jean-Francois
    Granger, Eric
    Sabourin, Robert
    INFORMATION SCIENCES, 2012, 192 : 50 - 70
  • [24] Adaptive fusion of human visual sensitive features for surveillance video summarization
    Salehin, Md. Musfequs
    Paul, Manoranjan
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2017, 34 (05) : 814 - 826
  • [25] Face Deduplication in Video Surveillance
    Chen, Qi
    Yang, Li
    Zhang, Dongping
    Shen, Ye
    Huang, Shuying
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (03)
  • [26] Improved Likelihood Ratios for Surveillance Video Face Recognition with Multimodal Feature Pairing
    Rodriguez, Andrea Macarulla
    Geradts, Zeno
    Worring, Marcel
    Unzueta, Luis
    2023 11TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS, IWBF, 2023,
  • [27] Face recognition by the LDA-based algorithm for a video surveillance system on DSP
    Kim, JO
    Kim, JS
    Chung, CH
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 1, 2005, 3480 : 638 - 646
  • [28] Suspicious Face Detection Based on Key Frame Recognition Under Surveillance Video
    Zheng, Xiaohui
    Ning, Yi
    Chen, Xianjun
    Zhan, Yongsong
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2016, PT I, 2016, 9712 : 645 - 652
  • [29] Multimodal Low Resolution Face and Frontal Gait Recognition from Surveillance Video
    Maity, Sayan
    Abdel-Mottaleb, Mohamed
    Asfour, Shihab S.
    ELECTRONICS, 2021, 10 (09)
  • [30] Face recognition in poor-quality video: Evidence from security surveillance
    Burton, AM
    Wilson, S
    Cowan, M
    Bruce, V
    PSYCHOLOGICAL SCIENCE, 1999, 10 (03) : 243 - 248