A survey on visual surveillance of object motion and behaviors

被引:1298
作者
Hu, WM [1 ]
Tan, TN
Wang, L
Maybank, S
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100080, Peoples R China
[2] Univ London Birkbeck Coll, Sch Comp Sci & Informat Syst, London WC1E 7HX, England
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS | 2004年 / 34卷 / 03期
基金
中国国家自然科学基金;
关键词
behavior understanding and description; fusion of data from multiple cameras; motion detection; personal identification; tracking; visual surveillance;
D O I
10.1109/TSMCC.2004.829274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual surveillance in dynamic scenes, especially for humans and vehicles, is currently one of the most active research topics in computer vision. It has a wide spectrum of promising applications, including access control in special areas, human identification at a distance, crowd flux statistics And congestion analysis, detection of anomalous behaviors, and interactive surveillance using multiple cameras, etc. In general, the processing framework of visual surveillance in dynamic scenes includes the following stages: modeling of environments, detection of motion, classification of moving objects, tracking, understanding and description of behaviors, human identification, and fusion of data from multiple cameras. We review recent developments and general strategies of all these stages. Finally, we analyze possible research directions, e.g., occlusion handling, a combination of two-and three-dimensional tracking, a. combination of motion analysis and biometrics, anomaly detection and behavior prediction, content-based retrieval of surveillance videos, behavior understanding and natural language description, fusion of information from multiple sensors, and remote surveillance.
引用
收藏
页码:334 / 352
页数:19
相关论文
共 182 条
[21]   A state-based approach to the representation and recognition of gesture [J].
Bobick, AF ;
Wilson, AD .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (12) :1325-1337
[22]  
BOULT T, 1998, P DARPA IM UND WORKS, P305
[23]   Discovery and segmentation of activities in video [J].
Brand, M ;
Kettnaker, V .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :844-851
[24]   Coupled hidden Markov models for complex action recognition [J].
Brand, M ;
Oliver, N ;
Pentland, A .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :994-999
[25]   Understanding manipulation in video [J].
Brand, M .
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, :94-99
[26]   Learning and recognizing human dynamics in video sequences [J].
Bregler, C .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :568-574
[27]  
BREMOND F, 1998, P INT WORKSH INT VIS, P57
[28]   Tracking human motion in structured environments using a distributed-camera system [J].
Cai, Q ;
Aggarwal, JK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (11) :1241-1247
[29]  
CAI W, 1996, P INT C PATT REC, P68
[30]   Spatio-temporal alignment of sequences [J].
Caspi, Y ;
Irani, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (11) :1409-1424