Automatic Annotation of Human Actions in Video

被引：105

作者：

Duchenne, Olivier ^{[1
]}

Laptev, Ivan ^{[1
]}

Sivic, Josef ^{[1
]}

Bach, Francis ^{[1
]}

Ponce, Jean ^{[1
]}

机构：

[1] INRIA, Ecole Normale Super, Paris, France

来源：

2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2009年

关键词：

D O I：

10.1109/ICCV.2009.5459279

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper addresses the problem of automatic temporal annotation of realistic human actions in video using minimal manual supervision. To this end we consider two associated problems: (a) weakly-supervised learning of action models from readily available annotations, and (b) temporal localization of human actions in test videos. To avoid the prohibitive cost of manual annotation for training, we use movie scripts as a means of weak supervision. Scripts, however, provide only implicit, noisy, and imprecise information about the type and location of actions in video. We address this problem with a kernel-based discriminative clustering algorithm that locates actions in the weakly-labeled training data. Using the obtained action samples, we train temporal action detectors and apply them to locate actions in the raw video data. Our experiments demonstrate that the proposed method for weakly-supervised learning of action models leads to significant improvement in action detection. We present detection results for three action classes in four feature length movies with challenging and realistic video data.

引用

页码：1491 / 1498

页数：8

共 50 条

[21] Automatic Detection of Repetitive Actions in a Video
Wehbe, Hassan
Joly, Philippe
Haidar, Bassem
2015 13TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2015,
[22] An interactive tool for manual, semi-automatic and automatic video annotation
Bianco, Simone
Ciocca, Gianluigi
Napoletano, Paolo
Schettini, Raimondo
COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 131 : 88 - 99
[23] Automatic Video Annotation with Adaptive Number of Key Words
Wang, Fangshi
Lu, Wei
Liu, Jingen
Shah, Mubarak
Xu, De
19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1247 - +
[24] Automatic video annotation and retrieval based on Bayesian inference
Wang, Fangshi
Xu, De
Lu, Wei
Wu, Weixin
ADVANCES IN MULTIMEDIA MODELING, PT 1, 2007, 4351 : 279 - 288
[25] Automatic dominant camera motion annotation for video retrieval
Xiong, W
Lee, JCM
STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES VI, 1997, 3312 : 50 - 59
[26] An efficient automatic video shot size annotation scheme
Wang, Meng
Hua, Xian-Sheng
Song, Yan
Lai, Wei
Dai, Li-Rong
Wang, Ren-Hua
ADVANCES IN MULTIMEDIA MODELING, PT 1, 2007, 4351 : 649 - 658
[27] A Semi-automatic Annotation Tool For Cooking Video
Bianco, Simone
Ciocca, Gianluigi
Napoletano, Paolo
Schettini, Raimondo
Margherita, Roberto
Marini, Gianluca
Gianforme, Giorgio
Pantaleo, Giuseppe
IMAGE PROCESSING: MACHINE VISION APPLICATIONS VI, 2013, 8661
[28] Expressive semantics for automatic annotation and retrieval of video streams
Del Bimbo, A
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 671 - 674
[29] Automatic Generation of Interactive Cooking Video with Semantic Annotation
Oh, Kyeong-Jin
Hong, Myung-Duk
Yoon, Ui-Nyoung
Jo, Geun-Sik
JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2016, 22 (06) : 742 - 759
[30] A methodology for image annotation of human actions in videos
Waheed, Moomina
Hussain, Shahid
Khan, Arif Ali
Ahmed, Mansoor
Ahmad, Bashir
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (33-34) : 24347 - 24365

← 1 2 3 4 5 →