Automatic Annotation of Human Actions in Video

被引:105
|
作者
Duchenne, Olivier [1 ]
Laptev, Ivan [1 ]
Sivic, Josef [1 ]
Bach, Francis [1 ]
Ponce, Jean [1 ]
机构
[1] INRIA, Ecole Normale Super, Paris, France
关键词
D O I
10.1109/ICCV.2009.5459279
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the problem of automatic temporal annotation of realistic human actions in video using minimal manual supervision. To this end we consider two associated problems: (a) weakly-supervised learning of action models from readily available annotations, and (b) temporal localization of human actions in test videos. To avoid the prohibitive cost of manual annotation for training, we use movie scripts as a means of weak supervision. Scripts, however, provide only implicit, noisy, and imprecise information about the type and location of actions in video. We address this problem with a kernel-based discriminative clustering algorithm that locates actions in the weakly-labeled training data. Using the obtained action samples, we train temporal action detectors and apply them to locate actions in the raw video data. Our experiments demonstrate that the proposed method for weakly-supervised learning of action models leads to significant improvement in action detection. We present detection results for three action classes in four feature length movies with challenging and realistic video data.
引用
收藏
页码:1491 / 1498
页数:8
相关论文
共 50 条
  • [21] Automatic Detection of Repetitive Actions in a Video
    Wehbe, Hassan
    Joly, Philippe
    Haidar, Bassem
    2015 13TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2015,
  • [22] An interactive tool for manual, semi-automatic and automatic video annotation
    Bianco, Simone
    Ciocca, Gianluigi
    Napoletano, Paolo
    Schettini, Raimondo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 131 : 88 - 99
  • [23] Automatic Video Annotation with Adaptive Number of Key Words
    Wang, Fangshi
    Lu, Wei
    Liu, Jingen
    Shah, Mubarak
    Xu, De
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1247 - +
  • [24] Automatic video annotation and retrieval based on Bayesian inference
    Wang, Fangshi
    Xu, De
    Lu, Wei
    Wu, Weixin
    ADVANCES IN MULTIMEDIA MODELING, PT 1, 2007, 4351 : 279 - 288
  • [25] Automatic dominant camera motion annotation for video retrieval
    Xiong, W
    Lee, JCM
    STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES VI, 1997, 3312 : 50 - 59
  • [26] An efficient automatic video shot size annotation scheme
    Wang, Meng
    Hua, Xian-Sheng
    Song, Yan
    Lai, Wei
    Dai, Li-Rong
    Wang, Ren-Hua
    ADVANCES IN MULTIMEDIA MODELING, PT 1, 2007, 4351 : 649 - 658
  • [27] A Semi-automatic Annotation Tool For Cooking Video
    Bianco, Simone
    Ciocca, Gianluigi
    Napoletano, Paolo
    Schettini, Raimondo
    Margherita, Roberto
    Marini, Gianluca
    Gianforme, Giorgio
    Pantaleo, Giuseppe
    IMAGE PROCESSING: MACHINE VISION APPLICATIONS VI, 2013, 8661
  • [28] Expressive semantics for automatic annotation and retrieval of video streams
    Del Bimbo, A
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 671 - 674
  • [29] Automatic Generation of Interactive Cooking Video with Semantic Annotation
    Oh, Kyeong-Jin
    Hong, Myung-Duk
    Yoon, Ui-Nyoung
    Jo, Geun-Sik
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2016, 22 (06) : 742 - 759
  • [30] A methodology for image annotation of human actions in videos
    Waheed, Moomina
    Hussain, Shahid
    Khan, Arif Ali
    Ahmed, Mansoor
    Ahmad, Bashir
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (33-34) : 24347 - 24365