Combining the Right Features for Complex Event Recognition

被引:48
|
作者
Tang, Kevin [1 ]
Yao, Bangpeng [1 ]
Li Fei-Fei [1 ]
Koller, Daphne [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
来源
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2013年
关键词
D O I
10.1109/ICCV.2013.335
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we tackle the problem of combining features extracted from video for complex event recognition. Feature combination is an especially relevant task in video data, as there are many features we can extract, ranging from image features computed from individual frames to video features that take temporal information into account. To combine features effectively, we propose a method that is able to be selective of different subsets of features, as some features or feature combinations may be uninformative for certain classes. We introduce a hierarchical method for combining features based on the AND/OR graph structure, where nodes in the graph represent combinations of different sets of features. Our method automatically learns the structure of the AND/OR graph using score-based structure learning, and we introduce an inference procedure that is able to efficiently compute structure scores. We present promising results and analysis on the difficult and large-scale 2011 TRECVID Multimedia Event Detection dataset [17].
引用
收藏
页码:2696 / 2703
页数:8
相关论文
共 50 条
  • [41] Spatiotemporal Features for Action Recognition and Salient Event Detection
    Rapantzikos, Konstantinos
    Avrithis, Yannis
    Kollias, Stefanos
    COGNITIVE COMPUTATION, 2011, 3 (01) : 167 - 184
  • [42] Spatiotemporal Features for Action Recognition and Salient Event Detection
    Konstantinos Rapantzikos
    Yannis Avrithis
    Stefanos Kollias
    Cognitive Computation, 2011, 3 : 167 - 184
  • [43] Combining SURF Descriptor and Complex Networks for Face Recognition
    Piotto, Joao Gilberto S.
    Lopes, Fabricio Martins
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 275 - 279
  • [44] AT-A-DISTANCE PERSON RECOGNITION VIA COMBINING OCULAR FEATURES
    Verma, Shalini
    Mittal, Paritosh
    Vatsa, Mayank
    Singh, Richa
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3131 - 3135
  • [45] Named Entity Recognition Architecture Combining Contextual and Global Features
    Tran Thi Hong Hanh
    Doucet, Antoine
    Sidere, Nicolas
    Moreno, Jose G.
    Pollak, Senja
    TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 264 - 276
  • [46] Combining acoustic features for improved emotion recognition in Mandarin speech
    Pao, TL
    Chen, YT
    Yeh, JH
    Liao, WY
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 279 - 285
  • [47] Efficient fall activity recognition by combining shape and motion features
    Iazzi, Abderrazak
    Rziza, Mohammed
    Thami, Rachid Oulad Haj
    COMPUTATIONAL VISUAL MEDIA, 2020, 6 (03) : 247 - 263
  • [48] Combining spectral and fractal features for emotion recognition on Electroencephalographic signals
    1600, World Scientific and Engineering Academy and Society, Ag. Ioannou Theologou 17-23, Zographou, Athens, 15773, Greece (10):
  • [49] The methods for combining the information of various kinds of features in speech recognition
    WANG Chengyou
    TANG Shuqi
    LIANG Diannong
    CHEN Huihuang and TANG Zhaojing(National University of Defence Technology Changsha 410073)Received
    ChineseJournalofAcoustics, 1997, (02) : 115 - 120
  • [50] Adaptive gesture recognition combining HMM models and geometrical features
    Cheng, Pu
    Zhou, Jie
    MIPPR 2011: PATTERN RECOGNITION AND COMPUTER VISION, 2011, 8004