Classification Matters: Improving Video Action Detection with Class-Specific Attention

被引:0
|
作者
Lee, Jinsung [1 ,2 ]
Kim, Taeoh [2 ]
Lee, Inwoong [2 ]
Shim, Minho [2 ]
Wee, Dongyoon [2 ]
Cho, Minsu [1 ]
Kwak, Suha [1 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, Pohang Si, South Korea
[2] NAVER Cloud, Seongnam Si, South Korea
来源
关键词
Video action detection; Video transformer;
D O I
10.1007/978-3-031-72661-3_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video action detection (VAD) aims to detect actors and classify their actions in a video. We figure that VAD suffers more from classification rather than localization of actors. Hence, we analyze how prevailing methods form features for classification and find that they prioritize actor regions, yet often overlooking the essential contextual information necessary for accurate classification. Accordingly, we propose to reduce the bias toward actor and encourage paying attention to the context that is relevant to each action class. By assigning a class-dedicated query to each action class, our model can dynamically determine where to focus for effective classification. The proposed model demonstrates superior performance on three challenging benchmarks with significantly fewer parameters and less computation.
引用
收藏
页码:450 / 467
页数:18
相关论文
共 50 条
  • [21] Class-specific correction and classification of NIR spectra of edible oils
    Alagappan, Lakshmi
    Chu, Jia En
    Chua, Joanna Huixin
    Ding, Jia Wen
    Xiao, Ronghui
    Yu, Zhe
    Pan, Kun
    Elejalde, Untzizu
    Lim, Kevin Junliang
    Wong, Limsoon
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2023, 241
  • [22] CLASS-SPECIFIC MODEL MIXTURES FOR THE CLASSIFICATION OF TIME-SERIES
    Baggenstoss, Paul M.
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2341 - 2345
  • [23] Class-Specific Guided Local Feature Selection for Data Classification
    Qian, Youcheng
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 645 - 649
  • [24] Class-Specific Sparse Principal Component Analysis for Visual Classification
    Pan, Fei
    Zhang, Zai-Xu
    Liu, Bao-Di
    Xie, Ji-Jun
    IEEE ACCESS, 2020, 8 : 110033 - 110047
  • [25] EEF: Exponentially Embedded Families With Class-Specific Features for Classification
    Tang, Bo
    Kay, Steven
    He, Haibo
    Baggenstoss, Paul M.
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (07) : 969 - 973
  • [26] Minimum class variance class-specific extreme learning machine for imbalanced classification
    Raghuwanshi, Bhagat Singh
    Shukla, Sanyam
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 178
  • [27] Learning class-specific edges for object detection and segmentation
    Prasad, Mukta
    Zisserman, Andrew
    Fitzgibbon, Andrew
    Kumar, M. Pawan
    Torr, P. H. S.
    COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2006, 4338 : 94 - +
  • [28] Time series classification by class-specific Mahalanobis distance measures
    Prekopcsak, Zoltan
    Lemire, Daniel
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2012, 6 (03) : 185 - 200
  • [29] CSMMI: Class-Specific Maximization of Mutual Information for Action and Gesture Recognition
    Wan, Jun
    Athitsos, Vassilis
    Jangyodsuk, Pat
    Jair Escalante, Hugo
    Ruan, Qiuqi
    Guyon, Isabelle
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (07) : 3152 - 3165
  • [30] Class-Specific Mahalanobis Distance Metric Learning for Biological Image Classification
    Mohan, B. S. Shajee
    Sekhar, C. Chandra
    IMAGE ANALYSIS AND RECOGNITION, PT II, 2012, 7325 : 240 - 248