Semi-supervised Active Learning for Video Action Detection

被引:0
|
作者
Singh, Ayush [1 ]
Rana, Aayush J. [2 ]
Kumar, Akash [2 ]
Vyas, Shruti [2 ]
Rawat, Yogesh Singh [2 ]
机构
[1] IIT ISM Dhanbad, Dhanbad, Bihar, India
[2] Univ Cent Florida, Orlando, FL USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we focus on label efficient learning for video action detection. We develop a novel semi-supervised active learning approach which utilizes both labeled as well as unlabeled data along with informative sample selection for action detection. Video action detection requires spatio-temporal localization along with classification, which poses several challenges for both active learning (informative sample selection) as well as semi-supervised learning (pseudo label generation). First, we propose NoiseAug, a simple augmentation strategy which effectively selects informative samples for video action detection. Next, we propose fft-attention, a novel technique based on high-pass filtering which enables effective utilization of pseudo label for SSL in video action detection by emphasizing on relevant activity region within a video. We evaluate the proposed approach on three different benchmark datasets, UCF-101-24, JHMDB-21, and Youtube-VOS. First, we demonstrate its effectiveness on video action detection where the proposed approach outperforms prior works in semi-supervised and weakly-supervised learning along with several baseline approaches in both UCF101-24 and JHMDB21. Next, we also show its effectiveness on Youtube-VOS for video object segmentation demonstrating its generalization capability for other dense prediction tasks in videos.
引用
收藏
页码:4891 / 4899
页数:9
相关论文
共 50 条
  • [1] Semi-supervised Learning for Multi-label Video Action Detection
    Zhang, Hongcheng
    Zhao, Xu
    Wang, Dongqi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2124 - 2134
  • [2] End-to-End Semi-Supervised Learning for Video Action Detection
    Kumar, Akash
    Rawat, Yogesh Singh
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14680 - 14690
  • [3] Video annotation by active learning and semi-supervised ensembling
    Song, Yan
    Qi, Guo-Jun
    Hua, Xian-Sheng
    Dai, Li-Rong
    Wang, Ren-Hua
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 933 - 936
  • [4] Semi-Supervised Active Learning for Object Detection
    Chen, Sijin
    Yang, Yingyun
    Hua, Yan
    ELECTRONICS, 2023, 12 (02)
  • [5] Semi-Supervised Active Learning for Anomaly Detection in Aviation
    Memarzadeh, Milad
    Matthews, Bryan
    Templin, Thomas
    Sharif Rohani, Aida
    Weckler, Daniel
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2023, 20 (04): : 181 - 194
  • [6] Semi-Supervised Learning for Video Captioning
    Lin, Ke
    Gan, Zhuoxin
    Wang, Liwei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1096 - 1106
  • [7] Semi-supervised Learning of Feature Hierarchies for Object Detection in a Video
    Yang, Yang
    Shu, Guang
    Shah, Mubarak
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1650 - 1657
  • [8] Semi-supervised Change Point Detection Using Active Learning
    De Brabandere, Arne
    Cao, Zhenxiang
    De Vos, Maarten
    Bertrand, Alexander
    Davis, Jesse
    DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 74 - 88
  • [9] Active and semi-supervised learning for object detection with imperfect data
    Rhee, Phill Kyu
    Erdenee, Enkhbayar
    Kyun, Shin Dong
    Ahmed, Minhaz Uddin
    Jin, Songguo
    COGNITIVE SYSTEMS RESEARCH, 2017, 45 : 109 - 123
  • [10] Network Intrusion Detection Based on Active Semi-supervised Learning
    Zhang, Yong
    Niu, Jie
    He, Guojian
    Zhu, Lin
    Guo, Da
    51ST ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN-W 2021), 2021, : 129 - 135