Egocentric action anticipation from untrimmed videos

被引:0
|
作者
Rodin, Ivan [1 ]
Furnari, Antonino [1 ,2 ]
Farinella, Giovanni Maria [1 ,2 ]
机构
[1] Univ Catania, Catania, Italy
[2] Univ Catania, Next Vis srl Spinoff, Catania, Italy
关键词
computer vision; pattern recognition;
D O I
10.1049/cvi2.12342
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Egocentric action anticipation involves predicting future actions performed by the camera wearer from egocentric video. Although the task has recently gained attention in the research community, current approaches often assume that input videos are 'trimmed', meaning that a short video sequence is sampled a fixed time before the beginning of the action. However, trimmed action anticipation has limited applicability in real-world scenarios, where it is crucial to deal with 'untrimmed' video inputs and the exact moment of action initiation cannot be assumed at test time. To address these limitations, an untrimmed action anticipation task is proposed, which, akin to temporal action detection, assumes that the input video is untrimmed at test time, while still requiring predictions to be made before actions take place. The authors introduce a benchmark evaluation procedure for methods designed to address this novel task and compare several baselines on the EPIC-KITCHENS-100 dataset. Through our experimental evaluation, testing a variety of models, the authors aim to better understand their performance in untrimmed action anticipation. Our results reveal that the performance of current models designed for trimmed action anticipation is limited, emphasising the need for further research in this area.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Untrimmed Action Anticipation
    Rodin, Ivan
    Furnari, Antonino
    Mavroeidis, Dimitrios
    Farinella, Giovanni Maria
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 337 - 348
  • [2] SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos
    Osman, Nada
    Camporese, Guglielmo
    Coscia, Pasquale
    Ballan, Lamberto
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3430 - 3438
  • [3] Active Learning of an Action Detector from Untrimmed Videos
    Bandla, Sunil
    Grauman, Kristen
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1833 - 1840
  • [4] Action Recognition from Single Timestamp Supervision in Untrimmed Videos
    Moltisanti, Davide
    Fidler, Sanja
    Damen, Dima
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9907 - 9916
  • [5] Generic Action Recognition from Egocentric Videos
    Singh, Suriya
    Arora, Chetan
    Jawahar, C. V.
    2015 FIFTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2015,
  • [6] Continuous Action Recognition and Segmentation in Untrimmed Videos
    Bai, Ruibin
    Zhao, Qing
    Zhou, Sanping
    Li, Yubing
    Zhao, Xueji
    Wang, Jinjun
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2534 - 2539
  • [7] Towards Streaming Egocentric Action Anticipation
    Furnari, Antonino
    Farinella, Giovanni Maria
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1250 - 1257
  • [8] Temporal Action Detection in Untrimmed Videos from Fine to Coarse Granularity
    Yao, Guangle
    Lei, Tao
    Liu, Xianyuan
    Jiang, Ping
    APPLIED SCIENCES-BASEL, 2018, 8 (10):
  • [9] Skimming and Scanning for Efficient Action Recognition in Untrimmed Videos
    Hong, Yunyan
    Zeng, Ailing
    Li, Min
    Lu, Cewu
    Jian, Li
    Xu, Qiang
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [10] StartNet: Online Detection of Action Start in Untrimmed Videos
    Gao, Mingfei
    Xu, Mingze
    Davis, Larry S.
    Socher, Richard
    Xiong, Caiming
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5541 - 5550