Weakly Supervised Actor-Action Segmentation via Robust Multi-Task Ranking

被引:17
|
作者
Yan, Yan [1 ]
Xu, Chenliang [2 ]
Cai, Dawen [3 ]
Corso, Jason J. [1 ]
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[2] Univ Rochester, Dept Comp Sci, Rochester, NY 14627 USA
[3] Univ Michigan, Dept Cell & Dev Biol, Biophys, Ann Arbor, MI 48109 USA
关键词
D O I
10.1109/CVPR.2017.115
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained activity understanding in videos has attracted considerable recent attention with a shift from action classification to detailed actor and action understanding that provides compelling results for perceptual needs of cutting-edge autonomous systems. However, current methods for detailed understanding of actor and action have significant limitations: they require large amounts of finely labeled data, and they fail to capture any internal relationship among actors and actions. To address these issues, in this paper, we propose a novel, robust multi-task ranking model for weakly-supervised actor-action segmentation where only video-level tags are given for training samples. Our model is able to share useful information among different actors and actions while learning a ranking matrix to select representative supervoxels for actors and actions respectively. Final segmentation results are generated by a conditional random field that considers various ranking scores for video parts. Extensive experimental results on the Actor-Action Dataset (A2D) demonstrate that the proposed approach outperforms the state-of-the-art weakly supervised methods and performs as well as the topperforming fully supervised method.
引用
收藏
页码:1022 / 1031
页数:10
相关论文
共 50 条
  • [1] A Weakly Supervised Multi-task Ranking Framework for Actor-Action Semantic Segmentation
    Yan, Yan
    Xu, Chenliang
    Cai, Dawen
    Corso, Jason J.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (05) : 1414 - 1432
  • [2] A Weakly Supervised Multi-task Ranking Framework for Actor–Action Semantic Segmentation
    Yan Yan
    Chenliang Xu
    Dawen Cai
    Jason J. Corso
    International Journal of Computer Vision, 2020, 128 : 1414 - 1432
  • [3] Mining and Unifying Heterogeneous Contrastive Relations for Weakly-Supervised Actor-Action Segmentation
    Duan, Bin
    Tang, Hao
    Sun, Changchang
    Zhu, Ye
    Yan, Yan
    2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, : 483 - 492
  • [4] Weakly Supervised Multi-Task Learning for Cell Detection and Segmentation
    Chamanzar, Alireza
    Nie, Yao
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 513 - 516
  • [5] Weakly Supervised Text-based Actor-Action Video Segmentation by Clip-level Multi-instance Learning
    Chen, Weidong
    Li, Guorong
    Zhang, Xinfeng
    Wang, Shuhui
    Li, Liang
    Huang, Qingming
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [6] Improving Weakly Supervised Lesion Segmentation using Multi-Task Learning
    Chu, Tianshu
    Li, Xinmeng
    Vo, Huy V.
    Summers, Ronald M.
    Sizikova, Elena
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 143, 2021, 143 : 60 - 73
  • [7] Optimizing multi-task network with learned prototypes for weakly supervised semantic segmentation
    Zhou, Lei
    Wang, Jiasong
    Luo, Jing
    Guo, Yuheng
    Li, Xiaoxiao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 134
  • [8] Weakly-Supervised Medical Image Segmentation Based on Multi-task Learning
    Xie, Xuanhua
    Fan, Huijie
    Yu, Zhencheng
    Bai, Haijun
    Tang, Yandong
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT II, 2022, 13456 : 395 - 404
  • [9] Robust action recognition and segmentation with multi-task conditional random fields
    Shimosaka, Masamichi
    Mori, Taketoshi
    Sato, Tomomasa
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10, 2007, : 3780 - +
  • [10] Actor-Action Semantic Segmentation with Grouping Process Models
    Xu, Chenliang
    Corso, Jason J.
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3083 - 3092