Weakly Supervised Actor-Action Segmentation via Robust Multi-Task Ranking

被引：17

作者：

Yan, Yan ^{[1
]}

Xu, Chenliang ^{[2
]}

Cai, Dawen ^{[3
]}

Corso, Jason J. ^{[1
]}

机构：

[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA

[2] Univ Rochester, Dept Comp Sci, Rochester, NY 14627 USA

[3] Univ Michigan, Dept Cell & Dev Biol, Biophys, Ann Arbor, MI 48109 USA

来源：

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年

关键词：

D O I：

10.1109/CVPR.2017.115

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fine-grained activity understanding in videos has attracted considerable recent attention with a shift from action classification to detailed actor and action understanding that provides compelling results for perceptual needs of cutting-edge autonomous systems. However, current methods for detailed understanding of actor and action have significant limitations: they require large amounts of finely labeled data, and they fail to capture any internal relationship among actors and actions. To address these issues, in this paper, we propose a novel, robust multi-task ranking model for weakly-supervised actor-action segmentation where only video-level tags are given for training samples. Our model is able to share useful information among different actors and actions while learning a ranking matrix to select representative supervoxels for actors and actions respectively. Final segmentation results are generated by a conditional random field that considers various ranking scores for video parts. Extensive experimental results on the Actor-Action Dataset (A2D) demonstrate that the proposed approach outperforms the state-of-the-art weakly supervised methods and performs as well as the topperforming fully supervised method.

引用

页码：1022 / 1031

页数：10

共 50 条

[31] Robust Visual Tracking via Multi-Task Sparse Learning
Zhang, Tianzhu
Ghanem, Bernard
Liu, Si
Ahuja, Narendra
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2042 - 2049
[32] Multi-proposal collaboration and multi-task training for weakly-supervised video moment retrieval
Zhang, Bolin
Yang, Chao
Jiang, Bin
Komamizu, Takahiro
Ide, Ichiro
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,
[33] Prototypical Transformer for Weakly Supervised Action Segmentation
Lin, Tao
Chang, Xiaobin
Sun, Wei
Zheng, Weishi
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 195 - 206
[34] Ranking Performance Measures in Multi-Task Agencies
Christensen, Peter O.
Sabac, Florin
Tian, Jie
ACCOUNTING REVIEW, 2010, 85 (05): : 1545 - 1575
[35] Multi-task ranking SVM for image cosegmentation
Liang, Xianpeng
Zhu, Lin
Huang, De-Shuang
NEUROCOMPUTING, 2017, 247 : 126 - 136
[36] A Multi-Task Dense Network with Self-Supervised Learning for Retinal Vessel Segmentation
Tu, Zhonghao
Zhou, Qian
Zou, Hua
Zhang, Xuedong
ELECTRONICS, 2022, 11 (21)
[37] Multi-task learning for gland segmentation
Rezazadeh, Iman
Duygulu, Pinar
SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (01) : 1 - 9
[38] MULTI-TASK SELF-SUPERVISED VISUAL REPRESENTATION LEARNING FOR MONOCULAR ROAD SEGMENTATION
Cho, Jaehoon
Kim, Youngjung
Jung, Hyungjoo
Oh, Changjae
Youn, Jaesung
Sohn, Kwanghoon
2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
[39] Multi-Task Learning for Subspace Segmentation
Wang, Yu
Wipf, David
Ling, Qing
Chen, Wei
Wassell, Ian
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1209 - 1217
[40] Multi-task learning for gland segmentation
Iman Rezazadeh
Pinar Duygulu
Signal, Image and Video Processing, 2023, 17 : 1 - 9

← 1 2 3 4 5 →