TVENet: Temporal variance embedding network for fine-grained action representation

被引：9

作者：

Han, Tingting ^{[1
,2
]}

Yao, Hongxun ^{[2
]}

Xie, Wenlong ^{[2
]}

Sun, Xiaoshuai ^{[2
]}

Zhao, Sicheng ^{[3
]}

Yu, Jun ^{[1
]}

机构：

[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China

[2] Harbin Inst Technol, Sch Comp Sci & Technol, 612 Zonghe Bldg, Harbin, Peoples R China

[3] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA

来源：

PATTERN RECOGNITION | 2020年 / 103卷 / 103期

基金：

中国国家自然科学基金;

关键词：

Fine-grained action representation; temporal variance embedding network (TVENet); joint optimization; temporal triplet loss; action search; DEEP; MODEL;

D O I：

10.1016/j.patcog.2020.107267

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the breakthroughs in general action understanding, it has become an inevitable trend to analyze the actions in finer granularity. However, related researches have been largely hindered by the lack of fine-grained datasets and the difficulty of capturing subtle differences between fine-grained actions that are highly similar overall. In this paper, we address the above challenges by constructing a fine-grained action dataset, i.e., Figure Skating, which can be used for end-to-end network training and presenting a framework for the joint optimization of classification and similarity constraints. We propose to incorporate the triplet loss into the training of Convolutional Neural Network, which learns a mapping from fine-grained actions to a compact Euclidean space where distances directly correspond to a measure of action similarity. Triplet loss compels actions of distinct classes to have larger distances than actions of the same class. Besides, to boost the discrimination of the fine-grained actions, we further propose a temporal variance embedding network (TVENet) embedding temporal context variances into the feature embeddings during the joint network training. The experimental results on Figure Skating dataset, HMDB51 dataset as well as UCF101 dataset demonstrate the effectiveness of TVENet representation for fine-grained action search. (C) 2020 Elsevier Ltd. All rights reserved.

引用

页数：16

共 50 条

[21] Fine-Grained Complexity of Temporal Problems
Dabrowski, Konrad K.
Jonsson, Peter
Ordyniak, Sebastian
Osipov, George
KR2020: PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PRINCIPLES OF KNOWLEDGE REPRESENTATION AND REASONING, 2020, : 284 - 293
[22] TEMPORAL STABILITY OF A FINE-GRAINED MAGNETITE
MURAD, E
SCHWERTMANN, U
CLAYS AND CLAY MINERALS, 1993, 41 (01) : 111 - 113
[23] Fine-Grained Temporal Relation Extraction
Vashishtha, Siddharth
Van Durme, Benjamin
White, Aaron Steven
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2906 - 2919
[24] Fine-Grained Fashion Similarity Learning by Attribute-Specific Embedding Network
Ma, Zhe
Dong, Jianfeng
Long, Zhongzi
Zhang, Yao
He, Yuan
Xue, Hui
Ji, Shouling
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11741 - 11748
[25] ACTION AND CRIME - A FINE-GRAINED APPROACH
GOLDMAN, AI
UNIVERSITY OF PENNSYLVANIA LAW REVIEW, 1994, 142 (05) : 1563 - 1586
[26] Fine-grained action plausibility rating
Lueddecke, Timo
Woergoetter, Florentin
ROBOTICS AND AUTONOMOUS SYSTEMS, 2020, 129 (129)
[27] Fine-grained Iterative Attention Network for Temporal Language Localization in Videos
Qu, Xiaoye
Tang, Pengwei
Zou, Zhikang
Cheng, Yu
Dong, Jianfeng
Zhou, Pan
Xu, Zichuan
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4280 - 4288
[28] Conditional Video Diffusion Network for Fine-Grained Temporal Sentence Grounding
Liu, Daizong
Zhu, Jiahao
Fang, Xiang
Xiong, Zeyu
Wang, Huan
Li, Renfu
Zhou, Pan
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5461 - 5476
[29] Discriminative Segment Focus Network for Fine-grained Video Action Recognition
Sun, Baoli
Ye, Xinchen
Yan, Tiantian
Wang, Zhihui
Li, Haojie
Wang, Zhiyong
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
[30] Coupled Generative Adversarial Network for Continuous Fine-grained Action Segmentation
Gammulle, Harshala
Fernando, Tharindu
Denman, Simon
Sridharan, Sridha
Fookes, Clinton
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 200 - 209

← 1 2 3 4 5 →