TVENet: Temporal variance embedding network for fine-grained action representation

被引:9
|
作者
Han, Tingting [1 ,2 ]
Yao, Hongxun [2 ]
Xie, Wenlong [2 ]
Sun, Xiaoshuai [2 ]
Zhao, Sicheng [3 ]
Yu, Jun [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, 612 Zonghe Bldg, Harbin, Peoples R China
[3] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
中国国家自然科学基金;
关键词
Fine-grained action representation; temporal variance embedding network (TVENet); joint optimization; temporal triplet loss; action search; DEEP; MODEL;
D O I
10.1016/j.patcog.2020.107267
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the breakthroughs in general action understanding, it has become an inevitable trend to analyze the actions in finer granularity. However, related researches have been largely hindered by the lack of fine-grained datasets and the difficulty of capturing subtle differences between fine-grained actions that are highly similar overall. In this paper, we address the above challenges by constructing a fine-grained action dataset, i.e., Figure Skating, which can be used for end-to-end network training and presenting a framework for the joint optimization of classification and similarity constraints. We propose to incorporate the triplet loss into the training of Convolutional Neural Network, which learns a mapping from fine-grained actions to a compact Euclidean space where distances directly correspond to a measure of action similarity. Triplet loss compels actions of distinct classes to have larger distances than actions of the same class. Besides, to boost the discrimination of the fine-grained actions, we further propose a temporal variance embedding network (TVENet) embedding temporal context variances into the feature embeddings during the joint network training. The experimental results on Figure Skating dataset, HMDB51 dataset as well as UCF101 dataset demonstrate the effectiveness of TVENet representation for fine-grained action search. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Fine-Grained Complexity of Temporal Problems
    Dabrowski, Konrad K.
    Jonsson, Peter
    Ordyniak, Sebastian
    Osipov, George
    KR2020: PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PRINCIPLES OF KNOWLEDGE REPRESENTATION AND REASONING, 2020, : 284 - 293
  • [22] TEMPORAL STABILITY OF A FINE-GRAINED MAGNETITE
    MURAD, E
    SCHWERTMANN, U
    CLAYS AND CLAY MINERALS, 1993, 41 (01) : 111 - 113
  • [23] Fine-Grained Temporal Relation Extraction
    Vashishtha, Siddharth
    Van Durme, Benjamin
    White, Aaron Steven
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2906 - 2919
  • [24] Fine-Grained Fashion Similarity Learning by Attribute-Specific Embedding Network
    Ma, Zhe
    Dong, Jianfeng
    Long, Zhongzi
    Zhang, Yao
    He, Yuan
    Xue, Hui
    Ji, Shouling
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11741 - 11748
  • [25] ACTION AND CRIME - A FINE-GRAINED APPROACH
    GOLDMAN, AI
    UNIVERSITY OF PENNSYLVANIA LAW REVIEW, 1994, 142 (05) : 1563 - 1586
  • [26] Fine-grained action plausibility rating
    Lueddecke, Timo
    Woergoetter, Florentin
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2020, 129 (129)
  • [27] Fine-grained Iterative Attention Network for Temporal Language Localization in Videos
    Qu, Xiaoye
    Tang, Pengwei
    Zou, Zhikang
    Cheng, Yu
    Dong, Jianfeng
    Zhou, Pan
    Xu, Zichuan
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4280 - 4288
  • [28] Conditional Video Diffusion Network for Fine-Grained Temporal Sentence Grounding
    Liu, Daizong
    Zhu, Jiahao
    Fang, Xiang
    Xiong, Zeyu
    Wang, Huan
    Li, Renfu
    Zhou, Pan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5461 - 5476
  • [29] Discriminative Segment Focus Network for Fine-grained Video Action Recognition
    Sun, Baoli
    Ye, Xinchen
    Yan, Tiantian
    Wang, Zhihui
    Li, Haojie
    Wang, Zhiyong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [30] Coupled Generative Adversarial Network for Continuous Fine-grained Action Segmentation
    Gammulle, Harshala
    Fernando, Tharindu
    Denman, Simon
    Sridharan, Sridha
    Fookes, Clinton
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 200 - 209