Matching Video Net: Memory-based embedding for video action recognition

被引:0
|
作者
Kim, Daesik [1 ]
Lee, Myunggi [1 ]
Kwak, Nojun [1 ]
机构
[1] Seoul Natl Univ, Grad Sch Convergence Sci & Technol, Seoul, South Korea
来源
2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2017年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of recent successful researches on action recognition are based on deep learning structures. Nonetheless, training deep neural networks is notorious for requiring huge amount of data. On the other hand, not enough data can lead to an overfitted model. In this work, we propose a novel model, matching video net (MVN), which can be trained with a small amount of data. In order to avoid the problem of overfitting, we use a non-parametric setup on top of parametric networks with external memories. An input clip of video is transformed into an embedding space and matched to the memorized samples in the embedding space. Then, the similarities between the input and the memorized data are measured to determine the nearest neighbors. We perform experiments in a supervised manner on action recognition datasets, achieving state-of-the-art results. Moreover, we applied our model to one-shot learning problems with a novel training strategy. Our model achieves surprisingly good results in predicting unseen action classes from only a few examples.
引用
收藏
页码:432 / 438
页数:7
相关论文
共 50 条
  • [41] A New Approach for Video Action Recognition: CSP-Based Filtering for Video to Image Transformation
    Rodriguez-Moreno, Itsaso
    Martinez-Otzeta, Jose Maria
    Goienetxea, Izaro
    Rodriguez, Igor
    Sierra, Basilio
    IEEE ACCESS, 2021, 9 (09): : 139946 - 139957
  • [42] Schatten p-norm based Image-to-Video Adaptation for Video Action Recognition
    Dass, Sharana Dharshikgan Suresh
    Krishnasamy, Ganesh
    Paramesran, Raveendran
    Phan, Raphael C. -W.
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [43] Recurrent Region Attention and Video Frame Attention Based Video Action Recognition Network Design
    Sang H.-F.
    Zhao Z.-Y.
    He D.-K.
    Zhao, Zi-Yu (Maikuraky1022@outlook.com), 1600, Chinese Institute of Electronics (48): : 1052 - 1061
  • [44] VLG-Net: Video-Language Graph Matching Network for Video Grounding
    Soldan, Mattia
    Xu, Mengmeng
    Qu, Sisi
    Tegner, Jesper
    Ghanem, Bernard
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3217 - 3227
  • [45] Coupling Video Segmentation and Action Recognition
    Ghodrati, Amir
    Pedersoli, Marco
    Tuytelaars, Tinne
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 618 - 625
  • [46] Action recognition in broadcast tennis video
    Zhu, Guangyu
    Xu, Changsheng
    Huang, Qingming
    Gao, Wen
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 251 - +
  • [47] Breaking video into pieces for action recognition
    Ying Zheng
    Hongxun Yao
    Xiaoshuai Sun
    Xuesong Jiang
    Fatih Porikli
    Multimedia Tools and Applications, 2017, 76 : 22195 - 22212
  • [48] Modeling Video Evolution For Action Recognition
    Fernando, Basura
    Gavves, Efstratios
    Oramas, Jose M.
    Ghodrati, Amir
    Tuytelaars, Tinne
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5378 - 5387
  • [49] Logo recognition in video stills by string matching
    den Hollander, RJM
    Hanjalic, A
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 517 - 520
  • [50] Recurring the Transformer for Video Action Recognition
    Yang, Jiewen
    Dong, Xingbo
    Liu, Liujun
    Zhang, Chao
    Shen, Jiajun
    Yu, Dahai
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14043 - 14053