Matching Video Net: Memory-based embedding for video action recognition

被引：0

作者：

Kim, Daesik ^{[1
]}

Lee, Myunggi ^{[1
]}

Kwak, Nojun ^{[1
]}

机构：

[1] Seoul Natl Univ, Grad Sch Convergence Sci & Technol, Seoul, South Korea

来源：

2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2017年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most of recent successful researches on action recognition are based on deep learning structures. Nonetheless, training deep neural networks is notorious for requiring huge amount of data. On the other hand, not enough data can lead to an overfitted model. In this work, we propose a novel model, matching video net (MVN), which can be trained with a small amount of data. In order to avoid the problem of overfitting, we use a non-parametric setup on top of parametric networks with external memories. An input clip of video is transformed into an embedding space and matched to the memorized samples in the embedding space. Then, the similarities between the input and the memorized data are measured to determine the nearest neighbors. We perform experiments in a supervised manner on action recognition datasets, achieving state-of-the-art results. Moreover, we applied our model to one-shot learning problems with a novel training strategy. Our model achieves surprisingly good results in predicting unseen action classes from only a few examples.

引用

页码：432 / 438

页数：7

共 50 条

[41] A New Approach for Video Action Recognition: CSP-Based Filtering for Video to Image Transformation
Rodriguez-Moreno, Itsaso
Martinez-Otzeta, Jose Maria
Goienetxea, Izaro
Rodriguez, Igor
Sierra, Basilio
IEEE ACCESS, 2021, 9 (09): : 139946 - 139957
[42] Schatten p-norm based Image-to-Video Adaptation for Video Action Recognition
Dass, Sharana Dharshikgan Suresh
Krishnasamy, Ganesh
Paramesran, Raveendran
Phan, Raphael C. -W.
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[43] Recurrent Region Attention and Video Frame Attention Based Video Action Recognition Network Design
Sang H.-F.
Zhao Z.-Y.
He D.-K.
Zhao, Zi-Yu (Maikuraky1022@outlook.com), 1600, Chinese Institute of Electronics (48): : 1052 - 1061
[44] VLG-Net: Video-Language Graph Matching Network for Video Grounding
Soldan, Mattia
Xu, Mengmeng
Qu, Sisi
Tegner, Jesper
Ghanem, Bernard
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3217 - 3227
[45] Coupling Video Segmentation and Action Recognition
Ghodrati, Amir
Pedersoli, Marco
Tuytelaars, Tinne
2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 618 - 625
[46] Action recognition in broadcast tennis video
Zhu, Guangyu
Xu, Changsheng
Huang, Qingming
Gao, Wen
18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 251 - +
[47] Breaking video into pieces for action recognition
Ying Zheng
Hongxun Yao
Xiaoshuai Sun
Xuesong Jiang
Fatih Porikli
Multimedia Tools and Applications, 2017, 76 : 22195 - 22212
[48] Modeling Video Evolution For Action Recognition
Fernando, Basura
Gavves, Efstratios
Oramas, Jose M.
Ghodrati, Amir
Tuytelaars, Tinne
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5378 - 5387
[49] Logo recognition in video stills by string matching
den Hollander, RJM
Hanjalic, A
2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 517 - 520
[50] Recurring the Transformer for Video Action Recognition
Yang, Jiewen
Dong, Xingbo
Liu, Liujun
Zhang, Chao
Shen, Jiajun
Yu, Dahai
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14043 - 14053

← 1 2 3 4 5 →