共 31 条
- [1] Herath S, Harandi M, Porikli F., Going deeper into action recognition: a survey, Image and Vision Computing, 60, pp. 4-21, (2017)
- [2] Wang X L, Girshick R, Gupta A, Et al., Non-local neural networks, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7794-7803, (2018)
- [3] Schroff F, Kalenichenko D, Philbin J., Facenet: a unified embedding for face recognition and clustering, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 815-823, (2015)
- [4] Wang L M, Xiong Y J, Wang Z, Et al., Temporal segment networks: towards good practices for deep action recognition, Proceedings of the European Conference on Computer Vision, pp. 20-36, (2016)
- [5] Laptev I, Marszalek M, Schmid C, Et al., Learning realistic human actions from movies, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-8, (2008)
- [6] Wang H, Schmid C., Action recognition with improved trajectories, Proceedings of the IEEE International Conference on Computer Vision, pp. 3551-3558, (2013)
- [7] Donahue J, Hendricks L A, Guadarrama S, Et al., Long-term recurrent convolutional networks for visual recognition and description, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2625-2634, (2015)
- [8] Ji S W, Xu W, Yang M, Et al., 3D convolutional neural networks for human action recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 35, 1, pp. 221-231, (2013)
- [9] Tran D, Bourdev L, Fergus R, Et al., Learning spatiotemporal features with 3D convolutional networks, Proceedings of the IEEE International Conference on Computer Vision, pp. 4489-4497, (2015)
- [10] Tran D, Wang H, Torresani L, Et al., A closer look at spatiotemporal convolutions for action recognition, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6450-6459, (2018)