共 37 条
- [21] Han S, Pool J, Tran J, Dally WJ., Learning both weights and connections for efficient neural network, Proc. of the Advances in Neural Information Processing Systems (NIPS), pp. 1135-1143, (2015)
- [22] Dong X, Chen S, Pan SJ., Learning to prune deep neural networks via layer-wise optimal brain surgeon, Proc. of the Advances in Neural Information Processing Systems (NIPS), pp. 4857-4867, (2017)
- [23] Naumov M, Chien L, Vandermersch P, Kapasi U., Cusparse library, Proc. of the GPU Technology Conf, (2010)
- [24] Chen X., Escoin: Efficient sparse convolutional neural network inference on GPUs, (2018)
- [25] Mao H, Han S, Pool J, Li W, Liu X, Wang Y, Dally WJ., Exploring the granularity of sparsity in convolutional neural networks, Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition Workshops (CVPR Workshops), pp. 1927-1934, (2017)
- [26] Park J, Li SR, Wen W, Tang PTP, Li H, Chen Y, Dubey P., Faster cnns with direct sparse convolutions and guided pruning, Proc. of the Int'l Conf. on Learning Representations (ICLR), (2017)
- [27] Lei J, Gao X, Song J, Wang XL, Song ML., Survey of deep neural network model compression, Ruan Jian Xue Bao/Journal of Software, 29, 2, pp. 251-266, (2018)
- [28] Zhang X, Tan G, Xue S, Li J, Zhou K, Chen M., Understanding the GPU microarchitecture to achieve bare-metal performance tuning, Proc. of the 22nd ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming (PPoPP), pp. 31-43, (2017)
- [29] Williams S, Waterman A, Patterson DA., Roofline: An insightful visual performance model for multicore architectures, Communications of the ACM, 52, 4, pp. 65-76, (2009)
- [30] Liu B, Wang M, Foroosh H, Tappen MF, Pensky M., Sparse convolutional neural networks, Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 806-814, (2015)