Research on CNN Parallel Computing and Learning Architecture Based on Real-Time Streaming Architecture

被引:1
|
作者
Zhu, Yuting [1 ]
Qian, Liang [1 ]
Wang, Chuyan [1 ]
Ding, Lianghui [1 ]
Yang, Feng [1 ]
Wang, Hao [2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[2] Air Force Mil Representat Off Shanghai Nanjing, Nanjing, Peoples R China
关键词
CNN; Parallel computing; Apache storm; Real time;
D O I
10.1007/978-3-030-05366-6_12
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural network (CNN) is a deep feed-forward artificial neural network, which is widely used in image recognition. However, this mode highlights the problems that the training time is too long and memory is insufficient. Traditional acceleration methods are mainly limited to optimizing for an algorithm. In this paper, we propose a method, namely CNN-S, to improve training efficiency and cost based on Storm and is suitable for every algorithm. This model divides data into several sub sets and processes data on several machine in parallel flexibly. The experimental results show that in the case of achieving a recognition accuracy rate of 95%, the training time of single serial model is around 913 s, and in CNN-S model only needs 248 s. The acceleration ratio can reach 3.681. This shows that the CNN-S parallel model has better performance than single serial mode on training efficiency and cost of system resource.
引用
收藏
页码:150 / 158
页数:9
相关论文
共 50 条
  • [41] Quantized CNN-based efficient hardware architecture for real-time hand gesture recognition
    Jaiswal, Mohita
    Sharma, Vaidehi
    Sharma, Abhishek
    Saini, Sandeep
    Tomar, Raghuvir
    MICROELECTRONICS JOURNAL, 2024, 151
  • [42] CNN-based architecture for real-time object-oriented video coding applications
    Grassi, G
    Grieco, LA
    INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2005, 33 (01) : 53 - 64
  • [43] Real-time CNN-based Segmentation Architecture for Ball Detection in a Single View Setup
    Van Zandycke, Gabriel
    De Vleeschouwer, Christophe
    PROCEEDINGS OF THE 2ND INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2019, 2019, : 51 - 58
  • [44] ISTTOK real-time architecture
    Carvalho, Ivo S.
    Duarte, Paulo
    Fernandes, Horacio
    Valcarcel, Daniel F.
    Carvalho, Pedro J.
    Silva, Carlos
    Duarte, Andre S.
    Neto, Andre
    Sousa, Jorge
    Batista, Antonio J. N.
    Hekkert, Tiago
    Carvalho, Bernardo B.
    FUSION ENGINEERING AND DESIGN, 2014, 89 (03) : 195 - 203
  • [45] ISTTOK real-time architecture
    Carvalho, I.S. (ivoc@ipfn.ist.utl.pt), 1600, Elsevier Ltd (89):
  • [46] A new FPGA/DSP-based parallel architecture for real-time image processing
    Batlle, J
    Martí, J
    Ridao, P
    Amat, J
    REAL-TIME IMAGING, 2002, 8 (05) : 345 - 356
  • [47] A hardware-based architecture to support flexible real-time parallel intrusion detection
    Mott, Stephen
    Hart, Samuel
    Montminy, David
    Williams, Paul
    Baldwin, Rusty
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING, VOLS 1 AND 2, 2007, : 614 - 619
  • [48] EHW architecture for real-time GPS attitude determination based on parallel genetic algorithm
    Xu, JN
    Arslan, T
    Wang, Q
    Wan, DJ
    2002 NASA/DOD CONFERENCE ON EVOLABLE HARDWARE, PROCEEDINGS, 2002, : 133 - 141
  • [49] A Flexible Parallel Hardware Architecture for AdaBoost-Based Real-Time Object Detection
    Kyrkou, Christos
    Theocharides, Theocharis
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2011, 19 (06) : 1034 - 1047
  • [50] A flexible real-time architecture
    Wickstrom, GL
    FIFTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING, PROCEEDINGS, 2000, : 99 - 106