Tunable Efficient Unitary Neural Networks (EUNN) and their application to RNNs

被引:0
|
作者
Li Jing [1 ]
Shen, Yichen [1 ]
Dubcek, Tena [1 ]
Peurifoy, John [1 ]
Skirlo, Scott [1 ]
LeCun, Yann [2 ]
Tegmark, Max [1 ]
Soljacic, Marin [1 ]
机构
[1] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] New York Univ, Facebook AI Res, New York, NY 10003 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using unitary (instead of general) matrices in artificial neural networks (ANNs) is a promising way to solve the gradient explosion/vanishing problem, as well as to enable ANNs to learn long-term correlations in the data. This approach appears particularly promising for Recurrent Neural Networks (RNNs). In this work, we present a new architecture for implementing an Efficient Unitary Neural Network (EUNNs); its main advantages can be summarized as follows. Firstly, the representation capacity of the unitary space in an EUNN is fully tunable, ranging from a subspace of SU(N) to the entire unitary space. Secondly, the computational complexity for training an EUNN is merely O(1) per parameter. Finally, we test the performance of EUNNs on the standard copying task, the pixel-permuted MNIST digit recognition benchmark as well as the Speech Prediction Test (TIMIT). We find that our architecture significantly outperforms both other state-of-the-art unitary RNNs and the LSTM architecture, in terms of the final performance and/or the wall-clock training speed. EUNNs are thus promising alternatives to RNNs and LSTMs for a wide variety of applications.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Tunable neural networks for CT image formation
    Tivnan, Matthew
    Gang, Grace J.
    Wang, Wenying
    Noel, Peter
    Sulam, Jeremias
    Webster Stayman, J.
    JOURNAL OF MEDICAL IMAGING, 2023, 10 (03)
  • [22] An Efficient Hybrid Mechanism with LSTM Neural Networks in Application to Stock Price Forecasting
    Ngoc-An Nguyen-Pham
    Trung T Nguyen
    KNOWLEDGE INNOVATION THROUGH INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_20), 2020, 327 : 447 - 458
  • [23] Advancing Asthma Management: Recurrent Neural Networks (RNNS) and Nanosensors for Precision Forecasting of Indoor Air Pollution
    Higgs, V.
    Ahmed, H.
    Flavier, J.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2024, 209
  • [24] A global-local hybrid evolutionary strategy (ES) for recurrent neural networks (RNNs) in system identification
    Teoh, E. J.
    Xiang, C.
    2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 1628 - 1635
  • [25] Dynamical Isometry and a Mean Field Theory of RNNs: Gating Enables Signal Propagation in Recurrent Neural Networks
    Chen, Minmin
    Pennington, Jeffrey
    Schoenholz, Samuel S.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [26] The unitary modification rules for neural networks with excitatory and inhibitory synaptic plasticity
    Silkis, IG
    BIOSYSTEMS, 1998, 48 (1-3) : 205 - 213
  • [27] Complex Unitary Recurrent Neural Networks Using Scaled Cayley Transform
    Maduranga, Kehelwala D. G.
    Helfrich, Kyle E.
    Ye, Qiang
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4528 - 4535
  • [28] Sequence Modeling with Recurrent Neural Networks (RNNs) for Student Learning Behavior Pattern Recognition in a Flipped Classroom
    Tang, Guangheng
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 401 - 418
  • [29] A Neural Networks Application in Ergonomics
    Ene, Alexandru
    Anghel, Daniel-Constantin
    2013 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2013,
  • [30] APPLICATION OF NEURAL NETWORKS TO PHARMACODYNAMICS
    VENGPEDERSEN, P
    MODI, NB
    JOURNAL OF PHARMACEUTICAL SCIENCES, 1993, 82 (09) : 918 - 926