State-Regularized Recurrent Neural Networks

被引:0
|
作者
Wang, Cheng [1 ]
Niepert, Mathias [1 ]
机构
[1] NEC Labs Europe, Heidelberg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent neural networks are a widely used class of neural architectures with two shortcomings. First, it is difficult to understand what exactly they learn. Second, they tend to work poorly on sequences requiring long-term memorization, despite having this capacity in principle. We aim to address both shortcomings with a class of recurrent networks that use a stochastic state transition mechanism between cell applications. This mechanism, which we term state-regularization, makes RNNs transition between a finite set of learnable states. We evaluate state-regularized RNNs on (1) regular languages for the purpose of automata extraction; (2) nonregular languages such as balanced parentheses, palindromes, and the copy task where external memory is required; and (3) real-word sequence learning tasks for sentiment analysis, visual object recognition, and language modeling. We show that state-regularization simplifies the extraction of finite state automata from the RNN's state transition dynamics; forces RNNs to operate more like automata with external memory and less like finite state machines; and makes RNNs more interpretable.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] POWER SYSTEM STATE FORECASTING VIA DEEP RECURRENT NEURAL NETWORKS
    Zhang, Liang
    Wang, Gang
    Giannakis, Georgios B.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8092 - 8096
  • [42] An algorithmic approach to adaptive state filtering using recurrent neural networks
    Parlos, AG
    Menon, SK
    Atiya, AF
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (06): : 1411 - 1432
  • [43] Comparative Study of State-of-Charge Estimation with Recurrent Neural Networks
    Wassiliadis, Nikolaos
    Herrmann, Thomas
    Wildfeuer, Leo
    Reiter, Christoph
    Lienkamp, Markus
    2019 IEEE TRANSPORTATION ELECTRIFICATION CONFERENCE AND EXPO (ITEC), 2019,
  • [44] Recognition of Converter Steelmaking State Based on Convolutional Recurrent Neural Networks
    Huang, Chengyong
    Dai, Zhangjie
    Sun, Ye
    Wang, Zijiao
    Liu, Wei
    Yang, Shufeng
    Li, Jingshe
    METALLURGICAL AND MATERIALS TRANSACTIONS B-PROCESS METALLURGY AND MATERIALS PROCESSING SCIENCE, 2024, 55 (03): : 1856 - 1868
  • [45] Exponentially convergent state estimation for delayed switched recurrent neural networks
    Ahn, Choon Ki
    EUROPEAN PHYSICAL JOURNAL E, 2011, 34 (11):
  • [46] Benchmarking Reservoir and Recurrent Neural Networks for Human State and Activity Recognition
    Bacciu, Davide
    Di Sarli, Daniele
    Gallicchio, Claudio
    Micheli, Alessio
    Puccinelli, Niccolo
    ADVANCES IN COMPUTATIONAL INTELLIGENCE (IWANN 2021), PT II, 2021, 12862 : 168 - 179
  • [47] A Digital Circuit Design of State-Space Recurrent Neural Networks
    Lin, Che-Wei
    Wang, Jeen-Shing
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1837 - 1841
  • [48] Training and extraction of fuzzy finite state automata in recurrent neural networks
    Chandra, Rohitash
    Omlin, Christian W.
    PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2006, : 271 - 275
  • [49] Exponentially convergent state estimation for delayed switched recurrent neural networks
    Choon Ki Ahn
    The European Physical Journal E, 2011, 34
  • [50] State estimation for jumping recurrent neural networks with discrete and distributed delays
    Wang, Zidong
    Liu, Yurong
    Liu, Xiaohui
    NEURAL NETWORKS, 2009, 22 (01) : 41 - 48