State-Regularized Recurrent Neural Networks

被引:0
|
作者
Wang, Cheng [1 ]
Niepert, Mathias [1 ]
机构
[1] NEC Labs Europe, Heidelberg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent neural networks are a widely used class of neural architectures with two shortcomings. First, it is difficult to understand what exactly they learn. Second, they tend to work poorly on sequences requiring long-term memorization, despite having this capacity in principle. We aim to address both shortcomings with a class of recurrent networks that use a stochastic state transition mechanism between cell applications. This mechanism, which we term state-regularization, makes RNNs transition between a finite set of learnable states. We evaluate state-regularized RNNs on (1) regular languages for the purpose of automata extraction; (2) nonregular languages such as balanced parentheses, palindromes, and the copy task where external memory is required; and (3) real-word sequence learning tasks for sentiment analysis, visual object recognition, and language modeling. We show that state-regularization simplifies the extraction of finite state automata from the RNN's state transition dynamics; forces RNNs to operate more like automata with external memory and less like finite state machines; and makes RNNs more interpretable.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions
    Wang, Cheng
    Lawrence, Carolin
    Niepert, Mathias
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7739 - 7750
  • [2] State-Regularized Policy Search for Linearized Dynamical Systems
    Abdulsamad, Hany
    Arenz, Oleg
    Peters, Jan
    Neumann, Gerhard
    TWENTY-SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2017, : 419 - 424
  • [3] A New State-Regularized QRRLS Algorithm With a Variable Forgetting Factor
    Chan, S. C.
    Chu, Y. J.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2012, 59 (03) : 183 - 187
  • [4] STATE OBSERVABILITY IN RECURRENT NEURAL NETWORKS
    ALBERTINI, F
    SONTAG, ED
    SYSTEMS & CONTROL LETTERS, 1994, 22 (04) : 235 - 244
  • [5] Predictive State Recurrent Neural Networks
    Downey, Carlton
    Hefny, Ahmed
    Li, Boyue
    Boots, Byron
    Gordon, Geoff
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [6] Day ahead ocean swell forecasting with recursively regularized recurrent neural networks
    Mirikitani, Deffick Takeshi
    OCEANS 2007 - EUROPE, VOLS 1-3, 2007, : 1376 - 1379
  • [7] State Estimation for Recurrent Neural Networks With Intermittent Transmission
    Liu, Chang
    Rao, Hongxia
    Yu, Xinxin
    Xu, Yong
    Su, Chun-Yi
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (05) : 2891 - 2900
  • [8] State-Frequency Memory Recurrent Neural Networks
    Hu, Hao
    Qi, Guo-Jun
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [9] On the Interpretation of Recurrent Neural Networks as Finite State Machines
    Oliva, Christian
    Lago-Fernandez, Luis F.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 312 - 323
  • [10] Learning State Space Trajectories in Recurrent Neural Networks
    Pearlmutter, Barak A.
    NEURAL COMPUTATION, 1989, 1 (02) : 263 - 269