State-Regularized Recurrent Neural Networks

被引：0

作者：

Wang, Cheng ^{[1
]}

Niepert, Mathias ^{[1
]}

机构：

[1] NEC Labs Europe, Heidelberg, Germany

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97 | 2019年 / 97卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recurrent neural networks are a widely used class of neural architectures with two shortcomings. First, it is difficult to understand what exactly they learn. Second, they tend to work poorly on sequences requiring long-term memorization, despite having this capacity in principle. We aim to address both shortcomings with a class of recurrent networks that use a stochastic state transition mechanism between cell applications. This mechanism, which we term state-regularization, makes RNNs transition between a finite set of learnable states. We evaluate state-regularized RNNs on (1) regular languages for the purpose of automata extraction; (2) nonregular languages such as balanced parentheses, palindromes, and the copy task where external memory is required; and (3) real-word sequence learning tasks for sentiment analysis, visual object recognition, and language modeling. We show that state-regularization simplifies the extraction of finite state automata from the RNN's state transition dynamics; forces RNNs to operate more like automata with external memory and less like finite state machines; and makes RNNs more interpretable.

引用

页数：11

共 50 条

[1] State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions
Wang, Cheng
Lawrence, Carolin
Niepert, Mathias
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7739 - 7750
[2] State-Regularized Policy Search for Linearized Dynamical Systems
Abdulsamad, Hany
Arenz, Oleg
Peters, Jan
Neumann, Gerhard
TWENTY-SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2017, : 419 - 424
[3] A New State-Regularized QRRLS Algorithm With a Variable Forgetting Factor
Chan, S. C.
Chu, Y. J.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2012, 59 (03) : 183 - 187
[4] STATE OBSERVABILITY IN RECURRENT NEURAL NETWORKS
ALBERTINI, F
SONTAG, ED
SYSTEMS & CONTROL LETTERS, 1994, 22 (04) : 235 - 244
[5] Predictive State Recurrent Neural Networks
Downey, Carlton
Hefny, Ahmed
Li, Boyue
Boots, Byron
Gordon, Geoff
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[6] Day ahead ocean swell forecasting with recursively regularized recurrent neural networks
Mirikitani, Deffick Takeshi
OCEANS 2007 - EUROPE, VOLS 1-3, 2007, : 1376 - 1379
[7] State Estimation for Recurrent Neural Networks With Intermittent Transmission
Liu, Chang
Rao, Hongxia
Yu, Xinxin
Xu, Yong
Su, Chun-Yi
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (05) : 2891 - 2900
[8] State-Frequency Memory Recurrent Neural Networks
Hu, Hao
Qi, Guo-Jun
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[9] On the Interpretation of Recurrent Neural Networks as Finite State Machines
Oliva, Christian
Lago-Fernandez, Luis F.
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 312 - 323
[10] Learning State Space Trajectories in Recurrent Neural Networks
Pearlmutter, Barak A.
NEURAL COMPUTATION, 1989, 1 (02) : 263 - 269

← 1 2 3 4 5 →