State-Regularized Recurrent Neural Networks

被引:0
|
作者
Wang, Cheng [1 ]
Niepert, Mathias [1 ]
机构
[1] NEC Labs Europe, Heidelberg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent neural networks are a widely used class of neural architectures with two shortcomings. First, it is difficult to understand what exactly they learn. Second, they tend to work poorly on sequences requiring long-term memorization, despite having this capacity in principle. We aim to address both shortcomings with a class of recurrent networks that use a stochastic state transition mechanism between cell applications. This mechanism, which we term state-regularization, makes RNNs transition between a finite set of learnable states. We evaluate state-regularized RNNs on (1) regular languages for the purpose of automata extraction; (2) nonregular languages such as balanced parentheses, palindromes, and the copy task where external memory is required; and (3) real-word sequence learning tasks for sentiment analysis, visual object recognition, and language modeling. We show that state-regularization simplifies the extraction of finite state automata from the RNN's state transition dynamics; forces RNNs to operate more like automata with external memory and less like finite state machines; and makes RNNs more interpretable.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] State of Charge and State of Health Estimation for Lithium Batteries Using Recurrent Neural Networks
    Chaoui, Hicham
    Ibe-Ekeocha, Chinemerem Christopher
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (10) : 8773 - 8783
  • [32] Estimation of missing logs by regularized neural networks
    Saggaf, MM
    Nebrija, EL
    AAPG BULLETIN, 2003, 87 (08) : 1377 - 1389
  • [33] Freeway travel time prediction with state-space neural networks - Modeling state-space dynamics with recurrent neural networks
    van Lint, JWC
    Hoogendoorn, SP
    van Zuylen, HJ
    ADVANCED TRAFFIC MANAGEMENT SYSTEMS FOR FREEWAYS AND TRAFFIC SIGNAL SYSTEMS 2002: HIGHWAY OPERATIONS, CAPACITY, AND TRAFFIC CONTROL, 2002, (1811): : 30 - 39
  • [34] Reduced-order state estimation of delayed recurrent neural networks
    Huang, He
    Huang, Tingwen
    Chen, Xiaoping
    NEURAL NETWORKS, 2018, 98 : 59 - 64
  • [35] Representation of fuzzy finite state automata in continuous recurrent neural networks
    Omlin, CW
    Thornber, KK
    Giles, CL
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 1023 - 1027
  • [36] Recognition of Converter Steelmaking State Based on Convolutional Recurrent Neural Networks
    Chengyong Huang
    Zhangjie Dai
    Ye Sun
    Zijiao Wang
    Wei Liu
    Shufeng Yang
    Jingshe Li
    Metallurgical and Materials Transactions B, 2024, 55 : 1856 - 1868
  • [37] Constructing deterministic finite-state automata in recurrent neural networks
    Omlin, CW
    Giles, CL
    JOURNAL OF THE ACM, 1996, 43 (06) : 937 - 972
  • [38] System identification with state-space recurrent fuzzy neural networks
    Yu, W
    Ferreyra, A
    2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 5106 - 5111
  • [39] U(1)-symmetric recurrent neural networks for quantum state reconstruction
    Morawetz, Stewart
    De Vlugt, Isaac J. S.
    Carrasquilla, Juan
    Melko, Roger G.
    PHYSICAL REVIEW A, 2021, 104 (01)
  • [40] Neural state space alignment for magnitude generalization in humans and recurrent networks
    Sheahan, Hannah
    Luyckx, Fabrice
    Nelli, Stephanie
    Teupe, Clemens
    Summerfield, Christopher
    NEURON, 2021, 109 (07) : 1214 - 1226.e8