Dynamic Feed-Forward LSTM

被引:0
|
作者
Piao, Chengkai [1 ]
Wang, Yuchen [1 ]
Wei, Jinmao [1 ]
机构
[1] Nankai Univ, 38 Tongyan Rd, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
Dynamic Process; Feed Forward; LSTM; Full Context; ATTENTION MECHANISM; BIDIRECTIONAL LSTM; MODEL;
D O I
10.1007/978-3-031-40283-8_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the insufficient hidden states capabilities and single-direction feeding flaws of existing LSTM caused by its horizontal recurrent steps. To this end, we propose the Dynamic Feed-Forward LSTM (D-LSTM). Specifically, our D-LSTM first expands the capabilities of hidden states by assigning an exclusive state vector to each word. Then, the Dynamic Additive Attention (DAA) method is utilized to adaptively compress local context words into a fixed size vector. Last, a vertical feed-forward process is proposed to search context relations by filtering informative features in the compressed context vector and updating hidden states. With the help of exclusive hidden states, each word can preserve its most correlated context features and hidden states do not interfere with each other. By setting an appropriate context window size for DAA and stacking multiple such layers, the context scope can be gradually expanded from a central word to both sides and achieve the whole sentence at the top layer. Furthermore, the D-LSTM module is compatible with parallel computing and amenable to training via back-propagation for its vertical prorogation. Experimental results on both classification and sequence tagging datasets insist that our models achieve competitive performance compared to existing LSTMs.
引用
收藏
页码:191 / 202
页数:12
相关论文
共 50 条
  • [41] Application of a feed-forward control structure
    University of Cape Town, Department of Electrical Engineering, University of Cape Town, Cape Town, Western Cape
    7700, South Africa
    Lect. Notes Electr. Eng., (91-100):
  • [42] Improvements in broadband feed-forward amplifiers
    Coimbra, MD
    Souza, RF
    MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 1996, 12 (04) : 213 - 215
  • [43] Propagating synchrony in feed-forward networks
    Jahnke, Sven
    Memmesheimer, Raoul-Martin
    Timme, Marc
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2013, 7
  • [44] Feed-forward inhibition in the visual thalamus
    Kosmidis, EK
    Vibert, JF
    NEUROCOMPUTING, 2002, 44 : 479 - 487
  • [45] A Novel Feed-Forward Controller for PMSMs
    Altun, Yusuf
    Gulez, Kayhan
    Mumcu, Tarik Veli
    Kizilkaya, M. Ozgur
    2013 3RD INTERNATIONAL CONFERENCE ON ELECTRIC POWER AND ENERGY CONVERSION SYSTEMS (EPECS), 2013,
  • [46] Quantum Feed-Forward Control of Light
    Andersen, Ulrik L.
    Filip, Radim
    PROGRESS IN OPTICS, VOL 53, 2009, 53 : 365 - 414
  • [47] LARGE DEVIATIONS FOR A FEED-FORWARD NETWORK
    Setayeshgar, Leila
    Wang, Hui
    ADVANCES IN APPLIED PROBABILITY, 2011, 43 (02) : 545 - 571
  • [48] FEED-FORWARD POSTURAL ADJUSTMENTS TO ACTION
    Latash, Mark L.
    Aruin, Alexander S.
    Klous, Miriam
    Krishnan, Vennila
    Mikulic, Pavle
    6TH INTERNATIONAL SCIENTIFIC CONFERENCE ON KINESIOLOGY: INTEGRATIVE POWER OF KINESIOLOGY, 2011, : 163 - 163
  • [49] Feed-forward Support of Human Walking
    van Dijk, Wietse
    Koopman, Bram
    Ronsse, Renaud
    van der Kooij, Herman
    2012 4TH IEEE RAS & EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL ROBOTICS AND BIOMECHATRONICS (BIOROB), 2012, : 1955 - 1960
  • [50] Joint Synthesis of Dynamic Feed-Forward and Static State Feedback for Platoon Control
    Koroglu, Hakan
    Falcone, Paolo
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 4503 - 4508