Training Recurrent Neural Networks Online by Learning Explicit State Variables

Somjit Nath; Vincent Liu; Alan Chan; Xin Li; Adam White; Martha White

Training Recurrent Neural Networks Online by Learning Explicit State Variables

Somjit Nath, Vincent Liu, Alan Chan, Xin Li, Adam White, Martha White

Keywords: incremental learning, partial observability, rnn

Abstract Paper Reviews Chat

Wed Session 4 (17:00-19:00 GMT) [Live QA] [Cal]

Wed Session 5 (20:00-22:00 GMT) [Live QA] [Cal]

Abstract: Recurrent neural networks (RNNs) allow an agent to construct a state-representation from a stream of experience, which is essential in partially observable problems. However, there are two primary issues one must overcome when training an RNN: the sensitivity of the learning algorithm's performance to truncation length and and long training times. There are variety of strategies to improve training in RNNs, the mostly notably Backprop Through Time (BPTT) and by Real-Time Recurrent Learning. These strategies, however, are typically computationally expensive and focus computation on computing gradients back in time. In this work, we reformulate the RNN training objective to explicitly learn state vectors; this breaks the dependence across time and so avoids the need to estimate gradients far back in time. We show that for a fixed buffer of data, our algorithm---called Fixed Point Propagation (FPP)---is sound: it converges to a stationary point of the new objective. We investigate the empirical performance of our online FPP algorithm, particularly in terms of computation compared to truncated BPTT with varying truncation levels.

Training Recurrent Neural Networks Online by Learning Explicit State Variables

Somjit Nath, Vincent Liu, Alan Chan, Xin Li, Adam White, Martha White

Similar Papers

RNNs Incrementally Evolving on an Equilibrium Manifold: A Panacea for Vanishing and Exploding Gradients?

Anil Kag, Ziming Zhang, Venkatesh Saligrama,

Improved memory in recurrent neural networks with sequential non-normal dynamics

Emin Orhan, Xaq Pitkow,

One-Shot Pruning of Recurrent Neural Networks by Jacobian Spectrum Evaluation

Shunshi Zhang, Bradly C. Stadie,

Fast is better than free: Revisiting adversarial training

Eric Wong, Leslie Rice, J. Zico Kolter,