Empirical Evaluation of A New Approach to Simplifying Long Short-term Memory (LSTM) (1612.03707v1)

Published 12 Dec 2016 in cs.NE

Abstract: The standard LSTM, although it succeeds in the modeling long-range dependences, suffers from a highly complex structure that can be simplified through modifications to its gate units. This paper was to perform an empirical comparison between the standard LSTM and three new simplified variants that were obtained by eliminating input signal, bias and hidden unit signal from individual gates, on the tasks of modeling two sequence datasets. The experiments show that the three variants, with reduced parameters, can achieve comparable performance with the standard LSTM. Due attention should be paid to turning the learning rate to achieve high accuracies

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

Slim LSTM networks: LSTM_6 and LSTM_C6 (2019)
SLIM LSTMs (2018)
Simplified Long Short-term Memory Recurrent Neural Networks: part III (2017)
Simplified Gating in Long Short-term Memory (LSTM) Recurrent Neural Networks (2017)
LSTM: A Search Space Odyssey (2015)