Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ARMIN: Towards a More Efficient and Light-weight Recurrent Memory Network (1906.12087v1)

Published 28 Jun 2019 in cs.LG, cs.NE, and stat.ML

Abstract: In recent years, memory-augmented neural networks(MANNs) have shown promising power to enhance the memory ability of neural networks for sequential processing tasks. However, previous MANNs suffer from complex memory addressing mechanism, making them relatively hard to train and causing computational overheads. Moreover, many of them reuse the classical RNN structure such as LSTM for memory processing, causing inefficient exploitations of memory information. In this paper, we introduce a novel MANN, the Auto-addressing and Recurrent Memory Integrating Network (ARMIN) to address these issues. The ARMIN only utilizes hidden state ht for automatic memory addressing, and uses a novel RNN cell for refined integration of memory information. Empirical results on a variety of experiments demonstrate that the ARMIN is more light-weight and efficient compared to existing memory networks. Moreover, we demonstrate that the ARMIN can achieve much lower computational overhead than vanilla LSTM while keeping similar performances. Codes are available on github.com/zoharli/armin.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Zhangheng Li (6 papers)
  2. Jia-Xing Zhong (12 papers)
  3. Jingjia Huang (12 papers)
  4. Tao Zhang (481 papers)
  5. Thomas Li (21 papers)
  6. Ge Li (213 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.