Counting in Language with RNNs (1810.12411v2)

Published 29 Oct 2018 in cs.LG, cs.NE, and stat.ML

Abstract: In this paper we examine a possible reason for the LSTM outperforming the GRU on LLMing and more specifically machine translation. We hypothesize that this has to do with counting. This is a consistent theme across the literature of long term dependence, counting, and LLMing for RNNs. Using the simplified forms of language -- Context-Free and Context-Sensitive Languages -- we show how exactly the LSTM performs its counting based on their cell states during inference and why the GRU cannot perform as well.

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Counting in Language with RNNs (1810.12411v2)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (3)