Survey of reasoning using Neural networks (1702.06186v2)

Published 14 Feb 2017 in cs.LG, cs.AI, and cs.NE

Abstract: Reason and inference require process as well as memory skills by humans. Neural networks are able to process tasks like image recognition (better than humans) but in memory aspects are still limited (by attention mechanism, size). Recurrent Neural Network (RNN) and it's modified version LSTM are able to solve small memory contexts, but as context becomes larger than a threshold, it is difficult to use them. The Solution is to use large external memory. Still, it poses many challenges like, how to train neural networks for discrete memory representation, how to describe long term dependencies in sequential data etc. Most prominent neural architectures for such tasks are Memory networks: inference components combined with long term memory and Neural Turing Machines: neural networks using external memory resources. Also, additional techniques like attention mechanism, end to end gradient descent on discrete memory representation are needed to support these solutions. Preliminary results of above neural architectures on simple algorithms (sorting, copying) and Question Answering (based on story, dialogs) application are comparable with the state of the art. In this paper, I explain these architectures (in general), the additional techniques used and the results of their application.

Authors (1)

Amit Sahu (7 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

Neural Turing Machines (2014)
Memory Capacity of Recurrent Neural Networks with Matrix Representation (2021)
Memory and attention in deep learning (2021)
A Taxonomy for Neural Memory Networks (2018)
Memory Augmented Neural Networks with Wormhole Connections (2017)

Find Related Papers