From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction (1804.10974v1)

Published 29 Apr 2018 in cs.CL, cs.LG, and stat.ML

Abstract: In this work, we study the credit assignment problem in reward augmented maximum likelihood (RAML) learning, and establish a theoretical equivalence between the token-level counterpart of RAML and the entropy regularized reinforcement learning. Inspired by the connection, we propose two sequence prediction algorithms, one extending RAML with fine-grained credit assignment and the other improving Actor-Critic with a systematic entropy regularization. On two benchmark datasets, we show the proposed algorithms outperform RAML and Actor-Critic respectively, providing new alternatives to sequence prediction.

Citations (6)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - zihangdai/ERAC-VAML: From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction (9 stars)

From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction (1804.10974v1)

Summary

Related Papers

GitHub