Certified Reinforcement Learning with Logic Guidance (1902.00778v4)

Published 2 Feb 2019 in cs.LG and stat.ML

Abstract: Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied to a variety of control problems. However, applications in safety-critical domains require a systematic and formal approach to specifying requirements as tasks or goals. We propose a model-free RL algorithm that enables the use of Linear Temporal Logic (LTL) to formulate a goal for unknown continuous-state/action Markov Decision Processes (MDPs). The given LTL property is translated into a Limit-Deterministic Generalised Buchi Automaton (LDGBA), which is then used to shape a synchronous reward function on-the-fly. Under certain assumptions, the algorithm is guaranteed to synthesise a control policy whose traces satisfy the LTL specification with maximal probability.

Authors (3)

Hosein Hasanbeig (8 papers)
Daniel Kroening (80 papers)
Alessandro Abate (137 papers)

Citations (49)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Certified Reinforcement Learning with Logic Guidance (1902.00778v4)

Summary

Related Papers