Inverse Reinforcement Learning with Missing Data (1911.06930v1)

Published 16 Nov 2019 in cs.LG, cs.AI, and stat.ML

Abstract: We consider the problem of recovering an expert's reward function with inverse reinforcement learning (IRL) when there are missing/incomplete state-action pairs or observations in the demonstrated trajectories. This issue of missing trajectory data or information occurs in many situations, e.g., GPS signals from vehicles moving on a road network are intermittent. In this paper, we propose a tractable approach to directly compute the log-likelihood of demonstrated trajectories with incomplete/missing data. Our algorithm is efficient in handling a large number of missing segments in the demonstrated trajectories, as it performs the training with incomplete data by solving a sequence of systems of linear equations, and the number of such systems to be solved does not depend on the number of missing segments. Empirical evaluation on a real-world dataset shows that our training algorithm outperforms other conventional techniques.

Authors (4)

Tien Mai (33 papers)
Quoc Phong Nguyen (14 papers)
Kian Hsiang Low (32 papers)
Patrick Jaillet (100 papers)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Inverse Reinforcement Learning with Missing Data (1911.06930v1)

Summary

Related Papers