Determining decomposability in standard RL benchmarks
Determine whether the decomposability condition—namely, that every pair of states is linked via finite sequences of shared predecessor relationships—is satisfied in widely used reinforcement learning benchmark environments such as MuJoCo continuous control tasks and Atari games, under their respective transition dynamics, to validate the assumptions needed for Adversarial IRL to recover state-only rewards up to a constant.
References
We have not been able to determine whether the decomposability condition is satisfied in standard RL benchmarks, such as MuJoCo tasks or Atari games.
— A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
(2203.11409 - Gleave et al., 2022) in Subsection "Recovering rewards" (following the definition and examples of the Decomposability Condition)