Dice Question Streamline Icon: https://streamlinehq.com

Termination guarantees for CVI with global stopping criteria

Establish termination in finitely many steps of compositional value iteration with the proposed global stopping criteria—optimistic GSC and bottom-up GSC—for arbitrary input string diagrams of MDPs and weights, thereby providing a general termination guarantee.

Information Square Streamline Icon: https://streamlinehq.com

Background

The paper introduces two sound global stopping criteria for compositional value iteration: an optimistic variant based on OVI and a bottom-up criterion using Pareto caches. These ensure error bounds upon termination.

However, the authors do not currently prove that their algorithm terminates; termination in undiscounted settings is subtle due to challenges like end components, which can arise compositionally.

References

Although ensuring the termination of our algorithm in finite steps with our GSCs remains future work, we show that our GSCs are sound, that is, its output satisfies a given precision upon termination.

Compositional Value Iteration with Pareto Caching (2405.10099 - Watanabe et al., 16 May 2024) in Section 2, Global Stopping Criteria (GSCs)