Generalization to Real-World Planning Scenarios
Establish whether the documentation-retrieval-integrated PDDL generation pipelines—including Modular w/ Specific Doc, Once w/ Whole Doc, and Refinement w/ Code-Retrieved Doc—generalize beyond the evaluated benchmarks (Blocks World, Logistics, Barman, and Mystery Blocks World) to more diverse or real-world planning scenarios.
References
Lastly, our evaluation is confined to a few benchmark domains; generalization to more diverse or real-world planning scenarios remains to be verified.
— Documentation Retrieval Improves Planning Language Generation
(2509.19931 - Wang et al., 24 Sep 2025) in Section 6 (Limitations)