Simultaneous multi-object 9-DoF pose control in image generation
Establish methods that enable simultaneous, accurate control over the full 9-DoF poses—3D location, size, and orientation—of multiple objects within a single image generation process, ensuring reliable alignment between specified poses and generated content.
Sponsor
References
However, achieving simultaneous control over the 9D poses (location, size, and orientation) of multiple objects remains an open challenge.
— SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation
(2511.16666 - Qin et al., 20 Nov 2025) in Abstract (also reiterated in Section 1: Introduction)