REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment (2405.18525v1)

Published 28 May 2024 in cs.CV

Abstract: Traditional image-to-3D models often struggle with scenes containing multiple objects due to biases and occlusion complexities. To address this challenge, we present REPARO, a novel approach for compositional 3D asset generation from single images. REPARO employs a two-step process: first, it extracts individual objects from the scene and reconstructs their 3D meshes using off-the-shelf image-to-3D models; then, it optimizes the layout of these meshes through differentiable rendering techniques, ensuring coherent scene composition. By integrating optimal transport-based long-range appearance loss term and high-level semantic loss term in the differentiable rendering, REPARO can effectively recover the layout of 3D assets. The proposed method can significantly enhance object independence, detail accuracy, and overall scene coherence. Extensive evaluation of multi-object scenes demonstrates that our REPARO offers a comprehensive approach to address the complexities of multi-object 3D scene generation from single images.

References (67)

Citations (2)

View on Semantic Scholar

Collections

Sign up for free to add this paper to one or more collections.

Sign Up

Summary

The paper presents a dual-step method that first extracts individual objects for detailed 3D mesh reconstruction.
It introduces differentiable layout optimization by integrating optimal transport-based appearance loss and semantic loss to achieve spatial and semantic coherence.
Evaluations demonstrate that REPARO significantly outperforms traditional methods in handling occlusions and biases in multi-object 3D scene generation.

The paper "REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment," published in May 2024, introduces a novel method called REPARO to tackle the challenge of generating 3D assets from single images, specifically in scenarios containing multiple objects. Traditional image-to-3D reconstruction methods often fail in such settings due to biases and the complexities arising from occlusions.

REPARO addresses these issues through a two-step compositional process:

Extraction and Mesh Reconstruction: In the first step, individual objects are extracted from the input scene. These objects are then individually reconstructed into 3D meshes using existing state-of-the-art image-to-3D models. This step ensures that each object's geometric details are captured accurately without the interference of other objects.
Differentiable Layout Optimization: In the second step, REPARO employs differentiable rendering techniques to optimize the spatial layout of these individual 3D meshes. The method integrates both an optimal transport-based long-range appearance loss term and a high-level semantic loss term within the differentiable rendering framework. The optimal transport-based appearance loss helps in correcting spatial misalignments while ensuring coherent scene composition, and the high-level semantic loss ensures the semantic consistency of the layout.

The core innovation lies in using these advanced loss terms, which aid in recovering the layout of 3D assets effectively. This dual-loss integration helps in preserving object independence, enhancing detail accuracy, and maintaining overall scene coherence.

The evaluations demonstrate that REPARO significantly improves upon existing methods in the context of multi-object scenes. The method shows a notable enhancement in terms of scene coherence and object detail accuracy, effectively overcoming the common issues related to biases and occlusions in multi-object 3D scene generation from single images. This comprehensive approach makes REPARO a promising solution for complex 3D asset generation tasks.

PDF Markdown

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Related Papers

Authors (9)

Tweets

https://twitter.com/gastronomy/status/1796030997516923087

https://twitter.com/CSVisionPapers/status/1796090536849293655