Learning to Infer Implicit Surfaces without 3D Supervision (1911.00767v1)

Published 2 Nov 2019 in cs.CV and cs.GR

Abstract: Recent advances in 3D deep learning have shown that it is possible to train highly effective deep models for 3D shape generation, directly from 2D images. This is particularly interesting since the availability of 3D models is still limited compared to the massive amount of accessible 2D images, which is invaluable for training. The representation of 3D surfaces itself is a key factor for the quality and resolution of the 3D output. While explicit representations, such as point clouds and voxels, can span a wide range of shape variations, their resolutions are often limited. Mesh-based representations are more efficient but are limited by their ability to handle varying topologies. Implicit surfaces, however, can robustly handle complex shapes, topologies, and also provide flexible resolution control. We address the fundamental problem of learning implicit surfaces for shape inference without the need of 3D supervision. Despite their advantages, it remains nontrivial to (1) formulate a differentiable connection between implicit surfaces and their 2D renderings, which is needed for image-based supervision; and (2) ensure precise geometric properties and control, such as local smoothness. In particular, sampling implicit surfaces densely is also known to be a computationally demanding and very slow operation. To this end, we propose a novel ray-based field probing technique for efficient image-to-field supervision, as well as a general geometric regularizer for implicit surfaces, which provides natural shape priors in unconstrained regions. We demonstrate the effectiveness of our framework on the task of single-view image-based 3D shape digitization and show how we outperform state-of-the-art techniques both quantitatively and qualitatively.

Authors (4)

Shichen Liu (21 papers)
Shunsuke Saito (56 papers)
Weikai Chen (31 papers)
Hao Li (803 papers)

Citations (192)

View on Semantic Scholar

Summary

Learning to Infer Implicit Surfaces Without 3D Supervision

The paper "Learning to Infer Implicit Surfaces Without 3D Supervision" addresses a central challenge in 3D shape modeling and reconstruction: the limited availability of 3D ground truth data and the utilization of abundant 2D images for training. The researchers propose a framework for learning implicit surface representations from 2D images, eliminating the need for direct 3D supervision. This approach stands in contrast to prior methods that rely heavily on explicit surface models such as voxels, point clouds, and meshes, which are often limited by resolution and topology constraints.

Implicit surfaces possess several key advantages, including the ability to represent complex topologies and continuous surfaces with flexible resolution control. However, achieving unsupervised learning of implicit surfaces using 2D data poses significant challenges. The primary obstacles include associating 2D image data with implicit surface changes in a differentiable manner and maintaining precise geometric properties such as surface smoothness.

To overcome these challenges, the authors introduce a novel ray-based field probing technique that facilitates efficient image-to-implicit-field supervision. This technique leverages a sparse set of 3D anchor points and probing rays to evaluate and aggregate field occupancy probabilities, allowing for the supervision of implicit field generation from 2D images. Additionally, a geometric regularizer is introduced to ensure desired geometric properties in the implicit surfaces. The regularizer utilizes finite difference methods to compute the derivatives of the implicit field, constraining surface properties like normals and curvature.

The framework is validated through experiments using the ShapeNet dataset, demonstrating superior performance in single-view 3D shape reconstruction tasks compared to existing unsupervised methods based on explicit representations. Quantitative metrics such as 3D Intersection over Union (IoU) reinforce the effectiveness of the implicit surface modeling approach, achieving higher accuracy and visual fidelity, particularly in capturing complex topologies.

The paper suggests that implicit representations can provide a more expressive and flexible foundation for learning diverse 3D shapes from minimal data. These findings have significant implications for the field of computer vision, particularly in enhancing the capacity to learn 3D models in scenarios where 3D data is scarce but 2D image data is abundant. Future directions highlighted by the authors include the unsupervised learning of textured geometries and further exploration of advanced data structures, such as Octree, to increase the efficiency of implicit field modeling. The paper opens possibilities for broader applications in AI, especially in areas demanding high-resolution and topologically diverse 3D reconstructions.

PDF Markdown

Related Papers

Find Related Papers