Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Pix2Point: Learning Outdoor 3D Using Sparse Point Clouds and Optimal Transport (2107.14498v1)

Published 30 Jul 2021 in cs.CV

Abstract: Good quality reconstruction and comprehension of a scene rely on 3D estimation methods. The 3D information was usually obtained from images by stereo-photogrammetry, but deep learning has recently provided us with excellent results for monocular depth estimation. Building up a sufficiently large and rich training dataset to achieve these results requires onerous processing. In this paper, we address the problem of learning outdoor 3D point cloud from monocular data using a sparse ground-truth dataset. We propose Pix2Point, a deep learning-based approach for monocular 3D point cloud prediction, able to deal with complete and challenging outdoor scenes. Our method relies on a 2D-3D hybrid neural network architecture, and a supervised end-to-end minimisation of an optimal transport divergence between point clouds. We show that, when trained on sparse point clouds, our simple promising approach achieves a better coverage of 3D outdoor scenes than efficient monocular depth methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Rémy Leroy (1 paper)
  2. Pauline Trouvé-Peloux (8 papers)
  3. Frédéric Champagnat (6 papers)
  4. Bertrand Le Saux (59 papers)
  5. Marcela Carvalho (5 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.