Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Bayes3D: fast learning and inference in structured generative models of 3D objects and scenes (2312.08715v1)

Published 14 Dec 2023 in cs.RO

Abstract: Robots cannot yet match humans' ability to rapidly learn the shapes of novel 3D objects and recognize them robustly despite clutter and occlusion. We present Bayes3D, an uncertainty-aware perception system for structured 3D scenes, that reports accurate posterior uncertainty over 3D object shape, pose, and scene composition in the presence of clutter and occlusion. Bayes3D delivers these capabilities via a novel hierarchical Bayesian model for 3D scenes and a GPU-accelerated coarse-to-fine sequential Monte Carlo algorithm. Quantitative experiments show that Bayes3D can learn 3D models of novel objects from just a handful of views, recognizing them more robustly and with orders of magnitude less training data than neural baselines, and tracking 3D objects faster than real time on a single GPU. We also demonstrate that Bayes3D learns complex 3D object models and accurately infers 3D scene composition when used on a Panda robot in a tabletop scenario.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Nishad Gothoskar (11 papers)
  2. Matin Ghavami (3 papers)
  3. Eric Li (15 papers)
  4. Aidan Curtis (19 papers)
  5. Michael Noseworthy (12 papers)
  6. Karen Chung (2 papers)
  7. Brian Patton (17 papers)
  8. William T. Freeman (114 papers)
  9. Joshua B. Tenenbaum (257 papers)
  10. Mirko Klukas (8 papers)
  11. Vikash K. Mansinghka (42 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com