Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction (1903.03757v2)

Published 9 Mar 2019 in cs.CV

Abstract: Indoor scenes exhibit rich hierarchical structure in 3D object layouts. Many tasks in 3D scene understanding can benefit from reasoning jointly about the hierarchical context of a scene, and the identities of objects. We present a variational denoising recursive autoencoder (VDRAE) that generates and iteratively refines a hierarchical representation of 3D object layouts, interleaving bottom-up encoding for context aggregation and top-down decoding for propagation. We train our VDRAE on large-scale 3D scene datasets to predict both instance-level segmentations and a 3D object detections from an over-segmentation of an input point cloud. We show that our VDRAE improves object detection performance on real-world 3D point cloud datasets compared to baselines from prior work.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Yifei Shi (26 papers)
  2. Angel Xuan Chang (3 papers)
  3. Zhelun Wu (2 papers)
  4. Manolis Savva (64 papers)
  5. Kai Xu (312 papers)
Citations (25)