Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences (2103.14898v3)

Published 27 Mar 2021 in cs.CV and cs.LG

Abstract: Scene graphs are a compact and explicit representation successfully used in a variety of 2D scene understanding tasks. This work proposes a method to incrementally build up semantic scene graphs from a 3D environment given a sequence of RGB-D frames. To this end, we aggregate PointNet features from primitive scene components by means of a graph neural network. We also propose a novel attention mechanism well suited for partial and missing graph data present in such an incremental reconstruction scenario. Although our proposed method is designed to run on submaps of the scene, we show it also transfers to entire 3D scenes. Experiments show that our approach outperforms 3D scene graph prediction methods by a large margin and its accuracy is on par with other 3D semantic and panoptic segmentation methods while running at 35 Hz.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Shun-Cheng Wu (11 papers)
  2. Johanna Wald (9 papers)
  3. Keisuke Tateno (12 papers)
  4. Nassir Navab (459 papers)
  5. Federico Tombari (214 papers)
Citations (134)