Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding (2406.08009v1)

Published 12 Jun 2024 in cs.CV, cs.AI, and cs.RO

Abstract: In recent years, there has been a surge of interest in open-vocabulary 3D scene reconstruction facilitated by visual LLMs (VLMs), which showcase remarkable capabilities in open-set retrieval. However, existing methods face some limitations: they either focus on learning point-wise features, resulting in blurry semantic understanding, or solely tackle object-level reconstruction, thereby overlooking the intricate details of the object's interior. To address these challenges, we introduce OpenObj, an innovative approach to build open-vocabulary object-level Neural Radiance Fields (NeRF) with fine-grained understanding. In essence, OpenObj establishes a robust framework for efficient and watertight scene modeling and comprehension at the object-level. Moreover, we incorporate part-level features into the neural fields, enabling a nuanced representation of object interiors. This approach captures object-level instances while maintaining a fine-grained understanding. The results on multiple datasets demonstrate that OpenObj achieves superior performance in zero-shot semantic segmentation and retrieval tasks. Additionally, OpenObj supports real-world robotics tasks at multiple scales, including global movement and local manipulation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yinan Deng (7 papers)
  2. Jiahui Wang (46 papers)
  3. Jingyu Zhao (14 papers)
  4. Jianyu Dou (1 paper)
  5. Yi Yang (856 papers)
  6. Yufeng Yue (28 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com