Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations (2012.09988v1)

Published 18 Dec 2020 in cs.CV

Abstract: 3D object detection has recently become popular due to many applications in robotics, augmented reality, autonomy, and image retrieval. We introduce the Objectron dataset to advance the state of the art in 3D object detection and foster new research and applications, such as 3D object tracking, view synthesis, and improved 3D shape representation. The dataset contains object-centric short videos with pose annotations for nine categories and includes 4 million annotated images in 14,819 annotated videos. We also propose a new evaluation metric, 3D Intersection over Union, for 3D object detection. We demonstrate the usefulness of our dataset in 3D object detection tasks by providing baseline models trained on this dataset. Our dataset and evaluation source code are available online at http://www.objectron.dev

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Adel Ahmadyan (4 papers)
  2. Liangkai Zhang (3 papers)
  3. Jianing Wei (7 papers)
  4. Artsiom Ablavatski (9 papers)
  5. Matthias Grundmann (31 papers)
Citations (156)

Summary

We haven't generated a summary for this paper yet.