Papers
Topics
Authors
Recent
Search
2000 character limit reached

nuScenes Revisited: Progress and Challenges in Autonomous Driving

Published 2 Dec 2025 in cs.CV and cs.RO | (2512.02448v1)

Abstract: Autonomous Vehicles (AV) and Advanced Driver Assistance Systems (ADAS) have been revolutionized by Deep Learning. As a data-driven approach, Deep Learning relies on vast amounts of driving data, typically labeled in great detail. As a result, datasets, alongside hardware and algorithms, are foundational building blocks for the development of AVs. In this work we revisit one of the most widely used autonomous driving datasets: the nuScenes dataset. nuScenes exemplifies key trends in AV development, being the first dataset to include radar data, to feature diverse urban driving scenes from two continents, and to be collected using a fully autonomous vehicle operating on public roads, while also promoting multi-modal sensor fusion, standardized benchmarks, and a broad range of tasks including perception, localization & mapping, prediction and planning. We provide an unprecedented look into the creation of nuScenes, as well as its extensions nuImages and Panoptic nuScenes, summarizing many technical details that have hitherto not been revealed in academic publications. Furthermore, we trace how the influence of nuScenes impacted a large number of other datasets that were released later and how it defined numerous standards that are used by the community to this day. Finally, we present an overview of both official and unofficial tasks using the nuScenes dataset and review major methodological developments, thereby offering a comprehensive survey of the autonomous driving literature, with a particular focus on nuScenes.

Summary

  • The paper demonstrates how the nuScenes dataset enhances AV research through advanced multimodal sensor fusion and precise calibration techniques.
  • It presents comprehensive data annotation methods and quality controls that ensure high accuracy in object detection, tracking, and localization.
  • The study outlines benchmark challenges and dataset extensions, highlighting both progress and remaining obstacles in autonomous driving.

Summary of "nuScenes Revisited: Progress and Challenges in Autonomous Driving"

The paper "nuScenes Revisited: Progress and Challenges in Autonomous Driving" (2512.02448) presents a comprehensive revisitation of the nuScenes dataset, detailing its foundational role in advancing autonomous vehicle (AV) research. The authors explore the dataset's conception, its expansive influence on subsequent datasets, and the numerous standards it established within the AV community. This essay explores the dataset's structure, technical intricacies, and its extensions, while emphasizing its implications for AV research and the challenges it addresses.

Dataset Overview

nuScenes, a large-scale multimodal autonomous driving dataset, was designed to overcome the limitations of previous datasets such as KITTI. It comprises 1000 driving scenes from urban environments in Boston and Singapore, incorporating data from cameras, lidar, radar, and GPS/IMU systems. The dataset is pivotal for tasks like object detection, tracking, and localization, offering 360-degree sensor coverage and detailed annotations for 23 diverse object classes. The dataset's creation involved rigorous synchronization and calibration of sensors, emphasizing precision in data collection.

Sensor Setup and Calibration:

Two Renault Zoe vehicles equipped with a Velodyne HDL32E lidar, multiple cameras, and Continental ARS 408-21 radars were used for data collection. The calibration processes ensured high-quality data, crucial for accurate perception and planning tasks. The meticulous calibration is evident in the alignment of camera and lidar data, exemplified in high-quality synchronization and calibration efforts. Figure 1

Figure 1: Overview of the sensors and their positions on the nuScenes vehicles.

Data Annotation and Quality Assurance

Annotation in nuScenes involves semantic labeling, instance identification, and bounding box delineation across a wide array of object classes, meticulously validated through multiple stages. The dataset employs privacy-preserving techniques, such as facial and license plate blurring, ensuring anonymity. Similarly, the annotation process includes validation steps that support attaining high-quality data, which is critical for AV applications.

Quality Control:

The quality assurance process in nuScenes is robust, utilizing manual and semi-automated checks to ensure annotation accuracy. This involves validating object scale, class, position, and orientation consistently across the scenes, surpassing typical outsourcing measures used in other datasets.

Extensions and Leaderboards

nuScenes has fostered the evolution of extensions like Panoptic nuScenes and nuImages, extending its utility to point-level annotations and 2D image tasks, respectively. These extensions enable detailed study into environmental interactions and traditional 2D detection challenges, broadening the scope of AV research possibilities.

Benchmark Challenges:

nuScenes hosts various benchmark tasks, such as 3D detection, tracking, and lidar segmentation, providing standardized metrics like the nuScenes Detection Score (NDS). This fosters a competitive and collaborative environment, enabling methodological advancements in AV systems. Figure 2

Figure 2: Results on the nuScenes detection leaderboard.

Radar and Mapping Developments

nuScenes was pioneering in integrating radar data, marking a shift towards robust sensor suites in adverse weather conditions. Despite the inherent challenges in radar data processing, it offered a significant advantage in velocity estimation and environmental perception, leading to broader radar dataset developments in the field.

Mapping Innovations:

The dataset's high-definition maps are integral for precise localization, offering geometric and semantic information crucial for AV operations in complex urban landscapes. These mapping features have inspired subsequent datasets to expand on map granularity and utility, enhancing navigation and planning capabilities. Figure 3

Figure 3: Semantic map annotations overlayed on a front camera image from a driving scene in Boston-Seaport.

Conclusion

The nuScenes dataset has had a monumental impact on AV research, setting benchmarks for multimodal data collection, annotation quality, and sensor fusion techniques. Its extensions and influence on subsequent datasets underscore its foundational role in the field. However, challenges such as sensor synchronization, data density, and geographic diversity remain pivotal areas for improvement. Future iterations and datasets, like nuPlan, focus on closed-loop simulation frameworks, addressing open challenges in perception and planning.

nuScenes continues to facilitate advancements in autonomous driving technologies, promoting robust and comprehensive AV solutions across diverse operational scenarios.

Paper to Video (Beta)

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 1 like about this paper.