IDD-AW: A Benchmark for Safe and Robust Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather (2311.14459v1)
Abstract: Large-scale deployment of fully autonomous vehicles requires a very high degree of robustness to unstructured traffic, and weather conditions, and should prevent unsafe mispredictions. While there are several datasets and benchmarks focusing on segmentation for drive scenes, they are not specifically focused on safety and robustness issues. We introduce the IDD-AW dataset, which provides 5000 pairs of high-quality images with pixel-level annotations, captured under rain, fog, low light, and snow in unstructured driving conditions. As compared to other adverse weather datasets, we provide i.) more annotated images, ii.) paired Near-Infrared (NIR) image for each frame, iii.) larger label set with a 4-level label hierarchy to capture unstructured traffic conditions. We benchmark state-of-the-art models for semantic segmentation in IDD-AW. We also propose a new metric called ''Safe mean Intersection over Union (Safe mIoU)'' for hierarchical datasets which penalizes dangerous mispredictions that are not captured in the traditional definition of mean Intersection over Union (mIoU). The results show that IDD-AW is one of the most challenging datasets to date for these tasks. The dataset and code will be available here: http://iddaw.github.io.
- Robust semantic segmentation by redundant networks with a layer-specific loss contribution and majority vote. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 332–333, 2020.
- nuscenes: A multimodal dataset for autonomous driving. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 11621–11631, 2020.
- All snow removed: Single image desnowing algorithm using hierarchical dual-tree complex wavelet representation and contradict channel loss. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 4196–4205, 2021.
- The cityscapes dataset for semantic urban scene understanding. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3213–3223, 2016.
- The cityscapes dataset. In CVPR Workshop on the Future of Datasets in Vision, volume 2. sn, 2015.
- Ithaca365: Dataset and driving perception under repeated and challenging weather conditions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 21383–21392, June 2022.
- IDD-3D: Indian driving dataset for 3d unstructured road scenes. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 4482–4491, January 2023.
- The pascal visual object classes (VOC) challenge. International journal of computer vision, 88:303–338, 2010.
- Removing rain from single images via a deep detail network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3855–3863, 2017.
- Vision meets robotics: The KITTI dataset. The International Journal of Robotics Research, 32(11):1231–1237, 2013.
- Benchmarking the robustness of semantic segmentation models with respect to common corruptions. International journal of computer vision, 129:462–483, 2021.
- Multispectral pedestrian detection via simultaneous detection and segmentation. arXiv preprint arXiv:1808.04818, 2018.
- Deep hierarchical semantic segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1246–1257, 2022.
- Hierarchical semantic segmentation of image scene with object labeling. EURASIP Journal on Image and Video Processing, 2018:1–10, 2018.
- Rain streak removal using layer priors. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2736–2744, 2016.
- 1 year, 1000 km: The oxford robotcar dataset. The International Journal of Robotics Research, 36(1):3–15, 2017.
- The mapillary vistas dataset for semantic understanding of street scenes. In Proceedings of the IEEE international conference on computer vision, pages 4990–4999, 2017.
- Model adaptation with synthetic and real data for semantic dense foggy scene understanding. In Proceedings of the European Conference on Computer Vision (ECCV), September 2018.
- Semantic foggy scene understanding with synthetic data. International Journal of Computer Vision, 126(9):973–992, Sep 2018.
- ACDC: The adverse conditions dataset with correspondences for semantic driving scene understanding. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 10765–10775, 2021.
- Accelerating computer vision tasks on gpus using ramanujan graph product framework. In Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD), pages 113–117, 2023.
- Prior-based domain adaptive object detection for hazy and rainy conditions. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XIV 16, pages 763–780. Springer, 2020.
- Piafusion: A progressive infrared and visible image fusion network based on illumination aware. Information Fusion, 83:79–92, 2022.
- Alexander Toet. The tno multiband image data collection. Data in Brief, 15:249–251, 2017.
- IDD: A dataset for exploring problems of autonomous navigation in unconstrained environments. In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), pages 1743–1751. IEEE, 2019.
- Internimage: Exploring large-scale vision foundation models with deformable convolutions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14408–14419, 2023.
- Joint rain detection and removal from a single image with contextualized deep networks. IEEE transactions on pattern analysis and machine intelligence, 42(6):1377–1393, 2019.
- Bdd100k: A diverse driving dataset for heterogeneous multitask learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2636–2645, 2020.
- Wilddash - creating hazard-aware benchmarks. In Proceedings of the European Conference on Computer Vision (ECCV), September 2018.
- Unifying panoptic segmentation for autonomous driving. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 21351–21360, 2022.
- Rainy wcity: A real rainfall dataset with diverse conditions for semantic driving scene understanding. In International Joint Conference on Artificial Intelligence, pages 1743–1749, 2022.
- Furqan Ahmed Shaik (2 papers)
- Abhishek Malreddy (1 paper)
- Nikhil Reddy Billa (1 paper)
- Kunal Chaudhary (3 papers)
- Sunny Manchanda (5 papers)
- Girish Varma (25 papers)