Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
158 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ShapeAug: Occlusion Augmentation for Event Camera Data (2401.02274v1)

Published 4 Jan 2024 in cs.CV

Abstract: Recently, Dynamic Vision Sensors (DVSs) sparked a lot of interest due to their inherent advantages over conventional RGB cameras. These advantages include a low latency, a high dynamic range and a low energy consumption. Nevertheless, the processing of DVS data using Deep Learning (DL) methods remains a challenge, particularly since the availability of event training data is still limited. This leads to a need for event data augmentation techniques in order to improve accuracy as well as to avoid over-fitting on the training data. Another challenge especially in real world automotive applications is occlusion, meaning one object is hindering the view onto the object behind it. In this paper, we present a novel event data augmentation approach, which addresses this problem by introducing synthetic events for randomly moving objects in a scene. We test our method on multiple DVS classification datasets, resulting in an relative improvement of up to 6.5 % in top1-accuracy. Moreover, we apply our augmentation technique on the real world Gen1 Automotive Event Dataset for object detection, where we especially improve the detection of pedestrians by up to 5 %.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (30)
  1. A low power, fully event-based gesture recognition system. In Conference on Computer Vision and Pattern Recognition (CVPR).
  2. Object detection with spiking neural networks on automotive event data. In International Joint Conference on Neural Networks (IJCNN).
  3. A large scale event-based detection dataset for automotive. arXiv preprint arXiv:2001.08499.
  4. Imagenet: A large-scale hierarchical image database. In Conference on Computer Vision and Pattern Recognition (CVPR).
  5. Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552.
  6. Incorporating learnable membrane time constant to enhance learning of spiking neural networks. In International Conference on Computer Vision (ICCV).
  7. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. In Conference on Computer Vision and Pattern Recognition Workshop (CVPRW).
  8. Occlusions for effective data augmentation in image classification. In International Conference on Computer Vision Workshop (ICCVW).
  9. Recurrent vision transformers for object detection with event cameras. In Conference on Computer Vision and Pattern Recognition (CVPR).
  10. Spiking Neuron Models: Single Neurons, Populations, Plasticity. Cambridge University Press.
  11. Eventdrop: data augmentation for event-based learning. In International Joint Conferences on Artificial Intelligence (IIJCAI).
  12. Identity mappings in deep residual networks. In European Conference on Computer Vision (ECCV).
  13. Augment your batch: better training with larger batches. In arXiv.
  14. Densely connected convolutional networks. In Conference on Computer Vision and Pattern Recognition (CVPR).
  15. Intra-clip aggregation for video person re-identification. In International Conference on Image Processing (ICIP).
  16. Krizhevsky, A. (2012). Learning multiple layers of features from tiny images. University of Toronto.
  17. CIFAR10-DVS: An event-stream dataset for object classification. Frontiers in Neuroscience.
  18. Asynchronous spatio-temporal memory network for continuous event-based object detection. Transactions on Image Processing.
  19. Neuromorphic data augmentation for training spiking neural networks. In European Conference on Computer Vision (ECCV).
  20. Focal loss for dense object detection. Transactions on Pattern Analysis and Machine Intelligence (PAMI).
  21. SSD: Single shot MultiBox detector. In European Conference on Computer Vision (ECCV).
  22. Decoupled weight decay regularization. In International Conference on Learning Representations (ICLR).
  23. Converting static image datasets to spiking neuromorphic datasets using saccades. Frontiers in Neuroscience.
  24. Learning to detect objects with a 1 megapixel event camera. In Neural Information Processing Systems (NeurIPS).
  25. Eventmix: An efficient data augmentation strategy for event-based learning. Information Sciences.
  26. Hide-and-seek: Forcing a network to be meticulous for weakly-supervised object and action localization. In International Conference on Computer Vision (ICCV).
  27. Hats: Histograms of averaged time surfaces for robust event-based object classification. In Conference on Computer Vision and Pattern Recognition (CVPR).
  28. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research (JMLR).
  29. CutMix: Regularization strategy to train strong classifiers with localizable features. In International Conference on Computer Vision (ICCV).
  30. Random erasing data augmentation. In AAAI conference on artificial intelligence.
Citations (6)

Summary

We haven't generated a summary for this paper yet.