Event-based Shape from Polarization with Spiking Neural Networks (2312.16071v1)
Abstract: Recent advances in event-based shape determination from polarization offer a transformative approach that tackles the trade-off between speed and accuracy in capturing surface geometries. In this paper, we investigate event-based shape from polarization using Spiking Neural Networks (SNNs), introducing the Single-Timestep and Multi-Timestep Spiking UNets for effective and efficient surface normal estimation. Specificially, the Single-Timestep model processes event-based shape as a non-temporal task, updating the membrane potential of each spiking neuron only once, thereby reducing computational and energy demands. In contrast, the Multi-Timestep model exploits temporal dynamics for enhanced data extraction. Extensive evaluations on synthetic and real-world datasets demonstrate that our models match the performance of state-of-the-art Artifical Neural Networks (ANNs) in estimating surface normals, with the added advantage of superior energy efficiency. Our work not only contributes to the advancement of SNNs in event-based sensing but also sets the stage for future explorations in optimizing SNN architectures, integrating multi-modal data, and scaling for applications on neuromorphic hardware.
- Poisson surface reconstruction. In Proceedings of the fourth Eurographics symposium on Geometry processing, volume 7, page 0, 2006.
- Niid-net: adapting surface normal knowledge for intrinsic image decomposition in indoor scenes. IEEE Transactions on Visualization and Computer Graphics, 26(12):3434–3445, 2020.
- Recovering surface normal and arbitrary images: A dual regression network for photometric stereo. IEEE Transactions on Image Processing, 30:3676–3690, 2021.
- Multimodal feature-based surface material classification. IEEE transactions on haptics, 10(2):226–239, 2016.
- Fast and accurate computation of surface normals from range images. In 2011 IEEE International Conference on Robotics and Automation, pages 3084–3091. IEEE, 2011.
- Jason Geng. Structured-light 3d surface imaging: a tutorial. Advances in optics and photonics, 3(2):128–160, 2011.
- Berthold KP Horn. Shape from shading: A method for obtaining the shape of a smooth opaque object from one view. 1970.
- Using shadows in finding surface orientations. Computer Vision, Graphics, and Image Processing, 22(1):145–176, 1983.
- Andrew P Witkin. Recovering surface shape and orientation from texture. Artificial intelligence, 17(1-3):17–45, 1981.
- Carlos Hernández. Stereo and silhouette fusion for 3d object modeling from uncalibrated images under circular motion. These de doctorat, École Nationale Supérieure des Télécommunications, 2, 2004.
- Robert J Woodham. Photometric method for determining surface orientation from multiple images. Optical engineering, 19(1):139–144, 1980.
- Reconstruction of specular surfaces using polarization imaging. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, volume 1, pages I–I. IEEE, 2001.
- Polarized 3d: High-quality depth sensing with polarization cues. In Proceedings of the IEEE International Conference on Computer Vision, pages 3370–3378, 2015.
- Constraining object features using a polarization reflectance model. Phys. Based Vis. Princ. Pract. Radiom, 1:167, 1993.
- Linear differential constraints for photo-polarimetric height estimation. In Proceedings of the IEEE international conference on computer vision, pages 2279–2287, 2017.
- Deep shape from polarization. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXIV 16, pages 554–571. Springer, 2020.
- Shape from polarization for complex scenes in the wild. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12632–12641, 2022.
- Depth sensing using geometrically constrained polarization normals. International Journal of Computer Vision, 125:34–51, 2017.
- High-sensitivity analysis of polarization by surface reflection. Machine Vision and Applications, 29:1171–1189, 2018.
- Lawrence B Wolff. Polarization vision: a new sensory approach to image understanding. Image and Vision computing, 15(2):81–93, 1997.
- Lucid vision phoenix polarization camera. https://thinklucid.com/product/phoenix-5-0-mp-polarized-model/. 2018.
- Event-based shape from polarization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1547–1556, 2023.
- Event-based vision: A survey. IEEE transactions on pattern analysis and machine intelligence, 2020.
- Towards spike-based machine intelligence with neuromorphic computing. Nature, 575(7784):607–617, 2019.
- Rectifier nonlinearities improve neural network acoustic models. In Proc. icml, volume 30, page 3. Citeseer, 2013.
- Empirical evaluation of rectified activations in convolutional network. arXiv preprint arXiv:1505.00853, 2015.
- Fast and accurate deep network learning by exponential linear units (elus). arXiv preprint arXiv:1511.07289, 2015.
- Wulfram Gerstner. Time structure of the activity in neural network models. Physical review E, 51(1):738, 1995.
- Larry F Abbott. Lapicque’s introduction of the integrate-and-fire model neuron (1907). Brain research bulletin, 50(5-6):303–304, 1999.
- Spiking neuron models: Single neurons, populations, plasticity. Cambridge university press, 2002.
- Deep learning. nature, 521(7553):436–444, 2015.
- Spikformer: When spiking neural network meets transformer. arXiv preprint arXiv:2209.15425, 2022.
- Spiking transformers for event-based single object tracking. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pages 8801–8810, 2022.
- Spiking graph convolutional networks. arXiv preprint arXiv:2205.02767, 2022.
- Spikegpt: Generative pre-trained language model with spiking neural networks. arXiv preprint arXiv:2302.13939, 2023.
- U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pages 234–241. Springer, 2015.
- William A Shurcliff. Polarized light: production and use. Harvard University Press, 1962.
- Edward Collett. Field guide to polarization. Spie Bellingham, WA, 2005.
- Recent progress in shape from polarization. Advances in Photometric 3D-Reconstruction, pages 177–203, 2020.
- Polarization-based transparent surface modeling from two views. In Proceedings Ninth IEEE International Conference on Computer Vision, pages 1381–1386. IEEE, 2003.
- Surface normal estimation of black specular objects from multiview polarization images. Optical Engineering, 56(4):041303–041303, 2017.
- Polarimetric dense monocular slam. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3857–3866, 2018.
- Depth from a polarisation+ rgb stereo pair. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7586–7595, 2019.
- Shape from polarization: a method for solving zenithal angle ambiguity. Optics letters, 37(20):4218–4220, 2012.
- Improved diffuse reflection models for computer vision. International Journal of Computer Vision, 30:55–71, 1998.
- Shape-from-polarisation: a nonlinear least squares approach. In Proceedings of the IEEE International Conference on Computer Vision Workshops, pages 2969–2976, 2017.
- Height-from-polarisation with unknown lighting or albedo. IEEE transactions on pattern analysis and machine intelligence, 41(12):2875–2888, 2018.
- N Justin Marshall. A unique colour and polarization vision system in mantis shrimps. Nature, 333(6173):557–560, 1988.
- Pdavis: Bio-inspired polarization event camera. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3962–3971, 2023.
- Unsupervised event-based learning of optical flow, depth, and egomotion. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 989–997, 2019.
- Ed Bullmore and Olaf Sporns. The economy of brain network organization. Nature Reviews Neuroscience, 13(5):336–349, 2012.
- Distributed hierarchical processing in the primate cerebral cortex. In Cereb cortex. Citeseer, 1991.
- A million spiking-neuron integrated circuit with a scalable communication network and interface. Science, 345(6197):668–673, 2014.
- Advancing neuromorphic computing with loihi: A survey of results and outlook. Proceedings of the IEEE, 109(5):911–934, 2021.
- Spiking deep convolutional neural networks for energy-efficient object recognition. International Journal of Computer Vision, 113(1):54–66, 2015.
- Going deeper in spiking neural networks: Vgg and residual architectures. Frontiers in neuroscience, 13:95, 2019.
- Spatio-temporal backpropagation for training high-performance spiking neural networks. Frontiers in neuroscience, 12:331, 2018.
- Lisnn: Improving spiking neural networks with lateral interactions for robust object recognition. In Christian Bessiere, editor, Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20, pages 1519–1525. International Joint Conferences on Artificial Intelligence Organization, 7 2020. Main track.
- Stereospike: Depth learning with a spiking neural network. IEEE Access, 10:127428–127439, 2022.
- Mss-depthnet: Depth prediction with multi-step spiking neural network. arXiv preprint arXiv:2211.12156, 2022.
- Spiking nerf: Making bio-inspired neural networks see through the real world. arXiv preprint arXiv:2309.10987, 2023.
- High speed and high dynamic range video with an event camera. IEEE transactions on pattern analysis and machine intelligence, 43(6):1964–1980, 2019.
- Time lens++: Event-based frame interpolation with parametric non-linear flow and multi-scale fusion. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 17755–17764, 2022.
- Incorporating learnable membrane time constant to enhance learning of spiking neural networks. In Proceedings of the IEEE/CVF international conference on computer vision, pages 2661–2671, 2021.
- Deconvolution and checkerboard artifacts. Distill, 1(10):e3, 2016.
- Integrating non-spiking interneurons in spiking neural networks. Frontiers in neuroscience, 15:633945, 2021.
- Liaf-net: Leaky integrate and analog fire network for lightweight and efficient spatiotemporal information processing. IEEE Transactions on Neural Networks and Learning Systems, 33(11):6249–6262, 2021.
- Robert Hecht-Nielsen. Theory of the backpropagation neural network. In Neural networks for perception, pages 65–93. Elsevier, 1992.
- Paul J Werbos. Backpropagation through time: what it does and how to do it. Proceedings of the IEEE, 78(10):1550–1560, 1990.
- Surrogate gradient learning in spiking neural networks: Bringing the power of gradient-based optimization to spiking neural networks. IEEE Signal Processing Magazine, 36(6):51–63, 2019.
- Spikingjelly: An open-source machine learning infrastructure platform for spike-based intelligence. Science Advances, 9(40):eadi1480, 2023.
- Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32, 2019.
- Training deep spiking neural networks. arXiv preprint arXiv:2006.04436, 2020.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Mitsuba 3 renderer, 2022. https://mitsuba-renderer.org.
- Esim: an open event camera simulator. In Conference on robot learning, pages 969–982. PMLR, 2018.
- A 1280x720 back-illuminated stacked temporal contrast event-based vision sensor with 4.86 um pixels, 1.066 geps readout, programmable event-rate controller and compressive data-formatting pipeline. In IEEE International Solid-State Circuits Conference, 2020.
- Breakthrough photography x4 polarizer. https://breakthrough.photography/products/x4-circular-polarizer.
- Esl: Event-based structured light. In 2021 International Conference on 3D Vision (3DV), pages 1165–1174. IEEE, 2021.
- Direct method for shape recovery from polarization and shading. In 2012 19th IEEE International Conference on Image Processing, pages 1769–1772. IEEE, 2012.
- Exploiting spiking dynamics with spatial-temporal feature normalization in graph learning. In Zhi-Hua Zhou, editor, Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21, pages 3207–3213. International Joint Conferences on Artificial Intelligence Organization, 8 2021. Main Track.
- Spike-flownet: event-based optical flow estimation with energy-efficient hybrid neural networks. In European Conference on Computer Vision, pages 366–382. Springer, 2020.
- Mark Horowitz. 1.1 computing’s energy problem (and what we can do about it). In 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), pages 10–14. IEEE, 2014.