ThermoNeRF: Joint RGB and Thermal Novel View Synthesis for Building Facades using Multimodal Neural Radiance Fields (2403.12154v2)
Abstract: Thermal scene reconstruction holds great potential for various applications, such as analyzing building energy consumption and performing non-destructive infrastructure testing. However, existing methods typically require dense scene measurements and often rely on RGB images for 3D geometry reconstruction, projecting thermal information post-reconstruction. This can lead to inconsistencies between the reconstructed geometry and temperature data and their actual values. To address this challenge, we propose ThermoNeRF, a novel multimodal approach based on Neural Radiance Fields that jointly renders new RGB and thermal views of a scene, and ThermoScenes, a dataset of paired RGB+thermal images comprising 8 scenes of building facades and 8 scenes of everyday objects. To address the lack of texture in thermal images, ThermoNeRF uses paired RGB and thermal images to learn scene density, while separate networks estimate color and temperature data. Unlike comparable studies, our focus is on temperature reconstruction and experimental results demonstrate that ThermoNeRF achieves an average mean absolute error of 1.13C and 0.41C for temperature estimation in buildings and other scenes, respectively, representing an improvement of over 50% compared to using concatenated RGB+thermal data as input to a standard NeRF. Code and dataset are available online.
- “Why are thermal images blurry”, 2023 eprint: 2307.15800
- “Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields” In ICCV, 2021
- “Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields” In CVPR, 2022
- “Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields” In ICCV, 2023
- Amanda Berg, Jörgen Ahlberg and Michael Felsberg “A thermal infrared dataset for evaluation of short-term tracking methods” In Swedish Symposium on image analysis, 2015
- Amanda Berg, Jörgen Ahlberg and Michael Felsberg “A thermal object tracking benchmark” In 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), 2015, pp. 1–6 IEEE
- Christopher Brooke “Thermal Imaging for the Archaeological Investigation of Historic Buildings” In Remote Sensing 10.9, 2018 DOI: 10.3390/rs10091401
- “3D Thermal Imaging System with Decoupled Acquisition for Industrial and Cultural Heritage Applications” Number: 3 Publisher: Multidisciplinary Digital Publishing Institute In Applied Sciences 10.3, 2020, pp. 828 DOI: 10.3390/app10030828
- Wenjie Chang, Yueyi Zhang and Zhiwei Xiong “Depth Estimation From Indoor Panoramas With Neural Scene Representation” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 899–908
- “Novel-view acoustic synthesis” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 6409–6419
- “3D Reconstruction from IR Thermal Images and Reprojective Evaluations” Publisher: Hindawi In Mathematical Problems in Engineering 2015, 2015, pp. e520534 DOI: 10.1155/2015/520534
- “Deep Thermal Imaging: Proximate Material Type Recognition in the Wild through Deep Learning of Spatial Surface Temperature Patterns” In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, CHI ’18 ACM, 2018 DOI: 10.1145/3173574.3173576
- “KAIST Multi-Spectral Day/Night Data Set for Autonomous and Assisted Driving” In IEEE Transactions on Intelligent Transportation Systems 19.3, 2018, pp. 934–948 DOI: 10.1109/TITS.2018.2791533
- “A Multi-spectral Dataset for Evaluating Motion Estimation Systems”, 2021 arXiv:2007.00622 [cs.CV]
- “Generation of 3D Thermal Models for the Analysis of Energy Efficiency in Buildings” In Advances in Design Engineering III Cham: Springer International Publishing, 2023, pp. 741–754
- “Depth-supervised NeRF: Fewer Views and Faster Training for Free” In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 12872–12881 URL: https://api.semanticscholar.org/CorpusID:235743051
- “Borrow from anywhere: Pseudo multi-modal object detection in thermal imagery” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019, pp. 0–0
- “C3I Thermal Automotive Dataset” IEEE Dataport, 2022 DOI: 10.21227/jf21-rt22
- Teledyne FLIR “FLIR Thermal Dataset for Algorithm Training” URL: https://www.flir.com/oem/adas/adas-dataset-form/
- “Flir Image Extractor CLI” In PyPI URL: https://pypi.org/project/flirimageextractor/
- “FLIR ONE Pro Thermal Imaging Camera for Smartphones | Teledyne FLIR” In FLIR ONE Pro Thermal Imaging Camera for Smartphones | Teledyne FLIR URL: https://www.flir.com/products/flir-one-pro/
- “Thermal cameras and applications: A survey” In Machine Vision and Applications 25, 2014, pp. 245–262 DOI: 10.1007/s00138-013-0570-5
- “Image Quality Metrics: PSNR vs. SSIM” In 2010 20th International Conference on Pattern Recognition, 2010, pp. 2366–2369 DOI: 10.1109/ICPR.2010.579
- “NeRF-RPN: A general framework for object detection in NeRFs” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 23528–23538
- “Thermal Radiation” In Encyclopedia of Wildfires and Wildland-Urban Interface (WUI) Fires Cham: Springer International Publishing, 2019, pp. 1–8 DOI: 10.1007/978-3-319-51727-8_78-1
- “Multispectral Pedestrian Detection: Benchmark Dataset and Baseline” In Integrated Comput.-Aided Eng. 20, 2015 DOI: 10.1109/CVPR.2015.7298706
- “Deep learning for infrared thermal image based machine health monitoring” In IEEE/ASME Transactions on Mechatronics 23.1 IEEE, 2017, pp. 151–159
- “Innovations in Building Diagnostics and Condition Monitoring: A Comprehensive Review of Infrared Thermography Applications” In Buildings 13.11, 2023 DOI: 10.3390/buildings13112829
- Marcin Kopaczka, Raphael Kolk and Dorit Merhof “A fully annotated thermal face database and its application for thermal facial expression recognition” In 2018 IEEE International Instrumentation and Measurement Technology Conference (I2MTC), 2018, pp. 1–6 DOI: 10.1109/I2MTC.2018.8409768
- Tadeusz Kruczek “Conditions for use of long-wave infrared camera to measure the temperature of the sky” In Energy 283, 2023, pp. 128466 DOI: 10.1016/j.energy.2023.128466
- “Energy efficiency studies through 3D laser scanning and thermographic technologies” In Energy and Buildings 43.6, 2011, pp. 1216–1221 DOI: 10.1016/j.enbuild.2010.12.031
- “ViViD++: Vision for visibility dataset” In IEEE Robotics and Automation Letters 7.3 IEEE, 2022, pp. 6282–6289
- “Spec-NeRF: Multi-spectral Neural Radiance Fields” arXiv:2310.12987 [cs, eess] arXiv, 2023 DOI: 10.48550/arXiv.2310.12987
- “Semantic Ray: Learning a Generalizable Semantic Field with Cross-Reprojection Attention” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 17386–17396
- “Deep learning thermal image translation for night vision perception” In ACM Transactions on Intelligent Systems and Technology (TIST) 12.1 ACM New York, NY, USA, 2020, pp. 1–18
- “Temporal and spatial deep learning network for infrared thermal defect detection” In Ndt & E International 108 Elsevier, 2019, pp. 102164
- “Infrared thermography in the built environment: A multi-scale review” In Renewable and Sustainable Energy Reviews 165, 2022, pp. 112540 DOI: 10.1016/j.rser.2022.112540
- “Nerf in the wild: Neural radiance fields for unconstrained photo collections” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 7210–7219
- “PHOTOGRAMMETRIC 3D BUILDING RECONSTRUCTION FROM THERMAL IMAGES” Conference Name: International Conference on Unmanned Aerial Vehicles in Geomatics (Volume IV-2/W3) - 4–7 September 2017, Bonn, Germany Publisher: Copernicus GmbH In ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences IV-2-W3, 2017, pp. 25–32 DOI: 10.5194/isprs-annals-IV-2-W3-25-2017
- “NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis” In ECCV, 2020
- “Instant Neural Graphics Primitives with a Multiresolution Hash Encoding” In ACM Trans. Graph. 41.4 New York, NY, USA: ACM, 2022, pp. 102:1–102:15 DOI: 10.1145/3528223.3530127
- Nobuyuki Otsu “A Threshold Selection Method from Gray-Level Histograms” In IEEE Transactions on Systems, Man, and Cybernetics 9.1, 1979, pp. 62–66 DOI: 10.1109/TSMC.1979.4310076
- Gunjan Parihar, Sumit Saha and Lalat Indu Giri “Application of infrared thermography for irrigation scheduling of horticulture plants” In Smart Agricultural Technology 1, 2021, pp. 100021 DOI: 10.1016/j.atech.2021.100021
- “Cross-Spectral Neural Radiance Fields” ISSN: 2475-7888 In 2022 International Conference on 3D Vision (3DV), 2022, pp. 606–616 DOI: 10.1109/3DV57658.2022.00071
- Amanda Ramón, Antonio Adán and Francisco Javier Castilla “Thermal point clouds of buildings: A review” In Energy and Buildings 274, 2022, pp. 112425 DOI: 10.1016/j.enbuild.2022.112425
- Johannes L. Schonberger and Jan-Michael Frahm “Structure-From-Motion Revisited” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
- “Pixelwise view selection for unstructured multi-view stereo” In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part III 14, 2016, pp. 501–518 Springer
- “Combining modern 3D reconstruction and thermal imaging: generation of large-scale 3D thermograms in real-time” Publisher: Taylor & Francis _eprint: https://doi.org/10.1080/17686733.2021.1991746 In Quantitative InfraRed Thermography Journal 19.5, 2022, pp. 295–311 DOI: 10.1080/17686733.2021.1991746
- “Automated thermal 3D reconstruction based on a robot equipped with uncalibrated infrared stereovision cameras” In Advanced Engineering Informatics 38, 2018, pp. 203–215 DOI: 10.1016/j.aei.2018.06.008
- “Nerfstudio: A Modular Framework for Neural Radiance Field Development” In Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Proceedings, SIGGRAPH ’23 ACM, 2023 DOI: 10.1145/3588432.3591516
- “NeRF-Supervised Deep Stereo” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 855–866
- Johan Vertens, Jannik Zürn and Wolfram Burgard “HeatNet: Bridging the Day-Night Domain Gap in Semantic Segmentation with Thermal Images” In arXiv preprint arXiv:2003.04645, 2020
- “HeatWave: A handheld 3D thermography system for energy auditing” In Energy and Buildings 66, 2013, pp. 445–460 DOI: 10.1016/j.enbuild.2013.07.030
- “Image Quality Assessment: From Error Visibility to Structural Similarity” In Image Processing, IEEE Transactions on 13, 2004, pp. 600–612 DOI: 10.1109/TIP.2003.819861
- “What is MSX?” In FLIR URL: https://www.flir.com/discover/professional-tools/what-is-msx/
- “NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection” In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 23320–23330
- “M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots” In IEEE Robotics and Automation Letters, 2021, pp. 1–1 DOI: 10.1109/LRA.2021.3138527
- “Plenoctrees for real-time rendering of neural radiance fields” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 5752–5761
- “NeRF-LiDAR: Generating Realistic LiDAR Point Clouds with Neural Radiance Fields” In AAAI Conference on Artificial Intelligence (AAAI), 2024
- “The unreasonable effectiveness of deep features as a perceptual metric” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 586–595
- Cheng Zhao, Lei Zhang and Yu Zhang “All-sky longwave radiation modelling based on infrared images and machine learning” In Building and Environment 238, 2023, pp. 110369 DOI: 10.1016/j.buildenv.2023.110369
- “In-Place Scene Labelling and Understanding with Implicit Scene Representation” In ICCV, 2021
- “Multimodal Neural Radiance Field” In 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023, pp. 9393–9399 DOI: 10.1109/ICRA48891.2023.10160388