Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane (2312.15253v1)
Abstract: Intraoperative imaging techniques for reconstructing deformable tissues in vivo are pivotal for advanced surgical systems. Existing methods either compromise on rendering quality or are excessively computationally intensive, often demanding dozens of hours to perform, which significantly hinders their practical application. In this paper, we introduce Fast Orthogonal Plane (Forplane), a novel, efficient framework based on neural radiance fields (NeRF) for the reconstruction of deformable tissues. We conceptualize surgical procedures as 4D volumes, and break them down into static and dynamic fields comprised of orthogonal neural planes. This factorization iscretizes the four-dimensional space, leading to a decreased memory usage and faster optimization. A spatiotemporal importance sampling scheme is introduced to improve performance in regions with tool occlusion as well as large motions and accelerate training. An efficient ray marching method is applied to skip sampling among empty regions, significantly improving inference speed. Forplane accommodates both binocular and monocular endoscopy videos, demonstrating its extensive applicability and flexibility. Our experiments, carried out on two in vivo datasets, the EndoNeRF and Hamlyn datasets, demonstrate the effectiveness of our framework. In all cases, Forplane substantially accelerates both the optimization process (by over 100 times) and the inference process (by over 15 times) while maintaining or even improving the quality across a variety of non-rigid deformations. This significant performance improvement promises to be a valuable asset for future intraoperative surgical applications. The code of our project is now available at https://github.com/Loping151/ForPlane.
- Flip: A difference evaluator for alternating images. PACMCGIT, 2020.
- Zoedepth: Zero-shot transfer by combining relative and metric depth. arXiv preprint arXiv:2302.12288, 2023.
- The impact of 3d digital reconstruction on the surgical planning of partial nephrectomy: A case-control study. still time for a novel surgical trend? Clinical Genitourinary Cancer, 18(6):e669–e678, 2020.
- Hexplane: A fast representation for dynamic scenes. In CVPR, 2023.
- Tensorf: Tensorial radiance fields. In ECCV, 2022.
- Mednerf: Medical neural radiance fields for reconstructing 3d-aware ct-projections from a single x-ray. In EMBC, 2022.
- Neural radiance flow for 4d view synthesis and video processing. In ICCV, 2021.
- Fast dynamic radiance fields with time-aware neural voxels. In SIGGRAPH Asia, 2022.
- Curvature-enhanced implicit function network for high-quality tooth model generation from cbct images. In MICCAI, 2022.
- K-planes: Explicit radiance fields in space, time, and appearance. In CVPR, 2023.
- Plenoxels: Radiance fields without neural networks. In CVPR, 2022.
- Surfelwarp: Efficient non-volumetric single view dynamic reconstruction, 2019.
- Implicit neural representations for medical imaging segmentation. In MICCAI, 2022.
- Segment anything. arXiv preprint arXiv:2304.02643, 2023.
- Virtual reality in surgical training. Surgical Oncology Clinics of North America, 9(1):61–79, 2000. Surgical Techniques and Outcomes.
- 3d ultrasound spine imaging with application of neural radiance field method. In IUS, 2021.
- Nerfacc: A general nerf acceleration toolbox. arXiv preprint arXiv:2210.04847, 2022.
- Super: A surgical perception framework for endoscopic tissue manipulation with surgical robotics. RA-L, 2020.
- Neural scene flow fields for space-time view synthesis of dynamic scenes. In CVPR, 2021.
- Integrating artificial intelligence and augmented reality in robotic surgery: An initial dvrk study using a surgical education scenario. In 2022 International Symposium on Medical Robotics (ISMR), pages 1–8, 2022.
- Robotic surgery remote mentoring via ar with 3d scene streaming and hand interaction, 2022.
- E-dssr: efficient dynamic surgical scene reconstruction with transformer-based stereoscopic depth perception. In MICCAI, 2021.
- Three-dimensional optical reconstruction of vocal fold kinematics using high-speed video with a laser projection system. TMI, 2015.
- Live tracking and dense reconstruction for handheld monocular endoscopy. TMI, 2018.
- Comparative validation of single-shot optical techniques for laparoscopic 3-d surface reconstruction. TMI, 2014.
- Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 2021.
- Implicit neural representation in medical imaging: A comparative survey. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 2381–2391, 2023.
- Three-dimensional tissue deformation recovery and tracking. IEEE Signal Processing Magazine, 2010.
- Instant neural graphics primitives with a multiresolution hash encoding. ToG, 2022.
- Neural importance sampling. ToG, 2019.
- Regnerf: Regularizing neural radiance fields for view synthesis from sparse inputs. In CVPR, 2022.
- Nerfies: Deformable neural radiance fields. In CVPR, 2021.
- D-nerf: Neural radiance fields for dynamic scenes. In CVPR, 2021.
- Deep implicit statistical shape models for 3d medical image delineation. In AAAI, 2022.
- Endo-depth-and-motion: reconstruction and tracking in endoscopic videos using depth networks and photometric constraints. RA-L, 2021.
- Dynamic ct reconstruction from limited views with implicit neural representations and parametric motion fields, 2021.
- Neat: Neural adaptive tomography. TOG, 2022.
- Three-dimensional digital reconstruction of renal model to guide preoperative planning of robot-assisted partial nephrectomy. International Journal of Urology, 26(9):931–932, 2019.
- Fast graph refinement and implicit neural representation for tissue tracking. In 2022 International Conference on Robotics and Automation (ICRA), pages 1281–1288. IEEE, 2022.
- Recurrent implicit neural graph for deformable tracking in endoscopic videos. In International Conference on Medical Image Computing and Computer-Assisted Intervention, pages 478–488. Springer, 2022.
- Tracking and mapping in medical computer vision: A review, 2023.
- 3d reconstruction of human laryngeal dynamics based on endoscopic high-speed recordings. TMI, 2016.
- Dynamic reconstruction of deformable soft-tissue with stereo scope in minimal invasive surgery. RA-L, 2017.
- Soft-tissue motion tracking and structure estimation for robotic assisted mis procedures. In MICCAI, 2005.
- Coil: Coordinate-based internal learning for tomographic imaging. IEEE Transactions on Computational Imaging, 2021.
- Neural rendering for stereo 3d reconstruction of deformable tissues in robotic surgery. In MICCAI, 2022.
- Image quality assessment: From error visibility to structural similarity. TIP, 2004.
- Neural fields in visual computing and beyond. In Computer Graphics Forum. Wiley Online Library, 2022.
- Nesvor: Implicit neural representation for slice-to-volume reconstruction in mri. IEEE Transactions on Medical Imaging, 42(6):1707–1719, 2023.
- Nerfvs: Neural radiance fields for free view synthesis via geometry scaffolds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16549–16558, 2023.
- Neural lerplane representations for fast 4d reconstruction of deformable tissues. MICCAI, 2023.
- Implicitatlas: learning deformable shape templates in medical imaging. In CVPR, 2022.
- The unreasonable effectiveness of deep features as a perceptual metric. In CVPR, 2018.
- Real-time dense reconstruction of tissue surface from stereo optical video. TMI, 2019.
- Emdq-slam: Real-time high-resolution reconstruction of soft tissue surface from stereo laparoscopy videos. In MICCAI, 2021.
- Tiavox: Time-aware attenuation voxels for sparse-view 4d dsa reconstruction, 2023.