Papers
Topics
Authors
Recent
Search
2000 character limit reached

Light Field Reconstruction Using Shearlet Transform

Published 29 Sep 2015 in cs.CV | (1509.08969v1)

Abstract: In this article we develop an image based rendering technique based on light field reconstruction from a limited set of perspective views acquired by cameras. Our approach utilizes sparse representation of epipolar-plane images in a directionally sensitive transform domain, obtained by an adapted discrete shearlet transform. The used iterative thresholding algorithm provides high-quality reconstruction results for relatively big disparities between neighboring views. The generated densely sampled light field of a given 3D scene is thus suitable for all applications which requires light field reconstruction. The proposed algorithm is compared favorably against state of the art depth image based rendering techniques.

Citations (169)

Summary

  • The paper presents a novel method for reconstructing densely sampled light fields from a limited number of views by utilizing the sparse representation properties of the shearlet transform.
  • Numerical results indicate that the shearlet-based reconstruction outperforms benchmark methods like DIBR in PSNR and visual quality, effectively handling data from wide baseline cameras.
  • This technique offers high-quality light field reconstruction applicable to immersive technologies including virtual reality, holographic displays, and advanced digital refocusing in light field photography.

Light Field Reconstruction Using Shearlet Transform

The paper presents an advanced technique for reconstructing light fields (LF) using the shearlet transform. The authors introduce an image-based rendering method aimed at obtaining densely sampled light fields from a limited number of multiview images acquired through spatially distributed cameras. The core methodology of this paper harnesses the properties of shearlets in representing sparse epipolar-plane images (EPIs).

Overview and Methodology

The paper describes two primary approaches to view synthesis: depth image-based rendering (DIBR) and plenoptic function approximation. While DIBR depends on depth estimation to generate novel views, the plenoptic approach integrates light field sampling and reconstruction without depth information, treating each view pixel as a sample within a multi-dimensional LF function.

The authors advocate for LF reconstruction using shearlets, a technique surpassing wavelets in sparse approximation performance, particularly for cartoon-like images. Shearlets exhibit better directional sensitivity, crucial for representing LF EPIs characterized by distinct geometric lines corresponding to scene depth.

The presented algorithm decouples LF reconstruction into a two-stage process utilizing sparse shearlet-domain representation. First, it adapts and modifies shearlet transform, focusing on compact support and tight frequency distribution. Secondly, it employs an iterative thresholding approach with adaptive acceleration that effectively reconstructs densely sampled LFs from high-disparity input views.

Numerical Evaluation and Results

The efficacy of this approach is demonstrated across a range of datasets with varying complexities and depth disparities. The shearlet-based reconstruction method consistently performs better than the DIBR benchmark algorithm (DERS+VSRS) in terms of PSNR and visual quality, accounting for minor artifacts seen in specific datasets owing to circular convolution implementation.

Furthermore, the method shows superiority over existing methods like those utilizing sparse Fourier transforms and depth layering approaches, especially when dealing with Lambertian scenes. Moreover, it supports applications requiring full-parallax LF and enables digital refocusing capabilities with reduced ghosting effects, indicative of robust interpolation.

Implications and Future Directions

The demonstrated high-quality results imply significant potential for implementing this LF reconstruction technique in advanced immersive applications such as holographic displays, virtual reality systems, and digital refocusing in LF photography. The work notably extends to setups with wide baseline cameras, achieving smooth transition effects across views.

Theoretical predictions suggest applicability upgrades, potentially accommodating non-Lambertian scene reconstruction as a future step by adjusting the bases for reconstruction to fit different frequency domain areas.

In summary, this paper contributes a substantive and well-grounded improvement in LF reconstruction methodologies. It sets a foundation for further exploration into transforming diverse viewing experiences with applications spread across various visual processing domains.

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.