Seismic Full-Waveform Inversion Using Deep Learning Tools and Techniques (1801.07232v2)

Published 22 Jan 2018 in physics.geo-ph and physics.comp-ph

Abstract: I demonstrate that the conventional seismic full-waveform inversion algorithm can be constructed as a recurrent neural network and so implemented using deep learning software such as TensorFlow. Applying another deep learning concept, the Adam optimizer with minibatches of data, produces quicker convergence toward the true wave speed model on a 2D dataset than Stochastic Gradient Descent and than the L-BFGS-B optimizer with the cost function and gradient computed using the entire training dataset. I also show that the cost function gradient calculation using reverse-mode automatic differentiation is the same as that used in the adjoint state method.

Authors (1)

Alan Richardson (5 papers)

Citations (103)

View on Semantic Scholar

Summary

The paper recasts FWI as a recurrent neural network, leveraging deep learning tools to achieve faster convergence on a 2D dataset.
It employs reverse-mode automatic differentiation in TensorFlow to simplify gradient computation compared to the traditional adjoint-state method, despite increased memory use.
Empirical results demonstrate that using the Adam optimizer with minibatch training reduces computational cycles while enhancing subsurface model accuracy.

Seismic Full-Waveform Inversion Using Deep Learning Tools and Techniques

The paper by Alan Richardson explores the integration of deep learning methodologies into seismic full-waveform inversion (FWI), presenting a novel approach that utilizes deep learning frameworks to implement FWI as a recurrent neural network (RNN). This convergence of seismic processing and deep learning is demonstrated by recasting FWI algorithms as RNNs and leveraging the capabilities of advanced deep learning optimizers and computational tools.

Integration of Deep Learning and Conventional FWI

Full-waveform inversion is fundamentally a nonlinear optimization problem aimed at constructing subsurface models from seismic data. The conventional approach relies on iterative inversion utilizing the adjoint-state method to compute gradients, typically demanding high computational resources. This paper refactors the FWI algorithm into the structure of recurrent neural networks, enabling the use of deep learning platforms such as TensorFlow, which offer considerable benefits in terms of development speed and computational efficiency.

A key innovation in the paper is the employment of the Adam optimizer, known for its capabilities in handling non-convex optimization problems more efficiently than traditional methods like Stochastic Gradient Descent (SGD) or L-BFGS-B, particularly when applied to minibatches. The Adam optimizer's utilization within this deep learning framework significantly accelerates the convergence towards the true wave speed model, as evidenced by its performance on a 2D dataset.

Automatic Differentiation vs. Adjoint State Method

The use of reverse-mode automatic differentiation within the TensorFlow environment simplifies the gradient computation compared to the manually implemented adjoint-state method. While both approaches yield comparable gradient results, automatic differentiation reduces development complexity. However, it introduces substantial memory overhead, as storing intermediate computations might be untenable for large-scale applications. Despite this, automatic differentiation provides flexibility allowing easier experimentation with varying model configurations and hyperparameters.

Empirical Evaluations

In a comparative analysis, Richardson demonstrates the superior convergence rates of the Adam optimizer with minibatch training relative to other techniques. Notably, the Adam optimizer reaches near-optimal solutions faster than the L-BFGS-B algorithm, which processes the entire dataset at each step. The empirical results showed that Adam with carefully tuned hyperparameters produced models closely resembling the true subsurface structures after fewer computational cycles, underscoring the potential for substantial efficiency gains in seismic inversions.

Discussion of Implications

The implications of this work are multifaceted. On a practical level, optimizing FWI using deep learning paradigms mitigates the computational burdens traditionally associated with seismic inversions, enabling faster turnaround times and potentially reducing costs. Theoretically, the integration encourages cross-pollination between domains, encouraging further innovation at the intersection of geophysics and machine learning.

The successes achieved in this paper could pave the way for broader adoption of deep learning techniques in geophysical applications, extending beyond subsurface modeling to other tasks such as real-time seismic data processing. Potential future developments could see more sophisticated neural network architectures, like CNNs or generative models, being applied to refine inversion accuracy and computational performance further.

Conclusion

This paper aligns seismic inversion technology with cutting-edge computational approaches, proving that deep learning techniques can enhance, if not transform, traditional FWI methodologies. Richardson's findings validate the premise that seismic algorithms can be expressed and optimized within deep learning frameworks, potentially heralding paradigm shifts in processing efficiency and accuracy. While challenges remain, especially regarding resource management in large-scale scenarios, the potential upsides offer exciting prospects for future research and application in AI-driven geophysical exploration.

PDF Markdown

Related Papers

Find Related Papers

YouTube

Show All Videos