Deep Learning for Hyperspectral Image Classification: An Overview (1910.12861v1)

Published 26 Oct 2019 in eess.IV and cs.CV

Abstract: Hyperspectral image (HSI) classification has become a hot topic in the field of remote sensing. In general, the complex characteristics of hyperspectral data make the accurate classification of such data challenging for traditional machine learning methods. In addition, hyperspectral imaging often deals with an inherently nonlinear relation between the captured spectral information and the corresponding materials. In recent years, deep learning has been recognized as a powerful feature-extraction tool to effectively address nonlinear problems and widely used in a number of image processing tasks. Motivated by those successful applications, deep learning has also been introduced to classify HSIs and demonstrated good performance. This survey paper presents a systematic review of deep learning-based HSI classification literatures and compares several strategies for this topic. Specifically, we first summarize the main challenges of HSI classification which cannot be effectively overcome by traditional machine learning methods, and also introduce the advantages of deep learning to handle these problems. Then, we build a framework which divides the corresponding works into spectral-feature networks, spatial-feature networks, and spectral-spatial-feature networks to systematically review the recent achievements in deep learning-based HSI classification. In addition, considering the fact that available training samples in the remote sensing field are usually very limited and training deep networks require a large number of samples, we include some strategies to improve classification performance, which can provide some guidelines for future studies on this topic. Finally, several representative deep learning-based classification methods are conducted on real HSIs in our experiments.

Citations (1,138)

View on Semantic Scholar

Summary

The paper systematically categorizes deep learning approaches into spectral, spatial, and spectral-spatial networks to enhance HSI classification accuracy.
It demonstrates that deep networks automatically learn complex, nonlinear features from raw hyperspectral data, reducing the need for handcrafted extraction.
Techniques like data augmentation, transfer learning, and unsupervised learning effectively mitigate the challenge of limited labeled samples in HSI classification.

Deep Learning for Hyperspectral Image Classification: An Expert Overview

In the domain of remote sensing, hyperspectral image (HSI) classification represents a substantive research area, primarily due to the complex nature of hyperspectral data. Traditional machine learning methods often struggle with the high dimensionality and nonlinear relationships characteristic of HSIs. In response, the advent of deep learning has provided significant advancements in overcoming these challenges. This paper by Shutao Li et al. presents a systematic survey of deep learning methodologies applied to HSI classification, emphasizing the advantages of these approaches and categorizing them into distinct frameworks for intellectual clarity.

Challenges in HSI Classification

HSI classification is confronted with two main issues: the large spatial variability of spectral signatures and the mismatch between high dimensionality and limited training samples. Factors such as varying illumination, atmospheric conditions, and sensor noise introduce variability, complicating the classification task. Additionally, despite the high-dimensional nature of HSIs, the available annotated data is generally sparse, posing a fundamental challenge for traditional classification methods that heavily rely on large training datasets.

Traditional Methods and Limitations

Initial efforts focused primarily on the spectral characteristics of each pixel, utilizing classifiers such as neural networks, support vector machines (SVMs), and various dimension-reduction techniques like principal component analysis (PCA) and linear discriminant analysis (LDA) to handle the high dimensionality. However, these methods largely overlooked the spatial context, leading to suboptimal classification performance. Subsequent research incorporated spatial information through methods such as extended morphological profiles (EMPs) and edge-preserving filtering (EPF), but these approaches often relied on hand-crafted feature extraction, limiting their adaptability and generalization capacity.

Deep Learning in HSI Classification

Deep learning offers a robust alternative by automatically learning complex, nonlinear features from raw data without the need for manual feature engineering. The paper categorizes deep learning-based HSI classification into spectral-feature networks, spatial-feature networks, and spectral-spatial-feature networks.

Spectral-Feature Networks:
- These networks focus exclusively on the spectral information of HSIs. Methods such as stacked auto-encoders (SAEs), deep belief networks (DBNs), and 1-D convolutional neural networks (CNNs) leverage spectral data to learn discriminative features. While these approaches yield better performance compared to traditional methods, they often ignore spatial information, which is critical for accurate classification.
Spatial-Feature Networks:
- Recognizing the importance of spatial context, these networks first reduce dimensionality via techniques like PCA and then apply 2-D CNNs to focus on spatial features. In some advanced frameworks, methodologies such as sparse representation-based CNNs integrate spatial detail more effectively. By emphasizing spatial context, these approaches improve classification performance but might still fall short in capturing spectral-spatial interactions fully.
Spectral-Spatial-Feature Networks:
- These networks aim to concomitantly exploit spectral and spatial features, generally categorized into preprocessing-based, integrated, and postprocessing-based networks. Preprocessing-based networks combine low-level spectral and spatial features before deep processing. Integrated networks, often using 3-D CNNs, extract features directly from the hyperspectral cubes. Postprocessing-based networks involve separate networks for spectral and spatial features, later fusing the outputs. Notably, architectures like feature fusion networks and generative adversarial networks (GANs) exemplify the efficacy of this integrated approach in producing high-accuracy classification results.

Addressing Limited Training Samples

One of the most pressing challenges in leveraging deep learning for HSI classification is the paucity of labeled training samples. The paper discusses several strategies:

Data Augmentation: Transformations and mixtures of existing samples can synthetically enrich the training dataset.
Transfer Learning: Pre-trained models on similar datasets can be fine-tuned on target HSI data, effectively mitigating the scarcity of labeled data.
Unsupervised/Semi-supervised Learning: Techniques like unsupervised feature extraction or semi-supervised learning can exploit large unlabeled datasets, enhancing feature representation without extensive labeled data.

Numerical Results and Comparisons

The paper includes comprehensive experimental results comparing traditional methods (e.g., SVM, EMP, JSR, EPF) with state-of-the-art deep learning approaches like 3D-CNN, Gabor-CNN, CNN-PPF, S-CNN, 3D-GAN, and DFFN across multiple HSIs. Deep learning methods consistently outperform traditional methods, particularly those leveraging both spectral and spatial information.

Future Directions

Theoretical improvements and practical implementations of deep learning for HSI classification are ongoing. Future research will likely focus on more efficient architecture designs, advanced training methods to better utilize limited labels, and cross-domain applications of learned features. As deep learning techniques continuously evolve, their application to HSI classification is poised to become even more robust and accurate.

Conclusion

Deep learning offers transformative potential for HSI classification, addressing the intricate challenges posed by high dimensionality and limited labeled data. By systematically exploiting spectral and spatial features, deep learning-based approaches substantively enhance classification accuracy and generalization capability, presenting substantial advancements over traditional methodologies.

PDF Markdown