Learnable Manifold Alignment (LeMA) is a semi-supervised cross-modality framework that integrates HS and MS data for enhanced land cover and land use classification.
It employs a joint optimization strategy that simultaneously learns a data-driven graph, a shared projection subspace, and a classifier to capture intrinsic data geometry.
Experiments on remote sensing datasets show LeMA achieving 5–10% improvements in overall accuracy and kappa coefficient over conventional methods.
Learnable Manifold Alignment (LeMA) is a semi-supervised cross-modality learning framework designed to exploit limited highly-discriminative hyperspectral (HS) data and abundant poorly-discriminative multispectral (MS) data for land cover and land use classification. Unlike prior manifold alignment techniques that rely on fixed Gaussian-kernel graphs, LeMA jointly learns a data-driven graph structure with the projection and classification parameters, enabling more effective cross-modality knowledge transfer and improved decision boundaries in the MS domain (Hong et al., 2019).
1. Problem Formulation
LeMA addresses cross-modality semi-supervised classification with the following structure:
Let N paired, labeled HS and MS samples be given:
XH​∈RdH​×N: hyperspectral feature matrix.
XM​∈RdM​×N: multispectral feature matrix.
Y∈{0,1}c×N: one-hot class label matrix for c classes.
Let NU​ unlabeled MS samples be provided: XU​∈RdM​×NU​.
The objective is to use the small set of labeled HS samples, in combination with the large set of unlabeled MS samples, to induce a classifier with good generalization on MS data.
Joint data matrices are constructed for paired and semi-supervised settings:
Extended data with unlabeled MS: X′=[XH​​0​0 0​XM​​XU​​]∈R(dH​+dM​)×(2N+NU​).
The framework seeks a shared XH​∈RdH​×N0-dimensional subspace, XH​∈RdH​×N1, via XH​∈RdH​×N2 with orthogonality XH​∈RdH​×N3, such that projected features XH​∈RdH​×N4 are maximally informative for classification. A linear regression/classification matrix XH​∈RdH​×N5 is learned to map XH​∈RdH​×N6 to labels.
2. Objective Function and Mathematical Formulation
The LeMA optimization problem is defined as:
XH​∈RdH​×N7
Subject to:
XH​∈RdH​×N8
XH​∈RdH​×N9, XM​∈RdM​×N0
XM​∈RdM​×N1 (scale constraint)
XM​∈RdM​×N2 for XM​∈RdM​×N3 both labeled in class XM​∈RdM​×N4
Here, XM​∈RdM​×N5 denotes the XM​∈RdM​×N6-th column of XM​∈RdM​×N7 and XM​∈RdM​×N8. The graph Laplacian is XM​∈RdM​×N9 with Y∈{0,1}c×N0. The sum Y∈{0,1}c×N1 captures the smoothness constraint. Critically, Y∈{0,1}c×N2—the adjacency matrix defining manifold structure—is learned jointly with Y∈{0,1}c×N3 and Y∈{0,1}c×N4.
Orthogonality on Y∈{0,1}c×N5 prevents degenerate scaling, and constraints on Y∈{0,1}c×N6 ensure it is a valid similarity graph. An upper-bound Y∈{0,1}c×N7 for class Y∈{0,1}c×N8 enforces degree comparability with an LDA-like graph.
3. Graph-based Label Propagation
After learning Y∈{0,1}c×N9 and c0, labels can be propagated via the regularized objective:
c1
with closed-form solution:
c2
or iteratively via:
c3
where c4.
This label propagation further sharpens the decision boundaries by leveraging the learned data manifold.
with splits NU​2 and NU​3. Updates involve Lagrange multipliers and auxiliary variables; the NU​4-update uses thin SVD.
NU​5-update (ADMM):NU​6 is partitioned into block matrices; labeled–labeled blocks are set by LDA-like rules, while the cross-part (labeled–unlabeled and unlabeled–unlabeled) is optimized subject to symmetry, nonnegativity, bounds, and scale constraints. Multiple auxiliary splits and soft-thresholding/proximal operators are used.
Convergence: Monitored by relative change in global objective or ADMM residual norms. Recommended hyperparameters: NU​7–NU​8, NU​9, XU​∈RdM​×NU​0–XU​∈RdM​×NU​1, XU​∈RdM​×NU​2.
5. Inference and Decision Boundary Construction
After learning XU​∈RdM​×NU​3, XU​∈RdM​×NU​4, and XU​∈RdM​×NU​5, classification of a novel MS sample XU​∈RdM​×NU​6 proceeds via:
Project XU​∈RdM​×NU​7 into the common subspace: XU​∈RdM​×NU​8.
Predict class scores: XU​∈RdM​×NU​9; assign class X=[XH​​0 0​XM​​]∈R(dH​+dM​)×2N0.
Optional: Extend graph and perform label propagation for refined predictions.
The decision boundary in latent space X=[XH​​0 0​XM​​]∈R(dH​+dM​)×2N1 is defined where two coordinates of X=[XH​​0 0​XM​​]∈R(dH​+dM​)×2N2 are equal, reflecting the linearity of X=[XH​​0 0​XM​​]∈R(dH​+dM​)×2N3. In practice, X=[XH​​0 0​XM​​]∈R(dH​+dM​)×2N4 can also be input to off-the-shelf classifiers such as linear SVM or Canonical Correlation Forest (CCF) for performance comparison.
6. Experimental Setup and Results
LeMA was evaluated on:
University of Houston and Chikusei (HS X=[XH​​0 0​XM​​]∈R(dH​+dM​)×2N5 simulated Sentinel-2 MS),
DFC2018 MS-LiDAR & HS data.
Evaluation metrics included Overall Accuracy (OA), Average Accuracy (AA), and Cohen’s X=[XH​​0 0​XM​​]∈R(dH​+dM​)×2N6. Comparisons were performed against:
On both OA and X=[XH​​0 0​XM​​]∈R(dH​+dM​)×2N7, LeMA outperformed baselines by 5–10%. Key findings:
Small amounts of HS labels can reliably guide the larger MS domain.
Learning X=[XH​​0 0​XM​​]∈R(dH​+dM​)×2N8 from the data instead of using a fixed Gaussian kernel graph produces a better manifold.
Semi-supervised alignment plus label propagation yields the most accurate decision boundaries.
Both linear SVM and CCF performed well on features learned via LeMA (Hong et al., 2019).
7. Algorithm Summary
An overview of the LeMA algorithm flow is as follows:
Initialization: Set random orthonormal X=[XH​​0 0​XM​​]∈R(dH​+dM​)×2N9, zero Y=[Y,Y]∈Rc×2N0, LDA-like assignment of Y=[Y,Y]∈Rc×2N1 on labeled blocks, random initialization of optimization blocks for Y=[Y,Y]∈Rc×2N2.
Assemble X′=[XH​​0​0 0​XM​​XU​​]∈R(dH​+dM​)×(2N+NU​)1 and compute Laplacian X′=[XH​​0​0 0​XM​​XU​​]∈R(dH​+dM​)×(2N+NU​)2.
Compute objective and check for convergence.
Return: The final classifier X′=[XH​​0​0 0​XM​​XU​​]∈R(dH​+dM​)×(2N+NU​)3, projection X′=[XH​​0​0 0​XM​​XU​​]∈R(dH​+dM​)×(2N+NU​)4, and graph X′=[XH​​0​0 0​XM​​XU​​]∈R(dH​+dM​)×(2N+NU​)5.
All update rules, ADMM decompositions, and convergence criteria are reported explicitly in (Hong et al., 2019). This encapsulation enables full reproducibility and systematic extension of the LeMA methodology for cross-modality semi-supervised learning.
“Emergent Mind helps me see which AI papers have caught fire online.”
Philip
Creator, AI Explained on YouTube
Sign up for free to explore the frontiers of research
Discover trending papers, chat with arXiv, and track the latest research shaping the future of science and technology.Discover trending papers, chat with arXiv, and more.