Class-Specific Multilinear Discriminant Analysis
- The paper introduces MCSDA, extending class-specific discriminant analysis to tensor data with mode-wise linear projections.
- It employs an alternating optimization strategy to update projection matrices for optimal in-class clustering and out-of-class separation.
- Empirical results demonstrate that MCSDA achieves faster computation and improved accuracy in applications like facial verification and stock prediction.
Multilinear Class-Specific Discriminant Analysis (MCSDA) is a tensor-based subspace learning technique that generalizes traditional class-specific discriminant analysis to high-order data. MCSDA maximizes the discrimination of each individual class in a feature space defined by low-dimensional multilinear projections, while maintaining the spatial and structural integrity of the original tensor representations. The method is formulated as an iterative optimization that alternately updates mode-wise projection matrices to achieve optimal class separation, with applications demonstrated in facial image verification and stock price prediction (Tran et al., 2017).
1. Problem Formulation and Objective
MCSDA addresses the challenge of transferring discriminant analysis techniques from vectorized data to high-order tensors. Given labeled samples with modes, the task is to learn, for each class , a set of mode-specific linear projection matrices satisfying .
Samples of class are treated as the positive class; all others are negative. The mean tensor for each class is . Each tensor is projected via
with the goal that in-class projected tensors cluster tightly around the projected mean, while out-of-class samples are maximally separated. This one-versus-rest construction yields a set of class-specific models, each optimized for its respective class in the tensor feature space.
2. Multilinear Class-Specific Scatter and Optimization Criterion
For a fixed class , MCSDA defines the following distances:
- The in-class (within-class) distance
- The out-of-class distance
These are rewritten in the trace-ratio form:
where aggregates in-class scatter and aggregates out-of-class scatter, with orthonormality constraints on each . The objective is to maximize for each class, ensuring that the projected out-of-class variance is maximized relative to the in-class variance.
3. Alternating Mode-Wise Optimization Algorithm
Due to the coupling of all mode-wise matrices in , optimization is performed via an alternating mode-wise strategy. At each iteration, for a given mode , two scatter matrices are computed after projecting along all modes except , followed by mode- unfolding:
- In-class scatter:
- Out-of-class scatter:
The optimization subproblem for each mode reduces to:
This is solved by computing the leading generalized eigenvectors of . The process iterates across all modes until convergence, sequentially updating the projections for each mode.
4. Structural Preservation and Computational Properties
MCSDA preserves the intrinsic spatial and structural information of tensor data by avoiding vectorization. Alternate projections along each tensor mode retain multilinear dependencies, essential for modeling high-order structures seen in images (spatial spatial) or time series (features time). The parameter count is reduced from a multiplicative scale (product of mode sizes in vectorization) to an additive one (sum of ). This reduction mitigates the curse of dimensionality and avoids small-sample-size pitfalls inherent in conventional vector approaches.
5. Empirical Evaluation
MCSDA's utility is demonstrated in two domains:
- Face verification (ORL, CMU-PIE): Each image is a tensor. Competing methods include vector-based CSDA, multilinear LDA (MDA), and enriched variants with HOG features. The metric is mean Average Precision (mAP) across class splits and train/test ratios. Findings show that MCSDA is $10$– faster than CSDA with only a slight mAP drop, consistently outperforms MDA, and benefits from feature enrichment when data are abundant.
- Stock price prediction (FI-2010 limit-order-book): Each sample is a tensor. Baselines include linear ridge regression, single-layer neural nets, BoF, neural BoF, CSDA, and MDA. With metrics such as per-class F1 and overall accuracy, MCSDA achieves the highest average F1 (), surpassing even deep bag-of-words variants, effectively capturing both feature and temporal multilinear structures.
6. Algorithmic Outline
The MCSDA learning procedure for a chosen class proceeds as follows:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
Algorithm MCSDA(Class c)
Input:
• Training tensors {X_j, l_j}
• Target subspace sizes I_1' × ... × I_K'
• Max iterations τ, threshold ε
Initialization:
For each mode n, set U_c^{(n)} ← all-ones or random orthonormal
For t = 1 to τ:
For n = 1 to K:
1. Compute in-class scatter S_I^{n(c)} via mode-n unfolding
2. Compute out-of-class scatter S_O^{n(c)} likewise
3. Solve S_O^{n(c)} u = λ S_I^{n(c)} u
and set U_c^{(n)} to top I_n' eigenvectors
End For
If sum_{n=1}^K || U_c^{(n)}(t) U_c^{(n)}(t–1)^T – I ||_F ≤ ε: stop
Output: {U_c^{(n)}}
Repeat for each class c to obtain class-specific projections. |