Gabor Surface Feature (GSF) for Face Recognition
- Gabor Surface Feature (GSF) is a face representation method that utilizes full local surface geometry from Gabor magnitude pictures, capturing magnitude, slope, and curvature for robust recognition.
- The technique computes first and second derivatives using symmetric difference filters and applies median-based binarization to ensure uniformity and resilience against intensity variations.
- Regional joint histogram aggregation combined with Ensemble of Piecewise FDA enables efficient dimensionality reduction and enhanced discriminative classification on benchmark datasets.
Gabor Surface Feature (GSF) is a face representation methodology that utilizes the full local surface geometry of Gabor magnitude pictures (GMPs). GSF distinguishes itself by encoding not only the magnitude information, but also the pixel-wise slope and curvature, thereby enabling richer, multi-scale, and multi-orientation characterization of facial structure critical for recognition tasks. The framework modularly applies Gabor filtering, computes local derivatives, binarizes these features, and then encodes them into compact codes, which are aggregated spatially for subspace-based classification.
1. Foundation: Gabor Magnitude Pictures
Gabor filters are defined for extraction across scales and orientations: with indexing scale and indexing orientation. The characteristic frequency vector is given by
where controls the Gaussian envelope and is the scale factor. The convolution of the preprocessed face image with results in complex responses, and GMPs are extracted as . Standard settings utilize orientations and scales, producing 40 GMPs per image (Rino, 2014).
2. Derivative Extraction: Local Slope and Curvature
GSF treats each GMP as a smooth surface and computes discrete first-order derivatives (slope) using symmetric difference filters: Second-order derivatives (curvature/convexity) are subsequently calculated:
This methodology captures both local spatial gradient and shape convexity—information not leveraged by prior methods limited to gradient encoding.
3. Binarization and Feature Encoding
A median-based binarization scheme is independently applied to magnitudes, first, and second derivatives for each GMP: Binary maps are formed such that
For convexity, the sum is binarized in the same fashion: This step ensures uniformity and robustness to intensity variation—a documented strength of the approach.
Each pixel’s binary triplet (or quadruplet, depending on variant) is translated to an integer code by logical weighting. Notable variants:
- GSF1 (magnitude + 1st + 2nd derivatives): (16 codes).
- GSF2 (only 1st + 2nd derivatives): (16 codes).
- Three-bit version: (8 codes).
4. Regional Joint Histogram Aggregation
All 40 code maps per face are segmented into non-overlapping grid regions, and each is further split into sub-regions. For each region and each sub-region , histograms over the code values ($0$ to , ) are constructed. The collection for region becomes a vector of length . The composite feature for the image is the concatenated set .
| Variant | Bit composition | Number of codes |
|---|---|---|
| GSF1 | Magnitude + + + | 16 |
| GSF2 | + + + | 16 |
| 3-bit | Magnitude + + | 8 |
Joint histograms capture fine spatial and geometric details across orientations and scales, facilitating discriminative representation.
5. Subspace Classification via Ensemble of Piecewise FDA
Classification utilizes Ensemble of Piecewise FDA (EPFDA), operating regionwise:
- Training vectors for all gallery faces yield within-class scatter and between-class scatter .
- The generalized eigenproblem is solved, with the top eigenvectors retained.
- Region features are projected: .
- Similarities between query and gallery face regions are quantified using the cosine metric, .
- The fused score is computed as , where is optionally set to a region’s standalone accuracy.
6. Experimental Protocols and Performance
Experiments on FERET, ORL, and FRGC-1.0.4 databases used rank-1 recognition rate as the principal metric. Preprocessing for illumination normalization was optionally applied through gamma correction, Difference of Gaussians, and contrast equalization.
On FERET (no preprocessing), results for probe sets fafb/fafc/dup I/dup II:
- GSF1: 99.2 / 95.9 / 80.0 / 56.4%
- GSF2: 99.3 / 93.8 / 84.9 / 69.2%
- GSF2 + weighting: 99.3 / 94.3 / 84.6 / 71.4%
With illumination preprocessing and region weighting:
- IP + GSF1 + W: 99.6 / 99.5 / 94.0 / 91.5%
- IP + GSF2 + W: 99.4 / 97.9 / 93.9 / 90.2%
On ORL, GSF1 achieved 97.3%, compared to 97.0% (LGBP+EPFDA) and 95.7% (GFC). On FRGC-1.0.4:
- GSF2: 94.9%; IP + GSF1: 97.2% (LGBP+EPFDA: 96.4%; GFC: 76.8%).
7. Discussion and Conceptual Implications
Treating GMPs as continuous surfaces allows GSF to exploit both local magnitude, slope, and curvature, improving over methods such as LGBP (which encode only gradient information). Median-based binarization imparts code uniformity and resilience against intensity scaling. Joint histograms across grid regions provide nuanced spatial statistics at multiple scales and orientations. EPFDA reduces feature dimensionality while selectively enhancing class separation per region. Region weighting further magnifies the contribution of highly discriminative facial areas to overall classification.
A plausible implication is that GSF’s physically interpretable encoding and compact binarization facilitate robust and efficient face recognition, with demonstrated superiority on benchmark datasets over prior magnitude-only or gradient-only Gabor-based methods (Rino, 2014).