Papers
Topics
Authors
Recent
2000 character limit reached

Gabor Surface Feature (GSF) for Face Recognition

Updated 14 December 2025
  • Gabor Surface Feature (GSF) is a face representation method that utilizes full local surface geometry from Gabor magnitude pictures, capturing magnitude, slope, and curvature for robust recognition.
  • The technique computes first and second derivatives using symmetric difference filters and applies median-based binarization to ensure uniformity and resilience against intensity variations.
  • Regional joint histogram aggregation combined with Ensemble of Piecewise FDA enables efficient dimensionality reduction and enhanced discriminative classification on benchmark datasets.

Gabor Surface Feature (GSF) is a face representation methodology that utilizes the full local surface geometry of Gabor magnitude pictures (GMPs). GSF distinguishes itself by encoding not only the magnitude information, but also the pixel-wise slope and curvature, thereby enabling richer, multi-scale, and multi-orientation characterization of facial structure critical for recognition tasks. The framework modularly applies Gabor filtering, computes local derivatives, binarizes these features, and then encodes them into compact codes, which are aggregated spatially for subspace-based classification.

1. Foundation: Gabor Magnitude Pictures

Gabor filters are defined for extraction across scales and orientations: gμ,ν(x,y)=kμ,ν2σ2exp(kμ,ν2(x2+y2)2σ2)[ejkμ,νT(x,y)eσ2/2]g_{\mu, \nu}(x, y) = \frac{\|k_{\mu, \nu}\|^2}{\sigma^2} \exp\left(-\frac{\|k_{\mu, \nu}\|^2(x^2 + y^2)}{2\sigma^2}\right) \left[e^{j k_{\mu, \nu}^T (x, y)} - e^{-\sigma^2/2}\right] with μ{0,,S1}\mu \in \{0, \ldots, S-1\} indexing scale and ν{0,,O1}\nu \in \{0, \ldots, O-1\} indexing orientation. The characteristic frequency vector kμ,νk_{\mu, \nu} is given by

kμ,ν=kmaxfμ(cos(νπ/O) sin(νπ/O))k_{\mu, \nu} = \frac{k_{\text{max}}}{f^\mu} \begin{pmatrix} \cos(\nu\pi/O) \ \sin(\nu\pi/O) \end{pmatrix}

where σ\sigma controls the Gaussian envelope and f>1f > 1 is the scale factor. The convolution of the preprocessed face image I(x,y)I(x, y) with gμ,νg_{\mu,\nu} results in complex responses, and GMPs are extracted as Gμ,ν(x,y)|G_{\mu, \nu}(x, y)|. Standard settings utilize O=8O=8 orientations and S=5S=5 scales, producing 40 GMPs per image (Rino, 2014).

2. Derivative Extraction: Local Slope and Curvature

GSF treats each GMP as a smooth surface G(x,y)G(x, y) and computes discrete first-order derivatives (slope) using symmetric difference filters: Gx(i,j)=G(i,j+1)G(i,j1),Gy(i,j)=G(i+1,j)G(i1,j)G_x(i, j) = G(i, j+1) - G(i, j-1), \quad G_y(i, j) = G(i+1, j) - G(i-1, j) Second-order derivatives (curvature/convexity) are subsequently calculated: Gxx(i,j)=Gx(i,j+1)Gx(i,j1),Gyy(i,j)=Gy(i,j+1)Gy(i,j1)G_{xx}(i, j) = G_x(i, j+1) - G_x(i, j-1), \quad G_{yy}(i, j) = G_y(i, j+1) - G_y(i, j-1)

Gxy(i,j)=Gx(i+1,j)Gx(i1,j)G_{xy}(i, j) = G_x(i+1, j) - G_x(i-1, j)

This methodology captures both local spatial gradient and shape convexity—information not leveraged by prior methods limited to gradient encoding.

3. Binarization and Feature Encoding

A median-based binarization scheme is independently applied to magnitudes, first, and second derivatives for each GMP: TG=median{G(i,j)},TGx=median{Gx(i,j)},T_G = \text{median}\{G(i, j)\}, \quad T_{G_x} = \text{median}\{G_x(i, j)\}, \ldots Binary maps are formed such that

B(i,j)=1 if G(i,j)TG, else 0B(i, j) = 1 \text{ if } G(i, j) \geq T_G, \text{ else } 0

For convexity, the sum C(i,j)=Gxx(i,j)+Gyy(i,j)C(i,j) = G_{xx}(i,j) + G_{yy}(i,j) is binarized in the same fashion: B2(i,j)=1 if C(i,j)median{C}, else 0B_2(i, j) = 1 \text{ if } C(i, j) \geq \text{median}\{C\}, \text{ else } 0 This step ensures uniformity and robustness to intensity variation—a documented strength of the approach.

Each pixel’s binary triplet (or quadruplet, depending on variant) is translated to an integer code by logical weighting. Notable variants:

  • GSF1 (magnitude + 1st + 2nd derivatives): F(i,j)=23B(i,j)+22Bx(i,j)+21By(i,j)+20B2(i,j)F(i,j) = 2^3 B(i,j) + 2^2 B_x(i,j) + 2^1 B_y(i,j) + 2^0 B_2(i,j) (16 codes).
  • GSF2 (only 1st + 2nd derivatives): F(i,j)=23B2(i,j)+22By(i,j)+21Bx(i,j)+20Byy(i,j)F(i,j) = 2^3 B_2(i,j) + 2^2 B_y(i,j) + 2^1 B_x(i,j) + 2^0 B_{yy}(i,j) (16 codes).
  • Three-bit version: F3(i,j)=22B(i,j)+21Bx(i,j)+20By(i,j)F_3(i,j) = 2^2 B(i,j) + 2^1 B_x(i,j) + 2^0 B_y(i,j) (8 codes).

4. Regional Joint Histogram Aggregation

All 40 code maps per face are segmented into M×NM \times N non-overlapping grid regions, and each is further split into SS sub-regions. For each region jj and each sub-region ss, histograms Hj,sH_{j,s} over the code values ($0$ to L1L-1, L=16L=16) are constructed. The collection for region jj becomes a vector XjX_j of length 40×S×L40 \times S \times L. The composite feature for the image is the concatenated set {X1,,XMN}\{X_1,\ldots,X_{MN}\}.

Variant Bit composition Number of codes
GSF1 Magnitude + GxG_x + GyG_y + B2B_2 16
GSF2 B2B_2 + GyG_y + GxG_x + ByyB_{yy} 16
3-bit Magnitude + GxG_x + GyG_y 8

Joint histograms capture fine spatial and geometric details across orientations and scales, facilitating discriminative representation.

5. Subspace Classification via Ensemble of Piecewise FDA

Classification utilizes Ensemble of Piecewise FDA (EPFDA), operating regionwise:

  • Training vectors XjX_j for all gallery faces yield within-class scatter SwjS_w^j and between-class scatter SbjS_b^j.
  • The generalized eigenproblem Sbjw=λSwjwS_b^j w = \lambda S_w^j w is solved, with the top RR eigenvectors WjR(40SL)×RW_j \in \mathbb{R}^{(40 S L) \times R} retained.
  • Region features are projected: Fj=WjTXjF_j = W_j^T X_j.
  • Similarities between query and gallery face regions are quantified using the cosine metric, sj=cos(Fj,Fj)s_j = \cos(F_j, F_j').
  • The fused score is computed as Score=j=1MNwjsj\text{Score} = \sum_{j=1}^{MN} w_j s_j, where wjw_j is optionally set to a region’s standalone accuracy.

6. Experimental Protocols and Performance

Experiments on FERET, ORL, and FRGC-1.0.4 databases used rank-1 recognition rate as the principal metric. Preprocessing for illumination normalization was optionally applied through gamma correction, Difference of Gaussians, and contrast equalization.

On FERET (no preprocessing), results for probe sets fafb/fafc/dup I/dup II:

  • GSF1: 99.2 / 95.9 / 80.0 / 56.4%
  • GSF2: 99.3 / 93.8 / 84.9 / 69.2%
  • GSF2 + weighting: 99.3 / 94.3 / 84.6 / 71.4%

With illumination preprocessing and region weighting:

  • IP + GSF1 + W: 99.6 / 99.5 / 94.0 / 91.5%
  • IP + GSF2 + W: 99.4 / 97.9 / 93.9 / 90.2%

On ORL, GSF1 achieved 97.3%, compared to 97.0% (LGBP+EPFDA) and 95.7% (GFC). On FRGC-1.0.4:

  • GSF2: 94.9%; IP + GSF1: 97.2% (LGBP+EPFDA: 96.4%; GFC: 76.8%).

7. Discussion and Conceptual Implications

Treating GMPs as continuous surfaces allows GSF to exploit both local magnitude, slope, and curvature, improving over methods such as LGBP (which encode only gradient information). Median-based binarization imparts code uniformity and resilience against intensity scaling. Joint histograms across grid regions provide nuanced spatial statistics at multiple scales and orientations. EPFDA reduces feature dimensionality while selectively enhancing class separation per region. Region weighting further magnifies the contribution of highly discriminative facial areas to overall classification.

A plausible implication is that GSF’s physically interpretable encoding and compact binarization facilitate robust and efficient face recognition, with demonstrated superiority on benchmark datasets over prior magnitude-only or gradient-only Gabor-based methods (Rino, 2014).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)
1.

Whiteboard

Follow Topic

Get notified by email when new papers are published related to Gabor Surface Feature (GSF).