Degree of Contrast (DoC) Metric

Updated 22 February 2026

Degree of Contrast (DoC) is a generalized contrast energy metric based on center–surround filtering that quantifies spatial contrast structure in visual stimuli.
It extends classical RMS contrast by incorporating local spatial relationships through zero-sum Difference-of-Gaussians filters, ensuring mathematical robustness.
DoC links neuroanatomical parameters to psychophysical performance, offering practical insights for diagnosing contrast confounds and modeling visual crowding.

The Degree of Contrast (DoC), also referred to as the generalized contrast energy metric, is a quantitative framework for assessing the spatial contrast structure of visual stimuli, motivated by the receptive field architectures of early visual neurons. DoC extends classical measures such as @@@@1@@@@ contrast by incorporating the local spatial relationships defined by center–surround filters in the retina, LGN, and primary visual cortex. This metric has been shown to robustly predict the psychophysical outcomes of visual crowding, including classic and paradoxical effects across a broad range of tasks, with applications as both a diagnostic for contrast confounds and as a foundational component in models of visual perception (Rodriguez et al., 2020).

1. Theoretical Foundations: Center–Surround Filtering and Semi-Norm Structure

The mathematical grounding of DoC arises from the properties of center–surround receptive fields, classically modeled as a difference of Gaussians (DoG). A standard neuron’s response is modeled as a weighted sum of local luminances using two radially symmetric Gaussians—a narrow excitatory “center” and a broader inhibitory “surround”:

Foreground (center) mask:

$\operatorname{MoF}(p, q) = \frac{1}{2\pi \sigma_F^2} \exp\left[ -\frac{(p-x)^2 + (q-y)^2}{2 \sigma_F^2} \right]$

Background (surround) mask with $\sigma_B = k \sigma_F$ (empirically, $k \simeq 5$ ):

$\operatorname{MoB}(p, q) = \frac{1}{2\pi \sigma_B^2} \exp\left[ -\frac{(p-x)^2 + (q-y)^2}{2 \sigma_B^2} \right]$

DoG filter:

$s(x, y; p, q) = \operatorname{MoF}(p, q) - \operatorname{MoB}(p, q)$

Given an $H \times W$ image patch $i$ , vectorized as $v = \text{vec}(i)$ , convolution with the DoG filter is represented as linear operator $J$ such that $J \, v = \text{vec}(s * i)$ .

The generalized contrast energy is then defined as:

$C(v) = v^T J^T J v$

Because $J$ is constructed from a zero-sum DoG filter, $C(v)$ is always non-negative, homogeneous, and subadditive, so $C(v)$ defines a semi-norm on the image space.

2. Relation to Classical RMS Contrast

DoC subsumes and generalizes the concept of RMS (Root Mean Square) contrast. In the limiting case where the center mask is a Dirac delta and the surround mask is uniform (i.e., all pixel values equally weighted), the DoG simplifies so that:

$Jv = v - \bar{v}\,1$

with $J^T J = I - \frac{1}{n} 11^T$ (for $n$ pixels). Thus,

$C(v) = v^T\left(I - \frac{1}{n} 11^T\right)v = n\cdot \operatorname{Var}(v)$

with $\operatorname{RMS}(v) = \sqrt{\frac{1}{n}C(v)}$ , confirming that RMS contrast is a special (trivial) case of the generalized DoC metric.

3. Neuroanatomical Parameterization and Practical Computation

Parameters for the center–surround Gaussians are chosen to match anatomical constraints, specifically the dendritic field radii of midget ganglion cells that scale with eccentricity $e$ as established by Dacey & Petersen (1992). This gives:

$\sigma_F = f(e)$ for the center, with $f(e)$ from empirical fit,
$\sigma_B = 5\sigma_F$ for the surround.

Practical computation of DoC for a grayscale patch $I$ with known eccentricity $e$ and viewing geometry is as follows:

Calculate $\sigma_F$ in pixels: $\sigma_F = f(e) \cdot (\text{pixels/deg})$ .
Build MoF and MoB as Gaussians over $[-N, N]$ , $N = \text{ceil}(3 \sigma_B)$ .
Compute DoG filter $h = \operatorname{MoF} - \operatorname{MoB}$ .
Convolve $h$ with $I$ : $R = \text{conv2}(I, h, \text{'same'})$ .
Sum squared responses: $C = \sum_{x=1}^H \sum_{y=1}^W R(x, y)^2$ .

Optionally, normalization by $H \cdot W$ or taking the square root enables direct comparability with RMS or device-invariant analyses.

4. Mapping DoC to Behavioral Performance in Crowding Tasks

The mapping from DoC to expected behavioral outcomes in crowding follows a parametric probabilistic model anchored in experimental psychophysics. For a given task:

$\mu_t$ is the DoC of the isolated target image.
A 10% reduction in $\mu_t$ maps to 1% correct identification.

Two Gaussian response curves are constructed:

Target:

$G_t(x) = \exp\left(-\frac{(x - \mu_t)^2}{2\sigma_t^2}\right)$

Pooling:

$G_p(x) = \exp\left(-\frac{(x - \mu_t)^2}{2\sigma_p^2}\right)$

with $\sigma_t = 0.1\mu_t/\sqrt{-2\ln 0.01}$ and $\sigma_p = |E_x - \mu_t|/\sqrt{-2\ln 0.01}$ for $E_x$ (the empirical contrast at the crowding threshold). The predicted proportion correct is:

$p(C) = G_t(C) + [1 - G_p(C)]$

Optionally, $p(C)$ is multiplied by the empirical ceiling (typically $\approx 0.85$ ) for direct comparison to observed psychometric performance.

5. Empirical Predictive Power for Classic and Paradoxical Crowding Effects

DoC has demonstrated quantitative accuracy in predicting outcomes from a variety of crowding experiments, including:

Experiment/Authors	Key Stimulus Configuration	Predictive Result via DoC Model
Pelli & Tillman (2008); Freeman & Simoncelli (2011)	Letter “r” with “a” flankers; varying distance and $e$	DoC reproduces crowding curves as function of flanker separation
Flom, Weymouth & Kahneman (1963)	Landolt-C with bar flankers at various gaps, $e=3°$	DoC predicts non-monotonic $p(d)$ , mirroring minimum at critical distance
Manassi, Sayim & Herzog (2012), “Uncrowding”	Vernier with $2,4,8,16$ bar flankers, $e=3.88°$	DoC increases with more flankers, $p(C)$ improves: matches uncrowding
Harrison & Bex (2015); Pachai et al. (2016)	Landolt-C with $0,1,5$ surrounding rings (gapped/solid)	Gap lowers DoC $\rightarrow$ more errors; more rings $\rightarrow$ more DoC, fewer errors

The capacity of DoC to account both for standard (distance- and eccentricity-dependent) crowding effects and for counterintuitive “uncrowding” phenomena supports its role as a general explanatory framework for early-stage contrast-driven crowding (Rodriguez et al., 2020).

6. Implementation Recommendations and Pseudocode

Practical implementation can follow the provided MATLAB-style pseudocode, which builds the DoG filter for the relevant eccentricity and convolving it with the image, then maps the result to behavioral probabilities per the described model:

function [C, p] = degree_of_contrast(I_patch, ecc_deg, Ex)
   % I_patch: 2D luminance array
   % ecc_deg : eccentricity in degrees
   % Ex      : empirical contrast threshold
   pix_per_deg = monitor_PIXELS_PER_MM / (mm_per_deg * 1000);
   sigma_F = f_retina(ecc_deg) * pix_per_deg;
   sigma_B = 5 * sigma_F;
   halfsize = ceil(3 * sigma_B);
   [X,Y] = meshgrid(-halfsize:halfsize, -halfsize:halfsize);
   MoF = exp(-(X.^2+Y.^2)/(2*sigma_F^2))/(2*pi*sigma_F^2);
   MoB = exp(-(X.^2+Y.^2)/(2*sigma_B^2))/(2*pi*sigma_B^2);
   h   = MoF - MoB;
   R = conv2(I_patch, h, 'same');
   C = sum(R(:).^2);
   mu_t = C_target_alone;
   sigma_t = 0.1*mu_t / sqrt(-2*log(0.01));
   sigma_p = abs(Ex - mu_t) / sqrt(-2*log(0.01));
   Gt = exp(- (C - mu_t).^2/(2*sigma_t^2));
   Gp = exp(- (C - mu_t).^2/(2*sigma_p^2));
   p  = Gt + (1 - Gp);
   p  = 0.85 * p;
   return
end

In this context,

f_{\text{retina}}(\text{ecc}_{\text{deg}})

yields the retinal field width at eccentricity, and

C_{\text{target\_alone}}

is the DoC for an isolated target image.

7. Scientific Significance and Implications

The DoC metric $C(v)=v^T J^T J v$ generalizes classical image contrast concepts by encoding the geometric and topological relationships induced by early center–surround processing. Its reduction to RMS contrast in degenerate cases ensures conceptual continuity, while its predictive validity across multiple psychophysical paradigms—including those previously considered paradoxical—indicates that much of visual crowding may be attributed to early-stage contrast computations. This suggests that higher-level grouping or pooling models, while potentially relevant for "configural" or "semantic" crowding cases, may not be required for the majority of basic crowding phenomena.

Available resources, including code and datasets for DoC benchmarking, can be found at https://github.com/DartmouthGrangerLab/Contrast/ (Rodriguez et al., 2020).

Markdown Report Issue Upgrade to Chat

References (1)

On the contrast-dependence of crowding (2020)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Degree of Contrast (DoC).

Degree of Contrast (DoC) Metric

1. Theoretical Foundations: Center–Surround Filtering and Semi-Norm Structure

2. Relation to Classical RMS Contrast

3. Neuroanatomical Parameterization and Practical Computation

4. Mapping DoC to Behavioral Performance in Crowding Tasks

5. Empirical Predictive Power for Classic and Paradoxical Crowding Effects

6. Implementation Recommendations and Pseudocode

7. Scientific Significance and Implications

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Degree of Contrast (DoC) Metric

1. Theoretical Foundations: Center–Surround Filtering and Semi-Norm Structure

2. Relation to Classical RMS Contrast

3. Neuroanatomical Parameterization and Practical Computation

4. Mapping DoC to Behavioral Performance in Crowding Tasks

5. Empirical Predictive Power for Classic and Paradoxical Crowding Effects

6. Implementation Recommendations and Pseudocode

7. Scientific Significance and Implications

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research