Papers
Topics
Authors
Recent
Search
2000 character limit reached

Distance-based ICC (dbICC) Overview

Updated 24 May 2026
  • Distance-based ICC (dbICC) is a reliability metric defined using arbitrary pairwise distances to quantify variance between and within subjects in non-scalar data.
  • It employs between-subject and within-subject mean squared distances with bootstrap bias correction to improve empirical coverage in high-dimensional settings.
  • The framework generalizes classical ICC via an extended Spearman–Brown formula, making it applicable to vectors, curves, graphs, and fMRI connectivity matrices.

The distance-based intraclass correlation coefficient (dbICC) is a generalization of the classical intraclass correlation coefficient (ICC), extending measurement reliability to settings where observations are non-scalar or the measurement space admits no natural ICC. Defined in terms of arbitrary distances between observations, dbICC enables reliability assessment for data types such as vectors, curves, graphs, and covariance matrices. The framework provides a well-defined proportion of variance attributable to between-subject differences versus within-subject variability, operationalized entirely via pairwise distances. Bias correction procedures and theoretical extensions, such as a generalized Spearman–Brown formula, provide robust inference and study-planning tools for complex, high-dimensional, or structured measurement data (Xu et al., 2019).

1. Formal Definition and Sample Estimation

Let II denote the number of subjects, JiJ_i the number of repeated measurements for subject ii, and XijX_{ij} the jjth observation from subject ii. For an arbitrary distance function d(,)d(\cdot, \cdot) on the observation space, two principal quantities underpin dbICC:

  • Between-subject mean squared distance:

MSDb=Ei1i2,j1,j2[d(Xi1j1,Xi2j2)2]MSD_b = \mathbb{E}_{i_1 \neq i_2, j_1, j_2}[\,d(X_{i_1j_1}, X_{i_2j_2})^2\,]

  • Within-subject mean squared distance:

MSDw=Ei,j1j2[d(Xij1,Xij2)2]MSD_w = \mathbb{E}_{i, j_1 \neq j_2}[\,d(X_{ij_1}, X_{ij_2})^2\,]

The distance-based intraclass correlation coefficient is then formulated as:

ρ=1MSDwMSDb\rho = 1 - \frac{MSD_w}{MSD_b}

This ratio quantifies the proportion of total distance-based "variance" due to subject-level signal, analogous to variance components in classical settings. For empirical data, the expectations are replaced by averages over all appropriate index pairs:

JiJ_i0

JiJ_i1

JiJ_i2

2. Bias Correction for Bootstrap Confidence Intervals

Estimation of dbICC confidence intervals is complicated by the unknown sampling distribution of JiJ_i3. Standard practice is to use a subject-level nonparametric bootstrap:

  • Resample subjects with replacement.
  • For each bootstrap sample, compute JiJ_i4 as in the sample formulas.

However, naive resampling can produce pseudo "between-subject" pairs where both indices correspond to the same original subject, leading to downward-biased estimates of JiJ_i5 and, consequently, overstated reliability. The bias correction consists of excluding any JiJ_i6 pair from the JiJ_i7 sum if JiJ_i8 in the JiJ_i9th bootstrap sample. ii0 within a subject needs no such correction. The resulting quantiles of the corrected ii1 form valid confidence intervals, substantially improving empirical coverage, especially in settings with small or moderate ii2 (Xu et al., 2019).

3. Generalized Spearman–Brown Formula

The dbICC admits an extended version of the Spearman–Brown formula for predicting the reliability of averaged measurements. Under the general "true-score+error" Hilbert space model ii3, with appropriate orthogonality and variance-type distance definitions, the multi-replication dbICC for ii4 independent repeats is

ii5

where ii6 and ii7 is the within-subject mean squared error across ii8 repetitions. In the classical scalar case, ii9 recovers the usual Spearman–Brown formula:

XijX_{ij}0

For vector or covariance-matrix data, the growth rate of reliability with XijX_{ij}1 is governed by the specific form of XijX_{ij}2; e.g., for XijX_{ij}3 IID multivariate measurements, XijX_{ij}4, yielding XijX_{ij}5 (Xu et al., 2019).

4. Simulation Studies

Xu, Reiss, and Cribben conducted extensive simulation experiments to evaluate point and interval estimation behavior for dbICC. In their scenarios, XijX_{ij}6, XijX_{ij}7, with varying signal-to-noise ratios XijX_{ij}8, sample sizes XijX_{ij}9, and jj0 repeats. True dbICC values jj1 ranged from 0.2 to 0.8.

Findings include:

  • The point estimator jj2 exhibits negative bias at small jj3 (e.g., median bias for jj4), diminishing with larger jj5.
  • Naive bootstrap confidence intervals tend to under-cover the nominal 95% level, especially for small jj6 (e.g., 86.0% coverage at jj7, jj8), whereas the bias-corrected bootstrap achieves substantially better accuracy (e.g., 90.8% coverage at jj9, ii0).
  • Coverage rates approach nominal as ii1 increases or ii2 increases (Xu et al., 2019).

5. Application: Test–Retest Reliability for fMRI-Derived Connectivity Matrices

To demonstrate dbICC in a high-dimensional, non-scalar context, Xu et al. analyzed test–retest data on resting-state brain functional connectivity derived from fMRI scans. The dataset comprised 25 healthy adults, each scanned twice, generating 333 × 333 ROI correlation matrices (NYU TRT dataset). Preprocessing included motion correction, spatial normalization, tissue segmentation, nuisance regression, spatial smoothing (FWHM 6mm), and band-pass filtering.

Three distance metrics on correlation matrices were evaluated:

  • ii3 (Frobenius) norm on flattened lower-triangular entries,
  • ii4 entrywise norm,
  • ii5, with ii6 the Pearson correlation between lower triangles ("correlation of correlations").

dbICC values (point with 95% bias-corrected bootstrap CI):

Region ii7 ii8 ii9
All 333 ROIs 0.378 (0.329, 0.424) 0.382 (0.335, 0.426) 0.382 (0.338, 0.426)
Default Mode (41) 0.488 (0.403, 0.562) 0.493 (0.404, 0.570) 0.487 (0.414, 0.555)
Visual Network (39) 0.434 (0.362, 0.508) 0.435 (0.354, 0.515) 0.451 (0.401, 0.500)

dbICC for the full set was lower than for subnetworks (Default Mode, Visual), and soft-thresholding did not further improve reliability (Xu et al., 2019).

6. Practical Recommendations and Methodological Guidance

  • Selection of d(,)d(\cdot, \cdot)0 should be tailored to the data: Euclidean, Frobenius, dynamic-time-warping, or correlation-based distances are appropriate for vectors, matrices, or time series, respectively.
  • Compute the full d(,)d(\cdot, \cdot)1 distance matrix, average over off-diagonal within-subject and between-subject blocks to obtain d(,)d(\cdot, \cdot)2 and d(,)d(\cdot, \cdot)3.
  • Employ subject-level nonparametric bootstrap, applying the bias correction for d(,)d(\cdot, \cdot)4 by excluding pairs representing the same individual in the resampled set.
  • Plan studies using the generalized Spearman–Brown relation: plot d(,)d(\cdot, \cdot)5 versus d(,)d(\cdot, \cdot)6 (measurement intensity) to estimate the requisite d(,)d(\cdot, \cdot)7 for a target reliability.
  • Perform sensitivity analyses to alternative distance choices, thresholding strategies, and sample size.
  • An implementation in R is available at https://github.com/wtagr/dbicc (Xu et al., 2019).

7. Context and Theoretical Significance

dbICC advances measurement reliability analysis to accommodate non-scalar, structured, and high-dimensional data realms, overcoming limitations of classical ICC. The approach is applicable wherever pairwise measurement dissimilarities are meaningful, including connectomics, time series, and functional data. The theoretical coherence with classical reliability models, extensibility to Hilbert spaces, and empirical validity underpin dbICC as a robust and unifying tool for generalized reliability quantification (Xu et al., 2019).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Distance-based ICC (dbICC).