Papers
Topics
Authors
Recent
Search
2000 character limit reached

Experimental Unit Information Index (EUII)

Updated 25 November 2025
  • Experimental Unit Information Index (EUII) is a metric that quantifies evidentiary value per experimental unit by normalizing the diagnostic odds ratio in hypothesis testing.
  • It applies to both fixed-sample and adaptive designs, offering interpretation from frequentist and Bayesian perspectives to optimize study parameters.
  • Numerical and asymptotic analyses reveal that increasing power and employing early-stopping rules enhance EUII, leading to more efficient and ethically sound experiments.

The Experimental Unit Information Index (EUII) quantifies the evidentiary value contributed by a single experimental unit in the context of hypothesis testing. Designed to enable rigorous trade-offs between statistical power, Type I error, and sample size, the EUII provides a single, unit-normalized metric that characterizes the per-unit accumulation of evidentiary value in both fixed-sample and adaptive designs. It is interpretable from both frequentist and Bayesian perspectives and offers guidance for optimizing study designs, particularly in fields such as animal research where reduction in experimental units is ethically mandated (Held et al., 21 Nov 2025).

1. Definition in Fixed-Sample Designs

The EUII for a fixed-sample design derives directly from the diagnostic odds ratio (DOR), which combines the likelihood ratios for significant and non-significant outcomes under both null (H0H_0) and alternative (H1H_1) hypotheses. For a test with per-sample Type I error α=P0(reject H0)\alpha = P_0(\mathrm{reject}\ H_0) and power 1β=P1(reject H0)1-\beta = P_1(\mathrm{reject}\ H_0), define:

  • Positive likelihood ratio: LR+=Powerα\mathrm{LR}^+ = \frac{\mathrm{Power}}{\alpha}
  • Negative likelihood ratio: LR=1Power1α\mathrm{LR}^- = \frac{1-\mathrm{Power}}{1-\alpha}

The diagnostic odds ratio:

DOR=LR+LR=Power/α(1Power)/(1α)=Power/(1Power)α/(1α)\mathrm{DOR} = \frac{\mathrm{LR}^+}{\mathrm{LR}^-} = \frac{\mathrm{Power}/\alpha}{(1-\mathrm{Power})/(1-\alpha)} = \frac{\mathrm{Power}/(1-\mathrm{Power})}{\alpha/(1-\alpha)}

If nn independent units are used, each unit is attributed with the nnth root of the DOR:

EUIIn=DOR1/n=(Power/(1Power)α/(1α))1/n\mathrm{EUII}_n = \mathrm{DOR}^{1/n} = \left(\frac{\mathrm{Power}/(1-\mathrm{Power})}{\alpha/(1-\alpha)}\right)^{1/n}

A test is considered evidentially useful if EUIIn>1\mathrm{EUII}_n > 1. This metric is agnostic to the specific values of α\alpha, 1β1-\beta, or nn and is readily computed using design specifications.

2. Asymptotic Properties

The behavior of EUII as nn \to \infty elucidates its theoretical bounds. For a one-sided, one-sample zz-test with effect size δ>0\delta > 0:

Power=Φ(δnz1α)\mathrm{Power} = \Phi(\delta \sqrt{n} - z_{1-\alpha})

where Φ\Phi is the standard normal CDF. Setting xn=δnz1αx_n = \delta \sqrt{n} - z_{1-\alpha} yields:

Power Odds=Φ(xn)1Φ(xn)\mathrm{Power\ Odds} = \frac{\Phi(x_n)}{1 - \Phi(x_n)}

Since α/(1α)\alpha/(1-\alpha) is asymptotically constant, EUIIn\mathrm{EUII}_n converges as:

limnEUIIn=exp(δ22)\lim_{n \to \infty} \mathrm{EUII}_n = \exp\left(\frac{\delta^2}{2}\right)

For a two-sample zz-test of mean difference δ\delta, the corresponding limit is:

limnEUIIn=exp(δ28)\lim_{n \to \infty} \mathrm{EUII}_n = \exp\left(\frac{\delta^2}{8}\right)

This establishes that the per-unit informational gain exhibits asymptotic saturation, reflecting diminishing per-unit returns as sample size increases.

3. Interpretations: Frequentist and Bayesian Perspectives

Frequentist Interpretation: EUII represents the geometric mean increase in odds of a significant result under H1H_1 compared to H0H_0:

DOR=Power OddsT1E Odds\mathrm{DOR} = \frac{\mathrm{Power\ Odds}}{\mathrm{T1E\ Odds}}

Exponentiation by $1/n$ interprets EUII as the per-unit multiplicative increase in evidentiary odds.

Bayesian Interpretation: By Bayes’ theorem, observing a significant (or non-significant) result modifies the posterior odds for H1H_1:

  • Significant: Odds(H1sig)=LR+Odds(H1)\mathrm{Odds}(H_1 | \text{sig}) = \mathrm{LR}^+ \cdot \mathrm{Odds}(H_1)
  • Non-significant: Odds(H1nonsig)=LROdds(H1)\mathrm{Odds}(H_1 | \text{nonsig}) = \mathrm{LR}^- \cdot \mathrm{Odds}(H_1)

The DOR quantifies the ratio of posterior odds between significant and non-significant outcomes. Therefore, EUIIn\mathrm{EUII}_n is the per-unit geometric average change in Bayes-factor-equivalent posterior odds distinguishing significant from non-significant results.

4. Extension to Adaptive and Group-Sequential Designs

In adaptive or group-sequential studies, the sample size NN is a random variable contingent on interim stopping for efficacy or futility, differing under H0H_0 and H1H_1. Define:

  • E(N+)\mathbb{E}(N_+): Expected sample size when stopping with significance (“sig”)
  • E(N)\mathbb{E}(N_-): Expected sample size when stopping with non-significance (“nonsig”)

The generalized EUII becomes:

EUII=(LR+)1/E(N+)(LR)1/E(N)\mathrm{EUII} = (\mathrm{LR}^+)^{1/\mathbb{E}(N_+)} \cdot (\mathrm{LR}^-)^{-1/\mathbb{E}(N_-)}

Variability in NN can be accommodated by a second-order Taylor expansion:

EUII~=(LR+)(1+CV+2)/E(N+)(LR)(1+CV2)/E(N)\widetilde{\mathrm{EUII}} = (\mathrm{LR}^+)^{(1 + CV_+^2)/\mathbb{E}(N_+)} \cdot (\mathrm{LR}^-)^{-(1 + CV_-^2)/\mathbb{E}(N_-)}

where CV+=SD(N+)/E(N+)CV_+ = \mathrm{SD}(N_+)/\mathbb{E}(N_+) and similarly for CVCV_-.

Analytic or simulation-based estimation of E(N+), E(N), CV+, CV\mathbb{E}(N_+),\ \mathbb{E}(N_-),\ CV_+,\ CV_- enables concrete EUII calculation for varied adaptive designs, allowing precise evaluation of how early-stopping rules impact per-unit evidentiary value.

5. Numerical Examples

A two-arm, fixed-sample design with α=0.05\alpha = 0.05, power $0.80$, and effect size δ=0.50\delta = 0.50:

  • n126n \approx 126,
  • LR+=0.80/0.05=16\mathrm{LR}^+ = 0.80 / 0.05 = 16
  • LR=0.20/0.950.2105\mathrm{LR}^- = 0.20 / 0.95 \approx 0.2105
  • DOR=16/0.210576\mathrm{DOR} = 16 / 0.2105 \approx 76
  • EUII126761/1261.035\mathrm{EUII}_{126} \approx 76^{1/126} \approx 1.035

Each unit increases DOR by approximately 3.5%3.5\%.

A constant-bound Pocock group-sequential design (four looks), with α=0.05\alpha=0.05, δ=0.50\delta=0.50, and nmax136n_{\max} \approx 136:

  • E(N+)90\mathbb{E}(N_+)\approx90, E(N)136\mathbb{E}(N_-)\approx136
  • EUII=161/900.21051/1361.038\mathrm{EUII} = 16^{1/90} \cdot 0.2105^{-1/136} \approx 1.038

Early stopping confers an additional per-unit evidentiary value of 0.3%\approx0.3\% over the fixed-sample design.

6. Implications for Design Optimization

Maximizing EUII entails:

  • Recognizing that for a fixed effect size δ\delta, the asymptotic bound exp(δ2/2)\exp(\delta^2 / 2) (or exp(δ2/8)\exp(\delta^2 / 8) for two-sample tests) is unattainable by further increasing nn; per-unit information exhibits diminishing returns.
  • Tuning α\alpha at finite nn trades off between LR+\mathrm{LR}^+ and LR\mathrm{LR}^-. Standard choices for α\alpha are typically near-optimal.
  • Lower β\beta (higher power) always increases LR+\mathrm{LR}^+ and EUII but necessitates larger nn.
  • Maximizing power for fixed nn yields the uniformly most powerful test and, hence, the highest EUII.
  • Adaptive designs, notably those with effective early-stopping rules for efficacy or futility, reduce E(N+)\mathbb{E}(N_+) or E(N)\mathbb{E}(N_-), increasing EUII substantially. Two to four well-selected interim analyses and application of predictive-power futility boundaries (e.g., stop if predictive power <0.2<0.2–$0.3$) capture a significant fraction of possible evidentiary gains.
  • Unbalanced randomization reduces power at fixed nn and thus modestly lowers EUII; equal allocation optimizes EUII if this metric is prioritized.

In summary, the EUII provides a rigorous, unified measure of evidence efficiency per experimental unit in both frequentist and Bayesian contexts. It is maximized by adopting most powerful critical values at fixed α\alpha, minimizing α\alpha for given power, and employing adaptive early-stopping rules where feasible to reduce expected sample size while preserving evidentiary value (Held et al., 21 Nov 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Experimental Unit Information Index (EUII).