Papers
Topics
Authors
Recent
2000 character limit reached

Possibilistic IV Regression

Updated 25 November 2025
  • Possibilistic Instrumental Variable Regression is a method for causal inference that relaxes exogeneity assumptions by allowing for arbitrary instrument invalidity.
  • It leverages possibility theory to perform sensitivity analysis and derive posterior possibility inferences and valid confidence sets for treatment effects.
  • Empirical evaluations show that adjusting the violation set A can maintain near-nominal coverage even when instruments are weak or potentially invalid.

Possibilistic instrumental variable regression is a methodology for causal inference in structural models with endogenous treatments when the validity of instrumental variables (IVs) is uncertain. Grounded in possibility theory rather than classical probability, this approach allows principled posterior inference on treatment effects under user-specified relaxations of the exogeneity assumption, thus facilitating sensitivity analysis even in the presence of arbitrary instrument invalidity. The method offers valid confidence sets for the treatment effect that remain informative with a single, potentially invalid, instrument and does not require specification of prior distributions or reliance on Markov chain Monte Carlo (MCMC) (Steiner et al., 20 Nov 2025).

1. Structural Model and Problem Formulation

The observable data comprise independent, identically distributed (Yi,Xi,Zi)(Y_i, X_i, Z_i) generated by the triangular structural model: Yi=βXi+Ziα+ϵi,Xi=Ziγ2+ηi,Y_i = \beta X_i + Z_i \alpha + \epsilon_i, \qquad X_i = Z_i \gamma_2 + \eta_i, where ZiRpZ_i \in \mathbb{R}^p is a vector of pp instruments, XiX_i is the treatment, YiY_i is the outcome, and (ϵi,ηi)(\epsilon_i, \eta_i)^\top is a mean-zero jointly Gaussian error (ϵi,ηi)N(0,Σ)(\epsilon_i, \eta_i)^\top \sim N\left(0, \Sigma\right) with

Σ=(σ11σ12 σ12σ22).\Sigma = \begin{pmatrix} \sigma_{11} & \sigma_{12} \ \sigma_{12} & \sigma_{22} \end{pmatrix}.

The classical IV assumptions are: relevance (γ20\gamma_2 \neq 0), exogeneity (α=0\alpha = 0), and instrument validity (Zi(ϵi,ηi)Z_i \perp (\epsilon_i, \eta_i)). However, in possibilistic IV regression, exogeneity is not assumed a priori; instead, α\alpha is allowed to be nonzero, encoding the potential invalidity of instruments.

Key to the approach is the incorporation of a violation set ARpA \subset \mathbb{R}^p, representing plausible values for α\alpha and thus for the degree and direction of exogeneity violations. Sensitivity analysis is then performed by conditioning inference on the event αA\alpha \in A.

2. Possibility Theory Foundations

A possibility function f:Θ[0,1]f: \Theta \to [0, 1] on parameter space Θ\Theta encodes uncertainty by satisfying supθΘf(θ)=1\sup_{\theta \in \Theta} f(\theta) = 1. The associated outer measure is

Pf(A)=supθAf(θ),AΘ.\overline{\mathbb{P}}_f(A) = \sup_{\theta \in A} f(\theta), \quad A \subset \Theta.

For a random variable θ\boldsymbol{\theta} with uncertainty fθf_{\boldsymbol{\theta}}, joint and conditional outer measures are defined via suprema analogous to the above, for instance

fθψ(θψ)=fθ,ψ(θ,ψ)supθfθ,ψ(θ,ψ).f_{\boldsymbol{\theta} \mid \boldsymbol{\psi}}(\theta \mid \psi) = \frac{f_{\boldsymbol{\theta}, \boldsymbol{\psi}}(\theta, \psi)}{\sup_{\theta'} f_{\boldsymbol{\theta}, \boldsymbol{\psi}}(\theta', \psi)}.

This framework enables uncertainty quantification and posterior inference without full probabilistic modeling, aligning with the modeling uncertainty inherent in IV settings with ambiguous exogeneity.

3. Posterior Possibility Inference

3.1 Reduced-Form Posterior

The reduced-form likelihood is modeled with W=[Y,X]MN(ZΓ,In,Ψ)W = [Y, X] \sim MN(Z \Gamma, I_n, \Psi), where Γ=[γ1,γ2]\Gamma = [\gamma_1, \gamma_2] and Ψ=R(β)ΣR(β)\Psi = R(\beta) \Sigma R(\beta)^\top with R(β)=[1β 01]R(\beta) = \begin{bmatrix} 1 & \beta \ 0 & 1 \end{bmatrix}.

With a vacuous prior f(Γ,Ψ)=1f(\Gamma, \Psi) = 1, the reduced-form posterior possibility is

fRF(Γ,ΨW)=p(WΓ,Ψ)supΓ,Ψp(WΓ,Ψ).f_{\text{RF}}(\Gamma, \Psi \mid W) = \frac{p(W \mid \Gamma, \Psi)}{\sup_{\Gamma', \Psi'} p(W \mid \Gamma', \Psi')}.

3.2 Structural Posterior

The parameters (Γ,Ψ)(\Gamma, \Psi) are reparameterized as (α,β,Σ)(\alpha, \beta, \Sigma) via γ1=βγ2+α\gamma_1 = \beta \gamma_2 + \alpha, Ψ=R(β)ΣR(β)\Psi = R(\beta)\Sigma R(\beta)^\top. The structural posterior possibility is then

fS(α,β,ΣW)=sup(Γ,Ψ):Γ[1,β]=α, Ψ=R(β)ΣR(β)fRF(Γ,ΨW).f_{\text{S}}(\alpha, \beta, \Sigma \mid W) = \sup_{(\Gamma, \Psi): \Gamma [1, -\beta]^\top = \alpha,~ \Psi = R(\beta)\Sigma R(\beta)^\top} f_{\text{RF}}(\Gamma, \Psi \mid W).

Under an uninformative prior, a closed-form solution is available by profiling out Σ\Sigma and using

logfS(α,βW)=12(αt(β))(ZZσ11)(αt(β)),\log f_{\mathrm{S}}(\alpha, \beta \mid W) = -\frac{1}{2} (\alpha - t(\beta))^\top \left( \frac{Z^\top Z}{\sigma_{11}} \right) (\alpha - t(\beta)),

where t(β)=γ^1βγ^2t(\beta) = \hat{\gamma}_1 - \beta \hat{\gamma}_2 and σ11=Ψ^112βΨ^12+β2Ψ^22\sigma_{11} = \hat{\Psi}_{11} - 2\beta \hat{\Psi}_{12} + \beta^2\hat{\Psi}_{22}. Here, Γ^=(ZZ)1ZW\hat{\Gamma} = (Z^\top Z)^{-1} Z^\top W, Ψ^=1n(WZΓ^)(WZΓ^)\hat{\Psi} = \frac{1}{n}(W - Z\hat{\Gamma})^\top (W - Z\hat{\Gamma}).

4. Conditional Inference and Computation

To assess β\beta given possible instrument invalidity, the posterior possibility conditional on αA\alpha \in A is

f(βαA,W)=supαA,ΣfS(α,β,ΣW)supβ,αA,ΣfS(α,β,ΣW).f(\beta \mid \alpha \in A, W) = \frac{\sup_{\alpha \in A,\, \Sigma} f_{\text{S}}(\alpha, \beta, \Sigma \mid W)}{\sup_{\beta',\, \alpha \in A,\, \Sigma} f_{\text{S}}(\alpha, \beta', \Sigma \mid W)}.

The supremum in Σ\Sigma is solved via the MLE, as R(β)Σ^(β)R(β)=Ψ^R(\beta)\hat{\Sigma}(\beta) R(\beta)^\top = \hat{\Psi}. The optimization in α\alpha simplifies to projecting t(β)t(\beta) onto AA under the ZZZ^\top Z metric:

  • If t(β)At(\beta) \in A, the maximizer α^(β)=t(β)\hat{\alpha}(\beta) = t(\beta);
  • Otherwise, α^(β)\hat{\alpha}(\beta) is the projection of t(β)t(\beta) onto AA in the ZZZ^\top Z norm.

The practical computation reduces to:

  1. Estimating reduced form parameters;
  2. For each candidate β\beta, projecting t(β)t(\beta) onto AA;
  3. Normalizing to form f(βαA,w)f(\beta \mid \alpha \in A, w).

The overall complexity is dominated by a pp-dimensional quadratic program and a 2×22 \times 2 covariance update for each β\beta.

5. Validified Confidence Sets and Sensitivity Analysis

Following Martin–Liu (2013), the validified posterior possibility is defined as

πw(βA)=Pβ{f(βαA,W)f(βαA,w)},\pi_w(\beta \mid A) = \mathbb{P}_\beta\{ f(\beta \mid \alpha \in A, W) \leq f(\beta \mid \alpha \in A, w) \},

where Pβ\mathbb{P}_\beta denotes the sampling distribution under β\beta. This yields a valid (1δ)(1-\delta) confidence set {β:πw(βA)δ}\{ \beta : \pi_w(\beta \mid A) \geq \delta \}, satisfying

supβPβ{πW(βA)δ}δ,δ[0,1].\sup_{\beta} \mathbb{P}_\beta\{ \pi_W(\beta \mid A) \leq \delta \} \leq \delta, \quad \forall\, \delta \in [0, 1].

Empirically, these intervals attain near-nominal frequentist coverage when AA contains the true α\alpha.

Sensitivity analysis is facilitated by varying the violation set AA. Setting Aτ={α:ατ}A_\tau = \{\alpha : \|\alpha\| \leq \tau\} transitions inference from point-identified as τ0\tau \to 0 to uninformative as τ\tau \to \infty. Graphically displaying f(βαAτ,w)f(\beta \mid \alpha \in A_\tau, w) as a function of τ\tau produces a "sensitivity curve" indexing the stability of causal conclusions to exogeneity violations.

6. Empirical Evaluation

Simulation experiments address single- and multiple-instrument settings:

  • For a single instrument (p=1p=1) with possible violations α{0,0.25,0.5}\alpha \in \{0, 0.25, 0.5\} and n=100n=100, nominal 95%95\% coverage is maintained if AA is correctly specified. Allowing for plausible α\alpha values, i.e., A=[0.5,0.5]A = [-0.5, 0.5], restores coverage when true α0\alpha \neq 0, though the confidence interval widens.
  • For multiple instruments (p=5p=5) with up to all instruments invalid (αi=0.1\alpha_i = 0.1), possibilistic IV regression using A=[0.1,0.1]pA = [-0.1, 0.1]^p or A=[0,0.2]pA = [0, 0.2]^p achieves near-nominal coverage, unlike competing methods, which fail when instrument invalidity is widespread.

A real-data example using the Acemoglu–Johnson–Robinson (AJR) dataset (n=64n=64 countries; Y=logY = \log GDP/capita; X=X = institutional quality; Z=logZ = \log settler mortality) demonstrates that with A={0}A = \{0\}, inference on β\beta is tight, but relaxing to A=[0.4,0.4]A = [-0.4, 0.4] leads the confidence set to include zero. Posterior probabilities for β>0\beta > 0 remain robust under moderate exogeneity violations.

7. Advantages, Limitations, and Methodological Comparison

Possibilistic IV regression offers several advantages:

Feature Possibilistic IV Regression Existing Alternatives
Handles arbitrary invalidity Yes (even single instrument) Often fails
Interval estimation Yes (possibility/confidence sets) Rare or conservative
Sensitivity analysis Natural, via violation set AA Typically ad hoc
Optimization complexity Finite-dimensional (no MCMC) Often requires MCMC
Frequentist calibration Validified intervals for all β\beta May not hold
  • Limitations: Requires explicit specification of a plausible violation set AA; computational burden grows with instrument dimension pp, especially for complex AA; as AA expands, inference becomes uninformative.
  • Comparison with Existing Methods:
    • Two-stage least squares (TSLS): Fails with invalid or weak instruments.
    • Plausible GMM (PGMM): Dependent on Gaussian priors.
    • BudgetIV: Relies on a "budget" hyperparameter, often overly conservative.
    • CIIV: Restricted to certain linear settings.
    • gIVBMA: Sensitive to prior choices.
    • Partial IV: Requires strong instrument strength.

A plausible implication is that possibilistic IV regression generalizes many existing point- and interval-estimation approaches, providing a robust, practical framework for sensitivity analysis under instrument invalidity (Steiner et al., 20 Nov 2025). Empirically, its calibration and interval width are reasonable when AA is well-guided, and the method directly connects uncertainty about exogeneity violations with interval estimations for causal effects.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)
Slide Deck Streamline Icon: https://streamlinehq.com

Whiteboard

Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Possibilistic Instrumental Variable Regression.