Weighted Average Derivative Effects
- Weighted Average Derivative Effects (WADEs) are defined by integrating the local derivative of a conditional mean function with an arbitrary weighting measure, generalizing ADEs.
- They facilitate efficient semiparametric estimation and robust causal inference across frameworks like quantile IV and kernel regression.
- Optimal weighting, Riesz representer techniques, and kernel-based methods enhance WADEs’ practical implementation and policy interpretation.
Weighted average derivative effects (WADEs) generalize the concept of average derivative effects (ADEs) by integrating the local derivative of a structural or conditional mean function against an arbitrary weighting measure, producing a scalar estimand of central interest in nonparametric analysis, semiparametric efficiency, and causal inference. WADEs subsume the ADE as a special case, admit rich causal and policy interpretations, and serve as canonical functionals in quantile IV, kernel regression, and doubly robust estimation frameworks.
1. Formal Definition and General Properties
Let denote observed outcome, continuous exposure or treatment, and covariates, respectively. The conditional mean outcome function is , with differentiable. For a weight function satisfying , the WADE is
When is normalized such that , is the ordinary average of the local slopes. The special case yields the unweighted ADE, . In conditional mean regression , the WADE associated with is (Hines et al., 2023, Cattaneo et al., 2022).
2. Riesz Representer Characterization and Influence Functions
WADEs are bounded linear functionals on the Hilbert space of square-integrable functions of , yielding a representation via the Riesz representer :
By integration by parts,
where denotes the conditional density of . The double-robust, one-step estimator is then
This approach enables valid inference using cross-fitted or machine-learned nuisance estimators, provided mean-square consistency (Hines et al., 2023, Hines et al., 2021).
3. Semiparametric Efficiency, Optimal Weights, and Variance Bounds
The efficiency bound for WADEs is determined by the variance of the influence function. Denote heteroscedastic variance by . Among all admissible weights, the optimal minimizing the nonparametric efficiency bound satisfies
with defined as
For homoscedasticity, the optimal WADE simplifies to
estimable via sample-split cross-fitting:
The nonparametric efficiency bound for is
with consistent variance estimation via plug-in influence-function sample averages (Hines et al., 2023, Hines et al., 2021).
4. WADEs in Nonparametric Quantile IV and Ill-posed Inverse Problems
WADEs characterize key functionals in nonparametric quantile IV regression (NPQIV), which is associated with non-separable, nonlinear, and ill-posed inverse models:
with user-supplied and the weak derivative. Semiparametric efficiency bounds for NPQIV WADEs depend on unknown conditional derivative operators and ill-posedness degree, rendering the information bound singular or non-singular according to . Penalized sieve generalized empirical likelihood (GEL) estimators constructed from unconditional and growing unconditional moment restrictions achieve consistency and, under regular conditions, asymptotic normality (Chen et al., 2019).
5. Kernel-based Density-Weighted ADEs, Bandwidth, and Robust Inference
Density-weighted average derivative (DWAD) estimators operationalize WADEs in kernel regression contexts, with (density weighting):
estimable by symmetric kernel and bandwidth :
Classical asymptotically-linear (AL) theory requires restrictive smoothness and bandwidth conditions (, ); inference is invariant to . The small-bandwidth (SB) framework relaxes these, allowing to shrink faster, produces robust finite-sample performance, and yields Gaussian limits for linear and quadratic terms with debiased variance estimators:
SB inference achieves lower coverage error and improved robustness to relative to AL (Cattaneo et al., 2022). Edgeworth expansions explicitly characterize bias, skewness, and coverage accuracy for both AL and SB regimes.
6. Causal and Policy Interpretations
WADEs admit causal interpretations as weighted averages of local incremental or “infinitesimal shift” effects, quantifying average outcome response to marginal changes in exposure under observable or stochastic intervention laws. For ADEs (, ), is the average derivative effect for infinitesimal shifts. Under general and , the estimand
measures the average slope of under intervention law . The optimal —identified via calculus of variations—minimizes the semiparametric variance bound among all such interventions, generalizing Crump overlap-weighted ATE principles to continuous exposures. WADEs thus serve as key estimands for incremental dose-response effects, overlap weighting, and robust causal analyses (Hines et al., 2023, Hines et al., 2021).
7. Practical Implementation and Guidelines
- Basis Selection: Linear sieves (e.g., B-splines, wavelets, polynomials) for functional parameter approximation; component basis functions for instruments and exposures increase with sample size (Chen et al., 2019).
- Penalty Tuning: Cross-validation, information criteria, or rule-of-thumb sequences (e.g., ) subject to prior smoothness specifications (Chen et al., 2019).
- Bandwidth in DWAD: Prefer small bandwidth for bias reduction in density-weighted estimation, ensuring (Cattaneo et al., 2022).
- Variance Estimation: Use empirical influence-function plug-in estimators for WADE SEs; in kernel settings, apply SB variance estimators for robust coverage (Cattaneo et al., 2022).
- Cross-fitting: Adopt -fold sample splitting for nuisance function estimation in debiased machine-learning WADE estimators, facilitating root- consistency and valid inference under weak regularity (Hines et al., 2023, Hines et al., 2021).
WADEs unify average derivative methodologies in economics, causal inference, and semiparametric theory, facilitating efficient estimation, robust inference, and interpretable “local” effects in models with continuous exposures, arbitrary weighting, and ill-posed inverse structures.