Finite-sample theory for mean/median aggregation of DML across sample splits
Determine whether aggregating double/debiased machine learning estimators across multiple random sample splits using mean or median aggregation improves finite-sample performance relative to using a single cross-fitted DML estimator, and, if so, establish formal theoretical guarantees quantifying any improvement under standard conditions on nuisance-function estimation and sample splitting.
References
We are not aware of formal theoretical arguments that point to improved finite sample properties of mean or median aggregated DML estimators. Our recommendation should be taken as practical but heuristic.
                — An Introduction to Double/Debiased Machine Learning
                
                (2504.08324 - Ahrens et al., 11 Apr 2025) in Section 5.1 (Group-Time Average Treatment Effects of Hospitalization), footnote