Generalized Random Forests using Fixed-Point Trees (2306.11908v4)
Abstract: We propose a computationally efficient alternative to generalized random forests (GRFs) for estimating heterogeneous effects in large dimensions. While GRFs rely on a gradient-based splitting criterion, which in large dimensions is computationally expensive and unstable, our method introduces a fixed-point approximation that eliminates the need for Jacobian estimation. This gradient-free approach preserves GRF's theoretical guarantees of consistency and asymptotic normality while significantly improving computational efficiency. We demonstrate that our method achieves a speedup of multiple times over standard GRFs without compromising statistical accuracy. Experiments on both simulated and real-world data validate our approach. Our findings suggest that the proposed method is a scalable alternative for localized effect estimation in machine learning and causal inference applications
- Joshua D Angrist and Jörn-Steffen Pischke “Mostly harmless econometrics: An empiricist’s companion” Princeton university press, 2009
- “Recursive partitioning for heterogeneous causal effects” In Proceedings of the National Academy of Sciences 113.27 National Acad Sciences, 2016, pp. 7353–7360
- Susan Athey, Julie Tibshirani and Stefan Wager “Generalized random forests” In The Annals of Statistics 47.2 Institute of Mathematical Statistics, 2019, pp. 1148–1178 DOI: 10.1214/18-AOS1709
- Leo Breiman “Random forests” In Machine learning 45 Springer, 2001, pp. 5–32
- “Classification and regression trees” CRC, 1984
- Raymond J. Carroll, David Ruppert and Alan H. Welsh “Local estimating equations” In Journal of the American Statistical Association 93.441 Taylor & Francis, 1998, pp. 214–227
- “The elements of statistical learning: data mining, inference, and prediction” Springer, 2009
- Peter J. Huber “The behavior of maximum likelihood estimates under nonstandard conditions” In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1967, pp. 221–233
- Whitney K Newey “Kernel estimation of partial means and a general variance estimator” In Econometric Theory 10.2 Cambridge University Press, 1994, pp. 1–21
- “grf: Generalized Random Forests” R package version 2.2.1, 2023 URL: https://github.com/grf-labs/grf
- “Estimation and inference of heterogeneous treatment effects using random forests” In Journal of the American Statistical Association 113.523 Taylor & Francis, 2018, pp. 1228–1242
- “Adaptive concentration of regression trees, with application to random forests” In arXiv preprint arXiv:1503.06388, 2015
- “Flexible Regularized Estimating Equations: Some New Perspectives” In arXiv preprint arXiv:2110.11074, 2021