Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
8 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Generalized Random Forests using Fixed-Point Trees (2306.11908v4)

Published 20 Jun 2023 in stat.ML, cs.LG, and stat.ME

Abstract: We propose a computationally efficient alternative to generalized random forests (GRFs) for estimating heterogeneous effects in large dimensions. While GRFs rely on a gradient-based splitting criterion, which in large dimensions is computationally expensive and unstable, our method introduces a fixed-point approximation that eliminates the need for Jacobian estimation. This gradient-free approach preserves GRF's theoretical guarantees of consistency and asymptotic normality while significantly improving computational efficiency. We demonstrate that our method achieves a speedup of multiple times over standard GRFs without compromising statistical accuracy. Experiments on both simulated and real-world data validate our approach. Our findings suggest that the proposed method is a scalable alternative for localized effect estimation in machine learning and causal inference applications

Definition Search Book Streamline Icon: https://streamlinehq.com
References (13)
  1. Joshua D Angrist and Jörn-Steffen Pischke “Mostly harmless econometrics: An empiricist’s companion” Princeton university press, 2009
  2. “Recursive partitioning for heterogeneous causal effects” In Proceedings of the National Academy of Sciences 113.27 National Acad Sciences, 2016, pp. 7353–7360
  3. Susan Athey, Julie Tibshirani and Stefan Wager “Generalized random forests” In The Annals of Statistics 47.2 Institute of Mathematical Statistics, 2019, pp. 1148–1178 DOI: 10.1214/18-AOS1709
  4. Leo Breiman “Random forests” In Machine learning 45 Springer, 2001, pp. 5–32
  5. “Classification and regression trees” CRC, 1984
  6. Raymond J. Carroll, David Ruppert and Alan H. Welsh “Local estimating equations” In Journal of the American Statistical Association 93.441 Taylor & Francis, 1998, pp. 214–227
  7. “The elements of statistical learning: data mining, inference, and prediction” Springer, 2009
  8. Peter J. Huber “The behavior of maximum likelihood estimates under nonstandard conditions” In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1967, pp. 221–233
  9. Whitney K Newey “Kernel estimation of partial means and a general variance estimator” In Econometric Theory 10.2 Cambridge University Press, 1994, pp. 1–21
  10. “grf: Generalized Random Forests” R package version 2.2.1, 2023 URL: https://github.com/grf-labs/grf
  11. “Estimation and inference of heterogeneous treatment effects using random forests” In Journal of the American Statistical Association 113.523 Taylor & Francis, 2018, pp. 1228–1242
  12. “Adaptive concentration of regression trees, with application to random forests” In arXiv preprint arXiv:1503.06388, 2015
  13. “Flexible Regularized Estimating Equations: Some New Perspectives” In arXiv preprint arXiv:2110.11074, 2021

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com