A Fast Sampling Gradient Tree Boosting Framework (1911.08820v1)

Published 20 Nov 2019 in cs.LG and stat.ML

Abstract: As an adaptive, interpretable, robust, and accurate meta-algorithm for arbitrary differentiable loss functions, gradient tree boosting is one of the most popular machine learning techniques, though the computational expensiveness severely limits its usage. Stochastic gradient boosting could be adopted to accelerates gradient boosting by uniformly sampling training instances, but its estimator could introduce a high variance. This situation arises motivation for us to optimize gradient tree boosting. We combine gradient tree boosting with importance sampling, which achieves better performance by reducing the stochastic variance. Furthermore, we use a regularizer to improve the diagonal approximation in the Newton step of gradient boosting. The theoretical analysis supports that our strategies achieve a linear convergence rate on logistic loss. Empirical results show that our algorithm achieves a 2.5x--18x acceleration on two different gradient boosting algorithms (LogitBoost and LambdaMART) without appreciable performance loss.

Authors (3)

Daniel Chao Zhou (1 paper)
Zhongming Jin (13 papers)
Tong Zhang (569 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

Lassoed Tree Boosting (2022)
Infinitesimal gradient boosting (2021)
Robust Boosting for Regression Problems (2020)
Proximal boosting: aggregating weak learners to minimize non-differentiable losses (2018)
Gradient and Newton Boosting for Classification and Regression (2018)