Back to Explorer
Research PaperResearchia:202601.09a86818[Data Science > Data Science]

What Functions Does XGBoost Learn?

Dohyeong Ki

Abstract

This paper establishes a rigorous theoretical foundation for the function class implicitly learned by XGBoost, bridging the gap between its empirical success and our theoretical understanding. We introduce an infinite-dimensional function class Fβˆžβˆ’STd,s\mathcal{F}^{d, s}_{\infty-\text{ST}} that extends finite ensembles of bounded-depth regression trees, together with a complexity measure Vβˆžβˆ’XGBd,s(β‹…)V^{d, s}_{\infty-\text{XGB}}(\cdot) that generalizes the L1L^1 regularization penalty used in XGBoost. We show that every optimizer of the XGBoost objective is also an optimizer of an equivalent penalized regression problem over Fβˆžβˆ’STd,s\mathcal{F}^{d, s}_{\infty-\text{ST}} with penalty Vβˆžβˆ’XGBd,s(β‹…)V^{d, s}_{\infty-\text{XGB}}(\cdot), providing an interpretation of XGBoost as implicitly targeting a broader function class. We also develop a smoothness-based interpretation of Fβˆžβˆ’STd,s\mathcal{F}^{d, s}_{\infty-\text{ST}} and Vβˆžβˆ’XGBd,s(β‹…)V^{d, s}_{\infty-\text{XGB}}(\cdot) in terms of Hardy--Krause variation. We prove that the least squares estimator over {f∈Fβˆžβˆ’STd,s:Vβˆžβˆ’XGBd,s(f)≀V}\{f \in \mathcal{F}^{d, s}_{\infty-\text{ST}}: V^{d, s}_{\infty-\text{XGB}}(f) \le V\} achieves a nearly minimax-optimal rate of convergence nβˆ’2/3(log⁑n)4(min⁑(s,d)βˆ’1)/3n^{-2/3} (\log n)^{4(\min(s, d) - 1)/3}, thereby avoiding the curse of dimensionality. Our results provide the first rigorous characterization of the function space underlying XGBoost, clarify its connection to classical notions of variation, and identify an important open problem: whether the XGBoost algorithm itself achieves minimax optimality over this class.

Submission:1/9/2026
Comments:0 comments
Subjects:Data Science; Data Science
Original Source:
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

What Functions Does XGBoost Learn? | Researchia