ExplorerData ScienceData Science
Research PaperResearchia:202601.10d08862

Diffusion Models with Heavy-Tailed Targets: Score Estimation and Sampling Guarantees

Yifeng Yu

Abstract

Score-based diffusion models have become a powerful framework for generative modeling, with score estimation as a central statistical bottleneck. Existing guarantees for score estimation largely focus on light-tailed targets or rely on restrictive assumptions such as compact support, which are often violated by heavy-tailed data in practice. In this work, we study conventional (Gaussian) score-based diffusion models when the target distribution is heavy-tailed and belongs to a Sobolev class with...

Submitted: January 10, 2026Subjects: Data Science; Data Science

Description / Details

Score-based diffusion models have become a powerful framework for generative modeling, with score estimation as a central statistical bottleneck. Existing guarantees for score estimation largely focus on light-tailed targets or rely on restrictive assumptions such as compact support, which are often violated by heavy-tailed data in practice. In this work, we study conventional (Gaussian) score-based diffusion models when the target distribution is heavy-tailed and belongs to a Sobolev class with smoothness parameter β>0. We consider both exponential and polynomial tail decay, indexed by a tail parameter γγ. Using kernel density estimation, we derive sharp minimax rates for score estimation, revealing a qualitative dichotomy: under exponential tails, the rate matches the light-tailed case up to polylogarithmic factors, whereas under polynomial tails the rate depends explicitly on γγ. We further provide sampling guarantees for the associated continuous reverse dynamics. In total variation, the generated distribution converges at the minimax optimal rate nβ/(2β+d)n^{-β/(2β+d)} under exponential tails (up to logarithmic factors), and at a γγ-dependent rate under polynomial tails. Whether the latter sampling rate is minimax optimal remains an open question. These results characterize the statistical limits of score estimation and the resulting sampling accuracy for heavy-tailed targets, extending diffusion theory beyond the light-tailed setting.

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jan 10, 2026
Topic:
Data Science
Area:
Data Science
Comments:
0
Bookmark
Diffusion Models with Heavy-Tailed Targets: Score Estimation and Sampling Guarantees | Researchia