ExplorerData ScienceStatistics
Research PaperResearchia:202603.23027

Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD

Emiel Hoogeboom

Abstract

It is currently difficult to distill discrete diffusion models. In contrast, continuous diffusion literature has many distillation approaches methods that can reduce sampling steps to a handful. Our method, Discrete Moment Matching Distillation (D-MMD), leverages ideas that have been highly successful in the continuous domain. Whereas previous discrete distillation methods collapse, D-MMD maintains high quality and diversity (given sufficient sampling steps). This is demonstrated on both text ...

Submitted: March 23, 2026Subjects: Statistics; Data Science

Description / Details

It is currently difficult to distill discrete diffusion models. In contrast, continuous diffusion literature has many distillation approaches methods that can reduce sampling steps to a handful. Our method, Discrete Moment Matching Distillation (D-MMD), leverages ideas that have been highly successful in the continuous domain. Whereas previous discrete distillation methods collapse, D-MMD maintains high quality and diversity (given sufficient sampling steps). This is demonstrated on both text and image datasets. Moreover, the newly distilled generators can outperform their teachers.


Source: arXiv:2603.20155v1 - http://arxiv.org/abs/2603.20155v1 PDF: https://arxiv.org/pdf/2603.20155v1 Original Link: http://arxiv.org/abs/2603.20155v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Mar 23, 2026
Topic:
Data Science
Area:
Statistics
Comments:
0
Bookmark
Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD | Researchia