ExplorerComputational LinguisticsNLP
Research PaperResearchia:202606.05010

Self-Augmenting Retrieval for Diffusion Language Models

Paul Jünger

Abstract

Discrete diffusion language models generate text by iteratively denoising an entire response in parallel. At each step, they predict tentative tokens for every masked position, committing the confident predictions to the output and discarding the unconfident ones. We show that the discarded tokens are in fact a useful lookahead signal for retrieval-augmented generation: even low-confidence tokens often surface salient entities early in the denoising trajectory, enabling retrieval of stronger evi...

Submitted: June 5, 2026Subjects: NLP; Computational Linguistics

Description / Details

Discrete diffusion language models generate text by iteratively denoising an entire response in parallel. At each step, they predict tentative tokens for every masked position, committing the confident predictions to the output and discarding the unconfident ones. We show that the discarded tokens are in fact a useful lookahead signal for retrieval-augmented generation: even low-confidence tokens often surface salient entities early in the denoising trajectory, enabling retrieval of stronger evidence before the output is finalized. We exploit this through Self-Augmenting Retrieval for Diffusion Language Models (SARDI), a dynamic RAG framework that uses these lookahead tokens to guide retrieval during denoising. SARDI is training-free, retriever-agnostic, and applicable to any reasoning-capable discrete diffusion language model. Across five multi-hop QA benchmarks, SARDI outperforms current training-free diffusion and autoregressive retrieval baselines at up to 8×8\times higher throughput.


Source: arXiv:2606.06474v1 - http://arxiv.org/abs/2606.06474v1 PDF: https://arxiv.org/pdf/2606.06474v1 Original Link: http://arxiv.org/abs/2606.06474v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 5, 2026
Topic:
Computational Linguistics
Area:
NLP
Comments:
0
Bookmark
Self-Augmenting Retrieval for Diffusion Language Models | Researchia