ExplorerData ScienceMachine Learning
Research PaperResearchia:202605.27059

Towards Controllable Image Generation through Representation-Conditioned Diffusion Models

Nithesh Chandher Karthikeyan

Abstract

Diffusion models have emerged as powerful tools for high-quality image generation and editing, but guiding these models to produce specific outputs remains a challenge. Conventional approaches rely on conditioning mechanisms, such as text prompts or semantic maps, which require extensively annotated datasets. In this preliminary work, we explore diffusion models conditioned on representations from a pre-trained self-supervised model. The self-conditioning mechanism not only improves the quality ...

Submitted: May 27, 2026Subjects: Machine Learning; Data Science

Description / Details

Diffusion models have emerged as powerful tools for high-quality image generation and editing, but guiding these models to produce specific outputs remains a challenge. Conventional approaches rely on conditioning mechanisms, such as text prompts or semantic maps, which require extensively annotated datasets. In this preliminary work, we explore diffusion models conditioned on representations from a pre-trained self-supervised model. The self-conditioning mechanism not only improves the quality of unconditional image generation, but also provides a representation space that can be used to control the generation. We explore this conditioning space by identifying directions of variations, and demonstrate promising properties in terms of smoothness and disentanglement.


Source: arXiv:2605.27343v1 - http://arxiv.org/abs/2605.27343v1 PDF: https://arxiv.org/pdf/2605.27343v1 Original Link: http://arxiv.org/abs/2605.27343v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
May 27, 2026
Topic:
Data Science
Area:
Machine Learning
Comments:
0
Bookmark