Back to Explorer
Research PaperResearchia:202601.29193[Statistics & ML > Statistics]

Generative Modeling of Discrete Data Using Geometric Latent Subspaces

Daniel Gonzalez-Alvarado

Abstract

We introduce the use of latent subspaces in the exponential parameter space of product manifolds of categorial distributions, as a tool for learning generative models of discrete data. The low-dimensional latent space encodes statistical dependencies and removes redundant degrees of freedom among the categorial variables. We equip the parameter domain with a Riemannian geometry such that the spaces and distances are related by isometries which enables consistent flow matching. In particular, geodesics become straight lines which makes model training by flow matching effective. Empirical results demonstrate that reduced latent dimensions suffice to represent data for generative modeling.


Source: arXiv:2601.21831v1 - http://arxiv.org/abs/2601.21831v1 PDF: https://arxiv.org/pdf/2601.21831v1 Original Link: http://arxiv.org/abs/2601.21831v1

Submission:1/29/2026
Comments:0 comments
Subjects:Statistics; Statistics & ML
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!