Back to Explorer
Research PaperResearchia:202602.28022[Chemistry > Chemistry]

Efficient training of generative models from multireference simulations and its application to the design of Dy complexes with large magnetic anisotropy

Zahra Khatibi

Abstract

Generative machine learning models can potentially provide direct access to novel and relevant portions of the full chemical space, overcoming the cost of systematic sampling. However, the training of these models generally requires a large amount of data, often precluding the use of expensive high-level ab initio simulations for this task. The generation of coordination compounds of Dy with large magnetic anisotropy represents a topical example, where multireference simulations of large molecules are necessary to perform reliable predictions. Here, we show that a semi-supervised chemically-inspired training-by-proxy of generative variational autoencoders can reduce the cost associated with building a training set from multireference simulations by two orders of magnitude. We illustrate the power of this approach by generating 100s of new organic ligands for Dy(III) pentagonal bipyramidal complexes exhibiting record values of magnetic anisotropy, while starting from datasets as small as 1k multireference calculations. This work thus paves the way to the computational generation of molecules as complex coordination compounds with target electronic and magnetic properties.


Source: arXiv:2602.23230v1 - http://arxiv.org/abs/2602.23230v1 PDF: https://arxiv.org/pdf/2602.23230v1 Original Link: http://arxiv.org/abs/2602.23230v1

Submission:2/28/2026
Comments:0 comments
Subjects:Chemistry; Chemistry
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!