LLM Meets Diffusion: A Hybrid Framework for Crystal Material Generation
2510.23040v1
cs.LG, cond-mat.mtrl-sci, cs.AI
2025-10-29
Авторы:
Subhojyoti Khastagir, Kishalay Das, Pawan Goyal, Seung-Cheol Lee, Satadeep Bhattacharjee, Niloy Ganguly
Abstract
Recent advances in generative modeling have shown significant promise in
designing novel periodic crystal structures. Existing approaches typically rely
on either large language models (LLMs) or equivariant denoising models, each
with complementary strengths: LLMs excel at handling discrete atomic types but
often struggle with continuous features such as atomic positions and lattice
parameters, while denoising models are effective at modeling continuous
variables but encounter difficulties in generating accurate atomic
compositions. To bridge this gap, we propose CrysLLMGen, a hybrid framework
that integrates an LLM with a diffusion model to leverage their complementary
strengths for crystal material generation. During sampling, CrysLLMGen first
employs a fine-tuned LLM to produce an intermediate representation of atom
types, atomic coordinates, and lattice structure. While retaining the predicted
atom types, it passes the atomic coordinates and lattice structure to a
pre-trained equivariant diffusion model for refinement. Our framework
outperforms state-of-the-art generative models across several benchmark tasks
and datasets. Specifically, CrysLLMGen not only achieves a balanced performance
in terms of structural and compositional validity but also generates more
stable and novel materials compared to LLM-based and denoisingbased models
Furthermore, CrysLLMGen exhibits strong conditional generation capabilities,
effectively producing materials that satisfy user-defined constraints. Code is
available at https://github.com/kdmsit/crysllmgen