MiAD: Mirage Atom Diffusion in Crystals
- MiAD is a generative diffusion framework that introduces a mirage atom type to enable dynamic atom counts in crystal synthesis.
- It employs an equivariant joint Markov diffusion model combining Gaussian, Wrapped Normal, and categorical diffusions for lattice, coordinates, and atom types.
- MiAD achieves a state-of-the-art 8.2% S.U.N. rate, outperforming baselines in stability and demonstrating improved uniqueness and novelty.
Mirage Atom Diffusion (MiAD) is a diffusion-based generative framework for de novo crystal generation that explicitly models the appearance and disappearance of atoms during the generative process. Introduced as a solution to the rigidity of fixed atom count in prior diffusion models, MiAD achieves state-of-the-art results on metrics of crystal stability, uniqueness, and novelty (S.U.N.) by employing a simple mirage infusion technique. This approach introduces an auxiliary “mirage” atom type, expands the generative domain to permit dynamic atom counts, and integrates these modifications within an equivariant joint Markov diffusion model (Okhotin et al., 18 Nov 2025).
1. Joint Diffusion Modeling for Crystal Structures
MiAD treats a crystal unit cell as a triplet , where is the lattice matrix, denotes fractional coordinates of atoms, and the atom types. The model employs a Markov noising process to corrupt data into pure noise, and a learnable reverse denoising process to synthesize samples.
The forward kernel factorizes as
with component-wise treatment as follows:
- Lattice: A Gaussian DDPM diffusion: .
- Fractional coordinates: Modeled on the torus via Wrapped Normal kernels , with score-matching loss leveraging Riemannian gradients.
- Atom types: Handled by categorical D3PM diffusion, .
The total loss combines lattice, coordinate, and atom-type terms as . The neural network is an equivariant GNN (CSPNet), respecting atom permutation, O(3) rotation, and periodicity.
2. Mirage Infusion: Dynamic Atom Count Handling
Traditional diffusion models for crystal generation fix (atom count) at the outset, constraining trajectory variability. MiAD's mirage infusion addresses this by:
- Introducing an extra “mirage” atom type (type 0).
- Fixing a large maximum atom count training atom count; working instead with
- Infusion operator: Extends a genuine crystal with real atoms to by appending mirage atoms (type 0) with uniformly random coordinates.
- Reduction operator: After sampling, genuine atoms are identified by type , while mirage atoms (type 0) are removed.
All model components and losses are computed in the expanded -atom domain, but only non-mirage atoms contribute to the coordinate loss, via a masked objective:
This enables the model to generate/deactivate atoms dynamically during denoising.
3. Mathematical Model and Equivariant Constraints
Both forward and reverse processes operate over the expanded joint space: The full training loss is
with the KL loss for the lattice, for masked coordinates, and for atom-type transitions across all atoms.
Equivariance under group actions is enforced so that, for any group element (e.g., permutation, rotation, translation),
4. Neural Network Architecture and Symmetry Handling
MiAD employs the CSPNet GNN architecture from DiffCSP. Key features include:
- Atom embeddings updated via message-passing layers, pooling over neighbors within a fixed cutoff.
- Lattice representation processed by a feed-forward network on for O(3) equivariance.
- Fractional-coordinate updates use pairwise Fourier features to impose periodicity.
The fixed-size representation guarantees the network is agnostic to the actual count of non-mirage atoms, handling dynamic atom populations transparently.
5. Sampling Algorithm
MiAD's generative process operates as follows:
- Initialization:
- (lattice)
- (fractional coordinates)
- (atom types)
- Denoising:
- For down to $1$: predict and sample from .
- Reduction:
- Discard all atoms with to obtain a valid sample .
This routine allows for emergence or disappearance of atoms at any reverse diffusion step.
6. Experimental Evaluation
Dataset and Training
Experiments are conducted on the MP-20 subset of the Materials Project, containing 45,231 stable crystals (train/val/test split 60/20/20%). MiAD and baseline models are evaluated using 10,000 generated samples each. Hyperparameters follow prior work (DiffCSP/CSPNet): hidden dimension 512, 6 GNN layers, maximum 100 atoms, maximum 20 neighbors, and cutoff 7.0 Å. For MiAD, is used (≈2.7× the mean atom count). Training uses batch size 256, Adam optimizer with learning rate .
Quantitative Results
Key performance is measured by the S.U.N. rate (fraction of generated crystals that are stable by DFT [E0 eV], unique via structure matching, and novel/unseen in training). Select results:
| Model | Stable % | Unique % | S.U.N. % |
|---|---|---|---|
| DiffCSP | 5.0 | 66.0 | 3.3 |
| FlowMM | 4.6 | 60.9 | 2.8 |
| FlowLLM | 13.9 | 33.8 | 4.7 |
| WyFormer | 5.2 | 92.3 | 4.8 |
| ADiT | 12.8 | 50.8 | 6.5 |
| MiAD | 12.5 | 65.2 | 8.2 |
MiAD achieves an 8.2% S.U.N. rate, a 2.5× improvement over the unmodified DiffCSP baseline and a 25% gain over the previous state of the art. Similar improvements are observed on ML-potential-based stability metrics and larger datasets.
7. Limitations and Prospective Research
Several limitations are identified:
- Computational Overhead: Mirage infusion increases the average atom count (by ≈2.7×), resulting in roughly 4× slower training and sampling and elevated memory requirements.
- Hyperparameter Sensitivity: Performance depends on and loss coefficient ; ablations suggest optimal and balanced loss weights ( lattice, coord, atom type).
- Failure Modes: Inadequate masking or improper initialization of mirage atoms may lead to degenerate behaviors such as trivial atom insertions or removals, or reduced generative diversity.
- Metrics Coverage: The S.U.N. metric, while comprehensive for stability and novelty, does not capture space-group diversity or structural variety, suggesting the need for additional evaluation criteria.
- Generalization: Mirage infusion is directly compatible with DiffCSP, MatterGen, FlowMM, CrystalFlow. Extending the approach to architectures such as transformers or to new domains (e.g., drug design, nanoporous materials) is proposed as a promising future direction.
A plausible implication is that the mirage infusion technique could be generalized beyond crystals to other structured generative domains that require variable count handling, provided equivariance and masking are appropriately enforced.
MiAD demonstrates that introducing a mirage atom type to joint diffusion models for crystal structure generation enables flexible atom count and substantially improves de novo inorganic materials discovery performance, as measured by the S.U.N. metric. The technique's conceptual simplicity and strong empirical results indicate its potential as a new standard for diffusion-based generative modeling in materials science (Okhotin et al., 18 Nov 2025).