Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Crystal-GFN: sampling crystals with desirable properties and constraints (2310.04925v2)

Published 7 Oct 2023 in cs.LG

Abstract: Accelerating material discovery holds the potential to greatly help mitigate the climate crisis. Discovering new solid-state materials such as electrocatalysts, super-ionic conductors or photovoltaic materials can have a crucial impact, for instance, in improving the efficiency of renewable energy production and storage. In this paper, we introduce Crystal-GFN, a generative model of crystal structures that sequentially samples structural properties of crystalline materials, namely the space group, composition and lattice parameters. This domain-inspired approach enables the flexible incorporation of physical and structural hard constraints, as well as the use of any available predictive model of a desired physicochemical property as an objective function. To design stable materials, one must target the candidates with the lowest formation energy. Here, we use as objective the formation energy per atom of a crystal structure predicted by a new proxy machine learning model trained on MatBench. The results demonstrate that Crystal-GFN is able to sample highly diverse crystals with low (median -3.1 eV/atom) predicted formation energy.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (28)
  1. Free energy calculation of crystalline solids using normalizing flows. Modelling and Simulation in Materials Science and Engineering, 30(6):065007, 2022.
  2. Flow network based generative models for non-iterative diverse candidate generation. In Advances in Neural Information Processing Systems (NeurIPS), volume 34, 2021.
  3. Molgan: An implicit generative model for small molecular graphs. arXiv preprint arXiv: 1805.11973, 2018.
  4. Crystal structure prediction by combining graph network and optimization algorithm. Nature Communications, 13(1), March 2022. doi: 10.1038/s41467-022-29241-4. URL https://doi.org/10.1038/s41467-022-29241-4.
  5. 3-d inorganic crystal structure generation and property prediction via representation learning. Journal of Chemical Information and Modeling, 60(10):4518–4535, 2020.
  6. Benchmarking materials property prediction methods: The matbench test set and automatminer reference algorithm., 2020. URL https://matbench.materialsproject.org/#citing-matbench. Accessed 2023-09-28.
  7. Phast: Physics-aware, scalable, and task-specific gnns for accelerated catalyst design. arXiv preprint arXiv: 2211.12020, 2022. URL https://arxiv.org/abs/2211.12020v3.
  8. Symmetry-adapted generation of 3d point sets for the targeted discovery of molecules. Advances in neural information processing systems, 32, 2019.
  9. Multi-fidelity active learning with gflownets. arXiv preprint arXiv: 2306.11715, 2023. URL https://arxiv.org/abs/2306.11715v1.
  10. Data-driven approach to encoding and decoding 3-d crystal structures. arXiv preprint arXiv:1909.00949, 2019.
  11. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Materials, 1(1):011002, 07 2013. ISSN 2166-532X. doi: 10.1063/1.4812323. URL https://doi.org/10.1063/1.4812323.
  12. Biological sequence design with GFlowNets. In International Conference on Machine Learning (ICML), volume 162. PMLR, 2022.
  13. GFlowNets for AI-driven scientific discovery. Digital Discovery, 2023.
  14. Generative adversarial networks for crystal structure prediction. ACS central science, 6(8):1412–1420, 2020.
  15. A theory of continuous generative flow networks. In International Conference on Machine Learning (ICML), 2023.
  16. Constrained crystals deep convolutional generative adversarial network for the inverse design of crystal structures. npj Computational Materials, 7(1):66, 2021.
  17. Trajectory balance: Improved credit assignment in GFlowNets. In Advances in Neural Information Processing Systems (NeurIPS), volume 35, 2022.
  18. Crystalgan: Learning to discover crystallographic structures with generative adversarial networks. AAAI Spring Symposium Combining Machine Learning with Knowledge Engineering, 2018.
  19. Diffusion probabilistic models enhance variational autoencoder for crystal structure generative modeling. arXiv preprint arXiv:2308.02165, 2023.
  20. Inverse design of crystals using generalized invertible crystallographic representation. arXiv preprint arXiv:2005.07609, 3(6):7, 2020.
  21. An invertible crystallographic representation for general inverse design of inorganic crystals with targeted properties. Matter, 5(1):314–335, 2022.
  22. E(n) equivariant normalizing flows. Neural Information Processing Systems, 2021.
  23. Graphaf: a flow-based autoregressive model for molecular graph generation. arXiv preprint arXiv:2001.09382, 2020.
  24. Dual use of artificial-intelligence-powered drug discovery. Nature Machine Intelligence, 4(3):189–191, 2022.
  25. Crystal diffusion variational autoencoder for periodic material generation. International Conference On Learning Representations, 2021.
  26. Geodiff: A geometric diffusion model for molecular conformation generation. arXiv preprint arXiv:2203.02923, 2022.
  27. High-throughput discovery of novel cubic crystal materials using deep generative neural networks. Advanced Science, 8(20):2100566, 2021.
  28. Towards predicting equilibrium distributions for molecular systems with deep learning. arXiv preprint arXiv:2306.05445, 2023.
Citations (12)

Summary

  • The paper introduces Crystal-GFN, a generative model that sequentially samples space group, composition, and lattice parameters to produce stable inorganic crystals.
  • It employs a GFlowNet framework with Beta distributions to adhere to crystallography constraints, ensuring physical realism in the generated structures.
  • Experimental results reveal 95% of structures achieve formation energies below -2 eV/atom within 12 hours on a CPU, showcasing its efficiency and potential in material discovery.

Analysis of "Crystal-GFN: Sampling Crystals with Desirable Properties and Constraints"

The paper "Crystal-GFN: sampling crystals with desirable properties and constraints" introduces Crystal-GFN, a generative model targeting the challenge of generating inorganic crystal structures that adhere to specific properties and constraints. This research contributes to the broader context of rapidly evolving methods in accelerated material discovery, a domain pivotal to advancements in renewable energy solutions.

The model operates by sequentially sampling space group, composition, and lattice parameters of a unit cell, leveraging domain knowledge from crystallography and materials science. This approach not only facilitates the generation of stable crystal structures but also incorporates domain-specific constraints more effectively than models that attempt to learn these constraints implicitly.

Methodological Approach

  • Space Group and Composition Sampling: The paper details the sampling of space groups, composing the horce of crystallography knowledge to introduce a three-dimensional space encompassing crystal-lattice systems, point symmetry, and spatial arrangement. By delineating the sample space in this structured manner, Crystal-GFN allows the integration of rules from the internals of crystallography, such as Wyckoff positions.
  • Lattice Parameters: The method samples these parameters in a continuous space, employing mixtures of Beta distributions to model their range effectively. Constraints emerge from and are aligned with crystallography principles, enforcing geometrical compatibility and enhancing sampling efficiency.
  • GFlowNet Framework: With the use of GFlowNets, the model efficiently samples from potentially high-dimensional distributions, enabling the pursuit of diverse viable crystal structures, a task usually hampered by conventional MCMC methods due to mode complexity in scientific data and generation spaces.

Numerical Results

Remarkably, the empirical evaluation unearths the model's capacity to discover crystal structures with significantly low predicted formation energy. A training duration of 12 hours on a CPU-only setup yields a median formation energy of -3.1 eV/atom, with 95% of sampled structures achieving a predicted energy lower than -2 eV/atom. This highlights Crystal-GFN's aptitude for navigating the compositionally and structurally diverse search space effectively within resource constraints.

Despite operating on reduced subset domains and using predictive proxy models, the results provide pivotal insights:

  • The generative process results in high diversity across sampled space groups, compositions, and lattice parameters.
  • The approach ensures the stability and physical realism of generated structures through explicit domain constraints.

Implications and Future Directions

Practically, the model establishes a foundational tool for accelerating the design and discovery of solid-state materials central to advancements in energy storage and conversion technologies. Theoretically, the successful application of GFlowNet to sequential sampling in the hybrid crystallography space pushes methodological boundaries, suggesting pathways for robust and scalable crystal generation strategies, even with constraints.

Future research directions include model expansions to generate atomic coordinates directly or simulate conditions under broader element sets. Enhancements could also explore multifaceted reward frameworks targeting specific material properties, broadening the applicability of Crystal-GFN in diverse industrial problem settings.

In conclusion, "Crystal-GFN: sampling crystals with desirable properties and constraints" presents an adept and strategically grounded entry into designing generative models that capitalize on crystallography frameworks, offering substantive capacities for material innovation. Such developments appear strategically aligned with broader scientific objectives of sustainable technological advancement.

Github Logo Streamline Icon: https://streamlinehq.com
Youtube Logo Streamline Icon: https://streamlinehq.com