A Recipe for Charge Density Prediction (2405.19276v1)
Abstract: In density functional theory, charge density is the core attribute of atomic systems from which all chemical properties can be derived. Machine learning methods are promising in significantly accelerating charge density prediction, yet existing approaches either lack accuracy or scalability. We propose a recipe that can achieve both. In particular, we identify three key ingredients: (1) representing the charge density with atomic and virtual orbitals (spherical fields centered at atom/virtual coordinates); (2) using expressive and learnable orbital basis sets (basis function for the spherical fields); and (3) using high-capacity equivariant neural network architecture. Our method achieves state-of-the-art accuracy while being more than an order of magnitude faster than existing methods. Furthermore, our method enables flexible efficiency-accuracy trade-offs by adjusting the model/basis sizes.
- Computing and rendering point set surfaces. IEEE Transactions on visualization and computer graphics, 9(1):3–15, 2003.
- Cormorant: Covariant molecular neural networks. Advances in neural information processing systems, 32, 2019.
- Even-tempered atomic orbitals. vi. optimal orbital exponents and optimal contractions of gaussian primitives for hydrogen, carbon, and oxygen in molecules. The Journal of Chemical Physics, 60(3):918–931, 1974.
- Mace: Higher order equivariant message passing neural networks for fast and accurate force fields. Advances in Neural Information Processing Systems, 35:11423–11436, 2022.
- E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nature communications, 13(1):2453, 2022.
- Lukas Biewald. Experiment tracking with weights and biases, 2020. URL https://www.wandb.com/. Software available from wandb.com.
- EGraFFBench: evaluation of equivariant graph neural network force fields for atomistic simulations. Digital Discovery, 3(4):759–768, 2024.
- Open catalyst 2020 (oc20) dataset and community challenges. Acs Catalysis, 11(10):6059–6072, 2021.
- Equivariant neural operator learning with graphon convolution. In Thirty-seventh Conference on Neural Information Processing Systems, 2023. URL https://openreview.net/forum?id=EjiA3uWpnc.
- Even-tempered slater-type orbitals revisited: From hydrogen to krypton. Journal of computational chemistry, 25(8):1030–1036, 2004.
- Symphony: Symmetry-equivariant point-centered spherical harmonics for molecule generation. arXiv preprint arXiv:2311.16199, 2023.
- A deep learning framework to emulate density functional theory. npj Computational Materials, 9(1):158, 2023.
- A hitchhiker’s guide to geometric GNNs for 3D atomic systems. arXiv preprint arXiv:2312.07511, 2023.
- Auxiliary basis sets to approximate coulomb potentials. Chemical physics letters, 240(4):283–290, 1995.
- Electron density learning of non-covalent systems. Chemical science, 10(41):9424–9432, 2019.
- Linear jacobi-legendre expansion of the charge density for machine learning-accelerated electronic structure calculations. npj Computational Materials, 9(1):87, 2023.
- Voronoi deformation density (VDD) charges: Assessment of the Mulliken, Bader, Hirshfeld, Weinhold, and VDD methods for charge analysis. Journal of computational chemistry, 25(2):189–210, 2004.
- Forces are not enough: Benchmark and critical evaluation for machine learning force fields with molecular simulations. Transactions on Machine Learning Research, 2023. ISSN 2835-8856. URL https://openreview.net/forum?id=A8pqQipwkt. Survey Certification.
- Gemnet: Universal directional graph neural networks for molecules. Advances in Neural Information Processing Systems, 34:6790–6802, 2021.
- e3nn: Euclidean neural networks. arXiv preprint arXiv:2207.09453, 2022.
- Predicting charge density distribution of materials using a local-environment-based graph convolutional network. Physical Review B, 100(18):184103, 2019.
- Array programming with NumPy. Nature, 585(7825):357–362, September 2020. doi: 10.1038/s41586-020-2649-2. URL https://doi.org/10.1038/s41586-020-2649-2.
- Equivariant diffusion for molecule generation in 3D. In International conference on machine learning, pages 8867–8887. PMLR, 2022.
- J. D. Hunter. Matplotlib: A 2D graphics environment. Computing in Science & Engineering, 9(3):90–95, 2007. doi: 10.1109/MCSE.2007.55.
- Plotly Technologies Inc. Collaborative data science, 2015. URL https://plot.ly.
- Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL materials, 1(1), 2013.
- Computational predictions of energy materials using density functional theory. Nature Reviews Materials, 1(1):1–13, 2016.
- Equivariant scalar fields for molecular docking with fast fourier transforms. In The Twelfth International Conference on Learning Representations, 2024. URL https://openreview.net/forum?id=BIveOmD1Nh.
- Equivariant graph neural networks for fast electron density estimation of molecules, liquids, and solids. npj Computational Materials, 8(1):183, 2022.
- Gaussian plane-wave neural operator for electron density estimation. arXiv preprint arXiv:2402.04278, 2024.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Self-consistent equations including exchange and correlation effects. Physical review, 140(4A):A1133, 1965.
- Higher-order equivariant neural networks for charge density prediction in materials. arXiv preprint arXiv:2312.05388, 2023.
- The atomic simulation environment—a Python library for working with atoms. Journal of Physics: Condensed Matter, 29(27):273002, 2017.
- Equiformer: Equivariant graph attention transformer for 3D atomistic graphs. arXiv preprint arXiv:2206.11990, 2022.
- EquiformerV2: Improved equivariant transformer for scaling to higher-degree representations. arXiv preprint arXiv:2306.12059, 2023.
- Introducing DDEC6 atomic population analysis: part 4. efficient parallel computation of net atomic charges, atomic spin moments, bond orders, and more. RSC advances, 8(5):2678–2707, 2018.
- Scalable parallel programming with CUDA: Is CUDA the parallel programming model that application developers have been waiting for? Queue, 6(2):40–53, 2008.
- Python materials genomics (pymatgen): A robust, open-source python library for materials analysis. Computational Materials Science, 68:314–319, 2013.
- Reducing SO(3) convolutions to SO(2) for efficient equivariant GNNs. In International Conference on Machine Learning, pages 27420–27438. PMLR, 2023.
- Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32, 2019.
- Towards combinatorial generalization for catalysts: A kohn-sham charge-density approach. Advances in Neural Information Processing Systems, 36, 2023.
- New basis set exchange: An open, up-to-date resource for the molecular sciences community. Journal of chemical information and modeling, 59(11):4814–4820, 2019.
- Nearsightedness of electronic matter. Proceedings of the National Academy of Sciences, 102(33):11635–11638, 2005.
- Informing geometric deep learning with electronic interactions to accelerate quantum chemistry. Proceedings of the National Academy of Sciences, 119(31):e2205221119, 2022.
- A recipe for cracking the quantum scaling limit with machine learned electron densities. Machine Learning: Science and Technology, 4(1):015027, 2023.
- Quantum chemistry structures and properties of 134 kilo molecules. Scientific data, 1(1):1–7, 2014.
- Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17. Journal of chemical information and modeling, 52(11):2864–2875, 2012.
- Basis set exchange: a community database for computational sciences. Journal of chemical information and modeling, 47(3):1045–1052, 2007.
- Equivariant message passing for the prediction of tensorial properties and molecular spectra. In International Conference on Machine Learning, pages 9377–9388. PMLR, 2021.
- A representation-independent electronic charge density database for crystalline materials. Scientific data, 9(1):661, 2022.
- Topological graph-based analysis of solid-state ion migration. npj Computational Materials, 9(1):99, 2023.
- John C Slater. Atomic shielding constants. Physical review, 36(1):57, 1930.
- Recent developments in the PySCF program package. The Journal of chemical physics, 153(2), 2020.
- Chemical properties from graph neural network-predicted electron densities. The Journal of Physical Chemistry C, 127(48):23459–23466, 2023.
- Tensor field networks: Rotation-and translation-equivariant neural networks for 3D point clouds. arXiv preprint arXiv:1802.08219, 2018.
- Machine learning force fields. Chemical Reviews, 121(16):10142–10186, 2021.
- Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for H to Rn: Design and assessment of accuracy. Phys. Chem. Chem. Phys., 7:3297, 2005. doi: 10.1039/b508541a.
- 3D steerable CNNs: Learning rotationally equivariant features in volumetric data. Advances in Neural Information Processing Systems, 31, 2018.
- Geodiff: A geometric diffusion model for molecular conformation generation. arXiv preprint arXiv:2203.02923, 2022.
- Assessing the design rules of electrides. Journal of Materials Chemistry C, 2024.
- Local structure order parameters and site fingerprints for quantification of coordination environment and crystal structure similarity. RSC advances, 10(10):6063–6081, 2020.
- Spherical channels for modeling atomic interactions. Advances in Neural Information Processing Systems, 35:8054–8067, 2022.