Adaptive atomic basis sets (2404.16942v1)
Abstract: Atomic basis sets are widely employed within quantum mechanics based simulations of matter. We introduce a machine learning model that adapts the basis set to the local chemical environment of each atom, prior to the start of self consistent field (SCF) calculations. In particular, as a proof of principle and because of their historic popularity, we have studied the Gaussian type orbitals from the Pople basis set, i.e. the STO-3G, 3-21G, 6-31G and 6-31G*. We adapt the basis by scaling the variance of the radial Gaussian functions leading to contraction or expansion of the atomic orbitals.A data set of optimal scaling factors for C, H, O, N and F were obtained by variational minimization of the Hartree-Fock (HF) energy of the smallest 2500 organic molecules from the QM9 database. Kernel ridge regression based ML prediction errors of the change in scaling decay rapidly with training set size, typically reaching less than 1 % for training set size 2000. Overall, we find systematically lower variance, and consequently the larger training efficiencies, when going from hydrogen to carbon to nitrogen to oxygen. Using the scaled basis functions obtained from the ML model, we conducted HF calculations for the subsequent 30'000 molecules in QM9. In comparison to the corresponding default Pople basis set results we observed improved energetics in up to 99 % of all cases. With respect to the larger basis set 6-311G(2df,2pd), atomization energy errors are lowered on average by ~31, 107, 11, and 11 kcal/mol for STO-3G, 3-21G, 6-31G and 6-31G*, respectively -- with negligible computational overhead. We illustrate the high transferability of adaptive basis sets for larger out-of-domain molecules relevant to addiction, diabetes, pain, aging.
- F. Jensen, WIREs Computational Molecular Science 3, 273 (2013), https://wires.onlinelibrary.wiley.com/doi/pdf/10.1002/wcms.1123 .
- J. G. Hill, International Journal of Quantum Chemistry 113, 21–34 (2012).
- E. R. Davidson and D. Feller, Chemical Reviews 86, 681 (1986), https://doi.org/10.1021/cr00074a002 .
- J. Dunning, Thom H., The Journal of Chemical Physics 90, 1007 (1989), https://pubs.aip.org/aip/jcp/article-pdf/90/2/1007/15358102/1007_1_online.pdf .
- A. Szabo and N. S. Ostlund, Modern Quantum Chemistry: Introduction to Advanced Electronic Structure Theory, 1st ed. (Dover Publications, Inc., Mineola, 1996).
- D. Khan, A. J. A. Price, M. L. Ach, and O. A. von Lilienfeld, “Adaptive hybrid density functionals,” (2024a), arXiv:2402.14793 [physics.chem-ph] .
- J. Behler, The Journal of Chemical Physics 134, 074106 (2011), https://doi.org/10.1063/1.3553717 .
- D. C. Liu and J. Nocedal, Mathematical Programming 45, 503 (1989).
- V. Vapnik, The nature of statistical learning theory (Springer science & business media, 1999).
- C. E. Rasmussen et al., Gaussian processes for machine learning, Vol. 1 (Springer).
- B. Huang and O. A. von Lilienfeld, Nature Chemistry 12, 945 (2020).
- F. Neese, WIREs Computational Molecular Science 12, e1606 (2022).
- A. L. Hickey and C. N. Rowley, The Journal of Physical Chemistry A 118, 3678 (2014).
- D. Rappoport and F. Furche, The Journal of Chemical Physics 133, 134105 (2010), https://pubs.aip.org/aip/jcp/article-pdf/doi/10.1063/1.3484283/15431477/134105_1_online.pdf .
- D. Khan, A. J. A. Price, M. L. Ach, O. Trottier, and O. A. von Lilienfeld, “Adaptive hybrid density functionals,” (2024b), arXiv:2402.14793 [physics.chem-ph] .
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.