Directional Message Passing for Molecular Graphs (2003.03123v2)

Published 6 Mar 2020 in cs.LG, physics.comp-ph, and stat.ML

Abstract: Graph neural networks have recently achieved great successes in predicting quantum mechanical properties of molecules. These models represent a molecule as a graph using only the distance between atoms (nodes). They do not, however, consider the spatial direction from one atom to another, despite directional information playing a central role in empirical potentials for molecules, e.g. in angular potentials. To alleviate this limitation we propose directional message passing, in which we embed the messages passed between atoms instead of the atoms themselves. Each message is associated with a direction in coordinate space. These directional message embeddings are rotationally equivariant since the associated directions rotate with the molecule. We propose a message passing scheme analogous to belief propagation, which uses the directional information by transforming messages based on the angle between them. Additionally, we use spherical Bessel functions and spherical harmonics to construct theoretically well-founded, orthogonal representations that achieve better performance than the currently prevalent Gaussian radial basis representations while using fewer than 1/4 of the parameters. We leverage these innovations to construct the directional message passing neural network (DimeNet). DimeNet outperforms previous GNNs on average by 76% on MD17 and by 31% on QM9. Our implementation is available online.

Authors (3)

Johannes Gasteiger (18 papers)
Janek Groß (6 papers)
Stephan Günnemann (169 papers)

Citations (787)

View on Semantic Scholar

Summary

Directional Message Passing for Molecular Graphs

In the paper "Directional Message Passing for Molecular Graphs," Gasteiger, Groß, and Günnemann propose an innovative approach to enhancing the performance of Graph Neural Networks (GNNs) for the prediction of quantum mechanical properties of molecules. The authors introduce the concept of directional message passing, a novel embedding technique that incorporates spatial directionality into message embeddings, significantly improving the predictive power and efficiency of GNNs. This approach culminates in the creation of the Directional Message Passing Neural Network (DimeNet), which sets new performance benchmarks on popular datasets such as QM9 and MD17.

Key Innovations

Directional Message Passing

Traditional GNNs utilize the pairwise distances between atoms to update node embeddings via message passing, fundamentally limiting their ability to capture directional dependencies that play a critical role in molecular interactions. The authors propose directional message passing to address this limitation. Each message between nodes (atoms) is embedded with directional information corresponding to the spatial orientation between the nodes. This results in message embeddings that are rotationally equivariant, preserving relative directional information upon rotation of the molecule, a crucial property for accurate molecular modeling.

Belief Propagation Scheme

Extending the traditional belief propagation concept, the authors implement a message passing scheme wherein message embeddings interact based on both the distances and angles between pairs of atoms. This method leverages directional embeddings, which are updated through interactions that incorporate relative directions, akin to angular potentials used in classical empirical models. By doing so, this approach directly models the energy contributions from bond angles and torsional rotations, which are otherwise challenging to capture with non-directional embeddings.

Orthogonal Basis Representations

The paper introduces spherical Bessel functions and spherical harmonics to construct orthogonal basis representations of interatomic distances and angles, outperforming the commonly used Gaussian radial basis functions. This choice not only enhances performance but also reduces the number of required parameters by a factor of four. The orthogonality and physical grounding of these basis functions provide a more efficient and theoretically sound representation that ensures the stability and smoothness of predictions.

Numerical Results and Implications

DimeNet's architecture substantially improves prediction accuracy on both the QM9 and MD17 datasets. On the QM9 dataset, which includes a variety of quantum mechanical properties, DimeNet achieves a mean standardized MAE reduction of 31% over previous state-of-the-art models. Specifically, it sets new performance records on 11 out of 12 targets, with notable improvements in properties such as the highest occupied molecular orbital energy ( $\epsilon_\text{HOMO}$ ), the lowest unoccupied molecular orbital energy ( $\epsilon_\text{LUMO}$ ), and the dipole moment ( $\mu$ ).

On MD17, DimeNet demonstrates superior performance in predicting both energy and atomic forces using just 1000 training samples. This result is particularly impressive given the dataset's challenging nature, where accurate force predictions are critical for realistic molecular dynamics simulations. DimeNet's force predictions approach the accuracy of sGDML, a highly specialized model, while maintaining general applicability and scalability.

Practical and Theoretical Implications

The introduction of directional message passing and orthogonal basis representations has several profound implications:

Enhanced Modeling Capability: By directly incorporating angular and torsional potentials, DimeNet provides a more accurate and physically meaningful representation of complex molecular interactions.
Computational Efficiency: The reduced parameter space due to orthogonal basis representations allows for more efficient training and inference.
Versatility and Scalability: DimeNet's design is broadly applicable across different types of molecular systems and is suitable for large-scale datasets, which is particularly advantageous for real-world applications in drug discovery and material science.

Future Developments

The framework established by this paper opens up several avenues for further research. One promising direction involves incorporating long-range interactions to model non-bonded forces such as van der Waals and electrostatic interactions more explicitly. Another potential area of development is extending directional message passing to more complex molecular systems, including larger biomolecules and materials, by further refining the basis representations and interaction schemes to handle higher-order interactions more effectively.

In conclusion, this paper presents significant advancements in GNN-based molecular modeling by integrating directional information into message embeddings. DimeNet sets a new standard in the field, combining theoretical rigor with practical efficiency and laying the groundwork for future innovations in molecular graph neural networks.

PDF Markdown

Related Papers

Find Related Papers

YouTube

Show All Videos