Efficient Prediction of SO(3)-Equivariant Hamiltonian Matrices via SO(2) Local Frames (2506.09398v1)

Published 11 Jun 2025 in cs.LG and physics.comp-ph

Abstract: We consider the task of predicting Hamiltonian matrices to accelerate electronic structure calculations, which plays an important role in physics, chemistry, and materials science. Motivated by the inherent relationship between the off-diagonal blocks of the Hamiltonian matrix and the SO(2) local frame, we propose a novel and efficient network, called QHNetV2, that achieves global SO(3) equivariance without the costly SO(3) Clebsch-Gordan tensor products. This is achieved by introducing a set of new efficient and powerful SO(2)-equivariant operations and performing all off-diagonal feature updates and message passing within SO(2) local frames, thereby eliminating the need of SO(3) tensor products. Moreover, a continuous SO(2) tensor product is performed within the SO(2) local frame at each node to fuse node features, mimicking the symmetric contraction operation. Extensive experiments on the large QH9 and MD17 datasets demonstrate that our model achieves superior performance across a wide range of molecular structures and trajectories, highlighting its strong generalization capability. The proposed SO(2) operations on SO(2) local frames offer a promising direction for scalable and symmetry-aware learning of electronic structures. Our code will be released as part of the AIRS library https://github.com/divelab/AIRS.

PDF Abstract

This paper (Yu et al., 11 Jun 2025 ) introduces QHNetV2, a novel neural network designed for the efficient and accurate prediction of Kohn-Sham Hamiltonian matrices in quantum systems. This task is crucial for accelerating electronic structure calculations used in physics, chemistry, and materials science, which are typically computationally expensive using traditional methods like Density Functional Theory (DFT).

The core challenge in predicting Hamiltonian matrices with neural networks lies in maintaining SO(3) equivariance, meaning the prediction should transform predictably when the molecular structure undergoes a 3D rotation. Existing state-of-the-art equivariant networks often rely on SO(3) tensor products, which have a computational complexity that scales poorly ( $O(L_{max}^6)$ ) with the maximum angular momentum degree ( $L_{max}$ ) required for accurate representation of atomic orbitals (e.g., $d$ -orbitals need $L_{max} \geq 4$ ).

QHNetV2 tackles this by proposing a framework that achieves global SO(3) equivariance without using costly SO(3) tensor products. The key idea is to perform feature updates and message passing within local SO(2) frames, leveraging the observation that off-diagonal blocks of the Hamiltonian matrix are inherently related to SO(2) local frames.

Key Technical Contributions and Implementation:

SO(2) Local Frames: The network operates in both a global SO(3) coordinate system and local SO(2) frames. SO(3) irreducible representations (irreps) are transformed into local SO(2) irreps using a canonicalization procedure based on a reference vector (e.g., the vector between two interacting atoms for off-diagonal features, or the vector to the closest neighbor for node features). This transformation allows applying SO(2)-equivariant operations within the local frame, and transforming the results back to the global SO(3) frame maintains overall SO(3) equivariance. This approach is linked to the concept of minimal frame averaging.
Efficient SO(2) Equivariant Operations: The paper introduces and utilizes several SO(2)-equivariant building blocks:
- SO(2) Linear: A linear transformation for SO(2) irreps, formulated similarly to a complex linear layer. This replaces costly SO(3) tensor products used in message passing.
- SO(2) Gate: A gating mechanism similar to those used in SO(3) networks, where scalar features modulate higher-order SO(2) features.
- SO(2) Layer Normalization: A normalization method applied to the norm of SO(2) irreps for training stability.
- SO(2) Tensor Product (TP): An operation to fuse SO(2) irreps, following rules $m_o = m_1 \pm m_2$ . This operation has a lower complexity ( $O(m_{max}^2)$ for pairwise, $O(m_{max}^v)$ for $v$ -body interactions) compared to SO(3) TP. A continuous SO(2) TP module, inspired by MACE's symmetric contraction, is used for node feature updating to capture many-body interactions efficiently within the local SO(2) frame.
Model Architecture (QHNetV2):
- The network uses a message-passing graph neural network architecture.
- Node and node pair embeddings are initialized.
- Node-wise Interaction: Messages between nodes are computed by transforming features to the pairwise SO(2) local frame, applying SO(2) Linear and Gate operations, and combining with radial basis functions and inner products of node features. The messages are then transformed back to the global frame and aggregated.
- Node Feature Updating: Aggregated node features are transformed into a node-centric SO(2) local frame (defined by the direction to the closest neighbor). Continuous SO(2) TP and SO(2) Linear layers are applied before transforming back to the global frame. Skip connections and Equivariant Layer Normalization are used.
- Off-diagonal Feature Updating: Pairwise features are updated directly in the pairwise SO(2) local frame using SO(2) Feed-Forward Networks (FFNs) composed of SO(2) Linear and Gate operations. Skip connections and SO(2) Layer Normalization are applied.
- Matrix Construction: The final predicted Hamiltonian matrix blocks are constructed from the updated node features (for diagonal blocks) and pairwise features (for off-diagonal blocks) using an expansion module that maps irreps to orbital pairs.

Practical Implementation Aspects and Considerations:

Software Stack: The implementation is based on PyTorch, PyTorch Geometric, and e3nn, common libraries for geometric deep learning. This suggests the model can be integrated into existing workflows using these tools.
Computational Efficiency: The primary benefit is avoiding SO(3) TPs. The SO(2) operations, particularly the SO(2) Linear, offer significant speedups. Experiments show QHNetV2 achieves speeds up to 4.34x faster than previous SO(3)-based models like QHNet on the QH9 dataset, while having comparable or slightly higher memory usage than a sparsity-based method like SPHNet.
Scalability: The reduced complexity of SO(2) operations makes the model potentially more scalable to systems requiring high $L_{max}$ , which is crucial for larger molecules and more complex electronic structures.
Datasets: The model is evaluated on the QH9 benchmark (various train/test splits focusing on random, OOD size, dynamic geometry, dynamic molecule generalization) and MD17 trajectories. The results indicate strong performance, especially on the QH9 datasets, and improvements in H and epsilon MAE compared to prior methods. Performance on smaller datasets (like MD17 Water) might be less pronounced, suggesting a need for sufficient training data.
Hyperparameters: Key hyperparameters include cutoff distance, learning rate schedule, number of layers, $L_{max}$ , and the dimensions of hidden SO(3) and SO(2) irreps. These influence model capacity and computational cost. The choice of $L_{max}$ affects the size of the irreps and the complexity of the SO(2) TP ( $O(m_{max}^v)$ where $m_{max}$ is the maximum order of SO(2) irrep, related to $L_{max}$ ).
Ablation Studies: The ablation studies confirm that both the SO(2) TP module for node updates and SO(2) FFNs for off-diagonal updates contribute positively to the model's performance on the Hamiltonian prediction task.
Deployment: Once trained, the model can predict Hamiltonian matrices for new molecular geometries in a single forward pass, providing significant speedup over iterative DFT calculations. The output is the Hamiltonian matrix itself, which can then be used to solve for eigenvalues (energies) and eigenvectors (wavefunctions) via standard linear algebra methods (e.g., diagonalization).

Real-World Applications:

Predicting Hamiltonian matrices is a direct step towards replacing or augmenting expensive parts of DFT calculations. Accurate and fast Hamiltonian prediction can enable:

Accelerated Molecular Dynamics Simulations: Running MD simulations requires computing forces and energies at each step, often derived from electronic structure. Fast Hamiltonian prediction can drastically speed up this process.
High-Throughput Screening: Quickly calculating electronic properties for large databases of molecules or materials for drug discovery, battery materials design, or catalyst development.
Solving the Kohn-Sham Equation: The predicted Hamiltonian matrix ( $\mathbf{H}$ ) is used directly in the generalized eigenvalue problem $\mathbf{H} \mathbf{C} = \bm{\epsilon} \mathbf{S} \mathbf{C}$ to obtain energies and wavefunctions. A machine learning model predicting $\mathbf{H}$ can significantly accelerate finding self-consistent solutions or potentially bypass the iterative SCF loop entirely if the prediction is sufficiently accurate.
Materials Property Prediction: Properties derived from electronic structure (like band structure, density of states, polarizability) can be calculated faster once the Hamiltonian is known.

In summary, QHNetV2 offers a practical and efficient approach to learning SO(3)-equivariant representations for Hamiltonian matrix prediction by skillfully utilizing SO(2) operations within local frames, circumventing the computational bottleneck of SO(3) tensor products. This advancement has the potential to make electronic structure calculations more accessible for larger systems and high-throughput applications.

PDF Markdown Bookmark Chat (Pro)

Authors (5)

Haiyang Yu (109 papers)
Yuchao Lin (10 papers)
Xuan Zhang (183 papers)
Xiaofeng Qian (37 papers)
Shuiwang Ji (122 papers)

Related Papers

Find Related Papers

GitHub

GitHub - divelab/AIRS: Artificial Intelligence Research for Science (AIRS) (636 stars)

Tweets

https://twitter.com/ShuiwangJi/status/1933532771273629975

https://twitter.com/VBFC/status/1933216842060816496