Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization (2407.01648v2)

Published 1 Jul 2024 in q-bio.BM, cs.LG, and q-bio.QM

Abstract: Generating ligand molecules for specific protein targets, known as structure-based drug design, is a fundamental problem in therapeutics development and biological discovery. Recently, target-aware generative models, especially diffusion models, have shown great promise in modeling protein-ligand interactions and generating candidate drugs. However, existing models primarily focus on learning the chemical distribution of all drug candidates, which lacks effective steerability on the chemical quality of model generations. In this paper, we propose a novel and general alignment framework to align pretrained target diffusion models with preferred functional properties, named AliDiff. AliDiff shifts the target-conditioned chemical distribution towards regions with higher binding affinity and structural rationality, specified by user-defined reward functions, via the preference optimization approach. To avoid the overfitting problem in common preference optimization objectives, we further develop an improved Exact Energy Preference Optimization method to yield an exact and efficient alignment of the diffusion models, and provide the closed-form expression for the converged distribution. Empirical studies on the CrossDocked2020 benchmark show that AliDiff can generate molecules with state-of-the-art binding energies with up to -7.07 Avg. Vina Score, while maintaining strong molecular properties. Code is available at https://github.com/MinkaiXu/AliDiff.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Siyi Gu (7 papers)
  2. Minkai Xu (40 papers)
  3. Alexander Powers (3 papers)
  4. Weili Nie (41 papers)
  5. Tomas Geffner (19 papers)
  6. Karsten Kreis (50 papers)
  7. Jure Leskovec (233 papers)
  8. Arash Vahdat (69 papers)
  9. Stefano Ermon (279 papers)
Citations (4)
X Twitter Logo Streamline Icon: https://streamlinehq.com