Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (2311.16208v2)

Published 27 Nov 2023 in q-bio.BM, cs.AI, and cs.LG

Abstract: The rapid evolution of artificial intelligence in drug discovery encounters challenges with generalization and extensive training, yet LLMs offer promise in reshaping interactions with complex molecular data. Our novel contribution, InstructMol, a multi-modal LLM, effectively aligns molecular structures with natural language via an instruction-tuning approach, utilizing a two-stage training strategy that adeptly combines limited domain-specific data with molecular and textual information. InstructMol showcases substantial performance improvements in drug discovery-related molecular tasks, surpassing leading LLMs and significantly reducing the gap with specialized models, thereby establishing a robust foundation for a versatile and dependable drug discovery assistant.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. He Cao (18 papers)
  2. Zijing Liu (21 papers)
  3. Xingyu Lu (28 papers)
  4. Yuan Yao (292 papers)
  5. Yu Li (377 papers)
Citations (41)
X Twitter Logo Streamline Icon: https://streamlinehq.com