Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Hierarchical Structure Enhances the Convergence and Generalizability of Linear Molecular Representation (2402.02164v4)

Published 3 Feb 2024 in cs.AI and q-bio.BM

Abstract: LLMs demonstrate fundamental abilities in syntax, semantics, and reasoning, though their performance often depends significantly on the inputs they process. This study introduces TSIS (Simplified TSID) and its variants:TSISD (TSIS with Depth-First Search), TSISO (TSIS in Order), and TSISR (TSIS in Random), as integral components of the t-SMILES framework. These additions complete the framework's design, providing diverse approaches to molecular representation. Through comprehensive analysis and experiments employing deep generative models, including GPT, diffusion models, and reinforcement learning, the findings reveal that the hierarchical structure of t-SMILES is more straightforward to parse than initially anticipated. Furthermore, t-SMILES consistently outperforms other linear representations such as SMILES, SELFIES, and SAFE, demonstrating superior convergence speed and enhanced generalization capabilities.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Juan-Ni Wu (2 papers)
  2. Tong Wang (144 papers)
  3. Li-Juan Tang (2 papers)
  4. Hai-Long Wu (2 papers)
  5. Ru-Qin Yu (2 papers)

Summary

We haven't generated a summary for this paper yet.