Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Exploring the Protein Sequence Space with Global Generative Models (2305.01941v1)

Published 3 May 2023 in q-bio.BM and cs.LG

Abstract: Recent advancements in specialized large-scale architectures for training image and language have profoundly impacted the field of computer vision and NLP. LLMs, such as the recent ChatGPT and GPT4 have demonstrated exceptional capabilities in processing, translating, and generating human languages. These breakthroughs have also been reflected in protein research, leading to the rapid development of numerous new methods in a short time, with unprecedented performance. LLMs, in particular, have seen widespread use in protein research, as they have been utilized to embed proteins, generate novel ones, and predict tertiary structures. In this book chapter, we provide an overview of the use of protein generative models, reviewing 1) LLMs for the design of novel artificial proteins, 2) works that use non-Transformer architectures, and 3) applications in directed evolution approaches.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Sergio Romero-Romero (1 paper)
  2. Sebastian Lindner (2 papers)
  3. Noelia Ferruz (4 papers)
Citations (3)