Existence and form of a protein language grammar for pLMs
Ascertain whether general biological rules analogous to a "protein language grammar" exist within protein sequences and, if they do, characterize the form such a grammar takes; additionally, identify which combinations of explainable artificial intelligence methods and information sources (training sequences, input prompts, model components, and output sequence perturbations) are required to extract these rules from decoder-only Transformer-based protein language models used for protein design.
Sponsor
References
However, because it is unclear if and in what form a protein language grammar exists, the combination of method and information category to enable this role remains unclear.
— Toward the Explainability of Protein Language Models for Sequence Design
(2506.19532 - Hunklinger et al., 24 Jun 2025) in Potential roles for XAI methods in protein design, Teacher role paragraph