Predicted Local Distance Difference Test (pLDDT)

Updated 7 April 2026

pLDDT is a per-residue confidence metric that quantifies local geometric agreement between predicted and experimental protein structures, scaled from 0 to 100.
Modern deep learning models like AlphaFold2 integrate pLDDT to filter and interpret predicted structures, enhancing reliability without alignment to experimental data.
High-speed implementations such as pLDDT-Predictor enable rapid proteome-wide screening, significantly reducing computational time in protein engineering.

The Predicted Local Distance Difference Test (pLDDT) is a per-residue confidence metric widely used in protein structure prediction. Derived from the Local Distance Difference Test (LDDT), pLDDT provides an intrinsic estimate of how well predicted atomic arrangements recapitulate true inter-residue geometries, specifically without requiring alignment or superposition to experimental references. Modern deep learning-based structure predictors, such as AlphaFold2 and ABodyBuilder3, incorporate pLDDT as a key model output, allowing users to interpret and filter predicted structures based on local and global reliability.

1. Definition and Mathematical Basis

LDDT is a superposition-free measure comparing predicted and reference local geometries by evaluating the agreement of inter-atomic (typically C $\alpha$ –C $\alpha$ ) distances. For a protein of length $L$ , and for each residue $i$ , a set of contact residues is defined:

$C_i = \left\{\, j \neq i \mid d_{ij}^{\mathrm{true}} < d_\mathrm{cut} \right\}$

where $d_{ij}^{\mathrm{true}}$ is the experimental (reference) C $\alpha$ –C $\alpha$ distance, and $d_\mathrm{cut}$ is typically 15 Å. The per-residue LDDT is:

$\mathrm{LDDT}_i = \frac{1}{|C_i|}\sum_{j\in C_i} \frac{1}{4} \sum_{t \in \{0.5,1.0,2.0,4.0\}} \mathbf{1}\left( |d_{ij}^{\mathrm{pred}} - d_{ij}^{\mathrm{true}} | < t \right)$

where $\alpha$ 0 is the indicator function. The global LDDT is averaged over all residues:

$\alpha$ 1

This score is scaled from 0 to 1, or to 0–100 in most implementations.

pLDDT is the expected value of the local LDDT estimated per residue by the neural network:

$\alpha$ 2

It is reported on a 0–100 scale, and reflects the network’s intrinsic confidence in the local structure irrespective of any post hoc benchmarking or reference data (Chae et al., 2024, Kenlay et al., 2024).

2. Model Implementations and Training Schemes

AlphaFold2 and ABodyBuilder3

AlphaFold2 and ABodyBuilder3 implement pLDDT as an additional model head operating on per-residue feature representations. In AlphaFold2, a softmax-based distance-error head approximates the expected per-residue LDDT, whereas ABodyBuilder3 employs a two-layer multilayer perceptron (MLP) with softmax activation, yielding a categorical distribution over 50 discrete bins covering the [0, 100] LDDT range:

$\alpha$ 3

where $\alpha$ 4 are the bin centers. The target, true per-residue LDDT, is discretized into bins and cross-entropy loss is applied. The resulting scalar prediction per residue is used as the pLDDT score, with the process unaffected by subsequent physics-based relaxation stages (Kenlay et al., 2024).

pLDDT-Predictor

pLDDT-Predictor introduces a distinct architecture leveraging pre-trained ESM2 LLM embeddings. The workflow consists of the following components:

Embedding Layer: ESM2-t6-8M-UR50D model produces 320-dimensional per-residue embeddings.
Transformer Encoder: Six-layer encoder (8 attention heads, $\alpha$ 5), processes embeddings.
Regression Head: Two fully connected layers transform per-residue representations into scalar predictions.
Aggregation: Global mean pooling converts the residue-level pLDDT to a single normalized value per protein, rescaled to [0,100] at inference.

Training uses the Huber (Smooth L1) loss:

$\alpha$ 6

The dataset comprises 1.5 million sequences from AlphaFold DB, split 80/10/10 for training, validation, and test, with all pLDDT values normalized to [0,1] for model stability (Chae et al., 2024).

3. Calibration, Benchmarking, and Interpretation

pLDDT values provide direct, per-residue assessments of model confidence. Both ABodyBuilder3 and pLDDT-Predictor validate pLDDT correlation against backbone RMSD. For instance, in ABodyBuilder3, the region-wise Pearson correlation (CDRH3 region) between pLDDT and RMSD is $\alpha$ 7 when using LM embeddings, higher than legacy ensemble-based approaches (see Table 1 below) (Kenlay et al., 2024).

Model	CDRH1	CDRH2	CDRH3	Fw-H	CDRL1	CDRL2	CDRL3	Fw-L
ABodyBuilder2 (ens.)	0.41	0.38	0.57	0.50	0.47	0.48	0.72	0.40
ABodyBuilder3	0.58	0.26	0.61	0.48	0.60	0.20	0.68	0.67
ABodyBuilder3-LM	0.69	0.36	0.73	0.39	0.72	0.52	0.68	0.58

Interpreting pLDDT ranges:

pLDDT ≥ 90: very high confidence, correlates with RMSD < 1.5 Å.
85 ≤ pLDDT < 90: high confidence, most RMSD < 2 Å.
70 ≤ pLDDT < 85: medium confidence, may require refinement.
pLDDT < 70: low confidence, likely inaccurate (Kenlay et al., 2024).

In pLDDT-Predictor, prediction accuracy for high-confidence structures (pLDDT > 70) reaches 91.2% with a mean squared error of 84.81 and mean absolute error of 5.85 on test data. Held-out set results report Pearson $\alpha$ 8 with reference AlphaFold2 pLDDT. Accuracy in high pLDDT bins is notably robust for sequence lengths below 1000 residues (Chae et al., 2024).

4. Computational Efficiency and Scalability

The computational cost of traditional structure-based pLDDT predictions is dominated by all-atom modeling (e.g., AlphaFold2: ~30 minutes/protein on RTX 4090). In contrast, pLDDT-Predictor achieves millisecond-per-protein inference (mean 0.007 s/protein), providing a 250,000× speedup over AlphaFold2 and making routine, genome-scale screening feasible on commodity hardware (Chae et al., 2024).

Such efficiency enables large-scale, sequence-level triage in protein engineering and design, including:

Pre-filtering millions of generated sequences before expensive structural modeling.
Metagenomic or proteome-wide structure confidence annotation.
Real-time feedback for LLM-driven protein design workflows.

5. Limitations and Caveats

Key limitations of pLDDT-based approaches include:

Scaling with Sequence Length: Transformer models exhibit quadratic complexity in sequence length, leading to accuracy degradation above 1000 residues (Chae et al., 2024).
Label Bias: Predictors trained on AlphaFold2 labels may inherit any systematic errors or biases present in AF2’s pLDDT calibration.
Interpretability: Per-residue scalar output is challenging to interpret mechanistically, especially in black-box models.
Coverage: pLDDT reports local confidence; it does not assess global topology, potential packing defects, or ligand/solvent interactions (Chae et al., 2024, Kenlay et al., 2024).

6. Future Directions and Extensions

Several directions are identified for advancing pLDDT-based confidence estimation:

Structural Priors: Augmenting predictive models with explicit geometric or physical constraints to improve error localization.
Transformer Advances: Adoption of linear-time or sparse attention mechanisms to mitigate sequence-length bottlenecks.
Model Compression: Distillation techniques targeting CPU/embedded deployment for decentralized screening.
Metric Generalization: Extending regression heads to additional metrics, such as Predicted Aligned Error (PAE), to provide complementary global uncertainty estimates (Chae et al., 2024).
Antibody-Specific Calibration: In antibodies, region-level averaging (e.g., CDRH3) provides actionable confidence boundaries for engineering campaigns, with established heuristics linking pLDDT to expected RMSD for CDR loops (Kenlay et al., 2024).

7. Workflow Summary and Practical Guidance

For model developers and practitioners, both pLDDT-Predictor and ABodyBuilder3 demonstrate workflows coupling per-residue embeddings, feed-forward architectures, and tailored loss functions to deliver actionable confidence scores. The single-model, single-pass nature obviates the need for expensive ensembles or multi-stage calibration.

In antibody modeling, region-level pLDDT enables targeted filtering of predicted structures, efficient uncertainty annotation, and practical integration with downstream design or experimental validation pipelines (Kenlay et al., 2024). For large-scale protein screening, high-speed predictors facilitate tractable confidence estimation at the scale of entire metagenomes or combinatorial protein libraries.

The pLDDT metric, through continued refinement and scalable modeling, constitutes a cornerstone of modern structural bioinformatics, underpinning both automated quality control and rational experimental planning.

Markdown Report Issue Upgrade to Chat

References (2)

pLDDT-Predictor: High-speed Protein Screening Using Transformer and ESM2 (2024)

ABodyBuilder3: Improved and scalable antibody structure predictions (2024)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Predicted Local Distance Difference Test (pLDDT).