Protograph-Based LDPC Codes
- Protograph-Based LDPC codes are structured error-correcting codes derived from a small bipartite graph (protograph) that is lifted to form larger code ensembles.
- They achieve near-capacity iterative decoding thresholds through techniques like spatial coupling and convolutional extensions.
- Incorporating quasi-cyclic designs and rate compatibility, these codes offer provable linear minimum distance growth essential for reliable digital communications.
Protograph-based @@@@1@@@@ (Low-Density Parity-Check) codes are a class of structured LDPC codes derived from a small bipartite base graph, known as a protograph. This approach enables the construction of code families with rich ensemble properties—such as rate compatibility, capacity-approaching iterative thresholds, linear growth of minimum Hamming distance, and efficient quasi-cyclic (QC) representations—while retaining manageable analysis and flexible implementation pathways. The protograph framework also accommodates spatial coupling and convolutional code generalizations, and admits powerful algebraic and combinatorial tools for detailed performance guarantees and optimization.
1. Foundations of Protograph-Based LDPC Codes
A protograph is defined by a small bipartite graph (base matrix) , where is the number of variable nodes and the number of check nodes; entries can exceed one, indicating multiple edges between node pairs. The protograph ensemble is generated by a “lifting” procedure: each node and edge in the base graph is replicated times, and the edges are permuted among the node copies. The full-length code has variables and checks, and the code inherits degree distributions, locality, and (when QC lifting is used) automorphism structure from the protograph.
This structured lifting preserves the ensemble properties of degree distributions and permits exact tracking of message types in iterative decoding, which can be leveraged for performance prediction and optimization (Mitchell et al., 2010, Ranganathan et al., 2017, Chen et al., 2014).
2. Convolutional and Terminated Protograph Codes
Spatially-coupled and terminated convolutional variants are obtained by “edge-spreading”: the base matrix is written as a sum of nonnegative matrices, each corresponding to connections at time offset and indicating syndrome former memory. The convolutional base matrix is bi-infinite, formed by diagonally stacking the 's. A terminated (length ) code is constructed by truncating to a banded of size . After lifting, the block length is .
The design rate is
and, as , .
Spatially-coupled (SC) and terminated protograph-based codes exhibit iterative decoding thresholds that approach the MAP (maximum a posteriori) threshold of the underlying uncoupled ensemble, a property known as threshold saturation. Degree profiles smoothly interpolate between perfectly regular (bulk) and lower-degree nodes at boundaries, with the precise local irregularities dependent on the choice of (Mitchell et al., 2010, 0805.0241).
3. Iterative Decoding Thresholds and Capacity Approach
Iterative decoding performance of a protograph code is evaluated through density evolution on the base graph. For the binary erasure channel (BEC) with erasure probability , the protograph structure allows exact per-edge recursion:
- At each iteration , message-erasure probabilities on edges are updated via standard variable-node and check-node rules.
The iterative (belief propagation) threshold is defined as the supremum for which as for all . For terminated spatially-coupled ensembles, (the MAP threshold of the uncoupled ensemble) as . Hence, terminated and SC protograph codes are capacity-approaching under iterative decoding (Mitchell et al., 2010, Pradhan et al., 2013, Chen et al., 2014).
4. Minimum Distance Growth and Ensemble Properties
Protograph-based LDPC codes enable rigorous combinatorial analysis of the minimum Hamming distance. Using average weight enumerators derived from the protograph spectrum, one can show that, for large , typical codes have for some , i.e., the ensemble is asymptotically good.
For quasi-cyclic (QC) sub-ensembles (where all liftings are circulant), explicit upper bounds on are available due to the combinatorial structure of circulant matrices (Ranganathan et al., 2017, Mitchell et al., 2010):
where denotes matrix permanent. This bound can be further increased by taking -fold covers of the base protograph (“arrays of circulants”), enhancing the distance for QC liftings.
Explicit edge-spreading (choice of ) and careful cover selection can dramatically increase —in concrete (3,6)-regular cases, bounds of 30, 56, 176, 210, etc., can be achieved by increasing the complexity of the cover (Mitchell et al., 2010, Ranganathan et al., 2017).
5. Quasi-Cyclic and Rate-Compatible Protograph Codes
QC-LDPC codes result from restricting the lifting permutations to circulants, which provide efficient encoder/decoder implementations and favorable hardware properties. Rate-compatible protograph-based codes, particularly “protograph-based Raptor-like (PBRL) codes,” have an explicit two-part protograph:
- A high-rate “precode” (an LDPC protograph)
- An “LT part” that appends degree-1 variable nodes, increasing redundancy and providing incremental rate steps.
PBRL ensembles enable design across a wide range of rates without sacrificing waterfall performance. Optimization of the base protograph for the minimum-distance upper bound—particularly in the QC sub-ensemble—substantially reduces error floors at short blocklengths, with finite-length codes constructed by QC-lifting and permutation-optimization (e.g., ACE, cPEG) (Ranganathan et al., 2017, Chen et al., 2014).
6. Graph Covers, Local Irregularity, and Advanced Optimizations
Graph covers of the protograph (arrays of circulants) allow the base graph to be “folded” into larger structures, further increasing achievable minimum distance in QC sub-ensembles. Beyond classical regular/irregular designs, local irregularity—where degree variation is introduced within the subgraph induced by a base matrix entry—can be exploited. This reduces decoding thresholds by tuning the local degree distribution without sacrificing the base graph's overall structure (Wüst et al., 23 May 2025).
Finite-length performance is further enhanced by incorporating Markov Chain Monte Carlo (MCMC) algorithms in the design process. The MC method uses Gibbs sampling in the space of lifting or partitioning arrangements to minimize the number of detrimental short cycles and absorbing sets, achieving significant reduction in error floors and construction complexity relative to classical deterministic approaches (Tanrıkulu et al., 22 Apr 2025).
7. Design Workflow and Practical Considerations
A canonical design procedure for protograph-based LDPC codes includes:
- Selection of variable/check degrees and syndrome-former memory
- Edge-spreading to optimize iterative thresholds and minimum distance
- Termination at length to arrive at a block code with rate
- Lifting by , restricting to circulant permutations for QC realization if desired
- If necessary, construction of -fold covers or introduction of local irregularity for further performance gains
The resulting codes combine near-capacity iterative thresholds, provable linear distance growth, and highly efficient QC-friendly encodings. The framework applies uniformly across regular, irregular, and generalized node types, for both block and convolutional settings, and admits extension to joint code-channel designs (e.g., for ISI, BICM) as well as non-binary generalizations (Mitchell et al., 2010, Ranganathan et al., 2017, Wüst et al., 23 May 2025, Tanrıkulu et al., 22 Apr 2025).
References:
- (Mitchell et al., 2010)
- (Ranganathan et al., 2017)
- (Chen et al., 2014)
- (0805.0241)
- (Pradhan et al., 2013)
- (Wüst et al., 23 May 2025)
- (Tanrıkulu et al., 22 Apr 2025)