SC-PCC Decoder: Efficient Iterative Threshold Decoding
- PCCC decoders are low-complexity iterative threshold decoders for turbo-like code constructions that use convolutional self-orthogonal component codes (CSOCs) for robust error correction.
- Spatial coupling divides information blocks into staggered sub-blocks, enabling sliding-window iterative decoding to attain near-MAP threshold performance with reduced latency.
- The SC-PCC architecture achieves significant coding gains and hardware efficiency by combining APP threshold decoding with scalable linear complexity and parallel processing.
A Parallel-Concatenated Code (PCC) decoder, and specifically its spatially coupled variant (SC-PCC), constitutes a class of low-complexity iterative threshold decoders designed for turbo-like code constructions. When equipped with convolutional self-orthogonal component codes (CSOCs) and suboptimal a posteriori probability (APP) threshold decoding, the SC-PCC decoder approaches the maximum a posteriori (MAP) performance threshold and achieves significant coding gains with reduced power, latency, and implementation complexity relative to optimal algorithms (Cummins et al., 2021).
1. Code Structure and Spatial Coupling
SC-PCCs are constructed using parallel concatenation of two CSOCs, each encoding a distinct permutation of the input. Each CSOC is a rate- systematic convolutional code defined by constraint length and memory . Self-orthogonality ensures that for each information error bit , there are parity-check equations , with no other error bit involved in more than one check. This structure guarantees a threshold decoding error-correction capability of .
Example parameterization for a rate-$2/3$ CSOC:
- , , 0
- Generator sequences (octal):
- 1
- 2
- 3, code rate 4
Spatial coupling is realized by dividing each information block 5 (length 6) into 7 sub-blocks, forming a “coupled source matrix” 8 such that each column is constructed from staggered sub-blocks across time and coupling depth. The matrix is zero-padded at boundaries, facilitating coupling across 9 blocks. Parallel interleaving and demultiplexing generate a permuted matrix 0.
At encoder output, for each time 1, both 2 and 3 are encoded by their respective CSOCs (each terminated by 4 zeros), resulting in parity outputs 5 and 6 concatenated with the systematic output 7: 8 The overall SC-PCC code rate is 9 for 0.
2. APP Threshold Decoding of CSOCs
Threshold decoding for CSOCs involves syndrome formation, LLR computation, and weighted thresholding:
- Syndrome Formation: For each received symbol 1, a hard decision 2 is made. The error bit 3 is computed. The syndrome is:
4
- LLR (Reliability Factor):
5
where 6 includes any a priori LLR from external decoder feedback.
- Self-Orthogonal Check Weighting: For each check 7, the extrinsic weight is formed:
8
using the box-plus operation.
- Threshold Decision Rule: The decoded estimate is:
9
where the full LLR argument includes self-orthogonal check extrinsic weight sums.
3. Sliding-Window Iterative Decoding Algorithm
The SC-PCC decoding algorithm employs a windowed message-passing procedure:
- A window size 0 is selected, targeting block 1.
- The received data for 2 contiguous blocks is demultiplexed into coupled channel inputs for the two parallel streams.
- Two APP threshold decoders process their respective streams, exchanging extrinsic LLRs per block via turbo-style iterations.
The schedule is divided into vertical iterations (3 forward sweeps per block) and horizontal iterations (4 full forward-reverse sweeps). For each target block,
5
component decoder calls are made. Hard decisions are output after 6 horizontal iterations or on successful CRC.
Pseudocode (verbatim from (Cummins et al., 2021)):
7
4. Latency, Memory, and Complexity
Latency
- Encoding: Requires buffering 7 input blocks, leading to latency 8.
- Decoding: Windowed decoding of size 9 adds latency 0. Minimum window for 1 is 2.
Memory
- Encoder memory: 3; for uncoupled PCC: 4.
- Decoder memory: Buffer for extrinsic LLRs 5, interleaver 6, syndrome/channel value registers per component 7; total 8.
Computational Complexity
- For block length 9, average nonzero taps per generator 0 with 1:
| Operation | Per Component Decoder |
|---|---|
| Multiplications | 2 |
| Additions | 3 |
| Box-plus ops | 4 |
Total for sliding window: 5
No BCJR trellis is employed, so the complexity scales linearly in 6. Parallelism is possible by decoding 7 information bits simultaneously ahead of syndrome updates.
5. Performance and Threshold Saturation
Performance evaluation in (Cummins et al., 2021) demonstrates:
- For rate 8 (9), SC-PCC ($2/3$0) achieves a $2/3$1 gain over standard PCC at BER $2/3$2 with similar complexity and latency.
- For rate $2/3$3 ($2/3$4), SC-PCC ($2/3$5) exhibits a $2/3$6 improvement at BER $2/3$7, and increasing $2/3$8 to $2/3$9 yields 0 gain.
Spatial coupling leads to threshold saturation: the iterative decoder’s waterfall region approaches the MAP threshold of the base PCC ensemble, and increases in coupling memory (1) or decoding window (2) further advance performance at the cost of higher latency and memory demands. A window 3 is generally sufficient to exploit most coupling gains. Excessively large 4 can degrade LLR reliability due to edge effects, which may be mitigated by LLR scaling or optimized message-passing schedules.
6. Hardware and Implementation Considerations
The SC-PCC decoder enables efficient VLSI design due to several attributes:
- Linear complexity in constraint length 5, avoiding the computational intensity of BCJR-based trellis decoding.
- High natural parallelism, as 6 information bits are independently threshold-decoded prior to syndrome updates.
- Hardware operations are restricted to additions, comparisons, and box-plus computations, precluding the need for exponentiation or division, resulting in reduced power consumption.
7. Significance and Broader Impact
The SC-PCC decoder with CSOC components and APP threshold decoding achieves notable coding gains over uncoupled PCCs at equivalent complexity and latency by leveraging spatial coupling. The resulting threshold saturation phenomenon demonstrates that performance can approach theoretical MAP limits using practical, scalable decoding rules. This framework highlights the utility of spatial coupling in enhancing the performance of turbo-like constructions while maintaining architectural efficiency (Cummins et al., 2021).