Reed–Muller Codes: Distance Distribution
- Distance Distribution of Reed–Muller Codes is the enumeration of codewords by Hamming weight, revealing the combinatorial structure and error-correcting capabilities.
- The methodology leverages recursive constructions like the (u, u+v)-technique and derivative methods to precisely characterize weight spectra in various regimes.
- Asymptotic analyses and probabilistic bounds, including binomial approximations, offer practical insights into weight plateaus and performance limits in large finite fields.
A Reed–Muller code is a family of linear codes with broad significance in both theoretical and applied coding theory, combinatorics, and computer science. The distance distribution—also known as the weight spectrum—of a Reed–Muller code refers to the enumeration of codewords by their Hamming weights, providing a precise profile of the code's combinatorial structure and its error-correcting performance. The study of distance distributions includes exact characterizations for certain parameter regimes, asymptotic error bounds, the structure of small-weight codewords, and unified frameworks for codes over various finite fields.
1. Fundamental Definitions and Notation
Let denote the finite field with elements. For integers , the -ary Reed–Muller code consists of evaluation vectors of all -variate polynomials of total degree at most , with entries in , evaluated over the points of : The parameters are:
- Length
- Dimension
- Minimum distance
For binary codes (), the weight (distance) spectrum is the set of Hamming weights of all codewords in , and the weight enumerator counts the number of codewords of weight .
2. Exact Weight Spectra: Families and
Recent work establishes explicit formulas for the weight spectra of two infinite families: for and for (Carlet et al., 2023). The determination proceeds via induction on utilizing the -construction:
- , the setwise sum of all possible weights.
Weight spectrum for ():
Equivalently,
Weight spectrum for ():
Or equivalently,
The proofs combine induction with exclusion of "forbidden holes" in the possible weight intervals, rigorously constrained by the Kasami–Tokura characterization:
- For weights in , the only allowable weights take the form for integer .
Explicit computations confirm this structure for specific small codes, such as and , whose complete spectra are tabulated (Carlet et al., 2023).
3. Structured Descriptions: Kasami–Tokura Bound and Forbidden Gaps
The interval is governed by the Kasami–Tokura theorem: only weights , for suitable , appear. As a result, the weight spectra of for fixed and large comprise:
- Isolated "small" weights fully prescribed by Kasami–Tokura
- Further isolated weights in , governed by the Kasami–Tokura–Azumi (KT–A) classification for weights
- A contiguous sequence of even weights ("central interval") in the middle
- Complements to of the isolated weights
For , the above structure is completely determined. For , this remains conjectural.
4. Asymptotic and Probabilistic Bounds: Character-Sum Framework
A general asymptotic analysis of over arbitrary finite fields is achieved via the character-sum method (Kolekar, 26 Jan 2026). For any received word and , the coset-weight distribution admits a binomial-approximation: with an explicit error bound: where and is polynomial in and . For fixed and , the ratio uniformly in .
This character-sum framework generalizes previous results for Reed–Solomon codes (the case) and reveals that, for large , the Reed–Muller distance distribution is sharply concentrated around the binomial estimate.
5. Global Weight Distribution: Plateaus and Combinatorial Structure
The cumulative weight distribution and the multiplicities at given weights exhibit a "plateau" phenomenon (0811.2356):
- For each , the distribution remains essentially constant in , rising exponentially at the cutoff points .
- For , .
- The asymptotics in each plateau obey as with fixed.
Upper and lower bounds for are established:
for constants , .
6. Techniques: -Construction and Derivative Methods
The recursive -construction underpins the inductive computation of spectra. For , each codeword can be written as with , . This implies , tightly constraining possible weights.
For asymptotic upper bounds and plateaus, the discrete derivative method is central (0811.2356). Mapping Boolean codewords into , repeated application of directional differences uncovers bias and allows for representations of low-weight words via a controlled number of derivatives, bounding the number of possible codewords at each weight level.
On the enumeration side, the character-sum approach utilizes:
- Lagrange-indicator polynomials for zero-set specification
- Additive and multiplicative characters on the quotient algebra of polynomials to enforce coefficient constraints
- Evaluation of Gauss sums and Möbius-inversion to control error terms, combined with the Li–Wan permutation sieve for distinctness in summations (Kolekar, 26 Jan 2026)
7. Open Problems and Conjectural Spectra
For the general family with fixed and , it is conjectured that the weight spectrum consists precisely of:
- Isolated gaps at small weights as predicted by Kasami–Tokura and KT–A results
- A single run of consecutive even weights ("central interval") in the middle
- Complements to of the exceptional weights
This conjecture remains open for (Carlet et al., 2023). The rigorous study of the coset-weight distribution for codes over large finite fields is now addressed systematically, but extensions to higher , detailed spectra for nonbinary codes, and explicit combinatorial characterizations in the intermediate regime remain active topics.
Key References:
- "The weight spectrum of two families of Reed–Muller codes" (Carlet et al., 2023)
- "On the Distance Distribution of Reed–Muller Codes" (Kolekar, 26 Jan 2026)
- "The List-Decoding Size of Reed-Muller Codes" (0811.2356)