Papers
Topics
Authors
Recent
Search
2000 character limit reached

Covering Arrays in Combinatorics

Updated 29 January 2026
  • Covering arrays are combinatorial matrices that ensure every t-wise interaction among parameters appears at least once, providing rigorous test coverage.
  • They are constructed using probabilistic methods, group-theoretic techniques, and algorithmic tools like the Lovász Local Lemma to achieve minimal test sizes.
  • Applications span software testing, combinatorial design, and extremal set theory, driving research on asymptotic bounds and optimal configurations.

A covering array is a fundamental combinatorial object, central to interaction testing, combinatorial design, and extremal set theory. It is an N×kN \times k array over a vv-ary alphabet in which every N×tN \times t subarray contains all possible vtv^t tuples in its rows, guaranteeing exhaustive coverage of all tt-way interactions among kk parameters, each with vv levels. The covering array number CAN(t,k,v)\mathrm{CAN}(t, k, v) is the minimum NN for which such an array exists.

1. Formal Definitions and Basic Properties

A covering array CA(N;t,k,v)\mathrm{CA}(N; t, k, v) is an N×kN \times k matrix AA over {0,1,,v1}\{0,1,\ldots,v-1\} with the property

T={i1,,it}{1,,k},and x{0,,v1}t, r{1,,N} such that (Ar,i1,,Ar,it)=x.\forall\,T=\{i_1,\ldots,i_t\}\subseteq\{1,\ldots,k\},\quad \text{and}\ x\in\{0,\ldots,v-1\}^t,\ \exists\,r\in\{1,\ldots,N\}\ \text{such that}\ (A_{r,i_1},\ldots,A_{r,i_t}) = x.

The covering array number: CAN(t,k,v)=min{N:CA(N;t,k,v) exists}.\mathrm{CAN}(t,k,v) = \min\{ N : \mathrm{CA}(N;t,k,v)\ \text{exists} \}. Orthogonal arrays OAλ(t,k,v)OA_\lambda(t,k,v) require each tt-tuple to appear exactly λ\lambda times in every tt-column subarray; covering arrays require at least one occurrence. Covering arrays generalize orthogonal arrays and provide the minimal test size needed to guarantee tt-wise coverage (Hiess et al., 20 Oct 2025).

2. Asymptotic Bounds and Constructions

Logarithmic Growth

The classical result (Katona–Kleitman–Godbole) is

CAN(t,k,v)=Θ(log2k)\mathrm{CAN}(t,k,v) = \Theta(\log_2 k)

for fixed tt, vv and kk \to \infty (Francetić et al., 2015).

Probabilistic and Analytical Bounds

The Lovász Local Lemma (LLL) yields: CAN(t,k,v)(t1)log2klog2(vt/(vt1))(1+o(1))\mathrm{CAN}(t,k,v) \leq \frac{(t-1)\,\log_2 k}{\log_2(v^t/(v^t-1))}(1+o(1)) with more refined constructions, e.g., fixed-weight columns, improving constants for small tt and vv (Yuan et al., 2014).

Entropy-compression (algorithmic LLL) further refines constants: d(t,v)v(t1)log2(vt1/(vt11))d(t,v) \leq \frac{v(t-1)}{\log_2(v^{t-1}/(v^{t-1}-1))} and, via multivariable optimization, to (Francetić et al., 2015): d(t,v)v(t1)(t1)log2(vv/(v1)v1)f0(t,v)d(t,v) \leq \frac{v(t-1)}{ (t-1)\log_2( v^v/(v-1)^{v-1} ) - f_0(t,v) } for d(t,v):=lim supkCAN(t,k,v)/log2kd(t,v) := \limsup_{k \to \infty}\mathrm{CAN}(t,k,v)/\log_2 k.

3. Exact Results for Small Parameters and Uniqueness

Binary Arrays and Strength Two

For q=2q=2, t=2t=2, maximal binary 2-covering arrays are unique up to equivalence, given by the standard maximal array—an m×(m1m/21)m \times \binom{m-1}{\lfloor m/2 \rfloor-1} matrix whose columns are all binary vectors of fixed weight (Choi et al., 2011).

Computational Determination

Modern computational methods (isomorph-free exhaustive search, canonical augmentation) have determined exact optimal covering arrays for 21 strength-two cases with v>2v>2, N>v2N>v^2 (Kokkala et al., 2019). All exhibit uniformity (equal symbol-frequency in each column).

4. Generalizations: Mixed, Hypergraph, and Sequence Covering Arrays

Mixed Covering Arrays

Generalized covering designs handle multi-part alphabets and block sizes, providing lower and upper bounds via Schőnheim's bound, edge-counting, and recursive constructions (Bailey et al., 2010). Partial and mixed-level arrays are realized by partial covering arrays and hypergraph-based models.

Hypergraph Covering Arrays

Covering arrays on rr-uniform hypergraphs restrict coverage to prescribed interactions, e.g., only certain triples. Inductive construction uses hooking operations (vertex/edge additions), achieving optimally small arrays for α\alpha-acyclic and conformal hypertrees (Akhtar et al., 2015).

Sequence and Perfect Sequence Covering Arrays

Sequence covering arrays (SCAs) are sets of permutations covering all ordered kk-subsequences; perfect sequence covering arrays (PSCAs) require exact multiplicity λ\lambda (Na et al., 2022, Gentle et al., 2022). The minimal such λ\lambda is denoted g(n,k)g(n,k). For (n,k){(5,3),(6,3),(7,3),(7,4)}(n,k) \in \{(5,3),(6,3),(7,3),(7,4)\}, g(n,k)=2g(n,k)=2; for (8,3)(8,3), g(8,3)=3g(8,3)=3 (Na et al., 2022). PSCAs are tightly connected to directed designs and deletion-correcting codes.

5. Partial and Relaxed Covering Arrays

Relaxed requirements give rise to partial covering arrays, covering only a fraction of tt-sets or tuples (Sarkar et al., 2016). Important results:

  • Partial covering arrays with fraction α\alpha:

N=O(vt(t1)lnkvtm+1)N = O\left( \frac{v^t(t-1)\,\ln k}{v^t - m + 1} \right)

  • ϵ\epsilon-almost covering arrays:

N=O(vtln(vt1/ϵ))N = O( v^t \ln( v^{t-1}/\epsilon ) )

Moser–Tardos resampling and Markov-type randomized algorithms ensure efficient generation, matching information-theoretic lower bounds up to constant factors.

6. Arrays with Higher Index (Replication) and Constraints

Covering Arrays of Index λ\lambda

For λ>1\lambda>1 (every tt-tuple occurs at least λ\lambda times), the main asymptotic bound is (Calbert et al., 2022): CANλ(t,k,v)=Θv,t(logk+λ)\mathrm{CAN}_\lambda(t,k,v) = \Theta_{v,t}(\log k + \lambda) removing previous λloglogk\lambda \log \log k terms. Improved leading constants are obtained via the Lovász Local Lemma and two-stage alteration schemes; graph coloring yields further reductions for higher λ\lambda.

SAT/MaxSAT-Based Construction

Satisfiability-based encodings of the covering array problem allow for exact and suboptimal solving, even under additional constraints (forbidden tuples, system-specific restrictions) (Ansótegui et al., 2021). MaxSAT variants minimize test suite size; incomplete MaxSAT is especially effective on large-scale constrained problems.

7. Algebraic and Group-Theoretic Constructions

Finite Field and Group Development Constructions

Maximal sequences (m-sequences), cyclic trace arrays, and group actions (e.g., PGL(2,q)(2,q)) are exploited to yield covering arrays of high strength and efficiency (Maity et al., 2015, Tzanakis, 2017). For strength t=4t=4, g=3g=3, explicit bounds 4-CAN(k,3)12k+34\text{-}CAN(k,3) \le 12k + 3 are achieved using projective general linear group constructions and starter vectors. For binary arrays, concatenation of cyclic Hamming codes, self-dual sequence families, interleaving, and primitive polynomial periodicity yield near-optimal covering sequences and arrays (Chee et al., 12 Feb 2025, Chee et al., 2024).

8. Open Problems and Current Research Directions

Key open areas include:

  • Determining CAN(t,k,v)\mathrm{CAN}(t,k,v) for more small (t,k,vt,k,v) and high strength (t>4t>4) parameter sets (Hiess et al., 20 Oct 2025).
  • Improving leading constants and sharpening lower bounds, especially for partial arrays and arrays of higher index.
  • Extending algebraic and group-theoretic constructions to broader parameter ranges, especially using cyclotomy and discrete logarithms (Tzanakis, 2017).
  • Classification and existence problems for mixed, hypergraph, and constrained covering arrays.
  • Connections to perfect sequence covering arrays, deletion codes, and directed tt-designs.
  • Systematic study of optimal arrays for non-binary alphabets and verification of the uniformity conjecture (Kokkala et al., 2019).
  • Efficient SAT/MaxSAT and enumeration algorithms capable of exact or near-optimal constructions in real-world, large-scale systems.

Table: Summary of Covering Array Number Bounds

Type Primary Bound or Complexity Reference
Classical (tt-way, full) O((t1)vtlogk)O((t-1)v^t\log k); Θ(vtlogk)\Theta(v^t\log k) (Sarkar et al., 2016)
Logarithmic growth (tt fixed) Θ(log2k)\Theta(\log_2 k) (Francetić et al., 2015)
Probabilistic/LLL bound (t1)log2k/log2(vt/(vt1))(t-1)\log_2 k/\log_2(v^t/(v^t-1)) (Yuan et al., 2014)
Entropy compression v(t1)/log2[vt1/(vt11)]v(t-1)/\log_2[v^{t-1}/(v^{t-1}-1)] (Francetić et al., 2015)
Higher index λ\lambda Θ(logk+λ)\Theta(\log k+\lambda) (Calbert et al., 2022)
Small q=2,t=2q=2, t=2 (exact) ((m1m/21))(m-1\choose \lfloor m/2 \rfloor-1) columns (Choi et al., 2011)
Partial/relaxed coverage O(vt1logk)O(v^{t-1}\log k) for small relaxations (Sarkar et al., 2016)
Algebraic/group development (t3t\ge3) Polynomial/construction-dependent (Maity et al., 2015)

Covering arrays remain a rich, continually evolving domain at the intersection of extremal combinatorics, algorithmic theory, algebraic design, and practical test suite optimization.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Covering Arrays in Combinatorics.