Arbitrary-length analogs to de Bruijn sequences (2108.07759v2)
Abstract: Let $\widetilde{\alpha}$ be a length-$L$ cyclic sequence of characters from a size-$K$ alphabet $\mathcal{A}$ such that the number of occurrences of any length-$m$ string on $\mathcal{A}$ as a substring of $\widetilde{\alpha}$ is $\lfloor L / Km \rfloor$ or $\lceil L / Km \rceil$. When $L = KN$ for any positive integer $N$, $\widetilde{\alpha}$ is a de Bruijn sequence of order $N$, and when $L \neq KN$, $\widetilde{\alpha}$ shares many properties with de Bruijn sequences. We describe an algorithm that outputs some $\widetilde{\alpha}$ for any combination of $K \geq 2$ and $L \geq 1$ in $O(L)$ time using $O(L \log K)$ space. This algorithm extends Lempel's recursive construction of a binary de Bruijn sequence. An implementation written in Python is available at https://github.com/nelloreward/pkl.