Papers
Topics
Authors
Recent
2000 character limit reached

Sturmian Characteristic Word

Updated 19 October 2025
  • Sturmian characteristic word is an infinite binary sequence defined by an irrational slope and its continued fraction expansion, yielding exactly n+1 distinct factors of length n.
  • It is generated by standard morphisms prescribed by a directive sequence, resulting in a fixed point that is balanced and uniformly recurrent.
  • The detailed combinatorial decomposition into singular and adjoining words interconnects symbolic dynamics, number theory, and quasicrystal theory for algorithmic analysis.

A Sturmian characteristic word is an infinite binary word cαc_\alpha associated to an irrational slope α(0,1)\alpha\in(0,1); it arises as the canonical symbolic coding of the minimal aperiodic sequences with factor complexity n+1n+1 for each length nn. The combinatorics, structure, and generation of cαc_\alpha are governed by the continued fraction expansion of α\alpha, encoded in a directive sequence that prescribes standard morphisms whose fixed points are precisely the characteristic words. The theory connects word combinatorics, symbolic dynamics, continued fractions, morphic substitutions, and number theory in a unified framework that supports explicit decomposition results, factorization, algorithmic generation, and deep classifications of infinite aperiodic order.

1. Definition, Standard Sequences, and Minimal Complexity

A characteristic Sturmian word cαc_\alpha over an alphabet A={a,b}\mathcal{A} = \{a, b\} is determined for irrational α(0,1)\alpha \in (0,1) by its continued fraction

α=[0;1+d1,d2,],di>0, d11.\alpha = [0; 1 + d_1, d_2, \dots], \quad d_i > 0, \ d_1 \geq 1.

Given the directive sequence (d1,d2,)(d_1, d_2, \ldots), define the standard sequence recursively: s1=b,s0=a,sn=sn1dnsn2(n1).s_{-1} = b, \quad s_0 = a, \quad s_n = s_{n-1}^{d_n} s_{n-2} \quad (n \geq 1). The infinite word

cα=limnsnc_\alpha = \lim_{n \to \infty} s_n

is the characteristic Sturmian word of slope α\alpha. It satisfies

n0,Card({factors of length n in cα})=n+1,\forall n \geq 0, \quad \mathrm{Card}(\{\text{factors of length }n\text{ in }c_\alpha\}) = n + 1,

giving the minimal complexity for an aperiodic word. c1αc_{1-\alpha} is defined analogously as the characteristic word for slope 1α1-\alpha.

These words are balanced (any two factors of equal length differ by at most one in the number of each letter), uniformly recurrent, and serve as the canonical "quasicrystals" in symbolic combinatorics (0708.4387).

2. Generation via Morphisms and Fixed Point Structure

Sturmian characteristic words are generated by "standard" morphisms built from the directive sequence. Consider the morphism σ\sigma associated with the continued fraction expansion as in (0708.4387):

  • The standard morphism σ\sigma is defined so that for all mm, σm(ab)=sm+1|\sigma^m(a b)| = |s_{m+1}|.
  • The letter exchange EE (involution aba\leftrightarrow b) defines σ^=EσE\hat{\sigma} = E \sigma E.

Fixed point property: cαc_\alpha is a fixed point of any power of the standard morphism: cα=limmσm(a),c1α=limmσ^m(b),c_\alpha = \lim_{m\rightarrow \infty} \sigma^m(a), \qquad c_{1-\alpha} = \lim_{m\rightarrow \infty} \hat{\sigma}^m(b), if and only if α=[0;1+d1,d2,,dn]\alpha = [0; 1 + d_1, d_2, \ldots, d_n] with dnd11d_n \geq d_1 \geq 1.

Thus, the morphic generation process is entirely determined by the continued fraction of α\alpha, and the standard sequence reflects the combinatorial structure of cαc_\alpha mirrored in the inflation structure of σ\sigma (0708.4387).

3. Conjugates, Singular Decomposition, and Structure

Conjugation extends classically: for infinite xx and kNk \in \mathbb{N}, the kk-th conjugate is the infinite word with prefix of length kk removed. The central result of (0708.4387) is that every conjugate of cαc_\alpha admits a decomposition into "generalized adjoining singular words" (Melançon's singular word decomposition).

For k=qm+1pk = q_{m+1} - p, 2pqm+1qm+12 \leq p \leq q_{m+1} - q_m + 1, where (qn)(q_n) are denominators of convergents of α\alpha, the kk-th conjugate

(σm)k(cα)(\sigma^m)^k(c_\alpha)

admits a decomposition

(σm)k(cα)=u1(UmUm+1Um+2),(\sigma^m)^k(c_\alpha) = u^{-1}(U_m U_{m+1} U_{m+2}\ldots),

where UjU_j are (generalized) adjoining singular words determined from the standard sequence sjs_j and uu is a prefix of a word Vm1V_{m-1} also expressible via qmq_m, pp. This generalizes decompositions previously available for the Fibonacci word (α=[0;2,1]\alpha = [0;2,1]).

The original singular word decomposition of cαc_\alpha takes the form

cα=W1U0U1U2c_\alpha = W_{-1} U_0 U_1 U_2 \ldots

with singular WnW_n and adjoining singular UnU_n built in terms of sns_n (0708.4387).

4. Continued Fraction Expansion and Recurrence Structure

The continued fraction expansion is fundamental:

  • The directive sequence (d1,d2,)(d_1, d_2, \ldots) both determines the standard sequence, and prescribes the combinatorial inflation for cαc_\alpha via σ\sigma.
  • The lengths of sns_n obey sn=qn|s_n| = q_n, where qnq_n is the denominator of the nn-th convergent to α\alpha: q0=1,q1=1+d1,qn=dnqn1+qn2.q_0 = 1, \quad q_1 = 1 + d_1, \quad q_n = d_n q_{n-1} + q_{n-2}.
  • The decomposition formulae for conjugates of cαc_\alpha involve arithmetic relationships directly in terms of qnq_{n} and the partial quotients dnd_n.

Thus, continued fraction expansions of slopes are not just auxiliary number-theoretic data—they impose the full combinatorial "directive order" on cαc_\alpha and its conjugates (0708.4387).

5. Explicit Example: the Case α=[0;2,7]\alpha = [0;2,7]

When α=[0;2,r]\alpha = [0;2, r] (e.g., r=7r = 7), (0708.4387) gives the complete singular decomposition for every conjugate:

  • The standard morphism σ\sigma has σm(ab)=sm+1=qm+1|\sigma^m(a b)| = |s_{m+1}| = q_{m+1} for every mm.
  • For each mm, there exists a word VmV_m of length Vm=qm+2qm+1|V_m| = q_{m+2} - q_{m+1} involved in the singular decomposition of conjugates.
  • For k=qm+1pk = q_{m+1} - p (with 2pqm+1qm+12 \leq p \leq q_{m+1}-q_m+1), one has

(σm)k(cα)=u1(Uj)jm,(\sigma^m)^k(c_\alpha) = u^{-1}(U_j)_{j\geq m},

where uu is a prefix of Vm1V_{m-1}, and the decomposition is fully explicit in terms of the standard sequence and continued fraction data.

This generalizes the earlier results for the infinite Fibonacci word and highlights the explicit influence of partial quotients and convergents on the combinatorics of conjugate decompositions (0708.4387).

6. Applications and Broader Context

The decompositions and structure furnished for characteristic Sturmian words generated by morphisms have several important ramifications:

  • In symbolic dynamics and combinatorics on words, these decompositions are essential for analyzing recurrence, palindromic factors, and fine balance properties within minimal complexity infinite words.
  • In number theory, the link between continued fractions and Sturmian word generation produces a direct bridge to Diophantine approximation and cutting-sequence algorithms.
  • In theoretical computer science, especially pattern recognition and formal languages, singular and adjoining singular word factorizations support efficient string matching and the analysis of self-similarity in aperiodic sequences.
  • In quasicrystal theory and aperiodic physical models, the self-similarity and hierarchical structure elucidated by morphic decompositions model hierarchical order and tiling properties.

The explicit decomposition results—where each conjugate can be built from generalized adjoining singular words whose structure is tightly controlled by the continued fraction coefficients—equip practitioners with robust combinatorial and algorithmic tools for dissecting and analyzing Sturmian characteristic words in a variety of mathematical and applied settings (0708.4387).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Whiteboard

Follow Topic

Get notified by email when new papers are published related to Sturmian Characteristic Word.