Papers
Topics
Authors
Recent
Search
2000 character limit reached

Ins-Robust Primitive Words

Published 4 Jul 2017 in math.CO and cs.FL | (1707.01010v2)

Abstract: Let Q be the set of primitive words over a finite alphabet with at least two symbols. We characterize a class of primitive words, Q_I, referred to as ins-robust primitive words, which remain primitive on insertion of any letter from the alphabet and present some properties that characterizes words in the set Q_I. It is shown that the language Q_I is dense. We prove that the language of primitive words that are not ins-robust is not context-free. We also present a linear time algorithm to recognize ins-robust primitive words and give a lower bound on the number of n-length ins-robust primitive words.

Summary

  • The paper characterizes ins-robust primitive words ($Q_I$), defining them as primitive words that remain primitive after inserting any letter from the alphabet.
  • The authors develop a linear-time algorithm to recognize these words and establish properties, including their density and a lower bound on their number.
  • The study demonstrates that the language of non-ins-robust primitive words is not context-free, showing the complexity and relationship of $Q_I$ to formal language classes.

The paper characterizes a subclass of primitive words referred to as ins-robust primitive words, denoted as QIQ_I, which maintain primitivity upon insertion of any letter from the underlying alphabet VV. The study details several properties of QIQ_I, demonstrates its density, and provides a linear-time algorithm for recognizing these words. Furthermore, a lower bound on the number of nn-length ins-robust primitive words is established, and it is proven that the language of primitive words that are not ins-robust is not context-free.

Here's a breakdown:

Basic Definitions and Properties:

  • A word is defined as a sequence of symbols from a finite alphabet VV.
  • A word ww is primitive if it is not a power of any other word (i.e., w=vnw = v^n implies n=1n = 1 and w=vw = v).
  • An ins-robust primitive word ww of length nn remains primitive after inserting any letter a∈Va \in V at any position i∈{0,1,…,n}i \in \{0, 1, \ldots, n\}.
  • The language of ins-robust primitive words (QIQ_I) is a subset of the language of primitive words (QQ).
  • The paper references prior work showing that if u1u2≠anu_1 u_2 \neq a^n, then at least one of the words among u1u2u_1 u_2 and u1au2u_1 a u_2 is primitive.

Characterization of Ins-Robust Primitive Words:

  • A central theorem states that a primitive word ww is not ins-robust if and only if it can be expressed as w=uru1u2usw = u^{r} u_1 u_2 u^{s}, where u=u1cu2∈Qu = u_1 c u_2 \in Q, u1,u2∈V∗u_1, u_2 \in V^*, c∈Vc \in V, and r,s≥0r, s \geq 0 with r+s≥1r + s \geq 1.
  • The set of non-ins-robust primitive words is denoted as QI‾=Q∖QIQ_{\overline{I}} = Q \setminus Q_I.
  • If u,v∈Qu, v \in Q and um=u1u2u^m = u_1 u_2 and v=u1cu2v = u_1 c u_2 for some c∈Vc \in V, then umvn∈QI‾u^m v^n \in Q_{\overline{I}} for m,n≥2m, n \geq 2.
  • If w∈QIw \in Q_I, then rev(w)∈QIrev(w) \in Q_I, where rev(w)rev(w) is the reverse of ww.
  • The language QIQ_I is reflective; that is, if uv∈QIuv \in Q_I, then vu∈QIvu \in Q_I.

Density of Ins-Robust Primitive Words:

  • A language LL is dense if for every w∈V∗w \in V^*, there exist x,y∈V∗x, y \in V^* such that xwy∈Lxwy \in L.
  • If ∣w∣=n|w| = n and wan∈QI‾wa^n \in Q_{\overline{I}} where w∉a∗w \notin a^*, then wan=u2u1u2wa^n = u^2 u_1 u_2, where u=u1bu2u = u_1 b u_2 for b≠ab \neq a.
  • If wan∈QI‾wa^n \in Q_{\overline{I}}, then for b≠ab \neq a, wbn∈QIwb^n \in Q_I.
  • The language QIQ_I is dense over the alphabet VV.

Relation to Formal Languages:

  • The paper investigates the relationship between QIQ_I and traditional languages in the Chomsky hierarchy.
  • The language QIQ_I is shown to be non-regular. This is proven using a pumping lemma argument and leveraging the result that for any fixed integer kk, there exists a positive integer mm such that the equation system (k−j)xj+j=m(k-j)x_j + j = m, j=0,1,2,…,k−1j = 0, 1, 2, \ldots, k-1 has a nontrivial solution with positive integers x1,x2,…,xj>1x_1, x_2, \ldots, x_j > 1.
  • The language of non-ins-robust primitive words, QI‾Q_{\overline{I}}, is proven to be non-context-free for a binary alphabet. This is shown via contradiction using the pumping lemma for context-free languages.

Counting Ins-Robust Primitive Words:

  • Let Z(k)=Vk∖QZ(k) = V^k \setminus Q be the set of nn-length non-primitive words.
  • Let m∈Nm \in \mathbb{N} and m=m1r1m2r2…mtrtm = {m_1}^{r_1} {m_2}^{r_2} \ldots {m_t}^{r_t} be the factorization of mm, where all mi,1≤i≤tm_i, 1 \leq i \leq t, are prime and mi≠mjm_i \neq m_j for i≠ji \neq j, then the number of primitive words of length mm is equal to

    ∣V∣m−∑1≤i≤t∣V∣mmi+∑1≤i≤j≤t∣V∣mmimj −∑1≤i≤j≤k≤t∣V∣mmimjmk+⋯+(−1)t−1∣V∣mm1m2⋯mt\begin{array}{rl} |V|^m - & \sum\limits_{1\le i\le t}|V|^{\frac{m}{m_i}} + \sum\limits_{1\le i\le j \le t}|V|^{\frac{m}{m_i m_j}} \ - & \sum\limits_{1\le i\le j \le k \le t}|V|^{\frac{m}{m_i m_j m_k}} + \cdots + (-1)^{t-1} |V|^{\frac{m}{m_1 m_2 \cdots m_t}} \end{array}

  • An upper bound on the number of non-ins-robust primitive words of length n≥2n \geq 2 is found as:

    ∣QI‾(n)∣≤(n+1)⋅(∣Z(n+1)∣−∣V∣).|Q_{\overline{I}(n)}| \leq (n+1) \cdot (|Z(n+1)| - |V|).

  • The number of ins-robust-primitive words of length nn, QI(n)Q_I(n), over an alphabet VV is equal to ∣Qn∣−∣QI‾(n)∣|Q_n| - |Q_{\overline{I}(n)}|.

Algorithm for Recognizing Ins-Robust Primitive Words:

  • A linear-time algorithm is presented to determine if a given primitive word ww is ins-robust.
  • The algorithm leverages the property that a word uu is non-ins-robust if and only if uuuu contains at least one periodic word of length ∣u∣|u| with period pp such that pp divides of length ∣u∣+1|u| + 1 and p≤∣u∣p \leq |u|.
  • The algorithm utilizes an existing algorithm for finding maximal repetitions in linear time.

Open Problems:

The paper concludes by posing questions for future research:

  1. Is QIiQ_I^i for i≥2i \geq 2 regular?
  2. Is the language of ins-robust primitive words QIQ_I a deterministic context-free language?

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.