Unavoidable pattern

This is an old revision of this page, as edited by Jochen Burghardt (talk | contribs) at 14:46, 26 February 2020 (Avoidability / Unavoidability on a specific alphabet: suggest to give informal and formal def.n separately; italicizing <math> has no effect; italicize defined notions instead; indicate that equivalence of both def.ns is a theorem; link to def.n of "infinite word"). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

In mathematics and theoretical computer science, a pattern is an unavoidable pattern if it's unavoidable on any finite alphabet.

Definitions

Pattern

Like a word, a pattern (also referred to as term) is a sequence of characters over some alphabet.

The minimum multiplicity of pattern   is   where   is the number of occurrence of character   in pattern  . In other words, it is the number of occurrences in   of the least frequently occurring character in  .

Instance

Given finite alphabets   and  , a word   is an instance of pattern  , if there exists a non-erasing semigroup morphism   such that  , where   denotes the Kleene star of  . Non-erasing means that   for all  , where   denotes the empty string.

Avoid / Match

A word   is said to match, or encounter, a pattern   if a substring of   is an instance of p. Otherwise,   is said to avoid  , or to be  -free.

Avoidability / Unavoidability on a specific alphabet

A pattern   is unavoidable on a finite alphabet   if each sufficiently long word   must match  ; formally: if  . Otherwise,   is avoidable on  , which implies there exist infinitely many words over alphabet   that avoid  . It can be shown that   is avoidable on   if, and only if, there exists an infinite word   that avoids  .

Avoidable / Unavoidable pattern

A pattern   is an unavoidable pattern(also reference as: blocking term) if   is unavoidable on any finite alphabet.

If a pattern is said to be unavoidable and not limited to a specific alphabet, then it's unavoidable for any finite alphabet by default. Conversely, if a pattern is said to be avoidable and not limited to a specific alphabet, then it's avoidable on some finite alphabet by default.

 -avoidable /  -unavoidable

A pattern   is  -avoidable if   is avoidable on an alphabet   of size  . Otherwise,   is  -unavoidable, which means   is unavoidable on every alphabet of size  .[1][2]

If pattern   is  -avoidable, then   is  -avoidable for all  .

Avoidability index

The avoidability index of a pattern   is the smallest   such that   is  -avoidable,   if   is unavoidable.[3]

Properties

  • Let pattern   be an instance of avoidable pattern  , then   is also avoidable.[4]
  • Let avoidable pattern   be a factor of pattern  , then   is also avoidable.[4]
  • A pattern   is unavoidable if and only if   is a factor of some unavoidable pattern  .
  • Given an unavoidable pattern   and a character   not in  , then   is unavoidable.[4]
  • Given an unavoidable pattern  , there exist a character   such that   occurs excatly once in  .[4]
  • Given a pattern  , let   represents the number of distinct characters of  . if  , then   is avoidable.[4]

Zimin words

Given alphabet  , Zimin words (patterns) are defined recursively   for   and base case  .

Unavoidability

All Zimin words are unavoidable.[5]

A word   is unavoidable if and only if it's a factor of a Zimin word.[5]

Given a finite alphabet  , let   represents the smallest   such that   matches   for all  . We have following properties:[6]

  •  
  •  
  •  
  •  

  is the longest unavoidable patterns constructed by alphabet   since  

Pattern reduction

Free letter

Given a pattern   over some alphabet  , we say   is free for   if there exist subsets   of   such that the following hold:

  1.   is a factor of   and   ↔    is a factor of   and  
  2.  

e.g. let  , then   is free for   since there exist   satisfied conditions above.

Reduce

A pattern   can reduce to pattern   if there exists a character   such that   is free for  , and   can be obtained by removing all occurrence of   from  . Denote as  .

e.g. let  , then   can reduce to   since   is free for  .

Transitivity

Given patterns  , if   can reduce to   and   can reduce to  , then   can reduce to  . Denote as  .

Unavoidability

A pattern   is unavoidable if and only if   can reduce to a word of length one, hence   where .[7][5]

Graph pattern avoidance[8]

Avoid / Match on a specific graph

Given a simple graph  , a edge coloring   matches pattern   if there exists a simple path   in   such that the sequence   matches  . Otherwise,   is said to avoid   or  -free.

Similarly, a vertex coloring   matches pattern   if there exists a simple path   in   such that the sequence   matches  .

Pattern chromatic number

The pattern chromatic number   is the minimal number of distinct colors needed for a  -free vertex coloring   over the graph  .

Let   where   is the set of all simple graphs with a maximum degree no more than  .

Similarly,   denote for edge coloring.

Avoidability / Unavoidability on graphs

A pattern   is avoidable on graphs if   is bounded by  , where   only depend on  .

  • Avoidance on words can be expressed as a specific case of avoidance on graphs, hence a pattern   is avoidable on any finite alphabet if and only if   for all   where   is a graph of   vertexs concatenated.


There exist an absolute constant  , such that   for all pattern   with  .

Given a pattern  , let   represents the number of distinct characters of  . if  , then   is avoidable on graphs.

Given a pattern   such that   is even for all  , then   for all   where   is the complete graph of n vertices.

Examples

  • The Thue–Morse sequence is cube-free and overlap-free, hence it avoids the patterns   and  .[1][2]
  • The patterns   and   are unavoidable on any alphabet, since they are factors of the Zimin words.[9][10]
  • The power patterns   for   are 2-avoidable.[1]
  • A square-free word is one avoiding the pattern  . An example is the word over the alphabet   obtained by taking the first difference of the Thue–Morse sequence.[11][12] See also Square-free word.
  • All binary patterns can divided into three categories:[13]
    •   are unavoidable.
    •   have avoidability index of 3.
    • others have avoidability index of 2.
  •   has avoidability index of 4, as well as other locked words.[14]
  •   has avoidability index of 5.[15]

Open problems

  • Is there an avoidable pattern   such that the avoidability index of   is 6?

References

  1. ^ a b c Lothaire (2011) p. 113
  2. ^ a b Berstel et al (2009) p.127
  3. ^ Lothaire (2011) p.124
  4. ^ a b c d e Schmidt, Ursula (1987-08-01). "Long unavoidable patterns". Acta Informatica. 24 (4): 433–445. doi:10.1007/BF00292112. ISSN 1432-0525.
  5. ^ a b c Zimin, A. I. (1984). "BLOCKING SETS OF TERMS". Mathematics of the USSR-Sbornik. 47 (2): 353. doi:10.1070/SM1984v047n02ABEH002647. ISSN 0025-5734.
  6. ^ Joshua, Cooper; Rorabaugh, Danny (2013). Bounds on Zimin Word Avoidance. arXiv:1409.3080. Bibcode:2014arXiv1409.3080C.
  7. ^ BEAN, DWIGHT RICHARD; EHRENFEUCHT, ANDRZEJ; MCNULTY, GEORGE FRANK (1979). Avoidable Patterns in Strings of Symbols (PDF).
  8. ^ Grytczuk, Jarosław (2007-05-28). "Pattern avoidance on graphs". Discrete Mathematics. The Fourth Caracow Conference on Graph Theory. 307 (11): 1341–1346. doi:10.1016/j.disc.2005.11.071. ISSN 0012-365X.
  9. ^ Allouche & Shallit (2003) p.24
  10. ^ Lothaire (2011) p.115
  11. ^ Pytheas Fogg (2002) p.104
  12. ^ Berstel et al (2009) p.97
  13. ^ Lothaire, M. (2002). Algebraic Combinatorics on Words.
  14. ^ Baker, Kirby A.; McNulty, George F.; Taylor, Walter (1989-12-18). "Growth problems for avoidable words". Theoretical Computer Science. 69 (3): 319–345. doi:10.1016/0304-3975(89)90071-6. ISSN 0304-3975.
  15. ^ Clark, Ronald J. (2006-04-01). "The existence of a pattern which is 5-avoidable but 4-unavoidable". International Journal of Algebra and Computation. 16 (02): 351–367. doi:10.1142/S0218196706002950. ISSN 0218-1967.