unavoidable pattern
In mathematics and theoretical computer science, a pattern is an unavoidable pattern if it is unavoidable on any finite alphabet.
Definitions
= Pattern =
Like a word, a pattern (also called term) is a sequence of symbols over some alphabet.
The minimum multiplicity of the pattern is where is the number of occurrence of symbol in pattern . In other words, it is the number of occurrences in of the least frequently occurring symbol in .
= Instance =
Given finite alphabets and , a word is an instance of the pattern if there exists a non-erasing semigroup morphism such that , where denotes the Kleene star of . Non-erasing means that for all , where denotes the empty string.
= Avoidance / Matching =
A word is said to match, or encounter, a pattern if a factor (also called subword or substring) of is an instance of . Otherwise, is said to avoid , or to be -free. This definition can be generalized to the case of an infinite , based on a generalized definition of "substring".
= Maximal ''p''-free word =
Given a pattern and an alphabet . A -free word is a maximal -free word over if and match .
= Avoidability index =
Properties
- A pattern is avoidable if is an instance of an avoidable pattern .{{Cite journal|last=Schmidt|first=Ursula|date=1987-08-01|title=Long unavoidable patterns|journal=Acta Informatica|language=en|volume=24|issue=4|pages=433–445|doi=10.1007/BF00292112|s2cid=7928450|issn=1432-0525}}
- Let avoidable pattern be a factor of pattern , then is also avoidable.
- A pattern is unavoidable if and only if is a factor of some unavoidable pattern .
- Given an unavoidable pattern and a symbol not in , then is unavoidable.
- Given an unavoidable pattern , then the reversal is unavoidable.
- Given an unavoidable pattern , there exists a symbol such that occurs exactly once in .
- Let represent the number of distinct symbols of pattern . If , then is avoidable.
Zimin words
{{main|Sesquipower}}
Given alphabet , Zimin words (patterns) are defined recursively for and .
Pattern reduction
= Free letter =
Given a pattern over some alphabet , we say is free for if there exist subsets of such that the following hold:
- is a factor of and ↔ is a factor of and
For example, let , then is free for since there exist satisfying the conditions above.
= Reduce =
A pattern reduces to pattern if there exists a symbol such that is free for , and can be obtained by removing all occurrence of from . Denote this relation by .
For example, let , then can reduce to since is free for .
= Locked =
A word is said to be locked if has no free letter; hence can not be reduced.{{Cite journal|last1=Baker|first1=Kirby A.|last2=McNulty|first2=George F.|last3=Taylor|first3=Walter|date=1989-12-18|title=Growth problems for avoidable words|journal=Theoretical Computer Science|language=en|volume=69|issue=3|pages=319–345|doi=10.1016/0304-3975(89)90071-6|issn=0304-3975|doi-access=free}}
= Transitivity =
Given patterns , if reduces to and reduces to , then reduces to . Denote this relation by .
Graph pattern avoidance<ref name=":4">{{Cite journal|last=Grytczuk|first=Jarosław|date=2007-05-28|title=Pattern avoidance on graphs|journal=Discrete Mathematics|series=The Fourth Caracow Conference on Graph Theory|language=en|volume=307|issue=11|pages=1341–1346|doi=10.1016/j.disc.2005.11.071|issn=0012-365X|doi-access=free}}</ref>
= Avoidance / Matching on a specific graph =
= Pattern chromatic number =
The pattern chromatic number is the minimal number of distinct colors needed for a -free vertex coloring over the graph .
Let where is the set of all simple graphs with a maximum degree no more than .
Similarly, and are defined for edge colorings.
= Probabilistic bound on ''{{pi}}{{sub|p}}(n)'' =
= Explicit colorings =
Given a pattern such that is even for all , then for all , where is the complete graph of vertices.
Given a pattern such that , and an arbitrary tree , let be the set of all avoidable subpatterns and their reflections of . Then .
Given a pattern such that , and a tree with degree . Let be the set of all avoidable subpatterns and their reflections of , then .
Examples
- The Thue–Morse sequence is cube-free and overlap-free; hence it avoids the patterns and .
- A square-free word is one avoiding the pattern . The word over the alphabet obtained by taking the first difference of the Thue–Morse sequence is an example of an infinite square-free word.{{Cite book|url=https://books.google.com/books?id=3w9eR3u8GN4C&q=Combinatorics+on+words.+Christoffel+words+and+repetitions+in+words+2009|title=Combinatorics on Words: Christoffel Words and Repetitions in Words|publisher=American Mathematical Soc.|isbn=978-0-8218-7325-0|pages=97|language=en}}{{Cite book|last=Fogg|first=N. Pytheas|url=https://books.google.com/books?id=qBIsuwEACAAJ&q=Substitutions+in+dynamics,+arithmetics+and+combinatorics.|title=Substitutions in Dynamics, Arithmetics and Combinatorics|date=2002-09-23|publisher=Springer Science & Business Media|isbn=978-3-540-44141-0|pages=104|language=en}}
- The patterns and are unavoidable on any alphabet, since they are factors of the Zimin words.{{Cite book|last1=Allouche|first1=Jean-Paul|url=https://books.google.com/books?id=2ZsSUStt96sC&dq=Automatic+Sequences%3A+Theory%2C+Applications%2C+Generalizations&pg=PR13|title=Automatic Sequences: Theory, Applications, Generalizations|last2=Shallit|first2=Jeffrey|last3=Shallit|first3=Professor Jeffrey|date=2003-07-21|publisher=Cambridge University Press|isbn=978-0-521-82332-6|pages=24|language=en}}
- The power patterns for are 2-avoidable.
- All binary patterns can be divided into three categories:{{cite book|last1=Lothaire|first1=M.|url=https://archive.org/details/algebraiccombina0000loth|url-access=registration|title=Algebraic Combinatorics on Words|publisher=Cambridge University Press|year=2002|isbn=9780521812207}}
- are unavoidable.
- have avoidability index of 3.
- others have avoidability index of 2.
- has avoidability index of 4, as well as other locked words.
- has avoidability index of 5.{{Cite journal|last=Clark|first=Ronald J.|date=2006-04-01|title=The existence of a pattern which is 5-avoidable but 4-unavoidable|journal=International Journal of Algebra and Computation|volume=16|issue=2|pages=351–367|doi=10.1142/S0218196706002950|issn=0218-1967}}
- The repetitive threshold is the infimum of exponents such that is avoidable on an alphabet of size . Also see Dejean's theorem.
Open problems
References
{{reflist}}
- {{cite book | last1 = Allouche | first1 = Jean-Paul | last2 = Shallit | first2 = Jeffrey | author2-link = Jeffrey Shallit | isbn = 978-0-521-82332-6 | publisher = Cambridge University Press | title = Automatic Sequences: Theory, Applications, Generalizations | year = 2003 | zbl=1086.11015 }}
- {{cite book | last1=Berstel | first1=Jean | last2=Lauve | first2=Aaron | last3=Reutenauer | first3=Christophe | last4=Saliola | first4=Franco V. | title=Combinatorics on words. Christoffel words and repetitions in words | series=CRM Monograph Series | volume=27 | location=Providence, RI | publisher=American Mathematical Society | year=2009 | isbn=978-0-8218-4480-9 | zbl=1161.68043 }}
- {{cite book | last=Lothaire | first=M. | author-link=M. Lothaire | title=Algebraic combinatorics on words | others=With preface by Jean Berstel and Dominique Perrin | edition=Reprint of the 2002 hardback | series=Encyclopedia of Mathematics and Its Applications | volume=90| publisher=Cambridge University Press | year=2011 | isbn=978-0-521-18071-9 | zbl=1221.68183 }}
- {{cite book | last=Pytheas Fogg | first=N. | editor1-first=Valérie | editor1-last = Berthé |editor1-link = Valérie Berthé| editor2-last = Ferenczi | editor2-first= Sébastien | editor3-last= Mauduit | editor3-first= Christian | editor4-last= Siegel | editor4-first= A. | title=Substitutions in dynamics, arithmetics and combinatorics | series=Lecture Notes in Mathematics | volume=1794 | location=Berlin | publisher=Springer-Verlag | year=2002 | isbn=3-540-44141-7 | zbl=1014.11015 }}