An automata for a Language scans the input string from left to right, processing one symbol at each step, and once finished, either rejects or accepts . That is, whether belongs to or not.
State
Whenever reads a symbol, the state of the automata may or may not change. We can classify into two types of automata based on its number of states: finite or infinite state automata.
Initial and accepting states
If is the set of all states of , one of these states has to be the initial state, usually denoted . We can then have one or more accepting states. If after finishing string our automata ends up at an accepting state, then we say that is accepted and belongs in the Language. Otherwise, is rejected.
Finite-state automata
If automata has finite state, it means there is a fixed number of positions that the machine can be at, at all possible times.
We transition to and from states every time a new symbol is processed. This determines the set of all possible next states, or if we move at all.
Deterministic or not?
We can further differentiate automata based on the number of transitions out of a state. If there’s only one, then we can deterministically say what our next state is. This is known as a DFA. Otherwise, it’s a nondeterministic finite automata.
Proving lower bound for number of automata states
Lemma 1
If a DFA accepts a Language and strings and are distinguishable with respect to , then
Proof by contradiction. Assume and and are distinguishable. By definition, there exists a string such that one of and is in , and the other is not.
However, will lead both and to the same state. either accepts or rejects both and . Therefore we have a contradiction.
Lemma 2
If is a set of pairwise, distinguishable strings, then every DFA for must have at least states.
This follows from the previous lemma that states that distinguishable strings will end up in different states, therefore there must exist at least of them.
Example
and
Consider . All of these strings are distinguishable with respect to . This means that the DFA for this set must have at least four states.
Example
and for each number , let
Here, every symbol up to the th symbol doesn’t really matter, but we have to transition through them. Then depending on or , we need two more states (one accepting, one rejecting). Therefore we need different states, because . we need an extra for .
We showed that is sufficient i.e. it’s enough. Now we need to show that is also necessary i.e. we can’t possibly use less states.
Showing necessity (at least )
The general strategy is to convert each of the states into a possible string that represents the state, and prove that every pair of strings is pairwise distinguishable. We may need to case on the strings in because they have different properties.
Example
Consider the set of strings . we have to prove that every pair of strings in this set is distinguishable w.r.t. .