Topics Automata Theory Grammars and Languages Complexities Formal Languages and Automata Theory 1 Why Automata Theory? To study abstract computing devices which are closely related to today’s computers. A simple example of finite state machine: 1 start on off 1 There are many different kinds of machines. Formal Languages and Automata Theory 2 Another Example 1 0 start 0 off off 1 on 0 1 When will this be on? Try 100, 1001, 1000, 111, 00, … Formal Languages and Automata Theory 3 Grammar and Languages Grammars and languages are closely related to automata theory and are the basis of many important software components like: Compilers and interpreters Text editors and processors Search engines System verification components Formal Languages and Automata Theory 4 Complexities Study the limits of computations. What kinds of problems can be solved with a computer? What kinds of problems can be solved efficiently? Formal Languages and Automata Theory 5 Preliminaries Alphabets Strings Languages Problems Formal Languages and Automata Theory 6 Alphabets An alphabet is a finite set of symbols. Usually, use to represent an alphabet. Examples: = {0,1}, the set of binary digits. = {a, b, … , z}, the set of all lower-case letters. = {(, )}, the set of open and close parentheses. Formal Languages and Automata Theory 7 Strings A string is a finite sequence of symbols from an alphabet. Examples: 0011 and 11 are strings from = {0,1} abc and bbb are strings from = {a, b, … , z} (()(())) and )(() are strings from = {(, )} Formal Languages and Automata Theory 8 Strings Empty string: Length of string: |0010| = 4, |aa| = 2, ||=0 Prefix of string: aaabc, aaabc, aaabc Proper prefix of string: aaabc, aaabc Suffix of string: aaabc, aaabc, aaabc Proper suffix of string: aaabc, aaabc Substring of string: aaabc, aaabc, aaabc Formal Languages and Automata Theory 9 Strings Concatenation: =abd, =ce, =abdce Exponentiation: =abd, 3=abdabdabd, 0= Reversal: =abd, R = dba k = set of all k-length strings formed by symbols in e.g., ={a,b}, 2={ab, ba, aa, bb}, 0={} What is 1? Is 1 different from ? How? Formal Languages and Automata Theory 10 Strings Kleene Closure * = 012… = k0 k e.g., ={a, b}, * = {, a, b, ab, aa, ba, bb, aaa, aab, abb, … } is the set of all strings formed by a’s and b’s. + = 123… = k>0 k i.e., * without the empty string. Formal Languages and Automata Theory 11 Languages A language is a set of strings over an alphabet. Examples: ={(, )}, L1={(), )(, (())} is a language over . ={a, b, c, … , z}, the set L of all legal English words is a language over . The set {} is a language over any alphabet. What is the difference between and {}? Formal Languages and Automata Theory 12 Languages Other Examples: ={0, 1}, L={0n1n | n1} is a language over consisting of the strings {01, 0011, 000111, … } ={0, 1}, L = {0i1j | ji0} is a language over consisting of the strings with some 0’s (possibly none) followed by at least as many 1’s. Formal Languages and Automata Theory 13 Problems In automata theory, a problem is to decide whether a given string is a member of some particular language. This formulation is general enough to capture the difficulty levels of all problems. Formal Languages and Automata Theory 14 Finite Automata ( or Finite State Machines) This is the simplest kind of machine. We will study 3 types of Finite Automata: Deterministic Finite Automata (DFA) Non-deterministic Finite Automata (NFA) Finite Automata with -transitions (-NFA) Formal Languages and Automata Theory 15 Deterministic Finite Automata (DFA) We have seen a simple example before: 1 start on off 1 There are some states and transitions (edges) between the states. The edge labels tell when we can move from one state to another. Formal Languages and Automata Theory 16 Definition of DFA A DFA is a 5-tuple (Q, , , q0, F) where Q is a finite set of states is a finite input alphabet is the transition function mapping Q to Q q0 in Q is the initial state (only one) F Q is a set of final states (zero or more) Formal Languages and Automata Theory 17 Definition of DFA For example: 1 start on off 1 Q is the set of states: {on, off} is the set of input symbols: {1} is the transitions: off 1 on; on 1 off q0 is the initial state: off F is the set of final states (double circle): {on} Formal Languages and Automata Theory 18 Definition of DFA Another Example: 1 start 0 q0 1 0 q2 q1 0 1 What are Q, , , q0 and F in this DFA? Formal Languages and Automata Theory 19 Transition Table For the previous example, the DFA is (Q,,,q0,F) where Q = {q0,q1,q2}, = {0,1}, F = {q2} and is such that Inputs States q0 q1 *q2 0 q1 q2 q1 1 q0 q0 q0 Note that there is one transition only for each input symbol from each state. Formal Languages and Automata Theory 20 Language of a DFA Given a DFA M, the language accepted (or recognized) by M is the set of all strings that, starting from the initial state, will reach one of the final states after the whole string is read. For example, the language accepted by the previous example is the string that ends with 00 Formal Languages and Automata Theory 21 DFA Example Consider the DFA M=(Q,,,q0,F) where Q = {q0,q1,q2,q3}, = {0,1}, F = {q0} and is: Inputs States q0 q1 q2 q3 0 q2 q3 q0 q1 Start 1 q1 q0 q3 q2 OR q0 1 1 0 0 q2 q1 0 0 1 1 q3 We can use a transition table or a transition diagram to specify the transitions. What input can take you to the final state in M? Formal Languages and Automata Theory 22