On the Provability of Consistency of PV Ebrahim A Larijani Submitted to the University of Wales in fulfilment for the Degree of Master of Research in Logic and Computation Department of Computer Science Swansea University October 2008 Declaration This work has not been previously accepted in substance for any degree and is not being concurrently submitted in candidature for any degree. Signed ............................................................ Date ............................................................ (candidate) Statement 1 This thesis is the result of my own investigations, except where otherwise stated. Other sources are acknowledged by footnotes giving explicit references. A bibliography is appended. Signed ............................................................ Date ............................................................ (candidate) Statement 2 I hereby give my consent for my thesis, if accepted, to be available for photocopying and for inter-library loan, and for the title and summary to be made available to outside organisations. Signed ............................................................ Date ............................................................ (candidate) Abstract This thesis is a contribution to the separation problem of Bounded Arithmetic. It is shown the consistency of induction free fragment of the equational theory PV of polynomially verifiable equations is not provable within PV. The main body of the thesis is a workout and improvement of the corresponding result by Buss and Ignjatovic [6]. The improvement has done by dropping the axiom concerning squaring function from PV. Contents 1 Introduction 1.1 Bounded Arithmetic(BA) . . . . . . . 1.2 BA hierarchy . . . . . . . . . . . . . . 1.3 Polynomial Hierarchy . . . . . . . . . 1.3.1 PH (machine dependent ) . . 1.3.2 PH (machine independent) . 1.4 Separation of BA hierarchy and PH . 1.5 Separation problem and consistency 1.6 Main results and structure of thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 4 5 6 6 6 9 10 12 2 Proof System PV 2.1 Language of PV . . . . . . . . . . . . . . . . 2.2 Axioms of PV . . . . . . . . . . . . . . . . . . 2.3 Proofs in PV . . . . . . . . . . . . . . . . . . 2.3.1 Structure of Proofs in PV . . . . . . 2.3.2 E − , P V − , P V . . . . . . . . . . . . . . 2.3.3 Interpretation of formulae as terms . 2.4 Arithmetisation of PV . . . . . . . . . . . . . 2.4.1 Gödelisation of Lp . . . . . . . . . . . 2.4.2 Gödelisation of Proofs . . . . . . . . 2.4.3 Provability Predicate for PV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 15 17 19 19 22 23 25 26 27 28 3 Examples of upper bounds for the length of proofs in P V − 29 4 Speed-up of induction for P V − 4.1 Speed-up of induction with quantifiers . . . . . . . . . . . . . . 4.2 Speed-up of induction for quantifier free equational theories . 4.3 Provability of consistency of P V − . . . . . . . . . . . . . . . . . 38 38 40 54 1 . . . . . . . . . . . . . . . . . . . . . . . . 4.4 Future works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 5 Appendix 58 Bibliography 59 2 Chapter 1 Introduction Logical complexity theory consists of logical frameworks to study problems in computational complexity. Such logical approaches give us better understanding of computational complexity classes like P, NP, co-NP, etc. which are first defined by S.Cook and L.Levin precisely [9] their investigations into the complexity theory raised famous open problem whether P equals NP. One framework to tackle latter question regarding proof complexity is equational theory PV that has been introduced by Cook [10] and deals with feasibly constructive proofs for reasoning about equations between polynomial time computable functions. In another aspect Buss showed [2] theories of Bounded Arithmetic (subsytems of Peano Arithmetic)are strongly related to the P V . It has been shown [12, 11] that separation problem for complexity classes (P, N P, co−N P, ...)are strongly related to the separation problem of Bounded Arithmetic theories which means whether the Bounded Arithmetic hierarchy is strict? A possible way of studying separation problem is to examine Gödel type consistency statements for Bounded Arithmetic theories. In this chapter, first we introduce Bounded Arithmetic and polynomial hierarchy and their relation. This will be followed by an overview of main results in separation problem. 3 1.1 Bounded Arithmetic(BA) Definition 1. We define language of BA, Lb as follows: 1 Lb = {0, S, +, ⋅, ⌊ a⌋, ♯, ≤, ∣a∣} 2 where 0, S, +, ⋅, ⌊ 21 a⌋ and ≤ are constant zero, successor function, multiplication, addition, shift right function and less-than-or-equal-to relation. ∣a∣ denotes the length of binary representation of a and a ♯ b = 2∣a∣⋅∣b∣ . In BA all quantifiers are bounded : ∀x ≤ t, ∃x ≤ t or sharply bounded: ∀x ≤ ∣t∣, ∃x ≤ ∣t∣. Now for defining Bounded Arithmetic hierarchy we need some definitions: Definition 2. For quantifier free formulae ϕ in the language of BA, the class of Σbi -fromulae(Πbi -formulae) is defined as follows: Σb1 ∶ ∃x1 ≤ s1 ∀y ≤ ∣t∣ϕ(x1 , y) Σb2 ∶ ∃x1 ≤ s1 ∀x2 ≤ s2 ∃y ≤ ∣t∣ϕ(x1 , x2 , y) ⋮ By replacing ∃and∀ in the above Σbi -formulae we obtain: Πb1 , Πb2 , .... The difference between Bounded Arithmetic and Peano Arithmetic is that induction in BA is restricted only to the bounded formulae Σbi or Πbi . In the following we define axiom scheme of BA. Definition 3. We define BA induction axioms as follows: Let Ψ be either the class of Σbi -formulae or Πbi -formulae and let A(x) ∈ Ψ then 1. Ψ − IN D ∶ A(0) ∧ (∀x)(A(x) ⊃ A(Sx)) ⊃ (∀x)A(x) 2. Ψ − P IN D ∶ A(0) ∧ (∀x)(A⌊ 12 x⌋) ⊃ A(x)) ⊃ (∀x)A(x) 3. Ψ − LIN D ∶ A(0) ∧ (∀x)(A(x) ⊃ A(Sx)) ⊃ (∀x)A(∣x∣) Now we can define theories of Bounded Arithmetic by using BASIC axioms (c.f. appendix A) and above induction schemes: Definition 4. S2i is defined as a first order theory over the language Lb and it has following sets of axioms: 4 • BASIC • Σbi − P IN D T2i is defined as a first order theory over the language Lb and it has following sets of axioms: • BASIC • Σbi − IN D Definition 5. S2 = ⋃ S2i i T2 = ⋃ T2i i 1.2 BA hierarchy With the above definition of S2i and T2i we can form a hierarchy of BA theories. In this line Buss stated following theorem to illustrate relationships between bounded arithmetic theories [2] : Theorem 1. For i ≥ 1, S2i and T2i in the Definition 4 we have T2i ⊢ S2i S2i ⊢ T2i−1 In particular we have following sequence of BA theories S21 ⊆ T21 ⊆ S22 ⊆ T22 ⊆ . . . It has been shown in [2] that S21 and equational theory P V are strongly related. Furthermore Buss proved some conservation results between P V and S21 . In BA hierarchy we are interested to know which functions can be defined at each level, in particular, Σbi -definable functions in S2i . Definition 6. Let A(x, y) be a Σbi (Πbi ) formula. A function f ∶ N → N is called Σbi -definable in S2i (T2i ) if 1. ∀x, A(x, f (x)) must be a valid formula . 2. S2i ⊢ (∀x)(∃y)A(x, y) 3. S2i ⊢ (∀x)(∀y)(∀z)(A(x, y) ∧ A(x, z) ⊃ y = z) 5 1.3 Polynomial Hierarchy Polynomial hierarchy (PH) first introduced by L.Stockmeyer [15] to generalise computational complexity classes P, NP, co-NP and beyond.There are two equivalent approaches to define PH, first approach is machine dependent definition in terms of Turing machine and the second is machine independent definition by using limited iteration and bounded quantification to determine complexity classes. 1.3.1 PH (machine dependent ) In this section we define PH with the aid of Turing machines. First we define classes for language or decision problems: [14] Definition 7. For every i ≥ 1 , Let L be a binary language (decision problem) we say L ∈ Σpi if there exist a polynomial time Turing machine M and a polynomial q such that : x ∈ L ⇔ ∃u1 ∈ {0, 1}q(∣x∣) ∀u2 ∈ {0, 1}q(∣x∣) . . . Qi ui ∈ {0, 1}q(∣x∣) M (x, u1 , . . . , ui ) = 1 Where Qi denotes ∃ or ∀ depending on whether i is even or odd. similarly by changing universal and existential quantifiers we obtain class of languages or decision problems, Πpi . Now we define PH: P H = ⋃ Σpi i p F P Σi denotes the class of functions which are computable by a polynomial time Turing machine with an oracle in Σpi . 1.3.2 PH (machine independent) Polynomial Hierarchy can be obtained by defining certain kinds of function classes. For this in this section first we define an algebra on a set of initial functions by operations “limited iteration” and composition then we introduce the set of “polynomial time growth rate” functions and finally we define PH in this setting.[2] Definition 8. Set of basic functions B: • 0 (the constant 0) 6 • x ↦ Sx (the successor function) • x ↦ ⌊ 12 x⌋ (the shift right function) • x ↦ 2 ⋅ x (shift left function) ⎧ ⎪ ⎪1 • (x, y) ↦ x ≤ y = ⎨ ⎪ ⎪ ⎩0 x ≤ y, x>0 ⎧ ⎪ ⎪y • (x, y, z) ↦ Choice(x, y, z) = ⎨ ⎪ ⎪ ⎩z x > 0, x=0 Definition 9. Let k ≥ 0, let g ∶ Nk → N and h ∶ Nk+2 → N be arbitrary functions let p and q be positive polynomials.We say f ∶ Nk → N is defined from g, h and time bound p and space bound q by “limited iteration” if the following conditions hold. For τ ∶ Nk+1 → N τ (x1 , . . . , xk , 0) = g(x1 , . . . , xk ) τ (x1 , . . . , xk , n + 1) = h(x1 , . . . , xk , n, τ (x1 , . . . , xk , n)) → → → (∀n ≤ p(∣Ð x ∣)(∣τ (Ð x , n)∣ ≤ q(∣Ð x ∣) → → → f (Ð x ) = τ (Ð x , p(∣Ð x ∣)) Definition of limited iteration is similar to the notion of limited recursion on notation which we will use it in the next chapter in order to define the language of PV. Definition 10. A function f ∶ Nk → N has polynomial growth rate iff there is a positive polynomial p(a polynomial with only nonnegative integer coef→ → → ficients.) such that for all Ð x , we have ∣f (Ð x )∣ < p(∣Ð x ∣). Let C be a set of functions of polynomial growth rate. The Polynomial-time closure of C, PTC(C), is the smallest class of functions which (1) contains C and B and (2) is closed under composition and limited iteration. Definition 11. PRED(C): is the set of predicates (boolean functions ) in the set a of functions C. 7 In particular, if Q and R are functions then (∀y ≤ Q(x1 , . . . , xn ))R(x1 , . . . , xn , y) is predicate (boolean function) which has the value 1 iff for all y ≤ Q(x1 , . . . , xn ) the value of R(x1 , . . . , xn , y) is nonzero. Accordingly we could define bounded quantification for existential quantifiers. Now we can explain “polynomially bounded quantification” and “logarithmically bounded quantification” by using different bounds for quantifiers: Definition 12. Let C be a set of functions closed under composition and Let ∣x∣ denote the length of binary representation of x . Then P B∃(C) is the set of predicates Q such that: 1. Q ∶ Ni → N for some i ∈ N 2. There is an R ∈ P RED(C) and a positive polynomial p such that for → all Ð x: Q(x1 , . . . , xk ) = ∃y ≤ 2p(∣x1 ∣,...,∣xk ∣) R(x1 , . . . , xk , y) P B∀(C) can be defined in a similar way by replacing existential quantifier with universal quantifier. Definition 13. Let C be a set of functions closed under composition. Then LB∃(C) is the set of predicates Q such that 1. Q ∶ Ni → N for some i ∈ N 2. There is an R ∈ P RED(C) and a positive polynomial p such that for → all Ð x: Q(x1 , . . . , xk ) = ∃y ≤ p(∣x1 ∣, . . . , ∣xk ∣)R(x1 , . . . , xk , y) LB∀(C) can be defined in a similar way by replacing existential quantifier with universal quantifier. Now we are ready to define Polynomial Hierarchy according to [2]: Definition 14. Polynomial Hierarchy(PH) is defined by induction on k as follows 1. ◻p0 is the smallest set of functions containing B and closed under composition, LB∃ and LB∀. 2. ∆p0 = Σp0 = Πp0 = P RED(◻p0 ). 8 3. ◻pk+1 = P T C(Σpk ) 4. ∆pk+1 = P RED(◻pk+1 ) 5. Σpk+1 = P B∃(∆pk+1 ) 6. Πpk+1 = P B∀(∆pk+1 ) 7. P H = ⋃k Σpk p In particular in the above definition we have ◻pk+1 = F P Σi (see last subsection). Furthermore in [15] it is shown that these two definition of PH (machine dependent and machine independent) are equivalent.(see following figure from [4] ) 1.4 Separation of BA hierarchy and PH One major open problem in Bounded Arithmetic is if the inclusion of theories as described in 1.2, is proper? or the hierarchy of Bounded Arithmetic could 9 be separated? It is shown by Buss et al. that hierarchy of BA and PH are strongly related in the following we present some results which show this relation. Theorem 2. Class of Σbi definable functions in S2i is equal to ◻pi (F P Σi−1 ) b This implies for example, in the case i = 1, the class of Σb1 definable functions in S21 is equivalent to the class of functions ◻p1 which are computed by a polynomial time Turing machine(F P ). For another example, the class of Σb2 definable functions in S22 is equivalent to the class of functions in ◻p2 which can be computed by a polynomial time Turing machine with an oracle in N P . We conclude this section by presenting following important theorems concerning the relation of separation problem in BA hierarchy and PH. Theorem 3. [12, 11] If the levels of the Polynomial time Hierarchy (PH) are separated, then the levels of bounded arithmetic theories (BA) are separated as well. In particular, if Σpi+2 ≠ Πpi+2 then S2i ≠ S2i+1 . Theorem 4. [17, 3] BA collapses iff PH collapses provably in BA 1.5 Separation problem and consistency As mentioned before in order to approach separation problem in BA hierarchy one way is to study Gödel type consistency statements of BA theories. We know from Gödel incompleteness theorem that, for subsystems of PA, namely IΣ1 where induction is restricted to the Σ1 formulae (of the form ∃xφ(x) where φ is bounded formula) we have: IΣ1 ⊬ Con(IΣ1 ) Where Con(IΣ1 ) denotes consistency of IΣ1 . Also for system IΣ2 where induction is applied on Σ2 formulae (of the form ∃x∀yφ(x, y)) we know that: IΣ2 ⊢ Con(IΣ1 ) Buss [2] showed the general notion of consistency is too strong since he proved: S2 ⊬ Con(S2−1 ) 10 where S2−1 is induction-free fragment of S21 . This fact lead Buss to introduce the notion of bounded consistency (BDCon) which is concerned with proofs containing only bounded formulae (formula which all quantifiers have to be bounded). Buss has shown in [2] that for at most one i: S2i+1 ⊢ BDCon(S2i ) This has been improved by Pudlak in [13]: S2 ⊬ BDCon(S21 ) In this line also Takeuti showed S2i can not prove the consistency of S2−1 where only Σbi+5 formulae appear.[16] On the other hand for equational theory P V Cook proved :[10] P V ⊬ Con(P V ) By similar argument to the Gödel incompleteness theorem. Furthermore Buss and Ignjatovic showed in [6] that even for induction-free fragment of P V , P V − , consistency is not provable in P V : P V ⊬ Con(P V − )(∗) The above negative results for provability of consistency led Takeuti to raised following conjecture [7]: “Let S2−∞ be the equational theory involving equations s = t, when s and t are closed terms in the language of S2 with natural rules based on the recursive definition of function symbols then S2 ⊬ Con(S2−∞ ).” The consequence of this conjecture would be impossibility of positive answer to the separation problem of BA by using consistency statements. Despite that in [1] it is shown that Takeuti conjecture is not true and consistency of S2−∞ is provable in S21 . In fact in [1] it is shown that S21 proves more general claim for consistency of “pure” equational theory S2−∞ . In particular, consistency of “pure” equational theory in the language of P V which is based solely on the recursive definition of function symbols is provable in S21 . This seems to contradict (∗) since S21 and P V are quiet similar (conservativity of S2 (P V ) over P V [2]). A deeper look reveals that these two results are sligthy different: 11 1. In [1] consistency statements consider proofs in pure equational theory .i.e. lines in proofs are equations and rules are from equational logic. 2. In (∗) [6] proofs considered by consistency statements have more liberal form such that each line consists of sequent and each sequent has formulae with boolean conjections. Also rules come from propositional logic and equational logic. Conclusion: The above comparison shows that consistency statements of the kind Con(P V − ) may be a candidate for separation of BA theories. 1.6 Main results and structure of thesis This thesis is a work out of results in [6] section (1,4) , regarding “unprovability of consistency of P V − ”. The contribution of the thesis is threefold: 1. Precision of arguments in [6] by providing full backgrounds which are necessary to get results. 2. New observation from proofs in [6] that theory P Vkr which can be obtained by restricting induction axioms to the “restricted k-induction” and has following schematic form ϕ(b) → ϕ(b + 1) ϕ(0) → ϕ(∣t∣k ) where ∣t∣k denotes the k-th binary length of term t, has the same expressive power to the P V (c.f. Theorem 12). 3. Improving results in [6] by weakening theory P V − . For this we dropped function symbol sq denoting squaring function, and its corresponding axiom. Furthermore we argued that the consistency of resulting theory which we also denote by Con(P V − ) is still unprovable in P V . So the structure of thesis as follows: In Chapter 2 we define theory P V and P V − precisely by fixing the language, axiom scheme, deduction scheme and Gödelisation of proof system P V . In Chapter 3 we show some examples of feasible proofs in P V − which we need them in Chapter 4. In particular Theorem 9 is proved in a new form different 12 than original proof (c.f. Theorem 8) [6]. Chapter 4 will discuss speed of induction in equational theories. This is based on speed-up technique in [6] but in a new form by introducing “restricted k-induction”. This is followed by a brief discussion of formalisation of proofs in P V − in a metalevel which enables us to get statements about consistency of P V − 13 Chapter 2 Proof System PV The proof system PV has been introduced by Cook [10] to capture the notion of feasibility of computation. The first motivation of Cook for developing PV came from the main question in complexity theory that whether P equals NP. His approach is to show that they are not equal, by trying to prove that the set of tautologies is not in NP or equivalently there is no propositional proof system with short proofs for all tautologies. The second motivation came from constructive mathematics; A constructive proof of a statement ∀xA provides efficient methods of finding a proof of A for each value of x, but it does not say anything about growth of proof’s length in terms of x and this led Cook to the notion of feasibly constructive proofs: If equation f (x) = g(x) has a proof Π in PV then there is a polynomial PΠ such that Π provides a uniform method of verifying f (x0 ) = g(x0 ), for a given natural number x0 , in PΠ (∣x0 ∣) steps. In this case we say f (x) = g(x) is Polynomially Verifiable . Like Church’s thesis for recursive functions, Cook proposed Verifiability thesis for equations in PV[10]: “An equation t = u of PV is provable in PV if and only if it is Polynomially Verifiable.” Cook used pure equational theory for the definition of PV. Obviously working with that proof system is very cumbersome therefore he developed proof system PV1 which allows logical connectives such as conjuction, negation and etc. Also in[10] it is shown that every equation is provable in PV1 if and only if it is provable in PV. We will develop PV1 further, and add new 14 symbols to the language as well as new set of axioms to obtain proofs in a more liberal way. 2.1 Language of PV In order to specify the syntax of PV we will add new symbols to the language of Bounded Arithmetic Lb and symbols for all polynomial computable functions. For this we shall introduce a mechanism for recursion called Limited Recursion on Notation (LRN) as well as Cobham’s theorem which ensures that all of polynomial computable functions could be defined in this framework. Definition 15. Limited Recursion on Notation (LRN): Function f (x, y1 , . . . , yn ) is obtained by LRN from functions g(y1 , . . . , yn ) , h(x, z, y1 , . . . , yn ) and k(x, y1 , . . . , yn ) if : 1. f (0, y1 , . . . , yn ) = g(y1 , . . . , yn ) . 2. f (x, y1 , . . . , yn ) = h(x, f (⌊ 12 x⌋, y1 , . . . , yn ), y1 , . . . , yn ) for x > 0 3. f (x, y1 , . . . , yn ) ≤ k(x, y1 , . . . , yn ) There is another characterisation of Limited Recursion on Notation by defining function Min which we normally use it in this thesis. Let M in(a, b) ∶= a (a b) Then we could rewrite LRN as follows: 1. f (0, y) = M in(g(y), k(0, y)) 2. f (x, y1 , . . . , yl ) = M in(h(x, f (⌊ 12 x⌋, y1 , . . . , yl ), y1 , . . . , yl ), k(x, y1 , . . . , yl )) Limited Recursion on Notation defines an algebra for all polynomial computable functions. This means that every polynomial computable functions can be defined by LRN and composition on initial functions. Later we use this fact to define the language of PV. The algebra of polynomial computable functions first studied by Cobham.[8] We state Cobham’s theorem in a sligthtly different way [6] rather than original one by Cobham and Cook [10] . 15 Definition 16. Cobham’s Class is the set of functions f ∶ Nl → N , such that for some Turing machine T and some polynomial p , T computes f (x1 , . . . , xl ) within p(∣x1 ∣ + . . . + ∣xl ∣) steps. Theorem 5. (Cobham’s theorem) Cobham’s Class is the least class of functions which include initial functions of Le and closed under operations LRN and Composition. Definition 17. Le is an extension of the language of Lb (see Definition 1) by the following new symbols: {Exp(a, b), , sq(a), ⟨a, b⟩, (a)1 , (a)2 } where Exp(a, b) = 2min{a,∣b∣} , is limited subtraction , sq(a) = a⋅a and ⟨a, b⟩, (a)1 , (a)2 are pairing function and its corresponding projections. Definition 18. Let ar(f ) denote arity of function symbol f and be defined in usual way. Then the language Lp can inductively be defined. For this the base clause is as follows: 1. Pil , which is the projection function (i.e. Pil (x1 , . . . , xl ) = xi ) and ar(Pil ) = l ) . 2. Zero function 0l , 0l (x1 , . . . , xl ) = 0 3. Le ⊆ Lp . And the inductive clause is : 1. Composition: If h, g1 . . . gk ∈ Lp , ar(h) = k and ar(g1 ) = . . . = ar(gk ) = l then : f ≡ Compk,l (h, g1 . . . , gk ) ∈ Lp and ar(Compk,l (h, g1 . . . , gk )) = l 2. LRN : If g, h, k ∈ Lp ,ar(g) = l , ar(h) = l + 2 then : f ≡ LRN l (g, h, k) ∈ Lp , ar(LRN l (g, h, k)) = l + 1 16 Notation:“≡” in the Definition 18 denotes defining function symbols in the meta language. ⎧ ⎪ ⎪y x = 0 Example:If Cond(x, y, z) = ⎨ then ⎪z x > 0 ⎪ ⎩ Cond ≡ LRN 3 (g, h, k) where g(y, z) = y (P12 ), h(x, w, y, z) = z(P44 ) and k(x, y, z) = y + z with this example we can easily defined following functions which we need them later: sg(x) = Cond(x, 0, s(0)) sg(x) = Cond(x, s(0), 0) Definition 19. By inductive definition we define terms in PV as following: 1. 0 is a term. 2. Every variable x and function symbol of arity 0 are terms. 3. If t1 , . . . , tk be terms then for function symbols f of arity k > 0, f (t1 , . . . , tk ) is a term. Definition 20. By inductive definition we define formulae in PV as following: 1. For terms t and u, t = u and t ≤ u are formulae. 2. If A and B are formulae then ¬A, A ⊃ B, A ∨ B, A ∧ B are formulae. 2.2 Axioms of PV Bounded Arithmetic is axiomatized by the finite set BASIC which is defined in [2][p 32]. Furthermore in this subsection we define theories corresponding to the languages in previous subsection . We will add new axioms to the BASIC to obtain the axiom scheme of PV which is defined by Lp . Earlier version of PV was a pure equational theory and had equality axioms only together with induction and substitution. Our PV also uses equality axioms as a part of its axiom scheme as well as new axioms which are introduced in the following definitions: 17 Definition 21. Equality axioms 1. t = t (Reflexivity) 2. s = t ⊃ t = s(Symmetry) 3. (s = t ∧ t = u) ⊃ s = u(Transitivity) 4. f ∈ Lp , ar(f ) = k (⋀ki=1 si = ti ) ⊃ f (s1 , . . . , sk ) = f (t1 , . . . , tk ) (Function compatibility) 5. in particular for the predicate symbol ≤ we have: s1 = t1 , s2 = t2 , t1 < t2 ⊃ s1 < s2 Definition 22. BASICe : The set of axioms BASICe can be obtained from the set of axiom BASIC by adding following new axioms: 1. ∣a∣ ≤ a 2. ∣a ⋅ b∣ ≤ ∣a∣ + ∣b∣ 3. Exp(a, 0) = 1 4. Exp(0, c) = 1 5. a + b ≤ ∣c∣ ⊃ Exp(a + b, c) = Exp(a, c) ⋅ Exp(b, c) 6. c ≠ 0 ⊃ (Exp(1, c) = 2 ∧ Exp(a, c) < 2 ⋅ c) 7. a ≤ b ↔ a b = 0 8. a b = 0 ↔ (b a) + a = b 9. sq(a) = a ⋅ a 10. ⟨(a, b⟩)1 = a 11. ⟨(a, b⟩)2 = b 12. ⟨(a)1 , (a)2 ⟩ = a 13. ∣⟨a, b⟩∣ ≤ 2 ⋅ (1 + ∣a∣ + ∣b∣) 18 14. ⟨a, b⟩ = ⌊ 12 ((a2 + b2 + 2a ⋅ b + a + 1) b))⌋ 15. t ≤ Exp(∣t∣, t) 16. ∣t∣ ≤ u ⊃ Exp(u, t) = Exp(∣t∣, t) Definition 23. BASICp is inductively defined as follows, the base clause is: BASICe ⊆ BASICp and the inductive clause is: 1. If f ∈ Lp and f ≡ Compk,l (h, g1 . . . , gk ) then: f (x, y1 , . . . , yl ) = h(g1 (y1 , . . . , yl ), . . . , gk (y1 , . . . , yl )) is in BASICp 2. If f ∈ Lp and f ≡ LRN l (g, h, k) then: f (0, y) = M in(g(y), k(0, y)) and x > 0 ⊃ f (x, y1 , . . . , yl ) = M in(h(x, f (⌊ 12 x⌋, y1 , . . . , yl ), y1 , . . . , yl ), k(x, y1 , . . . , yl )) are in BASICp 2.3 Proofs in PV In this section we describe structure of proofs in PV then we define P V − , and PV precisely. 2.3.1 Structure of Proofs in PV We use Gentzen sequent calculus PK[5] to define proofs in PV and its fragments. Each line of proofs in this system has expresions which are called “Sequents”. Eeach sequent consists of P V -formulae, A1 , . . . Ak and B1 , . . . Bl and sequent arrow “→”: A i , . . . , A k →B 1 , . . . B l The meaning of such sequent can be interpreted by the following formula: k l i=1 j=1 ⋀ Ai ⊃ ⋁ Bj Every proof in this system is a tree which every node represents a sequent. Every leaf in this tree has a sequent of the form →A where A is an axiom in 19 BASICp or logical axiom of the form A→A where A is a arbitrary atomic formula. Sequents can be derived by applying following inference rules to their antecedent sequents. Finally the root in the proof’s tree represents the endsequent of proof. Rules of inference in the sequent calculus PK are classified to the three groups: Weak Structural Rules, Cut Rule and Propositional Rules and illustrated by sequents as follows:(Γ, ∆ are sequences of formulae and A, B are PV-formulae) • Weak Structural Rules: (1) Exchange:left (2) Exchange:right 20 Γ, A, B, Π→∆ Γ, B, A, Π→∆ Γ→∆, A, B, Λ Γ→∆, B, A, Λ (3) Contraction:left (4) Contraction:right (5) Weakening:left A, A, Γ→∆ A, Γ→∆ Γ→∆, A, A Γ→∆, A Γ→∆ A, Γ→∆ (6) Weakening:right Γ→∆ Γ→∆, A • (7) The Cut Rule Γ→∆, A A, Γ→∆ Γ→∆ • The Propositional Rules: (8) ¬:left Γ→∆, A ¬A, Γ→∆ (9) ¬:right A, Γ→∆ Γ→∆, ¬A (10) ∧ ∶ left A, B, Γ→∆ A ∧ B, Γ→∆ (11) ∧ ∶ right (12) ∨ ∶ left Γ→∆, A Γ→∆, B Γ→∆, A ∧ B A, Γ→∆ B, Γ→∆ A ∨ B, Γ→∆ (13) ∨ ∶ right (14) ⊃∶ left Γ→∆, A, B Γ→∆, A ∨ B Γ→∆, A B, Γ→∆ A ⊃ B, Γ→∆ (15) ⊃∶ right 21 A, Γ→∆, B Γ→∆, A ⊃ B (16) Substitution (17) Induction Γ(a)→∆(a) Γ(t)→∆(t) Γ, A(b)→A(b + 1), ∆ → Γ, A(0)→A(t(Ð a )), ∆ In latter two cases t is an arbitrary term and b is an eigenvariable, that is a variable which must not appear in the lower sequent. Definition 24. Length of proof is the total number of occurrence of sequents in the proof. Definition 25. Size of proof P is the total number of symbols in P and is showed by ∣P ∣. 2.3.2 E −, P V −, P V In this subsection we define equational theories P V , P V − and E − . Every theory has a collection of language, set of axioms and set of inference rules. Proofs in these theories can be obtained by similar method in the subsection 2.3.1 w.r.t the suitable language, set of axioms and rules of inferences. Definition 26. E − is an equational theory which is determined by the language Le , set of axioms BASICe and equality axioms and rules of inference 1-16 (without induction rule) in the subsection 2.3.1 . Definition 27. P V − is an equational theory which is determined by the language Lp , set of axioms BASICp and equality axioms and rules of inference 1-16 (without induction rule) in the subsection 2.3.1 . Definition 28. P V is an equational theory which is determined by the language Lp , set of axioms BASICp and equality axioms and rules of inference 1-17 in the subsection 2.3.1 . Let B be a Lp -formula , we write P V ⊢ B if 22 →B is provable in P V . 2.3.3 Interpretation of formulae as terms In this section we explain how to interpret formulae as terms in P V − . This enables us to define functions with several conditions in P V by using P V formulae. We will use this kind of functions in Chapter 4 to speed-up induction in P V − . To this end first we define following P V functions:(for definition of sg(x) and sg(x) see example in section 2.1) • N OT (x) = 1 x • AN D(x, y) = M in(x, y) • M ax(a, b) = b + (a b) • OR(x, y) = M ax(x, y) ⎧ ⎪ ⎪1 • sg(x) = ⎨ ⎪ ⎪ ⎩0 ⎧ ⎪ ⎪1 • sg(x) = ⎨ ⎪ ⎪ ⎩0 x ≠ 0, x=0 x = 0, x≠0 Definition 29. We inductively define for each P V − f ormula, φ it’s interpretation φI : 1. if φ ≡ a ≤ b, then φI ≡ sg(a b). 2. if φ ≡ a = b, then φI ≡ AN D((a ≤ b)I , (b ≤ a)I ). 3. if φ ≡ ¬ψ, then φI ≡ N OT (ψ I ). 4. if φ ≡ ψ0 ∧ ψ1 , then φI ≡ AN D(ψ0I , ψ1I ). 5. if φ ≡ ψ0 ∨ ψ1 , then φI ≡ OR(ψ0I , ψ1I ) It is easy to check that if Lemma 1. Let φ be a P V − f ormula then: P V − ⊢ φ ↔ φI = 1 23 Proof. This lemma can be proved by induction on the structure of formula φ. For instance we prove the lemma for atomic formula a ≤ b and for the conjunction of two formulae. For atomic case φ ≡ a ≤ b we have from Axiom 7 of BASICe that: PV − ⊢ a ≤ b ↔ a b = 0 so from definition of sg we have: P V − ⊢ a ≤ b ↔ sg(a b) = 1 thus we proved that P V − ⊢ φ ↔ φI = 1. If φ ≡ ψ0 ∧ ψ1 then induction hypothesis implies: P V − ⊢ ψ0 ↔ ψ0I = 1 P V − ⊢ ψ1 ↔ ψ1I = 1 So we get the proof of P V − ⊢ ψ0 , ψ1 →M in(ψ0I , ψ1I ) = 1 P V − ⊢ ψ0 ∧ ψ1 →AN D(ψ0I , ψ1I ) = 1 This finishes the proof for conjunction of formulae. Above discussion helps us to define P V function with several conditions w.r.t P V formulae: Example: Let A be a PV-formula and let: Choice(A, x, y) ≡ x.sg(AI ) + y.sg(AI ) In particular from the Lemma 1 we have P V − ⊢ A→AI = 1 so: P V − ⊢ A→sg(AI ) = 1 P V − ⊢ A→sg(AI ) = 0 with the same argument for ¬A we obtain: P V − ⊢ ¬A→sg(AI ) = 0 P V − ⊢ ¬A→sg(AI ) = 1 Consequently we get the proofs of following sequents: 24 1. P V − ⊢ A→Choice(A, x, y) = x 2. P V − ⊢ ¬A→Choice(A, x, y) = y For example for f (x) such that : ⎧ ⎪ ⎪g(x) if A, f (x) = ⎨ ⎪ ⎪ ⎩h(x) if ¬A f is defined by f (x) = Choice(A, g(x), h(x))andf (x) ∈ Lp . Specifically we have following sequents P V − ⊢ A→f (x) = g(x) P V − ⊢ ¬A→f (x) = h(x) For another example if functions g1 (x), g2 (x), g3 (x) ∈ Lp and formulae A, B ∈ P V then for τ (x) such that: ⎧ g1 (x) if A, ⎪ ⎪ ⎪ ⎪ τ (x) = ⎨g2 (x) if ¬A ∧ B, ⎪ ⎪ ⎪ ⎪ ⎩g3 (x) if ¬A ∧ ¬B we have τ (x) ∈ Lp such that τ (x) = Choice(A, g1 (x), Choice(B, g2 (x), g3 (x))) . Also we get P V − ⊢ A→τ (x) = g1 (x) P V − ⊢ ¬A ∧ B →τ (x) = g2 (x) P V − ⊢ ¬A ∧ ¬B →τ (x) = g3 (x) 2.4 Arithmetisation of PV In order to study consistency of P V and P V − we need to arithmetize metamathematical concepts like proofs and formulas. In this section we describe a coding method for the syntax of PV by assigning Gödel number to symbols in Lp . We explain how to code terms, formulae and proofs in P V to Gödel numbers. Finally we will define provability predicate for P V . 25 2.4.1 Gödelisation of Lp Definition 30. (coding of sequence)To assign Gödel numbers to a sequence of natural numbers we use following method [2]: For the sequence a1 , . . . , an we reverse the order of binary representation of a1 , . . . , an which consists of 0,1 and commas. Then we replace each 0 by “10” and each 1 by “11” and each comma by “00”. Latter string of ones and zeros is binary representation of Gödel number of ⟨a1 . . . , an ⟩. The Gödel number of empty sequence ⟨⟩ = 0 First we assign to each symbol of PV a Gödel number as follows: • Logical Symbols: ¬−0 ⊃−1 ∧−2 ∨−3 (−4 )−5 ,−6 →−7 • Function Symbols: 0−9 S−13 1 ⋅−17 ⌊ 2 a⌋−19 ≤−23 Exp(a, b)−25 sq(a)−29 ⟨a, b⟩−31 (a)2 −35 +−15 ♯−21 −27 (a)1 −33 • Polynomial Computable Functions: As defined before, the language of P V , Lp , has symbols for every polynomial time computable functions. For assigning Gödel number to them we distinguish following cases: If f ≡ Compk,l (h, g1 . . . , gk ) then Gödel number of f is: ⌜f ⌝ = ⟨⟨0, k, l⟩, ⌜h⌝, ⌜g1 ⌝, . . . ⌜gk ⌝⟩ × 2 If f ≡ LRN l (g, h, k) then Gödel number of f is: ⌜f ⌝ = ⟨⟨1, l⟩, ⌜g⌝, ⌜h⌝, ⌜k⌝⟩ × 2 Free Variables: x1 - 37 x2 - 41 x3 - 45 ⋮ 26 Thus for every sequent “Γ→∆” of the form Ai , . . . , Ak →B1 , . . . , Bl we have: ⌜Γ→∆⌝ = ⌜⟨Ai , . . . , Ak →B1 , . . . , Bl ⟩⌝ Also for formula A, l(A) denotes ∣⌜A⌝∣ and for term t, l(t) denotes ∣⌜t⌝∣. 2.4.2 Gödelisation of Proofs As explained in 2.3.1, every proof in P V is a tree where each node corresponds to a sequent like Γ→∆. In order to arithmetize P V − proof s, first we assign to each node Γ→∆ a pair ⟨x, w⟩ where w is the Gödel number of that sequent which is computed according to the last subsection. x shows the number of inference rule which is applied to the immediate upper sequents of Γ→∆. For leaves of proof’s tree where it is a logical axiom of the form A→A or axioms from BASICp or equality axioms of the form →A, x is equal to zero. So far, we have assigned to every node in proof tree a number that codes inference details as well as sequent, now we can easily map every tree to a string by depth first order [2] with the aid of two symbols “[”, “]”, which denote moving down or up in a tree respectively. This can be shown by the following example: a b g c h e The above tree can be map into the following sequence: a[b[gh]c[e]] By assigning Gödel numbers to “[” and “]”, the Gödel number of above tree is equal to the code of following sequence: ⟨⌜a⌝, ⌜[⌝, ⌜b⌝, ⌜[⌝, ⌜g⌝, ⌜h⌝, ⌜]⌝, ⌜c⌝, ⌜[⌝, ⌜e⌝, ⌜]⌝, ⌜]⌝⟩ . Note: For xn = 16 and xn = 17, at every occurrence of substitution rule or induction rule we have to check that whether the substitution or induction are applied to the right variables or not? This can be done by introducing two polynomial computable PV-functions which get the number of sequent Γ→∆ as an input and gives all of occurring variables and terms in the sequent: 27 • V ar(⌜Γ→∆⌝) = {a1 , . . . , an } • T erm(⌜Γ→∆⌝) = {t1 , . . . tn } By above functions and Gödelisation method, verifying that substitution of variables or induction on variables is leading to the lower sequent with expected terms, can be done in polynomial time and that guarantees that substitution and induction are done in a proper way. Notation: For proof P , l(P ) denotes Gödel number of P : l(P ) = ∣⌜P ⌝∣. 2.4.3 Provability Predicate for PV Generally for a proof system, T, provability predicate can be defined as following: Definition 31. The provability predicate for a theory T can be defined as follows: P rovT (x) ≡ ∃yP rfT (y, x) which P rfT (y, x) is proof predicate and it is true when y is the Gödel number of a T − proof of formula with the Gödel number x. Definition 32. For a proof system T , Con(T ), consistency predicate is defined as follows: Con(T ) ≡ ¬P rovT (⌜0 = 1⌝) So by above definition of P rovT (x) we have : Con(P V ) ≡ ∀y¬P rfT (y, ⌜0 = 1⌝) Because our definition of PV is quantifier free, thus we drop ∀ from above formula and define consistency predicate for PV as follows: Con(P V ) ≡ ¬P rfP V (y, ⌜0 = 1⌝) 28 Chapter 3 Examples of upper bounds for the length of proofs in P V − In this chapter we study how to obtain upper bounds for terms in P V − such that the length of proving those upper bounds in P V − be polynomially bounded. We give proof of results in [2] concerning the length of P V − -proofs in detail and we will dicuss about the complexity of such proofs. Definition 33. Terms sq i (x) can be defined inductively as follows: 1. sq 0 (x) = x 2. sq k+1 (x) = sq(sq k (x)) Regarding the above definition we have following theorem: Lemma 2. BASICe ⊢ x ≤ y →sq i (x) ≤ sq i (y) for 0 ≤ i with a proof of length linear in i. Proof. From the set of axioms BASICe we can easily show that: x ≤ y →sq(x) ≤ sq(y) Applying i times substitution in (3.1) yields proof of Lemma 2. Theorem 6. E − ⊢ x ≤ sq m (y), y ≤ sq k (z)→x ≤ sq k+m (z) with a proof of length linear in m. 29 (3.1) Proof. By Lemma 2 we know a ≤ b→sq m (a) ≤ sq m (b) , applying substitution rules two times such that: a ↦ y and b ↦ sq k (z) gives: y ≤ sq k (z)→sq m (y) ≤ sq k+m (z). (3.2) Using the Axiom 8 of BASIC gives: x ≤ sq m (y), sq m (y) ≤ sq k+m (z)→x ≤ sq k+m (z) (3.3) finally applying Cut to the (3.2) and (3.3) gives the proof of: x ≤ sq m (y), y ≤ sq k (z)→x ≤ sq k+m (z) The length of proof in Theorem 6 is linear in m since we only used Lemma 2 and constant number of substitutions and cuts. Theorem 7. Let t(a1 , . . . , ak ) be an arbitrary Lp − term and c be a variable. Then P V − proves: ⋀(∣ai ∣ ≤ c), (1 < c)→∣t( a)∣ ≤ sq l(t) (c) → i≤k with a proof of length quadratic in l(t) We prove this theorem by induction on the complexity of terms in P V − . In the Lemma 4 we will prove above theorem for function symbols belong to the P V − . Later the general case for arbitrary terms will be discussed. Lemma 3. There is a constant 0 < B such that for every 0 < r, P V − proves: →c ≤ sqr (c) (3.4) With a proof of length bounded in B ⋅ l(r) Proof of Lemma 3. Proof is by induction on k, from axioms we have P V − ⊢ →c ≤ sq(c) by substitution sq r (c) to c in that we obtain P V − ⊢ →sq r (c) ≤ sq r+1 (c) 30 (3.5) From Axiom (8) of BASIC we have c ≤ sq r (c), sq r (c) ≤ sq r+1 (c)→c ≤ sq r+1 (c) (3.6) Also induction hypothesis implies : P V − ⊢ →c ≤ sq r (c) (3.7) with a proof of length bounded in B ⋅ l(r). So by applying cut (also weakening) to the (3.6) and (3.5), (3.7), respectively, we obtain: P V − ⊢ →c ≤ sq k+1 (c) (3.8) with a proof of length bounded in B ⋅ l(r). Lemma 4. There is a constant 0 < A such that for every function symbols f ∈ Lp , P V − proves: ⋀(∣ai ∣ ≤ c), (1 < c)→∣f ( a)∣ ≤ sq l(f ) (c) → i≤k with a proof of length bounded in A ⋅ l(f )2 . Proof of Lemma 4. We prove Lemma 4 by induction on the complexity of the definition of f . For this we have following cases: 1. If f is defined by LRN . 2. If f is defined by composition. → For the Case 1, f (x, Ð y ) is defined by limited recursion on notation (LRN) → → → from g(Ð y ) and h(x, z, Ð y ) and k(x, Ð y ). To prove this case, by induction hypothesis we have: P V − ⊢ ⋀(∣ai ∣ ≤ c), (1 < c)→∣g( a)∣ ≤ sq l(g) (c) → (3.9) i≤k P V − ⊢ ⋀ (∣ai ∣ ≤ c), (1 < c)→∣h( a)∣ ≤ sq l(h) (c) → (3.10) i≤k+2 P V − ⊢ ⋀ (∣ai ∣ ≤ c), (1 < c)→∣k( a)∣ ≤ sq l(k) (c) → i≤k+1 31 (3.11) With proofs of length bounded in A⋅l(g)2 , A⋅l(h)2 and A⋅l(k)2 , respectively. By properties of M in (as described in Chapter 2) and axioms of P V concerning LRN and equality axioms we have: → → P V − ⊢ →∣f (Ð a )∣ ≤ ∣k(Ð a )∣with a “constant” length proof. (3.12) By Lemma 3 we have : P V − ⊢ →c ≤ sq l(f )−l(k) (c) with a proof of length bounded in B ⋅(l(f )−l(k)). Applying one substitution c ↦ sq l(k) (c) gives P V − ⊢ →sq l(k) (c) ≤ sq l(f ) (c) (3.13) with a proof length bounded in B ⋅ (l(f ) − l(k))(our coding guarantees that l(k) ≤ l(f ). By using Axiom (8)of BASIC we have following derivations : → → → → P V − ⊢ ∣f (Ð a )∣ ≤ ∣k(Ð a )∣, ∣k(Ð a )∣ ≤ sq l(k) (c)→∣f (Ð a )∣ ≤ sq l(k) (c) (3.14) → → P V − ⊢ ∣f (Ð a )∣ ≤ sq l(k) (c), sq l(k) (c) ≤ sq l(f ) (c)→∣f (Ð a )∣ ≤ sq l(f ) (c) (3.15) Now by applying cuts and weakening as before, to the (3.15) and (3.14), (3.13), induction hypothesis (3.11) also (3.12), respectively, we get the final derivation in the Lemma 4. Complexity Analysis: In order to measure the size of above proof first we count the number of sequents in the derivation: by using induction hypothesis (3.11) the number of sequents will be quadratic in l(k) namely A ⋅ l(k)2 derivation (3.13) increases the length of proof B ⋅ (l(f ) − l(k)) and for other derivations from axioms as well as using Cut rule and Weakenings we increase the length of proof constantly by C. Thus, since we have l(k) ≤ l(f ) and above analysis, the constants B and C are independent of l(f ) so we have: A ⋅ l(k)2 + B ⋅ (l(f ) − l(k)) + C ≤ A ⋅ l(f )2 This shows that the length of proof is quadratic in l(f ).◻ → → → → For the Case 2, f (x, Ð y ) is defined by “Composition” from h(Ð z ) and gi (Ð y ), . . . , gk (Ð y ). From induction hypothesis we have : P V − ⊢ ⋀(∣ai ∣ ≤ c), (1 < c)→∣g1 ( a)∣ ≤ sq l(g1 ) (c) → i≤k 32 ⋮ P V − ⊢ ⋀(∣ai ∣ ≤ c), (1 < c)→∣gk ( a)∣ ≤ sq l(gk ) (c) → i≤k With proofs of length bounded in A ⋅ l(g)2 , . . . , A ⋅ l(gk )2 , respectively. → P V − ⊢ ⋀(∣bi ∣ ≤ c), (1 < c)→∣h( b )∣ ≤ sq l(h) (c) (3.16) i≤k with a proof of length bounded in A ⋅ l(h)2 . → From induction hypothesis (3.16), applying k substitutions bi ↦ gi ( a) and the substitution c ↦ sq m (c) where m = max {l(gi ) ∣i ≤ k} yields: P V − ⊢ ⋀(∣gi ( a ∣) ≤ sq m (c)), (1 < sq m (c))→∣h(g1 ( a), . . . , gk ( a)∣ ≤ sq l(h)+m (c) → → → i≤k (3.17) Lemma 3 implies following derivations: P V − ⊢ →1 ≤ sq m (c) (3.18) And proves for each i we have: P V − ⊢ c ≤ sq m−l(gi ) (c) so we get P V − ⊢ →sq l(gi ) (c) ≤ sq m (c) (3.19) Using this and induction hypothesis for gi and Axiom (8) of BASIC gives: P V − ⊢ ⋀(∣ai ∣ ≤ c), (1 < c)→∣g1 ( a)∣ ≤ sq m (c) → i≤k ⋮ P V − ⊢ ⋀(∣ai ∣ ≤ c), (1 < c)→∣gk ( a)∣ ≤ sq m (c) → (3.20) i≤k Now cuts of (3.17) to every derivation in (3.20) and (3.18) yields: P V − ⊢ ⋀(∣ai ∣ ≤ c), (1 < c)→∣h(gi ( a))∣ ≤ sq l(h)+m (c) → i≤k From axioms for composition of function we have → → → f (Ð a ) = h(g1 ( a), . . . , gk ( a) 33 (3.21) using this, (3.22)and Compatibility axiom concerning ∣x∣ and ≤ gives: P V − ⊢ ⋀(∣ai ∣ ≤ c), (1 < c)→∣f ( a)∣ ≤ sq l(h)+m (c) → (3.22) i≤k Again by using Lemma 3 and considering l(f ) > l(h) + m according to our coding in the case of composition, we will derive: P V − ⊢ →sq l(h)+m (c) ≤ sq l(f ) (c) (3.23) With similar technique as before by using Axiom (8)of BASIC and applying cuts, finally we obtain final derivation in the Lemma 4. Complexity Analysis: At (3.17) the length of proof is A ⋅ l(h)2 (by induction hypothesis and using substitutions) for occurrence of cuts, weak structural rules and derivations from axioms we consider constant C. Each derivation in (3.20) has the complexity: A ⋅ l(gi )2 + B.(m − l(gi )) so that part of derivation increase the length by A ⋅ m2 (because of applying k cuts). Using (3.23) for the final derivation increase the length of proof by B ⋅ (l(f ) − (m + l(h))). Thus for the whole proof we have: A⋅l(h)2 +A⋅m2 +B ⋅(l(f )−(m+l(h)))+C. This shows the constants “C” and “B” can be chosen independent of l(f ) consequently the whole derivation has the length A ⋅ l(f )2 . Proof of Theorem 7. For arbitrary term t which is defined by composition of function f and sub terms t1 , . . . , tk : t = f (t1 , . . . , tk ) Since our coding implies l(f ) + l(t1 ) + . . . + l(tk ) ≤ l(t), by similar way to the Lemma 4, in the case of composition, we get the proof of Theorem 7. So far it is shown that Theorem 7 has a proof of length quadratic in l(t), namely A ⋅ l(t)2 . We observe by inspecting the proof of Theorem 7 that each sequent appears in the proof has the size linear in l(t) e.g. A ⋅ l(t). Consequently the size of proof of Theorem 7 is bounded by A ⋅ l(t)3 (cubic in l(t)). In the following we discuss proof of upper bounds for triple length of terms. [6] proves Theorem 8 by using Lemma 5 which is involved with squaring function. Because we do not follow this way and we will prove Theorem 8 instead of using squaring function, we only explain proof of Theorem 8 in [6] brief. 34 Lemma 5. [6] Let ∣∣x∣∣ denotes double length of x (binary length of binary length of x), for all n ∈ N, P V − can prove: ∣∣sq n (x)∣∣ ≤ n + ∣∣x∣∣ with a proof of length linear in l(t). Theorem 8. P V − ⊢ ⋀(∣ai ∣ ≤ c), (1 < c)→∣∣∣t( a)∣∣∣ ≤ l(t) + ∣∣c∣∣ → i≤k with a proof of length quadratic in l(t). Proof. : From previous lemma we have in P V − : →∣∣sql(t)(c) ≤ l(t) + ∣∣c∣∣ (3.24) and its proof has the length quadratic in l(t). Also from axiom (12) of BASIC we have following derivations in P V − : ∣t( a)∣ ≤ sq l(t) (c)→∣∣t( a)∣∣ ≤ ∣sq l(t) (c)∣ → → ∣∣t( a)∣∣ ≤ ∣sq l(t) (c)∣→∣∣∣t( a)∣∣∣ ≤ ∣∣sq l(t) (c)∣∣ → → (3.25) So together with (3.25), (3.26) we get in P V − the proof of: ∣t( a)∣ ≤ sq l(t) (c)→∣∣∣t( a)∣∣∣ ≤ l(t) + ∣∣c∣∣ → → (3.26) Applying cut to (3.27) and end sequent in the Theorem 7 gives the proof of Theorem 8. Complexity Analysis: The proof of theorem 8 consists of two parts one from theorem 7 another from Lemma 5 and each parts has the length quadratic in l(t) so the proof of theorem 8 is of the length quadratic in l(t). In Theorem 9 we will prove upper bounds for triple length of terms different than Theorem 8. Our proof involves successor function which is recursively defined in the following definition. Definition 34. We define k-th successor of x, S k (x) and k-th length of term t, ∣t∣k by recursion on k: 35 ⎧ ⎪ ⎪S 0 (x) = x, • ⎨ k+1 k ⎪ ⎪ ⎩S (x) = S(S (x)) • ∣t∣0 = t • ∣t∣k+1 = ∣∣t∣k ∣ → Theorem 9. Let t( a) be a P V term and 0 < A be a constant then: P V − ⊢ ⋀(∣ai ∣3 ≤ c), (1 < c)→∣t( a)∣3 ≤ S l(t) (c) → i≤k with a proof of length bounded by A ⋅ (l(t))2 Proof. Like Theorem 7 the proof of this theorem is by induction on the complexity of terms. First we prove it for function symbols. If f is defined from g, h and k by LRN, from definition of LRN and Compatibility axiom for ∣t∣3 and ≤ we have following derivations:(we omit variables of function for the sake of simplicity) P V − ⊢ →∣f ∣ ≤ ∣k∣ P V − ⊢ →∣f ∣3 ≤ ∣k∣3 (3.27) Induction hypothesis in this case implies: P V − ⊢ →∣k∣3 ≤ S l(k) (c) This and (3.27) show that: P V − ⊢ →∣f ∣3 ≤ S l(k) (c) (3.28) Similar to the Lemma 3, for constant 0 < A, P V − proves →c ≤ S l(f )−l(k)(c) with a proof of length bounded by A ⋅ (l(f ) − l(k)) Applying one substitution gives: P V − ⊢ →S l(k) (c) ≤ S l(f ) (c), where our coding shows l(k) ≤ l(f ) From above derivations we can easily get the proof of P V − ⊢ →∣f ∣3 ≤ S l(f ) (c) 36 (3.29) The complexity analysis is similar to the Theorem 7 in particular derivation (3.29) has a proof length bounded by A⋅(l(f )−l(k)). Also from induction hypothesis, (3.28) has a proof of length bounded by A ⋅ (l(k)2 ). Thus the length of proof of Theorem 9 is bounded by A ⋅ (l(f )2 ) If f is defined by composition from h and g1 , . . . , gk then we have following derivations: → P V − ⊢ ⋀ ∣gi ∣3 ≤ c→∣h(gi )∣3 ≤ S l(h)+m (c) i≤k applying cuts to this and induction hypothesis for every gi gives the proof of: P V − ⊢ ⋀ ∣ai ∣3 ≤ c→∣h(gi , . . . , gk )∣3 ≤ S l(h)+m (c) (3.30) i≤k With axiom for composition and also the fact that l(h) + m ≤ l(f ) as well as (3.30) we get the proof of: P V − ⊢ ⋀ ∣ai ∣3 →∣f ( a)∣3 ≤ S l(f ) (c) → i≤k The estimation of proof’ length is similar to the Theorem 7. We conclude that Theorem 9 has a proof of length bounded by A ⋅ (l(t))2 . 37 Chapter 4 Speed-up of induction for P V − As defined in chapter 2 induction is given schematically in P V by : (17) Induction Γ, A0 (b)→A0 (b + 1), ∆ → Γ, A0 (0)→A0 (t(Ð a )), ∆ where A0 (x) is an open formula and t is an arbitrary term with eigenvariable b. Thus in order to do full induction we need following implications: ((A0 (0) ⊃ A0 (1))∧(A0 (1) ⊃ A0 (2))∧. . .∧(A0 (t−1) ⊃ A0 (t))) ⊃ (A0 (0) ⊃ A0 (t)) (with applying repeatedly Modes Pones). But in speed-up technique which will be explained later, we introduce a formula A∗0 (z) and shorten (c.f. Theorem 10) the number of steps up to the length of t (∣t∣): ((A∗0 (0) ⊃ A∗0 (1))∧(A∗0 (1) ⊃ A∗0 (2))∧. . .∧(A∗0 (∣t∣−1) ⊃ A∗0 (∣t∣))) ⊃ (A0 (0) ⊃ A0 (t)) In P V − we do not have Induction rule. In order to simulate Induction feasibly, we use above technique which is called cut shortening [6] by the aid of polynomial time computable functions and applying bounded number of cuts. This is discussed in Section (4.2). 4.1 Speed-up of induction with quantifiers In this section we state results from [6] concerning the first order theory over P V − which is denoted by P V1− . 38 Definition 35. For an open P V − f ormula, A0 and P V − term, t we define A∗0 as follows: A∗0 (z) ≡ (∀y ≤ t)(∀y ′ ≤ t)(y ′ ≤ y ∧ (y ≤ y ′ + Exp(z, t)) ∧ A0 (y ′ ) ⊃ A0 (y) Theorem 10. For above definition of A∗ we have: 1. P V1− ⊢ (∀x < t)(A0 (x) ⊃ A0 (x + 1)) ⊃ A∗0 (0) 2. P V1− ⊢ A∗0 (z) ⊃ A∗0 (z + 1) 3. P V1− ⊢ A∗0 (∣t∣) ⊃ (A0 (0) ⊃ A0 (t)) Proof. To prove Theorem 10, (1) we argue informally in P V1− . By definition and axiom (4) of BASICe , A∗0 (0) is equivalent to (∀y ≤ t)(∀y ′ ≤ t)(y ′ ≤ y ∧ (y ≤ y ′ + 1) ∧ A0 (y ′ ) ⊃ A0 (y) For the critical case y = y ′ , (1) is trivial. For the critical case y = y ′ + 1, by using premise of implication in Theorem 10, (1) and axioms we finish the proof. We informally prove Theorem 10, (2) in P V1− by distinguishing following cases 39 For Case 1, By definition premise of implication in Theorem 10, (2) is equivalent to: (∀y ≤ t)(∀y ′ ≤ t)(y ′ ≤ y ∧ (y ≤ y ′ + Exp(z, t)) ∧ A0 (y ′ ) ⊃ A0 (y) Also axioms imply that Exp(z, t) ≤ Exp(z + 1, t). This and above formula gives the result. For Case 2, first we apply premise of implication in Theorem 10, (2) to the values y ′ and y ′ + Exp(z, t) this gives us: (y ′ ≤ y ′ +Exp(z, t)∧(y ′ +Exp(z, t) ≤ y ′ +Exp(z, t))∧A0 (y ′ ) ⊃ A0 (y ′ +Exp(z, t)) Secondly we apply the premise of implication in to the y ′ + Exp(z, t) and y ′ to get: (y ′ + Exp(z, t) ≤ y ∧ (y ≤ y ′ + Exp(z + 1, t)) ∧ A0 (y ′ + Exp(z, t)) ⊃ A0 (y) By combining above results we finish the proof of Theorem 10, (2). Instantiating premise of Theorem 10, (3) by 0, t for y ′ , y and using the axiom t ≤ Exp(∣t∣, t)(axiom (15) of BASICe ) gives: (0 ≤ t ∧ (t ≤ 0 + Exp(∣t∣, t)) ∧ A0 (0) ⊃ A0 (t) which implies (3). 4.2 Speed-up of induction for quantifier free equational theories Since we do not have quantifiers in P V − we have to develop a speed-up induction technique which is different from the one in the previous subsection. Theorem 11, Lemma 6 and Lemma 8 gives a method to speed-up induction in quantifier free equational theories, namely P V − by using polynomial time computable functions called Skolem functions. Theorem 11. Let a = ⟨y ′ , y⟩, A0 and t be a PV formula and a PV term also A′0 be defined as: A′0 (z, ⟨y ′ , y⟩) ≡ (y ≤ t) ∧ (y ′ ≤ y) ∧ (y ≤ y ′ + Exp(z, t)) ∧ A0 (y ′ ) ⊃ A0 (y) 40 then there is a polynomial-time computable function F (z, a) in P V − such that, for A′ as above, P V − proves: A′0 (z, F (z, a)) ⊃ A′0 (z + 1, F (z + 1, a)) (4.1) A′0 (∣t∣, F (∣t∣, a)) ←→ A′0 (∣t∣, a) (4.2) And Let Â(z, a) = A′ (z, F (z, a)) then we have: P V − ⊢ Â(z, a) ⊃ Â(z + 1, a) (4.3) P V − ⊢ (A0 ((F (0, a))1 ) ⊃ A0 ((F (0, a))1 + 1)) ⊃ Â(0, a) (4.4) P V − ⊢ (∣t∣ ≤ u) ∧ Â(u, ⟨0, t⟩) ⊃ (A0 (0) ⊃ A0 (t)) (4.5) It should be mentioned that the proof of Theorem 11 does not depend on the structure of formula A0 and we use it only schematically. Now we define a Skolem function τ (z, a) for the pair a = ⟨y ′ , y⟩ which enables us to eliminate quantifiers in the induction step. For this we prove Lemma 6 as follows: Lemma 6. Let A0 and t be a PV formula and a PV term A′0 be the formula as described in Theorem 11 and a = ⟨y ′ , y⟩, then there is a P V term τ such that: P V − ⊢ A′0 (z, τ (z, a)) ⊃ A′0 (z + 1, a) (4.6) Proof. We prove this lemma by direct construction of τ from A0 : ⎧ ⟨y ′ , y⟩ y ≤ y ′ + Exp(z, t), ⎪ ⎪ ⎪ ⎪ τ (z, a) = ⎨⟨y ′ + Exp(z, t), y⟩ y > y ′ + Exp(z, t), A0 (y ′ + Exp(z, t)) ⎪ ⎪ ⎪ ′ ′ ′ ⎪ ⎩⟨y, y + Exp(z + 1, t)⟩ y > y + Exp(z, t), ¬A0 (y + Exp(z, t)), Above construction shows that τ is 1. In P V because ⟨, ⟩ ∈ Lp and Exp ∈ Lp (see examples in section 2.3.3). 2. uniformly defined from A0 such that l(τ ) and the length of proof of lemma are linearly bounded by l(A0 ). We prove (4.6) by informal argument within P V . For this we distinguish three cases: 41 For the first case τ (z, ⟨y ′ , y⟩) = ⟨y ′ , y⟩ we must prove: A′0 (z, ⟨y ′ , y⟩)→A′0 (z + 1, ⟨y ′ , y⟩) And this is supported by axioms of BASICe since we have : y ≤ y ′ + Exp(z, t)→y ≤ y ′ + Exp(z + 1, t) For the case 2 we prove assertion (4.6) by contradiction. That is to show in PV − ¬A′0 (z + 1, a) ⊃ ¬A′0 (z, τ (z, a))(∗) In this case we have τ (z, a) = ⟨y ′ + Exp(z, t), y⟩. For proving (∗) we need to show A0 (y ′ ), ¬A0 (y)→A0 (y ′ + Exp(z, ∣t∣)) ∧ ¬A0 (y) which is true since by definition of τ , A0 (y ′ + Exp(z, ∣t∣)) is true. Similarly in the case 3 we prove (4.6) by contradiction. From definition of τ we know τ (z, a) = ⟨y ′ , y ′ + Exp(z, ∣t∣)⟩. So for proving (∗) we need to show A0 (y ′ ), ¬A0 (y)→A0 (y ′ ) ∧ ¬A0 (y ′ + Exp(z, t)) 42 which is true since from definition of τ in Case 3, ¬A0 (y ′ + Exp(z, t)) is true. Definition 36. Modified limited recursion on notation(MLRN) Let t be a PV term then f is defined by MLRN from g, h, k ∈ P V if: 1. f (0, y) = g(y) 2. f (u + 1, y) = h(u, f (u, y)) where u < ∣t∣ 3. f (u, y) = f (∣t∣, y) where ∣t∣ ≤ u 4. f (u, y) ≤ k(u, y) Lemma 7. Let t be a PV term and g, h, k ∈ P V and let f be defined by MLRN from g, h, k, then f is definable in P V such that the conditions of MLRN are provable in P V − Proof. To prove this lemma we define f from another function which is defined by LRN and is in Lp We define h∗ , g∗ , k∗ ∈ Lp as follows: g∗ (y) = g(y) u h∗ (u, z, y) = h(∣⌊ ⌋∣, z, y) 2 k∗ (u, y) = k(∣u∣, y) Let f∗ ∈ Lp be the function which is defined by LRN from h∗ , g∗ , k∗ . Then we define f (u, y): f∗ (Exp(u, t) 1, y) So f which is defined by MLRN can be defined by LRN. Also above definition of f shows that conditions 1-4 of MLRN is provable in P V − . For the first part of theorem we explicitly construct F (z, a). Let t∗ be a P V -term such that ⟨∣t∣, ∣t∣⟩ ≤ ∣t∗ ∣ now we define F∗ (z, a) by modified limited recursion on notation (MLRN) as following: 1. F∗ (0, ⟨y ′ , y⟩) = ⟨y ′ , y⟩ 2. F∗ (u, ⟨y ′ , y⟩) = τ (∣t∣ u, F∗ (u 1, ⟨y ′ , y⟩)) for 1 ≤ u ≤ ∣t∣ 3. F∗ (u, ⟨y ′ , y⟩) = F∗ (∣t∣, ⟨y ′ , y⟩) for ∣t∣ ≤ u 4. ∣F∗ (u, ⟨y ′ , y⟩)∣ ≤ ∣t∗ ∣ 43 Definition 37. F (z, a) ≡ F∗ (∣t∣ z, a) It is directly followed from Axiom 7 of BASIC that: F (∣t∣, a) = F∗ (0, a) = a Lemma 8. P V − ⊢ z < ∣t∣ ⊃ F (z, a) = τ (z, F (z + 1, a)) Proof. : We argue informally in P V − . First we observe from BASICe we have following properties of for b ≤ a: • (a b) 1 = a (b + 1) • a (a b) = b The definition of F (z, a) and above facts from BASICe show that for z ≤ ∣t∣ we have: F (z, a) = F∗ (∣t∣ z, a) = τ (∣t∣ (∣t∣ z), F∗ ((∣t∣ z) 1, a)) = = τ (z, F∗ (∣t∣ (z + 1), a)) = τ (z, F (z + 1, a)) . Proof of theorem 11. By substituting a in lemma 6 with F (z + 1, a) we have P V − ⊢ A′0 (z, τ (z, F (z + 1, a))) ⊃ A′0 (z + 1, F (z + 1, a)). This and Lemma 8 yield (4.1). (4.2) is immediately followed from the fact that F (∣t∣, a) = a. (4.3) is implied by rewriting of (4.1) with formula Â. In (4.4) we have Â(0, a) ≡ A′0 (0, F (0, a)). Let m′ = (F (0, a))1 and m = (F (0, a))2 then A′ (0, m′ , m) is equivalent to m ≤ t, m′ ≤ m ∧ m ≤ m′ + 1 ∧ A(m′ ) ⊃ A(m) Critical case m = m′ is trivial. Critical case m = m′ + 1 is implied by axioms and premise of (4.4). Finally for (4.5), definition of  and axiom (16) of BASICe imply that: P V − ⊢ (∣t∣ ≤ u) ∧ Â(u, ⟨x′ , x⟩) ⊃ Â(∣t∣, ⟨x′ , x⟩) 44 On the other hand Â(∣t∣, ⟨0, t⟩) is defined as A′ (∣t∣, F (∣t∣, ⟨0, t⟩) which is equivalent to A′ (∣t∣, 0, t) By definition the latter formula means: (t ≤ t) ∧ (0 ≤ t) ∧ (t ≤ Exp(∣t∣, t)) ∧ A0 (0) ⊃ A0 (t) Thus we proved Â(∣t∣, ⟨0, t⟩) ⊃ (A0 (0) ⊃ A0 (t)) Note: The proofs of (4.3)-(4.5) only use A schematically with constant number of cuts and substitutions so the length of proofs are linearly bounded by l(A0 ). In [6] authors repeated arguments in Theorem 11 three times for ∣t∣, ∣∣t∣∣ and ∣∣∣t∣∣∣ in order to speed up induction. In the following we introduce new kind of induction, k-Induction which is more general and allows to argue about ∣t∣k . This method simplifies the proofs in [6]. Definition 38. For an open formula B(b) we define: k-Induction Γ, B(b)→B(b + 1), ∆ Γ, B(0)→B(∣s∣k ), ∆ P Vkr is a theory with the language Lp , sets of axioms BASICp and equality axioms, inference rules 1-16 and k-induction . Lemma 9. r P Vkr = P Vk+1 r That is a sequent is provable in P Vkr if and only if it is provable in P Vk+1 . Furthermore the length of proof only increases linearly. r r Proof. To prove Lemma 9 we show P Vkr ⊇ P Vk+1 and P Vkr ⊆ P Vk+1 . For the r first case we have to show that every P Vk+1 -proof can be transformed to a P Vkr -proof. We prove this by induction on the height of proofs considering various rules for the last inference. The only nontrivial case happens when r -proof we do (k +1)-Induction in the last inference. Assume in this case P Vk+1 has the following form. 45 P1 Γ, B(b) → B(b + 1), ∆ (k + 1)-Induction Γ, B(0) → B(∣s∣k+1 ), ∆ Induction hypothesis implies: P Vkr ⊢ Γ, B(b) → B(b + 1), ∆ Applying one k-induction using the fact that ∣s∣k+1 = ∣∣s∣∣k gives: P Vkr ⊢ Γ, B(0) → B(∣s∣k+1 ), ∆ r This finishes the proof of first case. In the case P Vkr ⊆ P Vk+1 we have to r r show that every P Vk -proof can be transformed into a P Vk+1 -proof, this can be done by induction on the height of proofs. Like previous case we consider different rules for the last inference. If the last inference be substitution then the P Vkr -proof is of the form P1 Γ(b) → ∆(b) Substitution Γ(t) → ∆(t) induction hypothesis implies : r P Vk+1 ⊢ Γ(b) → ∆(b) applying one substitution gives: r P Vk+1 ⊢ Γ(t) → ∆(t) Other strong inferences except k-induction are similar to this case. For the case of induction the structure of P Vkr − proof is P1 Γ, B(b)→B(b + 1), ∆ k-Induction Γ, B(0)→B(∣s∣k ), ∆ From induction hypothesis we have: r P Vk+1 ⊢ Γ, B(b)→B(b + 1), ∆ Let B̂ and F (z, a) be defined as Theorem 11 for formula B. Substitution of b with F (z, a)1 and inference rule for implication gives the proof of: r P Vk+1 ⊢ Γ→B(F (z, a)1 ) ⊃ B(F (z, a)1 + 1), ∆ 46 On the other hand from Theorem 11, (4.4) we have P V − ⊢ (B((F (0, a))1 ) ⊃ B((F (0, a))1 + 1)) ⊃ B̂(0, a) By applying cut to this and above sequent we get r P Vk+1 ⊢ Γ→B̂(0, a), ∆ (4.7) Theorem 11, (4.3) implies that: r P Vk+1 ⊢ Γ, B̂(z, a)→B̂(z + 1, a), ∆ So by definition of k-induction we have the proof of: r P Vk+1 ⊢ Γ, B̂(0, a)→B̂(∣s∣k+1 , a), ∆ Applying cut to this and derivation in (4.7) yields: r P Vk+1 ⊢ Γ→B̂(∣∣s∣k ∣, a), ∆ By another cut to this and derivation in Theorem 11, (4.5)we get the proof of : r P Vk+1 ⊢ Γ, B(0)→B(∣s∣k ), ∆ The above proof is based on the Theorem 11 which used input formula B only schematically. In addition we used constant number of substitutions and cuts. This shows that the length of proof only increases linearly. Theorem 12. For a fixed k ≥ 0 there is a polynomial q such that for every P V -proof, P , of sequent Γ→∆ there is a P Vkr -proof, P ∗ , of Γ→∆ and ∣P ∗ ∣ ≤ q(∣P ∣) Proof. The proof is by induction on k. Base case is trivial: If k = 0 then P V0r = P V , by definition. Induction hypothesis says that there is a polynomial q such that for every P V -proof, P there is P Vkr -proof, P ∗ which proves the same end sequent as r . The size of P and also ∣P ∗ ∣ ≤ q(∣P ∣). By Lemma 9 we have P Vkr = P Vk+1 proof in the Lemma 9 is polynomial in ∣P ∣, this and induction hypothesis for q shows the whole proof has polynomially bounded size. Definition 39. A proof P is in “Variable Normal Form” (VNF) if for each cut occurring in P of the form: 47 Γ→∆, A A, Γ→∆ Γ→∆ we have: V ar(A) ⊆ V ar(Γ, ∆) . Lemma 10. For each P Vkr -proof, P , of sequent Γ→∆, there is a P Vkr -proof, P ∗ , in VNF of Γ→∆ such that ∣P ∗ ∣ is polynomial in ∣P ∣. Proof. The proof is by induction on the height of proofs. The only nontrivial case occurs when the last inference is an application of cut rule. In this case P has the form: P1 P2 Γ → ∆, A A, Γ → ∆ Cut Γ→∆ where V ar(A) ⊈ V ar(Γ, ∆). By induction hypothesis we have P Vkr -proofs P1∗ and P2∗ in VNF. Now we substitute each variable in V ar(A)∖V ar(Γ, ∆) by 0 to reach formula A′ which has the property V ar(A′ ) ⊆ V ar(Γ, ∆). Applying one cut to this gives us the final proof. This derivation is as follows: P1∗ P2∗ Γ → ∆, A A, Γ → ∆ Substitution ⋮ ⋮ ′ ′ Γ → ∆, A A ,Γ → ∆ Cut Γ→∆ Since at most we need ∣P ∣ substitutions, the size of ∣P ∗ ∣ is polynomially bounded by ∣P ∣. Definition 40. Γ→∆ has “numerically restricted variables ” if for every free variable aj , in Γ and ∆ there exists a corresponding formula of the form ∣aj ∣3 ≤ S nj (0), nj ∈ N occurring in Γ. Definition 41. For a proof P , si(P ) denotes the number of inferences different than W:l. 48 Theorem 13. There is a polynomial p such that every P V -proof , P , of sequent Γ→∆ with numerically restricted variables can be transformed to a P V − -proof, P ∗ , of Γ→∆ and ∣P ∗ ∣ ≤ p(∣P ∣) . Proof of Theorem 13. We prove Theorem 13 in the following steps: 1. Transforming every P V -proof, P into a P V2r -proof by Theorem 12. 2. Obtaining P V2r -proof of P in VNF by Lemma 10 3. Proving that every P V2r -proof, P in VNF of sequent Γ → ∆ with NRV can be transformed into a P V − -proof, P ∗ of Γ → ∆ such that si(P ∗ ) ≤ p(si(P )). We prove (3) by induction on si(P ). We distinguish cases according to the last inferences. We disuss weakening to the left later and we will finish the proof by complexity analysis. First We prove (3) if the last inference in the proof P of Γ → ∆ is : 1. Cut 2. Substitution 3. Induction Other cases are similar to these cases. Cut: The structure of proof in this case has the following form: P2 P1 Γ → ∆, A A, Γ → ∆ Cut Γ→∆ Induction hypothesis implies that for P V2r -proofs , P1 and P2 , there are P V − proofs, P1∗ and P2∗ with the same end sequents. Applying one cut gives us P V − -proof, P ∗ of Γ → ∆. Substitution: The structure of proof in the case of substitution is: (b is an eigenvariable) P1 Substitution Γ(b)→∆(b) Γ(t)→∆(t) 49 We define m = max {nj ∣j ≤ k − 1} (variables are numerically restricted)and s = l(t) + m for P V -term, t. By applying one W:l to P1 we obtain P V2r -proof of ∣b∣3 ≤ S s (0), Γ(b)→∆(b) We observe that induction hypothesis is applicable because W:l does not increase the si(P1 ) and also the last sequent has NRV. So induction hypothesis implies that there is a P V − -proof of P V − ⊢ ∣b∣3 ≤ S s (0), Γ(b)→∆(b) applying substitution to the above sequent gives: P V − ⊢ ∣t∣3 ≤ S s (0), Γ(t)→∆(t) Finally by doing cut to this and the result from Theorem 9, Chapter 3 which says: P V − ⊢ ⋀ ∣aj ∣3 ≤ S nj (0)→∣t∣3 ≤ S s (0) j≤k we get the proof of: P V − ⊢ Γ(t)→∆(t) Induction: If the last inference in the P V2r -proof of P is induction (restricted 2-induction, c.f.Definition 38) then the structure of proof will be: P1 Γ, A(b)→A(b + 1), ∆ 2-Induction Γ, A(0)→A(∣∣t∣∣), ∆ Let m = max {nj ∣j ≤ k − 1} (variables are numerically restricted)and s = l(t)+ m for P V -term, t. By applying one W:l to P1 we get P V2r ⊢ ∣b∣3 ≤ S s (0), Γ, A(b)→A(b + 1), ∆ (4.8) Now we can apply induction hypothesis since W:l does not change the si(P1 ) and the last end sequent has NRV. Thus we have P V − ⊢ ∣b∣3 ≤ S s (0), Γ, , b ≤ t, A(b)→A(b + 1)(∗∗) Theorem 9 in Chapter 3 implies P V − ⊢ Γ, ⋀ ∣aj ∣3 ≤ S nj (0)→∣t∣3 ≤ S s (0)(∗ ∗ ∗) j≤k 50 also from BASIC we have P V − ⊢ b ≤ t→∣b∣3 ≤ ∣t∣3 by using axiom (8) of BASIC and applying cuts to these sequents we get P V − ⊢ b ≤ t→∣b∣3 ≤ S s (0) Applying cut to this and derivation (∗∗) gives us P V − ⊢ Γ, b ≤ t, A(b)→A(b + 1), ∆ (4.9) From equality axioms we have b ≤ t→b ≤ t applying implication rule (left(14))gives: b ≤ t ⊃ A(b), Γ, b ≤ t→A(b + 1) with using cut to this and sequent b + 1 ≤ t→b ≤ t we get the proof of: b + 1 ≤ t, b ≤ t ⊃ A(b), Γ→A(b + 1), ∆ Using implication rule (right (15)) yields: Γ, b ≤ t ⊃ A(b)→b + 1 ≤ t ⊃ A(b + 1), ∆ Now we define A0 as follows: A0 (b) ≡ b ≤ t ⊃ A(b) thus by rewriting above sequent we have the proof of: P V − ⊢ Γ→∆, A0 (b) ⊃ A0 (b + 1) (4.10) Let u = ∣∣t∣∣ we define A′ , Â, F (z, a) w.r.t formula A0 like Theorem 11. Let a = ⟨0, u⟩, then substituting b in (4.10) with F (0, a)1 and applying cut to the (4.4) gives: P V − ⊢ Γ→Â(0, ⟨0, u⟩), ∆ (4.11) From (4.3) we obtain P V − ⊢ Â(z, ⟨0, u⟩)→Â(S(z), ⟨0, u⟩) 51 applying cut to this and (4.11), S s (0) many times gives us : Γ→Â(s, ⟨0, u⟩), ∆ (4.12) From Theorem 11 we have : P V − ⊢ (∣u∣ ≤ S s (0)) ∧ Â(S s (0), ⟨0, u⟩)→(A0 (0) ⊃ A0 (u)) By applying a cut to this and (4.12) and another cut to the (∗ ∗ ∗) we get the proof of: P V − ⊢ Γ→A0 (0) ⊃ A0 (∣∣t∣∣), ∆ . From definition of A0 and applying some inference rules we can replace formula A0 with A we get : P V − ⊢ Γ, A(0)→A(∣∣t∣∣), ∆ Weakening:left If the last inference in the P V2r -proofs of P be a weak structural rule then the si(P ) does not change, consequently we could not apply induction hypothesis. To solve this problem we should find the last strong inferences and apply induction hypothesis at that point. In the following we discuss a case which the last strong inference be the rule (12)∨ ∶ lef t of the form: A, Γ→∆ B, Γ→∆ A ∨ B, Γ→∆ And the last inference be a weakening to the left. Thus the structure of proof is : P1 P2 A, Γ → ∆ B, Γ → ∆ ∨ ∶ lef t A ∨ B, Γ → ∆ Weakening : left ⋮ Ð → C , A ∨ B, Γ → ∆ In order to apply induction hypothesis we need to obtain sequents in NRV thus first we apply weakenings to the end sequents of P1 and P2 so we have following subderivations: 52 P1 A, Γ → ∆ Weakening : left ⋮ Ð → C , A, Γ → ∆ P2 B, Γ → ∆ Weakening : left ⋮ Ð → C , B, Γ → ∆ By applying induction hypothesis to the above sub derivations we get their P V − − proof s, P1∗ and P2∗ using (12)∨ ∶ lef t and some exchange rule gives the proof of P ∗ , this derivation is as follows: P1∗ P2∗ Ð → C , A, Γ → ∆ Exchange : left Ð → A, C , Γ → ∆ ∨ ∶ lef t Ð → C , B, Γ → ∆ Exchange : left Ð → B, C , Γ → ∆ Ð → A ∨ B, C , Γ → ∆ Exchange : left Ð → C , A ∨ B, Γ → ∆ Complexity Analysis: For the second part of Theorem 13 concerning size of proofs, first we estimate the length of P V − -proof, P ∗ . We showed in the step 3 of proof of Theorem 13 by induction on si(P ) that there is a polynomial p such that every P V2r proof, P in VNF with NRV, can be transformed to a P V − -proof, P ∗ which proves the same end sequent such that si(P ∗ ) ≤ p(si(P )). As it is shown in the proof of Theorem 13 in the case of using cut, substitution and induction for the last inference, the length of proof only increases linearly and we did certain number of W:l for each occurrence of cut,substitution and induction which is bounded by C ⋅ ∣P ∣ for a constant C. So the whole proof consists of two parts one comes from P and another part consists of weakenings which are necessary to get NRV. Therefore the overall estimation of proof’s length will be length of P ∗ ≤ si(P ∗ ) ⋅ C ⋅ ∣P ∣ 53 Let bi and ti be eigenvariables and terms which are involved in every occurrance of substitution or induction. According to the premiss of Theorem 13 every parameter variable ai must satify the condition of NRV so they have the form ∣aj ∣3 ≤ S nj (0) Let m be maximum of nj . To ensure that the condition of NRV holds when we apply substitution or cut we used weakenings namely with fromulae of the form ∣bi ∣3 ≤ S ni (0) Where ni = m + l(ti ). Hence ni is bounded by sum of all l(ti ) and we have ni ≤ m + ∑ l(ti ) ≤ ∣P ∣ This shows the size of each sequent is bounded by C.∣P ∣2 for a constant C. Finally in contrast of above discussion for length of proof we conclude that the size of P ∗ is polynomial in ∣P ∣. 4.3 Provability of consistency of P V − In this section we want to show that the consistency of P V − is not provable in P V . For this first we formalise Theorem 13 in S21 (P V ) then by using conservativity of S21 (P V ) over P V we show in the Theorem 15 that P V ⊬ Con(P V − )The complete workout of theorem 14 and 15 is beyond the scope of this MRes thesis so we only sketch their proofs. Theorem 14. Theorem 13 can be formalised in S21 (P V ) in the following way: Let t(p) be a P V -term and N RV (s) be a predicate which is true when a sequent with the number s has numerical restricted variables. (this predicate is definable in P V ) Then S21 (P V ) proves ∀p∀s∃q(P rfP V (p, s) ∧ N RV (s) ⊃ q ≤ t(p) ∧ P rfP V − (q, s)) Proof. Proof of Theorem 13 is built by Theorem 11, Lemma 9 and Lemma 10. The proof of Lemma 11 does not involve induction and uses input formula only schematically. Formalisation of this theorem in S21 (P V ) is straightforward. In Lemma 9, induction is on the height of proofs. In order to formalize this lemma we follow these steps: 54 1. Let P be a P Vkr -proof and l be length of P , we form sequence of subproofs p1 . . . pl which are sorted according to the number of inferences which they use. r 2. we show by induction on j that there are P Vk+1 − proof s, qj , with size polynomial in the size of pj , and proves the same end sequent as pj . 3. The above construction is an application of Σb1 − LIN D which is applicable in S21 (P V ). Formalisation of Lemma 10 is similar to the Lemma 9. In the proof of Theorem 13 we observe that induction hypothesis is applied to modified subproofs(to ensure the condition of NRV is hold ) in other words we applied induction on h in the following formula: ∀p∀s(si(p) ≤ h ∧ P rfP V2r (p, s) ∧ V N F (p) ∧ N RV (s) ⊃ ∃qP rfP V − (q, s)) This induction is an application of Πb2 − LIN D which is not applicable in S21 (P V ) unless BA hierarchy collapses. One way to solve this problem is to construct a sequence of suitable P V2r -proofs (with VNF and NRV) such that the elements of that sequence can be transformed into P V − -proofs successively this can be done in the following steps: 1. Fix a P V2r -proof, P which has NRV and VNF. Let b1 . . . bl be the eigenvariables occurring in P and a1 . . . ak be parameter variables. Without loss of generality we assume b1 . . . bl have following property: “ If the inference which eliminates bi occurs earlier than inference which eliminates bj then i < j” Let ti be the term which bounds bi in its eliminating inference, e.g. Γ, A(bi ) → ∆, A(bi + 1) Γ, A(0) → ∆, A(ti ) thus we have a sequence of terms t1 . . . tl . 2. We fix numerical restrictions for b1 . . . bl in similar way to the proof of theorem 13. The end sequent of P contains formulae of the form ∣ai ∣3 ≤ S nj (0), which occurred in the anticedent of endsequent of P . Let m = max{n1 . . . nk }. Now for each bi we define numerical restriction as following formula : ∣bi ∣3 ≤ S m+l(ti )+⋯+l(tl ) (0) 55 3. Construct the list L1 of all subproofs of P which are not the premise of weakening left (W:l), including P . 4. For each p′ ∈ L1 we add necessary weakening to the left (W:l) with numerical restriction from (2.) in a canonical way such that the resulting proofs has NRV. Let L2 be the list of such proofs. 5. Let p1 , p2 , . . . be the list of proofs in L2 sorted according to the order of number of inferences different than (W:l) 6. By induction on the j it can be shown that there are P V − -proof, qj of size polynomial in the size of pj which has the same end sequent as pj . Apparently this is an application of Σb1 − LIN D which is available in S21 (P V ). In section 2.4 we introduced Con(P V ) as the formula ∀y¬P rfT (y, ⌜0 = 1⌝) The appropriate notion for P V is developed by Cook [10] as follows: ⎧ 1 If m is the number of an equation t = u, ⎪ ⎪ ⎪ ⎪ P roofP V (m, n)=⎨ and n is the number of a proof in P V of t = u ⎪ ⎪ ⎪ ⎪ ⎩0 Otherwise These two definition for proof predicate are equivalent in S21 (P V ). Theorem 15. P V ⊬ Con(P V − ) Proof. For the sake of contradiction suppose P V ⊢ Con(P V − ) that means: P V ⊢ P roofP V − (p, ⌜0 = 1⌝) = 0 S21 (P V ) ⊢ P rfP V − (p, ⌜0 = 1⌝) = 0 S21 (P V ) ⊢ ∀p¬P rfP V − (p, ⌜0 = 1⌝) Applying Theorem 14 we gives : S21 (P V ) ⊢ ∀p¬P rfP V (p, ⌜0 = 1⌝) = 0 56 and from conservativity of S21 (P V ) over P V [2] we get: P V ⊢ P roofP V (p, ⌜0 = 1⌝) = 0 which contradicts the result in [10] by Cook that P V ⊬ Con(P V ) 4.4 Future works In this thesis we have worked out the result P V ⊬ Con(P V − ) in detail. Our proof for this result does not involve with additional function symbol, Squaring function and its corresponding axiom. But still there are some other axioms which distinct P V − from pure equational theory, in particular in contrast of S2−∞ [1]. One way to approach this problem is to weaken P V − by dropping further axioms and still maintaining provability of consistency. This will include also carrying out speed-up techniques in pure equational theories. Another direction for future work is to strengthen techniques in [1] to show provability of consistency of pure equational theories which are only based on recursive definition of function symbols as axioms. 57 Chapter 5 Appendix BASIC 1. y ≤ x ⊃ y ≤ Sx 2. x ≠ Sx 3. 0 ≤ x 4. x ≤ y ∧ x ≠ y ↔ Sx ≤ y 5. x ≠ 0 ⊃ 2 ⋅ x ≠ 0 6. y ≤ x ∨ x ≤ y 7. x ≤ y ∧ y ≤ x ⊃ x = y 8. x ≤ y ∧ y ≤ z ⊃ x ≤ z 9. ∣0∣ = 0 10. x ≠ 0 ⊃ ∣2 ⋅ x∣ = S(∣x∣) ∧ ∣S(2 ⋅ x)∣ = S(∣x∣) 11. ∣S0∣ = 0 12. x ≤ y ⊃ ∣x∣ ≤ ∣y∣ 13. ∣x ♯ y∣ = S(∣x∣ ⋅ ∣y∣) 14. 0 ♯ y = S0 58 15. x ≠ 0 ⊃ 1 ♯(2 ⋅ x) = 2(1) ∧ 1 ♯(S(2 ⋅ x)) = 2(1) 16. x = y 17. ∣x∣ = ∣y∣ ⊃ x = y 18. ∣x∣ = ∣u∣ + ∣v∣ ⊃ x = (u ♯ y) ⋅ (v ♯ y) 19. x ≤ x + y 20. x ≤ y ∧ x ≠ y ⊃ S(2 ⋅ x) ≤ 2 ⋅ y ∧ S(2 ⋅ x) ≠ 2 ⋅ y 21. x + y = y + x 22. x + 0 = x 23. x + Sy = S(x + y) 24. (x + y) + z = x + (y + z) 25. x + y ≤ x + z ↔ y ≤ z 26. x ⋅ 0 = 0 27. x ⋅ (Sy) = (x.y) + x 28. x ⋅ y = y ⋅ x 29. x ⋅ (y + z) = (x ⋅ y) + (x ⋅ z) 30. x ≥ S0 ⊃ (x ⋅ y ≤ x ⋅ z ↔ y ≤ z) 31. x ≠ 0 ⊃ ∣x∣ = S(∣⌊ 12 x⌋∣) 32. x = ⌊ 12 y⌋ ↔ (2 ⋅ x = y ∨ S(2 ⋅ x) = y) 59 Bibliography [1] Arnold Beckmann. Proving consistency of equational theories in bounded arithmetic. J. Symbolic Logic, 67(1):279–296, 2002. [2] Samuel R. Buss. Bounded arithmetic, volume 3 of Studies in Proof Theory. Lecture Notes. Bibliopolis, Naples, 1986. [3] Samuel R. Buss. Relating the bounded arithmetic and polynomial time hierarchies. Ann. Pure Appl. Logic, 75(1-2):67–77, 1995. Proof theory, provability logic, and computation (Berne, 1994). [4] Samuel R. Buss. Bounded arithmetic and propositional proof complexity. In Logic of computation (Marktoberdorf, 1995), volume 157 of NATO Adv. Sci. Inst. Ser. F Comput. Systems Sci., pages 67–121. Springer, Berlin, 1997. [5] Samuel R. Buss, editor. Handbook of proof theory, volume 137 of Studies in Logic and the Foundations of Mathematics. North-Holland Publishing Co., Amsterdam, 1998. [6] Samuel R. Buss and Aleksandar Ignjatovic. Unprovability of consistency statements in fragments of bounded arithmetic. Ann. Pure Appl. Logic, 74(3):221–244, 1995. [7] Peter Clote and Jan Krajı́ček, editors. Arithmetic, proof theory, and computational complexity, volume 23 of Oxford Logic Guides. The Clarendon Press Oxford University Press, New York, 1993. Papers from the conference held in Prague, July 2–5, 1991, Oxford Science Publications. 60 [8] Alan Cobham. The intrinsic computational difficulty of functions. In Logic, Methodology and Philos. Sci. (Proc. 1964 Internat. Congr.), pages 24–30. North-Holland, Amsterdam, 1965. [9] Stephen A. Cook. The complexity of theorem-proving procedures. In STOC ’71: Proceedings of the third annual ACM symposium on Theory of computing, pages 151–158, New York, NY, USA, 1971. ACM. [10] Stephen A. Cook. Feasibly constructive proofs and the propositional calculus (preliminary version). In Seventh Annual ACM Symposium on Theory of Computing (Albuquerque, N.M., 1975), pages 83–97. Assoc. Comput. Mach., New York, 1975. [11] Jan Krajı́ček. Fragments of bounded arithmetic and bounded query classes. Trans. Amer. Math. Soc., 338(2):587–598, 1993. [12] Jan Krajı́ček, Pavel Pudlák, and Gaisi Takeuti. Bounded arithmetic and the polynomial hierarchy. Ann. Pure Appl. Logic, 52(1-2):143–153, 1991. International Symposium on Mathematical Logic and its Applications (Nagoya, 1988). [13] P. Pudlák. A note on bounded arithmetic. Fund. Math., 136(2):85–89, 1990. [14] Boaz Barak Sanjeev Arora. Complexity Theory: A Modern Approach. Cambridge University Press, 2009. [15] Larry J. Stockmeyer. The polynomial-time hierarchy. Theoret. Comput. Sci., 3(1):1–22 (1977), 1976. [16] Gaisi Takeuti. Some relations among systems for bounded arithmetic. In Mathematical logic, pages 139–154. Plenum, New York, 1990. [17] Domenico Zambella. Notes on polynomially bounded arithmetic. J. Symbolic Logic, 61(3):942–966, 1996. 61