Logical operations and invariance

Logical operations and invariance Enrique Casanovas University of Barcelona July 28, 2004. Revised September 8, 2005 Abstract I present a notion of invariance under arbitrary surjective mappings for operators on a relational finite type hierarchy generalizing the so called Tarski-Sher criterion for logicality and I characterize the invariant operators as definable in a fragment of the first-order language. These results are compared with those obtained by Feferman and it is argued that further clarification of the notion of invariance is needed if one wants to use it to characterize logicality. 1 Background The semantical analysis of logical constants stems from the early work of Tarski, Mautner and Mostowski, and has received a lot of attention recently mainly due to the contributions of Sher, McGee, van Benthem, and Feferman. Its main point is the proposal that logicality may be understood as invariance under the most general transformations. In [13] Tarski develops the idea of identifying logical notions with those notions that are invariant under all permutations of the universe of discourse. A similar point of view had been previously but independently maintained by Mautner in [7]. Lindenbaum and Tarski proved in [5] that all logical notions from Principia Mathematica are invariant in this sense. Mostowski in [9] addressed the problem of characterizing quantifiers. He distinguishes between limited and unlimited quantifiers. A limited quantifier is defined on a specific set and it is invariant under all permutations of this set. An unlimited quantifier determines a limited quantifier on each particular set and it is required to be invariant under all bijections between sets. This distinction is used by Gómez-Torrente in [4] to justify the difference between the Tarski criterion, namely invariance under permutations of the universe, and the Mostowski criterion, which is invariance under bijections across universes. Note that in order to be able to apply Mostowski’s criterion we have to assume that the notions and operations we are discussing are defined on every set. Sher [10] requires that a logical constant be defined over all models, not only over all sets. Her criterion for logicality includes, among other conditions, 1 invariance under isomorphisms, which in fact is a weaker condition than mere invariance under bijections. This difference between Sher’s and other previous accounts of logical constancy is frequently neglected. For instance, McGee in [8] and Feferman in [3] identify Sher’s position with the requirement of invariance under bijections for logical operations, which is Mostowski’s criterion according to Gómez-Torrente’s terminology. For a concise presentation of Sher’s views see [11]. The proposal of identifying logical operations and notions with those invariant under bijections across domains has received severe criticism. It is generally acknowledged that invariance is a necessary condition for logicality, but not a sufficient one. One reason for this is the observation that all notions formally defined in terms of cardinalities turn out to be invariant under bijections, whereas their logical character is more than dubious in many cases. Moreover bijections connect only sets of the same size, and consequently invariance under bijections does not exclude an arbitrary behavior on different cardinalities. In words of Machover in his review of Sher’s book in [6]: “ For example, let C be the class whose members are all natural numbers belonging to some non-analytic set, as well as all infinite cardinals α (regarded as initial ordinals) such that 2ℵα = ℵα+1 . Now let Q be a particle of the same syntactic type as the familiar standard quantifiers ∃ and ∀, interpreted as follows: in any domain whose cardinality belongs to C, Q is interpreted as ∃, while in any other domain Q is interpreted as ∀. Clearly, Q under its given interpretation satisfies Sher’s criterion, and hence she would have us regard it as a logical particle.” The defense of the criterion of bijection-invariance has been undertaken mainly by McGee. In [8] he proves, roughly stated, that any bijection-invariant operation can be defined in an infinitary logical language and maintains that “this confirms Tarski’s thesis that an operation invariant under permutations is a logical operation” ([8], page 568). To be more precise, McGee shows that if F is a bijection-invariant operation, then for each cardinal number κ there is a formula ϕk in the infinitary language L∞∞ which characterizes F on sets of cardinality κ. Therefore McGee’s result does not provide a single formula for F , but a different formula ϕκ for each cardinal number κ. McGee’s work seems to Feferman “faultless in its execution” but “blatantly implausible” ([3] page 32). He presents three basic criticisms of McGee’s approach. In his own words ([3] page 37): 1. The thesis assimilates logic to mathematics, more specifically to set theory. 2. The set-theoretical notions involved in explaining the semantics of L∞∞ are not robust. 3. No natural explanation is given by it of what constitutes the same logical operation over arbitrary basic domains. Point 3 is for him the main reason for rejecting McGee’s proposal. As mentioned above, this was also one of the objections of Machover to Sher’s criterion for logicality. Feferman proceeds then to present his own ideas, which try, according to him ([3] page 39), to give an explanation of logical operations “which 2 shows how an operation behaves when applied over one domain M0 connects naturally with how it behaves over any other domain M00 ”. The solution is to investigate invariance under mappings instead of invariance under bijections. He calls it “homomorphism invariance”, which I think it is an unlucky denomination since there is no structure to be preserved and it is really only a matter of surjective mappings from pure sets to pure sets. Feferman characterizes the operations (of monadic type) which are invariant in this sense and he finds out that they are exactly the ones which are definable in the λ-calculus without equality. In other words, the invariant operations are those which can be characterized using only pure first-order logic. There are some precedents of Feferman’s results in Van Benthem [14] but they are of very restricted nature. In the end Feferman is not completely satisfied with his characterization since it excludes equality, and after a brief discussion of the problem of the logical character of equality he says we should give it “a distinguished role in logic even if it should not turn out to be logical on its own under some cross-domain invariance criterion, such as homomorphism” ([3] page 44). 2 Invariance My approximation to the semantic analysis of logical operations has, in its initial steps, many points in common with the one developed by Feferman. Clearly, invariance under bijections across universes is not the more general kind of invariance one can think of. Equality (or better inequality) is invariant under bijections but not necessarily under arbitrary mappings. In mathematics much more general kinds of transformations – which do not respect inequality – are many times considered the natural ones. It is therefore convenient to investigate more general kinds of transformations and to find out which are the notions and operations invariant under them. Unexpectedly, I arrived at very different conclusions than those of Feferman. Invariant notions and operations in my sense turn out to be only a small fragment of all notions and operations definable in first-order logic. In particular, negation, arbitrary conjunctions and universal quantification are not invariant. On the other hand it follows from my results that some particular forms of equality are invariant. Some part of my work in the last part of this paper has been devoted to try to understand how we arrive at so distant conclusions starting with the same assumptions and very close intuitions. Feferman’s analysis takes place in a finite type hierarchy of universes similar to the one used first by van Benthem. Just to simplify the proofs, Feferman replaces relational types by functional types and replaces relations by their characteristic functions. At first sight one can think that his replacement is innocuous, but in fact it has serious consequences. As I show in section 7, Feferman’s invariance notion does not correspond to invariance under surjective mappings when translated back into the relational framework. His definition of invariance seems to be very reasonable in the functional type hierarchy but it is fairly unnatural when restated in the relational setting. As pointed out 3 in section 7, it is in fact a different notion of invariance than the one here investigated. For the first levels of the type hierarchy it coincides with invariance under preimages of arbitrary mappings. This is a capital point to explain the difference in conclusions between Feferman’s analysis and mine. If one does not make the simplification of replacing relations by their characteristic functions, one is forced to deal with the notion of invariance I am discussing here. The fact that invariance splits into several notions, some stronger than others, and that accordingly different criteria for logicality appear, should perhaps be taken as a sign that the logical character of notions and operations from a semantical point of view is a matter of perspective and a matter of degree. Even outside the semantical framework, the idea that classifying a notion as a logical notion is a question of more or less has been considered by some authors. For instance, Byrd discusses in [1] the position that the conditional and the universal quantifier play a prominent role among logical concepts and Warmbrōd in [15] proposes a distinction between a secure logical theory with a minimal set of notions and an extended setting where a great variety of notions is allowed. According to my notion of invariance, existential quantification and disjunction are particularly strong whereas negation, conjunction and universal quantification are not. I realize that it is not easy to accept than universal quantification and conjunction are less logical that existencial quantification and disjunction. But according to my results there is a natural sense in which they are less robust. It is not my purpose here to support any particular point of view concerning the logical character of abstract operations. My analysis can be used to show that the semantic characterization based on invariance under all possible transformations leads to absurdities. It can also be used to support the view that in some respect some operations are more logical than others. I think that there is still a need for exploration. Several notions of invariance have appeared and some other may still be relevant. My research shows that the notion of invariance is not as clear as one could have thought. Moreover the previous proposals have been only partially developed. Feferman has been able to characterize invariant operations only of a restricted class. They are operations transforming a tuple consisting of n subsets of a set M on a m-ary relation on M . It would be interesting to see whether his description generalizes to more general operations transforming tuples of mi -ary relations (for 1 ≤ i ≤ n) on m-ary relations, which is the most reasonable setting. In this article all the mapping-invariant (in my sense) operations of these more general kinds are investigated. I describe with some detail these operations and I show they are just the operations definable in a specific fragment of first-order logic. I do not claim that they constitute the answer to the question of which are the logical operations. My approach to invariance is influenced by the fact that in algebra the most general kind of transformation usually considered is not isomorphism but homomorphism. On the contrary to what happens with the relation of being an isomorphic image, the relation of being an homomorphic image is not symmetric and transitive. Thus, one need to construct its symmetric and transitive closure in order to obtain an equivalence relation analogous to isomorphism. These topics are studied in [2] with the main motivation of developing the basic model4 theoretic aspects of first-order logic without equality. The equivalence relation among structures obtained as the symmetric and transitive closure of being an homomorphic image is called relativeness in [2]. “Being relative” is the natural substitute for “being isomorphic” when one does not care of preserving equalities and inequalities. In the case we are interested here, only pure sets are present –we do not deal with arbitrary models– and therefore one has to work with bijections in place of isomorphisms and with surjective mappings in place of homomorphisms. It is a particular case of the general situation studied in [2], namely the case in which all structures considered have empty similarity type. Feferman in [3] is also concerned with the equivalence relation characterized as the symmetric and transitive closure of being an image under an arbitrary surjective mapping and he chooses the name similarity to refer to it. Thus, a similarity is a finite composition of surjective mappings and inverses of surjective mappings and one says that two sets are similar when there is a similarity between them. Of course, any two nonempty sets are similar in this sense, but the important tool is not the similarity relation itself but invariance under every particular similarity. Similarity is just relativeness particularized to the case of structures which are only pure sets. Here we will follow Feferman’s terminology. Of course, we can apply to the similarity relation all results obtained in [2] for the relativeness relation. This has been very helpful for my understanding of the situation. I now describe with some detail the content of the paper. We will look at notions and operations defined simultaneously over any nonempty set. Over any such set we will consider the relational and the functional finite type hierarchy of universes and we will show how different transformations of the basic sets can be extended to upper levels. In section 3, after fixing terminology, I define mapping-invariance and I clarify some points concerning invariance under mappings and under similarities. In section 4, I characterize the mapping-invariant objects at the lower levels of the relational type hierarchy. In sections 5 and 6, I present my main results: the characterization of all mapping-invariant operators that transform a tuple of relations among individuals of different arities on a relation among individuals. The particular case of operators transforming m-ary relations on n-ary relations is studied in detail in section 5. Then in section 6 I need only to indicate how to generalize the results from section 5 to the broader setting. Sections 7 and 8 are devoted to compare my approach with that of Feferman. In section 7, I introduce the finite functional type hierarchy and I prove the equivalence between similarity-invariance and homomorphism-invariance in the sense of Feferman [3]. In section 8, Feferman’s proposal is presented, and I translated it back into the relational setting. In the finite relational type hierarchy I define what I call Feferman invariance and I show that an operator is Feferman invariant if and only if it is invariant in the sense of Feferman when translated into the finite functional type framework. Then a last notion of invariance is introduced, the notion of preimage-invariance and I show that it is the same as Feferman invariance at the lower levels. This explains why every first-order definable operation is invariant according to Feferman but it is not 5 according to my investigation: his transformations of the universe are the result of taking preimages and as is well-known they preserve intersections and complements while taking images do not. The notation is mainly standard. P(X) denotes the power set of X. If R ⊆ M n and 1 ≤ j ≤ n we will use the notation fieldj (R) = {a ∈ M : (a1 , . . . , an ) ∈ R for some a1 , . . . , an such that aj = a}. Thus, for example, if R is a binary relation, then n = 2, field1 (R) = dom (R), and field2 (R) = rng (R). The union of fieldj (R) for all j is field (R). R−1 is used for the inverse of a relation R even in the case that R happens to be a non necessarily one-to-one mapping. If R and S are relations then their relational product or composition will be R ◦ S = {(a, b) : for some c, R(a, c) and S(c, b)}. If R or S is a mapping we still use this notation, identifying the mapping with its graph. Thus f ◦ g is not defined by f ◦ g(x) = f (g(x)). 3 Mapping-invariance The relational finite types are generated from the basic type 0 by the rule: if τ1 , . . . , τn are relational finite types, then (τ1 , . . . , τn ) is a relational finite type. We will call them in short just relational types. If M is a nonempty set and τ a relational type, the τ -universe of M is the set Mτ defined recursively as follows: 1. M0 = M . 2. M(τ1 ,...,τn ) = P(Mτ1 × . . . × Mτn ). When using the notation Mτ , we will always assume that M is nonempty. The type τ = (τ1 , . . . , τn ) where τi = 0 for every i = 1, . . . , n will be denoted by 0n . Hence M0n = P(M n ). Let f : M → N be a mapping and let τ be a relational type. The mapping f can be extended in a very natural manner to a mapping from the τ -universe Mτ in the τ -universe Nτ . The mappings fτ : Mτ → Nτ are defined recursively by the following rules: 1. f0 = f . 2. If τ = (τ1 , . . . , τn ), the mapping fτ : Mτ → Nτ is given by fτ (a) = {(fτ1 (a1 ), . . . , fτn (an )) : (a1 , . . . , an ) ∈ a} for all a ⊆ Mτ1 × . . . × Mτn . 6 Observe that each fτ is onto if f is onto, and that fτ is one-to-one if f is oneto-one. Sometimes we will write f instead of fτ when the context does not lead to confusion. Compositions of surjective mappings in every possible direction produce the notion of similarity. A relation π ⊆ M × N is a similarity relation between M and N if and only if for some n ≥ 2 there are sets M1 , . . . , Mn and mappings f1 , . . . , fn−1 such that M = M1 , N = Mn and for every i = 1, . . . , n, fi is a mapping from Mi onto Mi+1 or it is a mapping from Mi+1 onto Mi , and π is the relational product or composition R1 ◦ . . . ◦ Rn−1 where Ri = fi if fi is from Mi onto Mi+1 and Ri = fi−1 if it is from Mi+1 onto Mi . The following is easily proven by induction on the length n of the sequence of sets M1 , . . . , Mn in the definition of similarity. Moreover it follows from the proof of Proposition 2.6 in [2] since a similarity is just a relativeness relation for the case of the empty language. Fact 3.1. A binary relation π ⊆ M × N is a similarity relation between M and N if and only if for every a ∈ M there is some b ∈ N such that π(a, b) and for every b ∈ N there is some a ∈ M such that π(a, b). In other words, if dom π = M and rng π = N . A similarity relation π between M and N is also naturally extensible to a similarity relation πτ between the τ -universes Mτ and Nτ . Assume that there are sets M1 , . . . , Mn and mappings f1 , . . . , fn−1 such that M = M1 , N = Mn and for every i = 1, . . . , n, fi is a mapping from Mi onto Mi+1 or it is a mapping from Mi+1 onto Mi , and π is the relational product R1 ◦. . .◦Rn−1 where Ri = fi if fi is from Mi onto Mi+1 and Ri = fi−1 if it is from Mi+1 onto Mi . Then πτ is the relational product R1τ ◦ . . . ◦ Rn−1 τ where Riτ = fi τ if fi is from Mi onto Mi+1 and Riτ = fi −1 τ if it is from Mi+1 onto Mi . Remark 3.2. Let π a similarity relation between M and N . The induced similarity πτ ⊆ Mτ × Nτ can also be defined inductively starting from π0 = π by the rule that if τ = (τ1 , . . . , τn ) is a relational type, then the similarity πτ between Mτ and Nτ is given by: πτ (a, b) if and only if 1. for each (a1 , . . . , an ) ∈ a there is a tuple (b1 , . . . , bn ) ∈ b such that πτi (ai , bi ) for each i = 1, . . . , n, and 2. for each (b1 , . . . , bn ) ∈ b there is a tuple (a1 , . . . , an ) ∈ a such that πτi (ai , bi ) for each i = 1, . . . , n. We will be concerned with the invariance of objects and operators. Let τ be a relational type. An object of type τ is a function a which associates with every nonempty set M a corresponding element aM ∈ Mτ . We will say that the object a is mapping-invariant if and only if for every M, N and every surjective mapping f : M → N , f (aM ) = aN . Equivalently, a is mapping-invariant if and only if for every M, N , for every similarity relation π between M and N , π(aM , aN ). We call the object a bijection-invariant if and only if for every M, N and every bijection f : M → N , fτ (aM ) = aN . 7 Let τ1 , . . . , τn , τ be relational types. A (τ1 , . . . , τn ; τ )-ary operator is a function F which for each nonempty set M gives a mapping FM : Mτ1 × . . . × Mτn → Mτ . Sometimes we omit the subscript in FM to simplify notation if this is not misleading. We call F mapping-invariant if for all M, N and every mapping f : M → N from M onto N , for all a1 ∈ Mτ1 , . . . , an ∈ Mτn , fτ (FM (a1 , . . . , an )) = FN (fτ1 (a1 ), . . . , fτn (an )). It is easy to check that a (τ1 , . . . , τn ; τ )-ary operator F is mapping-invariant if and only if for all M, N and every similarity relation π between M and N , for all a1 ∈ Mτ1 , . . . , an ∈ Mτn , and for all b1 ∈ Nτ1 , . . . , bn ∈ Nτn , if πτi (ai , bi ) for all i = 1, . . . , n, then πτ (FM (a1 , . . . , an ), FN (b1 , . . . , bn )). Following more customary terminology, a (0m1 , . . . , 0mr ; 0n )-ary operator will be also called (m1 , . . . , mr , n)-ary. As mentioned in the introduction, Tarski in [13] talks of invariance under permutations of a fixed universe and not of invariance under bijections across universes. We may say that the element b ∈ Mτ is permutation-invariant if and only if for every permutation f of M , fτ (b) = b. Clearly, if a τ -object a is bijection-invariant, then for every set M , the corresponding element aM ∈ Mτ is permutation-invariant. On the other hand it is easy to show that for each set M and each permutation-invariant element b ∈ Mτ we can find a bijectioninvariant τ -object a such that aM = b. Hence both criteria of invariance coincide when applied to a particular universe. 4 Mapping-invariant objects It is clear that there are no bijection-invariant objects of type 0. This was observed by Tarski in [13] for permutation invariance. He also noticed that the only permutation-invariant objects of type (0) are the empty set and the universe and more generally, that there are only finitely many permutationinvariant objects of type 0n . For instance, the permutation-invariant objects of type 02 = (0, 0) are the empty relation, the universal relation, the identity and its complement. The same happens for bijection-invariance. It is not difficult to see that the bijection-invariant objects of type 0n are the n-ary relations first-order definable in the language of pure equality. But the defining formula might be not uniform for all cardinalities. Fact 4.1. An object a of type 0n is bijection-invariant if and only if for each cardinal κ there is a first-order formula ϕκ = ϕκ (x1 , . . . , xn ) in the language of pure equality such that for each set M of cardinality κ, aM = {(a1 , . . . , an ) ∈ M n : M |= ϕ(a1 , . . . , an )}. Mapping-invariance is a more restrictive condition. Of course, there is no mapping-invariant object of type 0. Concerning objects of type 0n , the basic 8 difference with respect to bijection-invariance is that intersections and complements of mapping-invariant relations are not necessarily mapping-invariant. Equality is mapping-invariant, but inequality is not. Theorem 4.2. An object a of type 0n is mapping-invariant if and only if it is the empty object (aM = ∅ for every set M ) or the universe (aM = M n for every M ) or it is uniformly a union of objects dI for I ⊆ {1, . . . , n}, where dIM = {(a1 , . . . , an ) ∈ M n : ai = aj for all i, j ∈ I} for every set M . Proof. It is clear that all these objects are mapping-invariant. Let a be a mapping-invariant object. We will show that a is of the prescribed form. If aM = ∅ then aN = ∅ for any other set N , since if π is a similarity relation, π(∅, aN ) implies aN = ∅. I claim that if dIM ∩ aM 6= ∅ for a set M such that |M | > n, then for any set N , dIN ⊆ aN . From this it will follow that a is a union of objects dI . This includes the case aM = M n , which corresponds to |I| ≤ 1. Assume |M | > n and let (a1 , . . . , an ) ∈ dIM ∩ aM and (b1 , . . . , bn ) ∈ dIN . We now show that (b1 , . . . , bn ) ∈ aN . Notice that π = {(ai , bi ) : i = 1, . . . , n} ∪ ((M r {a1 , . . . , an }) × N ) is a similarity relation between M and N . By mapping invariance of a, there is a tuple (c1 , . . . , cn ) ∈ aN such that π((a1 , . . . , an ), (c1 , . . . , cn )). By construction of π we see that (c1 , . . . , cn ) = (b1 , . . . , bn ). Hence (b1 , . . . , bn ) ∈ aN . Invariant objects of type ((0)) are closely related to quantifiers. Tarski observed that the permutation-invariant objects in this type are the “properties concerning the number of elements of these classes”. Cardinality also plays a role for the mapping-invariant objects of type ((0)), but essentially as upper bound. Theorem 4.3. An object a of type ((0)) is mapping-invariant if and only if a is the empty object or it is uniformly a union of the following objects: 1. the object e such that for every set M , eM = {∅}, 2. the object u such that for every set M , uM = {M }, 3. the object p such that for every set M , pM = P(M ) r {∅}, 4. for some cardinal number κ, the object bκ such that for every set M , bκM = {A ⊆ M : A 6= ∅ and |A| < κ}. Proof. It is easy to check that all the described objects are mapping-invariant. We prove now that all the mapping-invariant objects are of this kind. If aM = ∅ then for any other set N , aN = ∅. Similarly, if ∅ ∈ aM then ∅ ∈ aN for every N . Now assume aM has nonempty elements different from M and let κ be a cardinal such that for each cardinal µ smaller than κ, aM has such elements of cardinality µ. We prove that in this case, for any set N , bκN ⊆ aN . Let B ⊆ N be nonempty and of cardinality strictly less than κ. Choose a set A ∈ aM of cardinality ≥ |B| and such that A 6= M , choose a surjective mapping f : A → B such that f (A) = B, and put π = f ∪ ((M r A) × N ). It is a similarity 9 relation between M and N and hence, by mapping-invariance of a, π(A, C) for some C ∈ aN . By choice of π it follows that C = B. Thus B ∈ aN . If for arbitrarily large κ there are sets N such that bκN ⊆ aN , then clearly for every N , aN = P(N ) or for every N , aN = P(N ) r {∅}. Otherwise, let κ be the least cardinal such that for some set N , bκN 6⊆ aN . Then for every µ < κ, for every set N , bµN ⊆ aN . Obviously, κ is a successor cardinal number, say κ = λ+ . Moreover A 6∈ aN if A is a subset of N such that A 6= N and |A| ≥ λ. If there is not a set M such that |M | ≥ λ and M ∈ aM then either aN = bλN for every N or aN = bλN ∪ {∅} for every N . Assume now that there is a set M such that |M | ≥ λ and M ∈ aM . We prove that N ∈ aN for every N . From this it will follow that either aN = bλN ∪ {N } for every N or aN = bλN ∪ {∅} ∪ {N } for every N . Consider an arbitrary set N . If |N | < λ, then clearly N ∈ aN . In case |M | = |N | ≥ λ, there is a bijection f between M and N and we put π = f . If M , N have different cardinality ≥ λ we may assume that |M | > |N | ≥ λ, we choose A ⊆ M and B ⊆ N of the same cardinality ≥ λ such that A 6= M , we choose a bijection f between A and B, and we put π = f ∪ ((M r A) × N ). In either case π is a similarity relation between M and N and there is a set C ∈ aN such that π(M, C) and it follows that N = C. Hence N ∈ aN . The last case to be considered is when there is no set M such that aM has a nonempty element A 6= M . Then aM ⊆ {∅, M } for every M . It is easy to see that either aM = ∅ for every M , or aM = {M } for every M , or aM = {∅, M } for every M . After Theorem 4.3 one might think that all quantifiers ∃<κ xϕ(x) are at our disposal in constructing mapping-invariant operators and in defining mappinginvariant objects. But this is not the case. It is misleading to restrict the notion of a quantifier over a set M to a specification of a collection of subsets of M . This might be enough to quantify formulas with just one free variable. Quantification of a formula with more free variables involves transformation of n + 1-ary relations on n-ary relations. In the next section I will characterize all mapping-invariant transformations of this kind. The quantifier ∃≤1 is clearly not among them. A similar situation arises with respect to the logical connectives. In a footnote Corcoran, the editor of [13], explains that during the Buffalo lecture that originated the paper [13] Tarski indicated that truth-functions can be accommodated in the relational type hierarchy over the universe M by identifying the truth values T and F with the universe M and the empty set. With this identification every binary truth-function is an element of M((0),(0),(0)) and hence we can discuss its invariance. It is easy to check that all truth-function are mappinginvariant objects in this sense. However this is again misleading. One thing is conjunction acting on sentences and a completely different thing is conjunction acting on formulas with free variables. Conjunction of formulas with free variables involves the operation of intersection, which is not mapping-invariant. 10 5 Characterizing mapping-invariant (m, n)-ary operators This section is devoted to the characterization of all (m, n)-ary mapping-invariant operators. I start fixing some basic operators and fixing also some generating operations to obtain more complex operators. I will show that all mappinginvariant operators are generated with these operations from the basic ones. The (m, n)-ary operators are the following ones: m m 1. The constant (m, 1)-ary operators C> and C⊥ whose actions on R ⊆ M m are given by m (a) C> (R) = M m (b) C⊥ (R) = ∅ m 2. The (m, n · m)-ary diagonal operator ∆m n such that for any R ⊆ M , ∆m n (R) = {n × a : a ∈ R} where n × a is the n-fold concatenation of the tuple a, that is, n × a = (b1 , . . . , bn·m ) if a = (a1 , . . . , am ) and bk·m+i = ai for 1 ≤ i ≤ m. 3. The (m, m−1)-ary i-projection operator Πm i (where m ≥ 2 and 1 ≤ i ≤ m) such that for any R ⊆ M m , Πm i (R) = {(a1 , . . . , ai−1 , ai+1 , . . . , am ) : (a1 , . . . , am ) ∈ R}. 4. For any σ ∈ Sym{1, . . . , m}, the (m, m)-ary permutation operator Pσ such that for any R ⊆ M m , Pσ (R) = {(aσ(1) , . . . , aσ(n) ) : (a1 , . . . , an ) ∈ R}. The generating operations for operators are the following ones: 1. Product. If F is (m, n1 )-ary and G is (m, n2 )-ary, the product F × G is the (m, n1 + n2 )-operator such that for any R ⊆ M m , F × G(R) = F (R) × G(R). 2. Sum. If F and G are (m, n)-ary operators, the sum F ∪G is the (m, n)-ary operator such that for any R ⊆ M m , F ∪ G(R) = F (R) ∪ G(R). 3. Composition. If F is (m, n)-ary and G is (n, k)-ary, the composition of F and G is the (m, k)-ary operator G ◦ F such that for any R ⊆ M m , G ◦ F (R) = G(F (R)). 11 Operators generated from projections, diagonals and permutations by composition will be called intern. Remark 5.1. A (m, n)-ary operator F is intern if and only if there is a map σ : {1, . . . , n} → {1, . . . , m} such that for any R ⊆ M m , F (R) = {(a1 , . . . , an ) : for some (a01 . . . , a0m ) ∈ R, ai = a0σ(i) for all i = 1, . . . , n} For the next lemma we need the technical notion of free system. Let R ⊆ M m . We call (M, R) a free system if R is infinite, for all a ∈ M there is at most one tuple (a1 , . . . , am ) ∈ R such that a = ai for some i and, finally, for all (a1 , . . . , am ) ∈ R, ai 6= aj if i 6= j. Lemma 5.2. Let F be a (m, n)-ary mapping-invariant operator. Let R ⊆ M m and assume (M, R) is a free system. For any (a1 , . . . , an ) ∈ FM (R) there are a decomposition ˙ k {1, . . . , n} = I1 ∪˙ . . . ∪I and operators F1 , . . . , Fk such that 1. Fl is (m, nl )-ary, where nl = |Il |. 2. If i ∈ Il and ai = aj , then j ∈ Il . 3. Fl is intern if {ai : i ∈ Il } has more than one element or if it has just one element a and a ∈ fieldj (R) for some j. 4. If {ai : i ∈ Il } has only one element a and a 6∈ field (R), then Fl = m ∆1nl ◦ C> , that is Fl (S) = {nl × a : a ∈ N } for all S ⊆ N m . 5. (ai : i ∈ Il ) ∈ Fl (R). 6. There is a σ ∈ Sym{1, . . . , m} such that for all S ⊆ N m , if (N, S) is a free system, then Pσ (F1 (S) × . . . × Fk (S)) ⊆ F (S). Proof. If a1 ∈ field (R), we choose as I1 a maximal subset of {1, . . . , m} for which 1 ∈ I1 and there is an intern (m, n1 )-ary operator G such that (aj : j ∈ I1 ) ∈ G(R), and we put F1 = G. If a1 6∈ field (R), we take I1 = {i : a1 = ai } m and we put F1 = ∆1n1 ◦ C> . In either case (ai : i ∈ I1 ) ∈ F1 (R) and also j ∈ I1 whenever aj = ai for some i ∈ I1 . Now let l be the first element of {1, . . . , m} r I1 . Again, if al ∈ field (R) we choose as I2 a maximal subset of {1, . . . , m} r I1 for which l ∈ I2 and there is an intern (m, n2 )-ary operator G such that (ai : i ∈ I2 ) ∈ G(R), and we put F2 = G. In case al 6∈ field (R) m we set as before I2 = {i : al = ai } and F2 = ∆1n2 ◦ C> . This procedure eventually ends and the decomposition is finished. We will show that it verifies 6. Assume S ⊆ N m , (N, S) is a free system, and (b1 , . . . , bn ) is a tuple such that (bi : i ∈ Il ) ∈ Fl (S) for all l = 1, . . . , k. We will prove that (b1 , . . . , bn ) ∈ F (S). a a The permutation σ will be the natural enumeration of the sequence sa 1 s2 . . . sk 12 where every sl is the enumeration of Il in increasing order. Let J be the set of all l ∈ {1, . . . , k} which do not fall under case 4, that is, all l for which Fl is intern. If l ∈ J, we can find a mapping σl : Il → {1, . . . , m} such that for all T ⊆ K m, Fl (T ) = {(ci : i ∈ Il ) : for some (c01 , . . . , c0m ) ∈ T, ci = c0σl (i) for all i ∈ Il } and we can find (al1 , . . . , aln ) ∈ R and (bl1 , . . . , bln ) ∈ S such that ai = alσl (i) and bi = blσl (i) for all i ∈ Il . For l 6∈ J we choose al and bl such that ai = al and bi = bl for all i ∈ Il . Let L be the set of all l ∈ {1, . . . , k} r J such that bl ∈ field (S). For l ∈ L choose now a tuple (bl1 , . . . , blm ) ∈ S such that bl = bli for some i and choose arbitrary (al1 , . . . , alm ) ∈ R with the only requirement that these tuples are all different and also different from the chosen for l ∈ J. This is possible because R is infinite. Let A = M r ({ali : 1 ≤ i ≤ m and l ∈ J ∪ L} ∪ {al : l 6∈ J}) and B = N r ({bli : 1 ≤ i ≤ m and l ∈ J ∪ L} ∪ {bl : l 6∈ L}). Since R is infinite, A and B are nonempty sets. Let π = (A × B) ∪ {(ali , bli ) : 1 ≤ i ≤ m and l ∈ J ∪ L} ∪ {(al , bl ) : l ∈ {1, . . . , k} r J}. It is a similarity relation between M and N and moreover π(R, S). By mappinginvariance of F , π(F (R), F (S)). Hence (c1 , . . . , cn ) ∈ F (S) for some c1 , . . . , cn 0 such that π(ai , ci ) for all i = 1, . . . , n. By construction, ali 6= ali0 for (l, i) 6= (l0 , i0 ) 0 0 and also ali 6= al for all i, l, l0 and al 6= al for all l 6= l0 . Hence (c1 , . . . , cn ) = (b1 , . . . , bn ). Lemma 5.3. Let F be a (m, n)-ary mapping-invariant operator. Assume M is infinite. For any (a1 , . . . , an ) ∈ FM (∅) there are a decomposition ˙ k {1, . . . , n} = I1 ∪˙ . . . ∪I and operators F1 , . . . , Fk such that m 1. Fl = ∆1nl ◦ C> , where nl = |Il |. 2. If i ∈ Il and ai = aj , then j ∈ Il . 3. (ai : i ∈ Il ) ∈ Fl (∅). 4. There is a σ ∈ Sym{1, . . . , m} such that for any infinite set N , Pσ (F1N (∅) × . . . × FkN (∅)) ⊆ FN (∅). Proof. It is basically the same reasoning as in the proof of Lemma 5.2 except that intern operators are not needed. Given (b1 , . . . , bn ) such that (bi : i ∈ Il ) ∈ FlN (∅) for each l = 1, . . . , k we have to show that (b1 , . . . , bn ) ∈ FN (∅). Once al and bl are chosen, define A = M r {al : l = 1, . . . , k}, B = N r {bl : l = 1, . . . , k} and π = (A × B) ∪ {(al , bl ) : l = 1, . . . , k}. Since M and N 13 are infinite, A and B are nonempty sets and π is a similarity relation between M and N . Clearly π(∅, ∅), and by mapping-invariance π(FM (∅), FN (∅)). Then π((a1 , . . . , an ), (c1 , . . . , cn )) for some (c1 , . . . , cn ) ∈ FN (∅). It follows that (c1 , . . . , cn ) = (b1 , . . . , bn ). Lemma 5.4. For all R, M such that R ⊆ M m and R 6= ∅, there are S, N, f such that S ⊆ N m , (N, S) is a free system and f : N → M is a mapping from N onto M such that f (S) = R. Proof. Choose an enumeration {(ai1 , . . . , aim ) : i ∈ I} of R whose index set I is infinite and choose an enumeration (bi : i ∈ J) of M r field (R) such that J ∩ (I × {1, . . . , m}) = ∅. Let N = J ∪ (I × {1, . . . , m}) and let f : N → M be the mapping defined by f (i) = bi for i ∈ J and f ((i, j)) = aij for (i, j) ∈ I × {1, . . . , m}. If S = f −1 (R), then (N, S) is a free system, f is surjective and f (S) = R. Lemma 5.5. Let F be an (m, n)-ary mapping-invariant operator. Let R ⊆ M m , S ⊆ N m and assume f : N → M is a surjective mapping such that f (S) = R. Then 1. f (FN (S)) = FM (R). 2. If G is also an (m, n)-ary mapping-invariant operator and FN (S) = GN (S), then FM (R) = GM (R). Proof. 1 is clear, since f is a similarity relation connecting R and S. 2 follows from 1 since FM (R) = f (FN (S)) = f (GN (S)) = GM (R). Theorem 5.6. An (m, n)-ary operator F is mapping-invariant if and only if there are two (m, n)-ary operators G, H generated from the basic operators by sum, product and composition and such that: 1. For all M , FM (∅) = GM (∅). 2. For all M and all nonempty R ⊆ M m , FM (R) = HM (R). Proof. Let us consider first the case of nonempty arguments. By lemmas 5.4 and 5.5 we may restrict our attention to free systems. By Lemma 5.2 if (M, R) is a free system, then FM (R) is a union of components, where a component is a m permutation of products of intern operators and operators of the form ∆1nl ◦C> . Since there are only finitely many choices for components, it is in fact a finite union of them. If we take all components for F arising in all free (M, R), by point 6 of Lemma 5.2 we see that in fact their union works for obtaining the value of F in all free (M, R). The case of the empty set is similar but using Lemma 5.3 and observing that for any M there are N, f such that N is infinite and f : N → M is onto. 14 Definition 5.1. Let P be a m-ary relation symbol and let ϕ = ϕ(x1 , . . . , xn ) be a first-order formula having P as its only extralogical symbol. The (m, n)-operator Gϕ attached to ϕ is defined as follows: for any nonempty M and R ⊆ M m , n Gϕ M (R) = {(a1 , . . . , an ) ∈ M : (M, R) |= ϕ(a1 , . . . , an )}. Definition 5.2. Consider the first-order language L whose only extralogical symbol is the m-ary relation symbol P . For convenience we assume that > and ⊥ are formulas. The mapping-invariant formulas are all L-formulas generated with the following rules: 1. > and ⊥ are mapping-invariant. 2. P (y1 , . . . , ym ) is mapping-invariant for any distinct variables y1 , . . . , ym . 3. ¬∃y1 . . . ym P (y1 , . . . , ym ) is mapping-invariant for any distinct variables y1 , . . . , y m . 4. If ϕ, ψ are mapping-invariant, then (ϕ ∨ ψ) is mapping-invariant. 5. If ϕ, ψ are mapping-invariant and have no common free variable, then (ϕ ∧ ψ) is mapping-invariant. 6. If ϕ = ϕ(x1 , . . . , xn ) is mapping-invariant and y is different from all the variables x1 , . . . , xn , then for all i = 1, . . . , n, (ϕ(x1 , . . . , xn ) ∧ xi = y) is mapping-invariant 7. If ϕ is mapping-invariant, then ∃xϕ is mapping-invariant. Observe that > and ⊥ are dispensable since we can replace them respectively by the mapping-invariant formulas (ψ ∨ ¬ψ) and (ψ ∧ ¬ψ) where ψ = ∃y1 . . . ym P (y1 , . . . , ym ). Theorem 5.7. An (m, n)-ary operator F is mapping-invariant if and only if F = Gϕ for some mapping-invariant formula ϕ = ϕ(x1 , . . . , xn ) in a language whose only extralogical symbol is the m-ary relation symbol P . Proof. It is easy to check that by induction on ϕ that all operators Gϕ for mapping-invariant ϕ are mapping-invariant. For the rest we first need to show that any operator generated by sum, product and composition from the basic ones is definable by a mapping-invariant formula. This can be easily done by induction on the length of the generating sequence of the operator. Now using Theorem 5.6 we see that there are two mapping-invariant formulas ψ(x1 , . . . , xn ) and χ(x1 , . . . , xn ) such that for all M , FM (∅) = Gψ M (∅) and for all M and all nonempty R ⊆ M m , FM (R) = GχM (R). Let ϕ(x1 , . . . , xn ) be the formula (ψ(x1 , . . . , xn ) ∧ ¬∃yP (y)) ∨ (χ(x1 , . . . , xn ) ∧ ∃yP (y)) where y = y1 , . . . , ym is a tuple of distinct variables. It is a mapping-invariant ψ ϕ χ formula and clearly Gϕ M (∅) = GM (∅) while for nonempty R, GM (R) = GM (R). ϕ Hence F = G . 15 6 More general operators Recall that an (m1 , . . . , mr , n)-ary operator is an operator F which for every nonempty set M gives a mapping FM : P(M m1 ) × . . . × P(M mr ) → P(M n ). Sometimes I will call operators with these arities general operators. Here I characterize the mapping-invariant general operators. The methods are closely parallel to the ones used in the previous section. Therefore I will state the results and give only brief indications about the proofs. The basic general operators are the basic operators introduced in the previous section and N 1. The product. It is the (m, n, m + n)-ary operator such that for all M , N and all R ⊆ M m and S ⊆ M n , M (R, S) = R × S. S 2. The sum. It is the (m, m, m)-ary operator such that for all M , and all S R ⊆ M m and S ⊆ M m , M (R, S) = R ∪ S. The general composition is the operation which for any (m1 , . . . , ms , n)-ary operator F and any (mi1 , . . . , miri , mi )-ary operators Fi (where i = 1, . . . , s), gives an (m11 , . . . , m1r1 , m21 , . . . , m2r2 , . . . , ms1 , . . . , msrs , n)-ary operator compF1 ,...,Fs ,F such i that for every set M and all subsets Rji ⊆ M mj (i = 1, . . . , s and j = 1, . . . , ri ), 1 s compF1 ,...,Fs ,F (R11 , . . . , Rrss ) = F (F1 (R11 , . . . , Rm ), . . . , Fs (R1s , . . . , Rm )) r1 rs A general operator is intern if it is generated from projections, diagonals, and permutations by general composition. I generalize now the technical notion of free system to make it useful for general operators. Let R1 ⊆ M m1 , . . . , Rr ⊆ M mr . We say that (M, R1 , . . . , Rr ) is a general free system if each (M, Ri ) is a free system and the Ri have disjoint fields. Lemma 6.1. Let F be a (m1 , . . . , mr , n)-ary mapping-invariant operator. Let R1 ⊆ M m1 , . . . , Rr ⊆ M mr , let L ⊆ {1, . . . , r} and assume (M, Ri )i∈L is a general free system and Ri = ∅ for i 6∈ L. For any (a1 , . . . , an ) ∈ F (R1 , . . . , Rr ) there is a decomposition ˙ k11 ∪˙ . . . ∪I ˙ 1r ∪˙ . . . ∪I ˙ krr {1, . . . , n} = I11 ∪˙ . . . ∪I and operators F11 , . . . , Fk11 , . . . , F1r , . . . , Fkrr such that 1. Flp is (mp , nl )-ary, where nl = |Ilp |. 2. If i ∈ Ilp and ai = aj , then j ∈ Il . 3. Flp is intern if {ai : i ∈ Ilp } has more than one element or it has just one element a and a ∈ fieldj (Rl ) for some j. 16 4. If {ai : i ∈ Ilp } has only one element a and a 6∈ field (R), then Flp = m ∆1nl ◦ C> p . 5. (ai : i ∈ Ilp ) ∈ Flp (Rl ). 6. There is a σ ∈ Sym{1, . . . , m1 , m1 + 1, . . . , m1 + m2 + . . . + mr } such that for all S1 ⊆ N m1 , . . . , Sr ⊆ N mr , if (N, Si )i∈L is a general free system and Si = ∅ for i 6∈ L, then 1 r Pσ (F11 (S1 )×. . .×Fm (S1 )×. . .×F1r (Sr )×. . .×Fm (Sr )) ⊆ F (S1 , . . . , Sr ). 1 r Proof. The fact that in a general free system (M, Ri )i∈L all the relations Ri have disjoint fields allows us to adapt the proof of Lemma 5.2 in a straightforward way to this setting. The details are left to the reader. Theorem 6.2. Let F be an (m1 , . . . , mr , n)-ary operator. Then F is mappinginvariant if and only if for each L ⊆ {1, . . . , r} there is an (m1 , . . . , mr , n)-ary operators GL generated from the general basic operators by general composition and such that for any set M and all R1 ⊆ M m1 , . . . , Rr ⊆ M mr such that Ri 6= ∅ if and only if i ∈ L, F (R1 , . . . , Rr ) = GL (R1 , . . . , Rr ). Proof. General free systems satisfy the corresponding version of Lemma 5.4 and therefore can be used in the same fashion as free systems were used in the proof of Theorem 5.6. For the case L = ∅ one has to use an infinite set M instead of a general free system. Definition 6.1. Let P1 , . . . , Pr be relation symbols (where Pi is mi -ary) and let ϕ = ϕ(x1 , . . . , xn ) be a first-order formula having P1 , . . . , Pr as its only extralogical symbol. The (m1 , . . . , mr , n)-operator Gϕ attached to ϕ is defined as follows: for any nonempty M and Ri ⊆ M mi for i = 1, . . . , r, n Gϕ M (R1 , . . . , Rr ) = {(a1 , . . . , an ) ∈ M : (M, R1 , . . . , Rr ) |= ϕ(a1 , . . . , an )}. The mapping-invariant formulas of the first order language whose extralogical symbols are P1 , . . . , Pr are all formulas generated with the rules described in the previous section except that instead of rules 2 and 3 we now have: 2 Pi (y1 , . . . , ymi ) is mapping-invariant for all distinct y1 , . . . , ymi for all i = 1, . . . , r. 3 ¬∃y1 . . . ymi Pi (y1 , . . . , ymi ) is mapping-invariant for all distinct y1 , . . . , ymi for all i = 1, . . . , r. Theorem 6.3. Let F be an (m1 , . . . , mr , n)-ary operator. Then F is mappinginvariant if and only if F = Gϕ for some mapping-invariant formula ϕ = ϕ(x1 , . . . , xn ) in a language whose only extralogical symbols are the relational symbols P1 , . . . , Pr (where Pi is mi -ary and corresponds to Ri ). 17 Proof. As in the proof of Theorem 5.7, we find a mapping-invariant formula ψL (x1 , . . . , xn ) for each L ⊆ {1, . . . , r} in such a way that FM (R1 , . . . , Rr ) = L m1 , . . . , Rr ⊆ M mr and Ri 6= ∅ for i ∈ L. The trick Gψ M (R1 , . . . , Rr ) if R1 ⊆ M for coding all the ψL in only one mapping-invariant formula ϕ(x1 , . . . , xn ) is the same as the one presented in the proof of Theorem 5.7. We take for ϕ the disjunction of all formulas of the form ^ ^ ψL (x1 , . . . , xn ) ∧ ∃yPi (y) ∧ ¬∃yPi (y) i6∈L i∈L 7 The functional type setting Let us consider now a modification of the finite type hierarchy. We start with the basic type 0 of individuals and the boolean type b. The finite functional types are obtained from the types 0 and b by the rule that whenever τ1 , . . . , τn , µ are finite functional types, then also (τ1 , . . . , τn → µ) is a finite functional type. We will call them more briefly functional types. Given a nonempty set M , we associate to each functional type τ a corresponding τ -universe Mτ over M . The starting universes are the universe of individuals M0 = M and the boolean universe Mb = {0, 1}. The remainder universes are obtained recursively according to the rule that if τ = (τ1 , . . . , τn → µ) then Mτ is the set of all mappings h : Mτ1 × . . . × Mτn → Mµ . Relational types, universes, objects, and operators can be represented in the functional setting by replacing hereditarily every relation by its characteristic function. For every relational type τ there is a corresponding functional type τ ∗ defined recursively as follows: 0∗ = 0 and (τ1 , . . . , τn )∗ = (τ1∗ , . . . , τn∗ → b). Every element a of Mτ corresponds also to an element a∗ of Mτ ∗ . This correspondence is in fact a bijection between Mτ and Mτ ∗ . For a ∈ M0 we just take a∗ = a. If τ = (τ1 , . . . , τn ) and a ∈ Mτ , we define a∗ as the mapping from Mτ1∗ × . . . × Mτn∗ into {0, 1} such that a∗ (a∗1 , . . . , a∗n ) = χa (a1 , . . . , an ) for all a1 ∈ Mτ1 , . . . , an ∈ Mτn , where χa is the characteristic function of a as a subset of Mτ1 × . . . × Mτn . Let τ be a functional type and let f : M → N be a surjective mapping. In general it is not possible to extend f naturally to a corresponding total mapping fτ : Mτ → Nτ for any functional type τ as I did for relational types, but nevertheless we can extend it as a partial surjective mapping. If τ = 0 we put fτ = f and if τ = b we take for fτ the identity on Mb = {0, 1}. If τ = (τ1 , . . . , τn → µ), and every fτi is surjective, we can define fτ as follows. 18 1. Its domain is the subset of Mτ which consists of all mappings a : Mτ1 × . . . × Mτn → Mµ such that (a) a(a1 , . . . , an ) ∈ dom (fµ ) for all a1 ∈ dom (fτ1 ), . . . , an ∈ dom (fτn ) (b) for all a1 ∈ dom (fτ1 ), . . . , an ∈ dom (fτn ), b1 ∈ dom (fτ1 ), . . . , bn ∈ dom (fτn ), if fτi (ai ) = fτi (bi ) for all i = 1, . . . , n, then fµ (a(a1 , . . . , an )) = fµ (a(b1 , . . . , bn )). 2. For any such a we define fτ (a) as the mapping from Nτ1 × . . . , ×Nτn into Nµ such that for all a1 ∈ dom (fτ1 ), . . . , an ∈ dom (fτn ), fτ (a)(fτ1 (a1 ), . . . , fτn (an )) = fµ (a(a1 , . . . , an )). We check now that fτ is surjective. Let b ∈ Nτ . For any c ∈ Nµ choose a corresponding c0 ∈ dom (fµ ) such that fµ (c0 ) = c. Define a ∈ Mτ as the mapping a : Mτ1 × . . . × Mτn → Mµ such that for all a1 ∈ Mτ1 , . . . , an ∈ Mτn , a(a1 , . . . , an ) = (b(fτ1 (a1 ), . . . , fτn (an )))0 . Observe that a ∈ dom fτ . Now, given b1 ∈ Nτ1 , . . . , bn ∈ Nτn take a1 ∈ Mτ1 , . . . , an ∈ Mτn with fτi (ai ) = bi for all i = 1, . . . , n, and note that b(b1 , . . . , bn ) = fµ (a(a1 , . . . , an )). Therefore b = fτ (a). We can also think of extending a similarity relation π between M and N to every τ -universe for any functional type τ . The extension πτ will be a similarity relation between its domain dom (πτ ) ⊆ Mτ and its range rng (πτ ) ⊆ Nτ , so not necessarily a similarity between Mτ and Nτ . We start by taking π0 = π and by taking as πb the identity in the boolean universe Mb = Nb = {0, 1}. Let τ = (τ1 , . . . , τn → µ). We define πτ ⊆ Mτ × Nτ by stipulating that for a ∈ Mτ an b ∈ Nτ , (∗) πτ (a, b) if and only if for all a1 ∈ Mτ1 , . . . , an ∈ Mτn , b1 ∈ Nτ1 , . . . , bn ∈ Nτn if πτi (ai , bi ) for i = 1, . . . , n, then πµ (a(a1 , . . . , an ), b(b1 , . . . , bn )). This last proposal of extensions seems very natural when working with similarities. For any given similarity π between M and N there is a unique system (πτ )τ a functional type of relations πτ ⊆ Mτ × Nτ which satisfy the defining condition (∗) stated above. But if π is a surjective mapping, we obtain the same extension πτ if we consider it only as a similarity relation. Moreover, if π is a relational composition of mappings and inverses of mappings, then the extensions πτ are nothing more than the corresponding composition of extensions of mappings. This has not been noticed by Feferman in [3] and therefore he distinguises between what he calls homomorphism-invariance and similarity-invariance. Before proving the equivalence we need to show an easy result according to with every chain of surjective mappings and inverses of them can be shortened to length two. 19 Lemma 7.1. If π is a similarity relation between M and N , there is a set K an surjective mappings f : K → M and g : K → N such that π = f −1 ◦ g. Proof. Let K be a set of indexes big enough to enumerate π as a set of pairs in the form π = {(ai , bi ) : i ∈ K} and define f : K → M by f (i) = ai and g : K → N by g(i) = bi . Proposition 7.2. Let τ be a functional type. 1. If f : M → N is a surjective mapping, then it is a similarity relation between M and N and its extension fτ as a mapping coincides with its extension as a similarity relation. 2. Assume that M1 , . . . , Mn are sets and f1 , . . . , fn−1 are mappings such that M = M1 , N = Mn and for every i = 1, . . . , n, fi is from Mi onto Mi+1 or it is from Mi+1 onto Mi , and π is the relational product R1 ◦ . . . ◦ Rn−1 where Ri = fi if fi : Mi → Mi+1 and Ri = fi−1 if fi : Mi+1 → Mi . Then πτ is the relational product R1τ ◦ . . . ◦ Rn−1 τ , where Riτ is the partial mapping fi τ if fi : Mi → Mi+1 and it is its inverse if fi : Mi+1 → Mi . Proof. 1. It is easy to check by induction on the complexity of τ that the system of partial mappings (fτ )τ a functional type satisfies condition (∗). 2. This is proven by induction on n, but using Lemma 7.1 we see that it will be enough to prove the result in three particular cases: Case 1. f : K → M , g : K → N are surjective and π = f −1 ◦ g. We need only to check that the system (fτ−1 ◦ gτ )τ a functional type satisfies condition (∗) and this is done by induction on the complexity of τ . Consider the case τ = (τ1 , . . . , τn → µ). Let a ∈ Mτ and b ∈ Nτ . Let R = fτ−1 ◦ gτ . If R(a, b) then for some c ∈ dom (fτ ) ∩ dom (gτ ) we have fτ (c) = a and gτ (c) = b. Let Si = fτ−1 ◦ gτi and assume Si (ai , bi ) for all i = 1, . . . , n. Then for each i there is i some ci ∈ dom (fτi ) ∩ dom (gτi ) such that fτi (ci ) = ai and gτi (ci ) = bi . Clearly c(c1 , . . . , cn ) ∈ dom (fµ ) ∩ dom (gµ ) and fµ (c(c1 , . . . , cn )) = a(a1 , . . . , an ), and gµ (c(c1 , . . . , cn )) = b(b1 , . . . , bn ). Hence T (a(a1 , . . . , an ), b(b1 , . . . , bn )) for T = fµ−1 ◦ gµ . Now we assume the righthand side of (∗) holds for a, b and we show that R(a, b). We have to find some c ∈ dom (fτ ) ∩ dom (gτ ) with fτ (c) = a and gτ (c) = b. Let ci ∈ Kτi for each i = 1, . . . , n. If some ci is not in dom (fτi ) ∪ dom (gτi ), we choose for c(c1 , . . . , cn ) some arbitrary element of Kµ . If all ci ∈ dom (fτi ) but some ci 6∈ dom (gτi ) we observe that a(fτ1 (c1 ), . . . , fτn (cn )) ∈ Mµ and since fµ is surjective we can define c(c1 , . . . , cn ) as an element d ∈ dom (fµ ) such that fµ (d) = a(fτ1 (c1 ), . . . , fτn (cn )). The procedure is similar if all ci ∈ dom (gτi ) but some ci 6∈ dom (fτi ). Assume now ci ∈ dom (fτi )∩dom (gτi ) for all i = 1, . . . , n. Let Si = fτ−1 ◦ gτi . Then Si (fτi (ci ), gτi (ci )) for all i and we can use i the righthand side of (∗) to make sure that there is some d ∈ dom (fµ )∩dom (gµ ) such that fµ (d) = a(fτ1 (c1 ), . . . , fτn (cn )) and gµ (d) = b(gτ1 (c1 ), . . . , gτn (cn )). We take as c(c1 , . . . , cn ) such a d in this last case. It follows that c ∈ dom (fτ ) ∩ dom (gτ ), fτ (c) = a and gτ (c) = b. Hence R(a, b). 20 Case 2. f : M → N , g : K → N , h : K → P are surjective and π = f ◦g −1 ◦h. Similar to the previous case. Given a ∈ Mτ and b ∈ Pτ which satisfy the righthand side of (∗) we check that a ∈ dom (fτ ) and we define in a similar way as before some c ∈ dom (gτ ) ∩ dom (hτ ) such that fτ (a) = gτ (c) and hτ (c) = b. Case 3. f : N → M , g : K → N , h : K → P are surjective and π = f −1 ◦ g −1 ◦ h. Also similar to the other cases. Now the main point is to start with a ∈ Mτ and b ∈ Pτ for which the righthand side of (∗) holds and to find c ∈ dom (gτ ) ∩ dom (hτ ) such that gτ (c) ∈ dom (fτ ), fτ (gτ (c)) = a and hτ (c) = b In the functional type setting, every operator can be identified with an object. Therefore it would be enough to consider only objects for all invariance purposes. Nevertheless, it is convenient to have both notions (object and operator) to compare them with objects and operators of the relational type setting. The definitions of invariance are similar to the ones given in the relational case, but types are functional now. Let us say that an object a of functional type τ is mapping-invariant if for all M, N and every surjective mapping f : M → N , aM ∈ dom (fτ ) and fτ (aM ) = aN . Feferman in [3] calls this notion homomorphism-invariance although the sets M, N are pure sets, without any added structure. He calls an object a similarity-invariant if for all M, N and every similarity relation π between M and N , πτ (aM , aN ). This notion of invariance can be easily defined also for (τ1 , . . . , τn ; µ)-ary operators as I did in the relational case. The details are left to the reader. Corollary 7.3. In the functional type hierarchy an object or an operator is mapping-invariant (i.e., homomorphism-invariant in the sense of Feferman) if and only if it is similarity-invariant. Proof. By Proposition 7.2. 8 Feferman’s analysis of mapping-invariance We have seen that every relational type τ can be represented as a functional type τ ∗ and every a ∈ Mτ has a corresponding element a∗ ∈ Mτ ∗ . In a similar way, to any operator F in the relational type setting corresponds an operator F ∗ in the functional type setting. It is defined by ∗ FM (a∗1 , . . . , a∗n ) = FM (a1 , . . . , an )∗ . Now I raise the question of what kind of invariance on relational types τ is the correct translation of mapping-invariance on the corresponding functional types τ ∗ . We will see that it is closely related to preimage-invariance and that it is just preimage-invariance at the first levels of the type hierarchy. Before discussing preimage-invariance I introduce a new version of invariance, which is exactly the relational counterpart of mapping-invariance on functional types. Let f : M → N be a surjective mapping. I present another way of extending f to every universe Mτ for any relational type τ . I will call it the Feferman 21 extension of f to τ and I will use the notation fτF e for it. We start, as usual, putting f0F e = f and define inductively fτF e for τ = (τ1 , . . . , τn ). In general it is going to be a partial surjective mapping. 1. Its domain consists of all a ⊆ Mτ1 × . . . × Mτn such that for all a1 ∈ dom (fτF1e ), . . . , an ∈ dom (fτFne ), b1 ∈ dom (fτF1e ), . . . , bn ∈ dom (fτFne ), if fτFi e (ai ) = fτFi e (bi ) for all i = 1, . . . , n, then (a1 , . . . , an ) ∈ a if and only if (b1 , . . . , bn ) ∈ a. 2. For any such a, we define fτF e (a) as {(fτF1e (a1 ), . . . , fτFne (an )) : (a1 , . . . , an ) ∈ a∩(dom (fτF1e )×. . .×dom (fτFne ))} Lemma 8.1. Let f : M → N be surjective, let τ be a relational type and assume a ∈ Mτ . Then 1. a ∈ dom (fτF e ) iff a∗ ∈ dom (fτ ∗ ) 2. If a ∈ dom (fτF e ), then fτF e (a)∗ = fτ ∗ (a∗ ). Proof. It is an easy induction on the complexity of τ . The notion of invariance for objects and operators in the context of these extensions fτF e is defined as in the other cases. We will call it Feferman invariance. For instance, an object a of type τ is Feferman invariant if and only if for any surjective mapping f : M → N , aM ∈ dom (fτF e ) and fτF e (aM ) = aN . Invariance for operators is defined in the obvious way. Proposition 8.2. An operator F is Feferman invariant if and only if its corresponding operator F ∗ in the functional type setting is mapping-invariant, i.e., homomorphism-invariant in the sense of Feferman. Proof. Let F be a (τ1 , . . . , τn ; µ)-ary operator. Assume F is Feferman invariant. Let f : M → N be surjective and let a1 ∈ Mτ1 , . . . , an ∈ Mτn be such that a∗i ∈ dom (fτi∗ ) for each i = 1, . . . , n. By Lemma 8.1, ai ∈ dom (fτFi e ) and fτFi e (ai )∗ = fτi∗ (a∗i ) for every i = 1, . . . , n. By Feferman invariance of F , FM (a1 , . . . , an ) ∈ dom (fµF e ) and fµF e (FM (a1 , . . . , an )) = FN (fτF1e (a1 ), . . . , fτFne (an )). ∗ (a∗1 , . . . , a∗n ) = FM (a1 , . . . , an )∗ ∈ dom (fµ∗ ) and By Lemma 8.1 FM ∗ fµ∗ (FM (a∗1 , . . . , a∗n )) = fµF e (FM (a1 , . . . , an ))∗ . Hence ∗ fµ∗ (FM (a∗1 , . . . , a∗n )) = FN (fτF1e (a1 ), . . . , fτFne (an ))∗ = FN∗ (fτ1∗ (a∗1 ), . . . , fτn∗ (a∗n )). The other direction has a similar proof. 22 Let f : M → N be a surjective mapping. I propose a last way of extending f to every universe Mτ for any relational type τ . We well call it the preimage extension of f to τ and we will use the notation fτp for it. The extension fτp will not be in general a total mapping from Mτ onto Nτ . It is a mapping but its domain is only a subset of Mτ . As usual, we start with f0p = f . Now let τ = (τ1 , . . . , τn ). 1. The domain of fτp consists of all relations a ⊆ Mτ1 , × . . . × Mτn for which there is some b ⊆ Nτ1 × . . . × Nτn such that a = {(a1 , . . . , an ) ∈ dom (fτp1 )×. . .×dom (fτpn ) : (fτp1 (a1 ), . . . , fτpn (an )) ∈ b}. 2. For each such a ∈ dom (fτp ) we set fτp (a) = {(fτp1 (a1 ), . . . , fτpn (an )) : (a1 , . . . , an ) ∈ a}. Let F be an operator of relational type (τ1 , . . . , τn ; τ ). We call F preimageinvariant if for any sets M, N , for any surjective mapping f : M → N , if a1 ∈ dom (fτp1 ), . . . , an ∈ dom (fτpn ), then FM (a1 , . . . , an ) ∈ dom (fτp ) and fτp (FM (a1 , . . . , an )) = FN (fτp1 (a1 ), . . . , fτpn (an )). An object a of relational type τ is preimage-invariant if for any sets M, N , for any surjective mapping f : M → N , aM ∈ dom (fτp ) and fτp (aM ) = aN . We will concentrate the following discussion on operators, but the results can be easily extended also to objects. Lemma 8.3. If τ = 0n and f : M → N is surjective, then fτF e = fτp . Proof. We show that dom (fτF e ) = dom (fτp ) and fτF e (a) = fτp (a) for every a ∈ dom (fτF e ). Let a ∈ dom (fτF e ) and let b = {(f (a1 ), . . . , f (an )) : (a1 , . . . , an ) ∈ a}. It follows that a = {(a1 , . . . , an ) : (f (a1 ), . . . , f (an )) ∈ b} and therefore a ∈ dom (fτp ). Clearly in this case fτF e (a) = b = fτp (a). On the other hand, if a ∈ dom (fτp ) there is some b ⊆ N n such that a = {(a1 , . . . , an ) : (f (a1 ), . . . , f (an ) ∈ b}. Whenever a1 , . . . , an , b1 , . . . , bn ∈ M , (a1 , . . . , an ) ∈ a, and f (ai ) = f (bi ) for every i = 1, . . . , n, we have that (f (b1 ), . . . , f (bn )) = (f (a1 ), . . . , f (an )) ∈ b and therefore (b1 , . . . , bn ) ∈ a. This means that a ∈ dom (fτp ). Theorem 8.4. An (m1 , . . . , mr , n)-ary operator F is preimage-invariant if and only if its corresponding operator in the functional type setting F ∗ is mappinginvariant, i.e., homomorphism-invariant in the sense of Feferman. Proof. By Proposition 8.2 is enough to prove that F is preimage-invariant if and only if it is Feferman invariant. Let τi = 0mi for every i = 1, . . . , r. Assume F is preimage-invariant and let f : M → N be surjective. Let R1 ⊆ M m1 , . . . , Rr ⊆ M mr and assume Ri ∈ dom (fτFi e ) for each i = 1, . . . , r. By Lemma 8.3 Ri ∈ dom (fτpi ) and fτpi (Ri ) = fτFi e (Ri ) for every i = 1, . . . , r. By preimage-invariance, FM (R1 , . . . , Rr ) ∈ dom (f0pn ) and f0pn (FM (R1 , . . . , Rr )) = FN (fτp1 (R1 ), . . . , fτpr (Rr )). 23 By Lemma 8.3 again, FM (R1 , . . . , Rr ) ∈ dom (f0Fne ) and f0Fne (FM (R1 , . . . , Rr )) = f0pn (FM (R1 , . . . , Rr )). Therefore f0Fne (FM (R1 , . . . , Rr )) = FN (fτF1e (R1 ), . . . , fτFre (Rr )) and F is Feferman invariant. The other direction is proven in a similar way. ACKNOWLEDGMENT Work partially supported by grant BFM2002-01034 of Spanish MCYT and grant 2002SGR 00126 of Catalan DURSI. I thank Xavier Caicedo for correcting some mistaken statements concerning Linsdtröm quantifiers made in a previous version. I also thank Genoveva Martı́ for some comments improving the exposition. References [1] M. Byrd. Russell, logicism and the choice of logical constants. Notre Dame Journal of Formal Logic, 30(3):343–361, 1989. [2] E. Casanovas, P. Dellunde, and R. Jansana. On elementary equivalence for equality–free logic. Notre Dame Journal of Formal Logic, 37:506–522, 1996. [3] S. Feferman. Logic, logics and logicism. Notre Dame Journal of Formal Logic, 40(1):31–54, 1999. [4] M. Gómez-Torrente. The problem of logical constants. The Bulletin of Symbolic Logic, 8:1–37, 2002. [5] A. Lindenbaum and A. Tarski. Über die Beschränktheit des Ausdrucksmittel deduktiver Theorien. Ergebnisse eines mathematischen Kolloquiums, 7:15–22, 1934-35. English translation in [12] pages 384-392. [6] M. Machover. Review of [10]. British Journal for the Philosophy of Science, 45:1078–1083, 1994. [7] F. I. Mautner. An extension of Klein’s Erlanger program: logic as invarianttheory. American Journal of Mathematics, 68:345–384, 1946. [8] V. McGee. Logical operations. Journal of Philosophical Logic, 25:567–580, 1996. [9] A. Mostowski. On a generalization of quantifiers. Fundamenta Mathematicae, 44:12–36, 1957. [10] G. Sher. The bounds of logic. A generalized viewpoint. The MIT Press, Cambridge, Massachusetts, 1991. [11] G. Sher. A characterization of logical constants is possible. 18/2(47):189–198, 2003. 24 Theoria, [12] A. Tarski. Logic, Semantics, Metamathematics. Papers from 1923 to 1938. Hackett Publishing Company, Indianapolis, second edition, 1983. Edited and introduced by J. Corcoran. Translated by J.H. Woodger. [13] A. Tarski. What are logical notions ? History and Philosophy of Logic, 7:143–154, 1986. Edited by John Corcoran. [14] J. Van Benthem. Logical constants across varying types. Notre Dame Journal of Formal Logic, 30(3):315–342, 1989. [15] K. Warmbrōd. Logical constants. Mind, 108(431):503–538, 1999. 25

Logical operations and invariance

Related documents

Products

Support

Logical operations and invariance

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib