A FIRST COURSE IN MATHEMATICAL LOGIC AND SET THEORY A FIRST COURSE IN MATHEMATICAL LOGIC AND SET THEORY Michael L. O’Leary College of DuPage c Copyright 2016 by John Wiley & Sons, Inc. All rights reserved Published by John Wiley & Sons, Inc., Hoboken, New Jersey Published simultaneously in Canada No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise, except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization through payment of the appropriate per-copy fee to the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, (978) 750-8400, fax (978) 750-4470, or on the web at www.copyright.com. Requests to the Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, (201) 748-6011, fax (201) 748-6008, or online at http://www.wiley.com/go/permission. Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best efforts in preparing this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose. No warranty may be created or extended by sales representatives or written sales materials. The advice and strategies contained herein may not be suitable for your situation. You should consult with a professional where appropriate. Neither the publisher nor author shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages. For general information on our other products and services or for technical support, please contact our Customer Care Department within the United States at (800) 762-2974, outside the United States at (317) 572-3993 or fax (317) 5724002. Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be available in electronic formats. For more information about Wiley products, visit our web site at www.wiley.com. Library of Congress Cataloging-in-Publication Data applied for. ISBN: 9780470905883 Printed in the United States of America 10 9 8 7 6 5 4 3 2 1 For my parents CONTENTS Preface xiii Acknowledgments List of Symbols 1 Propositional Logic 1.1 1.2 1.3 Symbolic Logic Propositions Propositional Forms Interpreting Propositional Forms Valuations and Truth Tables Inference Semantics Syntactics Replacement Semantics Syntactics xv xvii 1 1 2 5 7 10 19 21 23 31 31 34 vii viii CONTENTS 1.4 1.5 2 First-Order Logic 2.1 2.2 2.3 2.4 3 Proof Methods Deduction Theorem Direct Proof Indirect Proof The Three Properties Consistency Soundness Completeness Languages Predicates Alphabets Terms Formulas Substitution Terms Free Variables Formulas Syntactics Quantifier Negation Proofs with Universal Formulas Proofs with Existential Formulas Proof Methods Universal Proofs Existential Proofs Multiple Quantifiers Counterexamples Direct Proof Existence and Uniqueness Indirect Proof Biconditional Proof Proof of Disunctions Proof by Cases 40 40 44 47 51 51 55 58 63 63 63 67 70 71 75 75 76 78 85 85 87 90 96 97 99 100 102 103 104 105 107 111 112 Set Theory 117 3.1 117 118 Sets and Elements Rosters CONTENTS 3.2 3.3 3.4 4 Famous Sets Abstraction Set Operations Union and Intersection Set Difference Cartesian Products Order of Operations Sets within Sets Subsets Equality Families of Sets Power Set Union and Intersection Disjoint and Pairwise Disjoint ix 119 121 126 126 127 130 132 135 135 137 148 151 151 155 Relations and Functions 161 4.1 161 163 165 168 171 172 177 180 181 183 189 194 195 196 197 203 205 208 211 212 216 4.2 4.3 4.4 4.5 4.6 Relations Composition Inverses Equivalence Relations Equivalence Classes Partitions Partial Orders Bounds Comparable and Compatible Elements Well-Ordered Sets Functions Equality Composition Restrictions and Extensions Binary Operations Injections and Surjections Injections Surjections Bijections Order Isomorphims Images and Inverse Images x 5 CONTENTS Axiomatic Set Theory 225 5.1 225 226 227 228 229 230 234 237 239 242 243 249 250 253 256 257 260 264 268 268 271 274 275 278 280 5.2 5.3 5.4 5.5 5.6 6 Axioms Equality Axioms Existence and Uniqueness Axioms Construction Axioms Replacement Axioms Axiom of Choice Axiom of Regularity Natural Numbers Order Recursion Arithmetic Integers and Rational Numbers Integers Rational Numbers Actual Numbers Mathematical Induction Combinatorics Euclid’s Lemma Strong Induction Fibonacci Sequence Unique Factorization Real Numbers Dedekind Cuts Arithmetic Complex Numbers Ordinals and Cardinals 283 6.1 283 286 290 292 293 298 300 303 307 308 6.2 6.3 Ordinal Numbers Ordinals Classification Burali-Forti and Hartogs Transfinite Recursion Equinumerosity Order Diagonalization Cardinal Numbers Finite Sets CONTENTS 6.4 6.5 7 Countable Sets Alephs Arithmetic Ordinals Cardinals Large Cardinals Regular and Singular Cardinals Inaccessible Cardinals xi 310 313 316 316 322 327 328 331 Models 333 7.1 333 335 340 346 348 353 361 363 366 368 374 380 384 388 394 394 397 399 409 410 414 415 417 7.2 7.3 7.4 7.5 First-Order Semantics Satisfaction Groups Consequence Coincidence Rings Substructures Subgroups Subrings Ideals Homomorphisms Isomorphisms Elementary Equivalence Elementary Substructures The Three Properties Revisited Consistency Soundness Completeness Models of Different Cardinalities Peano Arithmetic Compactness Theorem Löwenheim–Skolem Theorems The von Neumann Hierarchy Appendix: Alphabets 427 References 429 Index 435 PREFACE This book is inspired by The Structure of Proof: With Logic and Set Theory published by Prentice Hall in 2002. My motivation for that text was to use symbolic logic as a means by which to learn how to write proofs. The purpose of this book is to present mathematical logic and set theory to prepare the reader for more advanced courses that deal with these subjects either directly or indirectly. It does this by starting with propositional logic and first-order logic with sections dedicated to the connection of logic to proof-writing. Building on this, set theory is developed using first-order formulas. Set operations, subsets, equality, and families of sets are covered followed by relations and functions. The axioms of set theory are introduced next, and then sets of numbers are constructed. Finite numbers, such as the natural numbers and the integers, are defined first. All of these numbers are actually sets constructed so that they resemble the numbers that are their namesakes. Then, the infinite ordinal and cardinal numbers appear. The last chapter of the book is an introduction to model theory, which includes applications to abstract algebra and the proofs of the completeness and compactness theorems. The text concludes with a note on Gödel’s incompleteness theorems. MICHAEL L. O’LEARY Glen Ellyn, Illinois July 2015 xiii ACKNOWLEDGMENTS Thanks are due to Susanne Steitz-Filler, Senior Editor at Wiley, for her support of this project. Thanks are also due to Sari Friedman and Katrina Maceda, both at Wiley, for their work in producing this book. Lastly, I wish to thank the anonymous reviewer whose comments proved beneficial. On a personal note, I would like to express my gratitude to my parents for their continued caring and support; to my brother and his wife, who will make sure my niece learns her math; to my dissertation advisor, Paul Eklof, who taught me both set theory and model theory; to Robert Meyer, who introduced me to symbol logic; to David Elfman, who taught me about logic through programming on an Apple II; and to my wife, Barb, whose love and patience supported me as I finished this book. xv SYMBOLS Symbol := ¬ ∧ ∨ → ↔ ๐ โจ ฬธโจ ⇒ โข ⇔ โข∗ โฌ ๐ข๐๐ ๐ต๐ ๐ฑ ∀ ∃ Page(s) 7 7, 11 7, 11 7, 11 7, 12 7, 13 10 21, 336, 346, 352 23, 338 24 26 34 40 52 52, 395 67 68, 72 68, 72 Symbol = ๐ ๐ฒ๐ณ ๐ญ๐ณ ๐ ๐ฑ ๐ ๐ฑ′ ๐ฆ๐ฑ ๐ฑ๐จ ๐ฎ๐ฅ ๐ณ๐ค๐ฑ๐ฌ๐ฒ(๐ ) ๐ฒ ๐ซ(๐ฒ) ๐ก ๐ฅ ๐(๐ฅ) ๐ฬ โฃ |๐| Page(s) 68, 226 68 68 69 69 69 69 69 69 70 70 73 75 80 88 98 116, 307 xvii A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. xviii LIST OF SYMBOLS Symbol ∈ ∉ ∅ ๐ ๐ ๐ ๐ ๐ ∞ [๐, ๐] (๐, ๐) {๐ฅ : ๐(๐ฅ)} ๐+ ๐− ∪ ∩ โงต ๐ด × ๐2 ๐๐ ⊆ * ⊂ gcd P(๐ด) โ โฑ โ โฑ (๐ด, ๐ ) ๐ผ๐ด dom(๐ ) ran(๐ ) โฆ ๐ −1 ๐๐ ๐ ๐๐ ฬธ ๐ ≡ mod ๐ [๐]๐ [๐]๐ ๐ดโ๐ ๐๐ 4 ๐ธ∗ Page(s) 118 118 118 119 119 119 119 119 120 120 120 122 123 123 126 126 127 128 130 130 131 135, 361 135 136 140 151 152 152 161 162 162 162 163, 195 166, 203 168 168 170, 387 170 171 171 172 172 178 178 Symbol โ ๐ (๐) ๐ :๐ท→๐ต ๐ท→๐ต โฆ๐ฅโง ๐ด๐ต ๐๐ ∗ ≅ ๐[๐ถ] ๐−1 [๐ถ] ๐ข๐(๐ ) ๐ญ ๐ญ๐ ๐ญ๐๐ M ๐+ ๐ ๐! โค โ ๐(๐)๐ ๐ ๐ ๐น๐ ๐ ๐๐พ๐4 (๐ด, ๐) โ 0 1 โ ๐ ๐๐พ๐(๐ผ, ๐ฝ) <๐ผ๐ด ≈ ๐โค โ โชฏ ฬธโชฏ โบ ๐๐ต ℵ0 ℵ Page(s) 178 183 190 190 191 192 193 194 197, 352 198 212, 380 216 217 232 235 235 235 236 237, 290 238 242 250 254 262 263 269 271 275 276 278 278 281 281 286 294 298 298 298 300 300 300 304 313 313 LIST OF SYMBOLS Symbol ๐ข๐ง ๐ฆ๐ข๐ง i ๐ผ๐ฟ ๐ (๐ด, ๐) ๐ผ๐ฅ๐ ๐ฒ๐บ๐๐ฆ M๐ (โ) M∗๐ (โ) GL(๐, โ) ๐ฒ๐บ๐ ๐๐ (โ) โจ๐โฉ โ ๐พ∈๐ ๐๐พ ๐:๐→๐ Page(s) 313 314 316 328 333 333 336 340 345 345 345 352 355 363 372 375 Symbol ๐→๐ ker(๐) โชฏ ๐∼ ๐ณ๐(๐) ๐ ๐ฃ ๐ ๐ฃ๐ ๐′ ๐๐ผ ๐ฉ TC(๐ด) ๐๐ผ Page(s) 375 379 389 393 404 408 410 410 410 411 411 417 419 419 420 xix CHAPTER 1 PROPOSITIONAL LOGIC 1.1 SYMBOLIC LOGIC Let us define mathematics as the study of number and space. Although representations can be found in the physical world, the subject of mathematics is not physical. Instead, mathematical objects are abstract, such as equations in algebra or points and lines in geometry. They are found only as ideas in minds. These ideas sometimes lead to the discovery of other ideas that do not manifest themselves in the physical world as when studying various magnitudes of infinity, while others lead to the creation of tangible objects, such as bridges or computers. Let us define logic as the study of arguments. In other words, logic attempts to codify what counts as legitimate means by which to draw conclusions from given information. There are many variations of logic, but they all can be classified into one of two types. There is inductive logic in which if the argument is good, the conclusion will probably follow from the hypotheses. This is because inductive logic rests on evidence and observation, so there can never be complete certainty whether the conclusions reached do indeed describe the universe. An example of an inductive argument is: 1 A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. 2 Chapter 1 PROPOSITIONAL LOGIC A red sky in the morning means that a storm is coming. We see a red sky this morning. Therefore, there will be a storm today. Whether this is a trust-worthy argument or not rests on the strength of the predictive abilities of a red sky, and we know about that by past observations. Thus, the argument is inductive. The other type is deductive logic. Here the methods yield conclusions with complete certainty, provided, of course, that no errors in reasoning were made. An example of a deductive argument is: All geometers are mathematicians. Euclid is a geometer. Therefore, Euclid is a mathematician. Whether Euclid refers to the author of the Elements or is Mr. Euclid from down the street is irrelevant. The argument works because the third sentence must follow from the first two. As anyone who has solved an equation or written a proof can attest, deductive logic is the realm of the mathematician. This is not to say that there are not other aspects to the discovery of mathematical results, such as drawing conclusions from diagrams or patterns, using computational software, or simply making a lucky guess, but it is to say that to accept a mathematical statement requires the production of a deductive proof of that statement. For example, in elementary algebra, we know that given 2๐ฅ − 5 = 11, we can conclude 2๐ฅ = 6 and then ๐ฅ = 3. As each of the steps is legal, it is certain that the conclusion of ๐ฅ = 3 follows. In geometry, we can write a two-column proof that shows that ∠๐ต ≅ ∠๐ท is guaranteed to follow from ABCD is a parallelogram. The study of these types of arguments, those that are deductive and mathematical in content, is called mathematical logic. Propositions To study arguments, one must first study sentences because they are the main parts of arguments. However, not just any type of sentence will do. Consider all squares are rectangles. Section 1.1 SYMBOLIC LOGIC 3 The purpose of this sentence is to affirm that things called squares also belong to the category of things called rectangles. In this case, the assertion made by the sentence is correct. Also, consider, circles are not round. This sentence denies that things called circles have the property of being round. This denial is incorrect. If a sentence asserts or denies accurately, the sentence is true, but if it asserts or denies inaccurately, the sentence is false. These are the only truth values that a sentence can have, and if a sentence has one, it does not have the other. As arguments intend to draw true conclusions from presumably true given sentences, we limit the sentences that we study to only those with a truth value. This leads us to our first definition. DEFINITION 1.1.1 A sentence that is either true or false is called a proposition. Not all sentences are propositions, however. Questions, exclamations, commands, or self-contradictory sentences like the following examples can neither be asserted nor be denied. โ Is mathematics logic? โ Hey there! โ Do not panic. โ This sentence is false. Sometimes it is unclear whether a sentence identifies a proposition. This can be due to factors such as imprecision or poor sentence structure. Another example is the sentence it is a triangle. Is this true or false? It is impossible to know because, unlike the other words of the sentence, the meaning of the word it is not determined. In this sentence, the word it is acting like a variable as in ๐ฅ + 2 = 5. As the value of ๐ฅ is undetermined, the sentence ๐ฅ + 2 = 5 is neither true nor false. However, if ๐ฅ represents a particular value, we could make a determination. For example, if ๐ฅ = 3, the sentence is true, and if ๐ฅ = 10, the sentence is false. Likewise, if it refers to a particular object, then it is a triangle would identify a proposition. There are two types of propositions. An atom is a proposition that is not comprised of other propositions. Examples include the angle sum of a triangle equals two right angles and some quadratic equations have real solutions. 4 Chapter 1 PROPOSITIONAL LOGIC A proposition that is not an atom but is constructed using other propositions is called a compound proposition. There are five types. โ A negation of a given proposition is a proposition that denies the truth of the given proposition. For example, the negation of 3 + 8 = 5 is 3 + 8 ≠ 5. In this case, we say that 3 + 8 = 5 has been negated. Negating the proposition the sine function is periodic yields the sine function is not periodic. โ A conjunction is a proposition formed by combining two propositions (called conjuncts) with the word and. For example, the base angles of an isosceles triangle are congruent, and a square has no right angles is a conjunction with the base angles of an isosceles triangle are congruent and a square has no right angles as conjuncts. โ A disjunction is a proposition formed by combining two propositions (called disjuncts) with the word or. The sentence the base angles of an isosceles triangle are congruent, or a square has no right angles is a disjunction. โ An implication is a proposition that claims a given proposition (called the antecedent) entails another proposition (called the consequent). Implications are also known as conditional propositions. For example, if rectangles have four sides, then squares have for sides (1.1) is a conditional proposition. Its antecedent is rectangles have four sides, and its consequent is squares have four sides. This implication can also be written as rectangles have four sides implies that squares have four sides, squares have four sides if rectangles have four sides, rectangles have four sides only if squares have four sides, and if rectangles have four sides, squares have four sides. A conditional proposition can also be written using the words sufficient and necessary. The word sufficient means “adequate” or “enough,” and necessary means “needed” or “required.” Thus, the sentence Section 1.1 SYMBOLIC LOGIC 5 rectangles having four sides is sufficient for squares to have four sides translates (1.1). In other words, the fact that rectangles have four sides is enough for us to know that squares have four sides. Likewise, squares having four sides is necessary for rectangles to have four sides is another translation of the implication because it means that squares must have four sides because rectangle have four sides. Summing up, the antecedent is sufficient for the consequent, and the consequent is necessary for the antecedent. โ A biconditional proposition is the conjunction of two implications formed by exchanging their antecedents and consequents. For example, if rectangles have four sides, then squares have four sides, and if squares have four sides, then rectangles have four sides. To remove the redundancy in this sentence, notice that the first conditional can be written as rectangles have four sides only if squares have four sides and the second conditional can be written as rectangles have four sides if squares have four sides, resulting in the biconditional being written as rectangles have four sides if and only if squares have four sides or the equivalent rectangles having four sides is necessary and sufficient for squares to have four sides. Propositional Forms As a typical human language has many ways to express the same thought, it is beneficial to study propositions by translating them into a notation that has a very limited collection of symbols yet is still able to express the basic logic of the propositions. Once this is done, rules that determine the truth values of propositions using the new notation can be developed. Any such system designed to concisely study human reasoning is called a symbolic logic. Mathematical logic is an example of symbolic logic. Let ๐ be a finite sequence of characters from a given collection of symbols. Call the collection an alphabet. Call ๐ a string over the alphabet. The alphabet chosen so that ๐ can represent a mathematical proposition is called the proposition alphabet and consists of the following symbols. 6 Chapter 1 PROPOSITIONAL LOGIC โ Propositional variables: Uppercase English letters, ๐ , ๐, ๐ , … , or uppercase English letters with subscripts, ๐๐ , ๐๐ , ๐ ๐ , … , where ๐ = 0, 1, 2, … โ Connectives: ¬, ∧, ∨, →, ↔ โ Grouping symbols: (, ), [, ]. The sequences ๐ ∨ ๐ and ๐1 ๐1 ∧ ↔ ((( and the empty string, a string with no characters, are examples of strings over this alphabet, but only certain strings will be chosen for our study. A string is selected because it is able to represent a proposition. These strings will be determined by a method called a grammar. The grammar chosen for our present purposes is given in the next definition. It is given recursively. That is, the definition is first given for at least one special case, and then the definition is given for other cases in terms of itself. DEFINITION 1.1.2 A propositional form is a nonempty string over the proposition alphabet such that โ every propositional variable is a propositional form. โ ¬๐ is a propositional form if ๐ is a propositional form. โ (๐ ∧ ๐), (๐ ∨ ๐), (๐ → ๐), and (๐ ↔ ๐) are propositional forms if ๐ and ๐ are propositional forms. We follow the convention that parentheses can be replaced with brackets and outermost parenthesis or brackets can be omitted. As with propositions, a propositional form that consists only of a propositional variable is an atom. Otherwise, it is compound. The strings ๐ , ๐1 , ¬๐ , (๐1 ∨ ๐2 ) ∧ ๐3 , and (๐ → ๐) ∧ (๐ ↔ ¬๐ ) are examples of propositional forms. To prove that the last string is a propositional form, proceed using Definition 1.1.2 by noting that (๐ → ๐) ∧ (๐ ↔ ¬๐ ) is the result of combining ๐ → ๐ and ๐ ↔ ¬๐ with ∧. The propositional form ๐ → ๐ is from ๐ and ๐ combined with →, and ๐ ↔ ¬๐ is from ๐ and ¬๐ combined with ↔. These and ¬๐ are propositional forms because ๐ , ๐, and ๐ are propositional variables. This derivation yields the following parsing tree: (P → Q) ∧ (R ↔ ¬P) P→Q P R ↔ ¬P Q R ¬P P Section 1.1 SYMBOLIC LOGIC 7 The parsing tree yields the formation sequence of the propositional form: ๐ , ๐, ๐ , ¬๐ , ๐ → ๐, ๐ ↔ ¬๐ , (๐ → ๐) ∧ (๐ ↔ ¬๐ ). The sequence is formed by listing each distinct term of the tree starting at the bottom row and moving upwards. EXAMPLE 1.1.3 Make the following assignments: ๐ := ๐ ↔ (๐ ∧ ๐), ๐ := (๐ ↔ ๐ ) ∧ ๐. The symbol := indicates that an assignment has been made. It means that the propositional form on the right has been assigned to the lowercase letter on the left. Using these designations, we can write new propositional forms using ๐ and ๐. The propositional form ๐ ∧ ๐ is [๐ ↔ (๐ ∧ ๐)] ∧ [(๐ ↔ ๐ ) ∧ ๐] with the formation sequence, ๐ , ๐, ๐ , ๐ ∧ ๐, ๐ ↔ ๐ , ๐ ↔ (๐ ∧ ๐), (๐ ↔ ๐ ) ∧ ๐, [๐ ↔ (๐ ∧ ๐)] ∧ [(๐ ↔ ๐ ) ∧ ๐], and ¬๐ → ๐ is ¬[(๐ ↔ ๐ ) ∧ ๐] → [๐ ↔ (๐ ∧ ๐)] with the formation sequence ๐ , ๐, ๐ , ๐ ↔ ๐ , ๐ ∧ ๐, (๐ ↔ ๐ ) ∧ ๐, ๐ ↔ (๐ ∧ ๐), ¬[(๐ ↔ ๐ ) ∧ ๐], ¬[(๐ ↔ ๐ ) ∧ ๐] → [๐ ↔ (๐ ∧ ๐)]. Interpreting Propositional Forms Notice that determining whether a string is a propositional form is independent of the meaning that we give the symbols. However, as we do want these symbols to convey meaning, we assume that the propositional variables represent atoms and set this interpretation on the connectives: ¬ ∧ ∨ → ↔ not and or implies if and only if Because of this interpretation, name the compound propositional forms as follows: 8 Chapter 1 PROPOSITIONAL LOGIC ¬๐ ๐∧๐ ๐∨๐ ๐→๐ ๐↔๐ negation conjunction disjunction implication biconditional EXAMPLE 1.1.4 To see how this works, assign some propositions to some propositional variables: ๐ := The sine function is not one-to-one. ๐ := The square root function is one-to-one. ๐ := The absolute value function is not onto. The following symbols represent the indicated propositions: โ ¬๐ The absolute value function is onto. โ ¬๐ ∨ ¬๐ The sine function is one-to-one, or the square root function is not one-toone. โ ๐→๐ If the square root function is one-to-one, the absolute function is not onto. โ ๐ ↔๐ The absolute value function is not onto if and only if the sine function is not one-to-one. โ ๐ ∧๐ The sine function is not one-to-one, and the square root function is one-toone. โ ¬๐ ∧ ๐ The sine function is one-to-one, and the square root function is one-to-one. โ ¬(๐ ∧ ๐) It is not the case that the sine function is not one-to-one and the square root function is one-to-one. The proposition the absolute value function is not onto if and only if both the sine function is not one-to-one and the square root function is one-to-one is translated as ๐ ↔ (๐ ∧ ๐) and Section 1.1 SYMBOLIC LOGIC 9 the absolute value function is not onto if and only if the sine function is not one-to-one, and the square root function is one-to-one is translated as (๐ ↔ ๐ ) ∧ ๐. If the parenthesis are removed, the resulting string is ๐ ↔ ๐ ∧ ๐. It is simpler, but it is not clear how it should be interpreted. To eliminate its ambiguity, we introduce an order of connectives as in algebra. In this way, certain strings without parentheses can be read as propositional forms. DEFINITION 1.1.5 [Order of Connectives] To interpret a propositional form, read from left to right and use the following precedence: โ propositional forms within parentheses or brackets (innermost first), โ negations, โ conjunctions, โ disjunctions, โ conditionals, โ biconditionals. EXAMPLE 1.1.6 To write the propositional form ¬๐ ∨ ๐ ∧ ๐ with parentheses, we begin by interpreting ¬๐ . According to the order of operations, the conjunction is next, so we evaluate ๐∧๐ . This is followed by the disjunction, and we have the propositional form ¬๐ ∨ (๐ ∧ ๐ ). EXAMPLE 1.1.7 To interpret ๐ ∧ ๐ ∨ ๐ correctly, use the order of operations. We discover that it has the same meaning as (๐ ∧ ๐) ∨ ๐ , but how is this distinguished from ๐ ∧ (๐ ∨ ๐ ) in English? Parentheses are not appropriate because they are not used as grouping symbols in sentences. Instead, use either. . . or. Then, using the assignments from Example 1.1.4, (๐ ∧ ๐) ∨ ๐ can be translated as either the sine function is not one-to-one and the square root function is one-to-one, or the absolute value function is not onto. Notice that either. . . or works as a set of parentheses. We can use this to translate ๐ ∧ (๐ ∨ ๐ ): the sine function is not one-to-one, and either the square root function is one-to-one or the absolute value function is not onto. 10 Chapter 1 PROPOSITIONAL LOGIC Be careful to note that the either-or phrasing is logically inclusive. For instance, some colleges require their students to take either logic or mathematics. This choice is meant to be exclusive in the sense that only one is needed for graduation. However, it is not logically exclusive. A student can take logic to satisfy the requirement yet still take a math class. EXAMPLE 1.1.8 Let us interpret ¬(๐ ∧ ๐). We can try translating this as not ๐ and ๐, but this represents ¬๐ ∧๐ according to the order of operations. To handle a propositional form such as ¬(๐ ∧ ๐), use a phrase like it is not the case or it is false and the word both. Therefore, ¬(๐ ∧ ๐) becomes it is not the case that both ๐ and ๐ or it is false that both ๐ and ๐. For instance, make the assignments. ๐ := quadratic equations have at most two real solutions, ๐ := the discriminant can be negative. Then, quadratic equations do not have at most two real solutions, and the discriminant can be negative is a translation of ¬๐ ∧ ๐. On the other hand, ¬(๐ ∧ ๐) can be it is not the case that both quadratic equations have at most two real solutions and the discriminant can be negative. To interpret ¬๐ ∧ ¬๐, use neither-nor: neither do quadratic equations have at most two real solutions, nor can the discriminant be negative. Valuations and Truth Tables Propositions have truth values, but propositional forms do not. This is because every propositional form represents any one of infinitely many propositions. However, once a propositional form is identified with a proposition, there should be a process by which the truth value of the proposition is associated with the propositional form. This is done with a rule ๐ called a valuation. The input of ๐ is a propositional form, and its output is T or F. Suppose that ๐ is a propositional variable. If ๐ has been assigned a proposition, { T if ๐ is true, ๐(๐ ) = F if ๐ is false. Section 1.1 SYMBOLIC LOGIC 11 For example, if ๐ := 2 + 3 = 5, then ๐(๐ ) = T, and if ๐ := 2 + 3 = 7, then ๐(๐ ) = F. If ๐ has not been assigned a proposition, then ๐(๐ ) can be defined arbitrarily as either T or F. The valuation of a compound propositional form is defined using truth tables. Let ๐ and ๐ be given propositional forms. Along the top row write ๐ and, if needed, ๐. Draw a vertical line. To its right identify the desired propositional form consisting of ๐, possibly ๐, and a single connective. In the body of the table, on the left of the vertical line are all combinations of T and F for ๐ and possibly ๐. On the right are the results of applying the connective. Each connective will have its own truth table, and we want to define these tables so that they match our understanding of the meaning of each connective. Since the truth value of the negation of a given proposition is the opposite of that proposition’s truth value, ๐ ¬๐ T F F T This means that ๐(¬๐) = F if ๐(๐) = T and ๐(¬๐) = T if ๐(๐) = F. The conjunction, 3 + 6 = 9, and all even integers are divisible by two, is true, but all integers are rational, and 4 is odd is false because the second conjunct is false. The disjunction 3 + 7 = 9, or all even integers are divisible by three, is false since both disjuncts are false. On the other hand, 3 + 7 = 9, or circles are round is true. This illustrates that โ a conjunction is true when both of its conjuncts are true, and false otherwise, and โ a disjunction is true when at least one disjunct is true, and false otherwise. We use these principles to define the truth tables for ๐ ∧ ๐ and ๐ ∨ ๐: ๐ T T F F ๐ ๐∧๐ T T F F T F F F ๐ ๐ ๐∨๐ T T T T F T F T T F F F We must remember that only one disjunct needs to be true for the entire disjunction to be true. For this reason, the logical disjunction is sometimes called an inclusive or. The propositional form for the exclusive or is (๐ ∨ ๐) ∧ ¬(๐ ∧ ๐). 12 Chapter 1 PROPOSITIONAL LOGIC There are many ways to understand an implication. Sometimes it represents causation as in if I score at least 70 on the exam, I will earn a passing grade. Other times it indicates what would have been the case if some past event had gone differently as in if I had not slept late, I would not have missed the meeting. Study of such conditional propositions is a very involved subject, one that need not concern us here because in mathematics a simpler understanding of the implication is enough. Suppose that ๐ and ๐ are assigned propositions so that ๐ → ๐ is a true implication. In mathematics, this means that it is not the case that ๐ is true but ๐ is false. This understanding of the conditional is known as material implication. For example, if rectangles have four sides, then squares have four sides, if rectangles have three sides, then squares have four sides, and if rectangles have three sides, then squares have three sides are all true, but if rectangles have four sides, then squares have three sides is false. Generalizing, in mathematics, ๐ → ๐ means ¬(๐∧¬๐), which has the following truth table: ๐ T T F F ๐ ¬๐ T F F T T F F T ๐ ∧ ¬๐ F T F F ¬(๐ ∧ ¬๐) T F T T The truth table for ๐ → ๐ is then defined as follows: ๐ ๐ ๐→๐ T T T T F F F T T F F T The truth table for ๐ ↔ ๐ is simpler because we understand ๐ ↔ ๐ to mean (๐ → ๐) ∧ (๐ → ๐). The truth table for this propositional form requires five columns: Section 1.1 SYMBOLIC LOGIC ๐ ๐ ๐→๐ T T T T F F F T T F F T 13 ๐ → ๐ (๐ → ๐) ∧ (๐ → ๐) T T T F F F T T Therefore, define the truth table of ๐ ↔ ๐ as: ๐ ๐ ๐↔๐ T T T T F F F T F F F T This understanding of the biconditional is known as material equivalence. Using these truth tables, the valuation of an arbitrary propositional form can be defined. DEFINITION 1.1.9 Let ๐ and ๐ be propositional forms. { T if ๐(๐) = F, โ ๐(¬๐) = F if ๐(๐) = T. { T if ๐(๐) = T and ๐(๐) = T, โ ๐(๐ ∧ ๐) = F otherwise. { F if ๐(๐) = F and ๐(๐) = F, โ ๐(๐ ∨ ๐) = T otherwise. { F if ๐(๐) = T and ๐(๐) = F, โ ๐(๐ → ๐) = T otherwise. { T if ๐(๐) = ๐(๐), โ ๐(๐ ↔ ๐) = F otherwise. EXAMPLE 1.1.10 Consider the propositional form (๐ ↔ ๐) ∨ (๐ → ๐ ) where ๐(๐ ) = F, ๐(๐) = T, and ๐(๐ ) = F. Then, ๐(๐ ↔ ๐) = F because ๐(๐ ) ≠ ๐(๐), and ๐(๐ → ๐ ) = T because ๐(๐ ) = F. Therefore, because ๐(๐ → ๐ ) = T, ๐([๐ ↔ ๐] ∨ [๐ → ๐ ]) = T, 14 Chapter 1 PROPOSITIONAL LOGIC We now generalize the definition of a truth table to create truth tables for more complicated propositional forms and then use the tables to find the valuation of a propositional form given the valuations of its proposition variables. EXAMPLE 1.1.11 To write the truth table of ๐ → ๐∧¬๐ , identify the column headings by drawing the parsing tree for this form: P → Q ∧ ¬P P Q ∧ ¬P Q ¬P P Reading from the bottom, we see that a formation sequence for the propositional form is ๐ , ๐, ¬๐ , ๐ ∧ ¬๐ , ๐ → ๐ ∧ ¬๐ . Hence, the truth table for this form is ๐ T T F F ๐ ¬๐ T F F F T T F T ๐ ∧ ¬๐ F F T F ๐ → ๐ ∧ ¬๐ F F T T So, if ๐(๐ ) = T and ๐(๐) = F, ๐(๐ → ๐ ∧ ¬๐ ) = F. That is, any proposition represented by ๐ → ๐∧¬๐ is false when the proposition assigned to ๐ is true and the proposition assigned to ๐ is false. The propositional form in the next example has three propositional variables. To make clear the truth value pattern that is to the left of the vertical line, note that if there are ๐ variables, the number of rows is twice the number of rows for ๐ − 1 variables. To see this, start with one propositional variable. Such a truth table has only two rows. Add a variable, we obtain four rows. The pattern is obtained by writing the one variable case twice. For the first time, it has a T written in front of each row. The second copy has an F in front of each row. To obtain the pattern for three variables, copy the twovariable pattern twice as in Figure 1.1. To generalize, if there are ๐ variables, there will be 2๐ rows. Section 1.1 SYMBOLIC LOGIC Two rows ๐ T F { −→ Four rows โง โช โจ โช โฉ ๐ T T F F Figure 1.1 ๐ T F T F −→ Eight rows โง โช โช โช โช โจ โช โช โช โช โฉ ๐ T T T T F F F F ๐ T T F F T T F F ๐ T F T F T F T F Valuation patterns. EXAMPLE 1.1.12 Use a truth table to find the truth value of if the derivative of the sine function is the cosine function and the second derivative of the sine function is the sine function, then the third derivative of the sine function is the cosine function. Define: ๐ := the derivative of the sine function is the cosine function, ๐ := the second derivative of the sine function is the sine function, ๐ := the third derivative of the sine function is the cosine function. So ๐ represents a true proposition, but ๐ and ๐ represent false propositions. The proposition is represented by ๐ ∧๐→๐ with truth table: ๐ T T T T F F F F ๐ T T F F T T F F ๐ ๐ ∧๐ T T F T T F F F T F F F T F F F ๐ ∧๐→๐ T F T T T T T T Notice that we could have determined the truth value by simply writing one line from the truth table: ๐ T ๐ F ๐ ๐ ∧๐ F F ๐ ∧๐→๐ T We see that ๐(๐ ∧ ๐ → ๐ ) = T when ๐(๐ ) = T, ๐(๐) = F, and ๐(๐ ) = F. Therefore, the proposition is true. 15 16 Chapter 1 PROPOSITIONAL LOGIC EXAMPLE 1.1.13 Both ๐ ∨ ¬๐ and ๐ → ๐ share an important property. Their columns in their truth tables are all T. For example, the truth table of ๐ ∨ ¬๐ is: ๐ T F ¬๐ F T ๐ ∨ ¬๐ T T Therefore, ๐(๐ ∨ ¬๐ ) always equals T, no matter the choice of ๐. However, the columns for ๐ ∧ ¬๐ and ๐ ↔ ¬๐ are all F. To check the first one, examine its truth table: ๐ T F ¬๐ F T ๐ ∧ ¬๐ F F This means that ๐(๐ ∧ ¬๐ ) is always F for every valuation ๐. Based on the last example, we make the next definition. DEFINITION 1.1.14 A propositional form ๐ is a tautology if ๐(๐) always equals T for every valuation ๐, and ๐ is a contradiction if ๐(๐) always equals F for every ๐. A propositional form that is neither a tautology nor a contradiction is called a contingency. Exercises 1. Identify each sentence as either a proposition or not a proposition. Explain. (a) Trisect the angle. (b) Some exponential functions are increasing. (c) All exponential functions are increasing. (d) 3 + 8 = 18 (e) 3 + ๐ฅ = 18 (f) Yea, logic! (g) A triangle is a three-sided polygon. (h) The function is differentiable. (i) This proposition is true. (j) This proposition is not true. 2. Identify the antecedent and the consequent for the given implications. (a) If the triangle has two congruent sides, it is isosceles. (b) The polynomial has at most two roots if it is a quadratic. (c) The data is widely spread only if the standard deviation is large. (d) The function being constant implies that its derivative is zero. (e) The system of equations is consistent is necessary for it to have a solution. Section 1.1 SYMBOLIC LOGIC 17 (f) A function is even is sufficient for its square to be even. 3. Give the truth value of each proposition. (a) A system of equations always has a solution, or a quadratic equation always has a real solution. (b) It is false that every polynomial function in one variable is differentiable. (c) Vertical lines have no slope, and lines through the origin have a positive ๐ฆ-intercept. (d) Every integer is even, or every even natural number is an integer. (e) If every parabola intersects the ๐ฅ-axis, then an ellipse has only one vertex. (f) The sine function is periodic if and only if every exponential function is always nonnegative. (g) It is not the case that 2 + 4 ≠ 6. (h) The distance between two points is always positive if every line segment is horizontal. (i) The derivative of a constant function is zero is necessary for the product rule to be true. (j) The derivative of the sine function being cosine is sufficient for the derivative of the cosine function being sine. (k) Any real number is negative or positive, but not both. 4. For each sentence, fill in the blank using as many of the words and, or, if, and if and only if as possible to make the proposition true. (a) Triangles have three sides 3 + 5 = 6. (b) 3 + 5 = 6 triangles have three sides. (c) Ten is the largest integer zero is the smallest integer. (d) The derivative of a constant function is zero tangent lines for increasing functions have positive slope. 5. Extend Figure 1.1 by writing the typical pattern of Ts and Fs for the truth table of a propositional form with four propositional variables and then with five propositional variables. 6. Use a parsing tree to show that the given string is a propositional form. (a) ๐ ∧ ๐ ∨ ๐ (b) ๐ ↔ ๐ ∨ ¬๐ (c) ๐ → ๐ → ๐ → ๐ (d) ¬๐ ∧ ๐ ∨ (๐ → ๐) ∧ ¬๐ (e) (๐ ∧ ๐ → ๐) ∧ ๐ → ๐ (f) ¬¬๐ ∨ ๐ ∧ ๐ → ๐ ∨ [๐ → ¬๐ → ¬(๐ ∨ ๐ )] 7. Define: ๐ := the angle sum of a triangle is 180, ๐ := 3 + 7 = 10, ๐ := the sine function is continuous. 18 Chapter 1 PROPOSITIONAL LOGIC Translate the given propositional forms into English. (a) ๐ ∨ ๐ (b) ๐ ∧ ๐ (c) ๐ ∧ ¬๐ (d) ๐ ∨ ¬๐ (e) ๐ ↔ ¬๐ (f) ๐ → ๐ (g) ๐ ∨ ๐ → ¬๐ (h) ๐ ↔ ๐ ∧ ¬๐ (i) ¬(๐ ∧ ๐) (j) ¬(๐ ∨ ๐) (k) ๐ ∨ ๐ ∧ ๐ (l) (๐ ∨ ๐) ∧ ๐ 8. Write the following sentences as propositional forms using the variables ๐ , ๐, and ๐ as defined in Exercise 7. (a) The sine function is continuous, and 3 + 7 = 10. (b) The angle sum of a triangle is 180, or the angle sum of a triangle is 180. (c) If 3 + 7 = 10, then the sine function is not continuous. (d) The angle sum of a triangle is 180 if and only if the sine function is continuous. (e) The sine function is continuous if and only if 3 + 7 = 10 implies that the angle sum of a triangle is not 180. (f) It is not the case that 3 + 7 ≠ 10. 9. Let ๐(๐ ) = T, ๐(๐) = T, ๐(๐ ) = F, and ๐(๐) = F. Find the given valuations. (See Exercise 6.) (a) ๐ ∧ ๐ ∨ ๐ (b) ๐ ↔ ๐ ∨ ¬๐ (c) ๐ → ๐ → ๐ → ๐ (d) ¬๐ ∧ ๐ ∨ (๐ → ๐) ∧ ¬๐ (e) (๐ ∧ ๐ → ๐) ∧ ๐ → ๐ (f) ¬¬๐ ∨ ๐ ∧ ๐ → ๐ ∨ [๐ → ¬๐ → ¬(๐ ∨ ๐ )] 10. Write the truth table for each of the given propositional forms. (a) ¬๐ → ๐ (b) ๐ → ¬๐ (c) (๐ ∨ ๐) ∧ ¬(๐ ∧ ๐) (d) (๐ → ๐) ∨ (๐ ↔ ๐ ) (e) ๐ ∧ (๐ ∨ ๐ ) (f) ๐ ∨ ๐ → ๐ (g) ๐ → ๐ ∧ ¬(๐ ∨ ๐ ) (h) ๐ → ๐ ↔ ๐ → ๐ (i) ๐ ∨ (¬๐ ↔ ๐ ) ∧ ๐ Section 1.2 INFERENCE 19 (j) (¬๐ ∨ ๐) ∧ ([๐ → ๐] ∨ ¬๐) 11. Check the truth value of these propositions using truth tables as in Example 1.1.12. (a) If 2 + 3 = 7, then 5 − 9 ≠ 0. (b) If a square is round implies that some functions have a derivative at ๐ฅ = 2, then every function has a derivative at ๐ฅ = 2. (c) Either four is odd or two is even implies that three is even. (d) Every even integer is divisible by 4 if and only if either 7 divides 21 or 9 divides 12. (e) The graph of the tangent function has asymptotes, and if sine is an increasing function, then cosine is a decreasing function. 12. If possible, find propositional forms ๐ and ๐ such that (a) ๐ ∧ ๐ is a tautology. (b) ๐ ∨ ๐ is a contradiction. (c) ¬๐ is a tautology. (d) ๐ → ๐ is a contradiction. (e) ๐ ↔ ๐ is a tautology. (f) ๐ ↔ ๐ is a contradiction. 1.2 INFERENCE Now that we have a collection of propositional forms and a means by which to interpret them as either true or false, we want to define a system that expands these ideas to include methods by which we can prove certain propositional forms from given propositional forms. What we will define is familiar because it is similar to what Euclid did with his geometry. Take, for example, the familiar result, opposite angles in a parallelogram are congruent. In other words, if ๐ด๐ต๐ถ๐ท is a parallelogram, then ∠๐ต ≅ ∠๐ท, which translates to: Given: Prove: ๐ด๐ต๐ถ๐ท is a parallelogram, ∠๐ต ≅ ∠๐ท. To demonstrate this, we draw a diagram, A D and then write a proof: B C 20 Chapter 1 PROPOSITIONAL LOGIC 1. 2. 3. 4. 5. 6. 7. ๐ด๐ต๐ถ๐ท is a parallelogram Join ๐ด๐ถ ∠๐ท๐ด๐ถ ≅ ∠๐ต๐ถ๐ด ∠๐ด๐ถ๐ท ≅ ∠๐ถ๐ด๐ต ๐ด๐ถ ≅ ๐ด๐ถ โณ๐ด๐ถ๐ท ≅ โณ๐ถ๐ด๐ต ∠๐ต ≅ ∠๐ท Given Postulate Alternate interior angles Alternate interior angles Reflexive ASA Corresponding parts Euclid’s geometry consists of geometric propositions that are established by proofs like the above. These proofs rely on rules of logic, previously proved propositions (lemmas, theorems, and corollaries), and propositions that are assumed to be true (the postulates). Using this system of thought, we can show which geometric propositions follow from the postulates and conclude which propositions are true, whatever it means for a geometric proposition to be true. Euclidean geometry serves as a model for the following modern definition. DEFINITION 1.2.1 A logical system consists of the following: โ An alphabet โ A grammar โ Propositional forms that require no proof โ Rules that determine truth โ Rules that are used to write proofs. Although Euclid did not provide an alphabet or a grammar specifically for his geometry, his system did include the last three aspects of a logical system. In this chapter we develop the logical system known as propositional logic. Its alphabet, grammar, and rules that determine truth were defined in Section 1.1. The remainder of this chapter is spent establishing the other two components. Consider the following collection of propositions: If squares are rectangles, then squares are quadrilaterals. Squares are rectangles. Therefore, squares are quadrilaterals. This is an example of a deduction, a collection of propositions of which one is supposed to follow necessarily from the others. In this particular case, if squares are rectangles, then squares are quadrilaterals and Section 1.2 INFERENCE 21 squares are rectangles are the premises, and squares are quadrilaterals is the conclusion. We recognize that in this case, the conclusion does follow from the premises because whenever the premises are true, the conclusion must also be true. When this is the case, the deduction is semantically valid, else it is semantically invalid. Notice that not only do we see that the deduction works because of the meaning of the propositions, but we also see that it is valid based on the forms of the sentences. In other words, we also recognize this deduction as valid: If Hausdorff spaces are preregular, their points can be separated. Hausdorff spaces are preregular. Therefore, their points can be separated. Although we might not know the terms Hausdorff space, preregular, and separated, we recognize the deduction as valid because it is of the same pattern as the first deduction: ๐→๐ ๐ ∴ ๐ (1.2) When the deduction is found to work based on its form, the deduction is syntactically valid, else it is syntactically invalid. We study both types of validity by examining general patterns of deductions and choosing rules that determine which forms correspond to deductions that are valid semantically and which forms correspond to deductions that are valid syntactically. Semantics The study of meaning is called semantics. We began this study when we wrote truth tables. These are characterized as semantic because the truth value of a proposition is based on its meaning. Our goal is to use truth tables to determine when an argument form, an example being (1.2), corresponds to a deduction that is semantically valid. We begin with a definition. DEFINITION 1.2.2 Let ๐0 , ๐1 , … , ๐๐−1 and ๐ be propositional forms. โ If ๐ is a tautology, write โจ ๐. โ Define ๐0 , ๐1 , … , ๐๐−1 to logically imply ๐ if โจ ๐0 ∧ ๐1 ∧ · · · ∧ ๐๐−1 → ๐. 22 Chapter 1 PROPOSITIONAL LOGIC When ๐0 , ๐1 , … , ๐๐−1 logically imply ๐, write ๐0 , ๐1 , … , ๐๐−1 โจ ๐. and say that ๐ is a consequence of ๐0 , ๐1 , … , ๐๐−1 . Call the propositional forms ๐0 , ๐1 , … , ๐๐−1 the premises of the implication and ๐ the conclusion. Notice that if ๐0 , ๐1 , … , ๐๐−1 โจ ๐, then for any valuation ๐, whenever ๐(๐๐ ) = T for all ๐ = 0, 1, … , ๐ − 1, it must be the case that ๐(๐) = T. Moreover, any deduction with premises represented by ๐0 , ๐1 , … , ๐๐−1 and conclusion by ๐ is semantically valid if ๐0 , ๐1 , … , ๐๐−1 โจ ๐. EXAMPLE 1.2.3 Because of Example 1.1.13, both โจ ๐ → ๐ and โจ ๐ ∨ ¬๐ . EXAMPLE 1.2.4 Prove: ๐ → ๐, ๐ โจ ๐ To accomplish this, show that the propositional form (๐ → ๐) ∧ ๐ → ๐, with antecedent equal to the conjunction of the premises and consequent consisting of the conclusion is a tautology. ๐ ๐ → ๐ (๐ → ๐) ∧ ๐ T T T F F F T T F F T F ๐ T T F F (๐ → ๐) ∧ ๐ → ๐ T T T T Therefore, โจ (๐ → ๐) ∧ ๐ → ๐, so ๐ → ๐, ๐ โจ ๐, and any deduction based on this form is semantically valid. EXAMPLE 1.2.5 Prove: ๐ ∨ ๐ → ๐, ๐ โจ ๐ ๐ T T F F ๐ T F T F ๐ ∨๐ T T T F ๐ ∨๐→๐ T F T T (๐ ∨ ๐ → ๐) ∧ ๐ T F F F (๐ ∨ ๐ → ๐) ∧ ๐ → ๐ T T T T If it is possible for ๐(๐๐ ) = T for ๐ = 0, 1, … , ๐ − 1 yet ๐(๐) = F, the propositional form ๐0 ∧ ๐1 ∧ · · · ∧ ๐๐−1 → ๐ Section 1.2 INFERENCE 23 is not a tautology, so ๐ is not a consequence of ๐0 , ๐1 , … , ๐๐−1 . If this is the case, write ๐0 , ๐1 , … , ๐๐−1 ฬธโจ ๐. EXAMPLE 1.2.6 Prove: ๐ ∧ ๐ → ๐, ๐ ฬธโจ ๐ ๐ T T F F ๐ T F T F ๐ ∧๐ T F F F ๐ ∧๐→๐ T T T T (๐ ∧ ๐ → ๐) ∧ ๐ T T F F (๐ ∧ ๐ → ๐) ∧ ๐ → ๐ T F T T Notice that F appears for (๐ ∧ ๐ → ๐) ∧ ๐ → ๐ on a line when ๐(๐ ∧ ๐ → ๐) = ๐(๐ ) = T yet ๐(๐) = F. Because of this, we can shorten the procedure for showing that a propositional form is not a consequence of other propositional forms. EXAMPLE 1.2.7 Prove: ๐ ∧ ๐ → ๐ ฬธโจ ๐ → ๐ ๐ T ๐ F ๐ ๐ ∧๐→๐ F T ๐ →๐ F Observe that this shows that the valuation of ๐ ∧ ๐ → ๐ can be T at the same time that the valuation of ๐ → ๐ is F. Syntactics Although we will return to semantics, it is important to note that using truth tables to check for logical implication has its limitations. If the argument form involves many propositional forms or if the propositional forms are complicated, the truth table used to show or disprove the logical implication can become unwieldy. Another issue is that in practice, truth tables are not the method of choice when determining whether a conclusion follows from the premises. What is typically done is to follow Euclid’s example (page 20), basing the conclusions on the syntax of the argument form, namely, based only on its pattern and structure. Let us return to the deduction on page 20 and work with it differently. Start with the two propositions, if squares are rectangles, then squares are quadrilaterals and squares are rectangles. Because of the combined structure of the two sentences, we know that we can write 24 Chapter 1 PROPOSITIONAL LOGIC squares are quadrilaterals. This act of writing (on paper or a blackboard or in the mind) means that we have the proposition and that it follows from the first two. Similarly, if we start with squares are triangles, or squares are rectangles and squares are not triangles, we can write squares are rectangles. Determining a method that will model this reasoning requires us to find rules by which propositional forms can be written from other propositional forms. Since every logical system requires a starting point, the first step in this process is to choose which propositional forms can be written without any prior justification. Each such propositional form is called an axiom. Playing the same role as that of a postulate in Euclidean geometry, an axiom can be considered as a rule of the game. Certain propositional forms lend themselves as good candidates for axioms because they are regarded as obvious. That is, they are self-evident. Other propositional forms are good candidates to be axioms, not because they are necessarily self-evident, but because they are helpful. In either case, the number of axioms should be as few as possible so as to minimize the number of assumptions. For propositional logic, we choose only three. They were first found in work of Gottlob Frege (1879) and later in that of Jan ลukasiewicz (1930). AXIOMS 1.2.8 [Frege–ลukasiewicz] Let ๐, ๐, and ๐ be propositional forms. โ [FL1] ๐ → (๐ → ๐) โ [FL2] ๐ → (๐ → ๐) → (๐ → ๐ → [๐ → ๐]) โ [FL3] ¬๐ → ¬๐ → (๐ → ๐). The next step in defining propositional logic is to state when it is legal to write a propositional form from given propositional forms. DEFINITION 1.2.9 The propositional forms ๐0 , ๐1 , … , ๐๐−1 infer ๐ if ๐ can be written whenever ๐0 , ๐1 , … , ๐๐−1 are written. Denote this by ๐0 , ๐1 , … , ๐๐−1 ⇒ ๐. This is known as an inference. To make rigorous which propositional forms can be inferred from given forms, we establish some rules. These are chosen because they model basic reasoning. They are also not proved, so they serve as postulates for our logic. Section 1.2 INFERENCE 25 INFERENCE RULES 1.2.10 Let ๐, ๐, ๐, and ๐ be propositional forms. โ Modus Ponens [MP] ๐ → ๐, ๐ ⇒ ๐ โ Modus Tolens [MT] ๐ → ๐, ¬๐ ⇒ ¬๐ โ Constructive Dilemma [CD] (๐ → ๐) ∧ (๐ → ๐ ), ๐ ∨ ๐ ⇒ ๐ ∨ ๐ โ Destructive Dilemma [DD] (๐ → ๐) ∧ (๐ → ๐ ), ¬๐ ∨ ¬๐ ⇒ ¬๐ ∨ ¬๐ โ Disjunctive Syllogism [DS] ๐ ∨ ๐, ¬๐ ⇒ ๐ โ Hypothetical Syllogism [HS] ๐ → ๐, ๐ → ๐ ⇒ ๐ → ๐ โ Conjunction [Conj] ๐, ๐ ⇒ ๐ ∧ ๐ โ Simplification [Simp] ๐∧๐ ⇒๐ โ Addition [Add] ๐ ⇒ ๐ ∨ ๐. To use Inference Rules 1.2.10, match the form exactly. For example, even though ๐ → ๐ appears to follow from (๐ ∧ ๐) → ๐ as an application of simplification, it does not. The problem is that simplification can only be applied to propositional forms with the ๐ ∧ ๐ pattern, but (๐ ∧ ๐) → ๐ is of the form ๐ → ๐. With this detail in mind, we make some inferences. EXAMPLE 1.2.11 Each inference is justified by the indicated rule. โ Modus ponens ๐ ∧ ๐ → ¬๐ , ๐ ∧ ๐ ⇒ ¬๐ โ Addition ๐ ⇒๐ ∨๐∧๐ โ Modus tolens ¬¬๐ , ๐ ∨ ๐ → ¬๐ ⇒ ¬(๐ ∨ ๐ ). 26 Chapter 1 PROPOSITIONAL LOGIC EXAMPLE 1.2.12 Since it is possible that some propositional forms are not needed for the inference, we also have the following: โ Modus ponens ๐ ∧ ๐ → ¬๐ , ๐ ∨ ๐, ๐ ∧ ๐, ๐ ↔ ๐ ⇒ ¬๐ โ Addition ๐ , ๐ , ๐ → ๐ ⇒ ๐ ∨ ๐ ∧ ๐ โ Modus tolens ¬๐, ¬¬๐ , ๐ ∧ ๐ , ๐ ∨ ๐ → ¬๐ ⇒ ¬(๐ ∨ ๐ ). Inference is a powerful tool, but it can only be used to check simple deductions. Sometimes multiple inferences are needed to move from a collection of premises to a conclusion. For example, if we write ๐ ∨ ๐, ¬๐, ๐ → ๐, based on the first two propositional forms, we can write ๐ by DS, and then based on this propositional form and the third of the given propositional forms, we can write ๐ by MP. This is a simple example of the next definition. DEFINITION 1.2.13 โ A formal proof of the propositional form ๐ (the conclusion) from the propositional forms ๐0 , ๐1 , … , ๐๐−1 (the premises) is a sequence of propositional forms, ๐0 , ๐1 , … , ๐๐−1 , ๐0 , ๐1 , … , ๐๐−1 , such that ๐๐−1 = ๐, and for all ๐ = 0, 1, … , ๐ − 1, either ๐๐ is an axiom, if ๐ = 0, then ๐0 , ๐1 , … , ๐๐−1 ⇒ ๐๐ , or if ๐ > 0, then ๐0 , ๐1 , … , ๐๐−1 , ๐0 , ๐1 , … , ๐๐−1 ⇒ ๐๐ . If there exists a formal proof of ๐ from ๐0 , ๐1 , … , ๐๐−1 , then ๐ is proved or deduced from ๐0 , ๐1 , … , ๐๐−1 and we write ๐0 , ๐1 , … , ๐๐−1 โข ๐. โ If there are no premises, a formal proof of ๐ is a sequence, ๐0 , ๐1 , … , ๐๐−1 , Section 1.2 INFERENCE 27 such that ๐0 is an axiom, ๐๐−1 = ๐, and for all ๐ > 0, either ๐๐ is an axiom or ๐0 , ๐1 , … , ๐๐−1 ⇒ ๐๐ . In this case, write โข ๐ and call ๐ a theorem. Observe that any deduction with premises represented by ๐0 , ๐1 , … , ๐๐−1 and conclusion by ๐ is syntactically valid if ๐0 , ๐1 , … , ๐๐−1 โข ๐. We should note that although ⇒ and โข have different meanings as syntactic symbols, they are equivalent. If ๐ ⇒ ๐, then ๐ โข ๐ using the proof ๐, ๐. Conversely, suppose ๐ โข ๐. This means that there exists a proof ๐, ๐0 , ๐1 , … , ๐๐−1 , ๐, so every time we write down ๐, we can also write down ๐. That is, ๐ ⇒ ๐. We summarize this as follows. THEOREM 1.2.14 For all propositional forms ๐ and ๐, ๐ ⇒ ๐ if and only if ๐ โข ๐. We use a particular style to write formal proofs. They will be in two-column format with each line being numbered. In the first column will be the sequence of propositional forms that make up the proof. In the second column will be the reasons that allowed us to include each form. The only reasons that we will use are โ Given (for premises), โ FL1, FL2, or FL3 (for an axiom), โ An inference rule. An inference rule is cited by giving the line numbers used as the premises followed by the abbreviation for the rule. Thus, the following proves ๐ ∨ ๐ → ๐ ∧ ๐ , ๐ โข ๐: 1. 2. 3. 4. 5. ๐ ∨๐→๐∧๐ ๐ ๐ ∨๐ ๐∧๐ ๐ Given Given 2 Add 1, 3 MP 4 Simp Despite the style, we should remember that a proof is a sequence of propositional forms that satisfy Definition 1.2.13. In this case, the sequence is ๐ ∨ ๐ → ๐ ∧ ๐ , ๐ , ๐ ∨ ๐, ๐ ∧ ๐ , ๐. The first two examples involve proofs that use the axioms. 28 Chapter 1 PROPOSITIONAL LOGIC EXAMPLE 1.2.15 Prove: โข ๐ → ๐ → (๐ → ๐ ) 1. 2. 3. ๐ → (๐ → ๐ ) ๐ → (๐ → ๐ ) → (๐ → ๐ → [๐ → ๐ ]) ๐ → ๐ → (๐ → ๐ ) FL1 FL2 MP This proves that ๐ → ๐ → (๐ → ๐ ) is a theorem. Also, by adding ๐ → ๐ as a given and an application of MP at the end, we can prove ๐ → ๐ โข ๐ → ๐. This result should not be surprising since ๐ → ๐ is a tautology. We would expect any premise to be able to prove it. EXAMPLE 1.2.16 Prove: ¬(๐ → ๐ ), ¬๐ โข ¬๐ 1. 2. 3. 4. 5. ¬(๐ → ๐ ) ¬๐ ¬๐ → ¬๐ → (๐ → ๐ ) ¬๐ → ¬๐ ¬๐ Given Given FL3 1, 3 MT 2, 4 MP The next three examples do not use an axiom in their proofs. EXAMPLE 1.2.17 Prove: ๐ → ๐, ๐ → ๐ , ๐ ∨ ¬๐ , ¬๐ โข ¬๐ 1. 2. 3. 4. 5. 6. 7. ๐ →๐ ๐→๐ ๐ ∨ ¬๐ ¬๐ ๐ →๐ ¬๐ ¬๐ Given Given Given Given 1, 2 HS 3, 4 DS 5, 6 MT EXAMPLE 1.2.18 Prove: ๐ → ๐, ๐ → ๐ → (๐ → ๐), ๐ ∨ ๐ , ¬๐ โข ๐ 1. 2. 3. ๐ →๐ ๐ → ๐ → (๐ → ๐) ๐ ∨๐ Given Given Given Section 1.2 INFERENCE 4. 5. 6. 7. 8. ¬๐ ๐ →๐ (๐ → ๐) ∧ (๐ → ๐) ๐∨๐ ๐ Given 1, 2 MP 1, 5 Conj 3, 6 CD 4, 7 DS EXAMPLE 1.2.19 Prove: ๐ → ๐, ๐ → ๐ , ¬๐ โข ¬๐ ∨ ¬๐ 1. 2. 3. 4. 5. 6. ๐ →๐ ๐→๐ ¬๐ (๐ → ๐ ) ∧ (๐ → ๐) ¬๐ ∨ ¬๐ ¬๐ ∨ ¬๐ Given Given Given 1, 2 Conj 3 Add 4, 5 DD Exercises 1. Show using truth tables. (a) ¬๐ ∨ ๐, ¬๐ โจ ¬๐ (b) ¬(๐ ∧ ๐), ๐ โจ ¬๐ (c) ๐ → ๐, ๐ โจ ๐ ∨ ๐ (d) ๐ → ๐, ๐ → ๐ , ๐ โจ ๐ (e) ๐ ∨ ๐ ∧ ๐ , ¬๐ โจ ๐ 2. Show the following using truth tables. (a) ¬(๐ ∧ ๐) ฬธโจ ¬๐ (b) ๐ → ๐ ∨ ๐ , ๐ ฬธโจ ๐ (c) ๐ ∧ ๐ → ๐ ฬธโจ ๐ → ๐ (d) (๐ → ๐) ∨ (๐ → ๐), ๐ ∨ ๐ ฬธโจ ๐ ∨ ๐ (e) ¬(๐ ∧ ๐) ∨ ๐ , ๐ ∧ ๐ ∨ ๐ ฬธโจ ๐ ∧ ๐ (f) ๐ ∨ ๐ , ๐ ∨ ๐, ๐ ↔ ๐ ฬธโจ ๐ ∧ ๐ 3. Identify the rule from Inference Rules 1.2.10. (a) ๐ → ๐ → ๐ , ๐ → ๐ ⇒ ๐ (b) ๐ , ๐ ∨ ๐ ⇒ ๐ ∧ (๐ ∨ ๐ ) (c) ๐ ⇒ ๐ ∨ (๐ ↔ ¬๐ ∧ ¬[๐ → ๐]) (d) ๐ , ๐ → (๐ ↔ ๐) ⇒ ๐ ↔ ๐ (e) ๐ ∨ ๐ ∨ ๐, (๐ ∨ ๐ → ๐) ∧ (๐ → ๐ ∧ ๐ ) ⇒ ๐ ∨ ๐ ∧ ๐ (f) ๐ ∨ (๐ ∨ ๐), ¬๐ ⇒ ๐ ∨ ๐ (g) ๐ → ¬๐, ¬¬๐ ⇒ ¬๐ (h) (๐ → ๐) ∧ (๐ → ๐ ), ¬๐ ∨ ¬๐ ⇒ ¬๐ ∨ ¬๐ (i) (๐ → ๐) ∧ (๐ → ๐ ) ⇒ ๐ → ๐ 29 30 Chapter 1 PROPOSITIONAL LOGIC 4. Arrange each collection of propositional forms into a proof for the given deductions and supply the appropriate reasons. (a) ๐ → ๐, ๐ → ๐, ๐ โข ๐ ∨ ๐ โ ๐ โ ๐∨๐ โ ๐ →๐ โ (๐ → ๐) ∧ (๐ → ๐) โ ๐ ∨๐ โ ๐ →๐ (b) ๐ → ๐, ๐ → ๐ , ๐ โข ๐ ∨ ๐ โ ๐ โ ๐ →๐ โ ๐ →๐ โ ๐ โ ๐→๐ โ ๐ ∨๐ (c) (๐ → ๐) ∨ (๐ → ๐ ), ¬(๐ → ๐), ¬๐ , ๐ ∨ ๐ โข ๐ โ ¬(๐ → ๐) โ ๐ โ (๐ → ๐) ∨ (๐ → ๐ ) โ ๐→๐ โ ¬๐ โ ¬๐ โ ๐∨๐ (d) (๐ ∨ ๐) ∧ ๐ , ๐ ∨ ๐ → ๐ , ¬๐ โข ¬๐ ∧ ๐ โ ๐ ∨๐ โ ๐ โ ๐∨๐ โ ๐ โ (๐ ∨ ๐) ∧ ๐ โ ๐∨๐ →๐ โ ¬๐ โ ¬๐ ∧ ๐ 5. Prove using Axioms 1.2.8. (a) โข ๐ → ๐ (b) โข ¬¬๐ → ๐ (c) โข ๐ → (๐ → [๐ → ๐ ]) (d) ¬๐ โข ๐ → ๐ (e) ๐ → ๐, ๐ → ๐ โข ๐ → ๐ (Do not use HS.) (f) ๐ → ๐, ¬๐ โข ¬๐ (Do not use MT.) 31 Section 1.3 REPLACEMENT 6. Prove. Axioms 1.2.8 are not required. (a) ๐ → ๐, ๐ ∨ (๐ → ๐), ¬๐ โข ๐ → ๐ (b) ๐ → ๐, ๐ → ๐ , ¬๐ โข ¬๐ (c) ๐ → ๐, ๐ → ๐, ¬๐ ∨ ¬๐ โข ¬๐ ∨ ¬๐ (d) [๐ → (๐ → ๐ )] ∧ [๐ → (๐ → ๐ )], ๐ ∨ ๐, ¬(๐ → ๐ ), ¬๐ โข ¬๐ (e) ๐ → ๐ → (๐ → ๐), ๐ → ๐ , ๐ → ๐, ๐ โข ๐ (f) ๐ → ๐ ∧ ๐ , ๐ ∨ ๐ → ๐ ∧ ๐ , ๐ โข ๐ (g) ๐ → ๐ ∧ ๐ , ¬(๐ ∧ ๐ ), ๐ ∧ ๐ ∨ (¬๐ → ๐) โข ๐ (h) ๐ ∨ ๐ → ¬๐ ∧ ¬๐, ๐ → ๐ , ๐ โข ¬๐ (i) ๐ → ๐ , ๐ ∨ ๐ ∨ ๐ → ๐ ∨ ๐ , ๐ ∨ ๐ → ๐ , ๐ โข ๐ (j) ๐ → ๐, ๐ → ๐ , ๐ → ๐, ๐ → ๐ , ๐ ∨ ๐ , ¬๐ โข ๐ (k) ๐ ∨ ๐ → ๐ ∨ ๐, (๐ → ๐ ) ∧ (๐ → ๐ ), ๐ , ¬๐ โข ๐ (l) ๐ → ๐, ๐ → ๐ , ๐ → ๐, (๐ ∨ ๐) ∧ (๐ ∨ ๐) โข ๐ ∨ ๐ (m) ๐ ∨ ¬๐ ∨ ๐ → (๐ → ๐ ), ๐ ∨ ¬๐ → (๐ → ๐ ), ๐ โข ๐ → ๐ 1.3 REPLACEMENT There are times when writing a formal proof that we want to substitute one propositional form for another. This happens when two propositional forms have the same valuations. It also happens when a particular sentence pattern should be able to replace another sentence pattern. We give rules in this section that codify both ideas. Semantics Consider the propositional form ¬(๐ ∨ ๐). Its valuation equals T when it is not the case that ๐(๐ ) = T or ๐(๐) = T (Definition 1.1.9). This implies that ๐(๐ ) = F and ๐(๐) = F, so ๐(¬๐ ∧ ¬๐) = T. Conversely, the valuation of ¬๐ ∧ ¬๐ is T implies that the valuation of ¬(๐ ∨ ๐) is T for similar reasons. Since no additional premises were assumed in this discussion, we conclude that ๐(¬[๐ ∨ ๐]) = ๐(¬๐ ∧ ¬๐), (1.3) and this means that ¬(๐ ∨ ๐) ↔ ¬๐ ∧ ¬๐ is a tautology. There is a name for this. DEFINITION 1.3.1 Two propositional forms ๐ and ๐ are logically equivalent if โจ ๐ ↔ ๐. Observe that Definition 1.3.1 implies the following result. THEOREM 1.3.2 All tautologies are logically equivalent, and all contradictions are logically equivalent. 32 Chapter 1 PROPOSITIONAL LOGIC Because of (1.3), we can use a truth table to prove logical equivalence. EXAMPLE 1.3.3 Prove: โจ ¬(๐ ∨ ๐) ↔ ¬๐ ∧ ¬๐. ๐ T T F F ๐ ๐ ∨ ๐ ¬(๐ ∨ ๐) T T F F T F T T F F F T ¬๐ F F T T ¬๐ ¬๐ ∧ ¬๐ F F T F F F T T EXAMPLE 1.3.4 Because โจ ๐ → ๐ ↔ ¬๐ ∨ ๐, we can replace ๐ → ๐ with ¬๐ ∨ ๐, and vice versa, at any time. When this is done, the resulting propositional form is logically equivalent to the original. For example, โจ ๐ ∧ (๐ → ๐) ↔ ๐ ∧ (¬๐ ∨ ๐). To see this, examine the truth table ๐ T T F F ๐ ๐ → ๐ ๐ธ ∧ (๐ท → ๐ธ) ¬๐ T T T F F F F F T T T T F T F T ¬๐ ∨ ๐ ๐ธ ∧ (¬๐ท ∨ ๐ธ) T T F F T T T F EXAMPLE 1.3.5 Consider ๐ ∧ (๐ → ๐). Since โจ ๐ → ๐ ↔ ¬๐ → ¬๐ [Exercise 2(f)], we can replace ๐ → ๐ with ¬๐ → ¬๐ giving โจ ๐ ∧ (๐ → ๐) ↔ ๐ ∧ (¬๐ → ¬๐ ). When studying an implication, we sometimes need to investigate the different ways that its antecedent and consequent relate to each other. DEFINITION 1.3.6 The converse of a given implication is the conditional proposition formed by exchanging the antecedent and consequent of the implication (Figure 1.2). DEFINITION 1.3.7 The contrapositive of a given implication is the conditional proposition formed by exchanging the antecedent and consequent of the implication and then replacing them with their negations (Figure 1.3). Section 1.3 REPLACEMENT If If the antecedent, the consequent, Figure 1.2 If If then the consequent. then the antecedent. Writing the converse. the antecedent, then not the consequent, then Figure 1.3 33 the consequent. not the antecedent. Writing the contrapositive. For example, the converse of if rectangles have four sides, squares have for sides is if squares have four sides, rectangles have four sides, and its contrapositive is if squares do not have four sides, rectangles do not have four sides. Notice that a biconditional proposition is simply the conjunction of a conditional with its converse. EXAMPLE 1.3.8 The propositional form ๐ → ๐ has ๐ → ๐ as its converse and ¬๐ → ¬๐ as its contrapositive. The first and fourth columns on the right of the next truth table show โจ ๐ → ๐ ↔ ¬๐ → ¬๐ , while ฬธโจ ๐ → ๐ ↔ ๐ → ๐ is shown by the first and last columns. ๐ ๐ ๐ → ๐ ¬๐ ¬๐ ¬๐ → ¬๐ ๐ → ๐ T T T F F T T T F F T F F T F T T F T T F F F T T T T T 34 Chapter 1 PROPOSITIONAL LOGIC Syntactics If we limit our proofs to Inference Rules 1.2.10, we quickly realize that there will be little of interest that we can prove. We would have no reason on which to base such clear inferences as ๐ โข๐∨๐ or ๐ ∨ ๐, ¬๐ โข ๐ . To fix this, we expand our collection of inference rules with a new type. Suppose that we know that the form ๐ ∧ ๐ can replace ๐ ∧ ๐ at any time and vice versa. For example, in the propositional form ๐ ∧ ๐ → ๐ , (1.4) ๐ ∧ ๐ can be replaced with ๐ ∧ ๐ so that we can write the new form ๐ ∧ ๐ → ๐ . (1.5) This type of rule is called a replacement rule and is written using the ⇔ symbol. For example, the replacement rule that allowed us to write (1.5) from (1.4) is ๐ ∧ ๐ ⇔ ๐ ∧ ๐. Similarly, when the replacement rule ¬(๐ ∧ ๐) ⇔ ¬๐ ∨ ¬๐ is applied to ๐ ∨ (¬๐ ∨ ¬๐ ), the result is ๐ ∨ ¬(๐ ∧ ๐ ). We state without proof the standard replacement rules. REPLACEMENT RULES 1.3.9 Let ๐, ๐, and ๐ be propositional forms. โ Associative Laws [Assoc] ๐ ∧ ๐ ∧ ๐ ⇔ ๐ ∧ (๐ ∧ ๐) ๐ ∨ ๐ ∨ ๐ ⇔ ๐ ∨ (๐ ∨ ๐) โ Commutative Laws [Com] ๐∧๐ ⇔๐∧๐ ๐∨๐ ⇔๐∨๐ โ Distributive Laws [Distr] ๐ ∧ (๐ ∨ ๐) ⇔ ๐ ∧ ๐ ∨ ๐ ∧ ๐ ๐ ∨ ๐ ∧ ๐ ⇔ (๐ ∨ ๐) ∧ (๐ ∨ ๐) โ Contrapositive Law [Contra] ๐ → ๐ ⇔ ¬๐ → ¬๐ Section 1.3 REPLACEMENT 35 โ Double Negation [DN] ๐ ⇔ ¬¬๐ โ De Morgan’s Laws [DeM] ¬(๐ ∧ ๐) ⇔ ¬๐ ∨ ¬๐ ¬(๐ ∨ ๐) ⇔ ¬๐ ∧ ¬๐ โ Idempotency [Idem] ๐∧๐⇔๐ ๐∨๐⇔๐ โ Material Equivalence [Equiv] ๐ ↔ ๐ ⇔ (๐ → ๐) ∧ (๐ → ๐) ๐ ↔ ๐ ⇔ ๐ ∧ ๐ ∨ ¬๐ ∧ ¬๐ โ Material Implication [Impl] ๐ → ๐ ⇔ ¬๐ ∨ ๐ โ Exportation [Exp] ๐ ∧ ๐ → ๐ ⇔ ๐ → (๐ → ๐). A replacement rule is used in a formal proof by appealing to the next inference rule. INFERENCE RULE 1.3.10 For all propositional forms ๐ and ๐, if ๐ if obtained from ๐ using a replacement rule, ๐ ⇒ ๐ and ๐ ⇒ ๐. As with Inference Rules 1.2.10, the replacement rule used when applying Inference Rule 1.3.10 must be used exactly as stated. This includes times when it seems unnecessary because it appears obvious. For example, ๐ ∨ ๐ does not follow directly from ¬๐ → ๐ using Impl. Instead, include a use of DN to give the correct sequence, ¬๐ → ๐, ¬¬๐ ∨ ๐, ๐ ∨ ๐. Similarly, the inference rule Add does not allow for ๐∨๐ to be derived from ๐ . Instead, we derive ๐ ∨ ๐. Follow this by Com to conclude ๐ ∨ ๐ . When writing formal proofs that appeal to Inference Rule 1.3.10, do not cite that particular rule but reference the replacement rule’s abbreviation using Replacement Rule 1.3.9 and the line which serves as a premise to the replacement. We use this practice in the following examples. EXAMPLE 1.3.11 Although it is common practice to move parentheses freely when solving equations, inferences such as ๐ ∧ ๐ ∧ (๐ ∧ ๐) โข ๐ ∧ (๐ ∧ ๐ ) ∧ ๐ 36 Chapter 1 PROPOSITIONAL LOGIC must be carefully demonstrated. Fortunately, this example only requires two applications of the associative law. Using the boxes as a guide, notice that ๐ ∧๐ ∧( ๐ ∧ ๐ ) is of the same form as the right-hand side of the associative law. Hence, we can remove the parentheses to obtain ๐ ∧๐ ∧ ๐ ∧ ๐ . Next, view the propositional form as ๐ ∧๐∧๐ ∧ ๐ . One more application within the first box yields the result, ๐ ∧ (๐ ∧ ๐ ) ∧ ๐ . We may, therefore, write a sequence of inferences by Inference Rule 1.3.10, ๐ ∧ ๐ ∧ (๐ ∧ ๐) ⇒ (๐ ∧ ๐ ∧ ๐ ) ∧ ๐ ⇒ [๐ ∧ (๐ ∧ ๐ )] ∧ ๐, and we have a proof of [๐ ∧ (๐ ∧ ๐ )] ∧ ๐ from ๐ ∧ ๐ ∧ (๐ ∧ ๐): 1. 2. 3. ๐ ∧ ๐ ∧ (๐ ∧ ๐) ๐ ∧๐∧๐ ∧๐ ๐ ∧ (๐ ∧ ๐ ) ∧ ๐ Given 1 Assoc 2 Assoc EXAMPLE 1.3.12 Prove: ๐ ∧ ๐ โข ¬๐ → ๐ 1. 2. 3. 4. 5. ๐ ∧๐ ๐ ๐ ∨๐ ¬¬๐ ∨ ๐ ¬๐ → ๐ Given 1 Simp 2 Add 3 DN 4 Impl EXAMPLE 1.3.13 Prove: ๐ → ๐, ๐ → ๐ โข ๐ ∨ ๐ → ๐ 1. 2. 3. 4. 5. ๐ →๐ ๐ →๐ (๐ → ๐) ∧ (๐ → ๐) (¬๐ ∨ ๐) ∧ (¬๐ ∨ ๐) (๐ ∨ ¬๐ ) ∧ (๐ ∨ ¬๐ ) Given Given 1, 2 Conj 3 Impl 4 Com Section 1.3 REPLACEMENT 6. 7. 8. 9. ๐ ∨ ¬๐ ∧ ¬๐ ¬๐ ∧ ¬๐ ∨ ๐ ¬(๐ ∨ ๐ ) ∨ ๐ ๐ ∨๐ →๐ 37 5 Dist 6 Com 7 DeM 8 Impl EXAMPLE 1.3.14 Prove: ๐ ∧ ๐ ∨ ๐ ∧ ๐ โข (๐ ∨ ๐) ∧ (๐ ∨ ๐ ) 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. ๐ ∧๐∨๐ ∧๐ (๐ ∧ ๐ ∨ ๐ ) ∧ (๐ ∧ ๐ ∨ ๐) (๐ ∨ ๐ ∧ ๐) ∧ (๐ ∨ ๐ ∧ ๐) (๐ ∨ ๐ ) ∧ (๐ ∨ ๐) ∧ [(๐ ∨ ๐ ) ∧ (๐ ∨ ๐)] (๐ ∨ ๐) ∧ (๐ ∨ ๐ ) ∧ [(๐ ∨ ๐ ) ∧ (๐ ∨ ๐)] (๐ ∨ ๐) ∧ ([๐ ∨ ๐ ] ∧ [(๐ ∨ ๐ ) ∧ (๐ ∨ ๐)]) ๐ ∨๐ (๐ ∨ ๐ ) ∧ (๐ ∨ ๐) ∧ [(๐ ∨ ๐) ∧ (๐ ∨ ๐ )] (๐ ∨ ๐ ) ∧ (๐ ∨ ๐) ๐ ∨๐ (๐ ∨ ๐ ) ∧ (๐ ∨ ๐) (๐ ∨ ๐) ∧ (๐ ∨ ๐ ) Given 1 Dist 2 Com 3 Dist 4 Com 5 Assoc 6 Simp 5 Com 8 Simp 9 Simp 7, 10 Conj 11 Com EXAMPLE 1.3.15 Both โข ๐ → ๐ and โข ๐ ∨ ¬๐ . Here is the proof for the second theorem: 1. 2. 3. 4. 5. ๐ → (๐ → ๐ ) ¬๐ ∨ (¬๐ ∨ ๐ ) (¬๐ ∨ ¬๐ ) ∨ ๐ ¬๐ ∨ ๐ ๐ ∨ ¬๐ FL1 1 Impl 2 Assoc 3 Idem 4 Com This proof can be generalized to any propositional form ๐, so that we also have โข ๐ → ๐ and โข ๐ ∨ ¬๐. The theorem ๐ ∨ ¬๐ is known as the law of the excluded middle, and the theorem ¬(๐ ∧ ¬๐) is the law of noncontradiction. Notice that by De Morgan’s law with Com and DN, we have for all propositional forms ๐ โข (๐ ∨ ¬๐) ↔ ¬(๐ ∧ ¬๐). Exercises 1. The inverse of a given implication is the contrapositive of the implication’s converse. Write the converse, contrapositive, and inverse for each conditional proposition in Exercise 1.1.2. 38 Chapter 1 PROPOSITIONAL LOGIC 2. Prove using truth tables. (a) โจ ๐ ∨ ¬๐ ↔ ๐ → ๐ (b) โจ ๐ ∨ ¬๐ ↔ ๐ ∨ ๐ ∨ ¬(๐ ∧ ๐) (c) โจ ๐ ∧ ๐ ↔ (๐ ↔ ๐) ∧ (๐ ∨ ๐) (d) โจ ๐ ∧ (๐ → ๐) ↔ ๐ ∧ (¬๐ → ¬๐ ) (e) โจ ๐ ∧ ๐ → ๐ ↔ ๐ ∧ ¬๐ → ¬๐ (f) โจ ๐ → ๐ ↔ ¬๐ → ¬๐ (g) โจ ๐ → ๐ ∧ ๐ ↔ (๐ → ๐) ∧ (๐ → ๐ ) (h) โจ ๐ → ๐ → (๐ → ๐ ) ↔ (๐ → ๐) ∧ ๐ → ๐ 3. A propositional form is in disjunctive normal form if it is a disjunction of conjunctions. For example, the propositional form ๐ ∧ (¬๐ ∨ ๐ ) is logically equivalent to (๐ ∧ ๐ ∧ ๐ ) ∨ (๐ ∧ ¬๐ ∧ ¬๐ ) ∨ (¬๐ ∧ ๐ ∧ ¬๐ ), which is in disjunctive normal form. Find propositional forms in disjunctive normal form that are logically equivalent to each of the following. (a) ๐ ∨ ๐ ∧ (๐ ∨ ¬๐ ) (b) (๐ ∨ ๐) ∧ (¬๐ ∨ ¬๐) (c) (๐ ∧ ¬๐ ∨ ๐ ) ∧ (๐ ∧ ๐ ∨ ๐ ∧ ¬๐ ) (d) ๐ ∨ (¬๐ ∨ [๐ ∧ ¬๐ ∨ ๐ ∧ ¬๐]) 4. Identify the rule from Replacement Rule 1.3.9. (a) (๐ → ๐) ∨ (๐ → ๐ ) ∨ ๐ ⇔ (๐ → ๐) ∨ ([๐ → ๐ ] ∨ ๐) (b) ¬¬๐ ↔ ๐ ∧ ๐ ⇔ ๐ ↔ ๐ ∧ ๐ (c) ๐ ∨ ๐ ∨ ๐ ⇔ ๐ ∨ ๐ ∨ ๐ (d) ¬๐ ∧ ¬(๐ ∨ ๐ ) ⇔ ¬(๐ ∨ [๐ ∨ ๐ ]) (e) ¬(๐ ∨ ๐) ∨ ๐ ⇔ ๐ ∨ ๐ → ๐ (f) ๐ ↔ ๐ → ๐ ⇔ ๐ ∧ (๐ → ๐ ) ∨ ¬๐ ∧ ¬(๐ → ๐ ) (g) ๐ ∨ ๐ ↔ ๐ ∧ ๐ ⇔ ๐ ∨ ๐ ↔ ๐ (h) ๐ ∧ ๐ ∧ ๐ ⇔ ๐ ∧ (๐ ∧ ๐) (i) (๐ ∨ ๐) ∧ (๐ ∨ ๐ ) ⇔ (๐ ∨ ๐) ∧ ๐ ∨ (๐ ∨ ๐) ∧ ๐ (j) (๐ ∧ [๐ → ๐]) → ๐ ⇔ ๐ → (๐ → ๐ → ๐) 5. For each given propositional form ๐, find another propositional form ๐ such that ๐ ⇔ ๐ using Replacement Rules 1.3.9. (a) ¬¬๐ (b) ๐ ∨ ๐ (c) ๐ → ๐ (d) ¬(๐ ∧ ๐) (e) (๐ ↔ ๐) ∧ (¬๐ ↔ ๐) (f) (๐ → ๐) ∨ (๐ → ๐) (g) (๐ → ๐) ∨ ๐ (h) ¬(๐ → ๐) (i) (๐ → ๐) ∧ (๐ ↔ ๐ ) Section 1.3 REPLACEMENT 39 (j) (๐ ∨ ¬๐) ↔ ๐ ∧ ๐ (k) ๐ ∨ (¬๐ ↔ ๐ ) ∧ ๐ 6. Arrange each collection of propositional forms into a proof for the given deduction and supply the appropriate reasons. (a) ๐ ∨ ๐ → ๐ โข (๐ → ๐ ) ∧ (๐ → ๐ ) โ ๐ ∨ ¬๐ ∧ ¬๐ โ (¬๐ ∨ ๐ ) ∧ (¬๐ ∨ ๐ ) โ ¬(๐ ∨ ๐) ∨ ๐ โ (๐ → ๐ ) ∧ (๐ → ๐ ) โ (๐ ∨ ¬๐ ) ∧ (๐ ∨ ¬๐) โ ¬๐ ∧ ¬๐ ∨ ๐ โ ๐ ∨๐→๐ (b) ¬(๐ ∧ ๐) → ๐ ∨ ๐, ¬๐ , ¬๐ โข ๐ โ ¬(๐ ∧ ๐) → ๐ ∨ ๐ โ ¬๐ โ ๐ ∨๐ โ ๐ โ ¬(๐ ∧ ๐) โ ¬๐ โ ๐ ∨๐ โ ¬๐ ∨ ¬๐ (c) ๐ → (๐ → ๐ ), ¬๐ → ๐, ¬๐ → ๐ , ๐ → ¬๐ โข ¬๐ → ๐ โ ๐ ∨๐ โ ¬๐ ∨ ¬๐ โ ๐ → ¬๐ โ ¬๐ โ ¬(๐ ∧ ๐) โ ๐ ∨๐ โ ¬¬๐ ∨ ๐ โ ¬๐ ∨ ¬๐ โ ๐ ∧๐→๐ โ ๐ → (๐ → ๐ ) โ ¬๐ → ๐ โ ¬๐ → ๐ โ ¬๐ → ๐ โ (¬๐ → ๐) ∧ (¬๐ → ๐ ) 7. Prove. (a) ¬๐ โข ๐ → ๐ (b) ๐ โข ¬๐ → ๐ (c) ¬๐ ∨ (¬๐ ∨ ¬๐ ) โข ๐ → ¬(๐ ∧ ๐ ) (d) ๐ → ๐ โข ๐ ∧ ๐ → ๐ 40 Chapter 1 PROPOSITIONAL LOGIC (e) (f) (g) (h) (i) (j) (k) (l) (m) (n) (o) (p) (q) (r) (s) (t) (u) (v) (w) (x) (y) (z) 1.4 ๐ →๐∧๐ โข๐ →๐ ๐ ∨ ๐ → ๐ โข ¬๐ → ¬๐ ๐ → (๐ → ๐ ) โข ๐ ∧ ¬๐ → ¬๐ ๐ → (๐ → ๐ ) โข ๐ → (๐ → ๐ ) ๐ ∧ ๐ ∨ ๐ ∧ ๐ โข ¬๐ → ๐ ∧ ๐ ๐ → ๐ โข ๐ → (๐ → ๐ ) ๐ → ¬(๐ → ๐ ) โข ๐ → ¬๐ ๐ ∨ (๐ ∨ ๐ ∨ ๐) โข (๐ ∨ ๐) ∨ (๐ ∨ ๐) ๐∨๐ →๐ ∧๐ โข๐→๐ ๐ ↔๐∧๐ โข๐ →๐ ๐ ↔๐∨๐ โข๐→๐ ๐ ∨๐∨๐ →๐ โข๐→๐ (๐ ∨ ๐) ∧ (๐ ∨ ๐) โข ๐ ∧ ๐ ∨ ๐ ∧ ๐ ∨ (๐ ∧ ๐ ∨ ๐ ∧ ๐) ๐ ∧ (๐ ∨ ๐ ) → ๐ ∧ ๐ โข ๐ → (๐ → ๐ ) ๐ ↔ ๐, ¬๐ โข ¬๐ ๐ → ๐ → ๐ , ¬๐ โข ¬๐ ๐ → (๐ → ๐ ), ๐ → ๐ ∧ ๐ โข ๐ → (๐ → ๐ ) ๐ → (๐ → ๐ ), ๐ → ๐ ∨ ๐ โข (๐ → ๐) ∨ (๐ → ๐ ) ๐ → ๐, ๐ → ๐ โข ๐ → ๐ ∧ ๐ ๐ ∨ ๐ → ๐ ∧ ๐, ¬๐ → (๐ → ¬๐ ), ¬๐ โข ¬๐ (๐ → ๐) ∧ (๐ → ๐), ๐ ∨ ๐ , (๐ → ¬๐) ∧ (๐ → ¬๐) โข ๐ ↔ ¬๐ ๐ ∧ (๐ ∧ ๐ ), ๐ ∧ ๐ → ๐ ∨ (๐ ∨ ๐), ¬๐ ∧ ¬๐ โข ๐ PROOF METHODS The methods of Sections 1.2 and 1.3 provide a good start for writing formal proofs. However, in practice we rarely limit ourselves to these rules. We often use inference rules that give a straightforward way to prove conditional propositions and allow us to prove a proposition when it is easier to disprove its negation. In both the cases, the new inference rules will be justified using the rules we already know. Deduction Theorem Because of Axioms 1.2.8, not all of the inference rules are needed to write the proofs found in Sections 1.2 and 1.3. This motivates the next definition. DEFINITION 1.4.1 Let ๐ and ๐ be propositional forms. The notation ๐ โข∗ ๐ means that there exists a formal proof of ๐ from ๐ using only Axioms 1.2.8, MP, and Inference Rule 1.3.10, and the notation โข∗ ๐ Section 1.4 PROOF METHODS 41 means that there exists a formal proof of ๐ from Axioms 1.2.8 using only MP and Inference Rule 1.3.10 For example, ๐ , ๐, ¬๐ ∨ (๐ → ๐ ) โข∗ ๐ because 1. 2. 3. 4. 5. 6. ๐ ๐ ¬๐ ∨ (๐ → ๐ ) ๐ → (๐ → ๐ ) ๐→๐ ๐ Given Given Given 3 Impl 1, 4 MP 2, 5 MP is a formal proof using MP and Impl as the only inference rules. We now observe that the propositional forms that can be proved using the full collection of rules from Sections 1.2 and 1.3 are exactly the propositional forms that can be proved when all rules are deleted from Inference Rules 1.2.10 except for MP. THEOREM 1.4.2 For all propositional forms ๐ and ๐, ๐ โข∗ ๐ if and only if ๐ โข ๐. PROOF Trivially, ๐ โข∗ ๐ implies ๐ โข ๐, so suppose that ๐ โข ๐. We show that the remaining parts of Inference Rules 1.2.10 are equivalent to using only Axioms 1.2.8, MP, and Replacement Rules 1.3.9. We show three examples and leave the proofs of the remaining inference rules to Exercise 6. The proof 1. 2. 3. 4. 5. 6. ๐→๐ ¬๐ ¬¬๐ → ¬¬๐ ¬¬๐ → ¬¬๐ → (¬๐ → ¬๐) ¬๐ → ¬๐ ¬๐ Given Given DN FL3 3, 4 MP 2, 5 MP shows that ๐ → ๐, ¬๐ โข∗ ¬๐. Thus, we do not need MT. The proof 1. 2. 3. 4. 5. 6. ๐ ๐ → (¬๐ → ๐) ¬๐ → ๐ ¬¬๐ ∨ ๐ ๐∨๐ ๐∨๐ shows that ๐ โข∗ ๐ ∨ ๐, Given FL1 1, 2 MP 3 Impl 4 DN 5 Com 42 Chapter 1 PROPOSITIONAL LOGIC so we do not need Add. This implies that we can use Add to demonstrate that ๐ → ๐, ๐ → ๐ โข∗ ๐ → ๐. The proof is as follows: 1. 2. 3. 4. 5. 6. 7. 8. 9. ๐→๐ ๐→๐ ¬๐ ∨ ๐ ¬๐ ∨ ๐ ∨ ¬๐ ¬๐ ∨ (¬๐ ∨ ๐) ๐ → (๐ → ๐) ๐ → (๐ → ๐) → (๐ → ๐ → [๐ → ๐]) ๐ → ๐ → (๐ → ๐) ๐→๐ Given Given 2 Impl 3 Add 4 Com 5 Impl FL2 6, 7 MP 1, 8 MP This implies that we do not need HS. At this point, there is an obvious question: If only MP is needed from Inference Rules 1.2.10, why were the other rules included? The answer is because the other inference rules are examples of common reasoning and excluding them would introduce unnecessary complications to the formal proofs. Try reproving some of the deductions of Section 1.2 with only MP and the axioms to confirm this. When a formal proof in Section 1.3 involved proving an implication, the replacement rule Impl would often appear in the proof. However, as we know from geometry, this is not the typical strategy used to prove an implication. What is usually done is that the antecedent is assumed and then the consequent is shown to follow. That this procedure justifies the given conditional is the next theorem. Its proof requires a lemma. LEMMA 1.4.3 Let ๐ and ๐ be propositional forms. If โข∗ ๐, then โข∗ ๐ → ๐. PROOF Let โข∗ ๐. By FL1, we have that โข∗ ๐ → (๐ → ๐), so โข∗ ๐ → ๐ follows by MP. THEOREM 1.4.4 [Deduction] For all propositional forms ๐ and ๐, ๐ โข ๐ if and only if โข ๐ → ๐. PROOF Let ๐ and ๐ be propositional forms. Assume that โข ๐ → ๐, so there exists propositional forms ๐0 , ๐1 , … , ๐๐−1 such that ๐0 , ๐1 , … , ๐๐−1 , ๐ → ๐ is a proof, where ๐0 is an axiom, ๐๐ is an axiom or ๐0 , ๐1 , … , ๐๐−1 ⇒ ๐๐ for ๐ > 0, and ๐ → ๐ follows from ๐0 , ๐1 , … , ๐๐−1 . Then, Section 1.4 PROOF METHODS ๐, ๐0 , ๐1 , … , ๐๐−1 , ๐ → ๐, ๐ is also a proof, where the last inference is due to MP. Therefore, ๐ โข ๐. By Theorem 1.4.2, to prove the converse, we only need to prove that if ๐ โข∗ ๐, then โข∗ ๐ → ๐. Assume ๐ โข∗ ๐. First note that if ๐ is an axiom, then โข∗ ๐, so โข∗ ๐ → ๐ by Lemma 1.4.3. Therefore, assume that ๐ is not an axiom. We begin by checking four cases. โ Suppose that the proof has only one propositional form. In this case, we have that ๐ = ๐, so the inference is of the form ๐ โข∗ ๐. By FL1, โข∗ ๐ → (๐ → ๐). Because ๐ → (๐ → ๐) ⇒ ¬๐ ∨ (¬๐ ∨ ๐) ⇒ (¬๐ ∨ ¬๐) ∨ ๐ ⇒ ¬๐ ∨ ๐ ⇒ ๐ → ๐, we conclude that โข∗ ๐ → ๐. โ Next, suppose the proof has two propositional forms and cannot be reduced to the first case. This implies that ๐ โข∗ ๐ by a single application of a replacement rule. Thus, ๐ → (¬๐ → ๐) โข∗ ๐ → (¬๐ → ๐) by a single application of the same replacement rule. Therefore, ๐ → (¬๐ → ๐) ⇒ ๐ → (¬๐ → ๐) ⇒ ๐ → (¬¬๐ ∨ ๐) ⇒ ๐ → (๐ ∨ ๐) ⇒ ๐ → ๐. This implies that โข∗ ๐ → ๐. โ We now consider the case when the proof of ๐ โข∗ ๐ has three propositional forms and ๐ follows by a rule of replacement. Let ๐, ๐, ๐ be the proof. This implies that ๐ โข∗ ๐, which implies โข∗ ๐ → ๐ (1.6) because either ๐ is an axiom and Lemma 1.4.3 applies or the previous two cases apply. If ๐ follows from ๐ by a rule of replacement, then โข ๐ → ๐ by 43 44 Chapter 1 PROPOSITIONAL LOGIC the previous case, so assume that ๐ follows from ๐ by a rule of replacement. Thus, โข∗ ๐ → ๐, which implies by Lemma 1.4.3 that โข∗ ๐ → (๐ → ๐). (1.7) โข∗ ๐ → (๐ → ๐) → (๐ → ๐ → [๐ → ๐]). (1.8) By FL2, Therefore, by (1.7) and (1.8) with MP, โข∗ ๐ → ๐ → (๐ → ๐), (1.9) and by (1.6) and (1.9) with MP, โข∗ ๐ → ๐. โ Again, let the proof have three propositional forms and write it as ๐, ๐ , ๐. Suppose that the inference that leads to ๐ is MP. This means that ๐ and ๐ → ๐ are in the proof. Because either ๐ = ๐ and ๐ โข∗ ๐ → ๐ or ๐ = ๐ → ๐ and ๐ โข∗ ๐, โข∗ ๐ → ๐ and โข∗ ๐ → (๐ → ๐). Thus, as in the previous case, using (1.8), we obtain โข∗ ๐ → ๐. These four cases exhaust the ways by which ๐ can be proved from ๐ with a proof with at most three propositional forms. Therefore, since these cases can be generalized to proofs of arbitrary length (Exercise 7), we conclude that ๐ โข ๐ implies โข ๐ → ๐. The deduction theorem (1.4.4) yields the next result. Its proof is left to Exercise 8. COROLLARY 1.4.5 For all propositional forms ๐0 , ๐1 , … , ๐๐−1 , ๐, ๐, ๐0 , ๐1 , … , ๐๐−1 , ๐ โข ๐ if and only if ๐0 , ๐1 , … , ๐๐−1 โข ๐ → ๐. Direct Proof Most propositions that mathematicians prove are implications. For example, if a function is differentiable at a point, it is continuous at that same point. As we know, this means that whenever the function ๐ is differentiable at ๐ฅ = ๐, it must also be the case that ๐ is continuous at ๐ฅ = ๐. Proofs of conditionals like this Section 1.4 PROOF METHODS 45 are typically very difficult if we are only allowed to use Inference Rules 1.2.10 and Replacement Rules 1.3.9. Fortunately, in practice another inference rule is used. To prove the differentiability result, what is usually done is that ๐ is assumed to be differential at ๐ฅ = ๐ and then a series of steps that lead to the conclusion that ๐ is continuous at ๐ฅ = ๐ are followed. We copy this strategy in our formal proofs using the next rule. Sometimes known as conditional proof, this inference rule follows by Corollary 1.4.5 and Theorem 1.2.14. INFERENCE RULE 1.4.6 [Direct Proof (DP)] For propositional forms ๐0 , ๐1 , … , ๐๐−1 , ๐, ๐, if ๐0 , ๐1 , … , ๐๐−1 , ๐ โข ๐, then ๐0 , ๐1 , … , ๐๐−1 ⇒ ๐ → ๐. PROOF Suppose ๐0 , ๐1 , … , ๐๐−1 , ๐ โข ๐. Then, by Corollary 1.4.5, ๐0 , ๐1 , … , ๐๐−1 โข ๐ → ๐. Therefore, by Simp and Com, ๐0 ∧ ๐1 ∧ · · · ∧ ๐๐−1 โข ๐ → ๐, so by Theorem 1.2.14, ๐0 ∧ ๐1 ∧ · · · ∧ ๐๐−1 ⇒ ๐ → ๐. Finally, we have by Conj and Theorem 1.2.14 that ๐0 , ๐1 , … , ๐๐−1 ⇒ ๐ → ๐. To see how this works, let us use direct proof to prove ๐ ∨ ๐ → (๐ ∧ ๐) โข ๐ → ๐ . To do this, we first prove ๐ ∨ ๐ → (๐ ∧ ๐), ๐ โข ๐ . Here is the proof: 1. 2. 3. 4. 5. ๐ ∨ ๐ → (๐ ∧ ๐) ๐ ๐ ∨๐ ๐ ∧๐ ๐ Given Given 2 Add 1, 3 MP 4 Simp 46 Chapter 1 PROPOSITIONAL LOGIC Therefore, by Inference Rule 1.4.6, ๐ ∨ ๐ → (๐ ∧ ๐) ⇒ ๐ → ๐ . A proof of the original deduction can now be written as 1. 2. ๐ ∨ ๐ → (๐ ∧ ๐) ๐ →๐ Given 1 DP However, instead of writing the first proof off to the side, it is typically incorporated into the proof as follows: 1. 2. 3. 4. 5. 6. ๐ ∨ ๐ → (๐ ∧ ๐) →๐ ๐ ∨๐ ๐ ∧๐ ๐ ๐ →๐ Given Assumption 2 Add 1, 3 MP 4 Simp 2–5 DP The proof that ๐ infers ๐ is a subproof of the main proof. To separate the propositional forms of the subproof from the rest of the proof, they are indented with a vertical line. The line begins with the assumption of ๐ in line 2 as an additional premise. Hence, its reason is Assumption. This assumption can only be used in the subproof. Consider it a local hypothesis. It is only used to prove ๐ → ๐ . If we were allowed to use it in other places of the proof, we would be proving a theorem that had different premises than those that were given. Similarly, all lines within the subproof cannot be referenced from the outside. We use the indentation to isolate the assumption and the propositional forms that follow from it. When we arrive at ๐ , we know that we have proved ๐ → ๐ . The next line is this propositional form. It is entered into the proof with the reason DP. The lines that are referenced are the lines of the subproof. EXAMPLE 1.4.7 Prove: ๐ → ¬๐, ¬๐ ∨ ๐ โข ๐ ∨ ๐ → (๐ → ๐) 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. ๐ → ¬๐ ¬๐ ∨ ๐ →๐ ∨ ๐ →๐ ¬๐ ๐∨๐ ๐ ¬¬๐ ๐ ๐ →๐ (๐ ∨ ๐) → (๐ → ๐) Given Given Assumption Assumption 1, 4 MP 3 Com 5, 6 DS 7 DN 2, 8 DS 4–9 DP 3–10 DP Section 1.4 PROOF METHODS 47 EXAMPLE 1.4.8 Prove: ๐ ∧ ๐ → ๐ → ๐, ¬๐ ∨ ๐ โข ๐ 1. 2. 3. 4. 5. 6. 7. 8. 9. Given Given Assumption 3 Com 4 Simp 5 DN 2, 6 DS 3–7 DP 1, 8 MP ๐ ∧๐→๐ →๐ ¬๐ ∨ ๐ →๐ ∧๐ ๐∧๐ ๐ ¬¬๐ ๐ ๐ ∧๐→๐ ๐ EXAMPLE 1.4.9 Prove: โข ๐ ∨ ¬๐ 1. 2. 3. 4. →๐ ๐ →๐ ¬๐ ∨ ๐ ๐ ∨ ¬๐ Assumption 1 DP 2 Impl 3 Com Note that lines 1–2 prove โข ๐ → ๐ . Indirect Proof When direct proof is either too difficult or not appropriate, there is another common approach to writing formal proofs. Sometimes going by the name of proof by contradiction or reductio ad absurdum, this inference rule can also be used to prove propositional forms that are not implications. INFERENCE RULE 1.4.10 [Indirect Proof (IP)] For all propositional forms ๐ and ๐, ¬๐ → (๐ ∧ ¬๐) ⇒ ๐. PROOF Notice that instead of repeating the argument from Example 1.4.9 in this proof, the example is simply cited as the reason on line 2. 1. 2. 3. 4. ¬๐ → (๐ ∧ ¬๐) ๐→๐ ¬(๐ ∧ ¬๐) → ¬¬๐ ¬(๐ ∧ ¬๐) → ๐ Given Example 1.4.9 1 Contra 3 DN 48 Chapter 1 PROPOSITIONAL LOGIC 5. 6. 7. 8. 4 DeM 5 DN 6 Impl 2, 7 MP ¬๐ ∨ ¬¬๐ → ๐ ¬๐ ∨ ๐ → ๐ ๐→๐→๐ ๐ The rule follows from Theorem 1.2.14. To use indirect proof, assume each premise and assume the negation of the conclusion. Then, proceed with the proof until a contradiction is reached. (In Inference Rule 1.4.10, the contradiction is represented by ๐ ∧ ¬๐.) At this point, deduce the original conclusion. EXAMPLE 1.4.11 Prove: ๐ ∨ ๐ → ๐ , ๐ ∨ ๐ → ¬๐ ∧ ๐ โข ¬๐ . 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. ๐ ∨๐→๐ ๐ ∨ ๐ → ¬๐ ∧ ๐ →¬¬๐ ๐ ๐ ∨๐ ๐ ๐ ∨๐ ¬๐ ∧ ๐ ¬๐ ๐ ∧ ¬๐ ¬๐ Given Given Assumption 3 DN 4 Add 1, 5 MP 6 Add 2, 7 MP 8 Simp 4, 9 Conj 3–10 IP Since IP involves proving an implication, the formal proof takes the same form as a proof involving DP. Indirect proof can also be nested within another indirect subproof. As with direct proof, we cannot appeal to lines within a subproof from outside of it. EXAMPLE 1.4.12 Prove: ๐ → ๐ ∧ ๐ , ๐ → ๐, ¬๐ → ๐ โข ๐. 1. 2. 3. 4. 5. 6. 7. 8. 9. ๐ →๐∧๐ ๐→๐ ¬๐ → ๐ →¬๐ ¬๐ →๐ ๐∧๐ ๐ ๐ ∧ ¬๐ Given Given Given Assumption 2, 4 MT Assumption 1, 6 MP 7 Simp 5, 8 Conj Section 1.4 PROOF METHODS 10. 11. 12. 13. 6-9 IP 3, 10 MP 4, 11 Conj 4–12 IP ¬๐ ๐ ๐ ∧ ¬๐ ๐ Notice that line 11 was not the end of the proof since it was within the first subproof. It followed under the added hypothesis of ¬๐. EXAMPLE 1.4.13 Prove: ๐ → ๐ โข ๐ ∧ ๐ → ๐ ∨ ๐. 1. 2. 3. 4. 5. 6. 7. 8. 9. Given Assumption Assumption 1, 3 MT 2 Simp 4, 5 Conj 3-6 IP 7 Add 2–8 DP ๐ →๐ →๐ ∧ ๐ →¬๐ ¬๐ ๐ ๐ ∧ ¬๐ ๐ ๐ ∨๐ ๐ ∧๐→๐ ∨๐ Exercises 1. Find all mistakes in the given proofs. (a) “๐ ∨ ๐ → ¬๐ , ๐ → ¬๐ → ๐ ∨ ๐ โข ๐” Attempted Proof 1. ๐ ∨ ๐ → ¬๐ 2. ๐ → ¬๐ → ๐ ∨ ๐ 3. →๐ 4. ¬¬๐ 5. ¬(๐ ∨ ๐) 6. ¬๐ ∧ ¬๐ 7. ¬๐ 8. ๐ → ¬๐ 9. ๐ ∨ ๐ 10. ๐ ∨ ๐ 11. ๐ Given Given Assumption Assumption 1, 4 MT 5 DeM 6 Simp 3–7 DP 2, 8 MP 9 Com 7, 10 DS (b) “¬๐ ∨ ๐ โข ๐ → ๐ → ๐ ” Attempted Proof 1. ¬๐ ∨ ๐ 2. →๐ Given Assumption 49 50 Chapter 1 PROPOSITIONAL LOGIC 3. 4. 5. 6. 7. ¬¬๐ ๐ ๐ →๐ ๐ ๐ →๐→๐ 2 DN 1, 3 DS 2–4 DP MP 2–6 DP (c) “¬๐ ∧ ๐, ¬๐ ∨ ๐ → ๐ โข ¬๐ ∨ ๐ → ๐” Attempted Proof 1. ¬๐ ∧ ๐ 2. ¬๐ ∨ ๐ → ๐ → ¬๐ 3. →¬๐ ∨ ๐ 4. 5. ๐ ¬๐ 6. 7. ๐ ∧ ¬๐ 8. ๐ 9. ¬¬๐ ๐ 10. 11. ¬๐ ∨ ๐ → ๐ Given Given Assumption Assumption 2, 3 MP 1 Simp 5, 6 Conj 4–7 IP 8 DN 4, 9 DS 3–10 DP 2. Prove using direct proof. (a) ๐ → ๐ ∧ ๐ โข ๐ → ๐ (b) ๐ ∨ ๐ → ๐ โข ๐ → ๐ (c) ๐ ∨ (๐ ∨ ๐ ) → ๐ โข ๐ → ๐ (d) ๐ → ๐, ๐ → ๐ โข ๐ ∨ ๐ → ๐ (e) ๐ → ๐, ๐ → ๐ โข ๐ → ๐ ∧ ๐ (f) ๐ → (๐ → ๐ ) โข ๐ ∧ ¬๐ → ¬๐ (g) ๐ → ¬๐ โข ๐ ∧ ๐ → (๐ → ¬๐) (h) ๐ → (๐ → ๐ ) โข ๐ → (๐ → ๐ ) (i) ๐ → (๐ → ๐ ), ๐ → (๐ → ๐) โข ๐ → (๐ → ๐) (j) ๐ → (๐ → ๐ ), ๐ → ๐ ∧ ๐ โข ๐ → (๐ → ๐ ) (k) ๐ ↔ ๐ ∧ ๐ โข ๐ → ๐ (l) ๐ ∧ ๐ ∨ ๐ → ๐ ∧ ๐ โข ๐ → (๐ → ๐ ) 3. Prove using indirect proof. (a) ๐ → ๐, ๐ → ๐ , ¬๐ โข ¬๐ (b) ๐ ∨ ๐ ∧ ๐ , ๐ → ๐, ๐ → ๐ โข ๐ (c) ๐ ∨ ๐ ∧ ¬๐ , ๐ → ๐, ๐ → ๐ โข ๐ (d) ๐ ⇔ ๐, ¬๐ โข ¬๐ (e) ๐ ∨ ๐ ∨ ๐ → ๐ ∧ ๐ โข ¬๐ ∨ ๐ ∧ ๐ (f) ๐ → ๐, ๐ → ๐ , ๐ → ๐ , ๐ ∨ ๐ , ¬๐ โข ๐ (g) ๐ → ¬๐, ๐ → ¬๐, ๐ → ๐, ๐ → ๐, ๐ ∨ ๐ โข ¬๐ ∨ ¬๐ Section 1.5 THE THREE PROPERTIES 51 (h) ๐ → ๐ ∧ ๐ , ๐ ∨ ๐ → ๐ ∧ ๐ , ๐ โข ๐ 4. Prove using both direct and indirect proof. (a) ๐ → ๐, ๐ ∨ (๐ → ๐), ¬๐ โข ๐ → ๐ (b) ๐ → ¬(๐ → ¬๐ ) โข ๐ → ๐ (c) ๐ → ๐ โข ๐ ∧ ๐ → ๐ (d) ๐ ⇔ ๐ ∨ ๐ โข ๐ → ๐ 5. Prove by using direct proof to prove the contrapositive. (a) ๐ → ๐, ๐ → ๐, ๐ → ๐ , ¬๐ โข ¬๐ → ¬(๐ ∨ ๐ ) (b) ๐ ∧ ๐ ∨ ๐ ∧ ๐ โข ¬๐ → ๐ ∧ ๐ (c) ๐ ∨ ๐ → ¬๐ , ๐ → ๐ โข ๐ → ¬๐ (d) ¬๐ → ¬๐, (¬๐ ∨ ๐) ∧ (๐ ∨ ๐) โข ¬๐ → ๐ 6. Prove to complete the proof of Theorem 1.4.2. (a) ๐ ∨ ๐, ¬๐ โข∗ ๐ (b) ๐, ๐ โข∗ ๐ ∧ ๐ (c) (๐ → ๐) ∧ (๐ → ๐ ), ๐ ∨ ๐ โข∗ ๐ ∨ ๐ (d) (๐ → ๐) ∧ (๐ → ๐ ), ¬๐ ∨ ¬๐ โข∗ ¬๐ ∨ ¬๐ (e) ๐ ∧ ๐ โข∗ ๐ 7. Given there is a proof of ๐ from ๐ with four propositional forms, prove โข ๐ → ๐. Generalize the proof for ๐ propositional forms. 8. Prove Corollary 1.4.5. 9. Can MP be replaced with another inference rule in Definition 1.4.1 and still have Theorem 1.4.2 hold true? If so, find the inference rules. 10. Can any of the replacement rules be removed from Definition 1.4.1 and still have Theorem 1.4.2 hold true? If so, how many can be removed and which ones? 1.5 THE THREE PROPERTIES We finish our introduction to propositional logic by showing that this logical system has three important properties. These are properties that are shared with Euclid’s geometry, but they are not common to all logical systems. Consistency Since we can need to consider infinitely many propositional forms, we now write our lists of propositional forms as ๐0 , ๐1 , ๐2 , … , allowing this sequence to be finite or infinite. Since proofs are finite, the notation ๐0 , ๐1 , ๐2 , … โข ๐ 52 Chapter 1 PROPOSITIONAL LOGIC means that there exists a subsequence ๐0 , ๐1 , … , ๐๐−1 of 0, 1, 2, … such that ๐๐0 , ๐๐1 , … , ๐๐๐−1 โข ๐. The notation ๐0 , ๐1 , ๐2 , … โฌ ๐ means that no such subsequence exists. DEFINITION 1.5.1 โ The propositional forms ๐0 , ๐1 , ๐2 , … are consistent if for every propositional form ๐, ๐0 , ๐1 , ๐2 , … โฌ ๐ ∧ ¬๐, and we write ๐ข๐๐(๐0 , ๐1 , ๐2 , … ). Otherwise, ๐0 , ๐1 , ๐2 , … is inconsistent. โ A logical system is consistent if no contradiction is a theorem. We have two goals. The first is to show that propositional logic is consistent. The second is to discover properties of sequences of consistent propositional forms that will aid in proving other properties of propositional logic. The next theorem is important to meet both of these goals. The equivalence of the first two parts is known as the compactness theorem THEOREM 1.5.2 If ๐0 , ๐1 , ๐2 , … are propositional forms, the following are equivalent in propositional logic. โ ๐ข๐๐(๐0 , ๐1 , ๐2 , … ). โ Every finite subsequence of ๐0 , ๐1 , ๐2 , … is consistent. โ There exists a propositional form ๐ such that ๐0 , ๐1 , ๐2 , … โฌ ๐. PROOF We have three implications to prove. โ Suppose there is a finite subsequence ๐๐0 , ๐๐1 , … , ๐๐๐−1 that proves ๐ ∧ ¬๐ for some propositional form ๐. This implies that there is a formal proof of ๐ ∧ ¬๐ from ๐0 , ๐1 , ๐2 , … , therefore, not ๐ข๐๐(๐0 , ๐1 , ๐2 , … ). โ Assume that ๐0 , ๐1 , ๐2 , … proves every propositional form. In particular, if we take a propositional form ๐, we have that ๐0 , ๐1 , ๐2 , … โข ๐ ∧ ¬๐. This means that there is a finite subsequence ๐๐0 , ๐๐1 , … , ๐๐๐−1 that proves ๐ ∧¬๐. โ Lastly, assume that there exists a propositional form ๐ such that ๐0 , ๐1 , ๐2 , … โข ๐ ∧ ¬๐. Section 1.5 THE THREE PROPERTIES 53 This means that there exist subscripts ๐0 , ๐1 , … , ๐๐−1 and propositional forms ๐0 , ๐1 , … , ๐๐−1 such that ๐๐0 , ๐๐1 , … , ๐๐๐−1 , ๐0 , ๐1 , … , ๐๐−1 , ๐ ∧ ¬๐ is a proof. Take any propositional form ๐. Then, ๐๐0 , ๐๐1 , … , ๐๐๐−1 , ๐0 , ๐1 , … , ๐๐−1 , ¬๐, ๐ ∧ ¬๐, ๐ is a proof of ๐ by IP. Therefore, ๐0 , ๐1 , ๐2 , … โข ๐. A sequence of propositional forms, such as ๐ → ๐, ๐ , ๐, although consistent, has the property that there are propositional forms that can be added to the sequence so that the resulting list remains consistent. When the sequence can no longer take new forms and remain consistent, we have arrived at a sequence that satisfies the next definition. DEFINITION 1.5.3 A sequence of propositional forms ๐0 , ๐1 , ๐2 , … is maximally consistent whenever ๐ข๐๐(๐0 , ๐1 , ๐2 , … ) and for all propositional forms ๐, ๐ข๐๐(๐, ๐0 , ๐1 , ๐2 , … ) implies that ๐ = ๐๐ for some ๐. It is a convenient result of propositional logic that every consistent sequence of propositional forms can be extended to a maximally consistent sequence. This is possible because all possible propositional forms can be put into a list. Following Definition 1.1.2, we first list the propositional variables: ๐ด, ๐ต, ๐ถ, ๐ด0 , ๐ด1 , ๐ด2 , ๐ต0 , ๐ต1 , ๐ต2 , โฎ โฎ โฎ ๐0 , ๐1 , ๐2 , … … … ๐, ๐, ๐, … Then, we list all propositional forms with only one propositional variable: ¬๐ด, ¬๐ด0 , ¬๐ต0 , โฎ ¬๐0 , ¬๐ต, ¬๐ด1 , ¬๐ต1 , โฎ ¬๐1 , ¬๐ถ, ¬๐ด2 , ¬๐ต2 , โฎ ¬๐2 , … … … ¬๐, ¬๐ , ¬๐, … Next, we list all propositional forms with exactly two propositional variables starting by writing ๐ด on the right: ๐ด ∨ ๐ด, ๐ต ∨ ๐ด, ๐ถ ∨ ๐ด, ๐ด0 ∨ ๐ด, ๐ด1 ∨ ๐ด, ๐ด2 ∨ ๐ด, ๐ต0 ∨ ๐ด, ๐ต1 ∨ ๐ด, ๐ต2 ∨ ๐ด, โฎ โฎ โฎ ๐0 ∨ ๐ด, ๐1 ∨ ๐ด, ๐2 ∨ ๐ด, … … … … ๐ ∨ ๐ด, ๐ ∨ ๐ด, ๐ ∨ ๐ด, 54 Chapter 1 PROPOSITIONAL LOGIC ๐ด ∧ ๐ด, ๐ต ∧ ๐ด, ๐ถ ∧ ๐ด, ๐ด0 ∧ ๐ด, ๐ด1 ∧ ๐ด, ๐ด2 ∧ ๐ด, ๐ต0 ∧ ๐ด, ๐ต1 ∧ ๐ด, ๐ต2 ∧ ๐ด, โฎ โฎ โฎ ๐0 ∧ ๐ด, ๐1 ∧ ๐ด, ๐2 ∧ ๐ด, … … … ๐ด → ๐ด, ๐ต → ๐ด, ๐ถ → ๐ด, ๐ด0 → ๐ด, ๐ด1 → ๐ด, ๐ด2 → ๐ด, ๐ต0 → ๐ด, ๐ต1 → ๐ด, ๐ต2 → ๐ด, โฎ โฎ โฎ ๐0 → ๐ด, ๐1 → ๐ด, ๐2 → ๐ด, … … … ๐ด ↔ ๐ด, ๐ต ↔ ๐ด, ๐ถ ↔ ๐ด, ๐ด0 ↔ ๐ด, ๐ด1 ↔ ๐ด, ๐ด2 ↔ ๐ด, ๐ต0 ↔ ๐ด, ๐ต1 ↔ ๐ด, ๐ต2 ↔ ๐ด, โฎ โฎ โฎ ๐0 ↔ ๐ด, ๐1 ↔ ๐ด, ๐2 ↔ ๐ด, … … … ๐ ∧ ๐ด, ๐ ∧ ๐ด, ๐ ∧ ๐ด, … ๐ → ๐ด, ๐ → ๐ด, ๐ → ๐ด, ๐ ↔ ๐ด, ๐ ↔ ๐ด, ๐ ↔ ๐ด, … … After following the same pattern by attaching ๐ด on the left, we continue by writing ¬๐ด on the right, and then on the left, and then we adjoin ๐ต and ¬๐ต, etc., and then use 3 propositional variables, and then four, etc. Following a careful path through this infinite list, we arrive at a sequence ๐0 , ๐1 , ๐2 , … of all propositional forms. We are now ready for the theorem. THEOREM 1.5.4 A consistent sequence of propositional forms is a subsequence of a maximally consistent sequence of propositional forms. PROOF Let ๐0 , ๐1 , ๐2 , … be consistent and ๐0 , ๐1 , ๐2 , … be a sequence of all propositional forms. Define the sequence ๐๐ as follows: โ Let ๐0 = ๐0 and { ๐0 ๐1 = ๐0 if ๐ข๐๐(๐0 , ๐0 , ๐1 , ๐2 , … ), otherwise, so the sequence at this stage is ๐0 , ๐0 or ๐0 , ๐0 . Both of these are consistent. โ Let ๐2 = ๐1 and { ๐1 ๐3 = ๐1 if ๐ข๐๐(๐1 , ๐0 , ๐1 , ๐2 , ๐0 , ๐1 , ๐2 , … ), otherwise. Section 1.5 THE THREE PROPERTIES 55 At this stage, the sequence is still consistent and of the form ๐0 , ๐1 , ๐1 , ๐1 or ๐0 , ๐1 , ๐1 , ๐1 . The first sequence is consistent by the definition of ๐3 , and the second sequence is consistent because ๐ข๐๐(๐0 , ๐1 , ๐1 ). โ Generalizing, let ๐2๐ = ๐๐ and ๐2๐+1 { ๐๐ = ๐๐ if ๐ข๐๐(๐๐ , ๐0 , ๐1 , … , ๐2๐ , ๐0 , ๐1 , ๐2 , … ), otherwise, resulting in a consistent sequence of the form ๐0 , ๐1 , ๐2 , ๐3 , … , ๐๐ , ๐๐ or ๐0 , ๐1 , ๐2 , ๐3 , … , ๐๐ , ๐๐ . Since ๐0 , ๐1 , ๐3 , … is a subsequence of ๐0 , ๐1 , ๐2 , … , it only remains to show that the new sequence is maximally consistent. โ Let ๐ be a propositional form such that ๐0 , ๐1 , ๐2 , … โข ๐ ∧ ¬๐ . This implies that there exists a sequence ๐๐ such that ๐0 < ๐1 < · · · < ๐๐ and ๐๐0 , ๐๐1 , … , ๐๐๐ โข ๐ ∧ ¬๐ , but by Theorem 1.5.2, this is impossible because ๐ข๐๐(๐0 , ๐1 , … , ๐๐๐ ). โ Suppose that ๐ is a propositional form so that ๐ข๐๐(๐ , ๐0 , ๐1 , ๐2 , … ). Write ๐ = ๐๐ for some ๐. Therefore, ๐ข๐๐(๐๐ , ๐0 , ๐1 , … , ๐2๐ , ๐0 , ๐1 , ๐2 , … ), which means that ๐ is a term of the sequence ๐๐ because ๐๐ was added at step 2๐ + 1. Soundness We have defined two separate tracks in propositional logic. One track is used to assign T or F to a propositional form, and thus it can be used to determine the truth value of a proposition. The other track focused on developing methods by which one propositional form can be proved from other propositional forms. These methods are used to write proofs in various fields of mathematics. The question arises whether these two tracks have been defined in such a way that they get along with each other. In other words, we want the propositional forms that we prove always to be assigned T, and we want the propositional forms that we always assign T to be provable. This means that we want semantic methods to yield syntactic results and syntactic methods to yield semantic results. 56 Chapter 1 PROPOSITIONAL LOGIC Sound The statement form is a tautology. The statement form is a theorem. Complete Figure 1.4 Sound and complete logics. DEFINITION 1.5.5 โ A logic is sound if every theorem is a tautology. โ A logic is complete if every tautology is a theorem. There is no guarantee that the construction of the two tracks for a logic will have these two properties (Figure 1.4), but it does in the case of propositional logic. To prove that propositional logic is sound, we need three lemmas. The proof of the first is left to Exercise 1. LEMMA 1.5.6 The propositional forms of Axioms 1.2.8 are tautologies. LEMMA 1.5.7 Let ๐, ๐, and ๐ be propositional forms. โ If ๐ ⇒ ๐, then ๐ → ๐ is a tautology. โ If ๐, ๐ ⇒ ๐, then ๐ ∧ ๐ → ๐ is a tautology. PROOF This is simply a matter of checking Inference Rules 1.2.10 and 1.3.10. For example, to check the theorem for De Morgan’s law, we must show that ¬(๐ ∧ ๐) → ¬๐ ∨ ¬๐ and ¬๐ ∨ ¬๐ → ¬(๐ ∧ ๐) are tautologies. To do this, examine the truth table ๐ T T F F ๐ T F T F ๐∧๐ T F F F ¬(๐ ∧ ๐) F T T T ¬๐ F F T T ¬๐ F T F T ¬๐ ∨ ¬๐ F T T T ¬(๐ ∧ ๐) → ¬๐ ∨ ¬๐ T T T T ¬๐ ∨ ¬๐ → ¬(๐ ∧ ๐) T T T T Section 1.5 THE THREE PROPERTIES We also have to show that ¬(๐ ∨ ๐) → ¬๐ ∧ ¬๐ and ¬๐ ∧ ¬๐ → ¬(๐ ∨ ๐) are tautologies. As another example, to check that the disjunctive syllogism leads to an implication that is a tautology, examining the truth table: ๐ T T F F ๐ ๐∨๐ T T F T T T F F ¬๐ (๐ ∨ ๐) ∧ ¬๐ F F F F T T T F (๐ ∨ ๐) ∧ ¬๐ → ๐ T T T T The other rules are checked similarly (Exercise 2). LEMMA 1.5.8 If ๐ → ๐ and ๐ are tautologies, then ๐ is a tautology. PROOF This is done by examining the truth table of ๐ → ๐ (page 12) where we see that ๐(๐) is constant and equal to T because ๐(๐) and ๐(๐ → ๐) are constant and both equal to T. We now prove the first important property of propositional logic. THEOREM 1.5.9 [Soundness] Every theorem of propositional logic is a tautology. PROOF Let ๐ be a theorem. Let ๐0 , ๐1 , … , ๐๐−1 be propositional forms such that ๐0 , ๐1 , … , ๐๐−1 is a proof for ๐ such that ๐1 is an axiom, ๐๐ with ๐ > 0 is an axiom or follows by a rule of inference, and ๐๐−1 = ๐ (Definition 1.2.13). We now examine the propositional forms of the proof. โ By Lemma 1.5.6, ๐0 is a tautology. โ The propositional form ๐1 is a tautology for one of two reasons. If ๐1 is an axiom, it is a tautology (Lemma 1.5.6). If it follows from ๐0 because ๐0 ⇒ ๐1 , then ๐0 → ๐1 is a tautology (Lemma 1.5.7), so ๐1 is a tautology by Lemma 1.5.8. 57 58 Chapter 1 PROPOSITIONAL LOGIC โ If ๐2 follows from ๐0 or ๐1 , reason as in the previous case. Suppose ๐0 , ๐1 ⇒ ๐2 . Then, by Lemma 1.5.7, (๐0 ∧ ๐1 ) → ๐2 is a tautology. Because ๐0 , ๐1 ⇒ ๐0 ∧ ๐1 by Conj, ๐0 ∧ ๐1 is a tautology. Thus, ๐2 is a tautology by Lemma 1.5.8. Since every ๐๐ with ๐ > 0 is an axiom, follows from some ๐๐ with ๐ < ๐, or follows from some ๐๐ , ๐๐ with ๐, ๐ < ๐, continuing in this manner, we find after finitely many steps that ๐ is a tautology. COROLLARY 1.5.10 For all propositional forms ๐0 , ๐1 , … , ๐๐−1 , ๐, if ๐0 , ๐1 , … , ๐๐−1 โข ๐, then ๐0 , ๐1 , … , ๐๐−1 โจ ๐. The Law of Noncontradiction being a theorem of propositional logic (page 37) suggests that we have the following result, which is the second important property of propositional logic. COROLLARY 1.5.11 Propositional logic is consistent. PROOF Let ๐ be a propositional form. Suppose that ๐ ∧ ¬๐ is a theorem. This implies that it is a tautology by the soundness theorem (1.5.9), but ๐ ∧ ¬๐ is a contradiction. Completeness We use the consistency of propositional logic to prove that propositional logic is complete. For this we need a few lemmas. LEMMA 1.5.12 If not ๐ข๐๐(¬๐, ๐0 , ๐1 , ๐2 , … ), then ๐0 , ๐1 , ๐2 , … โข ๐. PROOF If not ๐ข๐๐(๐0 , ๐1 , ๐2 , … ), then ๐0 , ๐1 , ๐2 , … โข ๐ by Theorem 1.5.2, so suppose ๐ข๐๐(๐0 , ๐1 , ๐2 , … ). Assume that there exists a propositional form ๐ such that ¬๐, ๐0 , ๐1 , ๐2 , … โข ๐ ∧ ¬๐. This implies that there exists a formal proof ๐๐0 , ๐๐1 , … , ๐๐๐−1 , ¬๐, ๐ 0 , ๐ 1 , … , ๐ ๐−1 , ๐ ∧ ¬๐, where ¬๐ is in the proof because ๐ข๐๐(๐0 , ๐1 , ๐2 , … ). Then, ๐๐0 , ๐๐1 , … , ๐๐๐−1 , ๐ (1.10) 59 Section 1.5 THE THREE PROPERTIES is a proof, where ๐ follows by IP with (1.10) as the subproof. Therefore, ๐0 , ๐1 , ๐2 , … โข ๐. LEMMA 1.5.13 If ๐0 , ๐1 , ๐2 , … are maximally consistent, then for every propositional form ๐, either ๐ = ๐๐ or ¬๐ = ๐๐ for some ๐. PROOF Since ๐0 , ๐1 , ๐2 , … are consistent, both ๐ and ¬๐ cannot be terms of the sequence. Suppose that it is ¬๐ that is not in the list. By the definition of maximal consistency, we conclude that ¬๐, ๐0 , ๐1 , ๐2 , … are not consistent. Therefore, by Lemma 1.5.12, we conclude that ๐0 , ๐1 , ๐2 , … โข ๐, and since the sequence is maximally consistent, ๐ = ๐๐ for some ๐. To prove the next lemma, we use a technique called induction on propositional forms. It states that a property will hold true for all propositional forms if two conditions are met: โ The property holds for all propositional variables. โ If the property holds for ๐ and ๐, the property holds for ¬๐, ๐ ∧ ๐, ๐ ∨ ๐, ๐ → ๐, and ๐ ↔ ๐. In proving the second condition, we first assume that the property holds for the propositional forms ๐ and ๐. (1.11) This assumption (1.11) is known as an induction hypothesis. Because ๐ ∨ ๐ ⇔ ¬๐ → ๐, ๐ ∧ ๐ ⇔ ¬(๐ → ¬๐), and ๐ ↔ ๐ ⇔ (๐ → ¬๐) → ¬(¬๐ → ๐), we need only to show that the induction hypothesis implies that the property holds for ¬๐ and ๐ → ๐. LEMMA 1.5.14 If ๐ข๐๐(๐0 , ๐1 , ๐2 , … ), there exists a valuation ๐ such that ๐(๐) = T if and only if ๐ = ๐๐ for some ๐ = 0, 1, 2, … . 60 Chapter 1 PROPOSITIONAL LOGIC PROOF Since ๐ข๐๐(๐0 , ๐1 , ๐2 , … ), we know that ๐0 , ๐1 , ๐2 , … (1.12) can be extended to a maximally consistent sequence of propositional forms by Theorem 1.5.4. If we find the desired valuation for the extended sequence, that valuation will also work for the original sequence, so assume that (1.12) is maximally consistent. Let ๐0 , ๐1 , ๐2 , … represent all of the possible propositional variables (page 53). Define { T if ๐๐ is a propositional variable of ๐๐ for some ๐, ๐(๐๐ ) = F otherwise. We prove that this is the desired valuation by induction on propositional forms. We first claim that for all ๐, ๐(๐๐ ) = T if and only if ๐๐ = ๐๐ for some ๐. To prove this, first note that if ๐๐ is a term of (1.12), then ๐(๐๐ ) = T by definition of ๐. To show the converse, suppose that ๐(๐๐ ) = T but ๐๐ is not a term of (1.12). By Lemma 1.5.13, there exists ๐ such that ¬๐๐ = ๐๐ . This implies that ๐(¬๐๐ ) = T, and then ๐(๐๐ ) = F (Definition 1.1.9), a contradiction. Now assume that ๐(๐) = T if and only if ๐ = ๐๐ for some ๐ and ๐(๐) = T if and only if ๐ = ๐๐ for some ๐. We first prove that ๐(¬๐) = T if and only if ¬๐ = ๐๐ for some ๐. โ Suppose that ๐(¬๐) = T. Then, ๐(๐) = F, and by induction, ๐ is not in (1.12). Since ๐0 , ๐1 , ๐2 , … is maximally consistent, ¬๐ is in the list (Lemma 1.5.13). โ Conversely, let ¬๐ = ๐๐ for some ๐. By consistency, ๐ is not in the sequence. Therefore, by the induction hypothesis, ๐(๐) = F, which implies that ๐(¬๐) = T. We next prove that ๐(๐ → ๐) = T if and only if ๐ → ๐ = ๐๐ for some ๐. โ Assume that ๐(๐ → ๐) = T. We have two cases to check. First, let ๐(๐) = ๐(๐) = T, so both ๐ and ๐ are in (1.12) by the induction hypothesis. Suppose ๐ → ๐ is not a term of the sequence. This implies that its Section 1.5 THE THREE PROPERTIES 61 negation ¬(๐ → ๐) is a term of the sequence by Lemma 1.5.13. Therefore, ๐ ∧ ¬๐ is in the sequence by maximal consistency, so ¬๐ is also in the sequence, a contradiction. Second, let ๐(๐) = F. By induction, ๐ is not a term of the sequence, which implies that ¬๐ is a term. Hence, ¬๐ ∨ ๐ is in the sequence, which implies that ๐ → ๐ is also in the sequence. โ To prove the converse, suppose that ๐ → ๐ = ๐๐ for some ๐. Assume ๐(๐ → ๐) = F. This means that ๐(๐) = T and ๐(๐) = F. Therefore, by induction, ๐ is a term of the sequence but ๐ is not. This implies that ¬๐ is in the sequence, so ¬๐ is in the sequence by MT. This contradicts the consistency of (1.12). THEOREM 1.5.15 [Completeness] Every tautology of propositional logic is a theorem. PROOF Let ๐ be a propositional form such that FL1, FL2, FL3 โฌ ๐. By Lemma 1.5.12, ๐ข๐๐(FL1, FL2, FL3, ¬๐). Therefore, by Lemma 1.5.14, there exists a valuation such that ๐(๐) = F, which implies that ฬธโจ ๐. COROLLARY 1.5.16 If ๐0 , ๐1 , ๐2 , … โจ ๐, then ๐0 , ๐1 , ๐2 โข ๐ for all propositional forms ๐0 , ๐1 , ๐2 , … . We conclude by Theorems 1.5.9 and 1.5.15 and their corollaries that the notions of semantically valid and syntactically valid coincide for deductions in propositional logic. Exercises 1. Prove Lemma 1.5.6. 2. Provide the remaining parts of the proof of Lemma 1.5.7. 3. Let ๐, ๐0 , ๐1 , ๐2 , … be propositional forms. Prove the following. (a) If ๐0 , ๐1 , ๐2 , · · · โฌ ๐, then ๐ข๐๐(¬๐, ๐0 , ๐1 , ๐2 , … ). (b) If ๐ข๐๐(๐0 , ๐1 , ๐2 , … ) and ๐0 , ๐1 , ๐2 , · · · โข ๐, then ๐ข๐๐(๐, ๐0 , ๐1 , ๐2 , … ). (c) If ๐ข๐๐(๐0 , ๐1 , ๐2 , … ), then ๐ข๐๐(๐, ๐0 , ๐1 , ๐2 , … ) or ๐ข๐๐(¬๐, ๐0 , ๐1 , ๐2 , … ). 4. Let ๐0 , ๐1 , ๐2 , … be a maximally consistent sequence of propositional forms. Let ๐ and ๐ be propositional forms. Prove the following. (a) If ๐0 , ๐1 , ๐2 , · · · โข ๐, then ๐ = ๐๐ for some ๐. (b) ๐ ∧ ๐ = ๐๐ for some ๐ if and only if ๐ = ๐๐ and ๐ = ๐๐ for some ๐ and ๐. (c) If (๐ → ๐) = ๐๐ and ๐ = ๐๐ for some ๐ and ๐, then ๐ = ๐๐ for some ๐. 5. Use truth tables to prove the following. Explain why this is a legitimate technique. (a) ¬๐ ∨ (¬๐ ∨ ¬๐ ) โข ๐ → ¬(๐ ∧ ๐ ) (b) ๐ ∨ ๐ → ๐ โข ¬๐ → ¬๐ 62 Chapter 1 PROPOSITIONAL LOGIC (c) ๐ → ¬(๐ → ๐ ) โข ๐ → ¬๐ (d) ๐ ↔ ๐ ∨ ๐ โข ๐ → ๐ (e) ๐ ∨ ๐ → ๐ ∧ ๐, ¬๐ → (๐ → ¬๐ ), ¬๐ โข ¬๐ 6. Write a formal proof to show the following. Explain why this is a legitimate technique. (a) ¬๐ ∨ ๐, ¬๐ โจ ¬๐ (b) ¬(๐ ∧ ๐), ๐ โจ ¬๐ (c) ๐ → ๐, ๐ โจ ๐ ∨ ๐ (d) ๐ → ๐, ๐ → ๐ , ๐ โจ ๐ (e) ๐ ∨ ๐ ∧ ๐ , ¬๐ โจ ๐ 7. Write a formal proof to show the following. (a) ¬(๐ ∧ ๐) ฬธโจ ¬๐ (b) (๐ → ๐) ∨ (๐ → ๐), ๐ ∨ ๐ ฬธโจ ๐ ∨ ๐ (c) ๐ ∨ ๐ , ๐ ∨ ๐, ๐ ↔ ๐ ฬธโจ ๐ ∧ ๐ 8. Write a formal proof to demonstrate the following. (a) ๐ ∨ ¬๐ is a tautology. (b) ๐ ∧ ¬๐ is a contradiction. 9. Modify propositional logic by removing all replacement rules (1.3.9). Is the resulting logic consistent? sound? complete? 10. Modify propositional logic by removing all inference rules (1.2.10) except for Inference Rule 1.3.10. Is the resulting logic consistent? sound? complete? CHAPTER 2 FIRST-ORDER LOGIC 2.1 LANGUAGES We developed propositional logic to model basic proof and truth. We did so by using propositional forms to represent sentences that were either true or false. We saw that propositional logic is consistent, sound, and complete. However, the sentences of mathematics involve ideas that cannot be fully represented in propositional logic. These sentences are able to characterize objects, such as numbers or geometric figures, by describing properties of the objects, such as being even or being a rectangle, and the relationships between them, such as equality or congruence. Since propositional logic is not well suited to handle these ideas, we extend propositional logic to a richer system. Predicates Consider the sentence it is a real number. This sentence has no truth value because the meaning of the word it is undetermined. As noted on page 3, the word it is like a gap in the sentence. It is as if the sentence was written as 63 A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. 64 Chapter 2 FIRST-ORDER LOGIC is a real number. However, that gap can be filled. Let us make some substitutions for it: 5 is a real number. ๐โ7 is a real number. Fido is a real number. My niece’s toy is a real number. With each replacement, the word that is undetermined is given meaning, and then the sentence has a truth value. In the examples, the first two propositions are true, and the last two are false. Notice that changes, whereas is a real number remains fixed. This is because these two parts of the sentence have different purposes. The first is a reference to an object, and the second is a property of the object. What we did in the examples was to choose a property and then test whether various objects have that property: It ↑ 5 is a real number. is a real number. True ๐โ7 is a real number. True Fido is a real number. False My niece’s toy is a real number. False Depending on the result, these two parts are put together to form a sentence that either accurately or inaccurately affirms that a particular property is an attribute of an object. For example, the first sentence states that being a real number is a characteristic of 5, which is correct, and the last sentence states that being a real number is a characteristic of my niece’s toy, which is not correct. We have terminology for all of this. โ The subject of a sentence is the expression that refers to an object. โ The predicate of a sentence is the expression that ascribes a property to the object identified by the subject. Thus, 5, ๐โ7, Fido, and my niece’s toy are subjects that are substituted for it, and is a real number is a predicate. Substituting for the subject but not substituting for the predicate is a restriction that we make. We limit the extension of propositional logic this way because it is to be part of mathematical logic and this is what we do in mathematics. For example, take the sentence, ๐ฅ + 2 = 7. On its own it has no truth value, but when we substitute ๐ฅ = 5 or ๐ฅ = 10, the sentence becomes a proposition. In this sense, ๐ฅ + 2 = 7 is like it is a real number. Both the sentences have a gap that is assigned a meaning giving the sentence a truth value. 65 Section 2.1 LANGUAGES However, the mathematical sentence is different from the English sentence in that it is unclear as to what part is the subject. Is it ๐ฅ or ๐ฅ + 2? For our purposes, the answer is irrelevant. This is because in mathematical logic, the subject is replaced with a variable or, sometimes, with multiple variables. This change leads to a modification of what a predicate is. DEFINITION 2.1.1 A predicate is an expression that ascribes a property to the objects identified by the variables of the sentence. Therefore, the sentence ๐ฅ + 2 = 7 is a predicate. It describes a characteristic of ๐ฅ. When expressions are substituted for ๐ฅ, the resulting sentence will be either a proposition that affirms that the value added to 2 equals 5 or another predicate. If ๐ฅ = 5 is substituted, the result is the true proposition 5 + 2 = 7. That is, ๐ฅ + 2 = 7 is satisfied by 5. If ๐ฅ = 10 and the substitution is made, the resulting proposition 10+2 = 7 is false. In mathematics, it is also common to substitute with undetermined values. For example, if the substitution is ๐ฅ = ๐ฆ, the result is the predicate ๐ฆ + 2 = 7, and if the substitution is ๐ฅ = sin ๐, the result is the predicate sin ๐ + 2 = 7. Substituting ๐ฅ = ๐ฆ + 2๐ง2 yields ๐ฆ + 2๐ง2 + 2 = 7, a predicate with multiple variables. If we substitute ๐ฅ = ๐ฆ2 − 7๐ฆ, the result is ๐ฆ2 − 7๐ฆ + 2 = 7, which is a predicate with multiple occurrences of the same variable. Assume that ๐ฅ represents a real number and consider the sentence there exists x such that ๐ฅ + 2 = 7. (2.1) Although there is a variable with 2 occurrences, the sentence is a proposition, so in propositional logic, it is an atom and would be represented by ๐ . This does not tell us much about the sentence, so we instead break the sentence into two parts: Quantifier ↓ there exists ๐ฅ such that Predicate ↓ ๐ฅ+2=7 A quantifier indicates how many objects satisfy the sentence. In this case, the quantifier is the existential quantifier, making the sentence an existential proposition. Such a proposition claims that there is at least one object that satisfies the predicate. In particular, although other sentences mean the same as (2.1) assuming that ๐ฅ represents a real number, such as there exists a real number ๐ฅ such that ๐ฅ + 2 = 7, ๐ฅ + 2 = 7 for some real number x, and some real number ๐ฅ satisfies ๐ฅ + 2 = 7, they all claim along with (2.1) that there is a real number ๐ฅ such that ๐ฅ + 2 = 7. This is true because 5 satisfies ๐ฅ + 2 = 7 (Figure 2.1). 66 Chapter 2 FIRST-ORDER LOGIC The real numbers 1 3 Satisfies x + 2 = 7 2 5 4 There is at least one object that satisfies ๐ฅ + 2 = 7. Therefore, there exists a real number ๐ฅ such that ๐ฅ + 2 = 7 is true. Figure 2.1 Again, assume that ๐ฅ is a real number. The sentence for all ๐ฅ, ๐ฅ + 5 = 5 + ๐ฅ (2.2) claims that ๐ฅ + 5 = 5 + ๐ฅ is true for every real number ๐ฅ. To see this, break (2.2) like (2.1): Quantifier ↓ for all ๐ฅ , Predicate ↓ ๐ฅ+5=5+๐ฅ This quantifier is called the universal quantifier. It states that the predicate is satisfied by each and every object. Including propositions that have the same meaning as (2.2), such as for all real numbers ๐ฅ, ๐ฅ + 5 = 5 + ๐ฅ, and ๐ฅ + 5 = 5 + ๐ฅ for every real number x, (2.2) is true because each substitution of a real number for ๐ฅ satisfies the predicate (Figure 2.2). These are examples of universal propositions,. A proposition can have multiple quantifiers. Take the equation ๐ฆ = 2๐ฅ2 + 1. Before we learned the various techniques that make graphing this equation simple, we graphed it by writing a table with one column holding the ๐ฅ values and another holding the ๐ฆ values. We then chose numbers to substitute for ๐ฅ and calculated the corresponding ๐ฆ. Although we did not explicitly write it this way, we learned that for every real number ๐ฅ, there exists a real number ๐ฆ such that ๐ฆ = 2๐ฅ2 + 1. This is a universal proposition, and its predicate is there exists a real number ๐ฆ such that ๐ฆ = 2๐ฅ2 + 1. (2.3) Section 2.1 LANGUAGES The real numbers 1 3 Satisfies x + 5 = 5 + x 2 4 67 5 Every object satisfies ๐ฅ + 5 = 5 + ๐ฅ. Therefore, for all real numbers ๐ฅ, ๐ฅ + 5 = 5 + ๐ฅ is true. Figure 2.2 We conclude that (2.3) is true because whenever ๐ฅ is replaced with any real number, there is a real number ๐ฆ that satisfies ๐ฆ = 2๐ฅ2 + 1. The symbolic logic that we define in this chapter is intended to model propositions that have a predicate and possibly a quantifier. As with propositional logic, an alphabet and a grammar will be chosen that enable us to write down the appropriate symbols to represent these sentences. Unlike propositional logic where we worked on both its syntax and semantics at the same time, this logic starts with a study only of its syntax. Alphabets No matter what mathematical subject we study, whether it is algebra, number theory, or something else, we can write our conclusions as propositions. These sentences usually involve mathematical symbols particular to the subject being studied. For example, both (2.1) and (2.2) are algebraic propositions. We know this because we recognize the symbols from algebra class. For this reason, the symbolic logic that we define will consist of two types of symbols. DEFINITION 2.1.2 Logic symbols consist of the following: โ Variable symbols: Sometimes simply referred to as variables, these symbols serve as placeholders. On their own, they are without meaning but can be replaced with symbols that do have meaning. A common example are the variable symbols in an algebraic equation. Variable symbols can be lowercase English letters ๐ฅ, ๐ฆ, and ๐ง, or lowercase English letters with subscripts, ๐ฅ๐ , ๐ฆ๐ , and ๐ง๐ . Depending on the context, variable symbols are sometimes expanded to include uppercase English letters, with or without subscripts, as well as Greek and Hebrew letters. Denote the collection of variable symbols by ๐ต๐ ๐ฑ. โ Connectives: ¬, ∧, ∨, →, ↔ 68 Chapter 2 FIRST-ORDER LOGIC โ Quantifier symbols: ∀, ∃ โ Equality symbol: = โ Grouping symbols: (, ), [, ], {, } Theory symbols consist of the following: โ Constant symbols: These are used to represent important objects in the subject that do not change. Common constant symbols are 0 and ๐. โ N-ary function symbols: The term n-ary refers to the number of arguments. For example, these symbols can represent unary functions that take one argument such as cosine or binary functions such as multiplication that take two arguments. โ N-ary relation symbols: These symbols are used to represent relations. For example, < represents the binary relation of less than and ๐ can represent the unary relation is an even number. It is not necessary to choose any theory symbols. However, if there are any, theory symbols must be chosen so that they are not connectives, quantifier symbols, the equality symbol, or grouping symbols, and the selection of theory symbols has precedence over the selection of variable symbols. This means that these two collections must have no common symbols. Moreover, although this is not the case in general, we assume that the logic symbols that are not variable symbols will be the same for all applications. On the other hand, theory symbols (if there are any) vary depending on the current subject of study. A collection of all logic symbols and any theory symbols is called a first-order alphabet and is denoted by ๐ . The term theory refers to a collection of propositions all surrounding a particular subject. Since different theories have different notation (think about how algebraic notation differs from geometric notation), alphabets change depending on the subject matter. Let us then consider the alphabets for a number of theories that will be introduced later in the text. The foundational theory is the first example. EXAMPLE 2.1.3 Set theory is the study of collections of objects. The ∈ is the only relation symbol, and it is binary. It has no other theory symbols. The theory symbols of set theory are denoted by ๐ฒ๐ณ and are summarized in the following table. ๐ฒ๐ณ Constant symbols Function symbols Relation symbol ∈ Section 2.1 LANGUAGES 69 EXAMPLE 2.1.4 Number theory is the study of the natural numbers. The symbols + and ⋅ represent regular addition and multiplication, respectively. As such they are binary function symbols. These and its other theory symbols are indicated in the following table and are denoted by ๐ญ๐ณ. ๐ญ๐ณ Constant symbols 0 1 Function symbols + ⋅ Relation symbols Another approach to number theory is called Peano arithmetic (Peano 1889). It is the study of the natural numbers using Peano’s axioms. It has a constant symbol 0 and a unary function symbol ๐. Denote these symbols by ๐ ๐ฑ. ๐ ๐ฑ Constant symbol 0 Function symbol ๐ Relation symbols The Peano arithmetic symbols are sometimes extended to include symbols for the operations of addition and multiplication and the less-than relation. Denote this extended collection by ๐ ๐ฑ′ . ๐ ๐ฑ′ Constant symbol 0 Function symbols ๐ + ⋅ Relation symbol < EXAMPLE 2.1.5 Group theory is the study of groups. A group is a set with an operation that satisfies certain properties. Typically, the operation is denoted by the binary function symbol โฆ. There is also a constant represented by ๐. Ring theory is the study of rings. A ring is a set with two operations that satisfy certain properties. The operations are usually denoted by the binary function symbols ⊕ and ⊗. The constant symbol is โ. These collections of theory symbols are denoted by ๐ฆ๐ฑ and ๐ฑ๐จ, respectively, and are summarized in the following tables. ๐ฆ๐ฑ Constant symbol ๐ Function symbol โฆ ๐ฑ๐จ Constant symbols โ Function symbols ⊕ ⊗ Relation symbols Relation symbol Field theory is the study of fields. A field is a type of ring with extra properties, so the theory symbols ๐ฑ๐จ can be used to write about fields. However, if an order is defined on a field, a binary relation symbol is needed, and the result is the theory of ordered fields. Denote these symbols by ๐ฎ๐ฅ. ๐ฎ๐ฅ Constant symbol โ Function symbols ⊕ ⊗ Relation symbol < Notice that the collections of theory symbols in the previous examples had at most two constant symbols. This is typical since subjects of study usually have only a few 70 Chapter 2 FIRST-ORDER LOGIC objects that require special recognition. However, there will be times when some extra constants are needed to reference objects that may or may not be named by the constant theory symbols. To handle these situations, we expand the given theory symbols by adding new constant symbols. DEFINITION 2.1.6 Let ๐ be a first-order alphabet with theory symbols ๐ฒ. When constant symbols not in ๐ฒ are combined with the symbols of ๐ฒ, the resulting collection of theory symbols is denoted by ๐ฒ. The number of new constant symbols varies depending on need. For example, suppose that we are working in a situation where we need four constant symbols in addition to the ones in ๐ฎ๐ฅ. Denote these new symbols by ๐1 , ๐2 , ๐3 , and ๐4 . Then, ๐ฎ๐ฅ consists of these four constant symbols and the symbols from ๐ฎ๐ฅ. Terms For a string to represent a proposition or a predicate from a particular theory, each nonlogic symbol of the string must be a theory symbol of that subject. For example, ∀๐ฅ(∈ ๐ฅ๐ด) and ∨∃ ∈ ๐๐4 {} ∧ ๐ฅ¬๐ฆ are strings for set theory. However, some strings have a reasonable chance of being given meaning, others do not. As with propositional logic, we need a grammar that will determine which strings are legal for the logic. Because a predicate might have variables, the types of representations that we want to make are more complicated than those of propositional logic. Hence, the grammar also will be more complicated. We begin with the next inductive definition (compare page 59). DEFINITION 2.1.7 Let ๐ be a first-order alphabet with theory symbols ๐ฒ. An ๐ฆ-term is a string over ๐ such that โ a variable symbol from ๐ is an ๐ฒ-term, โ a constant symbol from ๐ฒ is an ๐ฒ-term, โ ๐ ๐ก0 ๐ก1 · · · ๐ก๐−1 is an ๐ฒ-term if ๐ก0 , ๐ก1 , … , ๐ก๐−1 are ๐ฒ-terms and ๐ is a function symbol from ๐ฒ. Denote the collection of strings over ๐ that are ๐ฒ-terms by ๐ณ๐ค๐ฑ๐ฌ๐ฒ(๐ ). The string ๐ ๐ก0 ๐ก1 · · · ๐ก๐−1 is often written as ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) because it is common to write functions using this notation. We must remember, however, that this notation can Section 2.1 LANGUAGES 71 always be replaced with the notation of Definition 2.1.7. Furthermore, when writing about a general ๐ฒ-term where it is not important to mention ๐ฒ, we often simply write using the word term without the ๐ฒ. We will follow this convention when writing about similar definitions that require the theory symbols ๐ฒ. EXAMPLE 2.1.8 Here are examples of terms for each of the indicated theories. Assume that ๐ฅ, ๐ฆ and ๐ง1 are variable symbols. โ 0 [Peano arithmetic] โ +๐ฅ๐ฆ [number theory] โ โฆ0๐ง1 [group theory]. The string +๐ฅ๐ฆ is typically written as ๐ฅ+๐ฆ, and the string โฆ0๐1 is typically written as 0 โฆ ๐1 . If ๐ญ๐ณ is expanded to ๐ญ๐ณ by adding the constants ๐ and ๐, then +๐๐ is an ๐ญ๐ณ-term. As suggested by Example 2.1.8, the purpose of a term is to represent an object of study. A variable symbol represents an undetermined object. A constant symbol represents an object that does not change, such as a number. A function symbol is used to represent a particular object given another object. For example, the ๐ญ๐ณ-term +๐ฅ2 represents the number that is obtained when ๐ฅ is added to 2. Formulas As propositional forms are used to symbolize propositions, the next definition is the grammar used to represent propositional forms and predicates. The definition is given inductively and resembles the parentheses-less prefix notation invented by ลukasiewicz (1951) in the early 1920s. DEFINITION 2.1.9 Let ๐ฒ be the theory symbols from a first-order alphabet ๐ . An ๐ฆ-formula is a string over ๐ such that โ ๐ก0 = ๐ก1 is an ๐ฒ-formula if ๐ก0 and ๐ก1 are ๐ฒ-terms. โ ๐ ๐ก0 ๐ก1 · · · ๐ก๐−1 is an ๐ฒ-formula if ๐ is an ๐-ary relation symbol from ๐ฒ and ๐ก0 , ๐ก1 , … , ๐ก๐−1 are ๐ฒ-terms. โ ¬๐ is an ๐ฒ-formula if ๐ is an ๐ฒ-formula. โ ๐ ∧ ๐, ๐ ∨ ๐, ๐ → ๐, and ๐ ↔ ๐ are ๐ฒ-formulas if ๐ and ๐ are ๐ฒ-formulas. โ ∀๐ฅ๐ and ∃๐ฅ๐ are ๐ฒ-formulas if ๐ is an ๐ฒ-formula and ๐ฅ is a variable symbol from ๐ . 72 Chapter 2 FIRST-ORDER LOGIC The string ๐ ๐ก0 ๐ก1 · · · ๐ก๐−1 is often represented as ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ). A formula of the form ∀๐ฅ๐ is called a universal formula, and a formula of the form ∃๐ฅ๐ is an existential formula. Parentheses can be used around an ๐ฒ-formula for readability, especially when quantifier symbols are involved. For example, ∀๐ฅ∃๐ฆ๐ is the same formula as ∀๐ฅ(∃๐ฆ๐) and ∀๐ฅ∃๐ฆ(๐). EXAMPLE 2.1.10 Let ๐ฅ and ๐ฆ be variable symbols. Let ๐ be a constant symbol; ๐ , ๐, and โ be unary function symbols; and ๐ be a binary relation symbol from ๐ฒ. The following are ๐ฒ-formulas. โ ๐ฅ=๐ โ ๐ ๐๐ ๐ฆ โ ¬(๐ฆ = ๐๐) โ ๐ ๐ฅ๐ ๐ฅ → ๐ ๐ ๐ฅ๐ฅ โ ∀๐ฅ¬(๐ ๐ฅ = ๐ ๐) โ ∃๐ฅ∀๐ฆ(๐ ๐ ๐ฅ๐๐ฆ ∧ ๐ ๐ ๐ฅโ๐ฆ). In practice, ๐ ๐๐ ๐ฆ is usually written as ๐ (๐, ๐ (๐ฆ)), ¬(๐ฆ = ๐๐) as ๐ฆ ≠ ๐(๐), ๐ ๐ฅ๐ ๐ฅ as ๐ (๐ฅ, ๐ (๐ฅ)), ๐ ๐ ๐ฅ๐ฅ as ๐ (๐ (๐ฅ), ๐ฅ), and ๐ ๐ ๐ฅ๐๐ฆ as ๐ (๐ (๐ฅ), ๐(๐ฆ)). EXAMPLE 2.1.11 These are some ๐ฒ๐ณ-formulas with their standard translations. โ ¬ ∈ ๐ฅ{ } ๐ฅ is not an element of { }. โ ∀๐ฅ(∈ ๐ฅ๐ด → ∈๐ฅ๐ต) For all ๐ฅ, if ๐ฅ ∈ ๐ด, then ๐ฅ ∈ ๐ต. โ ¬∃๐ฅ∀๐ฆ(∈๐ฆ๐ฅ) It is not the case that there exists ๐ฅ such that for all ๐ฆ, ๐ฆ ∈ ๐ฅ. โ ๐ฅ = ๐ฆ ∨ ∈๐ฆ๐ฅ ∨ ∈๐ฅ๐ฆ ๐ฅ = ๐ฆ or ๐ฆ ∈ ๐ฅ or ๐ฅ ∈ ๐ฆ. EXAMPLE 2.1.12 Here are some ๐ญ๐ณ-formulas with their corresponding predicates. โ ∀๐ฅ∀๐ฆ∀๐ง [++๐ฅ๐ฆ๐ง = +๐ฅ+๐ฆ๐ง] (๐ฅ + ๐ฆ) + ๐ง = ๐ฅ + (๐ฆ + ๐ง) for all ๐ฅ, ๐ฆ, and ๐ง. โ ¬(๐ฅ = 0) → ∀๐ฆ∃๐ง(๐ง = ⋅๐ฅ๐ฆ) If ๐ฅ ≠ 0, then for all ๐ฆ, there exists ๐ง such that ๐ง = ๐ฅ๐ฆ. Section 2.1 LANGUAGES 73 โ ∀๐ฅ(⋅1๐ฅ = ๐ฅ) For all ๐ฅ, 1๐ฅ = ๐ฅ. We now name the system just developed. DEFINITION 2.1.13 An alphabet combined with a grammar is called a language. The language given by Definitions 2.1.2 and 2.1.9 is known as a first-order language. A formula of a first-order language is a first-order formula, and all of the formulas of the first-order language with theory symbols ๐ฒ is denoted by ๐ซ(๐ฒ). The first-order language developed to represent predicates, either with quantifiers or without them, is summarized in Figure 2.3. An alphabet that has both logic and theory symbols is chosen. Using a grammar, terms are defined, and then by extending the grammar, formulas are defined. A natural question to ask regarding this system is what makes it first-order? Look at the last part of Definition 2.1.9. It gives the rule that allows the addition of a quantifier symbol in a formula. Only ∀๐ฅ or ∃๐ฅ are permitted, where ๐ฅ is a variable symbol representing an object of study. Thus, only propositions that begin as for all x . . . or there exists x such that . . . can be represented as a firstorder formula. To quantify over function and relation symbols, we need to define a second-order formula. Augment the alphabet ๐ with function and relation symbols, which creates what is know as a second-order alphabet, and then add ∀๐ ๐ and ∃๐ ๐ are ๐ฒ-formulas if ๐ is an ๐ฒ-formula and ๐ is a function symbol from ๐ and ∀๐ ๐ and ∃๐ ๐ are ๐ฒ-formulas if ๐ is an ๐ฒ-formula and ๐ is a relation symbol from ๐ to Definition 2.1.9. For example, if the first-order formula ∀๐ฅ(∈ ๐ฅ๐ด →∈ ๐ฅ๐ต) is intended to be true for any ๐ด and ๐ต, it can be written as the second-order formula ∀๐ด∀๐ต∀๐ฅ(๐ด๐ฅ → ๐ต๐ฅ), where ๐ด๐ฅ represents the relation ๐ฅ ∈ ๐ด and ๐ต๐ฅ represents the relation ๐ฅ ∈ ๐ต. Logic symbols Grammar Theory symbols Terms Grammar Represent objects Alphabet Figure 2.3 A first-order language. Formulas Properties of objects 74 Chapter 2 FIRST-ORDER LOGIC Exercises 1. Determine whether the given strings are ๐ฆ๐ฑ-terms. If a string is not a ๐ฆ๐ฑ-term, find all issues that prevent it from being one. (a) 2๐ (b) ๐ฅ๐ฆโฆ (c) โฆ (d) <๐ฅ๐ (e) โฆโฆ๐ฅ๐ฆ๐ 2. Write the ๐ฆ๐ฑ-terms from Exercise 1 in their usual form (Example 2.1.10). 3. Extend ๐ฒ๐ณ to ๐ฒ๐ณ′ by adding the constant symbol ∅ and the unary function symbol P. Determine whether the given strings are ๐ฒ๐ณ′ -terms. If a string is not a ๐ฒ๐ณ′ -term, find all issues that prevent it from being one. (a) ∈๐ฅ∅ (b) ∅ (c) ๐ฅ (d) P∈๐ฅ๐ฆ (e) P๐ฅ 4. Determine whether the given strings are ๐ฎ๐ฅ-formulas. If a given string is not a ๐ฎ๐ฅ-formula, find all issues that prevent it from being one. (a) < ⊕๐ฅ๐ฆ⊗๐ฅ๐ฆ (b) ⊕๐ฅ = ⊗๐ฆ (c) ∀๐ฅ∃๐ฆ(+๐ฅ๐ฆ = 0) (d) ∀๐ฅ∀๐ฆ(⊗๐ฅ๐ฆ = ⊕๐ฆ๐ฅ) → <๐ฅ๐ฆ (e) ¬(<⊗๐ฅ๐ฆ ∧ ⊕๐ฅ๐ฆ) ↔ ∀๐ข∀๐ฃ(<⊗๐ฅ๐ฆ⊕๐ข๐ฃ) 5. Write the ๐ฑ๐จ-formulas from Exercise 4 in their usual form. 6. Extend ๐ฒ๐ณ to ๐ฒ๐ณ′′ by adding the constant symbol ∅ and the binary relation symbol ๐ . Determine whether the given strings are ๐ฒ๐ณ′′ -formulas. If a given string is not a ๐ฒ๐ณ′′ -formula, find all issues that prevent it from being one. (a) ๐ ๐ฅ∅ ∨ ∈๐ฅ๐ฆ (b) ๐ ๐ฅ∅ → ∈+๐ฅ๐ฆ๐ง (c) ∀๐ฅ∃∅(¬∈๐ฅ∅) (d) ∀๐ฅ ∨ ∀๐ฆ ∨ ∀๐ง(∈∅๐ฅ๐ฆ๐ง) ∨ ∈∅∅ (e) [∃๐ฅ(๐ ๐ฅ∅) → ∃๐ข∃๐ฃ∃๐ค(¬∈๐ข๐ฃ ↔ ¬∈๐ข๐ค)] ∨ ∅ = ๐ฅ1 7. Write the ๐ฒ๐ณ′′ -formulas from Exercise 6 in their usual form. 8. Translate the given sentence to an ๐ฒ-formula for the given theory symbol. (a) For all ๐ฅ, ๐(๐ฅ) ≠ 0. (๐ฒ = ๐ ๐ฑ) (b) For every number ๐ฅ, there is a number ๐ฆ such that ๐ฅ โฆ ๐ฆ = ๐. (๐ฒ = ๐ฆ๐ฑ) (c) If ๐ฅ < ๐ฆ and ๐ฆ < ๐ง, then ๐ฅ < ๐ง. (๐ฒ = ๐ฎ๐ฅ) (d) It is false that ๐ฅ ∈ ๐ฆ, ๐ฆ ∈ ๐ง, and ๐ง ∈ ๐ฅ for all ๐ฅ, ๐ฆ, and ๐ง. (๐ฒ = ๐ฒ๐ณ) Section 2.2 SUBSTITUTION 75 (e) For every ๐ข and ๐ฃ, there exists ๐ค such that if ๐ข = ๐ฃ, then ๐ข + ๐ค = ๐ฃ + ๐ค. (๐ฒ = ๐ฑ๐จ and + should be translated as ⊕ in the ๐ฑ๐จ-formula.) 9. Extend ๐ญ๐ณ to ๐ญ๐ณ′ by adding the numerals 2, 3, … , 9. Answer the following questions. (a) Is +34 = 7 a ๐ญ๐ณ′ -term? Explain. (b) Is ⋅4+39 a ๐ญ๐ณ′ -term? Explain. (c) Is ∃๐ฅ(+⋅4๐ฅ8 = +3๐ฅ) a ๐ญ๐ณ′ -formula? If it is, find ๐ฅ. (d) If possible, give an example of a ๐ญ๐ณ-formula that is not a ๐ญ๐ณ′ -formula. (e) If possible, give an example of a ๐ญ๐ณ′ -formula that is not a ๐ญ๐ณ-formula. 2.2 SUBSTITUTION As noted in the beginning of Section 2.1, there are times when a substitution will be made for a predicate’s variable. For example, in algebra, if ๐ (๐ฅ) = 3๐ฅ2 + ๐ฅ + 1, then ๐ (๐ฆ) = 3๐ฆ2 + ๐ฆ + 1, ๐ (2) = 3(2)2 + 2 + 1, and ๐ (sin ๐ฅ) = 3 sin2 ๐ฅ + sin ๐ฅ + 1. That is, we can substitute with variables, constants, and the results from functions. Therefore, to represent this in formulas, we can replace a variable with a term. Terms We begin by defining what it means to substitute for a variable in a term. We use ⇔ because one string can be replaced with the other string. DEFINITION 2.2.1 [Substitution in Terms] Let ๐ฒ be theory symbols from a first-order alphabet ๐ . Let ๐ฅ be a variable symbol from ๐ and ๐ก be an ๐ฒ-term. The notation ๐ก ๐ฅ means that ๐ฅ is replaced with ๐ก at every appropriate occurrence of ๐ฅ. For terms, this means the following. โ If ๐ฆ is a variable symbol from ๐ , ๐ก ๐ฆ ⇔ ๐ฅ { ๐ก if ๐ฅ = ๐ฆ, ๐ฆ if ๐ฅ ≠ ๐ฆ. โ If ๐ is a constant symbol from ๐ฒ, ๐ ๐ก ⇔ ๐. ๐ฅ โ If ๐ is an ๐-ary function symbol from ๐ฒ and ๐ 0 , ๐ 1 , … , ๐ ๐−1 are ๐ฒ-terms, ( ) ๐ก ๐ก ๐ก ๐ก (๐ ๐ 0 ๐ 1 · · · ๐ ๐−1 ) ⇔ ๐ ๐ 0 ๐ 1 · · · ๐ ๐−1 . ๐ฅ ๐ฅ ๐ฅ ๐ฅ 76 Chapter 2 FIRST-ORDER LOGIC Observe that when a substitution of a term is made into a term, the result is another term. EXAMPLE 2.2.2 Let ๐ฅ, ๐ฆ, and ๐ง be distinct variable symbols; ๐ and ๐ be constant symbols; ๐ be a binary function symbol; and ๐ and โ be unary function symbols. This means that ๐ ๐๐๐๐ is typically written as ๐ (๐, ๐(๐(๐))). ๐ฆ โ ๐ฅ ⇔๐ฆ ๐ฅ ๐ โ ๐ฅ ⇔๐ ๐ฅ ๐ ๐ฅ๐ง ⇔๐ฆ ๐ฅ ๐ฅ ๐ ⇔๐ ๐ฅ ๐ ๐ ⇔๐ ๐ฅ ๐๐ฅ ๐ ⇔๐ ๐ฅ ๐ (๐ ๐ฅ๐ฆ) ⇔ ๐ ๐ฅ๐ ๐ฆ ( ) โ๐ ๐๐ฆ (๐ ๐ฆ๐ง) ⇔ ๐ โ๐๐ง ๐ฆ ๐ฅ ([( ) ] ) ๐ ๐ ๐ ๐ ๐ง ⇔๐ ๐ฅ ๐ฆ ๐ง ๐ฅ ) ] ) ([( ๐ ๐๐ฅ ๐ฅ ๐๐ ⇔ ๐ ๐๐๐๐. ๐ ๐ฅ๐ฆ ๐ฅ ๐ฆ ๐ง ๐ฅ โ ๐ฆ โ โ โ โ โ โ โ Free Variables Substitution for a variable in a formula is a bit more involved. This is because of the influence of any possible quantifiers. For example, take the formula ๐ฅ = ๐ฆ ∨ ๐ฅ = ๐ ๐ฆ. (2.4) By Definition 2.2.1, we know that we can substitute constants ๐ for ๐ฅ and ๐ for ๐ฆ in the terms ๐ฅ, ๐ฆ, and ๐ ๐ฆ. We expect that we should also be able to make this substitution into the formula resulting in ๐ = ๐ ∨ ๐ = ๐ ๐. However, in the formula ∀๐ฅ∃๐ฆ(๐ฅ = ๐ฆ ∨ ๐ฅ = ๐ ๐ฆ), (2.5) Section 2.2 SUBSTITUTION 77 the situation is different because of the quantifiers. Consider the corresponding occurrences in (2.4) and its quantified (2.5): x=y∨x=fy ∀x∃y(x = y ∨ x = f y) Even though each occurrence of ๐ฅ and ๐ฆ in (2.4) can receive a substitution, each corresponding occurrence in (2.5) cannot because of the quantifiers. DEFINITION 2.2.3 Let ๐ฒ be theory symbols. Let ๐ก0 , ๐ก1 , … , ๐ก๐−1 be ๐ฒ-terms, ๐ be an ๐-ary relation symbol from ๐ฒ, and ๐ and ๐ be ๐ฒ-formulas. An occurrence of a variable in a formula is free or not free only according to the following rules: โ A variable occurrence in ๐ก0 = ๐ก1 and ๐ ๐ก0 ๐ก1 · · · ๐ก๐−1 is free. โ A variable occurrence in ¬๐ is free if and only if the corresponding occurrence in ๐ is free. โ A variable occurrence in ๐ ∧ ๐, ๐ ∨ ๐, ๐ → ๐, and ๐ ↔ ๐ is free if and only if the corresponding occurrence in ๐ or ๐ is free. โ Any occurrence of ๐ฅ in ∀๐ฅ๐ and ∃๐ฅ๐ is not free. โ Any occurrence of ๐ฆ ≠ ๐ฅ is free in ∀๐ฅ๐ and ∃๐ฅ๐ if and only if the corresponding occurrence of ๐ฆ is free in ๐. If an occurrence of a variable is not free, it is bound. All free occurrences of ๐ฅ in ๐ are within the scope of the universal quantifier in ∀๐ฅ๐ and the existential quantifier in ∃๐ฅ๐. EXAMPLE 2.2.4 Let ๐ be a unary function symbol and ๐ be a 3-ary relation symbol. In the formula ∀๐ฅ∃๐ฆ(๐ฅ = ๐ฆ ∨ ๐ ๐ฅ = ๐ฆ → ๐ ๐ฅ๐ฆ๐ง), all occurrences of ๐ฅ and ๐ฆ are bound, but the occurrence of ๐ง is free. In the formula ∃๐ฆ(๐ฅ = ๐ฆ ∨ ๐ ๐ฅ = ๐ฆ → ๐ ๐ฅ๐ฆ๐ง), all occurrences of ๐ฅ and ๐ง are free, but the occurrences of ๐ฆ are bound. In ๐ฅ = ๐ฆ ∨ ๐ ๐ฅ = ๐ฆ → ๐ ๐ฅ๐ฆ๐ง, all occurrences are free because all occurrences are free in ๐ฅ = ๐ฆ, ๐ ๐ฅ = ๐ฆ, and ๐ ๐ฅ๐ฆ๐ง. We need to know whether a formula has a free occurrence of a variable, so we make the next definition. 78 Chapter 2 FIRST-ORDER LOGIC DEFINITION 2.2.5 A free variable of the ๐ฒ-formula ๐ is a variable that has a free occurrence in ๐. EXAMPLE 2.2.6 Using the ๐ and ๐ from Example 2.2.4, both ๐ฅ and ๐ฆ are free variables of the formula ๐ ๐ฅ๐ฆ๐ → ∃๐ฅ(๐ ๐ฆ = ๐), even though ๐ฅ has both free and bound occurrences. Formulas We can now define what it means to make a substitution into a formula. DEFINITION 2.2.7 [Substitution in Formulas] Let ๐ฒ be theory symbols from a first-order alphabet ๐ . Let ๐ฅ and ๐ฆ be variable symbols of ๐ and ๐ be an ๐-ary relation symbol from ๐ฒ. Suppose that ๐ and ๐ are ๐ฒ-formulas and ๐ก, ๐ก0 , ๐ก1 , … , ๐ก๐−1 are ๐ฒ-terms. ๐ก ๐ก ๐ก ⇔ ๐ก0 = ๐ก1 ๐ฅ ๐ฅ ๐ฅ ๐ก ๐ก ๐ก ๐ก (๐ ๐ก0 ๐ก1 · · · ๐ก๐−1 ) ⇔ ๐ ๐ก0 ๐ก1 · · · ๐ก๐−1 ๐ฅ ๐ฅ ๐ฅ ๐ฅ ( ) ๐ก ๐ก (¬๐) ⇔ ¬ ๐ ๐ฅ ๐ฅ ๐ก ๐ก ๐ก (๐ ∧ ๐) ⇔ ๐ ∧ ๐ ๐ฅ ๐ฅ ๐ฅ ๐ก ๐ก ๐ก (๐ ∨ ๐) ⇔ ๐ ∨ ๐ ๐ฅ ๐ฅ ๐ฅ ๐ก ๐ก ๐ก (๐ → ๐) ⇔ ๐ → ๐ ๐ฅ ๐ฅ ๐ฅ ๐ก ๐ก ๐ก (๐ ↔ ๐) ⇔ ๐ ↔ ๐ ๐ฅ ๐ฅ ๐ฅ { ( ๐ก) ∀๐ฆ ๐ if ๐ฅ ≠ ๐ฆ and ๐ฆ is not a symbol of ๐ก ๐ก ๐ฅ (∀๐ฆ๐) ⇔ ๐ฅ ∀๐ฆ๐ otherwise ( ) { ๐ก ∃๐ฆ ๐ if ๐ฅ ≠ ๐ฆ and ๐ฆ is not a symbol of ๐ก ๐ก ๐ฅ (∃๐ฆ๐) ⇔ ๐ฅ ∃๐ฆ๐ otherwise. โ (๐ก0 = ๐ก1 ) โ โ โ โ โ โ โ โ The condition on the term ๐ก in the last two parts of Definition 2.2.7 is important. Consider the ๐ฑ๐จ-formula ๐ := ∃๐ฆ(๐ฅ ⊕ ๐ฆ = โ). 79 Section 2.2 SUBSTITUTION The usual interpretation of ๐ is that given ๐ฅ, there exists an number ๐ฆ such that ๐ฅ+๐ฆ = 0. Let ๐ be a unary function symbol and ๐ง be a variable symbol. Since ๐ฆ is not a symbol of ๐ง, ๐ง ๐ ⇔ ∃๐ฆ(๐ง ⊕ ๐ฆ = โ), ๐ฅ which has the same standard interpretation as ๐. Since ๐ฆ is not a symbol of ๐ ๐ง, we can substitute to find that ๐๐ง ⇔ ∃๐ฆ(๐ ๐ง ⊕ ๐ฆ = โ). ๐ ๐ฅ This is a reasonable substitution because it states that for the number given by ๐ ๐ง, there is a number ๐ฆ that when added to ๐ ๐ง gives 0. This is very similar to the standard interpretation of ๐. Both of these substitutions work because the number of free occurrences is unchanged by the substitution. However, if we allow the term to include ๐ฆ among its ๐ฆ symbols, the substitution ๐ would yield ๐ฅ ∃๐ฆ(๐ฆ ⊕ ๐ฆ = โ). (2.6) The typical interpretation of (2.6) is that there exists a number ๐ฆ such that ๐ฆ + ๐ฆ = 0. This is a reasonable proposition, but not in the spirit of the original formula ๐. The change of interpretation is due to the change in the number of free occurrences. The formula ๐ has one free occurrence, while (2.6) has none. Therefore, when making a substitution, the number of free occurrences should not change, and for this reason, ∃๐ฆ(๐ฅ ⊕ ๐ฆ = โ) ๐ฆ ⇔ ∃๐ฆ(๐ฅ ⊕ ๐ฆ = โ). ๐ฅ EXAMPLE 2.2.8 Let ๐ be the ๐ญ๐ณ formula ∀๐ฅ0 ∀๐ฅ1 (๐ฅ0 = ๐ฅ1 → ๐ฅ0 + ๐ฅ2 = ๐ฅ1 + ๐ฅ2 ). Notice that ๐ฅ2 is a free variable of ๐. However, all occurrences of ๐ฅ0 and ๐ฅ1 are bound. Therefore, ๐ 0 ⇔ ∀๐ฅ0 ∀๐ฅ1 (๐ฅ0 = ๐ฅ1 → ๐ฅ0 + ๐ฅ2 = ๐ฅ1 + ๐ฅ2 ), ๐ฅ0 and then letting ๐ฆ be a variable symbol, ( ) ๐ฆ 0 ๐ ⇔ ∀๐ฅ0 ∀๐ฅ1 (๐ฅ0 = ๐ฅ1 → ๐ฅ0 + ๐ฅ2 = ๐ฅ1 + ๐ฅ2 ), ๐ฅ0 ๐ฅ1 and finally, [( ) ] ๐ฆ 1 0 ๐ ⇔ ∀๐ฅ0 ∀๐ฅ1 (๐ฅ0 = ๐ฅ1 → ๐ฅ0 + 1 = ๐ฅ1 + 1). ๐ฅ0 ๐ฅ1 ๐ฅ2 This last formula has no free variables. A standard interpretation of this formula is 80 Chapter 2 FIRST-ORDER LOGIC for every integer ๐ฅ0 and ๐ฅ1 , if ๐ฅ0 = ๐ฅ1 , then ๐ฅ0 + 1 = ๐ฅ1 + 1. Furthermore, ๐ ๐ฅ ๐ฅ1 ⇔ [∀๐ฅ0 ∀๐ฅ1 (๐ฅ0 = ๐ฅ1 → ๐ฅ0 + ๐ฅ2 = ๐ฅ1 + ๐ฅ2 )] 1 ๐ฅ2 ๐ฅ2 ๐ฅ1 ⇔ ∀๐ฅ0 [∀๐ฅ1 (๐ฅ0 = ๐ฅ1 → ๐ฅ0 + ๐ฅ2 = ๐ฅ1 + ๐ฅ2 )] ๐ฅ2 ⇔ ∀๐ฅ0 ∀๐ฅ1 (๐ฅ0 = ๐ฅ1 → ๐ฅ0 + ๐ฅ2 = ๐ฅ1 + ๐ฅ2 ). Let ๐ represent the ๐ญ๐ณ-formula +๐ฆ2 = 7. Observe that ๐ has ๐ฆ as a free variable. To emphasize this, instead of writing ๐ := +๐ฆ2 = 7, we often denote the formula by ๐(๐ฆ) and write ๐(๐ฆ) := +๐ฆ2 = 7. Although ๐ฅ is not a variable in the equation, we can also write ๐(๐ฅ, ๐ฆ) := +๐ฆ2 = 7. For example, if we wanted to interpret the formula as an equation with one variable, we would use ๐(๐ฆ). If we wanted to view it as the horizontal line ๐ฆ = 5, we would use ๐(๐ฅ, ๐ฆ). DEFINITION 2.2.9 Let ๐ be a first-order alphabet with theory symbols ๐ฒ. Let ๐ be an ๐ฒ-formula. If ๐ has no free variables or the free variables of ๐ are among the distinct variables ๐ฅ0 , ๐ฅ1 , … , ๐ฅ๐−1 from ๐ , define ๐(๐ฅ0 , ๐ฅ1 , … , ๐ฅ๐−1 ) ⇔ ๐. The notation of Definition 2.2.9 can also be used to represent substitutions. Consider the formula ๐ := ๐ฅ + ๐ฆ = 0. Observe that ( ) ๐ฅ ๐ฆ ๐ ⇔๐ฆ+๐ฆ=0 ๐ฆ ๐ฅ and ( ๐ฆ) ๐ฅ ๐ ⇔ ๐ฅ + ๐ฅ = 0. ๐ฅ ๐ฆ However, suppose that we want to substitute into ๐ so that the result is ๐ฆ + ๐ฅ = 0. To accomplish this, we need two new and distinct variable symbols, ๐ข and ๐ฃ. Then, 81 Section 2.2 SUBSTITUTION ( ) ๐ข ๐ฃ ๐ ⇔ ๐ข + ๐ฃ = 0, ๐ฅ ๐ฆ (2.7) ( ๐ฆ) ๐ฅ ⇔ ๐ฆ + ๐ฅ = 0. ๐ ๐ข ๐ฃ (2.8) and then, Therefore, ([( ) ] ) ๐ข ๐ฃ ๐ฆ ๐ฅ ๐ ⇔ ๐ฆ + ๐ฅ = 0. ๐ฅ ๐ฆ ๐ข ๐ฃ This works because in (2.7), each variable symbol is replaced by a new symbol in such a way that the resulting formula has the same meaning as the original. In this way, when the original variable symbols are brought back in (2.8), all of the substitutions are made into distinct variables so that there are no conflicts and the switch can be made. This process can be generalized to any number of variable symbols in any order. DEFINITION 2.2.10 Let ๐(๐ฅ0 , ๐ฅ1 , … , ๐ฅ๐−1 ) be an ๐ฒ-formula and ๐ข0 , ๐ข1 , … , ๐ข๐−1 be distinct variable symbols not among ๐ฅ0 , ๐ฅ1 , … , ๐ฅ๐−1 . Define ๐(๐ข0 , ๐ข1 , … , ๐ข๐−1 ) ⇔ ) ] ) ( [( ๐ข๐−1 ๐ข ๐ข1 ··· ··· ๐ 0 . ๐ฅ0 ๐ฅ1 ๐ฅ๐−1 Then, for all ๐ฒ-terms ๐ก0 , ๐ก1 , … , ๐ก๐−1 , define ) ] ) ( [( ๐ก๐−1 ๐ก0 ๐ก1 ๐(๐ก0 , ๐ก1 , … , ๐ก๐−1 ) ⇔ · · · ๐ ··· . ๐ข0 ๐ข1 ๐ข๐−1 This is called a simultaneous substitution and is equivalent to replacing all free occurrences of ๐ฅ0 , ๐ฅ1 , … , ๐ฅ๐−1 in ๐ with ๐ก0 , ๐ก1 , … , ๐ก๐−1 , respectively. Observe that if ๐ does not have free variables, then ๐(๐ก0 , ๐ก1 , … , ๐ก๐−1 ) ⇔ ๐. EXAMPLE 2.2.11 To illustrate Definition 2.2.10, let ๐ฑ๐จ′ be the theory symbols of ๐ฑ๐จ combined with constant symbols 1 and 2. Let ๐ be the ๐ฑ๐จ′ -formula ๐ฅ ⊗ (๐ฆ ⊕ ๐ง) = ๐ฅ ⊗ ๐ฆ ⊕ ๐ฅ ⊗ ๐ง. 82 Chapter 2 FIRST-ORDER LOGIC Since ๐ฅ, ๐ฆ, and ๐ง are free variables, represent ๐ by ๐(๐ฅ, ๐ฆ, ๐ง, ๐ค). Then, ([( ) ] ) 1 ๐ฅ ๐ฆ 2 ๐(1, ๐ฅ, ๐ฆ, 2) ⇔ ๐ ๐ข1 ๐ข2 ๐ข3 ๐ข4 ) ] ) ([( [ ] 1 ๐ฅ ๐ฆ 2 ⇔ ๐ข1 ⊗ (๐ข2 ⊕ ๐ข3 ) = ๐ข1 ⊗ ๐ข2 ⊕ ๐ข1 ⊗ ๐ข3 ๐ข1 ๐ข2 ๐ข3 ๐ค ([ ] ) ( ) ๐ฅ ๐ฆ 2 ⇔ 1 ⊗ (๐ข2 ⊕ ๐ข3 ) = 1 ⊗ ๐ข2 ⊕ 1 ⊗ ๐ข3 ๐ข ๐ข ๐ค )2 3 ( [ ] ๐ฆ 2 ⇔ 1 ⊗ (๐ฅ ⊕ ๐ข3 ) = 1 ⊗ ๐ฅ ⊕ 1 ⊗ ๐ข3 ๐ข3 ๐ค 2 ⇔ (1 ⊗ (๐ฅ ⊕ ๐ฆ) = 1 ⊗ ๐ฅ ⊕ 1 ⊗ ๐ฆ) ๐ค ⇔ 1 ⊗ (๐ฅ ⊕ ๐ฆ) = 1 ⊗ ๐ฅ ⊕ 1 ⊗ ๐ฆ. EXAMPLE 2.2.12 Let ๐ฒ have constant symbols 5 and 9. Define ๐(๐ฅ, ๐ฆ) to be the ๐ฒ-formula ∀๐ฅ∃๐ง [๐(๐ฅ, ๐ฆ) ∧ ๐(๐ง)] ∨ ∃๐ฆ [๐(๐ฆ) → ๐ (๐ฅ)] . The first occurrence of ๐ฆ is free, and since ∀๐ฅ applies only to variables within the brackets on the left, the last occurrence of ๐ฅ is also free. Therefore, ๐(๐ฅ, ๐ฆ) is the disjunction of two formulas that we call ๐ข(๐ฆ) and ๐ฃ(๐ฅ): ๐ข(๐ฆ) ๐ฃ(๐ฅ) โโโโโโโโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโ ∀๐ฅ∃๐ง [๐(๐ฅ, ๐ฆ) ∧ ๐(๐ง)] ∨ ∃๐ฆ [๐(๐ฆ) → ๐ (๐ฅ)], from which we derive ๐(9, 5) ⇔ ๐ข(5) ∨ ๐ฃ(9) ⇔ ∀๐ฅ∃๐ง[๐(๐ฅ, 5) ∧ ๐(๐ง)] ∨ ∃๐ฆ[๐(๐ฆ) → ๐ (9)] As in Example 2.2.11, finding ๐(9, 5) is equivalent to simply replacing the free occurrences of ๐ฅ with 9 and the occurrences of ๐ฆ with 5. EXAMPLE 2.2.13 Consider the ๐ญ๐ณ-formula, ∀๐ฅ∀๐ฆ∀๐ง [++๐ฅ๐ฆ๐ง = +๐ฅ+๐ฆ๐ง] , (2.9) ∀๐ฅ∀๐ฆ∀๐ง [(๐ฅ + ๐ฆ) + ๐ง = ๐ฅ + (๐ฆ + ๐ง)] . (2.10) which is often written as The formula within the scope of the first quantifier symbol in (2.10) is ๐(๐ฅ) := ∀๐ฆ∀๐ง [(๐ฅ + ๐ฆ) + ๐ง = ๐ฅ + (๐ฆ + ๐ง)] . (2.11) Section 2.2 SUBSTITUTION 83 Notice that the occurrences of ๐ฅ are free in (2.11), but the occurrences of ๐ฆ and ๐ง are bound. For example, we can make the substitution ๐(2) ⇔ ∀๐ฆ∀๐ง[(2 + ๐ฆ) + ๐ง = 2 + (๐ฆ + ๐ง)]. Now, letting ๐(๐ฅ, ๐ฆ) := ∀๐ง[(๐ฅ + ๐ฆ) + ๐ง = ๐ฅ + (๐ฆ + ๐ง)], we have that ๐(๐ฅ) ⇔ ∀๐ฆ๐(๐ฅ, ๐ฆ). We can also define ๐(๐ฅ, ๐ฆ, ๐ง) := (๐ฅ + ๐ฆ) + ๐ง = ๐ฅ + (๐ฆ + ๐ง) so that ๐(๐ฅ, ๐ฆ) ⇔ ∀๐ง ๐(๐ฅ, ๐ฆ, ๐ง). What we have done is to break apart an ๐ญ๐ณformula that contains multiple quantifiers into a sequence of formulas: ๐(๐ฅ) โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ ∀๐ฅ ∀๐ฆ ∀๐ง [(๐ฅ + ๐ฆ) + ๐ง = ๐ฅ + (๐ฆ + ๐ง)] . โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ ๐(๐ฅ, ๐ฆ, ๐ง) โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ ๐(๐ฅ, ๐ฆ) We conclude that (2.10) can be written as ∀๐ฅ ๐(๐ฅ), ∀๐ฅ∀๐ฆ ๐(๐ฅ, ๐ฆ), or ∀๐ฅ∀๐ฆ∀๐ง ๐(๐ฅ, ๐ฆ, ๐ง). Observe that (2.9) has no free variables. It is a type of formula of particular importance because it represents a proposition. In other words, (2.9) is a propositional form. DEFINITION 2.2.14 An ๐ฒ-formula with no free variables is called an ๐ฆ-sentence. Exercises 1. Given a term, make the indicated substitution. ( ) 5 6 (a) 4 ๐ฆ [(๐ฅ ) ] ๐ ๐ (๐ฅ) ๐ (b) ๐ฅ ๐ฆ ๐ฅ ๐ฅ [ ] ๐ ๐(8) (c) (๐ฅ + ๐ฆ) ๐ฅ ๐ฆ [([ ] ) ] ๐ ๐ ๐ 5 (d) (๐ฅ + ๐ฆ + ๐ง) ๐ฅ ๐ฆ ๐ง ๐ฅ 84 Chapter 2 FIRST-ORDER LOGIC 2. Given a formula, make the indicated substitution. [ ๐ฆ] 4 (a) (๐ฅ < 2) ๐ฅ ๐ฅ ) ([ ๐ฆ] ๐ฅ 3 (b) (๐ฅ < 5 → ๐ฅ + 4 < 9) ๐ฅ ๐ฆ ๐ฅ ([([ ] ) ] ) 5 ๐ข ๐ฃ 6 ๐ (c) (๐ฅ = 5 ∧ ๐ฅ = ๐ฆ) ๐ฅ ๐ฆ ๐ฅ ๐ข ๐ฃ ( ) ๐ฅ ๐ฅ (d) [∃๐ฅ(๐ฅ − 4 = ๐ฆ) ∨ ∀๐ฆ(๐ฆ + ๐ง = ๐ฅ + 3)] ๐ฆ ๐ง 3. Identify all free occurrences of ๐ฅ in the given formulas. (a) ๐ฅ + 4 < 10 (b) ∃๐ฅ∀๐ฆ(๐ฅ + ๐ฆ = 0) (c) ∀๐ฅ∀๐ฆ(๐ฅ + ๐ฆ = ๐ฆ + ๐ฅ) ∨ ∀๐ง(๐ฅ + ๐ฆ = ๐ง − 3) (d) ∀๐ฅ[∀๐ง∀๐ฆ(๐ฅ + ๐ฆ = 2 ⋅ ๐ง) ↔ ๐ฅ + ๐ฅ = 2 ⋅ ๐ฅ] 4. Identify all bound occurrences of ๐ฅ in the given formulas. (a) (∀๐ฅ) [๐(๐ฅ) → (∃๐ฆ)๐(๐ฆ)] (b) (∃๐ฅ)๐(๐ฅ, ๐ฆ) → (∀๐ฆ)๐(๐ฅ) (c) (∃๐ฅ)(๐ฅ > 4) ∧ ๐ฅ < 10 (d) [(∀๐ฅ)(๐ฅ + 3 = 1) ∧ ๐ฅ = 9] ∨ (∃๐ฆ)(๐ฅ < 0) 5. Make the simultaneous substitution ๐(๐) for each of the given formulas. (a) ๐(๐ฅ) := 2๐ฅ + ๐ = 9 (b) ๐(๐ฅ) := ∀๐ฅ(๐ฅ + ๐ฆ = ๐ฆ + ๐ฅ) (c) ๐(๐ฅ) := ∃๐ฅ๐(๐ฅ) → ∀๐ฆ๐(๐ฅ, ๐ฆ) (d) ๐(๐ฅ) := ๐ฅ + 4 = 0 ∧ ∃๐ฅ(๐ฆ + ๐ง = ๐ฅ) ∨ ∃๐ง(๐ฅ + ๐ง = 3) 6. Make the simultaneous substitution ๐(1, 2, 3) for each of the given formulas. (a) ๐(๐ฅ, ๐ฆ, ๐ง) := ๐ข + ๐ฃ + ๐ค (b) ๐(๐ฅ, ๐ฆ, ๐ง) := ๐(๐ฅ, ๐ฆ) → (∀๐ฅ) [๐(๐ฅ) ∧ (∃๐ฆ)๐ก(๐ฆ, ๐ง)] . (c) ๐(๐ฅ, ๐ฆ, ๐ง) := ∃๐ฅ(๐ฅ + ๐ฆ = ๐ง) ∧ ∀๐ฆ(๐ฅ + ๐ฆ + ๐ง) ∨ ∀๐ฅ∀๐ฆ(๐ฅ + ๐ฆ + ๐ง) (d) ๐(๐ฅ, ๐ฆ, ๐ง) := ∃๐ฅ[๐(๐ฅ) ∧ ∀๐ฆ(๐(๐ฅ) ∧ ๐(๐ฆ) → ๐ฅ = ๐ฆ ∨ ๐ฆ = ๐ง)] 7. Identify the formula to the right of each quantifiers in the given formulas. (a) (∀๐ฅ) [๐(๐ฅ) → (∃๐ฆ)๐(๐ฆ)] (b) (∃๐ฅ)๐(๐ฅ, ๐ฆ) → (∀๐ฆ)๐(๐ฆ) (c) (∀๐ฅ)(∃๐ฆ)(∃๐ง)๐(๐ฅ, ๐ฆ, ๐ง) ∧ ๐(๐ค) (d) ๐(๐ฅ) ∧ (∃๐ฅ)(∀๐ฆ)๐(๐ฅ, ๐ฆ) ∨ (∀๐ง)๐(๐ง) 8. Which of the following are sentences. (a) ∀๐ฅ∀๐ฆ[๐(๐ฅ) ∨ ๐(๐ฆ)] ⇒ ∀๐ฆ[๐(๐) ∨ ๐(๐ฆ)] (b) ๐(2) ∧ ๐ก(3) ⇒ ∃๐ฆ[๐(2) ∧ ๐ก(๐ฆ)] (c) ∃๐ค[∃๐ฅ[๐(๐ฅ) ∨ ∃๐ง๐(๐ง) ↔ ∃๐ฆ[๐(๐ฅ) ∧ ๐(๐ฆ)] → ∀๐ฅ๐(๐ฅ)] → ∀๐ฆ๐(๐ฅ)] (d) ∀๐ฅ∀๐ฆ(๐(๐ฅ) → [๐(๐ฆ) ∧ ๐(๐ง)]) → ∃๐ฅ¬๐(๐ฅ) Section 2.3 SYNTACTICS 2.3 85 SYNTACTICS Since formulas without free variables are propositional forms, we can write proofs involving them using the rules from Sections 1.2 through 1.4. However, since the inference rules did not involve quantification, we need new rules to deal with universal and existential formulas. We need rules covering not only negations but also rules that enable the removal (instantiation) and adjoining (generalization) of quantifiers. We add these rules to Definition 1.2.13 to obtain a stronger notion of proof. Furthermore, since every sentence is a propositional form, every reference to a propositional form in Definition 1.2.13 can be considered to be a reference to a sentence. This allows us to write formal proofs using first-order languages. Quantifier Negation Consider the proposition all rectangles are squares. This sentence is false because there is a rectangle in which one side is twice the length of the adjacent side, so not all rectangles are squares. That is, some rectangle is not a square is true. Generalizing, we conclude that the negation of all P are Q is some P are not Q. This should be translated as an inference rule for formulas, so we assume ¬∀๐ฅ๐ ⇒ ∃๐ฅ¬๐. (2.12) Now consider the proposition some rectangles are round. This is false because there are no round rectangles, so all rectangles are not round is true. Generalizing, we conclude that the negation of some P are Q is all P are not Q. Again, this should be translated as an inference rule for formulas, so we assume ¬∃๐ฅ๐ ⇒ ∀๐ฅ¬๐. (2.13) 86 Chapter 2 FIRST-ORDER LOGIC ∀x p ∀x ¬p Negations ∃x p Figure 2.4 ∃x ¬p The modern square of opposition. Furthermore, by DN and (2.13), ∃๐ฅ¬๐ ⇒ ¬¬∃๐ฅ¬๐ ⇒ ¬∀๐ฅ¬¬๐ ⇒ ¬∀๐ฅ๐, and by DN with (2.12), ∀๐ฅ¬๐ ⇒ ¬¬∀๐ฅ¬๐ ⇒ ¬∃๐ฅ¬¬๐ ⇒ ¬∃๐ฅ๐. We summarize assumptions (2.12) and (2.13) and the two conclusions in the following replacement rule. REPLACEMENT RULES 2.3.1 [Quantifier Negation (QN)] Let ๐ฒ be theory symbols and ๐ be an ๐ฒ-formula. โ ¬∀๐ฅ๐ ⇔ ∃๐ฅ¬๐ โ ¬∃๐ฅ๐ ⇔ ∀๐ฅ¬๐. QN is illustrated with the modern square of opposition (Figure 2.4). Negations of quantified formulas are found at opposite corners. A version of the Square is found in Aristotle’s De Interpretatione, dating around 350 BC (Aristotle 1984). Whenever we negate formulas of the form ∀๐ฅ๐ or ∃๐ฅ๐, to make it easier to read, the final form should not have a negation immediately to the left of any quantifier, and using the replacement rules, the negation should be as far into the formula ๐ as possible. We say that such negations are in positive form. EXAMPLE 2.3.2 Find the negation of ∀๐ฅ(๐ ∧ ๐) and put the final answer in positive form. ¬∀๐ฅ(๐ ∧ ๐) ⇔ ∃๐ฅ¬(๐ ∧ ๐) ⇔ ∃๐ฅ(¬๐ ∨ ¬๐). The next example will use De Morgan’s law as the last one did. It also needs material implication and double negation. Section 2.3 SYNTACTICS 87 EXAMPLE 2.3.3 Find the negation of ∀๐ฅ∃๐ฆ[๐(๐ฅ) → ๐(๐ฆ)] and put it into positive form. ¬∀๐ฅ∃๐ฆ[๐(๐ฅ) → ๐(๐ฆ)] ⇔ ∃๐ฅ¬∃๐ฆ[๐(๐ฅ) → ๐(๐ฆ)] ⇔ ∃๐ฅ∀๐ฆ¬[๐(๐ฅ) → ๐(๐ฆ)] ⇔ ∃๐ฅ∀๐ฆ¬[¬๐(๐ฅ) ∨ ๐(๐ฆ)] ⇔ ∃๐ฅ∀๐ฆ[๐(๐ฅ) ∧ ¬๐(๐ฆ)]. Proofs with Universal Formulas Consider the sentence all multiples of 4 are even. This implies, for instance, that 8, 100, and −16 are even. To generalize this reasoning to formulas means that whenever we have ∀๐ฅ๐(๐ฅ), we also have ๐(๐), where ๐ might be either a constant symbol such as 8, 100, and −16 or a randomly chosen constant symbol. This gives the first inference rule that involves a quantifier. We do not prove it, but take it to be an axiom. Observe the use of Definition 2.1.6 INFERENCE RULE 2.3.4 [Universal Instantiation (UI)] If ๐(๐ฅ) is an ๐ฒ-formula, then for every constant symbol ๐ from ๐ฒ, ∀๐ฅ๐(๐ฅ) ⇒ ๐(๐), We make two observations about UI. First, since the resulting formula is to be part of a proof, the substitution must yield a sentence, so ๐ must be a constant symbol. Second, we use the notation ๐(๐ฅ) to represent the formula instead of ๐ because ∀๐ฅ๐(๐ฅ) will be part of a proof, which means, again, that it must be a sentence. Writing ๐(๐ฅ) limits the formula to have only ๐ฅ as a possible free variable, so ∀๐ฅ๐(๐ฅ) is a sentence. If the formula ๐ had free variables other than ๐ฅ, then ∀๐ฅ๐ would not be a sentence and not suitable for a proof. EXAMPLE 2.3.5 Let ๐, ๐, and ๐ be ๐ฒ-formulas and ๐ and ๐ be constant symbols from ๐ฒ. The following are legitimate uses of UI. Notice that each of the inferences results in an ๐ฒ-formula. โ ∀๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] ⇒ ๐(๐) → ๐(๐) โ ∀๐ฅ [๐(๐ฅ) ∨ ∀๐ฆ๐(๐ฆ)] ⇒ ๐(๐) ∨ ∀๐ฆ๐(๐ฆ) โ ∀๐ฅ∀๐ฆ [๐(๐ฅ) ∨ ๐(๐ฆ)] ⇒ ∀๐ฆ [๐(๐) ∨ ๐(๐ฆ)] โ ∀๐ฆ [๐(๐) ∨ ๐(๐ฆ)] ⇒ ๐(๐) ∧ ๐(๐) โ ∀๐ฆ [๐(๐) ∨ ๐(๐ฆ)] ⇒ ๐(๐) ∧ ๐(๐). 88 Chapter 2 FIRST-ORDER LOGIC Before we can write formal proofs, we need a rule that will attach a universal quantifier. It will be different from universal instantiation, for it requires a criterion on the constant. DEFINITION 2.3.6 Let ๐ be a constant symbol introduced into a formal proof by a substitution. If the first occurrence of ๐ is in a sentence that follows by UI, then ๐ is arbitrary. Otherwise, ๐ is particular and can be denoted by ๐ฬ to serve as a reminder that the symbol is not arbitrary. The idea behind Definition 2.3.6 is that if a constant symbol ๐ is arbitrary, it represents a randomly selected object, but if ๐ is particular, then ๐ represents an object with at least one known property. This property can be identified by a formula ๐(๐ฅ) so that we have ๐(๐). EXAMPLE 2.3.7 The constant symbol ๐ in the following is not arbitrary because its first occurrence of ๐ in line 1 is not the result of UI. 1. 2. ๐(๐) ๐(๐) ∨ ๐(๐) Given Add Therefore, these two lines should be written using ๐ฬ instead of ๐. 1. 2. ๐(๐) ฬ ๐(๐) ฬ ∨ ๐(๐) ฬ Given Add However, ๐ in the following is arbitrary because its first occurrence is in line 2, and line 2 follows from line 1 by UI. 1. 2. 3. ∀๐ฅ๐(๐ฅ) ๐(๐) ๐(๐) ∨ ๐(๐) Given 1 UI Add We can now introduce the rule of inference that allows us to attach universal quantifiers to formulas. We state it as an axiom. INFERENCE RULE 2.3.8 [Universal Generalization (UG)] If ๐(๐ฅ) is an ๐ฒ-formula with no particular constant symbols and ๐ is an arbitrary constant symbol from ๐ฒ, ๐(๐) ⇒ ∀๐ฅ๐(๐ฅ). Consider the following argument: All squares are rectangles. All rectangles are quadrilaterals. Therefore, all squares are quadrilaterals. Section 2.3 SYNTACTICS 89 Representing the premises by ∀๐ฅ[๐ (๐ฅ) → ๐(๐ฅ)] and ∀๐ฅ[๐(๐ฅ) → ๐(๐ฅ)], a formal proof of this includes UG: 1. 2. 3. 4. 5. 6. ∀๐ฅ[๐ (๐ฅ) → ๐(๐ฅ)] ∀๐ฅ[๐(๐ฅ) → ๐(๐ฅ)] ๐ (๐) → ๐(๐) ๐(๐) → ๐(๐) ๐ (๐) → ๐(๐) ∀๐ฅ[๐ (๐ฅ) → ๐(๐ฅ)] Given Given 1 UI 2 UI 3, 4 HS 5 UG Since ๐ was introduced in line 3 by UI, it is an arbitrary constant symbol. In addition, ๐ (๐) → ๐(๐) contains no particular constant symbols that appeared in the proof by substitution. Hence, the application of UG in line 6 is legal. EXAMPLE 2.3.9 These are illegal uses of universal generalization. โ Let ๐(๐ฅ) be an ๐ฒ-formula with ๐ a constant symbol. 1. 2. ๐(๐) ∀๐ฅ๐(๐ฅ) Given 1 UG [error] The constant symbol ๐ in line 1 is particular, even without being written as ๐. ฬ It was not introduced to the proof by UI. Therefore, universal generalization does not apply. โ Suppose that ๐ is an arbitrary constant symbol. 1. 2. ๐ + ๐ฬ = 0 ∀๐ฅ(๐ฅ + ๐ฬ = 0) 1 UG [error] The restriction against ๐(๐ฅ) containing particular symbols prevents the errant conclusion in line 2. โ The following is an attempt to prove ∀๐ฅ∀๐ฆ(๐ฅ + ๐ฆ = 2 ⋅ ๐ฅ) from the formula ∀๐ฅ(๐ฅ + ๐ฅ = 2 ⋅ ๐ฅ). 1. 2. 3. 4. ∀๐ฅ(๐ฅ + ๐ฅ = 2 ⋅ ๐ฅ) ๐+๐=2⋅๐ ∀๐ฆ(๐ + ๐ฆ = 2 ⋅ ๐) ∀๐ฅ∀๐ฆ(๐ฅ + ๐ฆ = 2 ⋅ ๐ฅ) Given 1 UI 2 UG [error] 3 UG Although the constant symbol ๐ in line 2 is arbitrary, the proof is not valid. The reason is that an illegal substitution was made in line 3. To see this, let ๐(๐ฅ) := ๐ฅ + ๐ฅ = 2 ⋅ ๐ฅ. 90 Chapter 2 FIRST-ORDER LOGIC Applying universal generalization gives ∀๐ฆ(๐ฆ + ๐ฆ = 2 ⋅ ๐ฆ) because ๐(๐ฆ) ⇔ ๐ฆ + ๐ฆ = 2 ⋅ ๐ฆ, but this is not what was written in line 3. Now for some formal proofs. EXAMPLE 2.3.10 Prove: ∀๐ฅ∀๐ฆ๐(๐ฅ, ๐ฆ) โข ∀๐ฆ∀๐ฅ๐(๐ฅ, ๐ฆ) 1. 2. 3. 4. 5. ∀๐ฅ∀๐ฆ๐(๐ฅ, ๐ฆ) ∀๐ฆ๐(๐, ๐ฆ) ๐(๐, ๐) ∀๐ฅ๐(๐ฅ, ๐) ∀๐ฆ∀๐ฅ๐(๐ฅ, ๐ฆ) Given 1 UI 2 UI 3 UG 4 UG Since both ๐ and ๐ first appear because of UI, they are arbitrary and universal generalization can be applied to both constant symbols. EXAMPLE 2.3.11 Prove: ∀๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] , ∀๐ฅ¬ [๐(๐ฅ) ∨ ๐(๐ฅ)] โข ∀๐ฅ¬๐(๐ฅ) 1. 2. 3. 4. 5. 6. 7. 8. ∀๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] ∀๐ฅ¬ [๐(๐ฅ) ∨ ๐(๐ฅ)] ๐(๐) → ๐(๐) ¬ [๐(๐) ∨ ๐(๐)] ¬๐(๐) ∧ ¬๐(๐) ¬๐(๐) ¬๐(๐) ∀๐ฅ¬๐(๐ฅ) Given Given 1 UI 2 UI 4 DeM 5 Simp 3, 6 MT 7 UG Notice that the ๐ in line 3 was introduced because of UI. Hence, ๐ is arbitrary throughout the proof. Proofs with Existential Formulas Since 4 + 5 = 9, we conclude that there exists ๐ฅ such that ๐ฅ + 5 = 9. Since we can construct an isosceles triangle, we conclude that isosceles triangles exist. This motivates the next inference rule. 91 Section 2.3 SYNTACTICS THEOREM 2.3.12 [Existential Generalization (EG)] If ๐(๐ฅ) is an ๐ฒ-formula and ๐ is a constant symbol from ๐ฒ, ๐(๐) ⇒ ∃๐ฅ๐(๐ฅ). PROOF Assume ๐(๐). Suppose that ∀๐ฅ¬๐(๐ฅ). By UI we have that ¬๐(๐). Therefore, ¬∀๐ฅ¬๐(๐ฅ) by IP, from which ∃๐ฅ๐(๐ฅ) follows by QN. EXAMPLE 2.3.13 Each of the following is a valid use of existential generalization. โ ๐(๐) ∧ ¬๐(๐) ⇒ ∃๐ฅ [๐(๐ฅ) ∧ ¬๐(๐ฅ)] โ ๐(๐) ∧ ๐ก(๐) ⇒ ∃๐ฆ [๐(๐) ∧ ๐ก(๐ฆ)]. Before we write some proofs, here is the inference rule that allows us to detach existential quantifiers. THEOREM 2.3.14 [Existential Instantiation (EI)] If ๐(๐ฅ) is an ๐ฒ-formula, ∃๐ฅ๐(๐ฅ) ⇒ ๐(๐), ฬ where ๐ is a constant symbol from ๐ฒ that has no occurrence in the formal proof prior to ๐(๐). ฬ PROOF Assume that ∃๐ฅ๐(๐ฅ) does not infer ๐(๐) for any constant symbol ๐. Suppose ∃๐ฅ๐(๐ฅ). Combined with the assumption, this implies that ¬๐(๐) for every constant symbol ๐ by the law of the excluded middle (page 37) and DS. That is, ¬๐(๐) for an arbitrary constant symbol ๐. Therefore, ∀๐ฅ¬๐(๐ฅ) by UG, so ¬∃๐ฅ๐(๐ฅ) by QN, which is a contradiction. Therefore, ∃๐ฅ๐(๐ฅ) ⇒ ๐(๐) for some constant symbol ๐, and ๐ is particular by Definition 2.3.6. The constant symbol ๐ฬ obtained by EI is called a witness of ∃๐ฅ๐(๐ฅ). The reason that the constant symbol ๐ must have no prior occurrence in a proof when applying EI is because a used symbol already represents some object, so if ๐ had appeared in an earlier line, we would have no reason to assume that we could write ๐(๐) from ∃๐ฅ๐(๐ฅ). For example, given ∃๐ฅ(๐ฅ + 4 = 5) (2.14) ∃๐ฅ(๐ฅ + 2 = 13), (2.15) and we write ๐ + 4 = 5 for some constant symbol ๐ by EI and (2.14). By EI and (2.15), we conclude that ๐ + 2 = 13 for some constant symbol ๐, but inferring ๐ + 2 = 13 is invalid because ๐ ≠ ๐. 92 Chapter 2 FIRST-ORDER LOGIC EXAMPLE 2.3.15 The following are legal uses of existential instantiation assuming that ๐ and ๐ have no prior occurrences. โ ∃๐ฅ [๐(๐ฅ) ∧ ๐(๐ฅ)] ⇒ ๐(๐) ฬ ∧ ๐(๐) ฬ ฬ ๐) → ๐(๐, ๐, ฬ ๐) โ ∃๐ฆ [๐(๐, ๐ฆ, ๐) → ๐(๐, ๐ฆ, ๐)] ⇒ ๐(๐, ๐, โ ∃๐ฅ∀๐ฆ∃๐ง๐(๐ฅ, ๐ฆ, ๐ง) ⇒ ∀๐ฆ∃๐ง๐(๐, ฬ ๐ฆ, ๐ง). EXAMPLE 2.3.16 Existential Instantiation cannot be used to justify either of the following. ฬ ∨ ๐(๐ง) by EI because the substitution โ ∃๐ง [๐(๐ง) ∨ ๐(๐ง)] does not imply ๐(๐) ฬ ∨ ๐(๐). ฬ was not made correctly. The result should have been ๐(๐) โ ∃๐ฅ∃๐ฆ๐(๐, ๐ฅ, ๐ฆ) does not imply ∃๐ฆ๐(๐, ๐, ๐ฆ) by EI because ๐ has a prior occurrence. Also, notice that the hat notation was not used. EXAMPLE 2.3.17 Assume that ๐ฅ is a real number and consider the following. 1. 2. 3. 4. 5. ∀๐ฅ∃๐ฆ(๐ฅ + ๐ฆ = 0) ∃๐ฆ(๐ + ๐ฆ = 0) ๐ + ๐ฬ = 0 ∀๐ฅ(๐ฅ + ๐ฬ = 0) ∃๐ฆ∀๐ฅ(๐ฅ + ๐ฆ = 0) Given 1 UI 2 EG 3 UG [error] 4 EG The conclusion in line 5 is incorrect. The problem lies in line 4 where UG was applied despite the particular constant symbol in line 3 (compare Example 2.3.9). This example makes clear why there is a restriction on particular elements in UG (Inference Rule 2.3.8). Since ๐ represents a particular real number, line 3 cannot be used to conclude that all real numbers plus that particular ๐ equals 0. We know that ๐ is the witness to line 2, but that is all we know about it. To correct the argument, we can essentially reverse the steps to arrive back at the premise. 1. 2. 3. 4. 5. ∀๐ฅ∃๐ฆ(๐ฅ + ๐ฆ = 0) ∃๐ฆ(๐ + ๐ฆ = 0) ๐ + ๐ฬ = 0 ∃๐ฆ(๐ฅ + ๐ฆ = 0) ∀๐ฅ∃๐ฆ(๐ฅ + ๐ฆ = 0) Given 1 UI 2 EG 3 EG 4 UG Notice that there is no particular constant symbol in line 4, so line 5 does legally follow by UG. Here are some formal proofs that use existential instantiation and generalization. EXAMPLE 2.3.18 Prove: ∃๐ฅ [๐(๐ฅ) ∧ ๐(๐ฅ)] , ∀๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] โข ∃๐ฅ๐(๐ฅ) Section 2.3 SYNTACTICS 1. 2. 3. 4. 5. 6. 7. ∃๐ฅ [๐(๐ฅ) ∧ ๐(๐ฅ)] ∀๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] ๐(๐) ฬ ∧ ๐(๐) ฬ ๐(๐) ฬ → ๐(๐) ฬ ๐(๐) ฬ ๐(๐) ฬ ∃๐ฅ๐(๐ฅ) 93 Given Given 1 EI 2 UI 3 Simp 4, 5 MP 6 EG EXAMPLE 2.3.19 Prove: ∀๐ฅ∃๐ฆ [๐(๐ฅ) ∧ ๐ก(๐ฆ)] โข ∀๐ฅ [๐(๐ฅ) ∧ (∃๐ฆ)๐ก(๐ฆ)] 1. 2. 3. 4. 5. 6. 7. 8. 9. ∀๐ฅ∃๐ฆ [๐(๐ฅ) ∧ ๐ก(๐ฆ)] ∃๐ฆ [๐(๐) ∧ ๐ก(๐ฆ)] ๐(๐) ∧ ๐ก(๐) ฬ ๐ก(๐) ฬ ∧ ๐(๐) ๐ก(๐) ฬ ∃๐ฆ๐ก(๐ฆ) ๐(๐) ๐(๐) ∧ (∃๐ฆ)๐ก(๐ฆ) ∀๐ฅ [๐(๐ฅ) ∧ (∃๐ฆ)๐ก(๐ฆ)] Given 1 UI 2 EI 3 Com 4 Simp 5 EG 3 Simp 6, 7 Conj 8 UG EXAMPLE 2.3.20 Prove : ๐(๐) → ∃๐ฅ [๐(๐ฅ) ∧ ๐(๐ฅ)] , ๐(๐) โข ∃๐ฅ๐(๐ฅ) 1. 2. 3. 4. 5. 6. 7. ๐(๐) ฬ → ∃๐ฅ [๐(๐ฅ) ∧ ๐(๐ฅ)] ๐(๐) ฬ ∃๐ฅ [๐(๐ฅ) ∧ ๐(๐ฅ)] ฬ ∧ ๐(๐) ฬ ๐(๐) ฬ ฬ ๐(๐) ∧ ๐(๐) ฬ ๐(๐) ∃๐ฅ๐(๐ฅ) Given Given 1, 2 MP 3 EI 4 Com 5 Simp 6 EG Exercises 1. Use QN and other replacement rules to determine whether the following are legal replacements. (a) ¬∀๐ฅ๐(๐ฅ) ⇔ ∀๐ฅ¬๐(๐ฅ) (b) ¬∃๐ฅ๐(๐ฅ) ⇔ ¬∀๐ฅ๐(๐ฅ) (c) ∀๐ฅ¬๐(๐ฅ) ⇔ ∃๐ฅ๐(๐ฅ) (d) ∃๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] ⇔ ¬∀๐ฅ [¬๐(๐ฅ) → ¬๐(๐ฅ)] (e) ¬∀๐ฅ∃๐ฆ๐(๐ฅ, ๐ฆ) ⇔ ∃๐ฅ∀๐ฆ¬๐(๐ฅ, ๐ฆ) (f) ¬∀๐ฅ∃๐ฆ๐(๐ฅ, ๐ฆ) ⇔ ∃๐ฆ∀๐ฅ๐(๐ฅ, ๐ฆ) 94 Chapter 2 FIRST-ORDER LOGIC 2. Negate and put into positive form. (a) ∃๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] (b) ∀๐ฅ∃๐ฆ [๐(๐ฅ) ∧ ๐(๐ฆ)] (c) ∃๐ฅ∃๐ฆ [๐(๐ฅ) ∨ ๐(๐ฅ, ๐ฆ)] (d) ∀๐ฅ∀๐ฆ [๐(๐ฅ) ∨ (∃๐ง)๐(๐ฆ, ๐ง)] (e) ∃๐ฅ¬๐(๐ฅ) ∨ ∀๐ฅ [๐(๐ฅ) ↔ ¬๐(๐ฅ)] (f) ∀๐ฅ∀๐ฆ∃๐ง(๐(๐ฅ) → [๐(๐ฆ) ∧ ๐(๐ง)]) → ∃๐ฅ¬๐(๐ฅ) 3. Determine whether each pair of propositions are negations. If they are not, write the negation of both. (a) Every real number has a square root. Every real number does not have a square root. (b) Every multiple of four is a multiple of two. Some multiples of two are multiples of four. (c) For all ๐ฅ, if ๐ฅ is odd, then ๐ฅ2 is odd. There exists ๐ฅ such that if ๐ฅ is odd, then ๐ฅ2 is even. (d) There exists an integer ๐ฅ such that ๐ฅ + 1 = 10. For all integers ๐ฅ, ๐ฅ + 1 ≠ 10. 4. Write the negation of the following propositions in positive form and in English. (a) For all ๐ฅ, there exists ๐ฆ such that ๐ฆโ๐ฅ = 9. (b) There exists ๐ฅ so that ๐ฅ๐ฆ = 1 for all real numbers ๐ฆ. (c) Every multiple of ten is a multiple of five. (d) No interval contains a rational number. (e) There is an interval that contains a rational number. 5. Let ๐ be a function and ๐ be a real number in the open interval ๐ผ. Then, ๐ is continuous at ๐ if for every ๐ > 0, there exists ๐ฟ > 0 such that for all ๐ฅ in ๐ผ, if 0 < |๐ฅ − ๐| < ๐ฟ, then |๐ (๐ฅ) − ๐ (๐)| < ๐. (a) Write what it means for ๐ to be not continuous at ๐. (b) The function ๐ is continuous on an open interval if it is continuous at every point of the interval. Write what it means for ๐ to be not continuous on an open interval. 6. Let ๐ be a function and ๐ a real number in the open interval ๐ผ. Then, ๐ is uniformly continuous on ๐ผ means that for every ๐ > 0, there exists ๐ฟ > 0 such that for all ๐ and ๐ฅ in ๐ผ, if 0 < |๐ฅ − ๐| < ๐ฟ, then |๐ (๐ฅ) − ๐ (๐)| < ๐. (a) Write what it means for ๐ to be not uniformly continuous on ๐ผ. (b) How does ๐ being continuous on ๐ผ differ from ๐ being uniformly continuous on ๐ผ? 7. Prove using QN. (a) ∃๐ฅ๐(๐ฅ) โข ∀๐ฅ¬๐(๐ฅ) → ∀๐ฅ๐(๐ฅ) (b) ∀๐ฅ[๐(๐ฅ) → ๐(๐ฅ)], ∀๐ฅ[๐(๐ฅ) → ๐(๐ฅ)], ¬∀๐ฅ๐(๐ฅ) โข ∃๐ฅ¬๐(๐ฅ) (c) ∀๐ฅ๐(๐ฅ) → ∀๐ฆ[๐(๐ฆ) → ๐(๐ฆ)], ∃๐ฅ[๐(๐ฅ) ∧ ¬๐(๐ฅ)] โข ∃๐ฅ¬๐(๐ฅ) (d) ∃๐ฅ๐(๐ฅ) → ∃๐ฆ๐(๐ฆ), ∀๐ฅ¬๐(๐ฅ) โข ∀๐ฅ¬๐(๐ฅ) Section 2.3 SYNTACTICS 8. Find all errors in the given proofs. (a) “∃๐ฅ [๐(๐ฅ) ∨ ๐(๐ฅ)] , ∃๐ฅ¬๐(๐ฅ) โข ∃๐ฅ๐(๐ฅ)” Attempted Proof 1. ∃๐ฅ [๐(๐ฅ) ∨ ๐(๐ฅ)] 2. ∃๐ฅ¬๐(๐ฅ) 3. ๐(๐) ∨ ๐(๐) 4. ¬๐(๐) 5. ¬๐(๐) 6. ∃๐ฅ¬๐(๐ฅ) Given Given 1 EI 2 EI 3, 4 DS 5 EG (b) “∀๐ฅ๐(๐ฅ) โข ∃๐ฅ∀๐ฆ [๐(๐ฅ) ∨ ๐(๐ฆ)]” Attempted Proof 1. ∀๐ฅ๐(๐ฅ) 2. ๐(๐) ฬ 3. ๐(๐) ฬ ∨ ๐(๐) 4. ∀๐ฆ [๐(๐) ฬ ∨ ๐(๐ฆ)] 5. ∃๐ฆ∀๐ฅ [๐(๐ฅ) ∨ ๐(๐ฆ)] Given 1 UI 2 Add 3 UG 4 EG (c) “∃๐ฅ๐(๐ฅ), ∃๐ฅ๐(๐ฅ) โข ∀๐ฅ [๐(๐ฅ) ∧ ๐(๐ฅ)]” Attempted Proof 1. ∃๐ฅ๐(๐ฅ) 2. ∃๐ฅ๐(๐ฅ) 3. ๐(๐) ฬ 4. ๐(๐) ฬ 5. ๐(๐) ฬ ∧ ๐(๐) 6. ∀๐ฅ [๐(๐ฅ) ∧ ๐(๐ฅ)] Given Given 1 EI 2 EI 3, 4 Conj 5 UG 9. Prove. (a) ∀๐ฅ๐(๐ฅ) โข ∀๐ฅ [๐(๐ฅ) ∨ ๐(๐ฅ)] (b) ∀๐ฅ๐(๐ฅ), ∀๐ฅ [๐(๐ฅ) → ¬๐(๐ฅ)] โข ∀๐ฅ¬๐(๐ฅ) (c) ∀๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] , ∀๐ฅ๐(๐ฅ) โข ∀๐ฅ๐(๐ฅ) (d) ∀๐ฅ [๐(๐ฅ) ∨ ๐(๐ฅ)] , ∀๐ฅ¬๐(๐ฅ) โข ∀๐ฅ๐(๐ฅ) (e) ∃๐ฅ๐(๐ฅ) โข ∃๐ฅ [๐(๐ฅ) ∨ ๐(๐ฅ)] (f) ∃๐ฅ∃๐ฆ๐(๐ฅ, ๐ฆ) โข ∃๐ฆ∃๐ฅ๐(๐ฅ, ๐ฆ) (g) ∀๐ฅ∀๐ฆ๐(๐ฅ, ๐ฆ) โข ∀๐ฆ∀๐ฅ๐(๐ฅ, ๐ฆ) (h) ∀๐ฅ¬๐(๐ฅ), ∃๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] โข ∃๐ฅ¬๐(๐ฅ) (i) ∃๐ฅ๐(๐ฅ), ∀๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] โข ∃๐ฅ๐(๐ฅ) (j) ∀๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] , ∀๐ฅ [๐(๐ฅ) → ๐ (๐ฅ)] , ∃๐ฅ [๐(๐ฅ) ∨ ๐(๐ฅ)] โข ∃๐ฅ [๐(๐ฅ) ∨ ๐ (๐ฅ)] (k) ∃๐ฅ [๐(๐ฅ) ∧ ¬๐(๐ฅ)] , ∀๐ฅ [๐(๐ฅ) → ๐(๐ฅ)] โข ∃๐ฅ¬๐(๐ฅ) (l) ∀๐ฅ๐(๐ฅ), ∀๐ฅ([๐(๐ฅ) ∨ ๐(๐ฅ)] → [๐(๐ฅ) ∧ ๐ (๐ฅ)]) โข ∃๐ฅ๐ (๐ฅ) (m) ∃๐ฅ๐(๐ฅ), ∃๐ฅ๐(๐ฅ) → ∀๐ฅ∃๐ฆ [๐(๐ฅ) → ๐(๐ฆ)] โข ∀๐ฅ๐(๐ฅ) ∨ ∃๐ฅ๐(๐ฅ) 95 96 Chapter 2 FIRST-ORDER LOGIC (n) ∃๐ฅ๐(๐ฅ), ∃๐ฅ๐(๐ฅ), ∃๐ฅ∃๐ฆ [๐(๐ฅ) ∧ ๐(๐ฆ)] → ∀๐ฅ๐(๐ฅ) โข ∀๐ฅ๐(๐ฅ) 10. Prove the following: (a) ๐(๐ฅ) ∧ ∃๐ฆ๐(๐ฆ) ⇔ ∃๐ฆ[๐(๐ฅ) ∧ ๐(๐ฆ)] (b) ๐(๐ฅ) ∨ ∃๐ฆ๐(๐ฆ) ⇔ ∃๐ฆ[๐(๐ฅ) ∨ ๐(๐ฆ)] (c) ๐(๐ฅ) ∧ ∀๐ฆ๐(๐ฆ) ⇔ ∀๐ฆ[๐(๐ฅ) ∧ ๐(๐ฆ)] (d) ๐(๐ฅ) ∨ ∀๐ฆ๐(๐ฆ) ⇔ ∀๐ฆ[๐(๐ฅ) ∨ ๐(๐ฆ)] (e) ∃๐ฅ๐(๐ฅ) → ๐(๐ฆ) ⇔ ∀๐ฅ[๐(๐ฅ) → ๐(๐ฆ)] (f) ∀๐ฅ๐(๐ฅ) → ๐(๐ฆ) ⇔ ∃๐ฅ[๐(๐ฅ) → ๐(๐ฆ)] (g) ๐(๐ฅ) → ∃๐ฆ๐(๐) ⇔ ∃๐ฆ[๐(๐ฅ) → ๐(๐ฆ)] (h) ๐(๐ฅ) → ∀๐ฆ๐(๐) ⇔ ∀๐ฆ[๐(๐ฅ) → ๐(๐ฆ)] 11. An ๐ฒ-formula is in prenex normal form if it is of the form ๐0 ๐ฅ0 ๐1 ๐ฅ1 … ๐๐−1 ๐ฅ๐−1 ๐, where ๐0 , ๐1 , … , ๐๐−1 are quantifier symbols and ๐ is a ๐ฒ-formula. Every ๐ฒ-formula can be replaced with an ๐ฒ-formula in prenex normal form, although variables might need to be renamed. Use Exercise 10 and QN to put the given formulas in prenex normal form. (a) [๐(๐ฅ) ∨ ๐(๐ฅ)] → ∃๐ฆ[๐(๐ฆ) → ๐(๐ฆ)] (b) ¬∀๐ฅ[๐(๐ฅ) → ¬∃๐ฆ๐(๐ฆ)] (c) ∃๐ฅ๐(๐ฅ) → ∀๐ฅ∃๐ฆ [๐(๐ฅ) → ๐(๐ฆ)] (d) ∀๐ฅ[๐(๐ฅ) → ๐(๐ฆ)] ∧ ¬∃๐ฆ∀๐ง[๐(๐ฆ) → ๐ (๐ง)] 2.4 PROOF METHODS The purpose of the propositional logic of Chapter 1 is to model the basic reasoning that one does in mathematics. Rules that determine truth (semantics) and establish valid forms of proof (syntactics) were developed. The logic developed in this chapter is an extension of propositional logic. It is called first-order logic. As with propositional logic, first-order logic provides a working model of a type of deductive reasoning that happens when studying mathematics, but with a greater emphasis on what a particular proposition is communicating about its subject. Geometry can serve as an example. To solve geometric problems and prove geometric propositions means to work with the axioms and theorems of geometry. What steps are legal when doing this are dictated by the choice of the logic. Because the subjects of geometric propositions are mathematical objects, such as points, lines, and planes, first-order logic is a good choice. However, sometimes other logical systems are used. An example of such an alternative is secondorder logic, which allows quantification over function and relation symbols (page 73) in addition to quantification over variable symbols. Whichever logic is chosen, that logic provides the general rules of reasoning for the mathematical theory. Since it has its own axioms and theorems, a logic itself is a theory, but because it is intended to provide rules for other theories, it is sometimes called a metatheory (Figure 2.5). Although all mathematicians use logic, they usually do not use symbolic logic. Instead, their proofs are written using sentences in English or some other human language Section 2.4 PROOF METHODS 97 First-order logic (metatheory) Mathematics Geometry (theory) Figure 2.5 Calculus (theory) First-order logic is a metatheory of mathematical theories. and usually do not provide all of the details. Call these paragraph proofs. Their intention is to lead the reader from the premises to the conclusion, making the result convincing. In many instances, a proof could be translated into the first-order logic, but this is not needed to meet the need of the mathematician. However, that it could be translated means that we can use first-order logic to help us write paragraph proofs, and in this section, we make that connection. Universal Proofs Our first paragraph proofs will be for propositions with universal quantifiers. To prove ∀๐ฅ๐(๐ฅ) from a given set of premises, we show that every object satisfies ๐(๐ฅ) assuming those premises. Since the proofs are mathematical, we can restrict the objects to a particular universe. A proposition of the form ∀๐ฅ๐(๐ฅ) is then interpreted to mean that ๐(๐ฅ) holds for all x from a given universe. This restriction is reasonable because we are studying mathematical things, not airplanes or puppies. To indicate that we have made a restriction to a universe, we randomly select an object of the universe by writing an introduction. This is a proposition that declares the type of object represented by the variable. The following are examples of introductions: Let ๐ be a real number. Take ๐ to be an integer. Suppose ๐ is a positive integer. From here we show that ๐(๐) is true. This process is exemplified by the next diagram: 98 Chapter 2 FIRST-ORDER LOGIC ∀x p(x) Let a be an object in the universe. Prove p(a). These types of proofs are called universal proofs. EXAMPLE 2.4.1 To prove that for all real numbers ๐ฅ, (๐ฅ − 1)3 = ๐ฅ3 − 3๐ฅ2 + 3๐ฅ − 1, we introduce a real number and then check the equation. PROOF Let ๐ be a real number. Then, (๐ − 1)3 = (๐ − 1)(๐ − 1)2 = (๐ − 1)(๐2 − 2๐ + 1) = ๐3 − 3๐2 + 3๐ − 1. For our next example, we need some terminology. DEFINITION 2.4.2 For all integers ๐ and ๐, ๐ divides ๐ (written as ๐ โฃ ๐) if ๐ ≠ 0 and there exists an integer ๐ such that ๐ = ๐๐. Therefore, 6 divides 18, but 8 does not divide 18. In this case, write 6 โฃ 18 and 8 - 18. If ๐ โฃ ๐, we can also write that ๐ is divisible by ๐, ๐ is a divisor or a factor of ๐, or ๐ is a multiple of ๐. For this reason, to translate a predicate like 4 divides ๐, write ๐ = 4๐ for some integer ๐. A common usage of divisibility is to check whether an integer is divisible by 2 or not. DEFINITION 2.4.3 An integer ๐ is even if 2 โฃ ๐, and ๐ is odd if 2 โฃ (๐ − 1). We are now ready for the example. EXAMPLE 2.4.4 Let us prove the proposition Section 2.4 PROOF METHODS 99 the square of every even integer is even. This can be rewritten using a variable: for all even integers ๐, ๐2 is even. The proof goes like this. PROOF Let ๐ be an even integer. This means that there exists an integer ๐ such that ๐ = 2๐, so we can calculate: ๐2 = (2๐)2 = 4๐2 = 2(2๐2 ). Since 2๐2 is an integer, ๐2 is even. Notice how the definition was used in the proof. After the even number was introduced, a proposition that translated the introduction into a form that was easier to use was written. This was done using Definitions 2.4.2 and 2.4.3. Existential Proofs Suppose that we want to write a paragraph proof for ∃๐ฅ๐(๐ฅ). This means that we must show that there exists at least one object of the universe that satisfies the formula ๐(๐ฅ). It will be our job to find that object. To do this directly, we pick an object that we think will satisfy ๐(๐ฅ). This object is called a candidate. We then check that it does satisfy ๐(๐ฅ). This type of a proof is called a direct existential proof, and its structure is illustrated as follows: ∃x p(x) Choose a candidate from the universe. Check that the candidate satisfies p(x). EXAMPLE 2.4.5 To prove that there exists an integer ๐ฅ such that ๐ฅ2 + 2๐ฅ − 3 = 0, we find an integer that satisfies the equation. A basic factorization yields (๐ฅ + 3)(๐ฅ − 1) = 0. Since ๐ฅ = −3 or ๐ฅ = 1 will work, we choose (arbitrarily) ๐ฅ = 1. Therefore, (1)2 + 2(1) − 3 = 0, proving the existence of the desired integer. 100 Chapter 2 FIRST-ORDER LOGIC Suppose that we want to prove that there is a function ๐ such that the derivative of ๐ is 2๐ฅ. After a quick mental calculation, we choose ๐ (๐ฅ) = ๐ฅ2 as a candidate and check to find that ๐ 2 ๐ฅ = 2๐ฅ. (2.16) ๐๐ฅ Notice that ๐โ๐๐ฅ is a function that has functions as its inputs and outputs. Let ๐ represent this function. That is, ๐ is a function symbol such that ๐(๐ )(๐ฅ) = ๐ ๐ (๐ฅ). ๐๐ฅ A formula that represents the proposition that was just proved is ∃๐ (๐(๐ )(๐ฅ) = 2๐ฅ). (2.17) This is a second-order formula (page 73) because the variable symbol ๐ฅ represents a real number (an object of the universe) and ๐ is a function symbol taking real numbers as arguments. Although this kind of reasoning is common to mathematics, it cannot be written as a first-order formula. This shows that there is a purpose to second-order logic. It is not a novelty. EXAMPLE 2.4.6 To represent (2.16) as a first-order formula, let ๐ be a unary function symbol representing the derivative and write ∀๐ฅ[๐(๐ฅ2 ) = 2๐ฅ]. Notice, however, that this formula does not convey the same meaning as (2.17). Multiple Quantifiers Let us take what we have learned concerning the universal and existential quantifiers and write paragraph proofs involving both. The first example is a simple one from algebra but will nicely illustrate our method. EXAMPLE 2.4.7 Prove that for every real number ๐ฅ, there exists a real number ๐ฆ so that ๐ฅ + ๐ฆ = 2. This translates to ∀๐ฅ∃๐ฆ(๐ฅ + ๐ฆ = 2) with the universe equal to the real numbers. Remembering that a universal quantifier must apply to all objects of the universe and an existential quantifier means that we must find the desired object, we have the following: Section 2.4 PROOF METHODS Let x be a real number. 101 Find a real number y. ∀x ∃y (x + y = 2) Check that x + y = 2. After taking an arbitrary ๐ฅ, our candidate will be ๐ฆ = 2 − ๐ฅ. PROOF Let ๐ฅ be a real number. Choose ๐ฆ = 2 − ๐ฅ and calculate: ๐ฅ + ๐ฆ = ๐ฅ + (2 − ๐ฅ) = 2. Now let us switch the order of the quantifiers. EXAMPLE 2.4.8 To see that there exists an integer ๐ฅ such that for all integers ๐ฆ, ๐ฅ + ๐ฆ = ๐ฆ, we use the following: Find an integer x. Let y be an integer. ∃x ∀y (x + y = y) Show x + y = y. In the proof, the first goal is to identify a candidate. Then, we must show that it works with every real number. PROOF We claim that 0 is the sought after object. To see this, let ๐ฆ be an integer. Then 0 + ๐ฆ = ๐ฆ. The next example will involve two existential quantifiers. Therefore, we have to find two candidates. EXAMPLE 2.4.9 We prove that there exist real numbers ๐ and ๐ so that for every real number ๐ฅ, (๐ + 2๐)๐ฅ + (2๐ − ๐) = 2๐ฅ − 6. Translating we arrive at 102 Chapter 2 FIRST-ORDER LOGIC Find a real number a. Find a real number b. ∃a ∃b ∀x[(a + 2b)x + (2a − b) = 2x − 6] Let x be a real number. Show (a + 2b)x + (2a − b) = 2x − 6. We have to choose two candidates, one for ๐ and one for ๐, and then check by taking an arbitrary ๐ฅ. PROOF Solving the system, ๐ + 2๐ = 2, 2๐ − ๐ = −6, we choose ๐ = −2 and ๐ = 2. Let ๐ฅ be a real number. Then, (๐ + 2๐)๐ฅ + (2๐ − ๐) = (−2 + 2 ⋅ 2)๐ฅ + (2 ⋅ (−2) − 2) = 2๐ฅ − 6. Counterexamples There are many times in mathematics when we must show that a proposition of the form ∀๐ฅ๐(๐ฅ) is false. This can be accomplished by proving ∃๐ฅ¬๐(๐ฅ) is true, and this is done by showing that an object ๐ exists in the universe such that ๐(๐) is false. This object is called a counterexample to ∀๐ฅ๐(๐ฅ). EXAMPLE 2.4.10 Show false: ๐ฅ + 2 = 7 for all real numbers ๐ฅ. To do this, we show that ∃๐ฅ(๐ฅ + 2 ≠ 7) is true by noting that the real number 0 satisfies ๐ฅ + 2 ≠ 7. Hence, 0 is a counterexample to ∀๐ฅ(๐ฅ + 2 = 7). The idea of a counterexample can be generalized to cases with multiple universal quantifiers. EXAMPLE 2.4.11 We know from algebra that every quadratic polynomial with real coefficients has a real zero is false. This can be symbolized as ∀๐∀๐∀๐∃๐ฅ(๐๐ฅ2 + ๐๐ฅ + ๐ = 0), where the universe is the collection of real numbers. The counterexample is found by demonstrating ∃๐∃๐∃๐∀๐ฅ(๐๐ฅ2 + ๐๐ฅ + ๐ ≠ 0). Section 2.4 PROOF METHODS 103 Many polynomials could be chosen, but we select ๐ฅ2 + 1. Its only zeros are ๐ and −๐. This means that ๐ = 1, ๐ = 0, and ๐ = 1 is our counterexample. Direct Proof Direct Proof is the preferred method for proving implications. To use Direct Proof to write a paragraph proof identify the antecedent and consequent of the implication and then follow these steps: โ assume the antecedent, โ translate the antecedent, โ translate the consequent so that the goal of the proof is known, โ deduce the consequent. Our first example will use Definitions 2.4.2 and 2.4.3. Notice the introductions in the proof. EXAMPLE 2.4.12 We use Direct Proof to write a paragraph proof of the proposition for all integers ๐ฅ, if 4 divides ๐ฅ, then ๐ฅ is even. First, randomly choose an integer ๐ and then identify the antecedent and consequent: if 4 divides ๐ฅ , then ๐ฅ is even . Use these to identify the structure of the proof: 4 divides a. This means a = 4k. … Show a = 2l. 2 divides a. Assume the antecedent. Translate the antecedent. Translate the consequent. Deduce the consequent. Now write the final version from this structure. PROOF Assume that 4 divides the integer ๐. This means ๐ = 4๐ for some integer ๐. We must show that ๐ = 2๐ for some integer ๐, but we know that ๐ = 4๐ = 2(2๐). Hence, let ๐ = 2๐. Sometimes it is difficult to prove a conditional directly. An alternative is to prove the contrapositive. This is sometimes easier or simply requires fewer lines. The next example shows this method in a paragraph proof. 104 Chapter 2 FIRST-ORDER LOGIC EXAMPLE 2.4.13 Let us show that for all integers ๐, if ๐2 is odd, then ๐ is odd. A direct proof of this is a problem. Instead, we prove its contrapositive, if ๐ is not odd, then ๐2 is not odd. In other words, we prove that if ๐ is even, then ๐2 is even. This will be done using Direct Proof: Let n be even. This means n = 2k. … Show n2 = 2l. 2 divides n 2 . Assume the antecedent. Translate the antecedent. Translate the consequent. Deduce the consequent. This leads to the final proof. PROOF Let ๐ be an even integer. This means that ๐ = 2๐ for some integer ๐. To see that ๐2 is even, calculate to find ๐2 = (2๐)2 = 4๐2 = 2(2๐2 ). Notice that this proof is basically the same as the proof for the square of every even integer is even on page 99. This illustrates that there is a connection between universal proofs and direct proofs (Exercise 4). Existence and Uniqueness There will be times when we want to show that there exists exactly one object that satisfies a given predicate ๐(๐ฅ). In other words, there exists a unique object that satisfies ๐(๐ฅ). This is a two-step process. โ Existence: Show that there is at least one object that satisfies ๐(๐ฅ). โ Uniqueness: Show that there is at most one object that satisfies ๐(๐ฅ). This is usually done by assuming both ๐ and ๐ satisfy ๐(๐ฅ) and then proving ๐ = ๐. This means to prove that there exists a unique ๐ฅ such that ๐(๐ฅ), we prove ∃๐ฅ๐(๐ฅ) ∧ ∀๐ฅ∀๐ฆ(๐(๐ฅ) ∧ ๐(๐ฆ) → ๐ฅ = ๐ฆ). Use direct or indirect existential proof to demonstrate that an object exists. The next example illustrates proving uniqueness. Section 2.4 PROOF METHODS 105 EXAMPLE 2.4.14 Let ๐ and ๐ be nonnegative integers with ๐ ≠ 0. To show that there is at most one pair of nonnegative integers ๐ and ๐ such that ๐ < ๐ and ๐ = ๐๐ + ๐, suppose in addition to ๐ and ๐ that there exists nonnegative integers ๐′ and ๐ ′ such that ๐′ < ๐ and ๐ = ๐๐ ′ + ๐′ . Assume that ๐′ > ๐. By Exercise 13, ๐ > ๐ ′ , so there exists ๐ข, ๐ฃ > 0 such that ๐′ = ๐ + ๐ข and ๐ = ๐ ′ + ๐ฃ. Therefore, ๐(๐ ′ + ๐ฃ) + ๐ = ๐๐ ′ + ๐ + ๐ข, ๐๐ ′ + ๐๐ฃ + ๐ = ๐๐ ′ + ๐ + ๐ข, ๐๐ฃ = ๐ข. Since ๐ฃ > 0, there exists ๐ค such that ๐ฃ = ๐ค + 1. Hence, ๐๐ค + ๐ = ๐(๐ค + 1) = ๐๐ฃ = ๐ข, so ๐ ≤ ๐ข. However, since ๐ < ๐ and ๐′ < ๐, we have that ๐ข < ๐ (Exercise 13), which is impossible. Lastly, the assumption ๐ > ๐′ leads to a similar contradiction. Therefore, ๐ = ๐′ , which implies that ๐ = ๐ ′ . EXAMPLE 2.4.15 To prove 2๐ฅ + 1 = 5 has a unique real solution, we show ∃๐ฅ(2๐ฅ + 1 = 5) and ∀๐ฅ∀๐ฆ(2๐ฅ + 1 = 5 ∧ 2๐ฆ + 1 = 5 → ๐ฅ = ๐ฆ). โ We know that ๐ฅ = 2 is a solution since 2(2) + 1 = 5. โ Suppose that ๐ and ๐ are solutions. We know that both 2๐ + 1 = 5 and 2๐ + 1 = 5, so we calculate: 2๐ + 1 = 2๐ + 1, 2๐ = 2๐, ๐ = ๐. Indirect Proof To use indirect proof, assume each premise and then assume the negation of the conclusion. Then proceed with the proof until a contradiction is reached. At this point, deduce the original conclusion. 106 Chapter 2 FIRST-ORDER LOGIC EXAMPLE 2.4.16 Earlier we proved for all integers ๐, if ๐2 is odd, then ๐ is odd by showing its contrapositive. Here we nest an Indirect Proof within a Direct Proof: Assume is odd. Suppose n is even. Assume the antecedent. Assume the negation. … n2 Contradiction n is odd. Deduce a contradiction. Conclude the consequent. We use this structure to write the paragraph proof. PROOF Take an integer ๐ and let ๐2 be odd. In order to obtain a contradiction, assume that ๐ is even. So, ๐ = 2๐ for some integer ๐. Substituting, we have ๐2 = (2๐)2 = 2(2๐2 ), showing that ๐2 is even. This is a contradiction. Therefore, ๐ is an odd integer. Indirect proof has been used to prove many famous mathematical results including the next example. EXAMPLE 2.4.17 We show that √ 2 is an irrational number. Suppose instead that √ ๐ 2= , ๐ where ๐ and ๐ are integers, ๐ ≠ 0, and the fraction ๐โ๐ has been reduced. Then, 2= ๐2 , ๐2 so ๐2 = 2๐2 . Therefore, ๐ = 2๐ for some integer ๐ [Exercise 11(c)]. We conclude that ๐2 = 2๐2 , which implies that ๐ is also even. However, this is impossible because the fraction was reduced. There are times when it is difficult or impossible to find a candidate for an existential proof. When this happens, an indirect existential proof can sometimes help. Section 2.4 PROOF METHODS 107 EXAMPLE 2.4.18 If ๐ is an integer such that ๐ > 1, then ๐ is prime means that the only positive divisors of ๐ are 1 and ๐; else ๐ is composite. From the definition, if ๐ is composite, there exist ๐ and ๐ such that ๐ = ๐๐ and 1 < ๐ ≤ ๐ < ๐. For example, 2, 11, and 97 are prime, and 4 and 87 are composite. Euclid proved that there are infinitely many prime numbers (Elements IX.20). Since it is impossible to find all of these numbers, we prove this theorem indirectly. Suppose that there are finitely many prime numbers and list them as ๐0 , ๐1 , … , ๐๐−1 . Consider ๐ = ๐0 ๐1 · · · ๐๐−1 + 1. If ๐ is prime, ๐ = ๐๐ for some ๐ = 0, 1, … , ๐ − 1, which would imply that ๐๐ divides 1. Therefore, ๐ is composite, but this is also impossible because ๐ must have a prime factor, which again means that 1 would be divisible by a prime. Biconditional Proof The last three types of proofs that we will examine rely on direct proof. The first of these takes three forms, and they provide the usual method of proving biconditionals. The first form follows because ๐ → ๐ and ๐ → ๐ imply (๐ → ๐) ∧ (๐ → ๐) by Conj, which implies ๐ ↔ ๐ by Equiv. INFERENCE RULE 2.4.19 [Biconditional Proof (BP)] ๐ → ๐, ๐ → ๐ ⇒ ๐ ↔ ๐. The rule states that a biconditional can be proved by showing both implications. Each is usually proved with direct proof. As is seen in the next example, the ๐ → ๐ subproof is introduced by (→) and its converse with (←). The conclusions of the two applications of direct proof are combined in line 7. EXAMPLE 2.4.20 Prove: ๐ → ๐ โข ๐ ∧ ๐ ↔ ๐ 1. (→) 2. 3. (←) 4. 5. 6. 7. ๐→๐ →๐ ∧ ๐ ๐ →๐ ๐ ๐∧๐ ๐∧๐ ↔๐ Given Assumption 2 Simp Assumption 1, 4 MP 4, 5 Conj 2–6 BP 108 Chapter 2 FIRST-ORDER LOGIC Sometimes the steps for one part are simply the steps for the other in reverse. When this happens, our work is cut in half, and we can use the short rule of biconditional proof. These proofs are simply a sequence of replacements with or without the reasons. This is a good method when only rules of replacement are used as in the next example. (Notice that there are no hypotheses to assume.) EXAMPLE 2.4.21 Prove: โข ๐ → ๐ ↔ ๐ ∧ ¬๐ → ¬๐ ๐ → ๐ ⇔ ¬๐ ∨ ๐ ⇔ ¬๐ ∨ ¬๐ ∨ ๐ ⇔ ¬๐ ∨ (¬๐ ∨ ๐) ⇔ ¬๐ ∨ ๐ ∨ ¬๐ ⇔ ¬๐ ∨ ¬¬๐ ∨ ¬๐ ⇔ ¬(๐ ∧ ¬๐) ∨ ¬๐ ⇔ ๐ ∧ ¬๐ → ¬๐ Impl Idem Assoc Com DN DM Impl EXAMPLE 2.4.22 Let us use biconditional proof to show that for all integers ๐, ๐ is even if and only if ๐3 is even. Since this is a biconditional, we must show both implications: if ๐ is even , then ๐3 is even , and if ๐3 is even , then ๐ is even . To prove the second conditional, we prove its contrapositive. Therefore, using the pattern of the previous example, the structure is (→) (←) →Assume ๐ is even Then, ๐ = 2๐ for some integer ๐ โฎ ๐3 = 2๐, with ๐ being an integer ๐3 is even →Assume ๐ is odd Then, ๐ = 2๐ + 1 for some integer ๐ โฎ ๐3 = 2๐ + 1, with ๐ being an integer ๐3 is odd Section 2.4 PROOF METHODS 109 PROOF Let ๐ be an integer. โ Assume ๐ is even. Then, ๐ = 2๐ for some integer ๐. We show that ๐3 is even. To do this, we calculate: ๐3 = (2๐)3 = 2(4๐3 ), which means that ๐3 is even. โ Now suppose that ๐ is odd. This means that ๐ = 2๐ + 1 for some integer ๐. To show that ๐3 is odd, we again calculate: ๐3 = (2๐ + 1)3 = 8๐3 + 12๐2 + 6๐ + 1 = 2(4๐3 + 6๐2 + 3๐) + 1. Hence, ๐3 is odd. Notice that the words were chosen carefully to make the proof more readable. Furthermore, the example could have been written with the words necessary and sufficient introducing the two subproofs. The (→) step could have been introduced with a phrase like to show sufficiency, and the (←) could have opened with as for necessity. There will be times when we need to prove a sequence of biconditionals. The propositional forms ๐0 , ๐1 , … , ๐๐−1 are pairwise equivalent if for all ๐, ๐, ๐๐ ↔ ๐๐ . In other words, ๐0 ↔ ๐1 , ๐0 ↔ ๐2 , … , ๐1 ↔ ๐2 , ๐1 ↔ ๐3 , … , ๐๐−2 ↔ ๐๐−1 . To prove all of these, we make use of the Hypothetical Syllogism. For example, if we know that ๐0 → ๐1 , ๐1 → ๐2 , and ๐2 → ๐0 , then ๐0 → ๐2 (because ๐0 → ๐1 ∧ ๐1 → ๐2 ), ๐1 → ๐0 (because ๐1 → ๐2 ∧ ๐2 → ๐0 ), ๐2 → ๐1 (because ๐2 → ๐0 ∧ ๐0 → ๐1 ). The result is the equivalence rule. 110 Chapter 2 FIRST-ORDER LOGIC INFERENCE RULE 2.4.23 [Equivalence Rule] To prove that the propositional forms ๐0 , ๐1 , … , ๐๐−1 are pairwise equivalent, prove: ๐0 → ๐1 , ๐1 → ๐2 , … , ๐๐−2 → ๐๐−1 , ๐๐−1 → ๐0 . In practice, the equivalence rule will typically be used to prove propositions that include the phrase the following are equivalent. EXAMPLE 2.4.24 Let ๐ (๐ฅ) = ๐๐ ๐ฅ๐ + ๐๐−1 ๐ฅ๐−1 + · · · + ๐1 ๐ฅ + ๐0 be a polynomial with real coefficients. That is, each ๐๐ is a real number and ๐ is a nonnegative integer. An integer ๐ is a zero of ๐ (๐ฅ) if ๐๐ ๐๐ + ๐๐−1 ๐๐−1 + · · · + ๐1 ๐ + ๐0 = 0, written as ๐ (๐) = 0, and a polynomial ๐(๐ฅ) is a factor of ๐ (๐ฅ) if there is a polynomial โ(๐ฅ) such that ๐ (๐ฅ) = ๐(๐ฅ)โ(๐ฅ). Whether ๐(๐ฅ) is a factor of ๐ (๐ฅ) or not, there exist unique polynomials ๐(๐ฅ) and ๐(๐ฅ) such that ๐ (๐ฅ) = ๐(๐ฅ)๐(๐ฅ) + ๐(๐ฅ) (2.18) the degree of ๐(๐ฅ) is less than the degree of ๐(๐ฅ). (2.19) and This result is called the polynomial division algorithm, The polynomial ๐(๐ฅ) is the quotient and ๐(๐ฅ) is the remainder. We prove that the following are equivalent: โ ๐ is a zero of ๐ (๐ฅ). โ ๐ is a solution to ๐ (๐ฅ) = 0. โ ๐ฅ − ๐ is a factor of ๐ (๐ฅ). To do this, we prove three conditionals: if ๐ is a zero of ๐ (๐ฅ) , then ๐ is a solution to ๐ (๐ฅ) = 0 , if ๐ is a solution to ๐ (๐ฅ) = 0 , then ๐ฅ − ๐ is a factor of ๐ (๐ฅ) , and if ๐ฅ − ๐ is a factor of ๐ (๐ฅ) , then ๐ is a zero of ๐ (๐ฅ) . Section 2.4 PROOF METHODS 111 We use direct proof on each. PROOF Let ๐๐ ๐ฅ๐ + ๐๐−1 ๐ฅ๐−1 + · · · + ๐1 ๐ฅ + ๐0 be a polynomial and assume that the coefficients are real numbers. Denote the polynomial by ๐ (๐ฅ). โ Let ๐ be a zero of ๐ (๐ฅ). By definition, this means ๐ (๐) = 0, so ๐ is a solution to ๐ (๐ฅ) = 0. โ Suppose ๐ is a solution to ๐ (๐ฅ) = 0. The polynomial division algorithm (2.18) gives polynomials ๐(๐ฅ) and ๐(๐ฅ) such that ๐ (๐ฅ) = ๐(๐ฅ)(๐ฅ − ๐) + ๐(๐ฅ) and the degree of ๐(๐ฅ) is less than 1 (2.19). Hence, ๐(๐ฅ) is a constant that we simply write as ๐. Now, 0 = ๐ (๐) = ๐(๐)(๐ − ๐) + ๐ = 0 + ๐ = ๐. Therefore, ๐ (๐ฅ) = ๐(๐ฅ)(๐ฅ − ๐), so ๐ฅ − ๐ is a factor of ๐ (๐ฅ). โ Lastly, assume ๐ฅ − ๐ is a factor of ๐ (๐ฅ). This means that there exists a polynomial ๐(๐ฅ) so that ๐ (๐ฅ) = (๐ฅ − ๐)๐(๐ฅ). Thus, ๐ (๐) = (๐ − ๐)๐(๐) = 0, which means ๐ is a zero of ๐ (๐ฅ). Proof of Disunctions The second type of proof that relies on direct proof is the proof of a disjunction. To prove ๐ ∨ ๐, it is standard to assume ¬๐ and show ๐. This means that we would be using direct proof to show ¬๐ → ๐. This is what we want because ¬๐ → ๐ ⇔ ¬¬๐ ∨ ๐ ⇔ ๐ ∨ ๐. The intuition behind the strategy goes like this. If we need to prove ๐ ∨ ๐ from some hypotheses, it is not reasonable to believe that we can simply prove ๐ and then use Addition to conclude ๐ ∨ ๐. Indeed, if we could simply prove ๐, we would expect the conclusion to be stated as ๐ and not ๐ ∨ ๐. Hence, we need to incorporate both disjuncts into the proof. We do this by assuming the negation of one of the disjuncts. If we can prove the other, the disjunction must be true. 112 Chapter 2 FIRST-ORDER LOGIC EXAMPLE 2.4.25 To prove for all integers ๐ and ๐, if ๐๐ = 0, then ๐ = 0 or ๐ = 0, we assume ๐๐ = 0 and show that ๐ ≠ 0 implies ๐ = 0. PROOF Let ๐ and ๐ be integers. Let ๐๐ = 0 and suppose ๐ ≠ 0. Then, ๐−1 exists. Multiplying both sides of the equation by ๐−1 gives ๐−1 ๐๐ = ๐−1 ⋅ 0, so ๐ = 0. Proof by Cases The last type of proof that relies on direct proof is proof by cases. Suppose that we want to prove ๐ → ๐ and this is difficult for some reason. We notice, however, that ๐ can be broken into cases. Namely, there exist ๐0 , ๐1 , … , ๐๐−1 such that ๐ ⇔ ๐0 ∨ ๐1 ∨ · · · ∨ ๐๐−1 . If we can prove ๐๐ → ๐ for each ๐, we have proved ๐ → ๐. If ๐ = 2, then ๐ ⇔ ๐0 ∨ ๐1 , and the justification of this is as follows: (๐0 → ๐) ∧ (๐1 → ๐) ⇔ (¬๐0 ∨ ๐) ∧ (¬๐1 ∨ ๐) ⇔ (๐ ∨ ¬๐0 ) ∧ (๐ ∨ ¬๐1 ) ⇔ ๐ ∨ ¬๐0 ∧ ¬๐1 ⇔ ¬๐0 ∧ ¬๐1 ∨ ๐ ⇔ ¬(๐0 ∨ ๐1 ) ∨ ๐ ⇔ ๐0 ∨ ๐1 → ๐ ⇔ ๐ → ๐. This generalizes to the next rule. INFERENCE RULE 2.4.26 [Proof by Cases (CP)] For every positive integer ๐, if ๐ ⇔ ๐0 ∨ ๐1 ∨ · · · ∨ ๐๐−1 , then ๐0 → ๐, ๐1 → ๐, … , ๐๐−1 → ๐ ⇒ ๐ → ๐. For example, since ๐ is a real number if and only if ๐ > 0, ๐ = 0, or ๐ < 0, if we needed to prove a proposition about an arbitrary real number, it would suffice to prove the result individually for ๐ > 0, ๐ = 0, and ๐ < 0. Section 2.4 PROOF METHODS 113 EXAMPLE 2.4.27 Our example of a proof by cases is a well-known one: for all integers ๐ and ๐, if ๐ = ±๐, then ๐ divides ๐ and ๐ divides ๐. The antecedent means ๐ = ๐ or ๐ = −๐, which are the two cases, so we have to show both if ๐ = ๐, then ๐ divides ๐ and ๐ divides ๐ and if ๐ = −๐, then ๐ divides ๐ and ๐ divides ๐. This leads to the following structure: (Case 1) (Case 2) →Suppose ๐ = ±๐ Thus, ๐ = ๐ or ๐ = −๐ →Assume ๐ = ๐ โฎ ๐ divides ๐ and ๐ divides ๐ →Assume ๐ = −๐ โฎ ๐ divides ๐ and ๐ divides ๐ ∴ ๐ divides ๐ and ๐ divides ๐ The final proof looks something like this. PROOF Let ๐ and ๐ be integers and suppose ๐ = ±๐. To show ๐ divides ๐ and ๐ divides ๐, we have two cases to prove. โ Assume ๐ = ๐. Then, ๐ = ๐ ⋅ 1 and ๐ = ๐ ⋅ 1. โ Next assume ๐ = −๐. This means that ๐ = ๐ ⋅ (−1) and ๐ = ๐ ⋅ (−1). In both the cases, we have proved that ๐ divides ๐ and ๐ divides ๐. Exercises 1. Let ๐ be an integer. Demonstrate that each of the following are divisible by 6. (a) 18 (b) −24 (c) 0 (d) 6๐ + 12 (e) 23 ⋅ 34 ⋅ 75 (f) (2๐ + 2)(3๐ + 6) 2. Let ๐ be a nonzero integer. Write paragraph proofs for each proposition. 114 Chapter 2 FIRST-ORDER LOGIC (a) 1 divides ๐. (b) ๐ divides 0. (c) ๐ divides ๐. 3. Write a universal proof for each proposition. (a) For all real numbers ๐ฅ, (๐ฅ + 2)2 = ๐ฅ2 + 4๐ฅ + 4. (b) For all integers ๐ฅ, ๐ฅ − 1 divides ๐ฅ3 − 1. (c) The square of every even integer is even. 4. Show why a proof of (∀๐ฅ)๐(๐ฅ) is an application of direct proof. 5. Write existential proofs for each proposition. (a) There exists a real number ๐ฅ such that ๐ฅ − ๐ = 9. (b) There exists an integer ๐ฅ such that ๐ฅ2 + 2๐ฅ − 3 = 0. (c) The square of some integer is odd. 6. Prove each of the following by writing a paragraph proof. (a) For all real numbers ๐ฅ, ๐ฆ, and ๐ง, ๐ฅ2 ๐ง + 2๐ฅ๐ฆ๐ง + ๐ฆ2 ๐ง = ๐ง(๐ฅ + ๐ฆ)2 . (b) There exist real numbers ๐ข and ๐ฃ such that 2๐ข + 5๐ฃ = −29. (c) For all real numbers ๐ฅ, there exists a real number ๐ฆ so that ๐ฅ − ๐ฆ = 10. (d) There exists an integer ๐ฅ such that for all integers ๐ฆ, ๐ฆ๐ฅ = ๐ฅ. (e) For all real numbers ๐, ๐, and ๐, there exists a complex number ๐ฅ such that ๐๐ฅ2 + ๐๐ฅ + ๐ = 0. (f) There exists an integer that divides every integer. 7. Provide counterexamples for each of the following false propositions. (a) Every integer is a solution to ๐ฅ + 1 = 0. (b) For every integer ๐ฅ, there exists an integer ๐ฆ such that ๐ฅ๐ฆ = 1. (c) The product of any two integers is even. (d) For every integer ๐, if ๐ is even, then ๐2 is a multiple of eight. 8. Assuming that ๐, ๐, ๐, and ๐ are integers with ๐ ≠ 0 and ๐ ≠ 0, give paragraph proofs using direct proof for the following divisibility results. (a) If ๐ divides ๐, then ๐ divides ๐๐. (b) If ๐ divides ๐ and ๐ divides ๐, then ๐2 divides ๐๐. (c) If ๐ divides ๐ and ๐ divides ๐, then ๐๐ divides ๐๐. (d) If ๐ divides ๐ and ๐ divides ๐, then ๐ divides ๐. 9. Write paragraph proofs using direct proof. (a) The sum of two even integers is even. (b) The sum of two odd integers is even. (c) The sum of an even and an odd is odd. (d) The product of two even integers is even. (e) The product of two odd integers is odd. (f) The product of an even and an odd is even. 10. Let ๐ and ๐ be integers. Write paragraph proofs. Section 2.4 PROOF METHODS 115 (a) If ๐ and ๐ are even, then ๐4 + ๐4 + 32 is divisible by 8. (b) If ๐ and ๐ are odd, then 4 divides ๐4 + ๐4 + 6. 11. Prove the following by using direct proof to prove the contrapositive. (a) For every integer ๐, if ๐4 is even, then ๐ is even. (b) For all integers ๐, if ๐3 + ๐2 is odd, then ๐ is odd. (c) For all integers ๐ and ๐, if ๐๐ is even, then ๐ is even or ๐ is even. 12. Write paragraph proofs. (a) The equation ๐ฅ − 10 = 23 has a unique solution. √ (b) The equation 2๐ฅ − 5 = 2 has a unique solution. (c) For every real number ๐ฆ, the equation 2๐ฅ + 5๐ฆ = 10 has a unique solution. (d) The equation ๐ฅ2 + 5๐ฅ + 6 = 0 has at most two integer solutions. 13. From the proof of Example 2.4.14, prove that ๐ > ๐ ′ and ๐ข < ๐. 14. Prove the results of Exercise 9 indirectly. 15. Prove using the method of biconditional proof. (a) ๐ ∨ ๐ → ¬๐, ๐ → ๐, ¬๐ ∨ ๐ → ๐ โข ๐ ↔ ¬๐ (b) ๐ ∨ (¬๐ ∨ ๐), ๐ ∨ (¬๐ ∨ ๐) โข ๐ ↔ ๐ (c) (๐ → ๐) ∧ (๐ → ๐ ), (๐ → ¬๐ ) ∧ (๐ → ¬๐), ๐ ∨ ๐ โข ๐ ↔ ¬๐ (d) ๐ ∧ ๐ → ¬(๐ ∨ ๐ก), ¬๐ ∨ ¬๐ก → ๐ ∧ ๐ โข ๐ ↔ ๐ก 16. Prove using the short rule of biconditional proof. (a) ๐ → ๐ ∧ ๐ ⇔ (๐ → ๐) ∧ (๐ → ๐) (b) ๐ → ๐ ∨ ๐ ⇔ ๐ ∧ ¬๐ → ๐ (c) ๐ ∨ ๐ → ๐ ⇔ (๐ → ๐) ∧ (๐ → ๐) (d) ๐ ∧ ๐ → ๐ ⇔ (๐ → ๐) ∨ (๐ → ๐) 17. Let ๐, ๐, ๐, and ๐ be integers. Write paragraph proofs using biconditional proof. (a) ๐ is even if and only if ๐2 is even. (b) ๐ is odd if and only if ๐ + 1 is even. (c) ๐ is even if and only if ๐ + 2 is even. (d) ๐3 + ๐2 + ๐ is even if and only if ๐ is even. (e) If ๐ ≠ 0, then ๐ divides ๐ if and only if ๐๐ divides ๐๐. 18. Suppose that ๐ and ๐ are integers. Prove that the following are equivalent. โ ๐ divides ๐. โ ๐ divides −๐. โ −๐ divides ๐. โ −๐ divides −๐. 19. Let ๐ be an integer. Prove that the following are equivalent. โ ๐ is divisible by 3. โ 3๐ is divisible by 9. โ ๐ + 3 is divisible by 3. 116 Chapter 2 FIRST-ORDER LOGIC 20. Prove by using direct proof but do not use the contrapositive: for all integers ๐ and ๐, if ๐๐ is even, then ๐ is even or ๐ is even. 21. Prove using proof by cases (Inference Rule 2.4.26). (a) For all integers ๐, if ๐ = 0 or ๐ = 0, then ๐๐ = 0. (b) The square of every odd integer is of the form 8๐ + 1 for some integer ๐. (Hint: Square 2๐ + 1. Then consider two cases: ๐ is even and ๐ is odd.) (c) ๐2 + ๐ + 1 is odd for every integer ๐. (d) The fourth power of every odd integer is of the form 16๐+1 for some integer ๐. (e) Every nonhorizontal line intersects the ๐ฅ-axis. 22. Let ๐ be an integer. Prove by cases. (a) 2 divides ๐(๐ + 1) (b) 3 divides ๐(๐ + 1)(๐ + 2) 23. For any real number ๐, the absolute value of ๐ is defined as { ๐ if ๐ ≥ 0, |๐| = −๐ if ๐ < 0. Let ๐ be a positive real number. Prove the following about absolute value for every real number ๐ฅ. (a) | − ๐ฅ| = |๐ฅ| (b) |๐ฅ2 | = |๐ฅ|2 (c) ๐ฅ ≤ |๐ฅ| (d) |๐ฅ๐ฆ| = |๐ฅ| |๐ฆ| (e) |๐ฅ| < ๐ if and only if −๐ < ๐ฅ < ๐. (f) |๐ฅ| > ๐ if and only if ๐ฅ > ๐ or ๐ฅ < −๐. 24. Take ๐ and ๐ to be nonzero integers and prove the given propositions. (a) ๐ divides 1 if and only if ๐ = ±1. (b) If ๐ = ±๐, then |๐| = |๐|. CHAPTER 3 SET THEORY 3.1 SETS AND ELEMENTS The development of logic that resulted in the work of Chapters 1 and 2 went through many stages and benefited from the work of various mathematicians and logicians through the centuries. Although modern logic can trace its roots to Descartes with his mathesis universalis and Gottfried Leibniz’s De Arte Combinatoria (1666), the beginnings of modern symbolic logic is generally attributed to Augustus De Morgan [Formal Logic (1847)], George Boole [Mathematical Analysis of Logic (1847) and An Investigation of the Laws of Thought (1847)], and Frege [Begriffsschrift (1879), Die Grundlagen der Arithmetik (1884), and Grundgesetze der Arithmetik (1893)]. However, when it comes to set theory, it was Georg Cantor who, with his first paper, “Ueber eine Eigenschaft des Inbegriffs aller reellen algebraischen Zahlen” (1874), and over a decade of research, is the founder of the subject. For the next four chapters, Cantor’s set theory will be our focus. A set is a collection of objects known as elements. An element can be almost anything, such as numbers, functions, or lines. A set is a single object that can contain many elements. Think of it as a box with things inside. The box is the set, and the things are the elements. We use uppercase letters to label sets, and elements will usu117 A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. 118 Chapter 3 SET THEORY ally be represented by lowercase letters. The symbol ∈ (fashioned after the Greek letter epsilon) is used to mean “element of,” so if ๐ด is a set and ๐ is an element of ๐ด, write ∈ ๐๐ด or, the more standard, ๐ ∈ ๐ด. The notation ๐, ๐ ∈ ๐ด means ๐ ∈ ๐ด and ๐ ∈ ๐ด. If ๐ is not an element of ๐ด, write ๐ ∉ ๐ด. If ๐ด contains no elements, it is the empty set. It is represented by the symbol ∅. Think of the empty set as a box with no things inside. Rosters Since the elements are those that distinguish one set from another, one method that is used to write a set is to list its elements and surround them with braces. This is called the roster method of writing a set, and the list is known as a roster. The braces signify that a set has been defined. For example, the set of all integers between 1 and 10 inclusive is {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}. Read this as “the set containing 1, 2, 3, 4, 5, 6, 7, 8, 9, and 10.” The set of all integers between 1 and 10 exclusive is {2, 3, 4, 5, 6, 7, 8, 9}. If the roster is too long, use ellipses (… ). When there is a pattern to the elements of the set, write down enough members so that the pattern is clear. Then use the ellipses to represent the continuing pattern. For example, the set of all integers inclusively between 1 and 1,000,000 can be written as {1, 2, 3, … , 999,999, 1,000,000}. Follow this strategy to write infinite sets as rosters. For instance, the set of even integers can be written as {… , −4, −2, 0, 2, 4, … }. EXAMPLE 3.1.1 โ As a roster, { } denotes the empty set. Warning: Never write {∅} for the empty set. This set has one element in it. โ A set that contains exactly one element is called a singleton. Hence, the sets {1}, {๐ }, and {∅} are singletons written in roster form. Also, 1 ∈ {1}, ๐ ∈ {๐ }, and ∅ ∈ {∅}. โ The set of linear functions that intersect the origin with an integer slope can be written as: {… , −2๐ฅ, −๐ฅ, 0, ๐ฅ, 2๐ฅ, … }. (Note: Here 0 represents the function ๐ (๐ฅ) = 0.) Let ๐ด and ๐ต be sets. These are equal if they contain exactly the same elements. The notation for this is ๐ด = ๐ต. What this means is if any element is in ๐ด, it is also in ๐ต, and Section 3.1 SETS AND ELEMENTS 119 conversely, if an element is in ๐ต, it is in ๐ด. To fully understand set equality, consider again the analogy between sets and boxes. Suppose that we have a box containing a carrot and a rabbit. We could describe it with the phrase the box that contains the carrot and the rabbit. Alternately, it could be referred to as the box that contains the orange vegetable and the furry, cotton-tailed animal with long ears. Although these are different descriptions, they do not refer to different boxes. Similarly, the set {1, 3} and the set containing the solutions of (๐ฅ − 1)(๐ฅ − 3) = 0 are equal because they contain the same elements. Furthermore, the order in which the elements are listed does not matter. The box can just as easily be described as the box with the rabbit and the carrot. Likewise, {1, 3} = {3, 1}. Lastly, suppose that the box is described as containing the carrot, the rabbit, and the carrot, forgetting that the carrot had already been mentioned. This should not be confusing, for one understands that such mistakes are possible. It is similar with sets. A repeated element does not add to the set. Hence, {1, 3} = {1, 3, 1}. Famous Sets Although sets can contain many different types of elements, numbers are probably the most common for mathematics. For this reason particular important sets of numbers have been given their own symbols. Symbol ๐ ๐ ๐ ๐ ๐ Name The set of natural numbers The set of integers The set of rational numbers The set of real numbers The set of complex numbers As rosters, ๐ = {0, 1, 2, … } and ๐ = {… , −2, −1, 0, 1, 2, … }. Notice that we define the set of natural numbers to include zero and do not make a distinction between counting numbers and whole numbers. Instead, write ๐+ = {1, 2, 3, … } and ๐− = {… , −3, −2, −1}. EXAMPLE 3.1.2 โ 10 ∈ ๐+ , but 0 ∉ ๐+ . โ 4 ∈ ๐, but −5 ∉ ๐. โ −5 ∈ ๐, but .65 ∉ ๐. 120 Chapter 3 SET THEORY โ .65 ∈ ๐ and 1โ2 ∈ ๐, but ๐ ∉ ๐. โ ๐ ∈ ๐, but 3 − 2๐ ∉ ๐. โ 3 − 2๐ ∈ ๐. Of the sets mentioned above, the real numbers are probably the most familiar. It is the set of numbers most frequently used in calculus and is often represented by a number line. The line can be subdivided into intervals. Given two endpoints, an interval includes all real numbers between the endpoints and possibly the endpoints themselves. Interval notation is used to name these sets. A parenthesis next to an endpoint means that the endpoint is not included in the set, while a bracket means that the endpoint is included. If the endpoints are included, the interval is closed. If they are excluded, the interval is open. If one endpoint is included and the other is not, the interval is half-open. If the interval has only one endpoint, then the set is called a ray and is defined using the infinity symbol (∞), with or without the negative sign. DEFINITION 3.1.3 Let ๐, ๐ ∈ ๐ such that ๐ < ๐. closed interval open interval half-open interval half-open interval [๐, ๐] (๐, ๐) [๐, ๐) (๐, ๐] closed ray closed ray open ray open ray [๐, ∞) (−∞, ๐] (๐, ∞) (−∞, ๐) EXAMPLE 3.1.4 We can describe (4, 7) as the interval of real numbers between 4 and 7 exclusive and [4, 7] as all real numbers between 4 and 7 inclusive. There is not a straightforward way to name the half-open intervals. For (4, 7], we can try the set of all real numbers ๐ฅ such that 4 < ๐ฅ ≤ 7 or the set of all real numbers greater than 4 and less than or equal to 7. The infinity symbol does not represent a real number, so a parenthesis must be used with it. Furthermore, the interval (−∞, ∞) can be used to denote ๐. Section 3.1 SETS AND ELEMENTS −4 −3 −2 −1 0 1 2 3 4 −4 −3 −2 −1 (a) (−1, 3] 0 1 2 3 121 4 (b) (−∞, 2] Figure 3.1 A half-open interval and a closed ray. EXAMPLE 3.1.5 The interval (−1, 3] contains all real numbers that are greater than −1 but less than or equal to 3 [Figure 3.1(a)]. A common mistake is to equate (−1, 3] with {0, 1, 2, 3}. It is important to remember that (−1, 3] includes all real numbers between −1 and 3. Hence, this set is infinite, as is (−∞, 2). It contains all real numbers less than 2 [Figure 3.1(b)]. EXAMPLE 3.1.6 Let ๐(๐ฅ) := ๐ฅ + 2 = 7. Since ๐(5) and there is no other real number ๐ such that ๐(๐), there exists a unique ๐ฅ ∈ ๐ such that ๐(๐ฅ). However, if ๐ is an element of ๐− or (−∞, 5), then ¬๐(๐). EXAMPLE 3.1.7 If ๐(๐ฅ) := ๐ฅ ≥ 10, then ๐(๐ฅ) for all ๐ฅ ∈ [20, 100], there exists ๐ฅ ∈ ๐ such that ¬๐(๐ฅ), and there is no element ๐ of {1, 2, 3} such that ๐(๐). Abstraction When trying to write sets as rosters, we quickly discover issues with the technique. Since the rational numbers are defined using integers, we suspect that ๐ can be written as a roster, but when we try to begin a list, such as 1 1 2 1, , , … , 2, , … , 2 3 3 we realize that there are complications with the pattern and are not quite sure that the we will exhaust them all. When considering ๐, we know immediately that a roster is out of the question. We conclude that we need another method. Fix a first-order alphabet with theory symbols ๐ฒ. Let ๐ด be a set and ๐ฒ-formula ๐(๐ฅ) have the property that for every ๐, ๐ ∈ ๐ด ⇔ ๐(๐). Notice that ๐(๐ฅ) completely describes the members of ๐ด. Namely, whenever we write ๐ ∈ ๐ด, we can also write ๐(๐), and, conversely, whenever we write ๐(๐), we can also write ๐ ∈ ๐ด. For example, let ๐ธ be the set of even integers. As a roster, ๐ธ = {… , −4, −2, 0, 2, 4, … }. 122 Chapter 3 SET THEORY Let ๐(๐ฅ) denote the formula ∃๐(๐ ∈ ๐ ∧ ๐ฅ = 2๐). (3.1) The even integers are exactly those numbers ๐ฅ such that ๐(๐ฅ). In particular, we have ๐(2) and ๐(−4) but not ๐(5). Therefore, 2 and −4 are elements of ๐ธ, but 5 is not. DEFINITION 3.1.8 Let ๐ be a first-order alphabet with theory symbols ๐ฒ. Let ๐(๐ฅ) be an ๐ฒ-formula and ๐ด be a set. Write ๐ด = {๐ฅ : ๐(๐ฅ)} to mean ๐ ∈ ๐ด ⇔ ๐(๐). Using {๐ฅ : ๐(๐ฅ)} to identify ๐ด is called the method of abstraction. Using Definition 3.1.8 and (3.1), write ๐ธ using abstraction as ๐ธ = {๐ฅ : ∃๐(๐ ∈ ๐ ∧ ๐ฅ = 2๐)} or ๐ธ = {๐ฅ : ๐(๐ฅ)}. Read this as “the set of ๐ฅ such that ๐(๐ฅ).” Because ๐ฅ = 2๐, it is customary to remove ๐ฅ from the definition of sets like ๐ธ and write ๐ธ = {2๐ : ∈ ๐๐}, or ๐ธ = {2๐ : ๐ ∈ ๐}. Read this as “the set of all 2๐ such that ๐ is an integer.” This simplified notation is still considered abstraction. Its form can be summarized as {elements : condition}. That is, what the elements look like come before the colon, and the condition that must be satisfied to be an element of the set comes after the colon. EXAMPLE 3.1.9 Given the quadratic equation ๐ฅ2 − ๐ฅ − 2 = 0, we know that its solutions are −1 and 2. Thus, its solution set is ๐ด = {−1, 2}. Using the method of abstraction, this can be written as ๐ด = {๐ฅ : − − ⋅ ๐ฅ๐ฅ๐ฅ2 = 0 ∧ ∈๐ฅ๐}. However, as we have seen, it is customary to write the formula so that it is easier to read, so ๐ด : ๐ฅ2 − ๐ฅ − 2 = 0 ∧ ๐ฅ ∈ ๐}, Section 3.1 SETS AND ELEMENTS 123 or, using a common notation, ๐ด{๐ฅ ∈ ๐ : ๐ฅ2 − ๐ฅ − 2 = 0}. Therefore, given an arbitrary polynomial ๐ (๐ฅ), its solution set over the real numbers is {๐ฅ ∈ ๐ : ๐ (๐ฅ) = 0}. EXAMPLE 3.1.10 Since ๐ฅ ∉ ∅ is always true, to write the empty set using abstraction, we use a formula like ๐ฅ ≠ ๐ฅ or a contradiction like ๐ ∧ ¬๐ , where ๐ is a propositional form. Then, ∅ = {๐ฅ ∈ ๐ : ๐ฅ ≠ ๐ฅ} = {๐ฅ : ๐ ∧ ¬๐ }. EXAMPLE 3.1.11 Using the natural numbers as the starting point, ๐ and ๐ can be defined using the abstraction method by writing ๐ = {๐ : ๐ ∈ ๐ ∨ −๐ ∈ ๐} and ๐= { } ๐ : ๐, ๐ ∈ ๐ ∧ ๐ ≠ 0 . ๐ Notice the redundancy in the definition of ๐. The fraction 1โ2 is named multiple times like 2โ4 or 9โ18, but remember that this does not mean that the numbers appear infinitely many times in the set. They appear only once. EXAMPLE 3.1.12 The open intervals can be defined using abstraction. (๐, ๐) = {๐ฅ ∈ ๐ : ๐ < ๐ฅ < ๐}, (๐, ∞) = {๐ฅ ∈ ๐ : ๐ < ๐ฅ}, (−∞, ๐) = {๐ฅ ∈ ๐ : ๐ฅ < ๐}. See Exercise 9 for the closed and half-open intervals. Also, as with ๐+ and ๐− , the superscript + or − is always used to denote the positive or negative numbers, respectively, of a set. For example, ๐+ = {๐ฅ ∈ ๐ : ๐ฅ > 0} and ๐− = {๐ฅ ∈ ๐ : ๐ฅ < 0}. 124 Chapter 3 SET THEORY EXAMPLE 3.1.13 Each of the following are written using the abstraction method and, where appropriate, as a roster. โ The set of all rational numbers with denominator equal to 3 in roster form is } { 3 2 1 0 1 2 3 …,− ,− ,− , , , , ,… . 3 3 3 3 3 3 3 Using abstraction, it is { } ๐ :๐∈๐ . 3 โ The set of all linear polynomials with integer coefficients and leading coefficient equal to 1 is {… , ๐ฅ − 2, ๐ฅ − 1, ๐ฅ, ๐ฅ + 1, ๐ฅ + 2, … }. Using abstraction it is {๐ฅ + ๐ : ๐ ∈ ๐}. โ The set of all polynomials of degree at most 5 can be written as {๐5 ๐ฅ5 + · · · + ๐1 ๐ฅ + ๐0 : ๐๐ ∈ ๐ ∧ ๐ = 0, 1, … , 5}. Exercises 1. Determine whether the given propositions are true or false. (a) 0 ∈ ๐ (b) 1โ2 ∈ ๐ (c) −4 ∈ ๐ (d) 4 + ๐ ∈ ๐ (e) 4.34534 ∈ ๐ (f) {1, 2} = {2, 1} (g) {1, 2} = {1, 2, 1} (h) [1, 2] = {1, 2} (i) (1, 3) = {2} (j) −1 ∈ (−∞, −1) (k) −1 ∈ [−1, ∞) (l) ∅ ∈ (−2, 2) (m) ∅ ∈ ∅ (n) 0 ∈ ∅ 2. Write the given sets as rosters. (a) The set of all integers between 1 and 5 inclusive (b) The set of all odd integers (c) The set of all nonnegative integers Section 3.1 SETS AND ELEMENTS 125 (d) The set of integers in the interval (−3, 7] (e) The set of rational numbers in the interval (0, 1) that can be represented with exactly two decimal places 3. If possible, find an element ๐ in the given set such that ๐ + 3.14 = 0. (a) ๐ (b) ๐ (c) ๐+ (d) ๐− (e) ๐ (f) ๐ (g) (0, 6) (h) (−∞, −1) 4. Determine whether the following are true or false. (a) 1 ∈ {๐ฅ : ๐(๐ฅ)} when ¬๐(1) (b) 7 ∈ {๐ฅ ∈ ๐ : ๐ฅ2 − 5๐ฅ − 14 = 0} (c) 7๐ฅ2 − 0.5๐ฅ ∈ {๐2 ๐ฅ2 + ๐1 ๐ฅ + ๐0 : ๐๐ ∈ ๐} (d) ๐ฅ๐ฆ ∈ {2๐ : ๐ ∈ ๐} if ๐ฅ is even and ๐ฆ is odd. (e) cos ๐ ∈ {๐ cos ๐ + ๐ sin ๐ : ๐, ๐ ∈ ๐} (f) {1, 3} = {๐ฅ : (๐ฅ − 1)(๐ฅ − 3) = 0} (g) {1, 3} = {๐ฅ : (๐ฅ − 1)(๐ฅ − 3)2 = 0} } {[ ๐ 0 ] } {[ ๐ฅ 0 ] (h) 0 ๐ฆ : ๐ฅ ∈ ๐ and ๐ฆ = 0 00 :๐∈๐ = 5. Write the following given sets in roster form. (a) {−3๐ : ๐ ∈ ๐} (b) {0 ⋅ ๐ : ๐ ∈ ๐} (c) {๐ cos ๐ฅ : ๐ ∈ ๐} (d) {๐๐ฅ2 + ๐๐ฅ + ๐ : ๐ ∈ ๐} {[ ๐ 0 ] } (e) 00 :๐∈๐ 6. Use a formula to uniquely describe the elements in the following sets. For example, ๐ฅ ∈ ๐ if and only if ๐ฅ ∈ ๐ ∧ ๐ฅ ≥ 0. (a) (0, 1) (b) (−3, 3] (c) [0, ∞) (d) ๐+ (e) {… , −2, −1, 0, 1, 2, … } (f) {2๐, 4๐, 6๐, … } 7. Write each set using the method of abstraction. (a) All odd integers (b) All positive rational numbers (c) All integer multiples of 7 (d) All integers that have a remainder of 1 when divided by 3 126 Chapter 3 SET THEORY (e) All ordered pairs of real numbers in which the ๐ฅ-coordinate is positive and the ๐ฆ-coordinate is negative (f) All complex numbers whose real part is 0 (g) All closed intervals that contain ๐ (h) All open intervals that do not contain a rational number (i) All closed rays that contain no numbers in (−∞, 3] (j) All 2 × 2 matrices with real entries that have a diagonal sum of 0 (k) All polynomials of degree at most 3 with real coefficients 8. Write the given sets of real numbers using interval notation. (a) The set of real numbers greater than 4 (b) The set of real numbers between −6 and −5 inclusive (c) The set of real numbers ๐ฅ so that ๐ฅ < 5 (d) The set of real numbers ๐ฅ such that 10 < ๐ฅ ≤ 14 9. Let ๐, ๐ ∈ ๐ with ๐ < ๐. Write the given intervals using the abstraction method. (a) [๐, ๐] (b) [๐, ∞) (c) (−∞, ๐] (d) (๐, ๐] 3.2 SET OPERATIONS We now use connectives to define the set operations. These allow us to build new sets from given ones. Union and Intersection The first set operation is defined using the ∨ connective. DEFINITION 3.2.1 The union of ๐ด and ๐ต is ๐ด ∪ ๐ต = {๐ฅ : ๐ฅ ∈ ๐ด ∨ ๐ฅ ∈ ๐ต}. The union of sets can be viewed as the combination of all elements from the sets. On the other hand, the next set operation is defined with ∧ and can be considered as the overlap between the given sets. DEFINITION 3.2.2 The intersection of ๐ด and ๐ต is ๐ด ∩ ๐ต = {๐ฅ : ๐ฅ ∈ ๐ด ∧ ๐ฅ ∈ ๐ต}. Section 3.2 SET OPERATIONS U A B U A (a) ๐ด ∪ ๐ต Figure 3.2 127 B (b) ๐ด ∩ ๐ต Venn diagrams for union and intersection. For example, if ๐ด = {1, 2, 3, 4} and ๐ต = {3, 4, 5, 6}, then ๐ด ∪ ๐ต = {1, 2, 3, 4, 5, 6} and ๐ด ∩ ๐ต = {3, 4}. The operations of union and intersection can be illustrated with pictures called Venn diagrams, named after the logician John Venn who used a variation of these drawings in his text on symbolic logic (Venn 1894). First, assume that all elements are members of a fixed universe ๐ (page 97). In set theory, the universe is considered to be the set of all possible elements in a given situation. Use circles to represent sets and a rectangle to represent ๐ . The space inside these shapes represent where elements might exist. Shading is used to represent where elements might exist after applying some set operations. The Venn diagram for union is in Figure 3.2(a) and the one for intersection is in Figure 3.2(b). If sets have no elements in their intersection, we can use the next definition to name them. DEFINITION 3.2.3 The sets ๐ด and ๐ต are disjoint or mutually exclusive when ๐ด ∩ ๐ต = ∅. The sets {1, 2, 3} and {6, 7} are disjoint. A Venn diagram for two disjoint sets is given in Figure 3.3. Set Difference The next two set operations take all of the elements of one set that are not in another. They are defined using the not connective. DEFINITION 3.2.4 The set difference of ๐ต from ๐ด is ๐ด โงต ๐ต = {๐ฅ : ๐ฅ ∈ ๐ด ∧ ๐ฅ ∉ ๐ต}. 128 Chapter 3 SET THEORY U A B Figure 3.3 A Venn diagram for disjoint sets. The complement of ๐ด is defined as ๐ด = ๐ โงต ๐ด = {๐ฅ : ๐ฅ ∈ ๐ ∧ ๐ฅ ∉ ๐ด}. Read ๐ด โงต ๐ต as “๐ด minus ๐ต” or “๐ด without ๐ต.” See Figure 3.4(a) for the Venn diagram of the set difference of sets and Figure 3.4(b) for the complement of a set. EXAMPLE 3.2.5 The following equalities use set difference. โ ๐ = ๐ โงต ๐− โ The set of irrational numbers is ๐ โงต ๐. โ ๐ = ๐ โงต {๐ + ๐๐ : ๐, ๐ ∈ ๐ ∧ ๐ ≠ 0} U A B (a) ๐ด โงต ๐ต Figure 3.4 U A (b) ๐ด Venn diagrams for set difference and complement. Section 3.2 SET OPERATIONS 129 EXAMPLE 3.2.6 Let ๐ = {1, 2, … , 10}. Use a Venn diagram to find the results of the set operations on ๐ด = {1, 2, 3, 4, 5} and ๐ต = {3, 4, 5, 6, 7, 8}. Each element will be represented as a point and labeled with a number. U A 1 2 3 4 5 6 B 7 8 9 10 โ ๐ด ∪ ๐ต = {1, 2, 3, 4, 5, 6, 7, 8} โ ๐ด ∩ ๐ต = {3, 4, 5} โ ๐ด โงต ๐ต = {1, 2} โ ๐ด = {6, 7, 8, 9, 10} EXAMPLE 3.2.7 Let ๐ถ = (−4, 2), ๐ท = [−1, 3], and ๐ = ๐. Use the diagram to perform the set operations. C∪D −4 −3 −2 −1 0 1 2 3 4 2 3 4 C∩D −4 −3 −2 −1 C๎D โ ๐ถ ∪ ๐ท = (−4, 3] โ ๐ถ ∩ ๐ท = [−1, 2) โ ๐ถ โงต ๐ท = (−4, −1) โ ๐ถ = (−∞, −4] ∪ [2, ∞) 0 1 130 Chapter 3 SET THEORY Cartesian Products The last set operation is not related to the logic connectives as the others, but it is nonetheless very important to mathematics. Let ๐ด and ๐ต be sets. Given elements ๐ ∈ ๐ด and ๐ ∈ ๐ต, we call (๐, ๐) an ordered pair. In this context, ๐ and ๐ are called coordinates. It is similar to the set {๐, ๐} except that the order matters. The definition is due to Kazimierz Kuratowski (1921). DEFINITION 3.2.8 If ๐ ∈ ๐ด and ๐ ∈ ๐ต, (๐, ๐) = {{๐}, {๐, ๐}}. Notice that (๐, ๐) = (๐′ , ๐′ ) means that {{๐}, {๐, ๐}} = {{๐′ }, {๐′ , ๐′ }}, which implies that ๐ = ๐′ and ๐ = ๐′ . Therefore, (๐, ๐) = (๐′ , ๐′ ) if and only if ๐ = ๐′ and ๐ = ๐′ . The set of all ordered pairs with the first coordinate from ๐ด and the second from ๐ต is named after René Descartes. DEFINITION 3.2.9 The Cartesian product of ๐ด and ๐ต is ๐ด × ๐ต = {(๐, ๐) : ๐ ∈ ๐ด ∧ ๐ ∈ ๐ต}. The product ๐2 = ๐ × ๐ is the set of ordered pairs of the Cartesian plane. EXAMPLE 3.2.10 โ Since (1, 2) ≠ (2, 1), {1, 2} × {0, 1, 2} = {(1, 0), (1, 1), (1, 2), (2, 0), (2, 1), (2, 2)}. Even though we have a set definition for an ordered pair, we can still visually represent this set on a grid as in Figure 3.5(a). โ If ๐ด = {1, 2, 7} and ๐ต = {∅, {1, 5}}, ๐ด × ๐ต = {(1, ∅), (1, {1, 5}), (2, ∅), (2, {1, 5}), (7, ∅), (7, {1, 5})}. See Figure 3.5(b). โ For any set ๐ด, ∅ × ๐ด = ๐ด × ∅ = ∅ because ¬∃๐ฅ(๐ฅ ∈ ∅ ∧ ๐ฆ ∈ ๐ด). Section 3.2 SET OPERATIONS (1, 2) (2, {1, 5}) 2 {1, 5} 1 ∅ 0 1 (7, ∅) 2 1 (a) {1, 2} × {0, 1, 2} Figure 3.5 2 7 (b) {1, 2, 7} × {∅, {1, 5}} Two Cartesian products. We generalize Definition 3.2.8 by defining (๐, ๐, ๐) = {{๐}, {๐, ๐}, {๐, ๐, ๐}}, (๐, ๐, ๐, ๐) = {{๐}, {๐, ๐}, {๐, ๐, ๐}, {๐, ๐, ๐, ๐}}, and for ๐ ∈ ๐, (๐0 , ๐1 , … , ๐๐−1 ) = {{๐0 }, {๐0 , ๐1 }, … , {๐0 , ๐1 , … , ๐๐−1 }}, which is called an ordered n-tuple. Then, ๐ด × ๐ต × ๐ถ = {(๐, ๐, ๐) : ๐ ∈ ๐ด ∧ ๐ ∈ ๐ต ∧ ๐ ∈ ๐ถ}, ๐ด × ๐ต × ๐ถ × ๐ท = {(๐, ๐, ๐, ๐) : ๐ ∈ ๐ด ∧ ๐ ∈ ๐ต ∧ ๐ ∈ ๐ถ ∧ ๐ ∈ ๐ท}, and ๐ด๐ = {(๐0 , ๐1 , … , ๐๐−1 ) : ๐๐ ∈ ๐ด ∧ ๐ = 0, 1, … , ๐ − 1}. Specifically, ๐3 = ๐ × ๐ × ๐, and ๐๐ = ๐ × ๐ × · · · × ๐, โโโโโโโโโโโโโโโโโโโ ๐ times which is known as Cartesian n-space. EXAMPLE 3.2.11 Let ๐ด = {1}, ๐ต = {2}, ๐ถ = {3}, and ๐ท = {4}. Then, ๐ด × ๐ต × ๐ถ × ๐ท = {(1, 2, 3, 4)} is a singleton containing an ordered 4-tuple. Also, (๐ด × ๐ต) ∪ (๐ถ × ๐ท) = {(1, 2), (3, 4)}, but ๐ด × (๐ต ∪ ๐ถ) × ๐ท = {(1, 2, 4), (1, 3, 4)}. 131 132 Chapter 3 SET THEORY Order of Operations As with the logical connectives, we need an order to make sense of expressions that involve many operations. To do this, we note the association between the set operations and certain logical connectives. } ๐ด ¬ ๐ดโงต๐ต ๐ด∩๐ต ∧ ๐ด∪๐ต ∨ From this we derive the order for the set operations. DEFINITION 3.2.12 [Order of Operations] To find a set determined by set operations, read from left to right and use the following precedence. โ sets within parentheses (innermost first), โ complements, โ set differences, โ intersections, โ unions. EXAMPLE 3.2.13 If the universe is taken to be {1, 2, 3, 4, 5}, then {5} ∪ {1, 2} ∩ {2, 3} = {5} ∪ {3, 4, 5} ∩ {2, 3} = {5} ∪ {3} = {3, 5} This set can be written using parentheses as {5} ∪ ({1, 2} ∩ {2, 3}). However, ({5} ∪ {1, 2}) ∩ {2, 3} = ({5} ∪ {3, 4, 5}) ∩ {2, 3} = {3, 4, 5} ∩ {2, 3} = {3}, showing that {5} ∪ {1, 2} ∩ {2, 3} ≠ ({5} ∪ {1, 2}) ∩ {2, 3}. Section 3.2 SET OPERATIONS 133 EXAMPLE 3.2.14 Define ๐ด = {1}, ๐ต = {2}, ๐ถ = {3}, and ๐ท = {4}. Then, we have ๐ด ∪ ๐ต ∪ ๐ถ ∪ ๐ท = {1, 2, 3, 4}. Written with parentheses and brackets, ๐ด ∪ ๐ต ∪ ๐ถ ∪ ๐ท = ([(๐ด ∪ ๐ต) ∪ ๐ถ] ∪ ๐ท). With the given assignments, ๐ด ∩ ๐ต ∩ ๐ถ ∩ ๐ท is empty. Exercises 1. Each of the given propositions are false. Replace the underlined word with another word to make the proposition true. (a) Intersection is defined using a disjunction. (b) Set diagrams are used to illustrate set operations. (c) ๐ โงต (๐ โงต ๐) is the set of irrational numbers. (d) Set difference has a higher order of precedence than complements. (e) The complement of ๐ด is equal to ๐ set minus ๐ด. (f) The intersection of two intervals is always an interval. (g) The union of two intervals is never an interval. (h) ๐ด × ๐ต is equal to ∅ if ๐ต does not contain ordered pairs. 2. Let ๐ด = {0, 2, 4, 6}, ๐ต = {3, 4, 5, 6}, ๐ถ = {0, 1, 2}, and ๐ = {0, 1, … , 9, 10}. Write the given sets in roster notation. (a) ๐ด ∪ ๐ต (b) ๐ด ∩ ๐ต (c) ๐ด โงต ๐ต (d) ๐ต โงต ๐ด (e) ๐ด (f) ๐ด × ๐ต (g) ๐ด ∪ ๐ต ∩ ๐ถ (h) ๐ด ∩ ๐ต ∪ ๐ถ (i) ๐ด ∩ ๐ต (j) ๐ด ∪ ๐ด ∩ ๐ต (k) ๐ด โงต ๐ต โงต ๐ถ (l) ๐ด ∪ ๐ต โงต ๐ด ∩ ๐ถ 3. Write each of the given sets using interval notation. ] (a) [2, 3] ∩ (5โ2, 7 (b) (−∞, 4) ∪ (−6, ∞) (c) (−2, 4) ∩ [−6, ∞) (d) ∅ ∪ (4, 12] 134 Chapter 3 SET THEORY 4. Identify each of the the given sets. (a) [6, 17] ∩ [17, 32) (b) [6, 17) ∩ [17, 32) (c) [6, 17] ∪ (17, 32) (d) [6, 17) ∪ (17, 32) 5. Draw Venn diagrams. (a) ๐ด ∩ ๐ต (b) ๐ด ∪ ๐ต (c) ๐ด ∩ ๐ต ∪ ๐ถ (d) (๐ด ∪ ๐ต) โงต ๐ถ (e) ๐ด ∩ ๐ถ ∩ ๐ต (f) ๐ด โงต ๐ต ∩ ๐ถ 6. Match each Venn diagram to as many sets as possible. (A) (B) (C) (D) (E) (F) (a) (b) (c) (d) (e) (f) (g) (h) (i) (j) ๐ด∪๐ต ๐ดโงต๐ต ๐ด∩๐ต (๐ด ∪ ๐ต) โงต (๐ด ∩ ๐ต) ๐ด∩๐ต∪๐ด∩๐ถ ∪๐ต∩๐ถ ๐ด∩๐ต∪๐ดโงต๐ต (๐ด ∪ ๐ต) ∩ (๐ด ∪ ๐ต) [(๐ด ∪ ๐ต) ∩ ๐ถ] ∪ [(๐ด ∪ ๐ต) ∩ ๐ถ] [๐ด ∩ ๐ต ∩ ๐ถ] ∪ [๐ด ∩ ๐ต ∩ ๐ถ] ๐ด โงต (๐ต ∪ ๐ถ) ∩ ๐ต โงต (๐ด ∪ ๐ถ) ∩ ๐ถ โงต (๐ด ∪ ๐ต) 7. The ordered pair (1, 2) is paired with the ordered pair (2, 1) using a set operation. Write each resulting set as a roster. (a) (1, 2) ∪ (2, 1) (b) (1, 2) ∩ (2, 1) (c) (1, 2) โงต (2, 1) 8. Let ๐(๐ฅ) be a formula. Prove the following. (a) ∀๐ฅ[๐ฅ ∈ ๐ด ∪ ๐ต → ๐(๐ฅ)] ⇒ ∀๐ฅ[๐ฅ ∈ ๐ด ∩ ๐ต → ๐(๐ฅ)] Section 3.3 SETS WITHIN SETS 135 (b) ∀๐ฅ[๐ฅ ∈ ๐ด ∪ ๐ต → ๐(๐ฅ)] ⇔ ∀๐ฅ[๐ฅ ∈ ๐ด → ๐(๐ฅ)] ∧ ∀๐ฅ[๐ฅ ∈ ๐ต → ๐(๐ฅ)] (c) ∃๐ฅ[๐ฅ ∈ ๐ด ∩ ๐ต ∧ ๐(๐ฅ)] ⇒ ∃๐ฅ[๐ฅ ∈ ๐ด ∧ ๐(๐ฅ)] ∧ ∃๐ฅ[๐ฅ ∈ ๐ต ∧ ๐(๐ฅ)] (d) ∃๐ฅ[๐ฅ ∈ ๐ด ∪ ๐ต ∧ ๐(๐ฅ)] ⇔ ∃๐ฅ[๐ฅ ∈ ๐ด ∧ ๐(๐ฅ)] ∨ ∃๐ฅ[๐ฅ ∈ ๐ต ∧ ๐(๐ฅ)] 9. Find a formula ๐(๐ฅ) and sets ๐ด and ๐ต to show that the following are false. (a) ∀๐ฅ[๐ฅ ∈ ๐ด → ๐(๐ฅ)] ∨ ∀๐ฅ[๐ฅ ∈ ๐ต → ๐(๐ฅ)] ⇒ ∀๐ฅ[๐ฅ ∈ ๐ด ∪ ๐ต → ๐(๐ฅ)] (b) ∃๐ฅ[๐ฅ ∈ ๐ด ∧ ๐(๐ฅ)] ∧ ∃๐ฅ[๐ฅ ∈ ๐ต ∧ ๐(๐ฅ)] ⇒ ∃๐ฅ[๐ฅ ∈ ๐ด ∩ ๐ต ∧ ๐(๐ฅ)] 3.3 SETS WITHIN SETS An important relation between any two sets is when one is contained within another. Subsets Let ๐ด and ๐ต be sets. ๐ด is a subset of ๐ต exactly when every element of ๐ด is also an element of ๐ต, in symbols ๐ด ⊆ ๐ต. This is represented in a Venn diagram by the circle for ๐ด being within the circle for ๐ต [Figure 3.6(a)]. DEFINITION 3.3.1 For all sets ๐ด and ๐ต, ๐ด ⊆ ๐ต ⇔ ∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ∈ ๐ต). If ๐ด is not a subset of ๐ต, write ๐ด * ๐ต. This is represented in a Venn diagram by ๐ด overlapping ๐ต with a point in ๐ด but not within ๐ต [Figure 3.6(b)]. Logically, this means ๐ด * ๐ต ⇔ ¬∀๐ฅ(๐ฅ ∈ ๐ด → ๐ฅ ∈ ๐ต) ⇔ ∃๐ฅ(๐ฅ ∈ ๐ด ∧ ๐ฅ ∉ ๐ต). Thus, to show ๐ด * ๐ต find an element in ๐ด that is not in ๐ต. For example, if we let ๐ด = {1, 2, 3} and ๐ต = {1, 2, 5}, then ๐ด * ๐ต because 3 ∈ ๐ด but 3 ∉ ๐ต. U B A U B A a (a) ๐ด ⊆ ๐ต Figure 3.6 (b) ๐ด * ๐ต Venn diagrams for a subset and a nonsubset. 136 Chapter 3 SET THEORY EXAMPLE 3.3.2 The proposition for all sets ๐ด and ๐ต, ๐ด ∪ ๐ต ⊆ ๐ด is false. To see this, we must prove that there exists ๐ด and ๐ต such that ๐ด ∪ ๐ต * ๐ด. We take ๐ด = {1} and ๐ต = {2} as our candidates. Since ๐ด∪๐ต = {1, 2} and 2 ∉ ๐ด, we have ๐ด∪๐ต * ๐ด. Every set is the improper subset of itself. The notation ๐ด ⊂ ๐ต means ๐ด ⊆ ๐ต but ๐ด ≠ ๐ต. In this case, ๐ด is a proper subset of ๐ต. EXAMPLE 3.3.3 โ {1, 2, 3} ⊂ {1, 2, 3, 4, 5} and {1, 2, 3} ⊆ {1, 2, 3} โ ๐⊂๐⊂๐⊂๐⊂๐ โ (4, 5) ⊂ (4, 5] ⊂ [4, 5] โ {1, 2, 3} is not a subset of {1, 2, 4}. Our study of subsets begins with a fundamental theorem. It states that the empty set is a subset of every set. THEOREM 3.3.4 ∅ ⊆ ๐ด. PROOF Let ๐ด be a set. Since ๐ฅ ∉ ∅, we have ∀๐ฅ(๐ฅ ∉ ∅ ∨ ๐ฅ ∈ ๐ด) ⇔ ∀๐ฅ(๐ฅ ∈ ∅ → ๐ฅ ∈ ๐ด) ⇔ ∅ ⊆ ๐ด. Proving that one set is a subset of another always means proving an implication, so Direct Proof is the primary tool in these proofs. That is, usually to prove that a set ๐ด is a subset of a set ๐ต, take an element ๐ฅ ∈ ๐ด and show ๐ฅ ∈ ๐ต. EXAMPLE 3.3.5 โ Let ๐ฅ ∈ ๐ โงต ๐. Then, ๐ฅ ∈ ๐ but ๐ฅ ∉ ๐, so ๐ฅ ∈ ๐ by Simp. Thus, ๐ โงต ๐ ⊆ ๐. โ Let ๐ด = {๐ฅ : ∃๐ ∈ ๐ [(๐ฅ − 2๐)(๐ฅ − 2๐ − 2) = 0]} and ๐ต = {2๐ : ๐ ∈ ๐}. Take ๐ฅ ∈ ๐ด. This means that ∃๐ ∈ ๐ [(๐ฅ − 2๐)(๐ฅ − 2๐ − 2) = 0] . In other words, (๐ฅ − 2๐)(๐ฅ − 2๐ − 2) = 0 Section 3.3 SETS WITHIN SETS 137 for some ๐ ∈ ๐. Hence, ๐ฅ − 2๐ = 0 or ๐ฅ − 2๐ − 2 = 0. We have two cases to check. If ๐ฅ − 2๐ = 0, then ๐ฅ = 2๐. This means ๐ฅ ∈ ๐ต. If ๐ฅ − 2๐ − 2 = 0, then ๐ฅ = 2๐ + 2 = 2(๐ + 1), which also means ๐ฅ ∈ ๐ต since ๐ + 1 is an integer. Hence, ๐ด ⊆ ๐ต. โ Fix ๐, ๐ ∈ ๐. Let ๐ฅ = ๐๐, some ๐ ∈ ๐. This means ๐ฅ = ๐๐ + 0๐, which implies that {๐๐ : ๐ ∈ ๐} ⊆ {๐๐ + ๐๐ : ๐, ๐ ∈ ๐}. This next result is based only on the definitions and Inference Rules 1.2.10. As with Section 3.2, we see the close ties between set theory and logic. THEOREM 3.3.6 โ ๐ด ⊆ ๐ด. โ If ๐ด ⊆ ๐ต and ๐ต ⊆ ๐ถ, then ๐ด ⊆ ๐ถ. โ If ๐ด ⊆ ๐ต and ๐ฅ ∈ ๐ด, then ๐ฅ ∈ ๐ต. โ If ๐ด ⊆ ๐ต and ๐ฅ ∉ ๐ต, then ๐ฅ ∉ ๐ด. โ Let ๐ด ⊆ ๐ต and ๐ถ ⊆ ๐ท. If ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ถ, then ๐ฅ ∈ ๐ต or ๐ฅ ∈ ๐ท. โ Let ๐ด ⊆ ๐ต and ๐ถ ⊆ ๐ท. If ๐ฅ ∉ ๐ต or ๐ฅ ∉ ๐ท, then ๐ฅ ∉ ๐ด or ๐ฅ ∉ ๐ถ. PROOF Assume ๐ด ⊆ ๐ต and ๐ต ⊆ ๐ถ. By definition, ๐ฅ ∈ ๐ด implies ๐ฅ ∈ ๐ต and ๐ฅ ∈ ๐ต implies ๐ฅ ∈ ๐ถ. Therefore, by HS, if ๐ฅ ∈ ๐ด, then ๐ฅ ∈ ๐ถ. In other words, ๐ด ⊆ ๐ถ. The remaining parts are left to Exercise 4. Note that it is not the case that for all sets ๐ด and ๐ต, ๐ด ⊆ ๐ต implies that ๐ต ⊆ ๐ด. To prove this, choose ๐ด = ∅ and ๐ต = {0}. Then, ๐ด ⊆ ๐ต yet ๐ต * ๐ด because 0 ∈ ๐ต and 0 ∉ ∅. Equality For two sets to be equal, they must contain exactly the same elements (page 118 of Section 3.1). This can be stated more precisely using the idea of a subset. DEFINITION 3.3.7 ๐ด = ๐ต means ๐ด ⊆ ๐ต and ๐ต ⊆ ๐ด. To prove that two sets are equal, we show both inclusions. Let us do this to prove ๐ด ∪ ๐ต = ๐ด ∩ ๐ต. This is one of De Morgan’s laws. To prove it, we demonstrate both ๐ด∪๐ต ⊆๐ด∩๐ต 138 Chapter 3 SET THEORY and ๐ด ∩ ๐ต ⊆ ๐ด ∪ ๐ต. This amounts to proving a biconditional, which means we will use the rule of biconditional proof. Look at the first direction: ๐ฅ∈๐ด∪๐ต ¬(๐ฅ ∈ ๐ด ∪ ๐ต) ¬(๐ฅ ∈ ๐ด ∨ ๐ฅ ∈ ๐ต) ๐ฅ∉๐ด∧๐ฅ∉๐ต ๐ฅ∈๐ด∧๐ฅ∈๐ต ๐ฅ∈๐ด∩๐ต Given Definition of complement Definition of union De Morgan’s law Definition of complement Definition of intersection Now read backward through those steps. Each follows logically when read in this order because only definitions and replacement rules were used. This means that the steps are reversible. Hence, we have a series of biconditionals, and we can use the short rule of biconditional proof (page 108): ๐ฅ ∈ ๐ด ∪ ๐ต ⇔ ¬(๐ฅ ∈ ๐ด ∪ ๐ต) ⇔ ¬(๐ฅ ∈ ๐ด ∨ ๐ฅ ∈ ๐ต) ⇔๐ฅ∉๐ด∧๐ฅ∉๐ต ⇔๐ฅ∈๐ด∧๐ฅ∈๐ต ⇔ ๐ฅ ∈ ๐ด ∩ ๐ต. Hence, ๐ด ∪ ๐ต = ๐ด ∩ ๐ต. We must be careful when writing these types of proofs since it is easy to confuse the notation. โ The correct translation for ๐ฅ ∈ ๐ด ∩ ๐ต is ๐ฅ ∈ ๐ด ∧ ๐ฅ ∈ ๐ต. Common mistakes for this translation include using formulas with set operations: x∈A ∩ x∈B Incorrect. A set should be on either side of a set operation. . . . and sets with connectives: x∈A ∧ B Correct. Incorrect. A formula should be present here. Section 3.3 SETS WITHIN SETS 139 Remember that connectives connect formulas and set operations connect sets. โ Negations also pose problems. If a complement is used, first translate using ๐ฅ∈๐ด⇔๐ฅ∈๐ ∧๐ฅ∉๐ด⇔๐ฅ∉๐ด and then proceed with the proof. Similarly, the formula ๐ฅ ∉ ๐ด∪๐ต can be written as neither ๐ฅ ∉ ๐ด ∪ ๐ฅ ∉ ๐ต nor ๐ฅ ∉ ๐ด ∨ ๐ฅ ∉ ๐ต. Instead, use DeM: ๐ฅ ∉ ๐ด ∪ ๐ต ⇔ ¬(๐ฅ ∈ ๐ด ∪ ๐ต) ⇔ ¬(๐ฅ ∈ ๐ด ∨ ๐ฅ ∈ ๐ต) ⇔๐ฅ∉๐ด∧๐ฅ∉๐ต ⇔ ๐ฅ ∈ ๐ด ∩ ๐ต. We can now prove many basic properties about set operations. Notice how the following are closely related to their corresponding replacement rules (1.3.9). THEOREM 3.3.8 โ Associative Laws ๐ด ∩ ๐ต ∩ ๐ถ = ๐ด ∩ (๐ต ∩ ๐ถ) ๐ด ∪ ๐ต ∪ ๐ถ = ๐ด ∪ (๐ต ∪ ๐ถ) โ Commutative Laws ๐ด∩๐ต =๐ต∩๐ด ๐ด∪๐ต =๐ต∪๐ด โ De Morgan’s Laws ๐ด∪๐ต =๐ด∩๐ต ๐ด∩๐ต =๐ด∪๐ต โ Distributive Laws ๐ด ∩ (๐ต ∪ ๐ถ) = ๐ด ∩ ๐ต ∪ ๐ด ∩ ๐ถ ๐ด ∪ ๐ต ∩ ๐ถ = (๐ด ∪ ๐ต) ∩ (๐ด ∪ ๐ถ) โ Idempotent Laws ๐ด∩๐ด=๐ด ๐ด ∪ ๐ด = ๐ด. EXAMPLE 3.3.9 We use the short rule of biconditional proof (page 108) to prove the equality ๐ด ∩ ๐ต ∩ ๐ถ = ๐ด ∩ (๐ต ∩ ๐ถ). ๐ฅ∈๐ด∩๐ต∩๐ถ ⇔๐ฅ∈๐ด∩๐ต∧๐ฅ∈๐ถ ⇔๐ฅ∈๐ด∧๐ฅ∈๐ต∧๐ฅ∈๐ถ ⇔ ๐ฅ ∈ ๐ด ∧ (๐ฅ ∈ ๐ต ∧ ๐ฅ ∈ ๐ถ) ⇔ ๐ฅ ∈ ๐ด ∧ (๐ฅ ∈ ๐ต ∩ ๐ถ) ⇔ ๐ฅ ∈ ๐ด ∩ (๐ต ∩ ๐ถ). 140 Chapter 3 SET THEORY Another way to prove it is to use a chain of equal signs. ๐ด ∩ ๐ต ∩ ๐ถ = {๐ฅ : ๐ฅ ∈ ๐ด ∩ ๐ต ∧ ๐ฅ ∈ ๐ถ} = {๐ฅ : ๐ฅ ∈ ๐ด ∧ ๐ฅ ∈ ๐ต ∧ ๐ฅ ∈ ๐ถ} = {๐ฅ : ๐ฅ ∈ ๐ด ∧ (๐ฅ ∈ ๐ต ∧ ๐ฅ ∈ ๐ถ)} = {๐ฅ : ๐ฅ ∈ ๐ด ∧ ๐ฅ ∈ ๐ต ∩ ๐ถ} = ๐ด ∩ (๐ต ∩ ๐ถ). We have to be careful when proving equality. If two sets are equal, there are always proofs for both inclusions. However, the steps needed for the one implication might not simply be the steps for the converse in reverse. The next example illustrates this. It is always true that ๐ด ∩ ๐ต ⊆ ๐ด. However, the premise is needed to show the other inclusion. EXAMPLE 3.3.10 Let ๐ด ⊆ ๐ต. Prove ๐ด ∩ ๐ต = ๐ด. โ Let ๐ฅ ∈ ๐ด ∩ ๐ต. This means that ๐ฅ ∈ ๐ด and ๐ฅ ∈ ๐ต. Then, ๐ฅ ∈ ๐ด (Simp). โ Assume that ๐ฅ is an element of ๐ด. Since ๐ด ⊆ ๐ต, ๐ฅ is also an element of ๐ต. Hence, ๐ฅ ∈ ๐ด ∩ ๐ต. A more involved example of this uses the concept of divisibility. Let ๐, ๐ ∈ ๐, not both equal to zero. A common divisor of ๐ and ๐ is ๐ when ๐ โฃ ๐ and ๐ โฃ ๐. For example, 4 is a common divisor of 48 and 36, but it is not the largest. DEFINITION 3.3.11 Let ๐, ๐ ∈ ๐ with ๐ and ๐ not both zero. The integer ๐ is the greatest common divisor of ๐ and ๐ if ๐ is a common divisor of ๐ and ๐ and ๐ ≤ ๐ for every common divisor ๐ ∈ ๐. In this case, write ๐ = gcd(๐, ๐). For example, 12 = gcd(48, 36) and 7 = gcd(0, 7). Notice that it is important that at least one of the integers is not zero. The gcd(0, 0) is undefined since all ๐ such that ๐ ≠ 0 divide 0. Further notice that the greatest common divisor is positive. EXAMPLE 3.3.12 Let ๐, ๐ ∈ ๐. Prove for all ๐ ∈ ๐, gcd(๐ + ๐๐, ๐) = gcd(๐, ๐). Section 3.3 SETS WITHIN SETS 141 If both pairs of numbers have the same common divisors, their greatest common divisors must be equal. So, define ๐ = {๐ ∈ ๐ : ๐ โฃ ๐ + ๐๐ ∧ ๐ โฃ ๐} and ๐ = {๐ ∈ ๐ : ๐ โฃ ๐ ∧ ๐ โฃ ๐}. To show that the greatest common divisors are equal, prove ๐ = ๐ . โ Let ๐ ∈ ๐. Then ๐ โฃ ๐ + ๐๐ and ๐ โฃ ๐. This means ๐ + ๐๐ = ๐๐ and ๐ = ๐๐ for some ๐, ๐ ∈ ๐. We are left to show ๐ โฃ ๐. By substitution, ๐+๐๐๐ = ๐๐. Hence, ๐ ∈ ๐ because ๐ = ๐๐ − ๐๐๐ = ๐(๐ − ๐๐). โ Now take ๐ ∈ ๐ . This means ๐ โฃ ๐ and ๐ โฃ ๐. Thus, there exists ๐, ๐ ∈ ๐ such that ๐ = ๐๐ and ๐ = ๐๐. Then, ๐ + ๐๐ = ๐๐ + ๐๐๐ = ๐(๐ + ๐๐). Therefore, ๐ โฃ ๐ + ๐๐ and ๐ ∈ ๐. As with subsets, let us now prove some results concerning the empty set and the universe. We use two strategies. โ Let ๐ด be a set. We know that ๐ด = ∅ if and only if ∀๐ฅ(๐ฅ ∉ ๐ด). Therefore, to prove that ๐ด is empty, take an arbitrary ๐ and show ๐ ∉ ๐ด. This can sometimes be done directly, but more often an indirect proof is better. That is, assume ๐ ∈ ๐ด and derive a contradiction. Since the contradiction arose simply by assuming ๐ ∈ ๐ด, this formula must be the problem. Hence, ๐ด can have no elements. โ Let ๐ be a universe. To prove ๐ด = ๐ , we must show that ๐ด ⊆ ๐ and ๐ ⊆ ๐ด. The first subset relation is always true. To prove the second, take an arbitrary element and show that it belongs to ๐ด. This works because ๐ contains all possible elements. EXAMPLE 3.3.13 โ Suppose ๐ฅ ∈ ๐ด ∩ ∅. Then ๐ฅ ∈ ∅, which is impossible. Therefore, ๐ด ∩ ∅ = ∅. โ Certainly, ๐ด ⊆ ๐ด ∪ ∅, so to show the opposite inclusion take ๐ฅ ∈ ๐ด ∪ ∅. Since ๐ฅ ∉ ∅, it must be the case that ๐ฅ ∈ ๐ด. Thus, ๐ด ∪ ∅ ⊆ ๐ด, and we have that ๐ด ∪ ∅ = ๐ด. โ From Exercise 5(b), we know ๐ด ∩ ๐ ⊆ ๐ด, so let ๐ฅ ∈ ๐ด. This means that ๐ฅ must also belong to the universe. Hence, ๐ฅ ∈ ๐ด ∩ ๐ , so ๐ด ∩ ๐ = ๐ด. โ Certainly, ๐ด ∪ ๐ ⊆ ๐ . Moreover, by Exercise 5(c), we have the other inclusion. Thus, ๐ด ∪ ๐ = ๐ . 142 Chapter 3 SET THEORY EXAMPLE 3.3.14 To prove that a set ๐ด is not equal to the empty set, show ¬∀๐ฅ(๐ฅ ∉ ๐ด), but this is equivalent to ∃๐ฅ(๐ฅ ∈ ๐ด). For instance, let ๐ด = {๐ฅ ∈ ๐ : ๐ฅ2 + 6๐ฅ + 5 = 0}. We know that ๐ด is nonempty since −1 ∈ ๐ด. EXAMPLE 3.3.15 Let ๐ด = {(๐, ๐) ∈ ๐ × ๐ : ๐ + ๐ = 0} and ๐ต = {(0, ๐) : ๐ ∈ ๐}. These sets are not disjoint since (0, 0) is an element of both ๐ด and ๐ต. However, ๐ด ≠ ๐ต because (1, −1) ∈ ๐ด but (1, −1) ∉ ๐ต. Let us combine the two strategies to show a relationship between ∅ and ๐ . THEOREM 3.3.16 For all sets ๐ด and ๐ต and universe ๐ , the following are equivalent. โ ๐ด⊆๐ต โ ๐ด∪๐ต =๐ โ ๐ด ∩ ๐ต = ∅. PROOF โ Assume ๐ด ⊆ ๐ต. Suppose ๐ฅ ∉ ๐ด. Then, ๐ฅ ∈ ๐ด, which implies that ๐ฅ ∈ ๐ต. Hence, for every element ๐ฅ, we have that ๐ฅ ∈ ๐ด or ๐ฅ ∈ ๐ต, and we conclude that ๐ด ∪ ๐ต = ๐. โ Suppose ๐ด ∪ ๐ต = ๐ . In order to obtain a contradiction, take ๐ฅ ∈ ๐ด ∩ ๐ต. Then, ๐ฅ ∈ ๐ต. Since ๐ฅ ∈ ๐ด, the supposition also gives ๐ฅ ∈ ๐ต, a contradiction. Therefore, ๐ด ∩ ๐ต = ∅. โ Let ๐ด ∩ ๐ต = ∅. Assume ๐ฅ ∈ ๐ด. By hypothesis, ๐ฅ cannot be a member of ๐ต, otherwise the intersection would be nonempty. Hence, ๐ฅ ∈ ๐ต. The following theorem is a generalization of the corresponding result concerning subsets. The proof could have been written using the short rule of biconditional proof or by appealing to Lemma 3.3.6 (Exercise 21). THEOREM 3.3.17 If ๐ด = ๐ต and ๐ต = ๐ถ, then ๐ด = ๐ถ. Section 3.3 SETS WITHIN SETS 143 PROOF Assume ๐ด = ๐ต and ๐ต = ๐ถ. This means ๐ด ⊆ ๐ต, ๐ต ⊆ ๐ด, ๐ต ⊆ ๐ถ, and ๐ถ ⊆ ๐ต. We must show that ๐ด = ๐ถ. โ Let ๐ฅ ∈ ๐ด. By hypothesis, ๐ฅ is then an element of ๐ต, which implies that ๐ฅ ∈ ๐ถ. โ Let ๐ฅ ∈ ๐ถ. Then, ๐ฅ ∈ ๐ต, from which ๐ฅ ∈ ๐ด follows. The last result of the section involves the Cartesian product. The first part is illustrated in Figure 3.7. The sets ๐ต and ๐ถ are illustrated along the vertical axis and ๐ด is illustrated along the horizontal axis. The Cartesian products are represented as boxes. The other parts of the theorem can be similarly visualized. THEOREM 3.3.18 โ ๐ด × (๐ต ∩ ๐ถ) = (๐ด × ๐ต) ∩ (๐ด × ๐ถ). โ ๐ด × (๐ต ∪ ๐ถ) = (๐ด × ๐ต) ∪ (๐ด × ๐ถ). โ ๐ด × (๐ต โงต ๐ถ) = (๐ด × ๐ต) โงต (๐ด × ๐ถ). โ (๐ด × ๐ต) ∩ (๐ถ × ๐ท) = (๐ด ∩ ๐ถ) × (๐ต ∩ ๐ท). PROOF We prove the first equation. The last three are left to Exercise 19. Take three sets ๐ด, ๐ต, and ๐ถ. Then, (๐, ๐) ∈ ๐ด × (๐ต ∩ ๐ถ) ⇔ ๐ ∈ ๐ด ∧ ๐ ∈ ๐ต ∩ ๐ถ ⇔๐∈๐ด∧๐∈๐ต∧๐∈๐ด∧๐∈๐ถ ⇔ (๐, ๐) ∈ ๐ด × ๐ต ∧ (๐, ๐) ∈ ๐ด × ๐ถ ⇔ (๐, ๐) ∈ (๐ด × ๐ต) ∩ (๐ด × ๐ถ). A×B B A × (B ∩ C) C A×C A Figure 3.7 ๐ด × (๐ต ∩ ๐ถ) = (๐ด × ๐ต) ∩ (๐ด × ๐ถ). 144 Chapter 3 SET THEORY Inspired by the last result, we might try proving that (๐ด × ๐ต) ∪ (๐ถ × ๐ท) is always equal to (๐ด ∪ ๐ถ) × (๐ต ∪ ๐ท), but no such proof exists. To show this, take ๐ด = ๐ต = {1} and ๐ถ = ๐ท = {2}. Then (๐ด ∪ ๐ถ) × (๐ต ∪ ๐ท) = {1, 2} × {1, 2} = {(1, 1), (1, 2), (2, 1), (2, 2)}. but (๐ด × ๐ต) ∪ (๐ถ × ๐ท) = {(1, 1)} ∪ {(2, 2)} = {(1, 1), (2, 2)}, Hence, (๐ด ∪ ๐ถ) × (๐ต ∪ ๐ท) * (๐ด × ๐ต) ∪ (๐ถ × ๐ท). Notice, however, that the opposite inclusion is always true (Exercise 3.3.8). Exercises 1. Answer true or false. (a) ∅ ∈ ∅ (b) ∅ ⊆ {1} (c) 1 ∈ ๐ (d) 1 ⊆ ๐ (e) 1 ∈ ∅ (f) {1} ⊆ ∅ (g) 0 ∈ ∅ (h) {1} ∈ ๐ (i) ∅ ⊆ ∅ 2. Answer true or false. For each false proposition, find one element that is in the first set but is not in the second. (a) ๐+ ⊆ ๐ (b) ๐+ ⊆ ๐+ (c) ๐ โงต ๐ ⊆ ๐ (d) ๐ โงต ๐ ⊆ ๐ (e) ๐ ∩ (−1, 1) ⊆ ๐ (f) (0, 1) ⊆ ๐+ (g) (0, 1) ⊆ {0, 1, 2} (h) (0, 1) ⊆ (0, 1] 3. Prove. (a) {๐ฅ ∈ ๐ : ๐ฅ2 − 3๐ฅ + 2 = 0} ⊆ ๐ (b) (0, 1) ⊆ [0, 1] (c) [0, 1] * (0, 1) (d) ๐ × ๐ * ๐ × ๐ (e) (0, 1) ∩ ๐ * [0, 1] ∩ ๐ (f) ๐ ⊆ ๐ (g) {๐๐ : ๐ ∈ ๐} ⊆ ๐ (h) ๐ * ๐ Section 3.3 SETS WITHIN SETS 145 4. Prove the remaining parts of Theorem 3.3.6. 5. Prove. (a) ๐ด ⊆ ๐ (b) ๐ด ∩ ๐ต ⊆ ๐ด (c) ๐ด ⊆ ๐ด ∪ ๐ต (d) ๐ด โงต ๐ต ⊆ ๐ด (e) If ๐ด ⊆ ๐ต, then ๐ด ∪ ๐ถ ⊆ ๐ต ∪ ๐ถ. (f) If ๐ด ⊆ ๐ต, then ๐ด ∩ ๐ถ ⊆ ๐ต ∩ ๐ถ. (g) If ๐ด ⊆ ๐ต, then ๐ถ โงต ๐ต ⊆ ๐ถ โงต ๐ด. (h) If ๐ด ≠ ∅, then ๐ด * ๐ด. (i) If ๐ด ⊆ ๐ต, then ๐ต ⊆ ๐ด. (j) If ๐ด ⊆ ๐ต, then {1} × ๐ด ⊆ {1} × ๐ต. (k) If ๐ด ⊆ ๐ถ and ๐ต ⊆ ๐ท, then ๐ด × ๐ต ⊆ ๐ถ × ๐ท. (l) If ๐ด ⊆ ๐ถ and ๐ต ⊆ ๐ท, then ๐ถ × ๐ท ⊆ ๐ด × ๐ต. 6. Prove that ๐ด ⊆ ๐ต if and only if ๐ต ⊆ ๐ด. 7. Show that the given proposition is false: for all sets ๐ด and ๐ต, if ๐ด ∩ ๐ต ≠ ∅, then ๐ด * ๐ด ∩ ๐ต. 8. Prove: (๐ด × ๐ต) ∪ (๐ถ × ๐ท) ⊆ (๐ด ∪ ๐ถ) × (๐ต ∪ ๐ท). 9. Take ๐, ๐, ๐ ∈ ๐. Let ๐ด = {๐ ∈ ๐ : ๐ โฃ ๐} and ๐ถ = {๐ ∈ ๐ : ๐ โฃ ๐}. Suppose ๐ โฃ ๐ and ๐ โฃ ๐. Prove ๐ด ⊆ ๐ถ. 10. Prove. (a) ๐ต ⊆ ๐ด and ๐ถ ⊆ ๐ด if and only if ๐ต ∪ ๐ถ ⊆ ๐ด. (b) ๐ด ⊆ ๐ต and ๐ด ⊆ ๐ถ if and only if ๐ด ⊆ ๐ต ∩ ๐ถ. 11. Prove the unproven parts of Theorem 3.3.8. 12. Prove each equality. (a) ∅ = ๐ (b) ๐ = ∅ (c) ๐ด ∩ ๐ด = ∅ (d) ๐ด ∪ ๐ด = ๐ (e) (f) (g) (h) (i) (j) (k) ๐ด=๐ด ๐ดโงต๐ด=∅ ๐ดโงต∅=๐ด ๐ด∩๐ต =๐ตโงต๐ด ๐ดโงต๐ต =๐ด∩๐ต ๐ด∪๐ต∩๐ต =∅ ๐ด∩๐ตโงต๐ด=∅ 146 Chapter 3 SET THEORY 13. Sketch a Venn diagram for each problem and then write a proof. (a) ๐ด = ๐ด ∩ ๐ต ∪ ๐ด ∩ ๐ต (b) ๐ด ∪ ๐ต = ๐ด ∪ ๐ด ∩ ๐ต (c) ๐ด โงต (๐ต โงต ๐ถ) = ๐ด ∩ (๐ต ∪ ๐ถ) (d) ๐ด โงต (๐ด ∩ ๐ต) = ๐ด โงต ๐ต (e) ๐ด ∩ ๐ต ∪ ๐ด ∩ ๐ต ∪ ๐ด ∩ ๐ต = ๐ด ∪ ๐ต (f) (๐ด ∪ ๐ต) โงต ๐ถ = ๐ด โงต ๐ถ ∪ ๐ต โงต ๐ถ (g) ๐ด โงต (๐ต โงต ๐ถ) = ๐ด โงต ๐ต ∪ ๐ด ∩ ๐ถ (h) ๐ด โงต ๐ต โงต ๐ถ = ๐ด โงต (๐ต ∪ ๐ถ) 14. Prove. (a) If ๐ด ⊆ ๐ต, then ๐ด โงต ๐ต = ∅. (b) If ๐ด ⊆ ∅, then ๐ด = ∅. (c) Let ๐ be a universe. If ๐ ⊆ ๐ด, then ๐ด = ๐ . (d) If ๐ด ⊆ ๐ต, then ๐ต โงต (๐ต โงต ๐ด) = ๐ด. (e) ๐ด × ๐ต = ∅ if and only if ๐ด = ∅ or 15. Let ๐, ๐, ๐ ∈ ๐ and define ๐ด = {๐ + ๐๐ : ๐ ∈ ๐} and ๐ต = {๐ + ๐(๐ + ๐) : ๐ ∈ ๐}. Show ๐ด = ๐ต. 16. Prove ๐ด = ๐ต, where ๐ด= and ๐ต= {[ ๐ ๐ ] ๐ ๐ {[ ๐ 0 ] 0๐ } : ๐ + ๐ = 0 ∧ ๐, ๐ ∈ ๐ } : ๐ = −๐ ∧ ๐2 + ๐ 2 = 0 ∧ ๐, ๐, ๐, ๐ ∈ ๐ . 17. Prove. (a) ๐ ≠ ๐ (b) ๐ ≠ ๐ (c) {0} × ๐ ≠ ๐ (d) ๐ × ๐ ≠ ๐ × ๐ (e) If ๐ด = {๐๐ฅ3 + ๐ : ๐, ๐ ∈ ๐} and ๐ต = {๐ฅ3 + ๐ : ๐ ∈ ๐}, then ๐ด ≠ ๐ต. (f) If ๐ด = {๐๐ฅ3 + ๐ : ๐, ๐ ∈ ๐} and ๐ต = {๐๐ฅ3 + ๐ : ๐, ๐ ∈ ๐}, then ๐ด ≠ ๐ต. (g) ๐ด ≠ ๐ต, where {[ ] } ๐ด = ๐0 0๐ : ๐, ๐ ∈ ๐ and ๐ต= {[ ๐ ๐ ] ๐ ๐ } : ๐, ๐, ๐, ๐ ∈ ๐ . 18. Find ๐ด and ๐ต to illustrate the given inequalities. (a) ๐ด โงต ๐ต ≠ ๐ต โงต ๐ด (b) (๐ด × ๐ต) × ๐ถ ≠ ๐ด × (๐ต × ๐ถ) (c) ๐ด × ๐ต ≠ ๐ต × ๐ด 19. For the remaining parts of Theorem 3.3.18, draw diagrams as in Figure 3.7 and prove the results. Section 3.3 SETS WITHIN SETS 147 20. Is it possible for ๐ด = ๐ด? Explain. 21. Prove Theorem 3.3.17 by first using the short rule of biconditional proof and then by directly appealing to Theorem 3.3.6. 22. Prove that the following are equivalent. โ ๐ด⊆๐ต โ ๐ด∪๐ต =๐ต โ ๐ดโงต๐ต =∅ โ ๐ด∩๐ต =๐ด 23. Prove that the following are equivalent. โ ๐ด∩๐ต =∅ โ ๐ดโงต๐ต =∅ โ ๐ด⊆๐ต 24. Find an example of sets ๐ด, ๐ต, and ๐ถ such that ๐ด ∩ ๐ต = ๐ด ∩ ๐ถ but ๐ต ≠ ๐ถ. 25. Does ๐ด ∪ ๐ต = ๐ด ∪ ๐ถ imply ๐ต = ๐ถ for all sets ๐ด and ๐ต? Explain. 26. Prove. (a) If ๐ด ∪ ๐ต ⊆ ๐ด ∩ ๐ต, then ๐ด = ๐ต. (b) If ๐ด ∩ ๐ต = ๐ด ∩ ๐ถ and ๐ด ∪ ๐ต = ๐ด ∪ ๐ถ, then ๐ต = ๐ถ. 27. Prove that there is a cancellation law with the Cartesian product. Namely, if ๐ด ≠ ∅ and ๐ด × ๐ต = ๐ด × ๐ถ, then ๐ต = ๐ถ. 28. When does ๐ด × ๐ต = ๐ถ × ๐ท imply that ๐ด = ๐ถ and ๐ต = ๐ท? 29. Prove. (a) If ๐ด ∪ ๐ต ≠ ∅, then ๐ด ≠ ∅ or ๐ต ≠ ∅. (b) If ๐ด ∩ ๐ต ≠ ∅, then ๐ด ≠ ∅ and ๐ต ≠ ∅. 30. Find the greatest common divisors of each pair. (a) 12 and 18 (b) 3 and 9 (c) 14 and 0 (d) 7 and 15 31. Let ๐ be a positive integer. Find the following and prove the result. (a) gcd(๐, ๐ + 1) (b) gcd(๐, 2๐) (c) gcd(๐, ๐2 ) (d) gcd(๐, 0) 148 Chapter 3 SET THEORY 3.4 FAMILIES OF SETS The elements of a set can be sets themselves. We call such a collection a family of sets and often use capital script letters to name them. For example, let โฐ = {{1, 2, 3}, {2, 3, 4}, {3, 4, 5}}. (3.2) The set โฐ has three elements: {1, 2, 3}, {2, 3, 4}, and {3, 4, 5}. EXAMPLE 3.4.1 โ {1, 2, 3} ∈ {{1, 2, 3}, {1, 4, 9}} โ 1 ∉ {{1, 2, 3}, {1, 4, 9}} โ {1, 2, 3} * {{1, 2, 3}, {1, 4, 9}} โ {{1, 2, 3}} ⊆ {{1, 2, 3}, {1, 4, 9}}. EXAMPLE 3.4.2 โ ∅ ⊆ {∅, {∅}} by Theorem 3.3.4. โ {∅} ⊆ {∅, {∅}} because ∅ ∈ {∅, {∅}}. โ {{∅}} ⊆ {∅, {∅}} because {∅} ∈ {∅, {∅}}. Families of sets can have infinitely many elements. For example, let โฑ = {[๐, ๐ + 1] : ๐ ∈ ๐}. (3.3) In roster notation, โฑ = {… , [−2, −1] , [−1, 0] , [0, 1] , [1, 2] , … }. Notice that in this case, abstraction is more convenient. For each integer ๐, the closed interval [๐, ๐ + 1] is in โฑ . The set ๐ plays the role of an index set, a set whose only purpose is to enumerate the elements of the family. Each element of an index set is called an index. If we let ๐ผ = ๐ and ๐ด๐ = [๐, ๐ + 1], the family can be written as โฑ = {๐ด๐ : ๐ ∈ ๐ผ}. To write the family โฐ (3.2) using an index set, let ๐ผ = {0, 1, 2} and define ๐ด0 = {1, 2, 3}, ๐ด1 = {2, 3, 4}, ๐ด2 = {3, 4, 5}. Then, the family illustrated in Figure 3.8 is โฐ = {๐ด๐ : ๐ ∈ ๐ผ} = {๐ด1 , ๐ด2 , ๐ด3 }. Section 3.4 FAMILIES OF SETS A0 1 โฐ 2 4 A2 Figure 3.8 3 149 2 3 3 4 A1 5 The family of sets โฐ = {๐ด๐ : ๐ ∈ ๐ผ} with ๐ผ = {1, 2, 3}. There is no reason why ๐ผ must be {0, 1, 2}. Any three-element set will do. The order in which the sets are defined is also irrelevant. For instance, we could have defined ๐ผ = {๐ค, ๐, 99} and ๐ด๐ค = {3, 4, 5}, ๐ด๐ = {2, 3, 4}, ๐ด99 = {1, 2, 3}. The goal is to have each set in the family referenced or indexed by at least one element of the set. We will still have โฐ = {๐ด๐ : ๐ ∈ ๐ผ} with a similar diagram (Figure 3.9). EXAMPLE 3.4.3 Write โฑ (3.3) using ๐ as the index set instead of ๐. Use the even natural numbers to index the intervals with a nonnegative integer left-hand endpoint. The odd natural numbers will index the intervals with a negative integer left-hand endpoint. To do this, define the sets ๐ต๐ as follows: A99 Aw Figure 3.9 1 3 โฐ 2 4 2 3 3 4 Aπ 5 The family of sets โฐ = {๐ด๐ : ๐ ∈ ๐ผ} with ๐ผ = {๐ค, ๐, 99}. 150 Chapter 3 SET THEORY ๐ต5 ๐ต3 ๐ต1 ๐ต0 ๐ต2 ๐ต4 โฎ = [−3, −2] = [−2, −1] = [−1, 0] = [0, 1] = [1, 2] = [2, 3] โฎ Use 2๐ + 1 to represent the odd natural numbers and 2๐ to represent the even natural numbers (๐ ∈ ๐). Then, ๐ต2(2)+1 โฎ = [−2 − 1, −2] ๐ต2(1)+1 = [−1 − 1, −1] ๐ต2(0)+1 = [−0 − 1, −0] and ๐ต2(0) = [0, 0 + 1] ๐ต2(1) = [1, 1 + 1] ๐ต2(2) = [2, 2 + 1] โฎ Therefore, define for all natural numbers ๐, ๐ต2๐+1 = [−๐ − 1, −๐] and ๐ต2๐ = [๐, ๐ + 1] . We have indexed the elements of โฑ as ๐ต0 ๐ต1 ๐ต2 ๐ต3 ๐ต4 ๐ต5 = [0, 1] = [−1, 0] = [1, 2] = [−2, −1] = [2, 3] = [−3, −2] โฎ So under this definition, โฑ = {๐ต๐ : ๐ ∈ ๐}. Section 3.4 FAMILIES OF SETS 151 Power Set There is a natural way to form a family of sets. Take a set ๐ด. The collection of all subsets of ๐ด is called the power set of ๐ด. It is represented by P(๐ด). DEFINITION 3.4.4 For any set ๐ด, P(๐ด) = {๐ต : ๐ต ⊆ ๐ด}. Notice that ∅ ∈ P(๐ด) by Theorem 3.3.4. EXAMPLE 3.4.5 โ P({1, 2, 3}) = {∅, {1}, {2}, {3}, {1, 2}, {1, 3}, {2, 3}, {1, 2, 3}} โ P({∅, {∅}}) = {∅, {∅}, {{∅}}, {∅, {∅}}} โ P(๐) = {∅, {0}, {1}, … , {0, 1}, {0, 2}, … }. Consider ๐ด = {2, 6} and ๐ต = {2, 6, 10}, so ๐ด ⊆ ๐ต. Examining the power sets of each, we find that P(๐ด) = {∅, {2}, {6}, {2, 6}} and P(๐ต) = {∅, {2}, {6}, {10}, {2, 6}, {2, 10}, {6, 10}, {2, 6, 10}}. Hence, P(๐ด) ⊆ P(๐ต). This result is generalized in the next lemma. LEMMA 3.4.6 ๐ด ⊆ ๐ต if and only if P(๐ด) ⊆ P(๐ต). PROOF โ Let ๐ด ⊆ ๐ต. Assume ๐ ∈ P(๐ด). Then, ๐ ⊆ ๐ด, which gives ๐ ⊆ ๐ต by Theorem 3.3.6. Hence, ๐ ∈ P(๐ต). โ Assume P(๐ด) ⊆ P(๐ต). Let ๐ฅ ∈ ๐ด. In other words, {๐ฅ} ⊆ ๐ด, but this means that {๐ฅ} ∈ P(๐ด). Hence, {๐ฅ} ∈ P(๐ต) by hypothesis, so ๐ฅ ∈ ๐ต. The definition of set equality and Lemma 3.4.6 are used to prove the next theorem. Its proof is left to Exercise 9. THEOREM 3.4.7 ๐ด = ๐ต if and only if P(๐ด) = P(๐ต). Union and Intersection We now generalize the set operations. Define the union of a family of sets to be the set of all elements that belong to some member of the family. This union is denoted by the same notation as in Definition 3.2.1 and can be defined using the abstraction method. 152 Chapter 3 SET THEORY DEFINITION 3.4.8 Let โฑ be a family of sets. Define โ โฑ = {๐ฅ : ∃๐ด(๐ด ∈ โฑ ∧ ๐ฅ ∈ ๐ด)}. If the family is indexed so that โฑ = {๐ด๐ : ๐ ∈ ๐ผ}, define โ ๐ด๐ = {๐ฅ : ∃๐(๐ ∈ ๐ผ ∧ ๐ฅ ∈ ๐ด๐ )}. ๐∈๐ผ Observe that โ โฑ = โ ๐∈๐ผ ๐ด๐ [Exercise 16(a)]. We generalize the notion of intersection similarly. The intersection of a family of sets is the set of all elements that belong to each member of the family. DEFINITION 3.4.9 Let โฑ be a family of sets. โ โฑ = {๐ฅ : ∀๐ด(๐ด ∈ โฑ → ๐ฅ ∈ ๐ด)}. If the family is indexed so that โฑ = {๐ด๐ : ๐ ∈ ๐ผ}, define โ ๐ด๐ = {๐ฅ : ∀๐(๐ ∈ ๐ผ → ๐ฅ ∈ ๐ด๐ )}. ๐∈๐ผ Observe that โ โฑ = โ ๐∈๐ผ ๐ด๐ [Exercise 16(b)]. Furthermore, notice that both definitions are indeed generalizations of the operations of Section 3.2 because as noted in Exercise 17, โ {๐ด, ๐ต} = ๐ด ∪ ๐ต, and โ {๐ด, ๐ต} = ๐ด ∩ ๐ต. EXAMPLE 3.4.10 Define โฐ = {[๐, ๐ + 1] : ๐ ∈ ๐}. โ When all of these intervals are combined, the result is the real line. This means that โ โฐ = ๐. โ There is not one element that is common to all of the intervals. Hence, โ โฐ = ∅. The next example illustrates how to write the union or intersection of a family of sets as a roster. Section 3.4 FAMILIES OF SETS 153 EXAMPLE 3.4.11 Let โฑ = {{1, 2, 3}, {2, 3, 4}, {3, 4, 5}}. โ โ Since 1 is in the first set of โฑ , 1 ∈ โฑ . The others can be explained similarly, so โ โฑ = {1, 2, 3, 4, 5}. Notice that mechanically this amounts to removing the braces around the sets of the family and setting the union to the resulting set: Remove braces {{1, 2, 3}, {2, 3, 4}, {3, 4, 5}} โฑ {1, 2, 3, 2, 3, 4, 3, 4, 5} {1, 2, 3, 4, 5} โ โฑ โ The generalized intersection โ is simply the overlap of all of the sets. Hence, 3 is the only element of โฑ . That is, โ โฑ = {3}, and this is illustrated by the following diagram: {{1, 2, 3 }, {2, 3 , 4}, { 3 , 4, 5}} โฑ {3} โโฑ EXAMPLE 3.4.12 Since a family of sets can be empty, we must โ be able to take the โunion and intersection of the empty set. To prove that ∅ = ∅, take ๐ฅ ∈ ∅. This means that there exists ๐ด ∈ ∅ such that ๐ฅ ∈ ๐ด, but this is impossible. We leave the fact โ that ∅ is equal to the universe to Exercise 24. The next theorem generalizes the Distributive Laws. Exercise 2.4.10 plays an important role in its proof. It allows us to move the quantifier. 154 Chapter 3 SET THEORY THEOREM 3.4.13 Let ๐ด๐ (๐ ∈ ๐ผ) and ๐ต be sets. โ โ ๐ด๐ = (๐ต ∪ ๐ด๐ ) โ ๐ต∪ ๐∈๐ผ ๐∈๐ผ โ ๐ต∩ โ โ ๐ด๐ = (๐ต ∩ ๐ด๐ ). ๐∈๐ผ ๐∈๐ผ PROOF We leave the second part to Exercise 19. The first part is demonstrated by the following biconditional proof: โ โ ๐ฅ∈๐ต∪ ๐ด๐ ⇔ ๐ฅ ∈ ๐ต ∨ ๐ฅ ∈ ๐ด๐ ๐∈๐ผ ๐∈๐ผ ⇔ ๐ฅ ∈ ๐ต ∨ ∀๐(๐ ∈ ๐ผ → ๐ฅ ∈ ๐ด๐ ) ⇔ ๐ฅ ∈ ๐ต ∨ ∀๐(๐ ∉ ๐ผ ∨ ๐ฅ ∈ ๐ด๐ ) ⇔ ∀๐(๐ฅ ∈ ๐ต ∨ ๐ ∉ ๐ผ ∨ ๐ฅ ∈ ๐ด๐ ) ⇔ ∀๐(๐ ∉ ๐ผ ∨ ๐ฅ ∈ ๐ต ∨ ๐ฅ ∈ ๐ด๐ ) ⇔ ∀๐(๐ ∈ ๐ผ → ๐ฅ ∈ ๐ต ∨ ๐ฅ ∈ ๐ด๐ ) ⇔ ∀๐(๐ ∈ ๐ผ → ๐ฅ ∈ ๐ต ∪ ๐ด๐ ) โ ⇔๐ฅ∈ (๐ต ∪ ๐ด๐ ). ๐∈๐ผ To understand the next result, consider the following example. Choose the universe to be {1, 2, 3, 4, 5} and perform some set operations on the family {{1, 2, 3}, {2, 3, 4}, {2, 3, 5}}. First, โ {{1, 2, 3}, {2, 3, 4}, {2, 3, 5}} = {2, 3} = {1, 4, 5}, and second, โ โ {{1, 2, 3}, {2, 3, 4}, {2, 3, 5}} = {{4, 5}, {1, 5}, {1, 4}} = {1, 4, 5}. This leads us to the next generalization of De Morgan’s laws. Its proof is left to Exercise 23. THEOREM 3.4.14 Let {๐ด๐ : ๐ ∈ ๐ผ} be a family of sets. โ โ โ ๐ด๐ = ๐ด๐ โ ๐∈๐ผ ๐∈๐ผ โ โ ๐∈๐ผ ๐ด๐ = ๐∈๐ผ ๐ด๐ . Section 3.4 FAMILIES OF SETS 155 Disjoint and Pairwise Disjoint What it means for two sets to be disjoint was defined in Section 3.2 (Definition 3.2.3). The next definition generalizes that notion to families of sets. Because a family can have more than two elements, it is appropriate to expand the concept of disjointness. DEFINITION 3.4.15 Let โฑ be a family of sets. โ โฑ is disjoint when โ โฑ = ∅. โ โฑ is pairwise disjoint when for all ๐ด, ๐ต ∈ โฑ , if ๐ด ≠ ๐ต, then ๐ด ∩ ๐ต = ∅. Observe that {{1, 2}, {3, 4}, {5, 6}} is both disjoint and pairwise disjoint because its elements have no common members. EXAMPLE 3.4.16 Let ๐ด be a set. We see that โ P(๐ด) = {๐ฅ : ∃๐ต[๐ต ∈ P(๐ด) ∧ ๐ฅ ∈ ๐ต]} = ๐ด. Although P(๐ด) is not a pairwise disjoint family of sets, it is a disjoint because ∅ ∈ P(๐ด). If the family is indexed, we can use another test to determine if it is pairwise disjoint. Let โฑ = {๐ด๐ : ๐ ∈ ๐ผ} be a family of sets. If for all ๐, ๐ ∈ ๐ผ, ๐ ≠ ๐ implies ๐ด๐ ∩ ๐ด๐ = ∅, (3.4) then โฑ is pairwise disjoint. To prove this, let โฑ be a family that satisfies (3.4). Take ๐ด๐ , ๐ด๐ ∈ โฑ for some ๐, ๐ ∈ ๐ผ and assume ๐ด๐ ≠ ๐ด๐ . Therefore, ๐ ≠ ๐, for otherwise they would be the equal. Hence, ๐ด๐ ∩ ๐ด๐ = ∅ by Condition 3.4. The next result illustrates the relationship between these two terms. One must be careful, though. The converse is false (Exercise 14). THEOREM 3.4.17 Let โฑ be a family of sets with at least two elements. If โฑ is pairwise disjoint, then โฑ is disjoint. PROOF Assume โฑ is pairwise disjoint. Since โฑ contains at least two sets, let ๐ด, ๐ต ∈ โฑ โ such that ๐ด ≠ ๐ต. Then, using Exercise 27, โฑ ⊆ ๐ด ∩ ๐ต = ∅. Exercises 1. Let ๐ผ = {1, 2, 3, 4, 5}, ๐ด1 = {1, 2}, ๐ด2 = {3, 4}, ๐ด3 = {1, 4}, ๐ด4 = {3, 4}, and ๐ด5 = {1, 3}. Write the given families of sets as a rosters. 156 Chapter 3 SET THEORY (a) (b) (c) (d) (e) (f) {๐ด๐ {๐ด๐ {๐ด๐ {๐ด๐ {๐ด๐ {๐ด๐ : ๐ ∈ ๐ผ} : ๐ ∈ {2, 4}} : ๐ = 1} : ๐ = 1, 2} : ๐ ∈ ∅} : ๐ ∈ ๐ด5 } 2. Answer true or false. (a) 1 ∈ {{1}, {2}, {1, 2}} (b) {1} ∈ {{1}, {2}, {1, 2}} (c) {1} ⊆ {{1}, {2}, {1, 2}} (d) {1, 2} ∈ {{{1, 2}, {3, 4}}, {1, 2}} (e) {1, 2} ⊆ {{{1, 2}, {3, 4}}, {1, 2}} (f) {3, 4} ∈ {{{1, 2}, {3, 4}}, {1, 2}} (g) {3, 4} ⊆ {{{1, 2}, {3, 4}}, {1, 2}} (h) ∅ ∈ {{{1, 2}, {3, 4}}, {1, 2}} (i) ∅ ⊆ {{{1, 2}, {3, 4}}, {1, 2}} (j) {∅} ∈ {∅, {∅, {∅}}} (k) {∅} ⊆ {∅, {∅, {∅}}} (l) ∅ ∈ {∅, {∅, {∅}}} (m) ∅ ⊆ {∅, {∅, {∅}}} (n) {∅} ⊆ ∅ (o) {∅} ⊆ {∅, {∅}} (p) {∅} ⊆ {{∅, {∅}}} (q) {{∅}} ⊆ {∅, {∅}} (r) {1} ∈ P(๐) (s) {1} ⊆ P(๐) (t) ∅ ∈ P(∅) (u) {∅} ∈ P(∅) (v) {∅} ⊆ P(∅) 3. Find sets such that๐ผ ∩ ๐ฝ = ∅ but {๐ด๐ : ๐ ∈ ๐ผ} and {๐ด๐ : ๐ ∈ ๐ฝ } are not disjoint. 4. Let {๐ด๐ : ๐ ∈ ๐พ} be a family of sets and let ๐ผ and ๐ฝ be subsets of ๐พ. Define โฐ = {๐ด๐ : ๐ ∈ ๐ผ} and โฑ = {๐ด๐ : ๐ ∈ ๐ฝ } and prove the following. (a) If ๐ผ ⊆ ๐ฝ , then โฐ ⊆ โฑ . (b) โฐ ∪ โฑ = {๐ด๐ : ๐ ∈ ๐ผ ∪ ๐ฝ } (c) {๐ด๐ : ๐ ∈ ๐ผ ∩ ๐ฝ } ⊆ โฐ ∩ โฑ 5. Using the same notation as in the previous problem, find a family {๐ด๐ : ๐ ∈ ๐พ} and subsets ๐ผ and ๐ฝ of ๐พ such that: (a) โฐ ∩ โฑ * {๐ด๐ : ๐ ∈ ๐ผ ∩ ๐ฝ } (b) {๐ด๐ : ๐ ∈ ๐ผ โงต ๐ฝ } * โฐ โงต โฑ 6. Show {๐ด๐ : ๐ ∈ ๐ผ} = ∅ if and only if ๐ผ = ∅. Section 3.4 FAMILIES OF SETS 157 7. Let ๐ด be a finite set. How many elements are in P(๐ด) if ๐ด has ๐ elements? Explain. 8. Find the given power sets. (a) P({1, 2}) (b) P(P({1, 2})) (c) P(∅) (d) P(P(∅)) (e) P({{∅}}) (f) P({∅, {∅}, {∅, {∅}}}) 9. Prove Theorem 3.4.7. 10. For each of the given equalities, prove or show false. If one is false, prove any true inclusion. (a) P(๐ด ∪ ๐ต) = P(๐ด) ∪ P(๐ต) (b) P(๐ด ∩ ๐ต) = P(๐ด) ∩ P(๐ต) (c) P(๐ด โงต ๐ต) = P(๐ด) โงต P(๐ต) (d) P(๐ด × ๐ต) = P(๐ด) × P(๐ต) 11. Prove P(๐ด) ⊆ P(๐ต) implies ๐ด ⊆ ๐ต by using the fact that ๐ด ∈ P(๐ด). 12. Write the following sets in roster form. โ (a) {{1, 2}, {1, 2}, {1, 3}, {1, 4}} โ (b) {{1, 2}, {1, 2}, {1, 3}, {1, 4}} โ (c) P(∅) โ (d) P(∅) โโ (e) {{{1}}, {{1, 2}}, {{1, 3}}, {{1, 4}}} โโ (f) {{{1}}, {{1, 2}}, {{1, 3}}, {{1, 4}}} โโ (g) {{{1}}, {{1, 2}}, {{1, 3}}, {{1, 4}}} โโ (h) {{{1}}, {{1, 2}}, {{1, 3}}, {{1, 4}}} โโ (i) ∅ โโ (j) ∅ 13. Draw Venn diagrams for a disjoint family of sets that is not pairwise disjoint and for a pairwise disjoint family of sets. 14. Show by example that a disjoint family of sets might not be pairwise disjoint. 15. Given a family of sets {๐ด๐ : ๐ ∈ ๐ผ}, find a family โฌ = {๐ต๐ : ๐ ∈ ๐ผ} such that {๐ด๐ × ๐ต๐ : ๐ ∈ ๐ผ} is pairwise disjoint. 16. Let โฑ = {๐ด๐ : ๐ ∈ ๐ผ} be a family of sets. Prove the given equations. โ โ (a) โฑ = ๐∈๐ผ ๐ด๐ โ โ (b) โฑ = ๐∈๐ผ ๐ด๐ โ โ 17. Prove for any sets ๐ด and ๐ต, {๐ด, ๐ต} = ๐ด ∪ ๐ต and {๐ด, ๐ต} = ๐ด ∩ ๐ต. โ โ 18. Let ๐ = {(0, ๐) : ๐ ∈ ๐+ }. Prove that ๐ = (0, ∞) and ๐ = (0, 1). 158 Chapter 3 SET THEORY 19. Prove the second part of Theorem 3.4.13. 20. Prove Theorem 3.4.17 indirectly. 21. Is Theorem 3.4.17 still true if the family of sets โฑ has at most one element? Explain. 22. Let {๐ด๐ : ๐ ∈ ๐ผ} be a family of sets and prove the following. โ (a) If ๐ต ⊆ ๐ด๐ for some ๐ ∈ ๐ผ, then ๐ต ⊆ ๐∈๐ผ ๐ด๐ . โ (b) If ๐ด๐ ⊆ ๐ต for all ๐ ∈ ๐ผ, ๐∈๐ผ ๐ด๐ ⊆ ๐ต. โ (c) If ๐ต ⊆ ๐∈๐ผ ๐ด๐ , then ๐ต ⊆ ๐ด๐ for all ๐ ∈ ๐ผ. 23. Prove Theorem 3.4.14. โ 24. Show ∅ = ๐ where ๐ is a universe. โ 25. Let โฑ be a family of sets such that ∅ ∈ โฑ . Prove โฑ = ∅. โ โ 26. Find families of sets โฐ and โฑ so that โฐ = โฑ but โฐ ≠ โฑ . Can this be repeated by replacing union with intersection? โ โ 27. Let โฑ be a family of sets, and let ๐ด ∈ โฑ . Prove โฑ ⊆ ๐ด ⊆ โฑ . 28. Let โฐ and โฑ be families of sets. Show the following. โ (a) {โฑ } = โฑ โ (b) {โฑ } = โฑ โ โ (c) If โฐ ⊆ โฑ , then โฐ ⊆ โฑ . โ โ (d) If โฐ ⊆ โฑ , then โฑ ⊆ โฐ . โ โ โ (e) (โฐ ∪ โฑ ) = โฐ ∪ โฑ โ โ โ (f) (โฐ ∪ โฑ ) = โฐ ∩ โฑ 29. Find families of sets โฐ and โฑ that make the following false. โ โ โ (a) (โฐ ∩ โฑ ) = โฐ ∩ โฑ โ โ โ (b) (โฐ ∩ โฑ ) = โฐ ∩ โฑ 30. Let โฑ be a family of sets. For each of the given equalities, prove true or show that it is false by finding a counter-example. โ (a) P(โฑ ) = โฑ โ (b) P(โฑ ) = โฑ โ (c) P( โฑ ) = โฑ โ (d) P( โฑ ) = โฑ 31. Prove these alternate forms of De Morgan’s laws. โ โ (a) ๐ด โงต ๐∈๐ผ ๐ต๐ = ๐∈๐ผ ๐ด โงต ๐ต๐ โ โ (b) ๐ด โงต ๐∈๐ผ ๐ต๐ = ๐∈๐ผ ๐ด โงต ๐ต๐ 32. Assume that the universe ๐ contains only sets such that for all ๐ด ∈ ๐ , ๐ด ⊆ ๐ and P(๐ด) ∈ ๐ . Prove the following equalities. โ (a) ๐ =๐ โ (b) ๐ =∅ Section 3.4 FAMILIES OF SETS 33. Let ๐ผ๐ = [๐, ๐ + 1] and ๐ฝ๐ = [๐, ๐ + 1], ๐ ∈ ๐. Define โฑ = {๐ผ๐ × ๐ฝ๐ : ๐, ๐ ∈ ๐}. Show โ โฑ = ๐2 and โ โฑ = ∅. Is โฑ pairwise disjoint? 159 CHAPTER 4 RELATIONS AND FUNCTIONS 4.1 RELATIONS A relation is an association between objects. A book on a table is an example of the relation of one object being on another. It is especially common to speak of relations among people. For example, one person could be the niece of another. In mathematics, there are many relations such as equals and less-than that describe associations between numbers. To formalize this idea, we make the next definition. DEFINITION 4.1.1 A set ๐ is an (n-ary) relation if there exist sets ๐ด0 , ๐ด1 , … , ๐ด๐−1 such that ๐ ⊆ ๐ด0 × ๐ด1 × · · · × ๐ด๐−1 . In particular, ๐ is a unary relation if ๐ = 1 and a binary relation if ๐ = 2. If ๐ ⊆ ๐ด × ๐ด for some set ๐ด, then ๐ is a relation on ๐ด and we write (๐ด, ๐ ). The relation on can be represented as a subset of the Cartesian product of the set of all books and the set of all tables. We could then write (dictionary, desk) to mean that the 161 A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. 162 Chapter 4 RELATIONS AND FUNCTIONS dictionary is on the desk. Similarly, the set {(2, 4), (7, 3), (0, 0)} is a relation because it is a subset of ๐ × ๐. The ordered pair (2, 4) means that 2 is related to 4. Likewise, ๐ × ๐ is a relation where every real number is related to every rational number, and according to Definition 4.1.1, the empty set is also a relation because ∅ = ∅ × ∅. EXAMPLE 4.1.2 For any set ๐ด, define ๐ผ๐ด = {(๐, ๐) : ๐ ∈ ๐ด}. Call this set the identity on ๐ด. In particular, the identity on ๐ is ๐ผ๐ = {(๐ฅ, ๐ฅ) : ๐ฅ ∈ ๐}, and the identity on ๐ is ๐ผ๐ = {(๐ฅ, ๐ฅ) : ๐ฅ ∈ ๐}. Notice that ∅ is the identity on ∅. EXAMPLE 4.1.3 The less-than relation on ๐ is defined as ๐ฟ = {(๐, ๐) : ๐, ๐ ∈ ๐ ∧ ๐ < ๐}. Another approach is to use membership in the set of positive integers as our condition. That is, ๐ฟ = {(๐, ๐) : ๐, ๐ ∈ ๐ ∧ ๐ − ๐ ∈ ๐+ }. Hence, (4, 7) ∈ ๐ฟ because 7 − 4 ∈ ๐+ . See Exercise 1 for another definition of ๐ฟ. When a relation ๐ ⊆ ๐ด × ๐ต is defined, all elements in ๐ด or ๐ต might not be used. For this reason, it is important to identify the sets that comprise all possible values for the two coordinates of the relation. DEFINITION 4.1.4 Let ๐ ⊆ ๐ด × ๐ต. The domain of ๐ is the set dom(๐ ) = {๐ฅ ∈ ๐ด : ∃๐ฆ(๐ฆ ∈ ๐ต ∧ (๐ฅ, ๐ฆ) ∈ ๐ )}, and the range of ๐ is the set ran(๐ ) = {๐ฆ ∈ ๐ต : ∃๐ฅ(๐ฅ ∈ ๐ด ∧ (๐ฅ, ๐ฆ) ∈ ๐ )}. EXAMPLE 4.1.5 If ๐ = {(1, 3), (2, 4), (2, 5)}, then dom(๐ ) = {1, 2} and ran(๐ ) = {3, 4, 5}. We represent this in Figure 4.1 where ๐ด and ๐ต are sets so that {1, 2} ⊆ ๐ด and {3, 4, 5} ⊆ ๐ต. The ordered pair (1, 3) is denoted by the arrow pointing from 1 to 3. Also, both dom(๐ ) and ran(๐ ) are shaded. Section 4.1 RELATIONS 163 R A 1 2 Figure 4.1 3 B 4 5 ๐ = {(1, 3), (2, 4), (2, 5)}. EXAMPLE 4.1.6 Let ๐ = {(๐ฅ, ๐ฆ) : |๐ฅ| = ๐ฆ ∧ ๐ฅ, ๐ฆ ∈ ๐}. Notice that both (2, 2) and (−2, 2) are elements of ๐. Furthermore, dom(๐) = ๐ and ran(๐) = [0, ∞). The domain and range of a relation can be the same set as in the next two examples. EXAMPLE 4.1.7 If ๐ = {(0, 1), (0, 2), (1, 0), (2, 0)}, then ๐ is a relation on the set {0, 1, 2} with dom(๐ ) = ran(๐ ) = {0, 1, 2}. EXAMPLE 4.1.8 For the relation } { √ ๐ = (๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฅ2 + ๐ฆ2 ≥ 1 , both the domain and range equal ๐. First, note that it is clear by the definition of ๐ that both dom(๐) and ran(๐) are subsets of ๐. For the other inclusion, take ๐ฅ ∈ ๐. Since (๐ฅ, 1) ∈ ๐, ๐ฅ ∈ dom(๐), and since (1, ๐ฆ) ∈ ๐, ๐ฆ ∈ ran(๐). Composition Given the relations ๐ and ๐, let us define a new relation. Suppose that (๐, ๐) ∈ ๐ and (๐, ๐) ∈ ๐ . Therefore, ๐ is related to ๐ through ๐. The new relation will contain the ordered pair (๐, ๐) to represent this relationship. DEFINITION 4.1.9 Let ๐ ⊆ ๐ด × ๐ต and ๐ ⊆ ๐ต × ๐ถ. The composition of ๐ and ๐ is the subset of ๐ด × ๐ถ defined as ๐ โฆ ๐ = {(๐ฅ, ๐ง) : ∃๐ฆ(๐ฆ ∈ ๐ต ∧ (๐ฅ, ๐ฆ) ∈ ๐ ∧ (๐ฆ, ๐ง) ∈ ๐)}. As illustrated in Figure 4.2, the reason that (๐, ๐) ∈ ๐ โฆ ๐ is because (๐, ๐) ∈ ๐ and (๐, ๐) ∈ ๐ . That is, ๐ is related to ๐ via ๐ โฆ ๐ because ๐ is related to ๐ via ๐ and then ๐ 164 Chapter 4 RELATIONS AND FUNCTIONS A C RโฆS c a S R B b Figure 4.2 A composition of relations. is related to ๐ via ๐ . The composition can be viewed as the direct path from ๐ to ๐ that does not require the intermediary ๐. EXAMPLE 4.1.10 To clarify the definition, let ๐ = {(2, 4), (1, 3), (2, 5)} and ๐ = {(0, 1), (1, 0), (0, 2), (2, 0)}. We have that ๐ โฆ ๐ = {(0, 3), (0, 4), (0, 5)}. Notice that (0, 3) ∈ ๐ โฆ ๐ because (0, 1) ∈ ๐ and (1, 3) ∈ ๐ . However, ๐ โฆ ๐ is empty since ran(๐ ) and dom(๐) are disjoint. EXAMPLE 4.1.11 Define and ๐ = {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฅ2 + ๐ฆ2 = 1} ๐ = {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฆ = ๐ฅ + 1}. Notice that ๐ is the unit circle and ๐ is the line with slope of 1 and ๐ฆ-intercept of (0, 1). Let us find ๐ โฆ ๐. ๐ โฆ ๐ = {(๐ฅ, ๐ง) ∈ ๐2 : ∃๐ฆ(๐ฆ ∈ ๐ ∧ (๐ฅ, ๐ฆ) ∈ ๐ ∧ (๐ฆ, ๐ง) ∈ ๐ )} = {(๐ฅ, ๐ง) ∈ ๐2 : ∃๐ฆ(๐ฆ ∈ ๐ ∧ ๐ฆ = ๐ฅ + 1 ∧ ๐ฆ2 + ๐ง2 = 1)} = {(๐ฅ, ๐ง) ∈ ๐2 : (๐ฅ + 1)2 + ๐ง2 = 1}. Section 4.1 RELATIONS 165 Therefore, ๐ โฆ ๐ is the circle with center (−1, 0) and radius 1. Example 4.1.10 shows that it is possible that ๐ โฆ ๐ ≠ ๐ โฆ ๐. However, we can change the order of the composition. THEOREM 4.1.12 If ๐ ⊆ ๐ด × ๐ต, ๐ ⊆ ๐ต × ๐ถ, and ๐ ⊆ ๐ถ × ๐ท, then ๐ โฆ (๐ โฆ ๐ ) = (๐ โฆ ๐) โฆ ๐ . PROOF Assume that ๐ ⊆ ๐ด × ๐ต, ๐ ⊆ ๐ต × ๐ถ, and ๐ ⊆ ๐ถ × ๐ท. Then, (๐, ๐) ∈ ๐ โฆ (๐ โฆ ๐ ) ⇔ ∃๐(๐ ∈ ๐ถ ∧ (๐, ๐) ∈ ๐ โฆ ๐ ∧ (๐, ๐) ∈ ๐ ) ⇔ ∃๐(๐ ∈ ๐ถ ∧ ∃๐(๐ ∈ ๐ต ∧ (๐, ๐) ∈ ๐ ∧ (๐, ๐) ∈ ๐) ∧ (๐, ๐) ∈ ๐ ) ⇔ ∃๐∃๐(๐ ∈ ๐ถ ∧ ๐ ∈ ๐ต ∧ (๐, ๐) ∈ ๐ ∧ (๐, ๐) ∈ ๐ ∧ (๐, ๐) ∈ ๐ ) ⇔ ∃๐∃๐(๐ ∈ ๐ต ∧ (๐, ๐) ∈ ๐ ∧ ๐ ∈ ๐ถ ∧ (๐, ๐) ∈ ๐ ∧ (๐, ๐) ∈ ๐ ) ⇔ ∃๐(๐ ∈ ๐ต ∧ (๐, ๐) ∈ ๐ ∧ ∃๐[๐ ∈ ๐ถ ∧ (๐, ๐) ∈ ๐ ∧ (๐, ๐) ∈ ๐ ]) ⇔ ∃๐(๐ ∈ ๐ต ∧ (๐, ๐) ∈ ๐ ∧ (๐, ๐) ∈ ๐ โฆ ๐) ⇔ (๐, ๐) ∈ (๐ โฆ ๐) โฆ ๐ . Inverses Let ๐ ⊆ ๐ด × ๐ต. We know that ๐ โฆ ๐ผ๐ด = ๐ and ๐ผ๐ต โฆ ๐ = ๐ [Exercise 10(a)]. If we want ๐ผ๐ด = ๐ผ๐ต , we need ๐ด = ๐ต so that ๐ is a relation on ๐ด. Then, ๐ โฆ ๐ผ๐ด = ๐ผ๐ด โฆ ๐ = ๐ . For example, if we again define ๐ = {(2, 4), (1, 3), (2, 5)} and view it as a relation on ๐, then ๐ composed on either side by ๐ผ๐ yields ๐ . To illustrate, consider the ordered pair (1, 3). It is an element of ๐ โฆ ๐ผ๐ because (1, 1) ∈ ๐ผ๐ ∧ (1, 3) ∈ ๐ , and it is also an element of ๐ผ๐ โฆ ๐ because (1, 3) ∈ ๐ ∧ (3, 3) ∈ ๐ผ๐ . Notice that not every identity relation will have this property. Using the same definition of ๐ as above, ๐ โฆ ๐ผ{1,2} = ๐ , but ๐ผ{1,2} โฆ ๐ = ∅. Now let us change the problem. Given a relation ๐ on ๐ด, can we find a relation ๐ on ๐ด such that ๐ โฆ ๐ = ๐ โฆ ๐ = ๐ผ๐ด ? The next definition is used to try to answer this question. 166 Chapter 4 RELATIONS AND FUNCTIONS DEFINITION 4.1.13 Let ๐ be a binary relation. The inverse of ๐ is the set ๐ −1 = {(๐ฆ, ๐ฅ) : (๐ฅ, ๐ฆ) ∈ ๐ }. For a relation ๐, we say that ๐ and ๐ are inverse relations if ๐ −1 = ๐. EXAMPLE 4.1.14 Let ๐ฟ be the less-than relation on ๐ (Example 4.1.3). Then, ๐ฟ−1 = {(๐ฆ, ๐ฅ) ∈ ๐2 : (๐ฅ, ๐ฆ) ∈ ๐ฟ} = {(๐ฆ, ๐ฅ) ∈ ๐2 : ๐ฅ < ๐ฆ} = {(๐ฆ, ๐ฅ) ∈ ๐2 : ๐ฆ > ๐ฅ}. This shows that less-than and greater-than are inverse relations. We now check whether ๐ โฆ ๐ −1 = ๐ −1 โฆ ๐ = ๐ผ๐ด for any relation ๐ on ๐ด. Consider ๐ = {(2, 1), (4, 3)}, which is a relation on {1, 2, 3, 4}. Then, ๐ −1 = {(1, 2), (3, 4)}, and we see that composing does not yield the identity on {1, 2, 3, 4} because ๐ โฆ ๐ −1 = {(1, 1), (3, 3)} = ๐ผ{1,3} and ๐ −1 โฆ ๐ = {(2, 2), (4, 4)} = ๐ผ{2,4} . The situation is worse when we define ๐ = {(2, 1), (2, 3), (4, 3)}. In this case, we have that ๐ −1 = {(1, 2), (3, 2), (3, 4)} but ๐ โฆ ๐ −1 = {(1, 1), (1, 3), (3, 1), (3, 3)} and ๐ −1 โฆ ๐ = {(2, 2), (2, 4), (4, 4)}. Neither of these compositions leads to an identity, but at least we have that {(1, 1), (3, 3)} ⊆ ๐ โฆ ๐ −1 and {(2, 2), (4, 4)} ⊆ ๐ −1 โฆ ๐. This can be generalized. THEOREM 4.1.15 ๐ผran(๐ ) ⊆ ๐ โฆ ๐ −1 and ๐ผdom(๐ ) ⊆ ๐ −1 โฆ ๐ for any binary relation ๐ . Section 4.1 RELATIONS 167 PROOF The first inclusion is proved in Exercise 12. To see the second inclusion, let ๐ฅ ∈ dom(๐ ). By definition, there exists ๐ฆ ∈ ran(๐ ) so that (๐ฅ, ๐ฆ) ∈ ๐ . Hence, (๐ฆ, ๐ฅ) ∈ ๐ −1 , which implies that (๐ฅ, ๐ฅ) ∈ ๐ −1 โฆ ๐ . Exercises 1. Let ๐ฟ ⊆ ๐ × ๐ be the less-than relation as defined in Example 4.1.3. Prove that ๐ฟ = {(๐, ๐) ∈ ๐ × ๐ : ๐ − ๐ ∈ ๐− }. 2. Find the domain and range of the given relations. (a) {(0, 1), (2, 3), (4, 5), (6, 7)} (b) {((๐, ๐), 1), ((๐, ๐), 2), ((๐, ๐), 3)} (c) ๐ × ๐ (d) ∅ × ∅ (e) ๐ × ∅ (f) {(๐ฅ, ๐ฆ) : ๐ฅ, ๐ฆ ∈ [0, 1] ∧ ๐ฅ < ๐ฆ} (g) {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฆ = 3} (h) {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฆ = |๐ฅ|} (i) {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฅ2 +√ ๐ฆ2 = 4} 2 (j) {(๐ฅ, ๐ฆ) ∈ ๐ : ๐ฆ ≤ ๐ฅ ∧ ๐ฅ ≥ 0} (k) {(๐ , ๐) : ∃๐ ∈ ๐(๐ (๐ฅ) = ๐๐ฅ ∧ ๐(๐ฅ) = ๐๐ฅ)} (l) {((๐, ๐), ๐ + ๐) : ๐, ๐ ∈ ๐} 3. Write ๐ โฆ ๐ as a roster. (a) ๐ = {(1, 0), (2, 3), (4, 6)}, ๐ (b) ๐ = {(1, 3), (2, 5), (3, 1)}, ๐ (c) ๐ = {(1, 2), (3, 4), (5, 6)}, ๐ (d) ๐ = {(1, 2), (3, 4), (5, 6)}, ๐ = {(1, 2), (2, 3), (3, 4)} = {(1, 3), (3, 1), (5, 2)} = {(1, 2), (3, 4), (5, 6)} = {(2, 1), (3, 5), (5, 7)} 4. Write ๐ โฆ ๐ using abstraction. (a) ๐ = {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฅ2 + ๐ฆ2 = 1} ๐ = ๐2 (b) ๐ = {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฅ2 + ๐ฆ2 = 1} ๐ =๐×๐ (c) ๐ = {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฅ2 + ๐ฆ2 = 1} ๐ = {(๐ฅ, ๐ฆ) ∈ ๐2 : (๐ฅ − 2)2 + ๐ฆ2 = 1} (d) ๐ = {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฅ2 + ๐ฆ2 = 1} ๐ = {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฆ = 2๐ฅ − 1} 5. Write the inverse of each relation. Use the abstraction method where appropriate. (a) ∅ (b) ๐ผ๐ (c) {(1, 0), (2, 3), (4, 6)} 168 Chapter 4 RELATIONS AND FUNCTIONS (d) (e) (f) (g) (h) {(1, 0), (1, 1), (2, 1)} ๐×๐ {(๐ฅ, sin ๐ฅ) : ๐ฅ ∈ ๐} {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฅ + ๐ฆ = 1} {(๐ฅ, ๐ฆ) ∈ ๐2 : ๐ฅ2 + ๐ฆ2 = 1} 6. Let ๐ ⊆ ๐ด × ๐ต and ๐ ⊆ ๐ต × ๐ถ. Show the following. (a) ๐ −1 โฆ ๐ −1 ⊆ ๐ถ × ๐ด. (b) dom(๐ ) = ran(๐ −1 ). (c) ran(๐ ) = dom(๐ −1 ). 7. Prove that if ๐ is a binary relation, (๐ −1 )−1 = ๐ . 8. Prove that (๐ โฆ ๐ )−1 = ๐ −1 โฆ ๐ −1 if ๐ ⊆ ๐ด × ๐ต and ๐ ⊆ ๐ต × ๐ถ. 9. Let ๐ , ๐ ⊆ ๐ด × ๐ต. Prove the following. (a) If ๐ ⊆ ๐, then ๐ −1 ⊆ ๐ −1 . (b) (๐ ∪ ๐)−1 = ๐ −1 ∪ ๐ −1 . (c) (๐ ∩ ๐)−1 = ๐ −1 ∩ ๐ −1 . 10. Let ๐ ⊆ ๐ด × ๐ต. (a) Prove ๐ โฆ ๐ผ๐ด = ๐ and ๐ผ๐ต โฆ ๐ = ๐ . (b) Show that if there exists a set ๐ถ such that ๐ด and ๐ต are subsets of ๐ถ, then ๐ โฆ ๐ผ๐ถ = ๐ผ๐ถ โฆ ๐ = ๐ . 11. Let ๐ ⊆ ๐ด × ๐ต and ๐ ⊆ ๐ต × ๐ถ. Show that ๐ โฆ ๐ = ∅ if and only if dom(๐) and ran(๐ ) are disjoint. 12. For any relation ๐ , prove ๐ผran(๐ ) ⊆ ๐ โฆ ๐ −1 . 13. Let ๐ ⊆ ๐ด × ๐ต. Prove. โ (a) {๐ฅ ∈ ๐ด : (๐ฅ, ๐) ∈ ๐ } = dom(๐ ). โ๐∈๐ต (b) ๐∈๐ด {๐ฆ ∈ ๐ต : (๐, ๐ฆ) ∈ ๐ } = ran(๐ ). 4.2 EQUIVALENCE RELATIONS In practice we usually do not write relations as sets of ordered pairs. We instead write propositions like 4 = 4 or 3 < 9. To copy this, we will introduce an alternate notation. DEFINITION 4.2.1 Let ๐ be a relation on ๐ด. For all ๐, ๐ ∈ ๐ด, ๐ ๐ ๐ if and only if (๐, ๐) ∈ ๐ , and ๐๐ ฬธ ๐ if and only if (๐, ๐) ∉ ๐ . For example, the less-than relation ๐ฟ (Example 4.1.3) is usually denoted by <, and we write 2 < 3 instead of (2, 3) ∈ ๐ฟ or (2, 3) ∈ <. Section 4.2 EQUIVALENCE RELATIONS 169 EXAMPLE 4.2.2 Define the relation ๐ on ๐ by ๐ = {(๐, ๐) ∈ ๐ × ๐ : ∃๐ ∈ ๐(๐ = ๐๐ ∧ ๐ ≠ 0)}. Therefore, for all ๐, ๐ ∈ ๐, ๐ ๐ ๐ if and only if ๐ divides ๐. Therefore, 4 ๐ 8 but 8๐ ฬธ 4. Relations can have different properties depending on their definitions. Here are three important examples using relations on ๐ด = {1, 2, 3}. โ {(1, 1), (2, 2), (3, 3)} has the property that every element of ๐ด is related to itself. โ {(1, 2), (2, 1), (2, 3), (3, 2)} has the property that if ๐ is related to ๐, then ๐ is related to ๐. โ {(1, 2), (2, 3), (1, 3)} has the property that if ๐ is related to ๐ and ๐ is related to ๐, then ๐ is related to ๐. These examples lead to the following definitions. DEFINITION 4.2.3 Let ๐ be a relation on ๐ด. โ ๐ is reflexive if ๐ ๐ ๐ for all ๐ ∈ ๐ด. โ ๐ is symmetric when for all ๐, ๐ ∈ ๐ด, if ๐ ๐ ๐, then ๐ ๐ ๐. โ ๐ is transitive means that for all ๐, ๐, ๐ ∈ ๐ด, if ๐ ๐ ๐ and ๐ ๐ ๐, then ๐ ๐ ๐. Notice that the relation in Example 4.2.2 is not reflexive because 0 does not divide 0 and is not symmetric because 4 divides 8 but 8 does not divide 4, but it is transitive because if ๐ divides ๐ and ๐ divides ๐, then ๐ divides ๐. When a relation is reflexive, symmetric, and transitive, it behaves very much like an identity relation (Example 4.1.2). Such relations play an important role in mathematics, so we name them. DEFINITION 4.2.4 A relation ๐ on ๐ด is an equivalence relation if ๐ is reflexive, symmetric, and transitive. Observe that the relation in Example 4.2.2 is not an equivalence relation. However, any identity relation is an equivalence relation. We see this assumption at work in the next example. 170 Chapter 4 RELATIONS AND FUNCTIONS EXAMPLE 4.2.5 Let ๐ be a relation on ๐ × (๐ โงต {0}) so that for all ๐, ๐ ∈ ๐ and ๐, ๐ ∈ ๐ โงต {0}, (๐, ๐) ๐ (๐, ๐) if and only if ๐๐ = ๐๐. To see that this is an equivalence relation, let (๐, ๐), (๐, ๐), and (๐, ๐ ) be elements of ๐ × (๐ โงต {0}). โ (๐, ๐) ๐ (๐, ๐) since ๐๐ = ๐๐. โ Assume (๐, ๐) ๐ (๐, ๐). Then, ๐๐ = ๐๐. This implies that ๐๐ = ๐๐, so (๐, ๐) ๐ (๐, ๐). โ Let (๐, ๐) ๐ (๐, ๐) and (๐, ๐) ๐ (๐, ๐ ). This gives ๐๐ = ๐๐ and ๐๐ = ๐๐. Therefore, (๐, ๐) ๐ (๐, ๐ ) because ๐๐ = ๐๐๐ = ๐๐. ๐ EXAMPLE 4.2.6 Take ๐ ∈ ๐+ and let ๐, ๐, and ๐ be integers. Define ๐ to be congruent to ๐ modulo ๐ and write ๐ ≡ ๐ (mod ๐) if and only if ๐ โฃ ๐ − ๐. That is, ๐ ≡ ๐ (mod ๐) if and only if ๐ = ๐ + ๐๐ for some ๐ ∈ ๐. For example, we have that 7 ≡ 1 (mod 3), 1 ≡ 13 (mod 3), and 27 ≡ 0 (mod 3), but 2 โข 9 (mod 3) and 25 โข 0 (mod 3). Congruence modulo ๐ defines the relation ๐ ๐ = {(๐, ๐) : ๐ ≡ ๐ (mod ๐)}. Observe that ๐ ๐ = {(๐, ๐) : ๐ โฃ ๐ − ๐} = {(๐, ๐) : ∃๐(๐ ∈ ๐ ∧ ๐ = ๐ + ๐๐)}. Prove that ๐ ๐ is an equivalence relation. โ (๐, ๐) ∈ ๐ ๐ because ๐ ≡ ๐ (mod ๐). โ Assume (๐, ๐) ∈ ๐ ๐ . This implies that ๐ โฃ ๐ − ๐. By Exercise 2.4.18, ๐ โฃ ๐ − ๐. Hence, (๐, ๐) ∈ ๐ ๐ . โ Let (๐, ๐), (๐, ๐) ∈ ๐ ๐ . Then ๐ = ๐ + ๐๐ and ๐ = ๐ + ๐๐ for some ๐, ๐ ∈ ๐. Substitution yields ๐ = ๐ + ๐๐ + ๐๐ = ๐ + ๐(๐ + ๐). Since the sum of two integers is an integer, (๐, ๐) ∈ ๐ ๐ . 171 Section 4.2 EQUIVALENCE RELATIONS Equivalence Classes Let ๐ be a relation on {1, 2, 3, 4} such that ๐ = {(1, 2), (1, 3), (2, 4)}. (4.1) Observe that 1 is related to 2 and 3, 2 is related to 4, and 3 and 4 are not related to any number. Combining the elements that are related to a particular element results in a set named by the next definition. DEFINITION 4.2.7 Let ๐ be a relation on ๐ด with ๐ ∈ ๐ด. The class of ๐ with respect to ๐ is the set [๐]๐ = {๐ฅ ∈ ๐ด : ๐ ๐ ๐ฅ}. If ๐ is an equivalence relation, [๐]๐ is called an equivalence class. We often denote [๐]๐ by [๐] if the relation is clear from context. Using ๐ as defined in (4.1), [1]๐ = {2, 3}, [2]๐ = {4}, and [3]๐ = [4]๐ = ∅. If ๐ had been an equivalence relation on a set ๐ด, then [๐]๐ would be nonempty for all ๐ ∈ ๐ด because ๐ would be an element of [๐]๐ (Exercise 17). EXAMPLE 4.2.8 Let ๐ be the equivalence relation from Example 4.2.5. We prove that [(1, 3)] = {(๐, 3๐) : ๐ ∈ ๐ โงต {0}}. To see this, take (๐, ๐) ∈ [(1, 3)]. This means that (1, 3) ๐ (๐, ๐), so ๐ = 3๐ and ๐ ≠ 0 because ๐ ≠ 0. Hence, (๐, ๐) = (๐, 3๐). Conversely, let ๐ ≠ 0. Then, (1, 3) ๐ (๐, 3๐) because 1 ⋅ 3๐ = 3 ⋅ ๐. Thus, (๐, 3๐) ∈ [(1, 3)]. EXAMPLE 4.2.9 Using the notation of Example 4.2.6, let ๐ ๐ be the relation defined by congruence modulo ๐. For all ๐ ∈ ๐, define [๐]๐ = ๐ ๐ (๐). Therefore, when ๐ = 5, the equivalence classes are: [0]5 [1]5 [2]5 [3]5 [4]5 = {… , −10, −5, 0, 5, 10, … }, = {… , −9, −4, 1, 6, 11, … }, = {… , −8, −3, 2, 7, 12, … }, = {… , −7, −2, 3, 8, 13, … }, = {… , −6, −1, 4, 9, 14, … }. 172 Chapter 4 RELATIONS AND FUNCTIONS In addition, [๐]5 = [๐]5 if and only if ๐ ≡ ๐ (mod 5). The collection of all equivalence classes of a relation is a set named by the next definition. DEFINITION 4.2.10 Let ๐ be an equivalence relation on ๐ด. The quotient set of ๐ด modulo ๐ is ๐ดโ๐ = {[๐]๐ : ๐ ∈ ๐ด}. Observe by Exercise 3 that it is always the case that โ ๐ด= ๐ดโ๐ . (4.2) EXAMPLE 4.2.11 Let ๐ ∈ ๐+ . The quotient set ๐โ๐ ๐ is denoted by ๐๐ . That is, ๐๐ = {[0]๐ , [1]๐ , … , [๐ − 1]๐ }. EXAMPLE 4.2.12 Define the relation ๐ on ๐2 by (๐, ๐) ๐ (๐, ๐) if and only if ๐ − ๐ = ๐ − ๐. ๐ is an equivalence relation by Exercise 2. We note that for any (๐, ๐) ∈ ๐2 , [(๐, ๐)] = {(๐ฅ, ๐ฆ) : (๐ฅ, ๐ฆ) ๐ (๐, ๐)} = {(๐ฅ, ๐ฆ) : ๐ฆ − ๐ฅ = ๐ − ๐} = {(๐ฅ, ๐ฆ) : ๐ฆ = ๐ฅ + (๐ − ๐)}. Therefore, the equivalence class of (๐, ๐) is the line with a slope of 1 and a ๐ฆintercept equal to (0, ๐ − ๐). The equivalence classes of (0, 1.5) and (0, −1) are illustrated in the graph in Figure 4.3. The quotient set ๐2 โ๐ is the collection of all such lines. Notice that โ ๐2 = ๐2 โ๐ . Partitions In Example 4.2.12, we saw that ๐2 is the union of all the lines with slope equal to 1, and since the lines are parallel, they form a pairwise disjoint set. These properties can be observed in the other equivalence relations that we have seen. Each set is equal to the union of the equivalence classes, and the quotient set is pairwise disjoint. Generalizing these two properties leads to the next definition. Section 4.2 EQUIVALENCE RELATIONS 173 R(0, 1.5) R(0, −1) Figure 4.3 Two equivalence classes in ๐2 when (๐, ๐) ๐ (๐, ๐) if and only if ๐ − ๐ = ๐ − ๐. DEFINITION 4.2.13 Let ๐ด be a nonempty set. The family ๐ซ is a partition of ๐ด if and only if โ ๐ซ ⊆ P(๐ด), โ โ ๐ซ = ๐ด, โ ๐ซ is pairwise disjoint. To illustrate the definition, let ๐ด = {1, 2, 3, 4, 5, 6, 7} and define the elements of the partition to be ๐ด0 = {1, 2, 5}, ๐ด1 = {3}, and ๐ด2 = {4, 6, 7}. The family ๐ซ = {๐ด1 , ๐ด2 , ๐ด3 } is a subset of P(๐ด), ๐ด = ๐ด0 ∪ ๐ด1 ∪ ๐ด2 , and ๐ซ is pairwise disjoint. Therefore, ๐ซ is a partition of ๐ด. This is illustrated in Figure 4.4. A1 A2 1 2 3 5 6 4 7 A3 Figure 4.4 A partition of the set ๐ด = {1, 2, 3, 4, 5, 6, 7}. 174 Chapter 4 RELATIONS AND FUNCTIONS EXAMPLE 4.2.14 For each real number ๐ ≥ 0, define ๐ถ๐ to be the circle with radius ๐ centered at the origin. Namely, { } √ ๐ถ๐ = (๐ฅ, ๐ฆ) : ๐ฅ2 + ๐ฆ2 = ๐ . Let ๐ = {๐ถ๐ : ๐ ∈ [0, ∞)}. We claim that ๐ is a partition of ๐2 . โ ๐ ⊆ P(๐2 ) because ๐ถ๐ ⊆ ๐2 for all ๐ ≥ 0. โ โ โ To prove that ๐2 = ๐∈[0,∞) ๐ถ๐ , it suffices to show that ๐2 ⊆ ๐∈[0,∞) ๐ถ๐ , but this follows because if (๐, ๐) ∈ ๐2 , then (๐, ๐) ∈ ๐ถ√ ๐2 +๐2 . โ To see that ๐ is pairwise disjoint, let ๐, ๐ ≥ 0 and assume that (๐, ๐) is an element of ๐ถ๐ ∩ ๐ถ๐ . Then, √ ๐ = ๐2 + ๐2 = ๐ , which implies that ๐ถ๐ = ๐ถ๐ . โ The set ๐5 is a family of subsets of ๐, has the property that ๐5 = ๐, and is pairwise disjoint. Hence, ๐5 is a partition for ๐. We generalize this result to the next theorem. It uses an arbitrary equivalence relation on a given set to define a partition for that set. In this case, we say that the equivalence relation induces the partition. THEOREM 4.2.15 If ๐ is an equivalence relation on ๐ด, then ๐ดโ๐ is a partition of ๐ด. PROOF Take a set ๐ด with an equivalence relation ๐ . โ Since an equivalence class is a subset of ๐ด, we have ๐ดโ๐ ⊆ P(๐ด). โ โ ๐ดโ๐ = ๐ด is (4.2). โ Let [๐], [๐] ∈ ๐ดโ๐ and assume that there exists ๐ฆ ∈ [๐] ∩ [๐]. In other words, ๐ ๐ ๐ฆ and ๐ ๐ ๐ฆ. Now take ๐ฅ ∈ [๐]. This means that ๐ ๐ ๐ฅ. Since ๐ฅ ๐ ๐ and ๐ฆ ๐ ๐ by symmetry, we have that ๐ฅ ๐ ๐ฆ, and then ๐ฅ ๐ ๐ by transitivity. Thus, ๐ฅ ∈ [๐], which shows [๐] ⊆ [๐]. Similarly, [๐] ⊆ [๐], so [๐] = [๐]. The collection of equivalence relations forms a partition of a set. Conversely, if we have a partition of a set, the partition gives rise to an equivalence relation on the set. To see this, take any set ๐ด and a partition ๐ซ of ๐ด. For all ๐, ๐ ∈ ๐ด, define ๐ ๐ ๐ if and only if there exists ๐ถ ∈ ๐ซ such that ๐, ๐ ∈ ๐ถ. Section 4.2 EQUIVALENCE RELATIONS 175 To show that ๐ is an equivalence relation, take ๐, ๐, and ๐ in ๐ด. โ โ Since ๐ ∈ ๐ด and ๐ด = ๐ซ , there exists ๐ถ ∈ ๐ซ such that ๐ ∈ ๐ถ. Therefore, ๐ ๐ ๐. โ Assume ๐ ๐ ๐. This means that ๐, ๐ ∈ ๐ถ for some ๐ถ ∈ ๐ซ . This, of course, is the same as ๐, ๐ ∈ ๐ถ. Hence, ๐ ๐ ๐. โ Suppose ๐ ๐ ๐ and ๐ ๐ ๐. Then, there are sets ๐ถ and ๐ท in ๐ซ so that ๐, ๐ ∈ ๐ถ and ๐, ๐ ∈ ๐ท. This means that ๐ถ ∩ ๐ท ≠ ∅. Since ๐ซ is pairwise disjoint, ๐ถ = ๐ท. So, ๐ and ๐ are elements of ๐ถ, and we have ๐ ๐ ๐. This equivalence relation is said to be induced from the partition. EXAMPLE 4.2.16 The sets {… , −10, −5, 0, 5, 10, … }, {… , −9, −4, 1, 6, 11, … }, {… , −8, −3, 2, 7, 12, … }, {… , −7, −2, 3, 8, 13, … }, {… , −6, −1, 4, 9, 14, … }, form a collection that is a partition of ๐. The equivalence relation that is induced from this partition is congruence modulo 5 (Example 4.2.9). Exercises 1. For all ๐, ๐ ∈ ๐ โงต {0}, let ๐ ๐ ๐ if and only if ๐๐ > 0. (a) Show that ๐ is an equivalence relation on ๐ โงต {0}. (b) Find [1] and [−3]. 2. Define the relation ๐ on ๐2 by (๐, ๐) ๐ (๐, ๐) if and only if ๐ − ๐ = ๐ − ๐. Prove that ๐ is an equivalence relation. โ 3. Let ๐ be an equivalence relation on ๐ด. Prove that ๐ด = ๐∈๐ด [๐]. 4. Prove that if ๐ถ is an equivalence class for some equivalence relation ๐ and ๐ ∈ ๐ถ, then ๐ถ = [๐]. 5. For all ๐, ๐ ∈ ๐, let ๐ ๐ ๐ if and only if |๐| = |๐|. (a) Prove ๐ is an equivalence relation on ๐. (b) Sketch the partition of ๐ induced by this equivalence relation. 6. For all (๐, ๐), (๐, ๐) ∈ ๐ × ๐, define (๐, ๐) ๐ (๐, ๐) if and only if ๐๐ = ๐๐. (a) Show that ๐ is an equivalence relation on ๐ × ๐. (b) What is the equivalence class of (1, 2)? (c) Sketch the partition of ๐ × ๐ induced by this equivalence relation. 7. Let ๐ด be a set and ๐ ∈ ๐ด. Show that the given relations are not equivalence relations on P(๐ด). 176 Chapter 4 RELATIONS AND FUNCTIONS (a) For all ๐ถ, ๐ท ⊆ ๐ด, define ๐ถ ๐ ๐ท if and only if ๐ถ ∩ ๐ท ≠ ∅. (b) For all ๐ถ, ๐ท ⊆ ๐ด, define ๐ถ ๐ ๐ท if and only if ๐ ∈ ๐ถ ∩ ๐ท. 8. Let ๐ง, ๐ง′ ∈ ๐ and write ๐ง = ๐ + ๐๐ and ๐ง′ = ๐′ + ๐′ ๐. Define ๐ง = ๐ง′ to mean that ๐ = ๐′ and ๐ = ๐′ . Prove that ๐ is an equivalence relation. 9. Find. (a) (b) (c) (d) [3]5 [12]6 [2]5 ∪ [27]5 [4]7 ∩ [5]7 10. Let ๐ be the remainder obtained when ๐ is divided by ๐. Prove [๐]๐ = [๐]๐ . 11. Let ๐, ๐ ∈ ๐ and suppose that gcd(๐, ๐) = 1. Prove. (a) There exists ๐ such that ๐๐ ≡ 1 (mod ๐). (Notice that ๐ is that multiplicative inverse of ๐ modulo ๐.) (b) If ๐๐ ≡ ๐๐ (mod ๐), then ๐ ≡ ๐ (mod ๐). (c) Prove that the previous implication is false if gcd(๐, ๐) ≠ 1. 12. Prove that {(๐, ๐ + 1] : ๐ ∈ ๐} is a partition of ๐. 13. Prove that the following are partitions of ๐2 . (a) ๐ซ = {{(๐, ๐)} : ๐, ๐ ∈ ๐}. (b) ๐ฑ = {{(๐, ๐ฆ) : ๐ฆ ∈ ๐} : ๐ ∈ ๐}. (c) โ = {๐ × (๐, ๐ + 1] : ๐ ∈ ๐}. 14. Is {[๐, ๐ + 1] × (๐, ๐ + 1) : ๐ ∈ ๐} a partition of ๐2 ? Explain. 15. Let ๐ be a relation on ๐ด and show the following: (a) ๐ is reflexive if and only if ๐ −1 is reflexive. (b) ๐ is symmetric if and only if ๐ = ๐ −1 . (c) ๐ is symmetric if and only if (๐ด × ๐ด) โงต ๐ is symmetric. 16. Let ๐ and ๐ be equivalence relations on ๐ด. Prove or show false. (a) ๐ ∪ ๐ is an equivalence relation on ๐ด. (b) ๐ ∩ ๐ is an equivalence relation on ๐ด. 17. Prove for all relations ๐ on ๐ด. (a) ๐ is reflexive if and only if ∀๐(๐ ∈ [๐]). (b) ๐ is symmetric if and only if ∀๐∀๐(๐ ∈ [๐] ↔ ๐ ∈ [๐]). (c) ๐ is transitive if and only if ∀๐∀๐∀๐([๐ ∈ [๐] ∧ ๐ ∈ [๐]] → ๐ ∈ [๐]). (d) ๐ is an equivalence relation if and only if ∀๐∀๐((๐, ๐) ∈ ๐ ↔ [๐] = [๐]). 18. Let ๐ be a relation on ๐ด with the property that if ๐ ๐ ๐ and ๐ ๐ ๐, then ๐ ๐ ๐. Prove that if ๐ is also reflexive, ๐ is an equivalence relation. 19. Define the relation ๐ on ๐ by ๐ + ๐๐ ๐ ๐ + ๐๐ if and only if √ √ ๐2 + ๐2 = ๐ 2 + ๐ 2 . Section 4.3 PARTIAL ORDERS 177 (a) Prove ๐ is an equivalence relation on ๐. (b) Graph [1 + ๐] in the complex plane. (c) Describe the partition that ๐ induces on ๐. 20. Let ๐ and ๐ be relations on ๐ด. The symmetric closure of ๐ is ๐ if ๐ ⊆ ๐ and for all symmetric relations ๐ on ๐ด such that ๐ ⊆ ๐ , then ๐ ⊆ ๐ . Prove the following. (a) ๐ ∪ ๐ −1 is the symmetric closure of ๐ . (b) A symmetric closure is unique. 4.3 PARTIAL ORDERS While equivalence relations resemble equality, there are other common relations in mathematics that we can model. To study some of their attributes, we expand Definition 4.2.3 with three more properties. DEFINITION 4.3.1 Let ๐ be a relation on ๐ด. โ ๐ is irreflexive if ๐ ๐ ฬธ ๐ for all ๐ ∈ ๐ด. โ ๐ is asymmetric when for all ๐, ๐ ∈ ๐ด, if ๐ ๐ ๐, then ๐ ๐ ฬธ ๐. โ ๐ is antisymmetric means that for all ๐, ๐ ∈ ๐ด, if ๐ ๐ ๐ and ๐ ๐ ๐, then ๐ = ๐. Notice that a relation on a nonempty set cannot be both reflexive and irreflexive. However, many relations have neither property. For example, consider the relation ๐ = {(1, 1)} on {1, 2}. Since (1, 1) ∈ ๐ , the relation ๐ is not irreflexive, and ๐ is not reflexive because (2, 2) ∉ ๐ . Likewise, a relation on a nonempty set cannot be both symmetric and asymmetric. EXAMPLE 4.3.2 The less-than relation on ๐ is irreflexive and asymmetric. It is also antisymmetric. To see this, let ๐, ๐ ∈ ๐. Since ๐ < ๐ and ๐ < ๐ is false, the implication if ๐ < ๐ and ๐ < ๐, then ๐ = ๐ is true. The ≤ relation is also antisymmetric. However, ≤ is neither irreflexive nor asymmetric since 3 ≤ 3. EXAMPLE 4.3.3 Let ๐ = {(1, 2)} and ๐ = {(1, 2), (2, 1)}. Both are relations on {1, 2}. The first relation is asymmetric since 1 ๐ 2 but 2 ๐ ฬธ 1. It is also antisymmetric, but ๐ is not antisymmetric because 2 ๐ 1 and 1 ๐ 2. Both the relations are irreflexive. 178 Chapter 4 RELATIONS AND FUNCTIONS EXAMPLE 4.3.4 Let ๐ be a relation on a set ๐ด. We prove that ๐ is antisymmetric if and only if ๐ ∩ ๐ −1 ⊆ ๐ผ๐ด . โ Assume that ๐ is antisymmetric and take (๐, ๐) ∈ ๐ ∩ ๐ −1 . This means that (๐, ๐) ∈ ๐ and (๐, ๐) ∈ ๐ −1 . Therefore, (๐, ๐) ∈ ๐ , and since ๐ is antisymmetric, ๐ = ๐. โ Now suppose ๐ ∩ ๐ −1 ⊆ ๐ผ๐ด . Let (๐, ๐), (๐, ๐) ∈ ๐ . We conclude that (๐, ๐) ∈ ๐ −1 , which implies that (๐, ๐) ∈ ๐ ∩ ๐ −1 . Then, (๐, ๐) ∈ ๐ผ๐ด . Hence, ๐ = ๐. As an equivalence relation is a generalization of an identity relation, the following relation is a generalization of ≤ on ๐. For this reason, instead of naming the relation ๐ , it is denoted by the symbol 4. DEFINITION 4.3.5 If a relation 4 on a set ๐ด is reflexive, antisymmetric, and transitive, 4 is a partial order on ๐ด and the ordered pair (๐ด, 4) is called a partially ordered set (or simply a poset). Furthermore, for all ๐, ๐ ∈ ๐ด, the notation ๐ โบ ๐ means ๐ 4 ๐ but ๐ ≠ ๐. For example, ≤ and = are partial orders on ๐, but < is not a partial order on ๐ because the relation < is not reflexive. Although = is a partial order on any set, in general an equivalence relation is not a partial order (Example 4.2.6). EXAMPLE 4.3.6 Divisibility (Definition 2.4.2) is a partial order on ๐+ . To prove this, let ๐, ๐, and ๐ be positive integers. โ ๐ โฃ ๐ since ๐ = ๐ ⋅ 1 and ๐ ≠ 0. โ Suppose that ๐ โฃ ๐ and ๐ โฃ ๐. This means that ๐ = ๐๐ and ๐ = ๐๐, for some ๐, ๐ ∈ ๐+ . Hence, ๐ = ๐๐๐, so ๐๐ = 1. Since ๐ and ๐ are positive integers, ๐ = ๐ = 1. That is, ๐ = ๐. โ Assume that we have ๐ โฃ ๐ and ๐ โฃ ๐. This means that ๐ = ๐๐ and ๐ = ๐๐ for some ๐, ๐ ∈ ๐+ . By substitution, ๐ = (๐๐)๐ = ๐(๐๐). Hence, ๐ โฃ ๐. EXAMPLE 4.3.7 Let ๐ธ be a collection of symbols and let ๐ธ∗ denote the set of all strings over ๐ธ (as on page 5). Use the symbol to denote the empty string, the string of length zero. As with the empty set, the empty string is always an element of ๐ธ∗ . For example, if ๐ธ = {๐, ๐, ๐}, then ๐๐๐, ๐๐๐๐๐๐, ๐, and are elements of ๐ธ∗ . Now, take ๐, ๐ ∈ ๐ธ∗ . The concatenation of ๐ and ๐ is denoted by ๐ โข ๐ and is the Section 4.3 PARTIAL ORDERS 000 001 010 00 011 100 01 101 110 10 0 179 111 11 1 โก Figure 4.5 A partial order defined on {0, 1}∗ . string consisting of the elements of ๐ followed by those of ๐. For example, if ๐ = 011 and ๐ = 1010, then ๐ โข ๐ = 0111010. Finally, for all ๐, ๐ ∈ ๐ธ∗ , define ๐ 4 ๐ if and only if there exists ๐ ∈ ๐ธ∗ such that ๐ = ๐ โข ๐. It can be shown that 4 is a partial order on ๐ธ∗ (Exercise 8) with the structure seen in Figure 4.5 for ๐ธ = {0, 1}. The partial order ≤ on ๐ has the property that for all ๐, ๐ ∈ ๐, either ๐ < ๐, ๐ < ๐, or ๐ = ๐. As we see in Figure 4.5, this is not the case for every partially ordered set. We do, however, have the following slightly weaker property. THEOREM 4.3.8 [Weak-Trichotomy Law] If 4 is a partial order on ๐ด, for all ๐, ๐ ∈ ๐ด, at most one of the following are true: ๐ โบ ๐, ๐ โบ ๐, or ๐ = ๐. PROOF Let ๐, ๐ ∈ ๐ด. We have three cases to consider. โ Suppose ๐ โบ ๐. This means that ๐ 4 ๐ and ๐ ≠ ๐. If in addition ๐ โบ ๐, by transitivity ๐ โบ ๐, which is a contradiction. โ That ๐ โบ ๐ precludes both ๐ โบ ๐ and ๐ = ๐ is proved like the first case. โ If ๐ = ๐, then by definition of โบ it is impossible for ๐ โบ ๐ or ๐ โบ ๐ to be true. Technically, subset is not a relation, but in a natural way, it can be considered as one. Let โฑ be a family of sets. Define ๐ = {(๐ด, ๐ต) : ๐ด ⊆ ๐ต ∧ ๐ด, ๐ต ∈ โฑ }. Associate ⊆ with the relation ๐. EXAMPLE 4.3.9 Let ๐ด be a nonempty set. We show that (P(๐ด), ⊆) is a partially ordered set. Let ๐ต, ๐ถ, and ๐ท be subsets of ๐ด. 180 Chapter 4 RELATIONS AND FUNCTIONS โ Since ๐ต ⊆ ๐ต, the relation ⊆ is reflexive. โ Since ๐ต ⊆ ๐ถ and ๐ถ ⊆ ๐ต implies that ๐ต = ๐ถ (Definition 3.3.7), ⊆ is antisymmetric. โ Since ๐ต ⊆ ๐ถ and ๐ถ ⊆ ๐ท implies ๐ต ⊆ ๐ท (Theorem 3.3.6), we see that ⊆ is transitive. This example is in line with what we know about subsets. For instance, if ๐ด ⊂ ๐ต, then we conclude that ๐ต ⊄ ๐ด and ๐ด ≠ ๐ต, which is what we expect from the weak-trichotomy law (4.3.8). Bounds Let 4 be a partial order on ๐ด with elements ๐ and ๐′ such that ๐ 4 ๐ and ๐ 4 ๐′ for all ๐ ∈ ๐ด. In particular, this implies that ๐ 4 ๐′ and ๐′ 4 ๐, so since 4 is antisymmetric, we conclude that ๐ = ๐′ . Similarly, if ๐ 4 ๐ and ๐′ 4 ๐ for all ๐ ∈ ๐ด, then ๐ = ๐′ . This argument justifies the use of the word the in the next definition. DEFINITION 4.3.10 Let (๐ด, 4) be a poset and ๐ ∈ ๐ด. โ ๐ is the least element of ๐ด (with respect to 4) if ๐ 4 ๐ for all ๐ ∈ ๐ด. โ ๐ is the greatest element of ๐ด (with respect to 4) if ๐ 4 ๐ for all ๐ ∈ ๐ด. There is no guarantee that a partially ordered set will have a least or a greatest element. In Example 4.3.9, the greatest element of P(๐ด) with respect to ⊆ is ๐ด and the least element is ∅. However, in Example 4.3.7 (Figure 4.5), the least element of ๐ธ∗ is , but there is no greatest element. EXAMPLE 4.3.11 If ๐ด is a finite set of real numbers, ๐ด has a least and a greatest element with respect to ≤. Under the partial order ≤, the set ๐+ also has a least element but no greatest element, and both {5๐ : ๐ ∈ ๐− } and ๐− have greatest elements but no least elements. To show that ๐ต = {5๐ : ๐ ∈ ๐+ } has a least element with respect to ≤, use the fact that the least element of ๐+ is 1. Therefore, the least element of ๐ต is 5 because 5 ∈ ๐ต and 5(1) ≤ 5๐ for all ๐ ∈ ๐+ . Some sets will not have a least or greatest element with a given partial order but there will still be elements that are considered greater or lesser than every element of the set. DEFINITION 4.3.12 Let 4 be a partial order on ๐ด and ๐ต ⊆ ๐ด. โ ๐ข ∈ ๐ด is an upper bound of ๐ต if ๐ 4 ๐ข for all ๐ ∈ ๐ต. The element ๐ข is the least upper bound of ๐ต if it is an upper bound and for all upper bounds ๐ข′ of ๐ต, ๐ข 4 ๐ข′ . 181 Section 4.3 PARTIAL ORDERS โ ๐ ∈ ๐ด is a lower bound of ๐ต if ๐ 4 ๐ for all ๐ ∈ ๐ต. The element ๐ is the greatest lower bound of ๐ต if it is a lower bound and for all lower bounds ๐′ of ๐ต, ๐′ 4 ๐. In the definition we can write the word the because the order is antisymmetric. EXAMPLE 4.3.13 The interval (3, 5) is a subset of ๐. Under the partial order ≤, both 5 and 10 are upper bounds of this interval, while 5 is a least upper bound. Also, 3 and −๐ are lower bounds, but 3 is the greatest lower bound. EXAMPLE 4.3.14 Assume โ that ๐ ⊆ P(๐). Since the elements of ๐ are subsets โ of ๐, we conclude โ that ๐ ∈ P(๐) (Definition 3.4.8). If ๐ด ∈ ๐ , then ๐ด ⊆ ๐ . Therefore, ๐ is an upper bound of ๐ with respect to ⊆. To see that it is the least upper bound, โ let ๐ be any upper bound of ๐ . Take ๐ฅ ∈ ๐ . This means that there exists ๐ท ∈ ๐ such that ๐ฅ ∈ ๐ท. Since ๐ is an upper bound of ๐ , ๐ท ⊆ ๐ . Hence, โ ๐ฅ ∈ ๐ , and we conclude that ๐ ⊆ ๐ . Comparable and Compatible Elements In Figure 4.5, we see that 01 4 010 and 01 4 011. However, 010 โ 011 and 011 โ 010. This means that in the poset of Example 4.3.7, there are pairs of elements that are related to each other and there are other pairs that are not. DEFINITION 4.3.15 Let (๐ด, 4) be a poset and ๐, ๐ ∈ ๐ด. If ๐ 4 ๐ or ๐ 4 ๐, then ๐ and ๐ are comparable with respect to 4. Elements of ๐ด are incomparable if they are not comparable. Continuing our review of the partially ordered set of Example 4.3.7, we note that the element has the property that no element is less than it, but as seen in Figure 4.5, for every element of ๐ธ∗ , there exists an element of ๐ธ∗ that is greater. However, that same relation defined on ๐ด = {๐ ∈ ๐ธ∗ : ๐ has at most 3 characters} (4.3) has the property that 000, 001, 010, 011, 100, 101, 110, 111 have no elements greater than them. This leads to the next definition. DEFINITION 4.3.16 Let (๐ด, 4) be a poset and ๐ ∈ ๐ด. โ ๐ is a minimal element of ๐ด (with respect to 4) if ๐ โ ๐ for all ๐ ∈ ๐ด. โ ๐ is a maximal element of ๐ด (with respect to 4) if ๐ โ ๐ for all ๐ ∈ ๐ด. 182 Chapter 4 RELATIONS AND FUNCTIONS Therefore, the empty string is a minimal element of ๐ด (4.3), and 000, 001, 010, 011, 100, 101, 110, 111 are maximal. Notice that every least element is minimal and every greatest element is maximal. Although not every pair of elements is comparable in the partially ordered set of Example 4.3.7, there are infinite sequences of comparable elements, such as 4 0 4 01 4 001 4 0001 4 · · · . DEFINITION 4.3.17 A subset ๐ถ of the poset (๐ด, 4) is a chain with respect to 4 if ๐ is comparable to ๐ for all ๐, ๐ ∈ ๐ถ. When ๐ is partially ordered by ≤, the sets {0, 1, 2, 3, … }, {… , −3, −2, −1, 0}, and {… , −2, 0, 2, 4, … } are chains. In P(๐), both {{1, 2}, {1, 2, 3, 4}, {1, 2, 3, 4, 5, 6}} and {∅, {0}, {0, 1}, {0, 1, 2}, … } are chains with respect to ⊆. EXAMPLE 4.3.18 To see that {๐ด๐ : ๐ ∈ ๐+ } where ๐ด๐ = {๐ฅ ∈ ๐ : (๐ฅ − 1)(๐ฅ − 2) · · · (๐ฅ − ๐) = 0} is a chain with respect to ⊆, take ๐, ๐ ∈ ๐+ . By definition, ๐ด๐ = {1, 2, … , ๐} and ๐ด๐ = {1, 2, … , ๐}. If ๐ ≤ ๐, then ๐ด๐ ⊆ ๐ด๐ , otherwise ๐ด๐ ⊆ ๐ด๐ . EXAMPLE 4.3.19 Let ๐ถ0 and ๐ถ1 be chains of ๐ด with respect to 4. Take ๐, ๐ ∈ ๐ถ0 ∩ ๐ถ1 . Then, ๐, ๐ ∈ ๐ถ0 , so ๐ 4 ๐ or ๐ 4 ๐. Therefore, ๐ถ0 ∩ ๐ถ1 is a chain. However, the union of two chains might not be a chain. For example, {{1}, {1, 2}} and {{1}, {1, 3}} are chains in (P(๐), ⊆), but {{1}, {1, 2}, {1, 3}} is not a chain because {1, 2} * {1, 3} and {1, 3} * {1, 2}. The sets ๐, ๐, and ๐ are chains of ๐ with respect to ≤. In fact, any subset of ๐ is a chain of ๐ because subsets of chains are chains. This motivates the next definition. DEFINITION 4.3.20 The poset (๐ด, 4) is a linearly ordered set and 4 is a linear order if ๐ด is a chain with respect to 4. Section 4.3 PARTIAL ORDERS 183 Since every subset ๐ด of ๐ is a chain with respect to ≤, the relation ≤ is a linear order on ๐ด. Furthermore, since every pair of elements in a linear order are comparable, Theorem 4.3.8 can be strengthened. THEOREM 4.3.21 [Trichotomy Law] If 4 is a linear order on ๐ด, for all ๐, ๐ ∈ ๐ด, exactly one of the following are true: ๐ โบ ๐, ๐ โบ ๐, or ๐ = ๐. Although it is not the case that every pair of elements of the poset defined in Example 4.3.7 is comparable, it is the case that for any given pair of elements, there exists another element that is related to the given elements. For example, for the pair 100 and 110, 1 4 100 and 1 4 110. Also, for the pair 101 and 10, 10 4 101 and 10 4 10. DEFINITION 4.3.22 Let (๐ด, 4) be a poset. The elements ๐, ๐ ∈ ๐ด are compatible if there exists ๐ ∈ ๐ด such that ๐ 4 ๐ and ๐ 4 ๐. If ๐ and ๐ are not compatible, they are incompatible and we write ๐ โ ๐. Observe that if ๐ and ๐ are comparable, they are also compatible. On the other hand, it takes some work to define a relation in which every pair of elements is incompatible. DEFINITION 4.3.23 A subset ๐ท of a poset (๐ด, 4) is an antichain with respect to 4 when for all ๐, ๐ ∈ ๐ท, if ๐ ≠ ๐, then ๐ โ ๐. The sets {{๐} : ๐ ∈ ๐} and {{1}, {2, 3}, {4, 5, 6}, {7, 8, 9, 10}, … } are antichains of P(๐) with respect to ⊆. Well-Ordered Sets The notion of a linear order incorporates many of the properties of ≤ on ๐ since ≤ is reflexive, antisymmetric, and transitive, and ๐ is a chain with respect to ≤. However, there is one important property of (๐, ≤) that is not included among those of a linear order. Because of the nature of the natural numbers, every subset of ๐ that is nonempty has a least element. For example, 5 is the least element of {5, 7, 32, 99} and 2 is the least element of {2, 4, 6, 8, … }. We want to be able to identify those partial orders that also have this property. DEFINITION 4.3.24 The linearly ordered set (๐ด, 4) is a well-ordered set and 4 is a well-order if every nonempty subset of ๐ด has a least element with respect to 4. 184 Chapter 4 RELATIONS AND FUNCTIONS According to Definition 4.3.24, (๐, ≤) is a well-ordered set, but this fact about the natural numbers cannot be proved without making an assumption. Therefore, so that we have at least one well-ordered set with which to work, we assume the following. AXIOM 4.3.25 (๐, ≤) is a well-ordered set. Coupling Axiom 4.3.25 with the next theorem will yield infinitely many well-ordered sets. THEOREM 4.3.26 If (๐ด, 4) is well-ordered and ๐ต is a nonempty subset of ๐ด, then (๐ต, 4) is wellordered. PROOF Let ๐ต ⊆ ๐ด and ๐ต ≠ ∅. To prove that ๐ต is well-ordered, let ๐ถ ⊆ ๐ต and ๐ถ ≠ ∅. Then, ๐ถ ⊆ ๐ด. Since 4 well-orders ๐ด, we know that ๐ถ has a least element with respect to 4. Because ๐ ∩ [5, ∞) ⊆ ๐+ ⊆ ๐, by Axiom 4.3.25 and Theorem 4.3.26, both (๐+ , ≤) and (๐ ∩ [5, ∞) , ≤) are well-ordered sets. EXAMPLE 4.3.27 Let ๐ด = {๐๐ : ๐ ∈ ๐}. To prove that ๐ด is well-ordered by ≤, let ๐ต ⊆ ๐ด such that ๐ต ≠ ∅. This means that there exists a nonempty subset ๐ผ of ๐ such that ๐ต = {๐๐ : ๐ ∈ ๐ผ}. Since ๐ is well-ordered, ๐ผ has least element ๐. We claim that ๐๐ is the least element of ๐ต. To see this, take ๐ ∈ ๐ต. Then, ๐ = ๐๐ for some ๐ ∈ ๐ผ. Since ๐ is the least element of ๐ผ, ๐ ≤ ๐. Therefore, ๐๐ ≤ ๐๐ = ๐. To prove that a set ๐ด is not well-ordered by 4, we must find a nonempty subset ๐ต of ๐ด that does not have a least element. This means that for every ๐ ∈ ๐ต, there exists ๐ ∈ ๐ต such that ๐ โบ ๐. That is, there are elements ๐๐ ∈ ๐ต (๐ ∈ ๐) such that · · · โบ ๐2 โบ ๐1 โบ ๐0 . This informs the next definition. DEFINITION 4.3.28 Let 4 be a partial order on ๐ด and ๐ต = {๐๐ : ๐ ∈ ๐} be a subset of ๐ด. โ ๐ต is increasing means ๐ < ๐ implies ๐๐ โบ ๐๐ for all ๐, ๐ ∈ ๐. โ ๐ต is decreasing means ๐ < ๐ implies ๐๐ โบ ๐๐ for all ๐, ๐ ∈ ๐. If a set is well-ordered, it has a least element, but the converse is not true. To see this, consider ๐ด = {0, 1โ2, 1โ3, 1โ4, … }. It has a least element, namely, 0, but ๐ด also Section 4.3 PARTIAL ORDERS contains the decreasing set 185 { } 1 1 1 1 , , , ,… . (4.4) 2 3 4 5 Therefore, ๐ด has a subset without a least element, so ๐ด is not well-ordered. We summarize this observation with the following theorem, and leave its proof to Exercise 23. THEOREM 4.3.29 (๐ด, 4) is not a well-ordered set if and only if (๐ด, 4) does not have a decreasing subset. Theorem 4.3.29 implies that any finite linear order is well-ordered. EXAMPLE 4.3.30 The decreasing sequence (4.4) with Theorem 4.3.29 shows that the sets (0, 1), [0, 1], ๐, and {1โ๐ : ๐ ∈ ๐+ } are not well-ordered by ≤. We close this section by proving two important results from number theory. Their proofs use Axiom 4.3.25. The strategy is to define a nonempty subset of a well-ordered set. Its least element ๐ will be a number that we want. This least element also needs to have a particular property, say ๐(๐). To show that it has the property, assume ¬๐(๐) and use this to find another element of the set that is less than ๐. This contradicts the minimality of ๐ allowing us to conclude ๐(๐). THEOREM 4.3.31 [Division Algorithm] If ๐, ๐ ∈ ๐ with ๐ ≠ 0, there exist unique ๐, ๐ ∈ ๐ such that ๐ < ๐ and ๐ = ๐๐ + ๐. PROOF Uniqueness is proved in Example 2.4.14. To prove existence, take ๐, ๐ ∈ ๐ and define ๐ = {๐ ∈ ๐ : ∃๐(๐ ∈ ๐ ∧ ๐ = ๐๐ + ๐)}. Notice that ๐ ∈ ๐, so ๐ ≠ ∅. Therefore, ๐ has a least element by Axiom 4.3.25. Call it ๐ and write ๐ = ๐๐ + ๐ for some natural number ๐. Assume ๐ ≥ ๐, which implies that ๐ − ๐ ≥ 0. Also, ๐ = ๐(๐ − 1) + ๐ − ๐ because ๐ − ๐ = ๐ − ๐๐ − ๐ = ๐ − ๐(๐ + 1), so ๐ − ๐ ∈ ๐. Since ๐ > ๐ − ๐ because ๐ is positive, ๐ cannot be the minimum of ๐, a contradiction. The value ๐ of the division algorithm (Theorem 4.3.31) is called the quotient and ๐ is the remainder. For example, if we divide 5 into 17, the division algorithm returns a quotient of 3 and a remainder of 2, so we can write that 17 = 5(3) + 2. Notice that 2 < 3. 186 Chapter 4 RELATIONS AND FUNCTIONS Call ๐ ∈ ๐ a linear combination of the integers ๐ and ๐ if ๐ = ๐ข๐ + ๐ฃ๐ for some ๐ข, ๐ฃ ∈ ๐. Since 37 = 5(2) + 3(9), we see that 37 is a linear combination of 2 and 9. Furthermore, if ๐ โฃ ๐ and ๐ โฃ ๐, then ๐ โฃ ๐ข๐ + ๐ฃ๐. To see this, write ๐ = ๐๐ and ๐ = ๐๐ for some ๐, ๐ ∈ ๐. Then, ๐ข๐ + ๐ฃ๐ = ๐ข๐๐ + ๐ฃ๐๐ = ๐(๐ข๐ + ๐ฃ๐), and this means ๐ โฃ ๐ข๐ + ๐ฃ๐. THEOREM 4.3.32 Let ๐, ๐ ∈ ๐ with not both equal to 0. If ๐ = gcd(๐, ๐), there exists ๐, ๐ ∈ ๐ such that ๐ = ๐๐ + ๐๐. PROOF Define ๐ = {๐ง ∈ ๐+ : ∃๐ฅ∃๐ฆ(๐ฅ, ๐ฆ ∈ ๐ ∧ ๐ง = ๐ฅ๐ + ๐ฆ๐)}. Notice that ๐ is not empty because ๐2 + ๐2 ∈ ๐ . By Axiom 4.3.25, ๐ has a least element ๐, so write ๐ = ๐๐ + ๐๐ for some ๐, ๐ ∈ ๐. โ Since ๐ > 0, the division algorithm (4.3.31) yields ๐ = ๐๐ + ๐ for some natural numbers ๐ and ๐ with ๐ < ๐. Then, ๐ = ๐ − ๐๐ = ๐ − (๐๐ − ๐๐)๐ = (1 − ๐๐)๐ + (๐๐)๐. If ๐ > 0, then ๐ ∈ ๐ , which is impossible because ๐ is the least element of ๐ . Therefore, ๐ = 0 and ๐ โฃ ๐. Similarly, ๐ โฃ ๐. โ To show that ๐ is the greatest of the common divisors, suppose ๐ โฃ ๐ and ๐ โฃ ๐ with ๐ ∈ ๐+ . By definition, ๐ = ๐ ๐ and ๐ = ๐ ๐ for some ๐, ๐ ∈ ๐. Hence, ๐ = ๐(๐ ๐) + ๐(๐ ๐) = ๐ (๐๐ − ๐๐). Thus, ๐ ≤ ๐ because ๐ is nonzero and ๐ divides ๐ (Exercise 25). Exercises 1. Is (∅, ⊆) a partial order, linear order, or well order? Explain. 2. For each relation on {1, 2}, determine if it is reflexive, irreflexive, symmetric, asymmetric, antisymmetric, or transitive. (a) {(1, 2)} (b) {(1, 2), (2, 1)} (c) {(1, 1), (1, 2), (2, 1)} (d) {(1, 1), (1, 2), (2, 2)} (e) {(1, 1), (1, 2), (2, 1), (2, 2)} (f) ∅ 3. Give an example of a relation that is neither symmetric nor asymmetric. Section 4.3 PARTIAL ORDERS 187 4. Let ๐ be a relation on ๐ด. Prove that ๐ is reflexive if and only if (๐ด × ๐ด) โงต ๐ is irreflexive. 5. Show that a relation ๐ on ๐ด is asymmetric if and only if ๐ ∩ ๐ −1 = ∅. 6. Let (๐ด, 4) and (๐ต, ≤) be posets. Define ∼ on ๐ด × ๐ต by (๐, ๐) ∼ (๐′ , ๐′ ) if and only if ๐ 4 ๐′ and ๐ ≤ ๐′ . Show that ∼ is a partial order on ๐ด × ๐ต. 7. For any alphabet ๐ธ, prove the following. (a) For all ๐, ๐, ๐ ∈ ๐ธ∗ , ๐ โข (๐ โข ๐) = (๐ โข ๐)โข ๐. (b) There exists ๐, ๐ ∈ ๐ธ∗ such that ๐ โข ๐ ≠ ๐ โข ๐. (c) ๐ธ∗ has an identity with respect to โข, but for all ๐ ∈ ๐ธ∗ , there is no inverse for ๐ if ๐ ≠ . 8. Prove that ๐ธ∗ from Example 4.3.7 is partially ordered by 4. 9. Prove that (P(๐ด), ⊆) is not a linear order if ๐ด has at least three elements. 10. Show that (๐ธ∗ , 4) is not a linear order if ๐ธ has at least two elements. 11. Prove that the following families of sets are chains with respect to ⊆. (a) {[0, ๐] : ๐ ∈ ๐+ } (b) {(2๐ )๐ : ๐ ∈ ๐} where (2๐ )๐ = {2๐ ⋅ ๐ : ๐ ∈ ๐} โ (c) {๐ต๐ : ๐ ∈ ๐} where ๐ต๐ = {๐ด๐ : ๐ ∈ ๐ ∧ ๐ ≤ ๐} and ๐ด๐ is a set for all ๐∈๐ 12. Can a chain be disjoint or pairwise disjoint? Explain. 13. Suppose that {๐ด๐ : ๐ ∈ ๐} is a chain of sets such that for all ๐ ≤ ๐, ๐ด๐ ⊆ ๐ด๐ . Prove for all ๐ ∈โ ๐. (a) {๐ด โถ ๐ ∈ ๐ ∧ ๐ ≤ ๐} = ๐ด๐ โ ๐ (b) {๐ด๐ โถ ๐ ∈ ๐ ∧ ๐ ≤ ๐} = ๐ด0 โ 14. Let {๐ด๐ : ๐ ∈ ๐} be a family of sets. For every ๐ ∈ ๐, define ๐ต๐ = ๐ ๐=0 ๐ด๐ . Show that {๐ต๐ : ๐ ∈ ๐} is a chain. โ 15. Let ๐๐ be a chain of the poset (๐ด, 4) for all ๐ ∈ ๐ผ. Prove that ๐∈๐ผ ๐๐ is a chain. โ Is ๐∈๐ผ ๐๐ necessarily a chain? Explain. 16. Let (๐ด, 4) and (๐ต, ≤) be linear orders. Define ∼ on ๐ด × ๐ต by (๐, ๐) ∼ (๐′ , ๐′ ) if and only if ๐ โบ ๐′ or ๐ = ๐′ and ๐ ≤ ๐′ . This relation is called a lexicographical order since it copies the order of a dictionary. (a) Prove that (๐ด × ๐ต, ∼) is a linear order. (b) Suppose that (๐, ๐) is a maximal element of ๐ด × ๐ต with respect to ∼. Show that ๐ is a maximal element of ๐ด with respect to 4. 188 Chapter 4 RELATIONS AND FUNCTIONS 17. Let ๐ด be a set. Prove that ๐ต ⊆ P(๐ด) โงต {∅} is an antichain with respect to ⊆ if and only if ๐ต is pairwise disjoint. 18. Prove the following true or false. (a) Every well-ordered set contains a least element. (b) Every well-ordered set contains a greatest element. (c) Every subset of a well-ordered set contains a least element. (d) Every subset of a well-ordered set contains a greatest element. (e) Every well-ordered set has a decreasing subset. (f) Every well-ordered set has a increasing subset. 19. For each of the given sets, indicate whether or not it is well-ordered by ≤. If it is, prove it. If it is not, find a decreasing sequence of elements of the set. √ (a) { 2, 5, 6, 10.56, 17, −100} (b) {2๐ : ๐ ∈ ๐} (c) {๐โ๐ : ๐ ∈ ๐+ } (d) {๐โ๐ : ๐ ∈ ๐− } (e) {−4, −3, −2, −1, … } (f) ๐ ∩ (๐, ∞) (g) ๐ ∩ (−7, ∞) 20. Prove that (๐ ∩ (๐ฅ, ∞), ≤) is a well-ordered set for all ๐ฅ ∈ ๐. 21. Show that a well-ordered set has a unique least element. 22. Let (๐ด, 4) be a well-ordered set. If ๐ต ⊆ ๐ด and there is an upper bound for ๐ต in ๐ด, then ๐ต has a greatest element. 23. Prove Theorem 4.3.29. 24. Where does the proof of Theorem 4.3.31 go wrong if ๐ = 0? 25. Prove that if ๐ โฃ ๐, then ๐ ≤ ๐ for all ๐, ๐ ∈ ๐. 26. Let ๐, ๐, ๐ ∈ ๐. Show that if gcd(๐, ๐) = gcd(๐, ๐) = 1, then gcd(๐, ๐๐) = 1. 27. Let ๐, ๐ ∈ ๐ and assume that gcd(๐, ๐) = 1. Prove. (a) If ๐ โฃ ๐ and ๐ โฃ ๐, then ๐๐ โฃ ๐. (b) gcd(๐ + ๐, ๐) = gcd(๐ + ๐, ๐) = 1. (c) gcd(๐ + ๐, ๐ − ๐) = 1 or gcd(๐ + ๐, ๐ − ๐) = 2. (d) If ๐ โฃ ๐, then gcd(๐, ๐) = 1. (e) If ๐ โฃ ๐ + ๐, then gcd(๐, ๐) = gcd(๐, ๐) = 1. (f) If ๐ โฃ ๐๐ and ๐ โฃ ๐๐, then ๐ โฃ ๐. 28. Let ๐, ๐ ∈ ๐, where at least one is nonzero. Prove that ๐ = ๐ if ๐ = {๐ฅ : ∃๐[๐ ∈ ๐ ∧ ๐ฅ = ๐ gcd(๐, ๐)]} and ๐ = {๐ฅ : ๐ฅ > 0 ∧ ∃๐ข∃๐ฃ(๐ข, ๐ฃ ∈ ๐ ∧ ๐ฅ = ๐ข๐ + ๐ฃ๐)}. Section 4.4 FUNCTIONS 189 29. Prove that if ๐ โฃ ๐ and ๐ โฃ ๐, then ๐ โฃ gcd(๐, ๐) for all ๐, ๐, ๐ ∈ ๐. 4.4 FUNCTIONS From algebra and calculus, we know what a function is. It is a rule that assigns to each possible input value a unique output value. The common picture is that of a machine that when a certain button is pushed, the same result always happens. In basic algebra a relation can be graphed in the Cartesian plane. Such a relation will be a function if and only if every vertical line intersects its graphs at most once. This is known as the vertical line test (Figure 4.6). This criteria is generalized in the next definition. DEFINITION 4.4.1 Let ๐ด and ๐ต be sets. A relation ๐ ⊆ ๐ด × ๐ต is a function means that for all (๐ฅ, ๐ฆ), (๐ฅ′ , ๐ฆ′ ) ∈ ๐ด × ๐ต, if ๐ฅ = ๐ฅ′ , then ๐ฆ = ๐ฆ′ . The function ๐ is an n-ary function if there exists sets ๐ด0 , ๐ด1 , … , ๐ด๐−1 such that ๐ด = ๐ด0 × ๐ด1 × · · · × ๐ด๐−1 . If ๐ = 1, then ๐ is a unary function, and if ๐ = 2, then ๐ is a binary function, EXAMPLE 4.4.2 The set {(1, 2), (4, 5), (6, 5)} is a function, but {(1, 2), (1, 5), (6, 5)} is not since it contains (1, 2) and (1, 5). Also, ∅ is a function (Exercise 16). Intersects at most once Figure 4.6 Passing the vertical line test. 190 Chapter 4 RELATIONS AND FUNCTIONS EXAMPLE 4.4.3 Define ๐ = {(๐ข, 1 + 2 cos ๐๐ข) : ๐ข ∈ ๐}. Assume that both (๐ฅ, 1 + 2 cos ๐๐ฅ) and (๐ฅ′ , 1 + 2 cos ๐๐ฅ′ ) are elements of ๐ . Then, because cosine is a function, ๐ฅ = ๐ฅ′ ⇒ ๐๐ฅ = ๐๐ฅ′ ⇒ cos ๐๐ฅ = cos ๐๐ฅ′ ⇒ 2 cos ๐๐ฅ = 2 cos ๐๐ฅ′ ⇒ 1 + 2 cos ๐๐ฅ = 1 + 2 cos ๐๐ฅ′ . Therefore, ๐ is a function. EXAMPLE 4.4.4 The standard arithmetic operations are functions. For example, taking a square root is a unary function, while addition, subtraction, multiplication, and division are binary functions. To illustrate this, addition on ๐ is the set ๐ด = {((๐, ๐), ๐ + ๐) : ๐, ๐ ∈ ๐}. (4.5) Let ๐ ⊆ ๐ด × ๐ต be a function. This implies that for all ๐ ∈ ๐ด, either [๐]๐ = ∅ or [๐]๐ is a singleton (Definition 4.2.7). For example, if ๐ = {(1, 2), (4, 5), (6, 5)}, then [1]๐ = {2} and [3]๐ = ∅. Because [๐]๐ can contain at most one element, we typically simplify the notation. DEFINITION 4.4.5 Let ๐ be a function. For all ๐ ∈ ๐ด, define ๐ (๐) = ๐ if and only if [๐]๐ = {๐} and write that ๐ (๐) is undefined if [๐]๐ = ∅. For example, using (4.5), ๐ด((5, 7)) = 12, but ๐ด(5, ๐) is undefined. With ordered ๐-tuples, the outer parentheses are usually eliminated so that we write ๐ด(5, 7) = 12. Moreover, if ๐ท ⊆ ๐ด is the domain of the function ๐ , write the function notation ๐ :๐ท→๐ต and call ๐ต a codomain of ๐ (Figure 4.7). Because functions are often represented by arrows that “send” one element to another, a function can be called a map. If ๐ (๐ฅ) = ๐ฆ, we can say that “๐ maps ๐ฅ to ๐ฆ.” We can also say that ๐ฆ is the image of ๐ฅ under ๐ and ๐ฅ is a pre-image of ๐ฆ. For example, if ๐ = {(3, 1), (4, 2), (5, 2)}, Section 4.4 FUNCTIONS 191 Domain f:D→B Codomain Function name Figure 4.7 Function notation. ๐ maps 3 to 1, 2 is the image of 4, and 5 is a pre-image of 2 (Figure 4.8). If ๐ is also a function with domain ๐ท and codomain ๐ต, we can use the abbreviation ๐, ๐ : ๐ท → ๐ต to represent both functions. An alternate choice of notation involves referring to the functions as ๐ท → ๐ต. EXAMPLE 4.4.6 If ๐ (๐ฅ) = cos √๐ฅ, then ๐ maps ๐ to −1, 0 is the image of ๐โ2, and ๐โ4 is a pre-image of 2โ2. EXAMPLE 4.4.7 Let ๐ด be any set. The identity relation on ๐ด (Definition 4.1.2) is a function, so call ๐ผ๐ด the identity map and write ๐ผ๐ด (๐ฅ) = ๐ฅ. Sometimes a function ๐ is defined using a rule that pairs an element of the function’s domain with an element of its codomain. When this is done successfully, ๐ is said to be well-defined. Observe that proving that ๐ is well-defined is the same as proving it to be a function. f 3 4 5 Figure 4.8 1 2 The map ๐ = {(3, 1), (4, 2), (5, 2)}. 192 Chapter 4 RELATIONS AND FUNCTIONS EXAMPLE 4.4.8 Let ๐ : ๐ → ๐ be defined by ๐ (๐ฅ) = 1 + 2 cos ๐๐ฅ. This is the function ๐ = {(๐ฅ, 1 + 2 cos ๐๐ฅ) : ๐ฅ ∈ ๐}. The work of Example 4.4.3 shows that ๐ is well-defined. Before we examine another example, let us set a convention on naming functions. It is partly for aesthetics, but it does help in organizing functions based on the type of elements in their domains and ranges. โ Use English letters (usually, ๐ , ๐, and โ) for naming functions that involve numbers. Typically, these will be lowercase, but there are occasions when we will choose them to be uppercase. โ Use Greek letters (often ๐ or ๐) for general functions or those with domains not consisting of numbers. They are also usually lowercase, but uppercase Greek letters like Φ and Ψ are sometimes appropriate. (See the appendix for the Greek alphabet.) EXAMPLE 4.4.9 Let ๐, ๐ ∈ ๐+ such that ๐ โฃ ๐. Define ๐([๐]๐ ) = [๐]๐ for all ๐ ∈ ๐. This means that ๐ = {([๐]๐ , [๐]๐ ) : ๐ ∈ ๐}. It is not clear that ๐ is well-defined since an equivalence class can have many representatives, so assume that [๐]๐ = [๐]๐ for ๐, ๐ ∈ ๐. Therefore, ๐ โฃ ๐ − ๐. Then, by hypothesis, ๐ โฃ ๐ − ๐, and this yields ๐([๐]๐ ) = [๐]๐ = [๐]๐ = ๐([๐]๐ ). EXAMPLE 4.4.10 Let ๐ฅ ∈ ๐. Define the greatest integer function as โฆ๐ฅโง = the greatest integer ≤ ๐ฅ. For example, โฆ5โง = 5, โฆ1.4โง = 1, and โฆ−3.4โง = −4. The greatest integer function is a function ๐ → ๐. It is well-defined because the relation ≤ wellorders ๐ (Exercise 4.3.22). If a relation on ๐ is not a function, it will fail the vertical line test (Figure 4.9). To generalize, let ๐ ⊆ ๐ด × ๐ต. To show that ๐ is not a function, we must show that there exists (๐ฅ, ๐ฆ1 ), (๐ฅ, ๐ฆ2 ) ∈ ๐ such that ๐ฆ1 ≠ ๐ฆ2 . EXAMPLE 4.4.11 The relation ๐ = ๐ผ๐ ∪ {(๐ฅ, −๐ฅ) : ๐ฅ ∈ ๐} is not a function since (4, 4) ∈ ๐ and (4, −4) ∈ ๐ . Section 4.4 FUNCTIONS y1 193 (x, y1) x y2 Figure 4.9 (x, y2) The relation is not a function. EXAMPLE 4.4.12 Define ๐ ⊆ ๐2 × ๐3 by ๐([๐]2 ) = [๐]3 for all ๐ ∈ ๐. Since 3 does not divide 2, ๐ is not well-defined. This is proved by noting that [0]2 = [2]2 but [0]3 ≠ [2]3 . There will be times when we want to examine sets of functions. If each function is to have the same domain and codomain, we use the following notation. DEFINITION 4.4.13 If ๐ด and ๐ต are sets, ๐ด ๐ต = {๐ : ๐ is a function ๐ด → ๐ต}. For instance, ๐ด ๐ต is a set of real-valued functions if ๐ด, ๐ต ⊆ ๐. EXAMPLE 4.4.14 If ๐๐ is a sequence of real numbers with ๐ = 0, 1, 2, … , the sequence ๐๐ is an element of ๐ ๐. Illustrating this, the sequence ๐๐ = (−1โ2)๐ is a function and can be graphed as in Figure 4.10. EXAMPLE 4.4.15 Let ๐ด ⊆ ๐ and fix ๐ ∈ ๐ด. The evaluation map, ๐๐ : ๐ด๐ด → ๐ด, is defined as ๐๐ (๐ ) = ๐ (๐). 194 Chapter 4 RELATIONS AND FUNCTIONS Figure 4.10 ๐๐ = (−1โ2)๐ is a function. For example, if ๐(๐ฅ) = ๐ฅ2 , then ๐3 (๐) = 9. Observe that the evaluation map is an ๐ด element of ( ๐ด)๐ด. Equality Since functions are sets, we already know that two functions are equal when they contain the same ordered pairs. However, there is a common test to determine function equality other than a direct appeal to Definition 3.3.7. THEOREM 4.4.16 Functions ๐, ๐ : ๐ด → ๐ต are equal if and only if ๐(๐ฅ) = ๐(๐ฅ) for all ๐ฅ ∈ ๐ด. PROOF Sufficiency is clear, so to prove necessity suppose that ๐(๐ฅ) = ๐(๐ฅ) for every ๐ฅ ∈ ๐ด. Take (๐, ๐) ∈ ๐. This means that ๐(๐) = ๐. By hypothesis, ๐(๐) = ๐, which implies that (๐, ๐) ∈ ๐. Hence, ๐ ⊆ ๐. The proof of ๐ ⊆ ๐ is similar, so ๐ = ๐. We use Theorem 4.4.16 in the next examples. EXAMPLE 4.4.17 Let ๐ and ๐ be functions ๐ → ๐ defined by ๐ (๐ฅ) = (๐ฅ − 3)2 + 2 and ๐(๐ฅ) = ๐ฅ2 − 6๐ฅ + 11. We show that ๐ = ๐ by taking ๐ฅ ∈ ๐ and calculating ๐ (๐ฅ) = (๐ฅ − 3)2 + 2 = (๐ฅ2 − 6๐ฅ + 9) + 2 = ๐ฅ2 − 6๐ฅ + 11 = ๐(๐ฅ). Section 4.4 FUNCTIONS 195 EXAMPLE 4.4.18 Let ๐, ๐ : ๐ → ๐6 be functions such that ๐(๐) = [๐]6 and ๐(๐) = [๐ + 12]6 . Take ๐ ∈ ๐. We show that [๐ + 12]6 = [๐]6 by proceeding as follows: ๐ฅ ∈ [๐ + 12]6 ⇔ ∃๐ [๐ ∈ ๐ ∧ ๐ฅ = ๐ + 12 + 6๐] ⇔ ∃๐ [๐ ∈ ๐ ∧ ๐ฅ = ๐ + 6(2 + ๐)] ⇔ ๐ฅ ∈ [๐]6 . Therefore, ๐ = ๐. EXAMPLE 4.4.19 Define ๐ :๐→ (๐ ) ๐ ๐ by ๐(๐ฅ) = ๐๐ฅ for all ๐ฅ ∈ ๐ (Example 4.4.15). We show that ๐ is well-defined. Take ๐, ๐ ∈ ๐ and assume that ๐ = ๐. To show that ๐(๐) = ๐(๐), we prove ๐๐ = ๐๐ . Therefore, let ๐ ∈ ๐ ๐. Since ๐ is a function and ๐, ๐ ∈ dom(๐ ), we have that ๐ (๐) = ๐ (๐). Thus, ๐๐ (๐ ) = ๐ (๐) = ๐ (๐) = ๐๐ (๐ ). By Theorem 4.4.16, we see that two functions ๐ and ๐ are not equal when either dom(๐ ) ≠ dom(๐) or ๐ (๐ฅ) ≠ ๐(๐ฅ) for some ๐ฅ in their common domain. For example, ๐ (๐ฅ) = ๐ฅ2 and ๐(๐ฅ) = 2๐ฅ are not equal because ๐ (3) = 9 and ๐(3) = 6. Although these two functions differ for every ๐ฅ ≠ 0 and ๐ฅ ≠ 2, it only takes one inequality to prove that the functions are not equal. For example, if we define { ๐ฅ2 if ๐ฅ ≠ 0, โ(๐ฅ) = 7 if ๐ฅ = 0, then ๐ ≠ โ since ๐ (0) = 0 and โ(0) = 7. Composition We now consider the composition of relations when those relations are functions. THEOREM 4.4.20 If ๐ : ๐ด → ๐ต and ๐ : ๐ถ → ๐ท are functions such that ran(๐) ⊆ ๐ถ, then ๐ โฆ ๐ is a function ๐ด → ๐ท and (๐ โฆ ๐)(๐ฅ) = ๐(๐(๐ฅ)). 196 Chapter 4 RELATIONS AND FUNCTIONS PROOF Because the range of ๐ is a subset of ๐ถ, we know that ๐ ⊆ ๐ด× ๐ถ and ๐ ⊆ ๐ถ × ๐ท. Let (๐, ๐1 ), (๐, ๐2 ) ∈ ๐ โฆ ๐. This means by Definition 4.1.9 that there exists ๐1 , ๐2 ∈ ๐ถ such that (๐, ๐1 ), (๐, ๐2 ) ∈ ๐ and (๐1 , ๐1 ), (๐2 , ๐2 ) ∈ ๐. Since ๐ is a function, ๐1 = ๐2 , and then since ๐ is a function, ๐1 = ๐2 . Therefore, ๐ โฆ ๐ is a function, which is clearly ๐ด → ๐ท. Furthermore, ๐ โฆ ๐ = {(๐ฅ, ๐ง) : ∃๐ฆ[๐ฆ ∈ ๐ถ ∧ (๐ฅ, ๐ฆ) ∈ ๐ ∧ (๐ฆ, ๐ง) ∈ ๐]} = {(๐ฅ, ๐ง) : ∃๐ฆ[๐ฆ ∈ ๐ถ ∧ ๐(๐ฅ) = ๐ฆ ∧ ๐(๐ฆ) = ๐ง]} = {(๐ฅ, ๐ง) : ๐ฅ ∈ ๐ด ∧ ๐(๐(๐ฅ)) = ๐ง} = {(๐ฅ, ๐(๐(๐ฅ))) : ๐ฅ ∈ ๐ด}. Hence, (๐ โฆ ๐)(๐ฅ) = ๐(๐(๐ฅ)). The ran(๐) ⊆ ๐ถ condition is important to√Theorem 4.4.20. For example, take the realvalued functions ๐ (๐ฅ) = ๐ฅ and ๐(๐ฅ) = ๐ฅ. Since ๐ (−1) = −1 but ๐(−1) ∉ ๐, we conclude that (๐ โฆ ๐ )(−1) is undefined. EXAMPLE 4.4.21 Define the two functions ๐ : ๐ → ๐ and ๐ : ๐ โงต {0} → ๐ by ๐ (๐ฅ) = โฆ๐ฅโง and ๐(๐ฅ) = 1โ๐ฅ. Since ran(๐ ) = ๐ * dom(๐), there are elements of ๐ for which ๐ โฆ ๐ is undefined. However, ran(๐) = ๐ โงต {0} ⊆ ๐ = dom(๐ ), so ๐ โฆ ๐ is defined and for all ๐ฅ ∈ ๐, (๐ โฆ ๐)(๐ฅ) = ๐ (๐(๐ฅ)) = ๐ (1โ๐ฅ) = โฆ1โ๐ฅโง. EXAMPLE 4.4.22 Let ๐ : ๐ ๐ → ๐ be defined by ๐(๐ ) = ๐3 (๐ ) and also let ๐ : ๐ → ๐7 be ๐(๐) = [๐]7 . Since ran(๐) ⊆ dom(๐), ๐ โฆ ๐ is defined. Thus, if ๐ : ๐ → ๐ is defined as ๐(๐) = 3๐, (๐ โฆ ๐)(๐) = ๐(๐(๐)) = ๐(๐(3)) = ๐(9) = [9]7 = [2]7 . We should note that function composition is not a binary operation unless both functions are ๐ด → ๐ด for some set ๐ด. In this case, function composition is a binary operation on ๐ด๐ด. Restrictions and Extensions There are times when a subset of a given function is required. For example, consider ๐ = {(๐ฅ, ๐ฅ2 ) : ๐ฅ ∈ ๐}. Section 4.4 FUNCTIONS 197 If only positive values of ๐ฅ are required, we can define ๐ = {(๐ฅ, ๐ฅ2 ) : ๐ฅ ∈ (0, ∞)} so that ๐ ⊆ ๐ . We have notation for this. DEFINITION 4.4.23 Let ๐ : ๐ด → ๐ต be a function and ๐ถ ⊆ ๐ด. โ The restriction of ๐ to ๐ถ is the function ๐ ๐ถ : ๐ถ → ๐ต so that (๐ ๐ถ)(๐ฅ) = ๐(๐ฅ) for all ๐ฅ ∈ ๐ถ. โ The function ๐ : ๐ท → ๐ธ is an extension of ๐ if ๐ด ⊆ ๐ท, ๐ต ⊆ ๐ธ, and ๐ ๐ด = ๐. EXAMPLE 4.4.24 Let ๐ = {(1, 2), (2, 3), (3, 4), (4, 1)} and ๐ = {(1, 2), (2, 3)}. We conclude that ๐ = ๐ {1, 2}, and ๐ is an extension of ๐. EXAMPLE 4.4.25 Let ๐ : ๐ → ๐ be a function and ๐ด, ๐ต ⊆ ๐ . We conclude that ๐ (๐ด ∪ ๐ต) = (๐ ๐ด) ∪ (๐ ๐ต) because (๐ฅ, ๐ฆ) ∈ ๐ (๐ด ∪ ๐ต) ⇔ ๐ฆ = ๐(๐ฅ) ∧ ๐ฅ ∈ ๐ด ∪ ๐ต ⇔ ๐ฆ = ๐(๐ฅ) ∧ (๐ฅ ∈ ๐ด ∨ ๐ฅ ∈ ๐ต) ⇔ ๐ฆ = ๐(๐ฅ) ∧ ๐ฅ ∈ ๐ด ∨ ๐ฆ = ๐(๐ฅ) ∧ ๐ฅ ∈ ๐ต ⇔ (๐ฅ, ๐ฆ) ∈ ๐ ๐ด ∨ (๐ฅ, ๐ฆ) ∈ ๐ ๐ต ⇔ (๐ฅ, ๐ฆ) ∈ (๐ ๐ด) ∪ (๐ ๐ต). Binary Operations Standard addition and multiplication of real numbers are functions ๐ × ๐ → ๐ (Example 4.4.4). This means two things. First, given any two real numbers, their sum or product will always be the same number. For instance, 3+5 is 8 and never another number. Second, given any two real numbers, their sum or product is also a real number. Notice that subtraction also has these two properties when it is considered an operation involving real numbers, but when we restrict substraction to ๐+ , it no longer has the second property because the difference of two positive integers might not be a positive integer. That is, subtraction is not a function ๐+ × ๐+ → ๐+ . 198 Chapter 4 RELATIONS AND FUNCTIONS DEFINITION 4.4.26 A binary operation ∗ on the nonempty set ๐ด is a function ๐ด × ๐ด → ๐ด. The symbol that represents the addition function is +. It can be viewed as a function ๐ → ๐. Therefore, using function notation, +(3, 5) = 8. However, we usually write this as 3 + 5 = 8. Similarly, since ∗ represents an operation like addition, instead of writing ∗ (๐, ๐), we usually write ๐ ∗ ๐, To prove that a relation ∗ is a binary operation on ๐ด, we must show that it satisfies Definition 4.4.26. To do this, take ๐, ๐′ , ๐, ๐′ ∈ ๐ด, and prove: โ ๐ = ๐′ and ๐ = ๐′ implies ๐ ∗ ๐ = ๐′ ∗ ๐′ , โ ๐ด is closed under ∗, that is ๐ ∗ ๐ ∈ ๐ด. EXAMPLE 4.4.27 Define ๐ฅ ∗ ๐ฆ = 2๐ฅ − ๐ฆ and take ๐, ๐′ , ๐, ๐′ ∈ ๐. โ Assume ๐ = ๐′ and ๐ = ๐′ . Then, ๐ ∗ ๐ = 2๐ − ๐ = 2๐′ − ๐′ = ๐′ ∗ ๐′ . The second equality holds because multiplication and subtraction are binary operations on ๐. โ Because the product and difference of two integers is an integer, we have that ๐ ∗ ๐ ∈ ๐, so ๐ is closed under ∗. Thus, ∗ is a binary operation on ๐. EXAMPLE 4.4.28 Let ๐ = {๐, ๐, ๐, ๐} and define ∗ by the following table: ∗ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ The table is read from left to right, so ๐ ∗ ๐ = ๐. The table makes ∗ into a binary operation since every pair of elements of ๐ is assigned a unique element of ๐. EXAMPLE 4.4.29 Fix a set ๐ด. For any ๐, ๐ ∈ P(๐ด), define ๐ ∗ ๐ = ๐ ∪ ๐ . โ Let ๐1 , ๐2 , ๐1 , ๐2 ∈ P(๐ด). If we assume that ๐1 = ๐2 and ๐1 = ๐2 , we have ๐1 ∪ ๐1 = ๐2 ∪ ๐2 . Hence, ∗ is well-defined. Section 4.4 FUNCTIONS 199 โ To show that P(๐ด) is closed under ∗, let ๐ต and ๐ถ be subsets of ๐ด. Then ๐ต ∗ ๐ถ = ๐ต ∪ ๐ถ ∈ P(๐ด) because ๐ต ∪ ๐ถ ⊆ ๐ด [Exercise 3.3.10(a)]. This shows that ∗ is a binary operation on P(๐ด). Notice that ∗ is a subset of [P(๐ด) × P(๐ด)] × P(๐ด). EXAMPLE 4.4.30 Let ๐ ∈ ๐+ and define [๐]๐ + [๐]๐ = [๐ + ๐]๐ . We show that this is a binary operation on ๐๐ . โ Let ๐1 , ๐2 , ๐1 , ๐2 ∈ ๐. Suppose [๐1 ]๐ = [๐2 ]๐ and [๐1 ]๐ = [๐2 ]๐ . This means that ๐1 = ๐2 + ๐๐ and ๐1 = ๐2 + ๐๐ for some ๐, ๐ ∈ ๐. Hence, ๐1 + ๐1 = ๐2 + ๐2 + ๐(๐ + ๐), and we have [๐1 + ๐1 ]๐ = [๐2 + ๐2 ]๐ . โ For closure, let [๐]๐ , [๐]๐ ∈ ๐๐ where ๐ and ๐ are integers. Then, we have that [๐]๐ + [๐]๐ = [๐ + ๐]๐ ∈ ๐๐ since ๐ + ๐ is an integer. Many binary operations share similar properties with the operations of + and × on ๐. The next definition gives four of these properties. DEFINITION 4.4.31 Let ∗ be a binary operation on ๐ด. โ ∗ is associative means that (๐ ∗ ๐) ∗ ๐ = ๐ ∗ (๐ ∗ ๐) for all ๐, ๐, ๐ ∈ ๐ด. โ ∗ is commutative means that ๐ ∗ ๐ = ๐ ∗ ๐ for all ๐, ๐ ∈ ๐ด. โ The element ๐ is an identity of ๐ด with respect to ∗ when ๐ ∈ ๐ด and ๐ ∗ ๐ = ๐ ∗ ๐ = ๐ for all ๐ ∈ ๐ด. โ Suppose that ๐ด has an identity ๐ with respect to ∗ and let ๐ ∈ ๐ด. The element ๐′ ∈ ๐ด is an inverse of ๐ with respect to ∗ if ๐ ∗ ๐′ = ๐′ ∗ ๐ = ๐. Notice that the identity, if it exists, must be unique. To prove this, suppose that both ๐ and ๐′ are identities. These must be equal because ๐ = ๐ ∗ ๐′ = ๐′ . So, if a set has an identity with respect to an operation, we can refer to it as the identity of the set. Similarly, we can write the inverse if it exists for associative binary operations (Exercise 20). EXAMPLE 4.4.32 We assume that + and × are both associative and commutative on ๐ and all subsets of ๐, that 0 is the identity with respect to + (the additive identity) and 1 is the identity with respect to × (the multiplicative identity), and that every complex number has an inverse with respect to + (an additive inverse) and every nonzero complex number has an inverse with respect to × (a multiplicative inverse). 200 Chapter 4 RELATIONS AND FUNCTIONS EXAMPLE 4.4.33 The binary operation defined in Example 4.4.30 is both associative and commutative. To see that it is commutative, let ๐, ๐ ∈ ๐. Then, [๐]๐ + [๐]๐ = [๐ + ๐]๐ = [๐ + ๐]๐ = [๐]๐ + [๐]๐ , (4.6) where the second equality holds because + is commutative on ๐. Its identity is [0]๐ [let ๐ = 0 in (4.6)] and the additive inverse of [๐]๐ is [−๐]๐ . EXAMPLE 4.4.34 Since ๐ด ∪ ๐ต = ๐ต ∪ ๐ด and (๐ด ∪ ๐ต) ∪ ๐ถ = ๐ด ∪ (๐ต ∪ ๐ถ) for all sets ๐ด, ๐ต, and ๐ถ, the binary operation in Example 4.4.29 is both associative and commutative. Its identity is ∅, and only ∅ has an inverse. Exercises 1. Indicate whether each of the given relations are functions. If a relation is not a function, find an element of its domain that is paired with two elements of its range. (a) {(1, 2), (2, 3), (3, 4), (4, 5), (5, 1)} (b) {(1, 1), √ (1, 2), (1, 3), (1, 4), (1, 5)} (c) {(๐ฅ, |๐ฅ| : ๐ฅ ∈ ๐} √ (d) {(๐ฅ, ± |๐ฅ|) : ๐ฅ ∈ ๐} (e) {(๐ฅ, ๐ฅ2 ) : ๐ฅ ∈ ๐} (f) {([๐]5 , ๐) : ∃๐ ∈ ๐(๐ = ๐ + 5๐)} (g) ๐ : ๐ → ๐5 if ๐(๐) = [๐]5 2. Prove that the given relations are functions. (a) {(๐ฅ, 1โ๐ฅ) : ๐ฅ ∈ ๐ โงต {0}} (b) {(๐ฅ, ๐ฅ + 1) : ๐ฅ ∈ ๐} (c) {(๐ฅ, |๐ฅ|) √ : ๐ฅ ∈ ๐} (d) {(๐ฅ, ๐ฅ) : ๐ฅ ∈ [0, ∞)} 3. Let ๐ = {(๐ฅ, ๐ฆ) ∈ ๐2 : 2๐ฅ + ๐ฆ = 1}. Show that ๐ is a function with domain and codomain equal to the set of real numbers. 4. Let ๐ , ๐ : ๐ → ๐ be functions. Prove that ๐(๐ฅ, ๐ฆ) = (๐ (๐ฅ), ๐(๐ฆ)) is a function with domain and codomain equal to ๐ × ๐. 5. Let ๐ด be a set and define ๐(๐ด) = P(๐ด). Show that ๐ is a function. 6. Define { ๐ฅ2 ๐ (๐ฅ) = 5 if ๐ฅ ≥ 0, if ๐ฅ < 0. Show that ๐ is well-defined with domain equal to ๐. What is ran(๐ )? Section 4.4 FUNCTIONS 201 7. Let ๐ฅ be in the domain and ๐ฆ in the range of each relation. Explain why each of the given equations does not describe a function. (a) ๐ฆ = 5 ± ๐ฅ (b) ๐ฅ2 + ๐ฆ2 = 1 (c) ๐ฅ = 4๐ฆ2 − 1 (d) ๐ฆ2 − ๐ฅ2 = 9 8. Let ๐ : ๐ → ๐7 be defined by ๐(๐) = [๐]7 . Write the given images as rosters. (a) ๐(0) (b) ๐(7) (c) ๐(3) (d) ๐(−3) 9. Define ๐([๐]๐ ) = [๐]๐ for all ๐ ∈ ๐. Is ๐ being function sufficient for ๐ โฃ ๐? Explain. 10. Give an example of a function that is an element of the given sets. (a) ๐ ๐ (b) ๐ ๐ (c) ๐ ๐ (d) ๐ [0, ∞) (e) ๐ (๐5 ) 11. Evaluate the indicated expressions. (a) ๐4 (๐ ) if ๐ (๐ฅ) = 9๐ฅ + 2 (b) ๐๐ (๐) if ๐(๐) = sin ๐ 12. Since functions are sets, we can perform set operations on them. Let ๐ (๐ฅ) = ๐ฅ2 and ๐(๐ฅ) = −๐ฅ. Find the following. (a) ๐ ∪ ๐ (b) ๐ ∩ ๐ (c) ๐ โงต ๐ (d) ๐ โงต ๐ โ 13. Let ๐ be a chain of functions with respect to ⊆. Prove that ๐ is a function. 14. Let ๐ and ๐ be functions. Prove the following. (a) If ๐ and ๐ are functions, ๐ ∩ ๐ is a function. (b) ๐ ∪ ๐ is a function if and only if ๐ (๐ฅ) = ๐(๐ฅ) for all ๐ฅ ∈ dom(๐ ) ∩ dom(๐). 15. Let ๐ : ๐ด → ๐ต be a function. Define a relation ๐ on ๐ด by ๐ ๐ ๐ if and only if ๐ (๐) = ๐ (๐). (a) Show ๐ is an equivalence relation. (b) Find [3]๐ if ๐ : ๐ → ๐ is defined by ๐ (๐) = 2๐. (c) Find [2]๐ if ๐ : ๐ → ๐5 is given by ๐ (๐) = [๐]5 . 16. Show that ∅ is a function and find its domain and range. 202 Chapter 4 RELATIONS AND FUNCTIONS 17. Define ∗ by ๐ฅ ∗ ๐ฆ = ๐ฅ + ๐ฆ + 2 for all ๐ฅ, ๐ฆ ∈ ๐. (a) Show that ∗ is a binary operation on ๐. (b) Prove that −2 is the identity of ๐ with respect to ∗. (c) For every ๐ ∈ ๐, show that −๐ − 4 is the inverse of ๐ with respect to ∗. 18. Define the binary operation ∗ by ๐ฅ ∗ ๐ฆ = 2๐ฅ − ๐ฆ for all ๐ฅ, ๐ฆ ∈ ๐. (a) Is there an integer that serves as an identity with respect to ∗? (b) Does every integer have an inverse with respect to ∗? 19. Let ๐ , ๐ : ๐ → ๐ be functions. Prove that ๐ โฆ ๐ is well-defined. 20. For an associative binary operation, prove that the inverse of an element is unique if it exists. Show that this might not be the case if the binary operation is not associative. 21. Prove that the given pairs of functions are equal. (a) ๐ (๐ฅ) = (๐ฅ − 1)(๐ฅ − 2)(๐ฅ + 3) and ๐(๐ฅ) = ๐ฅ3 − 7๐ฅ + 6 where ๐ , ๐ : ๐ → ๐ (b) ๐(๐, ๐) = ๐ + ๐ and ๐(๐, ๐) = ๐ + ๐ where ๐, ๐ : ๐ × ๐ → ๐ (c) ๐(๐, ๐) = ([๐]5 , [๐ + 7]5 ) and ๐(๐, ๐) = ([๐ + 5]5 , [๐ − 3]5 ) where ๐, ๐ : ๐ × ๐ → ๐5 × ๐5 (d) ๐(๐ ) = ๐ ๐ and ๐(๐ ) = {(๐, ๐ (๐)) : ๐ ∈ ๐} where ๐, ๐ : ๐ ๐ → ๐ ๐ 22. Show that the given pairs of functions are not equal. (a) ๐ (๐ฅ) = ๐ฅ and ๐(๐ฅ) = 2๐ฅ where ๐ , ๐ : ๐ → ๐ (b) ๐ (๐ฅ) = ๐ฅ − 3 and ๐(๐ฅ) = ๐ฅ + 3 where ๐ , ๐ : ๐ → ๐ (c) ๐(๐) = [๐]5 and ๐(๐) = [๐]4 where ๐, ๐ : ๐ → ๐4 ∪ ๐5 (d) ๐(๐ด) = ๐ด โงต {0} and ๐(๐ด) = ๐ด ∩ {1, 2, 3} where ๐, ๐ : P(๐) → P(๐) 23. Let ๐ : ๐ → ๐ ๐ be defined by ๐(๐) = ๐๐ where ๐๐ is the function ๐๐ : ๐ → ๐ with ๐๐ (๐ฅ) = ๐๐ฅ. Prove that ๐ is well-defined. 24. For each pair of functions, find the indicated values when possible. (a) ๐ : ๐ → ๐ and ๐ (๐ฅ) = 2๐ฅ3 ๐ : ๐ → ๐ and ๐(๐ฅ) = ๐ฅ + 1 (๐ โฆ ๐)(2) (๐ โฆ ๐ )(0) √ (b) ๐ : [0, ∞) → ๐ and ๐ (๐ฅ) = ๐ฅ ๐ : ๐ → ๐ and ๐(๐ฅ) = |๐ฅ| − 1 (๐ โฆ ๐)(0) (๐ โฆ ๐ )(4) (c) ๐ : ๐ → ๐5 and ๐(๐) = [๐]5 ๐ : ๐ ๐ → ๐ and ๐(๐ ) = ๐ (0) (๐ โฆ ๐)(.5๐ฅ + 1) (๐ โฆ ๐)(2) Section 4.5 INJECTIONS AND SURJECTIONS 203 25. For each of the given functions, find the composition of the function with itself. For example, find ๐ โฆ ๐ for part (a). (a) ๐ : ๐ → ๐ with ๐ (๐ฅ) = ๐ฅ2 (b) ๐ : ๐ → ๐ with ๐(๐ฅ) = 3๐ฅ + 1 (c) ๐ : ๐ × ๐ → ๐ × ๐ with ๐(๐ฅ, ๐ฆ) = (2๐ฆ, 5๐ฅ − ๐ฆ) (d) ๐ : ๐๐ → ๐๐ with ๐([๐]๐ ) = [๐ + 2]๐ 26. Let ๐ : ๐ด → ๐ต be a function and ๐ = ๐ ๐ถ where ๐ถ ⊆ ๐ด. Prove that if ๐ : ๐ถ → ๐ด is defined by ๐(๐) = ๐ (known as the inclusion map), then ๐ = ๐ โฆ ๐. 27. Write the given restrictions as rosters. (a) {(1, 2), (2, 2), (3, 4), (4, 7)} {1, 3} (b) ๐ {0, 1, 2, 3} where ๐ (๐ฅ) = 7๐ฅ − 1 and dom(๐ ) = ๐ (c) (๐ + โ) {−3.3, 1.2, 7} where ๐(๐ฅ) = โฆ๐ฅโง, โ(๐ฅ) = ๐ฅ + 1, and both dom(๐) and dom(โ) equal ๐ 28. For functions ๐ and ๐ such that ๐ด, ๐ต ⊆ dom(๐ ), prove the following. (a) ๐ ๐ด = ๐ ∩ [๐ด × ran(๐ )] (b) ๐ (๐ด ∩ ๐ต) = (๐ ๐ด) ∩ (๐ ๐ต) (c) ๐ (๐ด โงต ๐ต) = (๐ ๐ด) โงต (๐ ๐ต) (d) (๐ โฆ ๐ ) ๐ด = ๐ โฆ (๐ ๐ด) 29. Let ๐ : ๐ → ๐ be a function. Prove that if ๐ด ⊆ ๐ , then ๐ ๐ด = ๐ โฆ ๐ผ๐ด . 30. Let ๐ : ๐ด ๐ถ → ๐ต๐ถ be defined by ๐(๐ ) = ๐ ๐ต. Prove that ๐ is well-defined. 31. A real-valued function ๐ is periodic if there exists ๐ > 0 so that ๐ (๐ฅ) = ๐ (๐ฅ + ๐) for all ๐ฅ ∈ dom(๐ ). Let ๐, โ : ๐ → ๐ be functions with period ๐. Prove that ๐ โฆ โ is periodic with period ๐. 32. Let (๐ด, 4) be a poset. A function ๐ : ๐ด → ๐ด is increasing means for all ๐ฅ, ๐ฆ ∈ ๐ด, if ๐ฅ โบ ๐ฆ, then ๐(๐ฅ) โบ ๐(๐ฆ). A decreasing function is defined similarly. Suppose that ๐ and ๐ are increasing. Prove that ๐ โฆ ๐ is increasing. 4.5 INJECTIONS AND SURJECTIONS When looking at relations, we studied the concept of an inverse relation. Given ๐ , obtain ๐ −1 by exchanging the ๐ฅ- and ๐ฆ-coordinates. The same can be done with functions, but the inverse might not be a function. For example, given ๐ = {(1, 2), (2, 3), (3, 2)}, its inverse is ๐ −1 = {(2, 1), (3, 2), (2, 3)}. However, if the original relation is a function, we often want the inverse also to be a function. This leads to the next definition. 204 Chapter 4 RELATIONS AND FUNCTIONS DEFINITION 4.5.1 ๐ : ๐ด → ๐ต is invertible means that ๐−1 is a function ๐ต → ๐ด. An immediate consequence of the definition is the next result. LEMMA 4.5.2 Let ๐ be invertible. Then, ๐(๐ฅ) = ๐ฆ if and only if ๐−1 (๐ฆ) = ๐ฅ for all ๐ฅ ∈ dom(๐). PROOF Suppose that ๐(๐ฅ) = ๐ฆ. This means that (๐ฅ, ๐ฆ) ∈ ๐, so (๐ฆ, ๐ฅ) ∈ ๐−1 . Since ๐ is invertible, ๐−1 is a function, so write ๐−1 (๐ฆ) = ๐ฅ. The converse is proved similarly. We use Lemma 4.5.2 in the proof of the next theorem, which gives conditions for when a function is invertible. THEOREM 4.5.3 ๐ : ๐ด → ๐ต is invertible if and only if ๐−1 โฆ ๐ = ๐ผ๐ด and ๐ โฆ ๐−1 = ๐ผ๐ต . PROOF Take a function ๐ : ๐ด → ๐ต. Then, ๐−1 ⊆ ๐ต × ๐ด. โ Assume that ๐−1 is a function ๐ต → ๐ด. Let ๐ฅ ∈ ๐ด and ๐ฆ ∈ ๐ต. By assumption, we have ๐ฅ0 ∈ ๐ด and ๐ฆ0 ∈ ๐ต such that ๐(๐ฅ) = ๐ฆ0 and ๐−1 (๐ฆ) = ๐ฅ0 . This implies that ๐−1 (๐ฆ0 ) = ๐ฅ and ๐(๐ฅ0 ) = ๐ฆ by Lemma 4.5.2. Therefore, (๐−1 โฆ ๐)(๐ฅ) = ๐−1 (๐(๐ฅ)) = ๐−1 (๐ฆ0 ) = ๐ฅ, and (๐ โฆ ๐−1 )(๐ฆ) = ๐(๐−1 (๐ฆ)) = ๐(๐ฅ0 ) = ๐ฆ. โ Now assume ๐−1 โฆ ๐ = ๐ผ๐ด and ๐ โฆ ๐−1 = ๐ผ๐ต . To show that ๐−1 is a function, take (๐ฆ, ๐ฅ), (๐ฆ, ๐ฅ′ ) ∈ ๐−1 . From this, we know that (๐ฅ, ๐ฆ) ∈ ๐. Therefore, (๐ฅ, ๐ฅ′ ) ∈ ๐−1 โฆ ๐ = ๐ผ๐ด , so ๐ฅ = ๐ฅ′ . In addition, we know that dom(๐−1 ) ⊆ ๐ต, so to prove equality, let ๐ฆ ∈ ๐ต. Then, (๐ฆ, ๐ฆ) ∈ ๐ผ๐ต = ๐ โฆ ๐−1 . Thus, there exists ๐ฅ ∈ ๐ด such that (๐ฆ, ๐ฅ) ∈ ๐−1 , so ๐ฆ ∈ dom(๐−1 ). EXAMPLE 4.5.4 โ Let ๐ : ๐ → ๐ be the function given by ๐ (๐ฅ) = ๐ฅ + 2. Its inverse is ๐(๐ฅ) = ๐ฅ − 2 by Theorem 4.5.3. This is because (๐ โฆ ๐ )(๐ฅ) = ๐(๐ฅ + 2) = (๐ฅ + 2) − 2 = ๐ฅ Section 4.5 INJECTIONS AND SURJECTIONS 205 and (๐ โฆ ๐)(๐ฅ) = ๐ (๐ฅ − 2) = (๐ฅ − 2) + 2 = ๐ฅ. โ If the function ๐ : [0, ∞) → [0, ∞) is defined by √ ๐(๐ฅ) = ๐ฅ2 , then ๐ −1 is a −1 function [0, ∞) → [0, ∞) and is defined by ๐ (๐ฅ) = ๐ฅ. โ Let โ : ๐ → (0, ∞) be defined as โ(๐ฅ) = ๐๐ฅ . By Theorem 4.5.3, we know that โ−1 (๐ฅ) = ln ๐ฅ because ๐ln ๐ฅ = ๐ฅ for all ๐ฅ ∈ [0, ∞) and ln ๐๐ฅ = ๐ฅ for all ๐ฅ ∈ ๐. Injections Theorem 4.5.3 can be improved by finding a condition for the invertibility of a function based only on the given function. Consider the following. In order for a relation to be a function, it cannot look like Figure 4.9. Since the inverse exchanges the roles of the two coordinates, in order for an inverse to be a function, the original function cannot look like the graph in Figure 4.11. In other words, if ๐ −1 is to be a function, there cannot exist ๐ฅ1 and ๐ฅ2 so that ๐ฅ1 ≠ ๐ฅ2 and ๐ (๐ฅ1 ) = ๐ (๐ฅ2 ). But then, ¬∃๐ฅ1 ∃๐ฅ2 [๐ฅ1 ≠ ๐ฅ2 ∧ ๐ (๐ฅ1 ) = ๐ (๐ฅ2 )] is equivalent to ∀๐ฅ1 ∀๐ฅ2 [๐ฅ1 = ๐ฅ2 ∨ ๐ (๐ฅ1 ) ≠ ๐ (๐ฅ2 )], which in turn is equivalent to ∀๐ฅ1 ∀๐ฅ2 [๐ (๐ฅ1 ) = ๐ (๐ฅ2 ) → ๐ฅ1 = ๐ฅ2 ]. Hence, ๐ being an invertible function implies that for every ๐ฅ1 , ๐ฅ2 ∈ dom(๐ ), ๐ฅ1 = ๐ฅ2 if and only if ๐ (๐ฅ1 ) = ๐ (๐ฅ2 ). f (x 1, y) x1 Figure 4.11 y (x 2, y) x2 The inverse of a function might not be a function. (4.7) 206 Chapter 4 RELATIONS AND FUNCTIONS ๐ A Figure 4.12 B ๐ is a one-to-one function. This means that the elements of the domain of ๐ and the elements of the range of ๐ form pairs of elements as illustrated in Figure 4.12. The sufficiency of (4.7) is the definition of a function, while necessity is the next definition. DEFINITION 4.5.5 The function ๐ : ๐ด → ๐ต is one-to-one if and only if for all ๐ฅ1 , ๐ฅ2 ∈ ๐ด, if ๐(๐ฅ1 ) = ๐(๐ฅ2 ), then ๐ฅ1 = ๐ฅ2 . A one-to-one function is sometimes called an injection. EXAMPLE 4.5.6 Define ๐ : ๐ → ๐ by ๐ (๐ฅ) = 5๐ฅ+1. To show that ๐ is one-to-one, let ๐ฅ1 , ๐ฅ2 ∈ ๐ and assume ๐ (๐ฅ1 ) = ๐ (๐ฅ2 ). Then, 5๐ฅ1 + 1 = 5๐ฅ2 + 1, 5๐ฅ1 = 5๐ฅ2 , ๐ฅ1 = ๐ฅ2 . EXAMPLE 4.5.7 Let ๐ : ๐ × ๐ → ๐ × ๐ × ๐ be the function ๐(๐, ๐) = (๐, ๐, 0). For any (๐1 , ๐1 ), (๐2 , ๐2 ) ∈ ๐ × ๐, assume ๐(๐1 , ๐1 ) = ๐(๐2 , ๐2 ). This means that (๐1 , ๐1 , 0) = (๐2 , ๐2 , 0). Hence, ๐1 = ๐2 and ๐1 = ๐2 , and this yields (๐1 , ๐1 ) = (๐2 , ๐2 ). Section 4.5 INJECTIONS AND SURJECTIONS ๐ A Figure 4.13 207 B ๐ is not a one-to-one function. If a function is not one-to-one, there must be an element of the range that has at least two pre-images (Figure 4.13). An example of a function that is not one-to-one is ๐ (๐ฅ) = ๐ฅ2 where both the domain and codomain of ๐ are ๐. This is because ๐ (2) = 4 and ๐ (−2) = 4. Another example is ๐ : ๐ → ๐ defined by ๐(๐) = cos ๐. It is not an injection because ๐(0) = ๐(2๐) = 1. Although the original function might not be one-to-one, we can always restrict the function to a subset of its domain so that the resulting function is one-to-one. This is illustrated in the next two examples. EXAMPLE 4.5.8 Let ๐ be the function {(1, 5), (2, 8), (3, 8), (4, 6)}. We observe that ๐ is not oneto-one, but both ๐ {1, 2} = {(1, 5), (2, 8)} and ๐ {3, 4} = {(3, 8), (4, 6)} are one-to-one as in Figure 4.14. A B 1 f โพA 5 2 8 3 6 4 f โพB Figure 4.14 Restrictions of ๐ to ๐ด and ๐ต are one-to-one. 208 Chapter 4 RELATIONS AND FUNCTIONS EXAMPLE 4.5.9 Let ๐ : ๐ → ๐ be the function ๐(๐ฅ) = ๐ฅ2 . This function is not one-to-one, but ๐ [0, ∞) and ๐ (−10, −5) are one-to-one. √ Let ๐ (๐ฅ) = 3๐ฅ + 6 and ๐(๐ฅ) = 5๐ฅ − 8. Both are injections. Notice that √ (๐ โฆ ๐)(๐ฅ) = 3 5๐ฅ − 8 + 6 and (๐ โฆ ๐ )(๐ฅ) = √ 15๐ฅ + 22 are also injections. We can generalize this result to the next theorem. THEOREM 4.5.10 If ๐ : ๐ด → ๐ต and ๐ : ๐ต → ๐ถ are injections, ๐ โฆ ๐ is an injection. PROOF Assume that ๐ : ๐ด → ๐ต and ๐ : ๐ต → ๐ถ are one-to-one. Let ๐1 , ๐2 ∈ ๐ด and assume (๐ โฆ ๐)(๐1 ) = (๐ โฆ ๐)(๐2 ). Then, ๐(๐(๐1 )) = ๐(๐(๐2 )). Since ๐ is one-to-one, ๐(๐1 ) = ๐(๐2 ), and since ๐ is one-to-one, ๐1 = ๐2 . Surjections The function being an injection is not sufficient for it to be invertible since it is possible that not every element of the codomain will have a pre-image. In this case, the codomain cannot be the domain of the inverse. To prevent this situation, we will need the function to satisfy the next definition. DEFINITION 4.5.11 A function ๐ : ๐ด → ๐ต is onto if and only if for every ๐ฆ ∈ ๐ต, there exists ๐ฅ ∈ ๐ด such that ๐(๐ฅ) = ๐ฆ. An onto function is also called a surjection. This definition is related to the range (or image) of the function. The range of the function ๐ : ๐ด → ๐ต is ran(๐) = {๐ฆ : ∃๐ฅ(๐ฅ ∈ ๐ด ∧ (๐ฅ, ๐ฆ) ∈ ๐)} = {๐(๐ฅ) : ๐ฅ ∈ dom(๐)} as illustrated in Figure 4.15. Thus, ๐ is onto if and only if ran(๐) = ๐ต. EXAMPLE 4.5.12 Define ๐ : [0, ∞) → [0, ∞) by ๐ (๐ฅ) = √ ๐ฅ. Its range is also [0, ∞), so it is onto. Section 4.5 INJECTIONS AND SURJECTIONS 209 ran( ๐ ) A B x Figure 4.15 ๐ y The range of ๐ : ๐ด → ๐ต with ๐ฆ = ๐(๐ฅ). EXAMPLE 4.5.13 The ranges of the following functions are different from their codomains, so they are not onto. โ Let ๐ : ๐ → ๐ be defined by ๐(๐ฅ) = |๐ฅ|. Then, ran(๐) = [0, ∞). โ Define โ : ๐ → ๐ by โ(๐) = 2๐. Here, ran(โ) = {2๐ : ๐ ∈ ๐}. The functions illustrated in Figures 4.12, 4.13, and 4.14 are onto functions as are those in the next examples. EXAMPLE 4.5.14 Any linear function ๐ : ๐ → ๐ that is not a horizontal line is a surjection. To see this, let ๐ (๐ฅ) = ๐๐ฅ + ๐ for some ๐ ≠ 0. Take ๐ฆ ∈ ๐. We need to find ๐ฅ ∈ ๐ so that ๐๐ฅ + ๐ = ๐ฆ. Choose ๐ฆ−๐ ๐ฅ= . ๐ Then, ( ) ๐ฆ−๐ ๐ (๐ฅ) = ๐ + ๐ = ๐ฆ. ๐ The approach in the example is typical. To show that a function is onto, take an arbitrary element of the codomain and search for a candidate to serve as its pre-image. When found, check it. EXAMPLE 4.5.15 Take a positive integer ๐ and let ๐ : ๐ → ๐๐ be defined as ๐(๐) = [๐]๐ . To see that ๐ is onto, take [๐]๐ ∈ ๐๐ for some ๐ ∈ ๐. We then find that ๐(๐) = [๐]๐ . EXAMPLE 4.5.16 Let ๐, ๐ ∈ ๐ with ๐ > ๐. A function ๐ : ๐๐ → ๐๐ defined by ๐(๐ฅ0 , ๐ฅ1 , … , ๐ฅ๐−1 , ๐ฅ๐ , … , ๐ฅ๐−1 ) = (๐ฅ0 , ๐ฅ1 , … , ๐ฅ๐−1 ) 210 Chapter 4 RELATIONS AND FUNCTIONS is called a projection.. Such functions are not one-to-one, but they are onto. For instance, define ๐ : ๐ × ๐ × ๐ → ๐ × ๐ by ๐(๐ฅ, ๐ฆ, ๐ง) = (๐ฅ, ๐ฆ). It is not one-to-one because ๐(1, 2, 3) = ๐(1, 2, 4) = (1, 2). However, if we take (๐, ๐) ∈ ๐ × ๐, then ๐(๐, ๐, 0) = (๐, ๐), so ๐ is onto. If a function is not onto, it has a diagram like that of Figure 4.16. Therefore, to show that a function is not a surjection, we must find an element of the codomain that does not have a pre-image. EXAMPLE 4.5.17 Define ๐ : ๐ → ๐ by ๐ (๐) = 3๐. This function is not onto because 5 does not have a pre-image in ๐. EXAMPLE 4.5.18 The function ๐:๐×๐→๐×๐×๐ defined by ๐(๐, ๐) = (๐, ๐, 0) is not onto because (1, 1, 1) does not have a preimage in ๐ × ๐. We have the following analog of Theorem 4.5.10 for surjections. THEOREM 4.5.19 If ๐ : ๐ด → ๐ต and ๐ : ๐ต → ๐ถ are surjections, ๐ โฆ ๐ is a surjection. PROOF Assume that ๐ : ๐ด → ๐ต and ๐ : ๐ต → ๐ถ are surjections. Take ๐ ∈ ๐ถ. Then, there exists ๐ ∈ ๐ต so that ๐(๐) = ๐ and ๐ ∈ ๐ด such that ๐(๐) = ๐. Therefore, (๐ โฆ ๐)(๐) = ๐(๐(๐)) = ๐(๐) = ๐. ๐ A Figure 4.16 ๐ is not a onto function. B Section 4.5 INJECTIONS AND SURJECTIONS 211 Bijections If we have a function that is both one-to-one and onto, then it is called a bijection or a one-to-one correspondence. Observe that ∅ is a bijection. EXAMPLE 4.5.20 As illustrated in Examples 4.5.6 and 4.5.14, every linear ๐ : ๐ → ๐ with nonzero slope is a bijection. EXAMPLE 4.5.21 Both ๐ : (−๐โ2, ๐โ2) → ๐ such that ๐(๐) = tan ๐ and โ : ๐ → (0, ∞) where โ(๐ฅ) = ๐๐ฅ are bijections. We are now ready to give the standard test for invertibility. Its proof requires both Lemma 4.5.2 and Theorem 4.5.3. The benefit of this theorem is that it provides a test for invertibility in which the given function is examined instead of its inverse. THEOREM 4.5.22 A function is invertible if and only if it is a bijection. PROOF Let ๐ : ๐ด → ๐ต be a function. โ Suppose that ๐ is invertible. To show that ๐ is one-to-one, let ๐ฅ1 , ๐ฅ2 ∈ ๐ด and assume that ๐(๐ฅ1 ) = ๐(๐ฅ2 ). Then, by Theorem 4.5.3, ๐ฅ1 = ๐−1 (๐(๐ฅ1 )) = ๐−1 (๐(๐ฅ2 )) = ๐ฅ2 . To see that ๐ is onto, take ๐ฆ ∈ ๐ต. Then, there exists ๐ฅ ∈ ๐ด such that ๐−1 (๐ฆ) = ๐ฅ. Hence, ๐(๐ฅ) = ๐ฆ by Lemma 4.5.2. โ Assume that ๐ is both one-to-one and onto. To show that ๐−1 is a function, let (๐ฆ, ๐ฅ), (๐ฆ, ๐ฅ′ ) ∈ ๐−1 . This implies that ๐(๐ฅ) = ๐ฆ = ๐(๐ฅ′ ). Since ๐ is one-to-one, ๐ฅ = ๐ฅ′ . To prove that the domain of ๐−1 is ๐ต, take ๐ฆ ∈ ๐ต. Since ๐ is onto, there exists ๐ฅ ∈ ๐ด such that ๐(๐ฅ) = ๐ฆ. By Lemma 4.5.2, we have that ๐−1 (๐ฆ) = ๐ฅ, so ๐ฆ ∈ dom(๐−1 ). By Theorem 4.5.22 the functions of Examples 4.5.20 and 4.5.21 are invertible. THEOREM 4.5.23 If ๐ : ๐ด → ๐ต and ๐ : ๐ต → ๐ถ are bijections, then ๐−1 and ๐ โฆ ๐ are bijections. PROOF Suppose ๐ is a bijection. By Theorem 4.5.22, it is invertible, so ๐−1 is a function that has ๐ as its inverse. Therefore, ๐−1 is a bijection by Theorem 4.5.22. Combining the proofs of Theorems 4.5.10 and 4.5.19 show that ๐ โฆ ๐ is a bijection when ๐ is a bijection. 212 Chapter 4 RELATIONS AND FUNCTIONS Using the functions ๐ and โ from Example 4.5.21, we conclude from Theorem 4.5.23 that โ โฆ ๐ is a bijection with domain (−๐โ2, ๐โ2) and range (0, ∞). Order Isomorphims Consider the function ๐ : ๐ × {0} → {0} × ๐ (4.8) defined by ๐(๐, 0) = (0, ๐). It can be shown that ๐ is a bijection (compare Exercise 12). Define the linear orders 4 on ๐ × {0} and 4′ on {0} × ๐ by (๐, 0) 4 (๐, 0) if and only if ๐ ≤ ๐ and (0, ๐) 4′ (0, ๐) if and only if ๐ ≤ ๐. Notice that (3, 0) 4 (5, 0) and (0, 3) 4′ (0, 5) because 3 ≤ 5. We can generalize this to conclude that (๐, 0) 4 (๐, 0) if and only if (0, ๐) 4′ (0, ๐), and this implies that (๐, 0) 4 (๐, 0) if and only if ๐(๐, 0) 4′ ๐(๐, 0). This leads to the next definition. DEFINITION 4.5.24 Let ๐ be a relation on ๐ด and ๐ be a relation on ๐ต. โ ๐ : ๐ด → ๐ต is an order-preserving function if for all ๐1 , ๐2 ∈ ๐ด, (๐1 , ๐2 ) ∈ ๐ if and only if (๐(๐1 ), ๐(๐2 )) ∈ ๐ and we say that ๐ preserves ๐ with ๐. โ An order-preserving bijection is an order isomorphism. โ (๐ด, ๐ ) is order isomorphic to (๐ต, ๐) and we write (๐ด, ๐ ) ≅ (๐ต, ๐) if there exists an order isomorphism ๐ : ๐ด → ๐ต preserving ๐ with ๐. If (๐ด, ๐ ) ≅ (๐ต, ๐) and the relations ๐ and ๐ are clear from context, we can write ๐ด ≅ ๐ต. Sometimes (๐ด, ๐ ) and (๐ต, ๐) are said to have the same order type when they are order isomorphic. An isomorphism pairs elements from two sets in such a way that the orders on the two sets appear to be the same. Section 4.5 INJECTIONS AND SURJECTIONS 213 EXAMPLE 4.5.25 Define ๐ : ๐+ → ๐− by ๐ (๐ฅ) = −๐ฅ (Example 3.1.12). Clearly, ๐ is a bijection. Moreover, ๐ preserves ≤ with ≥. To prove this, let ๐ฅ1 , ๐ฅ2 ∈ ๐+ . Then, ๐ฅ1 ≤ ๐ฅ1 ⇔ −๐ฅ1 ≥ −๐ฅ2 ⇔ ๐ (๐ฅ1 ) ≥ ๐ (๐ฅ2 ). Therefore, (๐+ , ≤) ≅ (๐− , ≥). Observe that the inverse of (4.8) is the function ๐−1 : {0} × ๐ → ๐ × {0} such that ๐−1 (0, ๐) = (๐, 0). This function preserves 4′ with 4. This result is generalized and proved in the next theorem. THEOREM 4.5.26 The inverse of an order isomorphism preserving ๐ with ๐ is an order isomorphism preserving ๐ with ๐ . PROOF Let ๐ : ๐ด → ๐ต be an order isomorphism preserving ๐ with ๐. By Theorem 4.5.23, ๐−1 is a bijection. Suppose that (๐1 , ๐2 ) ∈ ๐. Since ๐ is onto, there exists ๐1 , ๐2 ∈ ๐ด such that ๐(๐1 ) = ๐1 and ๐(๐2 ) = ๐2 . This implies that (๐(๐1 ), ๐(๐2 )) ∈ ๐. Since ๐ is an isomorphism preserving ๐ with ๐, we have that (๐1 , ๐2 ) ∈ ๐ . However, ๐1 = ๐−1 (๐1 ) and ๐2 = ๐−1 (๐2 ), so (๐−1 (๐1 ), ๐−1 (๐2 )) ∈ ๐ . Therefore, ๐−1 : ๐ต → ๐ด is an isomorphism preserving ๐ with ๐ . Also, observe that ๐ : ๐ → (0, ∞) defined by ๐(๐ฅ) = ๐๐ฅ is an order isomorphism preserving ≤ with ≤. Using ๐ from Example 4.5.25, the composition ๐ โฆ ๐ is an order isomorphism ๐ → (−∞, 0) such that (๐ โฆ ๐)(๐ฅ) = −๐๐ฅ . That this happens in general is the next theorem. Its proof is left to Exercise 25. THEOREM 4.5.27 If ๐ : ๐ด → ๐ต is an isomorphism preserving ๐ with ๐ and ๐ : ๐ต → ๐ถ is an isomorphism preserving ๐ with ๐ , then ๐ โฆ ๐ : ๐ด → ๐ถ is an isomorphism preserving ๐ with ๐ . EXAMPLE 4.5.28 Let ๐ be a relation on ๐ด, ๐ be a relation on ๐ต, and ๐ be a relation on ๐ถ. โ Since the identity map is an order isomorphism, (๐ด, ๐ ) ≅ (๐ด, ๐ ). โ Suppose (๐ด, ๐ ) ≅ (๐ต, ๐). This means that there exists an order isomorphism ๐ : ๐ด → ๐ต preserving ๐ with ๐. By Theorem 4.5.26, ๐−1 is an order isomorphism preserving ๐ with ๐ . Therefore, (๐ต, ๐) ≅ (๐ด, ๐ ). 214 Chapter 4 RELATIONS AND FUNCTIONS โ Let (๐ด, ๐ ) ≅ (๐ต, ๐) and (๐ต, ๐) ≅ (๐ถ, ๐ ). By Theorem 4.5.27, we conclude that (๐ด, ๐ ) ≅ (๐ถ, ๐ ). If an order-preserving function is one-to-one, even it is not a surjection, the function still provides an order isomorphism between its domain and range. This concept is named by the next definition. DEFINITION 4.5.29 ๐ : ๐ด → ๐ต is an embedding if ๐ is an order isomorphism ๐ด → ran(๐). For example, ๐ : ๐ → ๐ such that ๐ (๐) = ๐ is an embedding preserving ≤ and ๐ : ๐2 → ๐3 such that ๐(๐ฅ, ๐ฆ) = (๐ฅ, ๐ฆ, 0) is an embedding preserving the lexicographical order (Exercise 4.3.16). Although ๐2 is not a subset of ๐3 , we view the image of ๐ as a copy of ๐2 in ๐3 that preserves the orders. Exercises 1. Show that the given pairs of functions are inverses. (a) ๐ (๐ฅ) = 3๐ฅ + 2 and ๐(๐ฅ) = 31 ๐ฅ − 23 (b) ๐(๐, ๐) = (2๐, ๐ + 2) and ๐(๐, ๐) = ( 12 ๐, ๐ − 2) (c) ๐ (๐ฅ) = ๐๐ฅ and ๐(๐ฅ) = log๐ ๐ฅ, where ๐ > 0 2. For each function, graph the indicated restriction. (a) ๐ (0, ∞), ๐ (๐ฅ) = ๐ฅ2 (b) ๐ [−5, −2], ๐(๐ฅ) = |๐ฅ| [ ] (c) โ 0, ๐โ2 , โ(๐ฅ) = cos ๐ฅ 3. Prove that the given functions are one-to-one. (a) ๐ : ๐ → ๐, ๐ (๐ฅ) = 2๐ฅ + 1 (b) ๐ : ๐2 → ๐2 , ๐(๐ฅ, ๐ฆ) = (3๐ฆ, 2๐ฅ) (c) โ : ๐ โงต {9} → ๐ โงต {0}, โ(๐ฅ) = 1โ(๐ฅ − 9) (d) ๐ : ๐ × ๐ → ๐ × (0, ∞), ๐(๐, ๐ฅ) = (3๐, ๐๐ฅ ) (e) ๐ : P(๐ด) → P(๐ต), ๐(๐ถ) = ๐ถ ∪ {๐} where ๐ด ⊂ ๐ต and ๐ ∈ ๐ต โงต ๐ด 4. Let ๐ : (๐, ๐) → (๐, ๐) be defined by ๐ (๐ฅ) = ๐−๐ (๐ฅ − ๐) + ๐. ๐−๐ Graph ๐ and show that it is a bijection. 5. Let ๐ and ๐ be functions such that ran(๐) ⊆ dom(๐ ). (a) Prove that if ๐ โฆ ๐ is one-to-one, ๐ is one-to-one. (b) Give an example of functions ๐ and ๐ such that ๐ โฆ ๐ is one-to-one, but ๐ is not one-to-one. 6. Define ๐ : ๐ → ๐๐ by ๐(๐) = [๐]๐ . Show that ๐ is not one-to-one. Section 4.5 INJECTIONS AND SURJECTIONS 215 7. Show that the given functions are not one-to-one. (a) ๐ : ๐ → ๐, ๐ (๐ฅ) = ๐ฅ4 + 3 (b) ๐ : ๐ → ๐, ๐(๐ฅ) = |๐ฅ − 2| + 4 (c) ๐ : P(๐ด) → {{๐}, ∅}, ๐(๐ต) = ๐ต ∩ {๐}, where ๐ ∈ ๐ด and ๐ด has at least two elements (d) ๐5 : ๐ ๐ → ๐, ๐5 (๐ ) = ๐ (5) 8. Let ๐ : ๐ → ๐ be periodic (Exercise 4.4.31). Prove that ๐ is not one-to-one. 9. Show that the given functions are onto. (a) ๐ : ๐ → ๐, ๐ (๐ฅ) = 2๐ฅ + 1 (b) ๐ : ๐ → (0, ∞), ๐(๐ฅ) = ๐๐ฅ (c) โ : ๐ โงต {0} → ๐ โงต {0}, โ(๐ฅ) = 1โ๐ฅ (d) ๐ : ๐ × ๐ → ๐, ๐(๐, ๐) = ๐ + ๐ (e) ๐5 : ๐ ๐ → ๐, ๐5 (๐ ) = ๐ (5) 10. Show that the given functions are not onto. (a) ๐ : ๐ → ๐, ๐ (๐ฅ) = ๐๐ฅ (b) ๐ : ๐ → ๐, ๐(๐ฅ) = |๐ฅ| (c) ๐ : ๐ × ๐ → ๐ × ๐, ๐(๐, ๐) = (3๐, ๐2 ) (d) ๐ : ๐ → ๐ ๐, ๐(๐) = ๐ , where ๐ (๐ฅ) = ๐ for all ๐ฅ ∈ ๐ 11. Let ๐ and ๐ be functions such that ran(๐) ⊆ dom(๐ ). (a) Prove that if ๐ โฆ ๐ is onto, then ๐ is onto. (b) Give an example of functions ๐ and ๐ such that ๐ โฆ ๐ is onto, but ๐ is not onto. 12. Define ๐ : ๐ × ๐ → ๐ × ๐ by ๐(๐ฅ, ๐ฆ) = (๐ฆ, ๐ฅ). Show that ๐ is a bijection. 13. Show that the function ๐พ : ๐ด × ๐ต → ๐ถ × ๐ท defined by ๐พ(๐, ๐) = (๐(๐), ๐(๐)) is a bijection if both ๐ : ๐ด → ๐ถ and ๐ : ๐ต → ๐ท are bijections. 14. Define ๐พ : ๐ด × (๐ต × ๐ถ) → (๐ด × ๐ต) × ๐ถ by ๐พ(๐, (๐, ๐)) = ((๐, ๐), ๐). Prove ๐พ is a bijection. 15. Demonstrate that the inverse of a bijection is a bijection. 16. Prove that the empty set is a bijection with domain and range equal to ∅. 17. Let ๐ด ⊆ ๐ and define ๐ : ๐ ๐ → ๐ด ๐ by ๐(๐ ) = ๐ ๐ด. Is ๐ always one-to-one? Is it always onto? Explain. 18. A function ๐ : ๐ด → ๐ต has a left inverse if there exists a function ๐ : ๐ต → ๐ด such that ๐ โฆ ๐ = ๐ผ๐ด . Prove that a function is one-to-one if and only if it has a left inverse. 216 Chapter 4 RELATIONS AND FUNCTIONS 19. A function ๐ : ๐ด → ๐ต has a right inverse if there exists a function ๐ : ๐ต → ๐ด so that ๐ โฆ ๐ = ๐ผ๐ต . Prove a function is onto if and only if it has a right inverse. 20. Let (๐ด, 4) be a poset. For every bijection ๐ : ๐ด → ๐ด, ๐ is increasing (Exercise 4.4.32) if and only if ๐−1 is increasing. 21. Show that if a real-valued function is increasing, it is one-to-one. 22. Prove or show false this modification of Theorem 4.5.23: If ๐ : ๐ด → ๐ต and ๐ : ๐ถ → ๐ท are bijections with ran(๐) ⊆ ๐ถ, then ๐ โฆ ๐ is a bijection. 23. Let ๐ด, ๐ต ⊆ ๐ be two sets ordered by ≤. Let ๐ด be well-ordered by ≤. Prove that if ๐ : ๐ด → ๐ต is an order-preserving surjection, ๐ต is well-ordered by ≤. 24. Let ๐ : ๐ด → ๐ต be an isomorphism preserving ๐ with ๐ ′ . Let ๐ถ ⊆ ๐ด and ๐ท ⊆ ๐ต. Prove the following. (a) (๐ถ, ๐ ∩ [๐ถ × ๐ถ]) ≅ (๐ [๐ถ] , ๐ ′ ∩ [๐ [๐ถ] × ๐ [๐ถ]]) (b) (๐ท, ๐ ′ ∩ [๐ท × ๐ท]) ≅ (๐−1 [๐ท] , ๐ ∩ [๐−1 [๐ท] × ๐−1 [๐ท]]) 25. Prove Theorem 4.5.27. 26. Define ๐ : ๐ → ๐ by ๐ (๐ฅ) = 2๐ฅ + 1. Prove that ๐ is an order isomorphism preserving < with <. 27. Suppose that (๐ด, ๐ ) and (๐ต, ๐) are posets. Let ๐ : ๐ด → ๐ต be an order isomorphism preserving ๐ with ๐ and ๐ถ ⊆ ๐ด. Prove that if ๐ is the least element of ๐ถ with respect to ๐ , then ๐(๐) is the least element of ๐ [๐ถ] with respect to ๐. 28. Find linear orders (๐ด, 4) and (๐ต, 4′ ) such that each is isomorphic to a subset of the other but (๐ด, 4) is not isomorphic to (๐ต, 4′ ). 29. Let (๐ด, 4) be a poset. Prove that there exists ๐ต ⊆ P(๐ด) such that (๐ด, 4) ≅ (๐ต, ⊆). 4.6 IMAGES AND INVERSE IMAGES So far we have focused on the image of single element in the domain of a function. Sometimes we will need to examine a set of images. DEFINITION 4.6.1 Let ๐ : ๐ด → ๐ต be a function and ๐ถ ⊆ ๐ด. The image of ๐ถ (under ๐) is ๐ [๐ถ] = {๐(๐ฅ) : ๐ฅ ∈ ๐ถ}. Notice that ๐ [๐ถ] ⊆ ๐ต (Figure 4.17) and ๐ [๐ด] = ran(๐). A similar definition can be made with subsets of the codomain. DEFINITION 4.6.2 Let ๐ : ๐ด → ๐ต be a function and ๐ท ⊆ ๐ต. The inverse image of ๐ท (under ๐) is ๐−1 [๐ท] = {๐ฅ ∈ ๐ด : ๐(๐ฅ) ∈ ๐ท}. Section 4.6 IMAGES AND INVERSE IMAGES 217 ๐[C] A B ๐ x y C Figure 4.17 The image of ๐ถ under ๐. D A B ๐ x y ๐−1 [D] Figure 4.18 The inverse image of ๐ท under ๐. Observe that ๐−1 [๐ท] ⊆ ๐ด (Figure 4.18) and ๐−1 [ran(๐)] = ๐ด. EXAMPLE 4.6.3 Let ๐ = {(1, 2), (2, 4), (3, 5), (4, 5)}. This set is a function. Its domain is {1, 2, 3, 4}, and its range is {2, 4, 5}. Then, ๐ [{1, 3}] = {2, 5} and ๐ −1 [{5}] = {3, 4}. EXAMPLE 4.6.4 Define ๐ : ๐ → ๐ by ๐ (๐ฅ) = ๐ฅ2 + 1. โ To prove ๐ [(1, 2)] = (2, 5), we show both inclusions. Let ๐ฆ ∈ ๐ [(1, 2)]. Then, ๐ฆ = ๐ฅ2 + 1 for some ๐ฅ ∈ (1, 2). By a little algebra, we see that 2 < ๐ฅ2 + 1 < 5. Hence, ๐ฆ ∈ (2, 5). Conversely, let ๐ฆ ∈ (2, 5). Because √ 2 < ๐ฆ < 5 ⇔ 1 < ๐ฆ − 1 < 4 ⇔ 1 < ๐ฆ − 1 < 2, 218 Chapter 4 RELATIONS AND FUNCTIONS (a) ๐ [(1, 2)] = (2, 5) Figure 4.19 √ (b) ๐ −1 [(2, 5)] = (−2, −1) ∪ (1, 2) The image and inverse image of a set under ๐ . ๐ฆ − 1 ∈ (1, 2). Furthermore, √ √ √ ๐ ( ๐ฆ − 1) = ๐ ( ๐ฆ − 1) = ( ๐ฆ − 1)2 + 1 = ๐ฆ. Therefore, ๐ฆ ∈ ๐ [(1, 2)]. This is illustrated in Figure 4.19(a). โ Simply because ๐ [(1, 2)] = (2, 5), we cannot conclude ๐ −1 [(2, 5)] equals (1, 2). Instead, ๐ −1 [(2, 5)] = (−2, −1) ∪ (1, 2), as seen in Figure 4.19(b), because ๐ฅ ∈ (−2, −1) ∪ (1, 2) ⇔ −2 < ๐ฅ < −1 or 1 < ๐ฅ < 2 ⇔ 1 < ๐ฅ2 < 4 ⇔ 2 < ๐ฅ2 + 1 < 5 ⇔ ๐ (๐ฅ) ∈ (2, 5) ⇔ ๐ฅ ∈ ๐ −1 [(2, 5)] . โ To show that ๐ −1 [(−2, −1)] is empty, take ๐ฅ ∈ ๐ −1 [(−2, −1)]. This means that −2 < ๐ (๐ฅ) < −1, but this is impossible because ๐ (๐ฅ) is positive. Let ๐ = {(1, 3), (2, 3), (3, 4), (4, 5)}. This is a function {1, 2, 3, 4} → {3, 4, 5}. Notice that ๐ [{1} ∪ {2, 3}] = {3, 4} = ๐ [{1}] ∪ ๐ [{2, 3}], ๐ [{1, 2} ∩ {3}] = ๐ [∅] = ∅ ⊆ {3} = ๐ [{1}] ∩ ๐ [{2, 3}], ๐ −1 [{3, 4} ∪ {5}] = {1, 2, 3, 4} = ๐ −1 [{3, 4}] ∪ ๐ −1 [{5}], Section 4.6 IMAGES AND INVERSE IMAGES 219 and ๐ −1 [{3, 4} ∩ {5}] = ∅ = ๐ −1 [{3, 4}] ∩ ๐ −1 [{5}]. This result concerning the interaction of images and inverse images with union and intersection is generalized in the next theorem. THEOREM 4.6.5 Let ๐ : ๐ด → ๐ต be a function with ๐ถ, ๐ท ⊆ ๐ด and ๐ธ, ๐น ⊆ ๐ต. โ ๐ [๐ถ ∪ ๐ท] = ๐ [๐ถ] ∪ ๐ [๐ท]. โ ๐ [๐ถ ∩ ๐ท] ⊆ ๐ [๐ถ] ∩ ๐ [๐ท]. โ ๐−1 [๐ธ ∪ ๐น ] = ๐−1 [๐ธ] ∪ ๐−1 [๐น ]. โ ๐−1 [๐ธ ∩ ๐น ] = ๐−1 [๐ธ] ∩ ๐−1 [๐น ]. PROOF We prove the first and third parts, leaving the others for Exercise 7. By Exercise 3.2.8(d), ๐ฆ ∈ ๐ [๐ถ ∪ ๐ท] ⇔ ∃๐ฅ(๐ฅ ∈ ๐ถ ∪ ๐ท ∧ ๐(๐ฅ) = ๐ฆ) ⇔ ∃๐ฅ(๐ฅ ∈ ๐ถ ∧ ๐(๐ฅ) = ๐ฆ) ∨ ∃๐ฅ(๐ฅ ∈ ๐ท ∧ ๐(๐ฅ) = ๐ฆ) ⇔ ๐ฆ ∈ ๐ [๐ถ] ∨ ๐ฆ ∈ ๐ [๐ท] ⇔ ๐ฆ ∈ ๐ [๐ถ] ∪ ๐ [๐ท] . In addition, ๐ฅ ∈ ๐−1 [๐ธ ∪ ๐น ] ⇔ ๐(๐ฅ) ∈ ๐ธ ∪ ๐น ⇔ ๐(๐ฅ) ∈ ๐ธ ∨ ๐(๐ฅ) ∈ ๐น ⇔ ๐ฅ ∈ ๐−1 [๐ธ] ∨ ๐ฅ ∈ ๐−1 [๐น ] ⇔ ๐ฅ ∈ ๐−1 [๐ธ] ∪ ๐−1 [๐น ] . It might seem surprising that we only have an inclusion in the second part of Theorem 4.6.5. To see that the other inclusion is false, let ๐ = {(1, 3), (2, 3)}. Then, ๐ [{1} ∩ {2}] = ๐ [∅] = ∅, but ๐ [{1}] ∩ ๐ [{2}] = {3} ∩ {3} = {3}. Hence, ๐ [{1}] ∩ ๐ [{2}] * ๐ [{1} ∩ {2}]. However, if ๐ had been a bijection, the inclusion would hold (Exercise 13). 220 Chapter 4 RELATIONS AND FUNCTIONS EXAMPLE 4.6.6 Let ๐ (๐ฅ) = ๐ฅ2 + 1. We check the union results of Theorem 4.6.5. โ We have already seen in Example 4.6.4 that ๐ [(1, 2)] = (2, 5). Since (1, 2) = (1, 1.5] ∪ [1.5, 2), apply ๐ to both of these intervals and find that ๐ [(1, 1.5]] = (2, 3.25], ๐ [[1.5, 2)] = [3.25, 5). Therefore, ๐ [(1, 2)] = ๐ [(1, 1.5]] ∪ ๐ [[1.5, 2)]. โ Also, from Example 4.6.4, ๐ −1 [(2, 5)] = (−2, −1) ∪ (1, 2). We can write (2, 5) as the union of (2, 4) and (3, 5). Since ) ( √ ) ( √ ๐ −1 [(2, 4)] = − 3, −1 ∪ 1, 3 , ( √ ) (√ ) ๐ −1 [(3, 5)] = −2, − 2 ∪ 2, 2 , we have that ๐ −1 [(2, 5)] = ๐ −1 [(2, 4)] ∪ ๐ −1 [(3, 5)]. Theorem 4.6.5 can be modified to arbitrary unions and intersections. The proof is left to Excercise 17. THEOREM 4.6.7 Let {๐ด๐ : ๐ ∈ ๐ผ} be a family of sets. [โ ] โ โ ๐ ๐ด๐ = ๐[๐ด๐ ] ๐∈๐ผ ๐∈๐ผ [โ ] โ โ ๐ ๐ด๐ ⊆ ๐[๐ด๐ ] ๐∈๐ผ โ ๐−1 [โ ๐∈๐ผ โ ๐−1 [โ ๐∈๐ผ ๐∈๐ผ ] โ ๐ด๐ = ๐−1 [๐ด๐ ] ๐∈๐ผ ] โ ๐ด๐ = ๐−1 [๐ด๐ ]. ๐∈๐ผ We know that the composition of a function and its inverse equals the identity map (Theorem 4.5.3). The last two results of the section show a similar result involving the image and inverse image using functions that are either one-to-one or onto. Section 4.6 IMAGES AND INVERSE IMAGES 221 THEOREM 4.6.8 Let ๐ : ๐ด → ๐ต be a function. Suppose ๐ถ ⊆ ๐ด and ๐ท ⊆ ๐ต. โ If ๐ is one-to-one, then ๐−1 [๐[๐ถ]] = ๐ถ. โ If ๐ is onto, then ๐[๐−1 [๐ท]] = ๐ท. PROOF We prove the first part and leave the second to Exercise 8. Suppose that ๐ is an injection. We show that ๐−1 [๐ [๐ถ]] = ๐ถ. โ Let ๐ฅ ∈ ๐−1 [๐[๐ถ]]. This means that there exists ๐ฆ ∈ ๐[๐ถ] such that ๐(๐ฅ) = ๐ฆ. Furthermore, there is a ๐ง ∈ ๐ถ so that ๐(๐ง) = ๐ฆ. Therefore, since ๐ is one-to-one, ๐ฅ = ๐ง, which means that ๐ฅ ∈ ๐ถ. โ This step will work for any function. Take ๐ฅ ∈ ๐ถ, so ๐(๐ฅ) ∈ ๐[๐ถ]. By definition, ๐−1 [๐[๐ถ]] = {๐ง ∈ ๐ด : ๐(๐ง) ∈ ๐[๐ถ]}. Since ๐ฅ is also an element of ๐ด, we conclude that ๐ฅ ∈ ๐−1 [๐[๐ถ]]. Because we used the one-to-one condition only to prove ๐−1 [๐[๐ถ]] ⊆ ๐ถ, we suspect that this inclusion is false if the function is not one-to-one. To see this, let ๐ : ๐ → ๐ be defined as ๐ (๐ฅ) = ๐ฅ2 . We know that ๐ is not one-to-one. Choose ๐ถ = {1}. Then, ๐ −1 [๐ [๐ถ]] = ๐ −1 [{1}] = {−1, 1}. Therefore, ๐ −1 [๐ [๐ถ]] * ๐ถ. When examining the example, we might conjecture that the function being one-toone is necessary for equality. That this is the case is the following theorem. THEOREM 4.6.9 Let ๐ : ๐ด → ๐ต be a function. โ If ๐−1 [๐ [๐ถ]] = ๐ถ for all ๐ถ ⊆ ๐ด, then ๐ is one-to-one. [ ] โ If ๐ ๐−1 [๐ท] = ๐ท for all ๐ท ⊆ ๐ต, then ๐ is onto. PROOF As with the previous theorem, we prove only the first part. The second part is Exercise 8. Assume ๐−1 [๐[๐ถ]] = ๐ถ for all ๐ถ ⊆ ๐ด. Take ๐ฅ1 , ๐ฅ2 ∈ ๐ด and let ๐(๐ฅ1 ) = ๐(๐ฅ2 ). Now, {๐ฅ1 } = ๐−1 [๐[{๐ฅ1 }]] = {๐ง ∈ ๐ด : ๐(๐ง) = ๐(๐ฅ1 )}. Because ๐(๐ฅ1 ) = ๐(๐ฅ2 ), we have that {๐ฅ1 , ๐ฅ2 } ⊆ {๐ง ∈ ๐ด : ๐(๐ง) = ๐(๐ฅ1 )}, so we must have ๐ฅ1 = ๐ฅ2 . 222 Chapter 4 RELATIONS AND FUNCTIONS Exercises 1. Let ๐ (a) (b) (c) (d) : ๐ → ๐ be defined by ๐ (๐ฅ) = 2๐ฅ + 1. Find the images and inverse images. ๐ [(1, 3]] ๐ [(−∞, 0)] ๐ −1 [(−1, 1)] ๐ −1 [(0, 2) ∪ (5, 8)] 2. Let ๐ (a) (b) (c) (d) : ๐ → ๐ be the function ๐(๐ฅ) = ๐ฅ4 − 1. Find the images and inverse images. ๐[{0}] ๐[๐] ๐ −1 [{0, 15}] ๐ −1 [[−9, −5] ∪ [0, 5]] 3. Define ๐ : ๐ × ๐ → ๐ by ๐(๐, ๐) = โฆ๐โง + โฆ๐โง. Find the images and inverse images. (a) ๐[{0} × ๐] (b) ๐[(0, 1) × (0, 1)] (c) ๐−1 [{2, 4}] (d) ๐−1 [๐] 4. Let ๐ (a) (b) (c) (d) : ๐ → ๐ be a function and define ๐พ : P(๐) → P(๐) by ๐พ(๐ถ) = ๐[๐ถ]. Prove that ๐พ is well-defined. Let ๐(๐) = 2๐. Find ๐พ[{1, 2, 3}], ๐พ[๐], ๐พ −1 [{1, 2, 3}], and ๐พ −1 [๐]. Under what conditions is ๐พ one-to-one? Under what conditions is ๐พ onto? 5. For any function ๐, show that ๐[∅] = ∅ and ๐ −1 [∅] = ∅. 6. Prove for every ๐ต ⊆ ๐ด, ๐ผ๐ด [๐ต] = ๐ต and (๐ผ๐ด )−1 [๐ต] = ๐ต. 7. Prove the remaining parts of Theorem 4.6.5. 8. Prove the unproven parts of Theorems 4.6.8 and 4.6.9. 9. Let ๐ : ๐ด → ๐ต be an injection and ๐ถ ⊆ ๐ด. (a) Prove ๐(๐ฅ) ∈ ๐[๐ถ] if and only if ๐ฅ ∈ ๐ถ. (b) Show that Exercise 9(a) is false if the function is not one-to-one. 10. If ๐ is a function, ๐ด ⊆ ๐ต ⊆ dom(๐), and ๐ถ ⊆ ๐ท ⊆ ran(๐), show that both ๐[๐ด] ⊆ ๐[๐ต] and ๐ −1 [๐ถ] ⊆ ๐ −1 [๐ท]. 11. Let ๐ : ๐ด → ๐ต be a function and take disjoint sets ๐ and ๐ . (a) Prove false: If ๐ , ๐ ⊆ ๐ด, then ๐[๐ ] ∩ ๐[๐ ] = ∅. (b) Prove false: If ๐ , ๐ ⊆ ๐ต, then ๐−1 [๐ ] ∩ ๐−1 [๐ ] = ∅. (c) What additional assumption is needed to prove both of the implications? 12. Assume that ๐ and ๐ are functions such that ran(๐) ⊆ dom(๐). Let ๐ด be a subset of dom(๐). Prove or show false with a counterexample: (๐ โฆ ๐)[๐ด] = ๐[๐[๐ด]]. 13. Let ๐ : ๐ด → ๐ต be one-to-one. Prove the following. Section 4.6 IMAGES AND INVERSE IMAGES 223 (a) ๐[๐ด] ∩ ๐[๐ต] ⊆ ๐[๐ด ∩ ๐ต]. (b) If ๐ถ ⊆ ๐ด and ๐ท ⊆ ๐ต, then ๐[๐ถ] = ๐ท if and only if ๐−1 [๐ท] = ๐ถ. 14. Prove that if ๐ : ๐ด → ๐ต is a bijection and ๐ถ ⊆ ๐ด, then ๐[๐ด โงต ๐ถ] = ๐ต โงต ๐[๐ถ]. 15. Find a function ๐ : ๐ด → ๐ต and a set ๐ท ⊆ ๐ต such that ๐ท * ๐[๐−1 [๐ท]]. 16. Let ๐ be an injection with ๐ด ⊆ dom(๐) and ๐ต ⊆ ran(๐). Prove that ๐ต ⊆ ๐[๐ด] if and only if ๐ −1 [๐ต] ⊆ ๐ด. 17. Prove Theorem 4.6.7. CHAPTER 5 AXIOMATIC SET THEORY 5.1 AXIOMS When we began studying set theory in Chapter 3, we made several assumptions regarding which things are sets. For example, we assumed that collections of numbers, like ๐, ๐, or (0, ∞) are sets. We supposed that operating with given sets to form new collections, as with union or intersection, resulted in sets. We also assumed that formulas could be used to describe certain sets. All of this seemed perfectly reasonable, but since all of these assumptions were made without a carefully thought-out system, we would be wise to pause and investigate if we have introduced any problems. Consider the following question. Given a formula ๐(๐ฅ), is there a set of the form {๐ฅ : ๐(๐ฅ)}? Consistent with the attitude of our previous work, we might quickly answer in the affirmative. Mathematicians, including Cantor, also initially thought that this was the case. However, it was shown independently by Bertrand Russell and Ernst Zermelo that not every formula can be used to define a set. For example, let ๐(๐ฅ) := ๐ฅ ∉ ๐ฅ and ๐ด = {๐ฅ : ๐(๐ฅ)} and consider whether ๐ด is an element of itself. If ๐ด ∈ ๐ด, then due to the definition of ๐(๐ฅ), ๐ด ∉ ๐ด, and if ๐ด ∉ ๐ด, then ๐ด ∈ ๐ด. Because of this built-in contradiction, it is impossible for ๐ด to be a set. This is known as Russell’s paradox, and it was a serious challenge to set theory. One solution would have been to dismiss 225 A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. 226 Chapter 5 AXIOMATIC SET THEORY set theory altogether. The problem was that this new subject combined with advances in logic appeared to promise a framework in which to study foundational questions of mathematics. David Hilbert famously supported set theory by remarking that “no one will drive us out of this paradise that Cantor has created for us,” so dismissal was not an option. In order to prevent contradictions such as Russell’s paradox from appearing, mathematicians settled on the method of Euclid as the solution, but instead of assuming geometric postulates, over a period of time, certain set-theoretic axioms were chosen. Their purpose was to define a system by which one could determine whether a given collection should be considered a set in such a manner that prevented any contradictions from arising. In this chapter we identify the axioms and then redefine ๐, ๐, and other collections so that we are confident in our previous assumptions regarding them being sets. Equality Axioms We begin with the basics. Although not officially among the set axioms, = is always assumed to satisfy the following rules. They are defined to replicate the standard reasoning of equality that we have been using in previous chapters. AXIOMS 5.1.1 [Equality] Let ๐ฅ, ๐ฆ, and ๐ง be variable symbols from theory symbols ๐ฒ. โ [E1] ๐ฅ = ๐ฅ. โ [E2] ๐ฅ = ๐ฆ ⇔ ๐ฆ = ๐ฅ. โ [E3] ๐ฅ = ๐ฆ, ๐ฆ = ๐ง ⇒ ๐ฅ = ๐ง. Let ๐ฅ0 , ๐ฅ1 , … , ๐ฅ๐−1 and ๐ฆ0 , ๐ฆ1 , … , ๐ฆ๐−1 be variable symbols from ๐ฒ. โ [E4] For any ๐-ary function symbol ๐ of ๐ฒ, ๐ฅ0 = ๐ฆ0 ∧ ๐ฅ1 = ๐ฆ1 ∧ · · · ∧ ๐ฅ๐−1 = ๐ฆ๐−1 ⇒ ๐ (๐ฅ0 , ๐ฅ1 , … , ๐ฅ๐−1 ) = ๐ (๐ฆ0 , ๐ฆ1 , … , ๐ฆ๐−1 ). โ [E5] For any ๐ฒ-formula ๐(๐ข0 , ๐ข1 , … , ๐ข๐−1 ), ๐ฅ0 = ๐ฆ0 ∧ ๐ฅ1 = ๐ฆ1 ∧ · · · ∧ ๐ฅ๐−1 = ๐ฆ๐−1 ⇒ ๐(๐ฅ0 , ๐ฅ1 , … , ๐ฅ๐−1 ) ↔ ๐(๐ฆ0 , ๐ฆ1 , … , ๐ฆ๐−1 ). Axioms E1, E2, and E3 give = the behavior of an equivalence relation (Definition 4.2.4). For example, we can use E2 to prove that for any constant symbols ๐0 and ๐1 , โข ๐0 = ๐1 ↔ ๐1 = ๐0 . Section 5.1 AXIOMS The proof goes as follows: 227 ] [ ๐ ๐1 ๐0 = ๐1 ⇔ (๐ฅ0 = ๐ฅ1 ) 0 ๐ฅ0 ๐ฅ1 [ ] ๐ ๐1 ⇔ (๐ฅ1 = ๐ฅ0 ) 0 ๐ฅ0 ๐ฅ1 ⇔ ๐1 = ๐0 . Axiom E4 allows a function symbol to be used in a proof as a function, and E5 allows equal terms to be substituted into formulas with the result being equivalent formulas. For example, given the ๐ญ๐ณ-term ๐ข + ๐ฃ, by E4, ๐ฅ0 = ๐ฆ0 ∧ ๐ฅ1 = ๐ฆ1 ⇒ ๐ฅ0 + ๐ฅ1 = ๐ฆ0 + ๐ฆ1 , and given the ๐ญ๐ณ-formula ๐ข + ๐ฃ = ๐ฃ + ๐ข, by E5, ๐ฅ0 = ๐ฆ0 ∧ ๐ฅ1 = ๐ฆ1 ⇒ ๐ฅ0 + ๐ฅ1 = ๐ฅ1 + ๐ฅ0 ↔ ๐ฆ0 + ๐ฆ1 = ๐ฆ1 + ๐ฆ0 . Formal proofs that require a deduction on an equality need to reference one of the equality axioms from Axioms 5.1.1. Existence and Uniqueness Axioms The axioms will be ๐ฒ๐ณ-formulas (Example 2.1.3), where all terms represent sets. We begin by assuming the existence of two sets. AXIOM 5.1.2 [Empty Set] ∃๐ฅ∀๐ฆ(¬ ๐ฆ ∈ ๐ฅ). In ๐ฒ๐ณ-formulas, a witness for the empty set axiom is denoted by { }, although it is usually written as ∅. AXIOM 5.1.3 [Infinity] ∃๐ฅ({ } ∈ ๐ฅ ∧ ∀๐ข[๐ข ∈ ๐ฅ → ∃๐ฆ(๐ฆ ∈ ๐ฅ ∧ ๐ข ∈ ๐ฆ ∧ ∀๐ฃ[๐ฃ ∈ ๐ข → ๐ฃ ∈ ๐ฆ])]). The standard interpretation of the infinity axiom is that there exists a set that contains ∅ and ๐ ∪ {๐} is an element of the set if ๐ is an element of the set. In Chapter 3, we noted that the elements of a set determine the set. For example, {1, 2} = {1, 2, 2}. This principle is codified by the next axiom. AXIOM 5.1.4 [Extensionality] ∀๐ฅ∀๐ฆ(∀๐ข[๐ข ∈ ๐ฅ ↔ ๐ข ∈ ๐ฆ] → ๐ฅ = ๐ฆ). Suppose ๐ด1 and ๐ด2 are witnesses to the empty set axiom (5.1.2). Since ๐ฅ ∉ ๐ด1 and ๐ฅ ∉ ๐ด2 for every ๐ฅ, we conclude that ๐ฅ ∈ ๐ด1 ↔ ๐ฅ ∈ ๐ด2 . 228 Chapter 5 AXIOMATIC SET THEORY Therefore, ๐ด1 = ๐ด2 by extensionality (Axiom 5.1.4), which means that ∅ is the witness to the empty set axiom. This uniqueness result does not appear to extend to the infinity axiom because both {∅, {∅}, {∅, {∅}}, {∅, {∅}, {∅, {∅}}}, … } and {∅, {∅}, {∅, {∅}}, {∅, {∅}, {∅, {∅}}}, … , {∅, {∅}, {∅, {∅}}, {∅, {∅}, {∅, {∅}}}}, … } are witnesses, provided that they are sets. Construction Axioms Now to build some sets. The next four axioms allow us to do this. AXIOM 5.1.5 [Pairing] ∀๐ข ∀๐ฃ ∃๐ฅ ∀๐ค (๐ค ∈ ๐ฅ ↔ ๐ค = ๐ข ∨ ๐ค = ๐ฃ). Suppose that ๐ and ๐ are sets. Since {๐, ๐} is a witness of ∃๐ฅ ∀๐ค (๐ค ∈ ๐ฅ ↔ ๐ค = ๐ ∨ ๐ค = ๐), {๐, ๐} is a set by the pairing axiom, and from this, we conclude that {๐, {๐, ๐}}, {๐, {๐, ๐}}, and {{๐, ๐}} = {{๐, ๐}, {๐, ๐}} are sets. Because {๐, ๐} equals {๐}, pairing along with extensionality prove the existence of singletons. For example, if ๐ is a witness to the Infinity Axiom, {∅, ๐ }, {∅}, and {∅, {∅, ๐ }} are sets. AXIOM 5.1.6 [Union] ∀๐ฅ∃๐ฆ∀๐ข(๐ข ∈ ๐ฆ ↔ ∃๐ฃ[๐ฃ ∈ ๐ฅ ∧ ๐ข ∈ ๐ฃ]). โ โ By the union axiom, ๐ is a set, and since ๐ ∪ ๐ = {๐, ๐}, we conclude that ๐ ∪ ๐ is a set. Furthermore, the empty set, union, and pairing axioms can be used to prove that for any ๐ ∈ ๐, there exists a set of the form {๐0 , ๐1 , … , ๐๐ } (Exercise 3). AXIOM 5.1.7 [Power Set] ∀๐ฅ∃๐ฆ∀๐ข(๐ข ∈ ๐ฆ ↔ ∀๐ฃ[๐ฃ ∈ ๐ข → ๐ฃ ∈ ๐ฅ]). Because of Definition 3.3.1, the power set axiom can be written as ∀๐ฅ∃๐ฆ∀๐ข(๐ข ∈ ๐ฆ ↔ ๐ข ⊆ ๐ฅ). We conclude that for every set ๐, P(๐) is a set by the power set axiom, and by extensionality, P(๐) is the unique set of subsets of ๐. The next axiom is actually what is called an axiom scheme, infinitely many axioms, one for every formula. They are sometimes called the separation axioms. Section 5.1 AXIOMS 229 AXIOMS 5.1.8 [Subset] For every ๐ฒ๐ณ-formula ๐(๐ข) not containing the symbol ๐ฆ, the following is an axiom: ∀๐ฅ∃๐ฆ∀๐ข [๐ข ∈ ๐ฆ ↔ ๐ข ∈ ๐ฅ ∧ ๐(๐ข)] . The formula ๐(๐ข) in the subset axioms cannot contain the symbol ๐ฆ because the axioms yield the existence of this set. If ๐ฆ was among its symbols, the existence of ๐ฆ would depend on ๐ฆ. The subset axioms yield many familiar sets. โ Let โฑ be a set. By a subset axiom, there exists a set ๐ถ such that โ ๐ฅ∈๐ถ↔๐ฅ∈ โฑ ∧ ∀๐(๐ ∈ โฑ → ๐ฅ ∈ ๐). Observe that the symbol ๐ถ does not appear in the formula ∀๐(๐ ∈ ๐ด → ๐ฅ ∈ ๐). โ Also, observe that โthe set ๐ถ is the intersection of โฑ . Hence, โฑ is a set, and since ๐ ∩ ๐ = {๐, ๐}, we conclude that ๐ ∩ ๐ is a set. โ By a subset axiom, there exists a set ๐ท such that ๐ฅ ∈ ๐ท ↔ ๐ฅ ∈ ๐ ∧ ๐ฅ ∉ ๐, so ๐ โงต ๐ is a set. Replacement Axioms Given sets ๐ด and ๐ต, the function ๐ : ๐ด → ๐ต is a set because ๐ ⊆ ๐ด × ๐ต and ๐ด × ๐ต is a set. Suppose, instead, that the function is defined using a formula ๐(๐ฅ, ๐ฆ) and that its domain is given by a set ๐ด. It cannot be concluded from Axioms 5.1.4–5.1.10 that the range {๐ฆ : ๐ฅ ∈ ๐ด ∧ ๐(๐ฅ, ๐ฆ)} is a set. However, it appears reasonable that it is. For example, define ๐(๐ฅ, ๐ฆ) := ๐ฆ ∈ ๐ ∧ ๐ฆ ≤ ๐ฅ ∧ ∀๐ง(๐ง ∈ ๐ ∧ ๐ฆ ≤ ๐ง → ๐ฅ < ๐ง). An examination of the formula shows ๐(3.4, 4) and ๐(−7.1, −8). Since ๐(๐ฅ, ๐ฆ1 ) ∧ ๐(๐ฅ, ๐ฆ2 ) → ๐ฆ1 = ๐ฆ2 , the formula ๐(๐ฅ, ๐ฆ) defines a function (Definition 4.4.10). If the domain is given to be [0, ∞), its range is the set [0, ∞) ∩ ๐. Generalizing to an arbitrary ๐(๐ฅ, ๐ฆ), it is expected that the range would be a set if the domain is a set. The next axiom scheme guarantees this. It was first found in correspondence between Cantor and Richard Dedekind (Cantor 1932) and Dmitry Mirimanoff (1917) with formal versions by Abraham Fraenkel (1922) and Thoralf Skolem (1922). 230 Chapter 5 AXIOMATIC SET THEORY AXIOMS 5.1.9 [Replacement] For every ๐ฒ๐ณ-formula ๐(๐ก, ๐ค) not containing the symbol ๐ฆ, the following is an axiom: ∀๐ฅ[∀๐ข∀๐ฃ1 ∀๐ฃ2 (๐ข ∈ ๐ฅ ∧ ๐(๐ข, ๐ฃ1 ) ∧ ๐(๐ข, ๐ฃ2 ) → ๐ฃ1 = ๐ฃ2 ) → ∃๐ฆ∀๐ค(๐ค ∈ ๐ฆ ↔ ∃๐ก[๐ก ∈ ๐ฅ ∧ ๐(๐ก, ๐ค)])]. As an example, every indexed family of sets is a set. To prove this, let ๐ผ be a set and ๐ด๐ be a set for all ๐ ∈ ๐ผ. Define ๐(๐, ๐ฆ) := ๐ฆ = ๐ด๐ . Observe that by E2 and E3 (Axioms 5.1.1), ๐ฆ1 = ๐ด๐ ∧ ๐ฆ2 = ๐ด๐ ⇒ ๐ฆ1 = ๐ฆ2 . Therefore, by a replacement axiom (5.1.9), there exists a set โฑ such that ๐ค ∈ โฑ ↔ ∃๐(๐ ∈ ๐ผ ∧ ๐ค = ๐ด๐ ), so โฑ = {๐ด๐ : ๐ ∈ ๐ผ} is a set, which implies that the union and intersection of families of sets are sets. Suppose ๐ ∈ ๐ and ๐ ∈ ๐. By the subset and pairing axioms, we conclude that {๐, ๐}, (๐, ๐) = {{๐}, {๐, ๐}} (Definition 3.2.8), and {(๐, ๐)} are sets . Therefore, fixing ๐, {{(๐, ๐)} : ๐ ∈ ๐} is a family of sets, which implies that it is a set. Likewise, {{{(๐, ๐)} : ๐ ∈ ๐} : ๐ ∈ ๐} is a set. Hence, by the union axiom (5.1.6), โโ ๐ ×๐ = {{{(๐, ๐)} : ๐ ∈ ๐} : ๐ ∈ ๐} is a set. This implies, using a subset axiom (5.1.8), that any binary relation ๐ such that dom(๐ ) ⊆ ๐ and ran(๐ ) ⊆ ๐ is a set. Axiom of Choice Suppose that we are given the pairwise disjoint family of sets โฑ = {{1, 3, 5}, {2, 9, 11}, {7, 8, 13}}. It is easy to find a set ๐ such that ๐ ∩ ๐ด is a singleton for every ๐ด ∈ โฑ . (5.1) Section 5.1 AXIOMS 231 Simply run through the elements of โฑ and choose an element from each set and put it in ๐. Since โฑ is pairwise disjoint, each choice will differ from the others. For example, it might be that ๐ = {1, 9, 13}. However, what happens if โฑ is an infinite set? If there was not a systematic way where elements could be chosen from the sets of โฑ , we would be left with making infinitely many choices, which is something that we cannot do. Nonetheless, it appears reasonable that there is a set ๐ that intersects each member of โฑ exactly once. Such a set cannot be proved to exist from Axioms 5.1.2 to 5.1.9, so we need another axiom. It is called the axiom of choice. We will have to use it every time that an infinite number of arbitrary choices need to be made. AXIOM 5.1.10 [Choice] If โฑ is a family of pairwise disjoint, nonempty sets, there exists ๐ ⊆ that ๐ ∩ ๐ด is a singleton for all ๐ด ∈ โฑ . โ โฑ such The statement of the axiom of choice can be written as an ๐ฒ๐ณ-formula (Exercise 12). Also, notice that ๐ in Axiom 5.1.10 is a function (Exercise 13). It is called a selector. The next follows quickly from the axiom of choice. In fact, the proposition is equivalent to the axiom (Exercise 15), so this corollary is often used as a replacement for it. COROLLARY 5.1.11 For every binary relation ๐ , there exists a function ๐ such that ๐ ⊆ ๐ and dom(๐) = dom(๐ ). PROOF Let ๐ ⊆ ๐ด×๐ต. Define โฑ = {{๐}×[๐]๐ : ๐ ∈ ๐ด}, which is a set by a replacement axiom (Exercise 14). Since โฑ is pairwise disjoint and the set {๐} × [๐]๐ ≠ ∅ for all ๐ ∈ dom(๐ ), the axiom of choice (5.1.10) implies that there exists a selector โ ๐ such that ๐ ⊆ โฑ and ๐ ∩ ({๐} × [๐]๐ ) is a singleton for all ๐ ∈ dom(๐ ). Thus, ๐ ⊆ ๐ , and for all ๐ ∈ dom(๐ ), there exists a unique ๐ ∈ ๐ต such that ๐ ๐ ๐. This implies that ๐ is the desired function. โ Given a family of sets โฑ , define a relation ๐ ⊆ โฑ × โฑ by ๐ = {(๐ด, ๐) : ๐ด ∈ โฑ ∧ ๐ ∈ ๐ด}. โ By Corollary 5.1.11, there exists a function ๐ : โฑ → โฑ such that ๐(๐ด) ∈ ๐ด for all ๐ด ∈ โฑ . The function ๐ is called a choice function. EXAMPLE 5.1.12 Let ๐ = {๐ด๐ : ๐ ∈ ๐ผ} be a family of nonempty sets. We want to define a family of singletons โฌ such that for all ๐ ∈ ๐ผ, if {๐๐ } ∈ โฌ, then ๐๐ ∈ ๐ด๐ . 232 Chapter 5 AXIOMATIC SET THEORY โ By Corollary 5.1.11, there exists a choice function ๐ : ๐ → ๐ . The family โฌ = {{๐(๐ด๐ )} : ๐ ∈ ๐ผ} is the desired set because ๐(๐ด๐ ) ∈ ๐ด๐ for all ๐ ∈ ๐ผ. There are many theorems equivalent to the axiom of choice (5.1.10). One such result involves families of sets. Take ๐ ∈ ๐ and define ๐๐ = {{0}, {0, 1}, … , {0, 1, … , ๐}}. Observe that ๐๐ is a chain with respect to ⊆ and contains a maximal element (Definition 4.3.16), which is {0, 1, … , ๐}. However, the chain ๐ = {{0}, {0, 1}, … , {0, 1, … , ๐}, … } has no maximal element. There are many sets that can be added to ๐ to give it a maximal element, but the natural choice is to add the union of ๐ to the family giving ๐ ′ = {{0}, {0, 1}, … , {0, 1, … , ๐}, … , ๐}. ๐ ′ has a maximal element, namely, ๐. The generalization of this result to any family of sets was first proved by Kuratowski (1922) and then independently by Zorn (1935) for whom the theorem is named despite Kuratowski’s priority. The proof given is essentially due to Zermelo (Halmos 1960). THEOREM 5.1.13 [Zorn’s Lemma] โ Let ๐ be a family of sets. If ๐ ∈ ๐ for every chain ๐ of ๐ with respect to ⊆, there exists ๐ ∈ ๐ such that ๐ ⊄ ๐ด for all ๐ด ∈ ๐ . PROOF Let ๐ : P(๐ ) โงต {∅} → ๐ be a choice function (Corollary 5.1.11). For every chain ๐ of ๐ , define ๐ฬ = {๐ด ∈ ๐ : ๐ ∪ {๐ด} is a chain}. Notice that ๐ฬ is the set of elements of ๐ that when added to ๐ yields a chain. Let ๐ข๐(๐ ) be defined as the set of all chains of ๐ . That both ๐ฬ and ๐ข๐(๐ ) are sets is left to Exercise 17(a). Define X : ๐ข๐(๐ ) → ๐ข๐(๐ ) by { ๐ ∪ {๐(๐ฬ โงต ๐ )} if ๐ฬ โงต ๐ ≠ ∅, X(๐ ) = ๐ if ๐ฬ โงต ๐ = ∅. Next, suppose that ๐โ0 is a chain of ๐ such thatโX(๐0 ) = ๐0 . By the assumption on ๐ , we have that ๐ โ0 ∈ ๐ . To prove that ๐0 is a maximal element of ๐ , take ๐ด ∈ ๐ such that ๐0 ⊂ ๐ด. Since ๐ด has an element that is not in any of the elements of ๐0 , the union ๐0 ∪ {๐ด} is a chain properly containing ๐0 , which is impossible. We conclude that the theorem is proved if ๐0 is shown to exist. Section 5.1 AXIOMS 233 To accomplish this, we begin with a definition. A subset ๐ฏ of ๐ข๐(๐ ) is called a tower when โ ∅∈๐ฏ, โ X(๐ ) ∈ ๐ฏ for all ๐ ∈ ๐ฏ , โ โ ๐ ∈ ๐ฏ for every chain ๐ ⊆ ๐ฏ . โ Define ๐ณ๐(๐ ) to be the set of all towers of ๐ . Observe that ๐ข๐(๐ ) and ๐ณ๐(๐ ) are both towers [Exercise 17(b)]. โ Take ๐ถ ∈ ๐ณ๐(๐ โ ) such that ๐ถ is comparable with respect to inclusion to every element of ๐ณ๐(๐ ). Such a set element of ๐ณ๐(๐ ) โ ๐ถ exists since ∅ is anโ and comparable to โ every element of ๐ณ๐(๐ ). Suppose ๐ด ∈ ๐ณ๐(๐ ) such that โ ๐ด ⊂ ๐ถ. Because ๐ณ๐(๐ ) is a tower, X(๐ด) ∈ ๐ณ๐(๐ ). If ๐ถ ⊂ X(๐ด), then X(๐ด) โงต ๐ด has at least two elements, which is impossible. Therefore, โ (5.2) if ๐ด ∈ ๐ณ๐(๐ ) and ๐ด ⊂ ๐ถ, then X(๐ด) ⊆ ๐ถ. Define ๐ = {๐ด ∈ โ ๐ณ๐(๐ ) : ๐ด ⊆ ๐ถ ∨ X(๐ถ) ⊆ ๐ด}. If ๐ด ⊆ ๐ถ, then ๐ด ⊆ X(๐ถ) since ๐ถ ⊆ X(๐ถ). Hence, every element of ๐ is comparable to X(๐ถ). Also, ๐ is a tower because of the following: โ โ ∅ ∈ ๐ because ∅ ∈ ๐ณ๐(๐ ) and ∅ ⊆ ๐ถ. โ โ โ Let ๐ต ∈ ๐ . Since ๐ณ๐(๐ ) is a tower, X(๐ต) ∈ ๐ณ๐(๐ ). If ๐ต ⊂ ๐ถ, then X(๐ต) ⊆ ๐ถ by (5.2). If ๐ถ ⊆ ๐ต, then X(๐ถ) ⊆ X(๐ต). In both cases, X(๐ต) ∈ ๐ . โ โ โ Let ๐ ⊆ ๐ be a chain. Then, ๐ ∈ ๐ณ๐(๐ ). Suppose there exists ๐ถ0 ∈ ๐ such that ๐ถ0 is not a subset of ๐ถ. This implies that X(๐ถ) ⊆ ๐ถ0 . โ โ Thus, X(๐ถ) ⊆ ๐ , so ๐ ∈ ๐ . โ โ โ Hence, ๐ = ๐ณ๐(๐ ) because ๐ ⊆ ๐ณ๐(๐ ).โ Therefore, ๐ณ๐(๐ ) is a chain because ๐ถ is arbitrary [Exercise 17(c)]. Since ๐ณ๐(๐ ) is a tower, โโ โ ๐ณ๐(๐ ) ∈ ๐ณ๐(๐ ) and then ) โ ๐ณ๐(๐ ) ∈ ๐ณ๐(๐ ). โโ โ Hence, since ๐ณ๐(๐ ) is an upper bound of ๐ณ๐(๐ ), (โ โ ) โโ X ๐ณ๐(๐ ) ⊆ ๐ณ๐(๐ ), X which yields X (โ โ (โ โ ) โโ ๐ณ๐(๐ ) = ๐ณ๐(๐ ). There are many equivalents to the axiom of choice (Rubin and Rubin 1985, Jech 1973). One of them is Zorn’s lemma. 234 Chapter 5 AXIOMATIC SET THEORY THEOREM 5.1.14 The axiom of choice is equivalent to Zorn’s lemma. PROOF Since we have already proved that the axiom of choice (5.1.10) implies Zorn’s lemma (Theorem 5.1.13), we only need to prove the converse. We must be careful to only use Axioms 5.1.2–5.1.9 and not Axiom 5.1.10. Assume Zorn’s lemma and let ๐ be a relation. Define ๐ = {๐ : ๐ ⊆ ๐ ∧ ๐ is a function}. Let ๐ be a chain of elements of ๐ . โ โ Take ๐ ∈ ๐ , so ๐ ∈ ๐ถ for some ๐ถ โ ∈ ๐ . This implies that ๐ถ is a subset of ๐ . Therefore, ๐ ∈ ๐ , proving that ๐ ⊆ ๐ . โ โ ๐ is a function by Exercise 4.4.13. โ We conclude that ๐ ∈ ๐ . Thus, by Zorn’s lemma, there exists a maximal element Φ ∈ ๐ . This means that Φ ⊆ ๐ and dom(Φ) ⊆ dom(๐ ). To prove that dom(๐ ) ⊆ dom(Φ), let (๐ฅ, ๐ฆ) ∈ ๐ and suppose that ๐ฅ ∉ dom(Φ). This implies that Φ ∪ {(๐ฅ, ๐ฆ)} ⊆ ๐ is a function. Hence, Φ ∪ {(๐ฅ, ๐ฆ)} ∈ ๐ , contradicting the maximality of Φ. We conclude that Φ is the desired function, and the axiom of choice follows (Exercise 15). There was a controversy regarding the axiom of choice when Zermelo first proposed it. Despite mathematicians having previously used it implicitly, some objected to its nonconstructive nature. The other axioms yield distinct results, but the axiom of choice results in a set with elements that are not clearly identified. Over time, however, most objections have faded. This is because the majority of mathematicians regard it as reasonable and generally those who question the axiom of choice realize that eliminating it would lead to serious problems because many proofs in various fields of mathematics rely on the axiom. Axiom of Regularity The ideas that led to the next axiom (also known as the axiom of foundation) can be found in Mirimanoff (1917), while the statement of the axiom is credited to Skolem (1922) and John von Neumann (1923). AXIOM 5.1.15 [Regularity] ∀๐ฅ(๐ฅ ≠ { } → ∃๐ฆ[๐ฆ ∈ ๐ฅ ∧ ¬∃๐ข(๐ข ∈ ๐ฆ ∧ ๐ข ∈ ๐ฅ)]). The main result of the regularity axiom is that it prevents sets from being elements of themselves. Suppose there exists a set ๐ด such that ๐ด ∈ ๐ด. Then, ๐ด ∩ {๐ด} ≠ ∅, but the regularity axiom implies that ๐ด ∩ {๐ด} should be empty. Section 5.1 AXIOMS 235 THEOREM 5.1.16 No set is an element of itself. If ๐ = {๐ฅ : ๐ฅ is a set} is a set, ๐ ∈ ๐ , contradicting Theorem 5.1.16. Thus, the theorem’s corollary quickly follows. COROLLARY 5.1.17 There is no set of all sets. Because of the regularity axiom (5.1.15), ๐ด ∉ ๐ด for all sets ๐ด. Therefore, {๐ฅ : ๐ฅ ∉ ๐ฅ} is not a set by, which prevents Russell’s paradox from being deduced from the axioms, provided that they are consistent (Theorem 1.5.2). The axiom of regularity is the final axiom of our chosen collection of axioms. It is believed that they do not prove any contradictions, which implies that the axioms prevent the construction of {๐ฅ : ๐ฅ ∉ ๐ฅ} as a set. Therefore, we write the following definition. The collections of axioms are named after the mathematician who was primarily responsible for their selection (Zermelo 1908). DEFINITION 5.1.18 โ Axioms 5.1.2–5.1.8 are the Zermelo axioms. This collection of sentences is denoted by ๐ญ. โ Axioms 5.1.2–5.1.8 combined with replacement and regularity (Axioms 5.1.9 and 5.1.15) are the Zermelo–Fraenkel axioms, denoted by ๐ญ๐. โ The Zermelo–Fraenkel axioms with the axiom of choice is denoted by ๐ญ๐๐. The nonempty sets that follow from ๐ญ๐๐ have the property that all of their elements are sets. Sets with this property are called hereditary or pure. Assuming ๐ญ๐๐ does not prevent us from working with different types of sets, such as sets of symbols or formulas, but we must remember that such nonhereditary sets are not products of ๐ญ๐๐, so they must be handled with care because we do not want to fall into a paradox. Exercises 1. Let ๐ฒ be a set of theory symbols. Let ๐1 , ๐2 , ๐3 , ๐4 ∈ ๐ฒ be constant symbols and ๐ ∈ ๐ฒ be a binary function symbol. Suppose that ๐(๐ฅ, ๐ฆ) is an ๐ฒ-formula. Use the Equality Axioms (5.1.1) to prove the following. (a) โข ๐1 = ๐1 (b) โข ๐1 = ๐2 ∧ ๐2 = ๐3 → ๐1 = ๐3 (c) โข ๐1 = ๐2 ∧ ๐3 = ๐4 → ๐ (๐1 , ๐3 ) = ๐ (๐2 , ๐4 ) (d) โข ๐1 = ๐2 ∧ ๐3 = ๐4 → [๐(๐ (๐1 ), ๐3 ) ↔ ๐(๐ (๐2 ), ๐4 )] 2. Prove ∀๐ฅ∀๐ฆ(๐ฅ = ๐ฆ → ∀๐ข[๐ข ∈ ๐ฅ ↔ ๐ข ∈ ๐ฆ]). 3. For any ๐ ∈ ๐ with ๐ > 0, prove that there exists a set of the form {๐0 , ๐1 , … , ๐๐−1 }. โโ 4. Let โฑ be a family of sets. Prove that P( โฑ ) is a set. 236 Chapter 5 AXIOMATIC SET THEORY 5. Use a subset axiom (5.1.8) to prove that there is no set of all sets. This proves that Russell’s paradox does not follow from the axiom of ๐ญ. 6. Prove that there is no set that has every singleton as an element. 7. Let ๐ด and ๐ต be sets. Define the symmetric difference of ๐ด and ๐ต by ๐ด M ๐ต = ๐ด โงต ๐ต ∪ ๐ต โงต ๐ด. Prove that ๐ด M ๐ต is a set. 8. Prove the given equations for all sets ๐ด, ๐ต, and ๐ถ. (a) ๐ด M ∅ = ๐ด (b) ๐ด M ๐ = ๐ด (c) ๐ด M ๐ต = ๐ต M ๐ด (d) ๐ด M (๐ต M ๐ถ) = (๐ด M ๐ต) M ๐ถ (e) ๐ด M ๐ต = (๐ด ∪ ๐ต) โงต (๐ต ∪ ๐ด) (f) (๐ด M ๐ต) ∩ ๐ถ = (๐ด ∩ ๐ถ) M (๐ต ∩ ๐ถ) โ โ 9. Given sets ๐ผ and ๐ด๐ for all ๐ ∈ ๐ผ, show that ๐∈๐ผ ๐ด๐ and ๐∈๐ผ ๐ด๐ are sets. 10. Prove that ๐ด0 × ๐ด1 × · · · × ๐ด๐−1 is a set if ๐ด0 , ๐ด1 , … , ๐ด๐−1 are sets. 11. Show that the Cartesian product of a nonempty family of nonempty sets is not empty. 12. Write the Axiom of Choice (5.1.10) as an ๐ฒ๐ณ-formula. 13. Demonstrate that a selector is a function. 14. In the proof of Corollary 5.1.11, prove that {{๐} × [๐]๐ : ๐ ∈ ๐ด} is a set. 15. Prove that Corollary 5.1.11 implies Axiom 5.1.10. 16. Let ๐ = ๐ × {0, 1}. Find a function ๐น : ๐ → {0, 1} such that ๐น (๐) = 0 for all ๐ ∈ ๐. 17. Prove the following parts of the proof of Zorn’s lemma (5.1.13). (a) ๐ฬ and ๐ข๐(๐ ) are sets. (b) ๐ข๐(๐ ) and ∩๐ณ๐(๐ ) are towers. โ (c) ๐ณ๐(๐ ) is a chain. 18. Prove that ๐ญ๐ and Zorn’s lemma imply the axiom of choice. 19. Use Zorn’s lemma to prove that for every function ๐ : ๐ด → ๐ต, there exists a maximal ๐ถ ⊆ ๐ด such that ๐ ๐ถ is one-to-one. โ 20. Prove that if ๐ is a collection of sets such that ๐ ∈ ๐ for every chain ๐ ⊆ ๐ , then ๐ contains a maximal element. 21. Let (๐ด, ๐ ) be a partial order. Prove that there exists an order ๐ on ๐ด such that ๐ ⊆ ๐ and ๐ is linear. Section 5.2 NATURAL NUMBERS 237 22. Use the regularity axiom (5.1.15) to prove that if {๐, {๐, ๐}} = {๐, {๐, ๐}, then ๐ = ๐ and ๐ = ๐. This gives an alternative to Kuratowski’s definition of an ordered pair (Definition 3.2.8). 23. Prove that there does not exist an infinite sequence of sets ๐ด0 , ๐ด1 , ๐ด2 , … such that ๐ด๐+1 ∈ ๐ด๐ for all ๐ = 0, 1, 2, … . 24. Prove that the empty set axiom (5.1.2) can be proved from the other axioms of ๐ญ๐๐. 25. Use Axiom 5.1.9 to prove that the subset axioms (5.1.8) can be proved from the other axioms of ๐ญ๐๐. 26. Let (๐ด, ๐ ) be a poset. Prove that every chain of ๐ด is contained in a maximal chain with respect to ⊆. This is called the Hausdorff maximal princple. 27. Prove that the Hausdorff maximal principle implies Zorn’s lemma, so is equivalent to the axiom of choice. 5.2 NATURAL NUMBERS In order to study mathematics itself, as opposed to studying the contents of mathematics as when we study calculus or Euclidean geometry, we need to first develop a system in which all of mathematics can be interpreted. In such a system, we should be able to precisely define mathematical concepts, like functions and relations; construct examples of them; and write statements about them using a very precise language. ๐ญ๐๐ with first-order logic seems like a natural choice for such an endeavor. However, in order to be a success, this system must have the ability to represent the most basic objects of mathematical study. Namely, it must be able to model numbers. Therefore, within ๐ญ๐๐ families of sets that copy the properties of ๐, ๐, ๐, ๐, and ๐ will be constructed. Since we can do this, we conclude that ๐, ๐, ๐, ๐, and ๐ are themselves sets, and we also conclude that what we discover about their analogs are true about them. We begin with the natural numbers. DEFINITION 5.2.1 For every set ๐, the successor of ๐ is the set ๐+ = ๐ ∪ {๐}. If ๐ is the successor of ๐, then ๐ is the predecessor of ๐ and we write ๐ = ๐− . For example, we have that {3, 5, 7}+ = {3, 5, 7, {3, 5, 7}} and ∅+ = {∅}. For convenience, write ๐++ for (๐+ )+ , so ∅++ = {∅, {∅}}. Also, the predecessor of the set {∅, {∅}, {∅, {∅}}} is {∅, {∅}}, so write {∅, {∅}, {∅, {∅}}}− = {∅, {∅}}. Furthermore, since ∅ contains no elements, we have the following. THEOREM 5.2.2 ∅ does not have a predecessor. 238 Chapter 5 AXIOMATIC SET THEORY Although every set has a successor, we are primarily concerned with certain sets that have a particular property. DEFINITION 5.2.3 The set ๐ด is inductive if ∅ ∈ ๐ด and ๐+ ∈ ๐ด for all ๐ ∈ ๐ด. Definition 5.2.3 implies that if ๐ด is inductive, ๐ด contains the sets ∅, {∅}, {∅, {∅}}, {∅, {∅}, {∅, {∅}}}, … because ∅+ {∅}+ {∅, {∅}}+ {∅, {∅}, {∅, {∅}}}+ = {∅}, = {∅, {∅}}, = {∅, {∅}, {∅, {∅}}}, = {∅, {∅}, {∅, {∅}}, {∅, {∅}, {∅, {∅}}}}, โฎ Of course, Definition 5.2.3 does not guarantee the existence of an inductive set. The infinity axiom (5.1.3) does that. Then, by a subset axiom (5.1.8), ∀๐ฅ∃๐ฆ∀๐ข(๐ข ∈ ๐ฆ ↔ ๐ข ∈ ๐ฅ ∧ ๐ฅ is inductive ∧ ∀๐ค[๐ค is inductive → ๐ข ∈ ๐ค]), where w is inductive can be written as ∅ ∈ ๐ค ∧ ∀๐(๐ ∈ ๐ค → ๐+ ∈ ๐ค). Therefore, the collection that contains the elements that are common to all inductive sets is a set, and we can make the next definition (von Neumann 1923). DEFINITION 5.2.4 An element that is a member of every inductive set is called a natural number. Let ๐ denote the set of natural numbers. That is, ๐ = {∅, {∅}, {∅, {∅}}, {∅, {∅}, {∅, {∅}}}, … }. Definition 5.2.4 suggests that the elements of ๐ will be interpreted to represent the elements of ๐, so represent each natural number with the appropriate element of ๐: 0 = ∅, 1 = {∅}, 2 = {∅, {∅}}, 3 = {∅, {∅}, {∅, {∅}}}, 4 = {∅, {∅}, {∅, {∅}}, {∅, {∅}, {∅, {∅}}}}, โฎ Section 5.2 NATURAL NUMBERS 239 Although the choice of which sets should represent the numbers of ๐ is arbitrary, our choice does have some fortunate properties. For example, the number of elements in each natural number and the number of ๐ that it represents are the same. Moreover, notice that each natural number can be written as 0 = { }, 1 = {0}, 2 = {0, 1}, 3 = {0, 1, 2}, 4 = {0, 1, 2, 3}, โฎ and also note that โ โ โ โ 1 = 0, 2 = 1, 3 = 2, 4 = 3, โฎ The empty set is an element of ๐ by definition, so suppose ๐ ∈ ๐ and take ๐ด to be an inductive set. Since ๐ is a natural number, ๐ ∈ ๐ด, and since ๐ด is inductive, ๐+ ∈ ๐ด. Because ๐ด is arbitrary, we conclude that ๐+ is an element of every inductive set, so ๐+ ∈ ๐ (Definition 5.2.4). This proves the next theorem. THEOREM 5.2.5 ๐ is inductive. The proof of the next theorem is left to Exercise 5. THEOREM 5.2.6 If ๐ด is an inductive set and ๐ด ⊆ ๐, then ๐ด = ๐. Order Because we want to interpret ๐ so that it represents ๐, we should be able to order ๐ as ๐ is ordered by ≤. That is, we need to find a partial order on ๐ that is also a well-order. To do this, we start with a definition. DEFINITION 5.2.7 A set ๐ด is transitive means for all ๐ and ๐, if ๐ ∈ ๐ and ๐ ∈ ๐ด, then ๐ ∈ ๐ด. Observe that ∅ is transitive because ๐ ∈ ∅ is always false. More transitive sets are found in the next theorem. 240 Chapter 5 AXIOMATIC SET THEORY THEOREM 5.2.8 โ Every natural number is transitive. โ ๐ is transitive. PROOF The proof of the second part is left to Exercise 4. We prove the first by defining ๐ด = {๐ ∈ ๐ : ๐ is transitive}. We have already noted that 0 ∈ ๐ด, so assume that ๐ ∈ ๐ด and let ๐ ∈ ๐ and ๐ ∈ ๐ ∪ {๐}. If ๐ ∈ {๐}, then ๐ ∈ ๐. If ๐ ∈ ๐, then by hypothesis, ๐ ∈ ๐. In either case, ๐ ∈ ๐+ since ๐ ⊆ ๐+ , so ๐+ ∈ ๐ด. Hence, ๐ด is inductive, so ๐ด = ๐ by Theorem 5.2.6. In addition to being transitive, each element of ๐ has another useful property that provides another reason why their choice was a wise one. LEMMA 5.2.9 If ๐, ๐ ∈ ๐, then ๐ ⊂ ๐ if and only if ๐ ∈ ๐. PROOF Take ๐, ๐ ∈ ๐. If ๐ ∈ ๐, then ๐ ⊂ ๐ since ๐ is transitive (Exercise 3). To prove the converse, define ๐ด = {๐ ∈ ๐ : ∀๐(๐ ∈ ๐ ∧ [๐ ⊂ ๐ → ๐ ∈ ๐])}. We show that ๐ด is inductive. Trivially, 0 ∈ ๐ด. Now take ๐ ∈ ๐ด and let ๐ ∈ ๐ such that ๐ ⊂ ๐+ . We have two cases to consider. โ Suppose ๐ ∉ ๐. Then, ๐ ⊆ ๐. If ๐ ⊂ ๐, then ๐ ∈ ๐ by hypothesis, which implies that ๐ ∈ ๐+ . If ๐ = ๐, then ๐ ∈ ๐+ . โ Next assume that ๐ ∈ ๐. Take ๐ฅ ∈ ๐. Since ๐ is transitive by Theorem 5.2.8, we have that ๐ฅ ∈ ๐. This implies that ๐ ∪ {๐} ⊆ ๐, but this is impossible since ๐ ⊂ ๐+ , so ๐ ∉ ๐. For example, we see that 2 ∈ 3 and 2 ⊂ 3 because {∅, {∅}} ∈ {∅, {∅}, {∅, {∅}}} and {∅, {∅}} ⊂ {∅, {∅}, {∅, {∅}}}. Now use Lemma 5.2.9 to define an order on ๐. Instead of using 4, we use ≤ to copy the order on ๐. Section 5.2 NATURAL NUMBERS 241 DEFINITION 5.2.10 For all ๐, ๐ ∈ ๐, let ๐ ≤ ๐ if and only if ๐ ⊆ ๐. Define ๐ < ๐ to mean ๐ ≤ ๐ but ๐ ≠ ๐. Lemma 5.2.9 in conjunction with Definition 5.2.10 implies that for all ๐, ๐ ∈ ๐, we have that ๐ < ๐ if and only if ๐ ⊂ ๐ if and only if ๐ ∈ ๐. The order of Definition 5.2.10 makes ๐ a chain with ∅ as its least element. THEOREM 5.2.11 (๐, ≤) is a linear order. PROOF That (๐, ≤) is a poset follows as in Example 4.3.9. To show that ๐ is a chain under ≤, define ๐ด = {๐ ∈ ๐ : ∀๐(๐ ∈ ๐ ∧ [๐ ≤ ๐ ∨ ๐ ≤ ๐])}. We prove that ๐ด is inductive. โ Since ∅ is a subset of every set, 0 ∈ ๐ด. โ Suppose that ๐ ∈ ๐ด and let ๐ ∈ ๐. We have two cases to check. First, assume that ๐ ≤ ๐. If ๐ < ๐, then ๐+ ≤ ๐, while if ๐ = ๐, then ๐ < ๐+ . Now suppose that ๐ ≤ ๐, but this implies that ๐ < ๐+ . In either case, ๐+ ∈ ๐ด. We can now quickly prove the following. COROLLARY 5.2.12 For all ๐, ๐ ∈ ๐, if ๐+ = ๐+ , then ๐ = ๐. PROOF Let ๐ and ๐ be natural numbers and assume that ๐ ∪ {๐} = ๐ ∪ {๐}. (5.3) Take ๐ฅ ∈ ๐. Then, ๐ฅ ∈ ๐ or ๐ฅ = ๐. If ๐ฅ ∈ ๐, then ๐ ⊆ ๐, so suppose that ๐ฅ = ๐. This implies that ๐ ∈ ๐, so ๐ ≠ ๐ by Theorem 5.1.16. However, we then have ๐ ∈ ๐ by (5.3), which contradicts the trichotomy law (Theorem 4.3.21). Similarly, we can prove that ๐ ⊆ ๐. Since the successor defines a function ๐ : ๐ → ๐ where ๐(๐) = ๐+ , Corollary 5.2.12 implies that ๐ is one-to-one. Thereom 5.2.11 shows that (๐, ≤) is a linear order as (๐, ≤) is a linear order. The next theorem shows that the similarity goes further. 242 Chapter 5 AXIOMATIC SET THEORY THEOREM 5.2.13 (๐, ≤) is a well-ordered set. PROOF By Theorem 5.2.11, (๐, ≤) is a linear order. To prove that it is well-ordered, suppose that ๐ด ⊆ ๐ such that ๐ด does not have a least element. Define ๐ต = {๐ ∈ ๐ : {0, 1, … , ๐} ∩ ๐ด = ∅}. We prove that ๐ต is inductive. โ If 0 ∈ ๐ด, then ๐ด has a least element because 0 is the least element of ๐. Hence, {0} ∩ ๐ด = ∅, so 0 ∈ ๐ต. โ Let ๐ ∈ ๐ต. This implies that {0, 1, … , ๐} ∩ ๐ด is empty. Thus, ๐+ cannot be an element of ๐ด for then it would be the least element of ๐ด. Hence, {0, 1, … , ๐, ๐+ } ∩ ๐ด = ∅ proving that ๐+ ∈ ๐ต. Therefore, ๐ต = ๐, which implies that ๐ด is empty. Recursion The familiar factorial is defined recursively as 0! = 1, (๐ + 1)! = (๐ + 1)๐! (๐ ∈ ๐). (5.4) (5.5) A recursive definition is one that is given in terms of itself. This is illustrated in (5.5) where the factorial is defined using the factorial. It appears that the factorial is a function ๐ → ๐, but a function is a set, so why do (5.4) and (5.5) define a set? Such a definition is not found among the axioms of Section 5.1 or the methods of Section 3.1. That they do define a function requires an important theorem. THEOREM 5.2.14 [Recursion] Let ๐ด be a set and ๐ ∈ ๐ด. If ๐ is a function ๐ด → ๐ด, there exists a unique function ๐ : ๐ → ๐ด such that โ ๐ (0) = ๐, โ ๐ (๐+ ) = ๐(๐ (๐)) for all ๐ ∈ ๐. PROOF Let ๐ : ๐ด → ๐ด be a function. Define โฑ = {โ : โ ⊆ ๐ × ๐ด ∧ (0, ๐) ∈ โ ∧ ∀๐∀๐ฆ[(๐, ๐ฆ) ∈ โ → (๐+ , ๐(๐ฆ)) ∈ โ])}. (5.6) Note that โฑ is a set by a subset axiom (5.1.8) and โฑ is nonempty because ๐×๐ด ∈ โฑ . Let โ ๐= โฑ. Section 5.2 NATURAL NUMBERS 243 Observe that ๐ ∈ โฑ (Exercise 8). Define ๐ท = {๐ ∈ ๐ : ∃๐ง[(๐, ๐ง) ∈ ๐ ] ∧ ∀๐ฆ∀๐ฆ′ [(๐, ๐ฆ) ∈ ๐ ∧ (๐, ๐ฆ′ ) ∈ ๐ → ๐ฆ = ๐ฆ′ ]}. We prove that ๐ท is inductive. โ Since โฑ ≠ ∅, we know that (0, ๐) ∈ ๐ by (5.6), so let (0, ๐) also be an element of ๐ . If we assume that ๐ ≠ ๐, then โ ๐ โงต {(0, ๐)} ∈ โฑ , which is impossible because it implies that (0, ๐) ∉ โฑ . Hence, 0 ∈ ๐ท. โ Suppose that ๐ ∈ ๐ท. This means that (๐, ๐ง) ∈ ๐ for some ๐ง ∈ ๐ด. Thus, (๐+ , ๐(๐ง)) ∈ ๐ by (5.6). Assume that (๐+ , ๐ฆ) ∈ ๐ . If ๐ฆ ≠ ๐(๐ง),โthen ๐ โงต {(๐+ , ๐ฆ)} ∈ โฑ , which again leads to the contradictory (๐+ , ๐ฆ) ∉ โฑ . Hence, ๐ is a function ๐ → ๐ด (Theorem 5.2.6). We confirm that ๐ has the desired properties. โ โ ๐ (0) = ๐ because (0, ๐) ∈ โฑ . โ Take ๐ ∈ ๐ and write ๐ฆ = ๐ (๐). This implies that (๐, ๐ฆ) ∈ ๐ , so we have that (๐+ , ๐(๐ฆ)) ∈ ๐ . Therefore, ๐ (๐+ ) = ๐(๐ฆ) = ๐(๐ (๐)). To prove that ๐ is unique, let ๐ ′ : ๐ → ๐ด be a function such that ๐ ′ (0) = ๐ and ๐ ′ (๐+ ) = ๐(๐ ′ (๐)) for all ๐ ∈ ๐. Let ๐ธ = {๐ ∈ ๐ : ๐ (๐) = ๐ ′ (๐)}. The set ๐ธ is inductive because ๐ (0) = ๐ = ๐ ′ (0) and assuming ๐ ∈ ๐ we have that ๐ (๐+ ) = ๐(๐ (๐)) = ๐(๐ ′ (๐)) = ๐ ′ (๐+ ). The factorial function has domain ๐ but the recursion theorem (5.2.14) gives a function with domain ๐ and uses the successor of Definition 5.2.1. We need a connection between ๐ and ๐. We do this by defining two operations on ๐ that we designate by + and ⋅ and then showing that the basic properties of ๐ under these two operations are the same as the basic properties of ๐ under standard addition and multiplication. Arithmetic We begin with addition. Let ๐ : ๐ → ๐ be defined by ๐(๐) = ๐+ . For every ๐ ∈ ๐, by Theorem 5.2.14, there exists a unique function ๐๐ : ๐ → ๐ such that ๐๐ (0) = ๐ and for all ๐ ∈ ๐, Define ๐๐ (๐+ ) = ๐(๐๐ (๐)) = [๐๐ (๐)]+ . ๐ = {((๐, ๐), ๐๐ (๐)) : ๐, ๐ ∈ ๐}. Since ๐๐ is a function for every ๐ ∈ ๐, the set ๐ is a binary operation (Definition 4.4.26). Observe that for all ๐, ๐ ∈ ๐, 244 Chapter 5 AXIOMATIC SET THEORY โ ๐(๐, 0) = ๐, โ ๐(๐, ๐+ ) = ๐(๐, ๐)+ . We know that for every ๐, ๐ ∈ ๐, we have that ๐ + 0 = ๐ and to add ๐ + ๐, one simply adds 1 a total of ๐ times to ๐. This is essentially what ๐ does to the natural numbers. For example, for 1, 3, 4 ∈ ๐, 1 + 3 = ([(1 + 1) + 1] + 1) = 4 and for 1, 2, 3, 4 ∈ ๐ (page 238), ๐(1, 3) = ๐(1, 2+ ) = ๐(1, 2)+ = ๐(1, 1+ )+ = ๐(1, 1)++ = ๐(1, 0+ )++ = ๐(1, 0)+++ = 1+++ = 2++ = 3+ = 4. Therefore, we choose ๐ to be addition on ๐. To define the addition, we only need to cite the two properties given in Theorem 5.2.14. Since each ๐๐ is unique, there is no other function to which the definition could be referring. Therefore, we can define addition recursively. DEFINITION 5.2.15 For all ๐, ๐ ∈ ๐, โ ๐ + 0 = ๐, โ ๐ + ๐+ = (๐ + ๐)+ . Notice that ๐+ = (๐ + 0)+ = ๐ + 0+ = ๐ + 1. Furthermore, using the notation from page 238, we conclude that 1 + 3 = 4 because ๐(1, 3) = 4. The following lemma shows that the addition given in Definition 5.2.15 has a property similar to that of commutativity. LEMMA 5.2.16 For all ๐, ๐ ∈ ๐, โ 0 + ๐ = ๐. โ ๐+ + ๐ = (๐ + ๐)+ . Section 5.2 NATURAL NUMBERS 245 PROOF Let ๐, ๐ ∈ ๐. To prove that 0 + ๐ = ๐, we show that ๐ด = {๐ ∈ ๐ : 0 + ๐ = ๐} is inductive. โ 0 ∈ ๐ด because 0 + 0 = 0. โ Let ๐ ∈ ๐ด. Then, 0 + ๐+ = (0 + ๐)+ = ๐+ , so ๐+ ∈ ๐ด. To prove that ๐+ + ๐ = (๐ + ๐)+ , we show that ๐ต = {๐ ∈ ๐ : ๐+ + ๐ = (๐ + ๐)+ } is inductive. โ Again, 0 ∈ ๐ต because 0 + 0 = 0. โ Suppose that ๐ ∈ ๐ต. We have that ๐+ + ๐+ = (๐+ + ๐)+ = (๐ + ๐)++ = (๐ + ๐+ )+ , where the first and third equality follow by Definition 5.2.15 and the second follows because ๐ ∈ ๐ต. Thus, ๐+ ∈ ๐ต. Now to see that + behaves on ๐ as + behaves on ๐, we use Definition 4.4.31. THEOREM 5.2.17 โ The binary operation + on ๐ is associative and commutative. โ 0 is the additive identity for ๐. PROOF 0 is the additive identity by Definition 5.2.15 and Lemma 5.2.16, and that + is associative on ๐ is Exercise 14. To show that + is commutative, let ๐ ∈ ๐ and define ๐ด = {๐ ∈ ๐ : ๐ + ๐ = ๐ + ๐}. As has been our strategy, we show that ๐ด is inductive. โ ๐ + 0 = ๐ by Definition 5.2.15, and 0 + ๐ = ๐ by Lemma 5.2.16, so 0 ∈ ๐ด. โ Let ๐ ∈ ๐ด. Therefore, ๐ + ๐ = ๐ + ๐, which implies that ๐ + ๐+ = (๐ + ๐)+ = (๐ + ๐)+ = ๐+ + ๐. Hence, ๐+ ∈ ๐ด. Multiplication on ๐ can be viewed as iterated addition. For example, 3 ⋅ 4 = 3 + 3 + 3 + 3, so we define multiplication recursively along these lines. As with addition, the result is a binary operation by Theorem 5.2.14. 246 Chapter 5 AXIOMATIC SET THEORY DEFINITION 5.2.18 For all ๐, ๐ ∈ ๐, โ ๐ ⋅ 0 = 0, โ ๐ ⋅ ๐+ = ๐ ⋅ ๐ + ๐. For example, 3 ⋅ 4 = 12 because 3⋅4=3⋅3+3 = (3 ⋅ 2 + 3) + 3 = ([3 ⋅ 1 + 3] + 3) + 3 = ([(3 ⋅ 0 + 3) + 3] + 3) + 3 = ([3 + 3] + 3) + 3 = 12, where ([3 + 3] + 3) + 3 = 12 is left to Exercise 13. The next result is analogous to Lemma 5.2.16. Its proof is left to Exercise 9. LEMMA 5.2.19 For all ๐, ๐ ∈ ๐, โ 0 ⋅ ๐ = 0, โ ๐+ ⋅ ๐ = ๐ ⋅ ๐ + ๐. We now prove that ⋅ on ๐ behaves as ⋅ on ๐ and that + and ⋅ on ๐ interact with each other via the distributive law as the two operations on ๐. For the proof, we introduce two common conventions for these two operations. โ So that we lessen the use of parentheses, define ⋅ to have precedence over +, and read from left to right. That is, ๐ ⋅ ๐ + ๐ = (๐ ⋅ ๐) + ๐ and ๐ + ๐ ⋅ ๐ = ๐ + (๐ ⋅ ๐), and ๐ + ๐ + ๐ = (๐ + ๐) + ๐ and ๐ ⋅ ๐ ⋅ ๐ = (๐ ⋅ ๐) ⋅ ๐. โ Define ๐๐ = ๐ ⋅ ๐. THEOREM 5.2.20 โ The binary operation ⋅ on ๐ is associative and commutative. โ 1 is the multiplicative identity for ๐. โ The distributive law holds for ๐. This means that for all ๐, ๐, ๐ ∈ ๐, ๐(๐ + ๐) = ๐๐ + ๐๐. Section 5.2 NATURAL NUMBERS 247 PROOF That the associative and commutative properties hold is Exercise 12. To prove the other parts of the theorem, let ๐, ๐, ๐ ∈ ๐. Since 0+ = 1, by Definition 5.2.18 and Lemma 5.2.16, ๐1 = ๐0+ = ๐0 + ๐ = 0 + ๐ = ๐, and by Lemma 5.2.19 and Definition 5.2.15, we have that 1๐ = ๐. To show that the distributive law holds, define ๐ด = {๐ ∈ ๐ : ๐(๐ + ๐) = ๐๐ + ๐๐}. Since 0(๐ + ๐) = 0 and 0๐ + 0๐ = 0 + 0 = 0, we have that 0 ∈ ๐ด. Now suppose that ๐ ∈ ๐ด. That is, ๐(๐ + ๐) = ๐๐ + ๐๐. Therefore, ๐+ ∈ ๐ด because ๐+ (๐ + ๐) = ๐(๐ + ๐) + (๐ + ๐) = (๐๐ + ๐๐) + (๐ + ๐) = (๐๐ + ๐) + (๐๐ + ๐) = ๐+ ๐ + ๐+ ๐. EXAMPLE 5.2.21 We revisit the factorial function. Since addition and multiplication are now defined on ๐, let ๐ : ๐ × ๐ → ๐ × ๐ be the function ๐(๐, ๐) = (๐ + 1, (๐ + 1)๐). Theorem 5.2.14 implies that there exists a unique function ๐ : ๐ → ๐ × ๐ such that ๐ (0) = (0, 1) and ๐ (๐ + 1) = ๐(๐ (๐)). Let ๐ be the projection map ๐(๐, ๐) = ๐. By Theorem 5.2.6 and Exercise 10, we conclude that for all ๐ ∈ ๐, ๐ (๐ + 1) = (๐ + 1, (๐ + 1)(๐ โฆ ๐ )(๐)). Therefore, the factorial function ๐!, which is defined recursively by 0! = 1, (๐ + 1)! = (๐ + 1)๐! (๐ ∈ ๐), is ๐ โฆ ๐ by the uniqueness of ๐ . (5.7) 248 Chapter 5 AXIOMATIC SET THEORY To solve an equation like 6 + 2๐ฅ = 14 where the coefficients are from ๐, we can use the cancellation law and write 6 + 2๐ฅ = 14, 6 + 2๐ฅ = 6 + 8, 2๐ฅ = 8, 2๐ฅ = 2 ⋅ 4, ๐ฅ = 4. For equations with coefficients in ๐, we need a similar law. THEOREM 5.2.22 [Cancellation] Let ๐, ๐, ๐ ∈ ๐. โ If ๐ + ๐ = ๐ + ๐, then ๐ = ๐. โ If ๐๐ = ๐๐ and ๐ ≠ 0, then ๐ = ๐. PROOF The proof for multiplication is Exercise 15. For addition, define ๐ด = {๐ ∈ ๐ : ∀๐∀๐(๐ ∈ ๐ ∧ ๐ ∈ ๐ ∧ [๐ + ๐ = ๐ + ๐ → ๐ = ๐])}. Clearly, 0 ∈ ๐ด, so assume ๐ ∈ ๐ด. To prove that ๐+ ∈ ๐ด, let ๐, ๐ ∈ ๐ and suppose that ๐+ + ๐ = ๐+ + ๐. Then, (๐ + ๐)+ = (๐ + ๐)+ by Lemma 5.2.16, so ๐ + ๐ = ๐ + ๐ by Corollary 5.2.12. Hence, ๐ = ๐ because ๐ ∈ ๐ด. Exercises 1. For every set ๐ด, show that ๐ด+ is a set. 2. For every ๐ ∈ ๐, show that ๐+++++ = ๐ + 5. 3. Show that the following are equivalent. โ ๐ด is transitive. โ If ๐ ∈ ๐ด, then ๐ ⊂ ๐ด. โ ๐ด ⊆ P(๐ด). โ โ ๐ด ⊆ ๐ด. โ + โ (๐ด ) = ๐ด. 4. Prove that ๐ is transitive. 5. Prove Theorem 5.2.6. 6. Let ๐ด ⊆ ๐. Show that if โ ๐ด = ๐ด, then ๐ด = ๐. 7. Take ๐ข, ๐ฃ, ๐ฅ, ๐ฆ ∈ ๐ and assume that ๐ข + ๐ฅ = ๐ฃ + ๐ฆ. Prove that ๐ข ∈ ๐ฃ if and only if ๐ฆ ∈ ๐ฅ. Section 5.3 INTEGERS AND RATIONAL NUMBERS 249 8. Prove that ๐ ∈ โฑ in the proof of Theorem 5.2.14. 9. Prove Lemma 5.2.19. 10. Prove (5.7) from Example 5.2.21. 11. Let ๐ด be a set and ๐ : ๐ด → ๐ด be a one-to-one function. Take ๐ ∈ ๐ด โงต ran(๐). Recursively define ๐ : ๐ → ๐ด such that ๐ (0) = ๐, ๐ (๐+ ) = ๐(๐ (๐)). Prove that ๐ is one-to-one. 12. Let ๐, ๐, ๐ ∈ ๐. Prove the given equations. (a) ๐๐ = ๐๐. (b) ๐(๐ + ๐) = ๐๐ + ๐๐. 13. Show that ([3 + 3] + 3) + 3 = 12. 14. Prove that addition on ๐ is associative. 15. For all ๐, ๐, ๐ ∈ ๐ with ๐ ≠ 0, prove that if ๐๐ = ๐๐, then ๐ = ๐. 16. Show that for all ๐, ๐ ∈ ๐, if ๐๐ = 0, then ๐ = 0 or ๐ = 0. 17. We define exponentiation on ๐. For all ๐ ∈ ๐, ๐0 = 1, + ๐๐ = ๐๐ ⋅ ๐. (a) Use the recursion (Theorem 5.2.14) to prove that this defines a function ๐ × ๐ → ๐. (b) Show that exponentiation on ๐ is one-to-one. 18. Let ๐ฅ, ๐ฆ, ๐ฅ ∈ ๐. Use the definition of Exercise 17 to prove the given equations. (a) ๐ฅ๐ฆ+๐ง = ๐ฅ๐ฆ ๐ฅ๐ง . (b) (๐ฅ๐ฆ)๐ง = ๐ฅ๐ง ๐ฆ๐ง . (c) (๐ฅ๐ฆ )๐ง = ๐ฅ๐ฆ๐ง . 19. Let ๐, ๐, ๐ ∈ ๐. Assume that ๐ ≤ ๐ and 0 ≤ ๐. Demonstrate the following. (a) ๐ + ๐ ≤ ๐ + ๐. (b) ๐๐ ≤ ๐๐. (c) ๐๐ ≤ ๐๐ . 5.3 INTEGERS AND RATIONAL NUMBERS Now that we have defined within set theory the set of natural numbers and confirmed its basic properties, we wish to continue this with other sets of numbers. We begin with the integers. 250 Chapter 5 AXIOMATIC SET THEORY Integers We build the integers using the natural numbers. The problem is how to define the negative integers. We need to decide how to represent the adjoining of the negative sign to a natural number. One option that might work is to use ordered pairs. These are always good options when extra information needs to be included with each element of a set. For example, (4, 0) could represent 4 because 4 − 0 = 4, and (0, 4) could represent −4 because 0 − 4 = −4. However, this is a problem because we did not define subtraction on ๐, so we need another solution. Our decision is to generalize this idea of subtracting coordinates to the set ๐ × ๐, but use addition to do it. Since there are infinitely many pairs (๐, ๐) such that ๐ − ๐ = 4, we equate them by using the equivalence relation of Exercise 4.2.2. DEFINITION 5.3.1 Let ๐ be the equivalence relation on ๐ × ๐ defined by (๐, ๐) ๐ (๐′ , ๐′ ) if and only if ๐ + ๐′ = ๐′ + ๐. Define โค = (๐ × ๐)โ๐ to be the set of integers. Using Definition 5.3.1 we associate elements of ๐ with elements of โค: ๐ โฎ −2 −1 0 1 2 โฎ โค โฎ [(0, 2)] [(0, 1)] [(0, 0)] [(1, 0)] [(2, 0)] โฎ Notice that because 0 and 2 are elements of ๐, the equivalence class [(0, 2)] is the name for [(∅, {∅, {∅}})]. Also, [(0, 2)] = [(1, 3)] = [(2, 4)] = · · · . The ordering of โค is defined in terms of the ordering on ๐. Be careful to note that the symbol ≤ will represent two different orders, one on โค and one on ๐ (Definition 5.2.10). This overuse of the symbol will not lead to confusion because the order will be clear from context. For example, in the next definition, the first ≤ is the order on โค, and the second ≤ is the order on ๐. DEFINITION 5.3.2 For all [(๐, ๐)], [(๐′ , ๐′ )] ∈ โค, define [(๐, ๐)] ≤ [(๐′ , ๐′ )] if and only if ๐ + ๐′ ≤ ๐′ + ๐ and Section 5.3 INTEGERS AND RATIONAL NUMBERS 251 [(๐, ๐)] < [(๐′ , ๐′ )] if and only if ๐ + ๐′ < ๐′ + ๐. Since −4 = [(0, 4)] and 3 = [(5, 2)], we have that −4 < 3 because 0 + 2 < 5 + 4. Since (โค, ≤) has no least element (Exercise 3), it is not well-ordered. However, it is a chain. Note that in the proof the symbol ≤ is again overused. THEOREM 5.3.3 (โค, ≤) is a linear order. PROOF We use the fact that the order on ๐ is a linear order (Theorem 5.2.11). Let ๐, ๐ ∈ โค, so ๐ = (๐, ๐) and ๐ = (๐′ , ๐′ ) for some ๐, ๐, ๐′ , ๐′ ∈ ๐. โ Because ๐ + ๐ ≤ ๐ + ๐, we have ๐ ≤ ๐. โ Suppose that ๐ ≤ ๐ and ๐ ≤ ๐. This implies that ๐ + ๐′ ≤ ๐′ + ๐ and ๐′ + ๐ ≤ ๐ + ๐′ . Since ≤ is antisymmetric on ๐, ๐ + ๐′ = ๐′ + ๐, which implies that ๐ = ๐, so ≤ is antisymmetric on โค. โ Using a similar strategy, it can be shown that ≤ is transitive on โค (Exercise 4). Thus, (โค, ≤) is a poset. โ Since (๐, ≤) is a linear order, ๐ + ๐′ ≤ ๐′ + ๐ or ๐′ + ๐ ≤ ๐ + ๐′ . This implies that ๐ ≤ ๐ or ๐ ≤ ๐, so โค is a chain under ≤. Although ๐ is not a subset of โค like ๐ is a subset of ๐, the set ๐ can be embedded in โค (Definition 4.5.29). This is shown by the function ๐ : ๐ → โค defined by ๐(๐) = [(๐, 0)]. (5.8) We see that ๐ is one-to-one because [(๐, 0)] = [(๐, 0)] implies that ๐ = ๐. The function ๐ is then an order isomorphism using the relation from Definition 5.3.2 (Exercise 1). As with ๐, we define what it means to add and multiply two integers. DEFINITION 5.3.4 Let [(๐, ๐)], [(๐ข, ๐ฃ)] ∈ โค. โ [(๐, ๐)] + [(๐ข, ๐ฃ)] = [(๐ + ๐ข, ๐ + ๐ฃ)]. โ [(๐, ๐)] ⋅ [(๐ข, ๐ฃ)] = [(๐๐ข + ๐๐ฃ, ๐๐ฃ + ๐ข๐)]. For example, the equation in โค, [(5, 2)] + [(7, 1)] = [(12, 3)], corresponds to the equation in ๐, 3 + 6 = 9, 252 Chapter 5 AXIOMATIC SET THEORY and the equation [(5, 2)] ⋅ [(7, 1)] = [(35 + 2, 5 + 14)] = [(37, 19)] corresponds to the equation 3 ⋅ 6 = 18. Before we check the properties of + and ⋅ on โค, we should confirm that these are well-defined. Exercise 5 is for addition. To prove that multiplication is well-defined, let [(๐, ๐)] = [(๐′ , ๐′ )] and [(๐ข, ๐ฃ)] = [(๐ข′ , ๐ฃ′ )]. Then, ๐ + ๐′ = ๐′ + ๐ and ๐ข + ๐ฃ′ = ๐ข′ + ๐ฃ. Hence, by Theorem 5.2.20 in ๐ we have that ๐๐ข + ๐′ ๐ข = ๐′ ๐ข + ๐๐ข, ๐๐ฃ + ๐′ ๐ฃ = ๐′ ๐ฃ + ๐๐ฃ, ๐′ ๐ข + ๐′ ๐ฃ′ = ๐′ ๐ข′ + ๐′ ๐ฃ, ๐′ ๐ข + ๐′ ๐ฃ′ = ๐′ ๐ข′ + ๐′ ๐ฃ. Therefore, ๐๐ข + ๐′ ๐ข + ๐′ ๐ฃ + ๐๐ฃ + ๐′ ๐ข + ๐′ ๐ฃ′ + ๐′ ๐ข′ + ๐′ ๐ฃ equals ๐′ ๐ข + ๐๐ข + ๐๐ฃ + ๐′ ๐ฃ + ๐′ ๐ข′ + ๐′ ๐ฃ + ๐′ ๐ข + ๐′ ๐ฃ′ , so by Cancellation (Theorem 5.2.22), ๐๐ข + ๐๐ฃ + ๐′ ๐ฃ′ + ๐′ ๐ข′ = ๐๐ฃ + ๐๐ข + ๐′ ๐ข′ + ๐′ ๐ฃ′ . This implies by Definition 5.3.1 that [(๐๐ข + ๐๐ฃ, ๐๐ฃ + ๐๐ข)] = [(๐′ ๐ข′ + ๐′ ๐ฃ′ , ๐′ ๐ฃ′ + ๐′ ๐ข′ )], and this by Definition 5.3.4 implies that [(๐, ๐)] ⋅ [(๐ข, ๐ฃ)] = [(๐′ , ๐′ )] ⋅ [(๐ข′ , ๐ฃ′ )]. We follow the same order of operations with + and ⋅ on โค as on ๐ and also write ๐๐ for ๐ ⋅ ๐. THEOREM 5.3.5 โ The binary operations + and ⋅ on โค are associative and commutative. โ [(0, 0)] is the additive identity for โค, and [(1, 0)] is its multiplicative identity. โ For every ๐ ∈ โค, there exists an additive inverse of ๐. โ For all ๐, ๐, ๐ ∈ โค, the distributive law holds. Section 5.3 INTEGERS AND RATIONAL NUMBERS 253 PROOF We will prove parts of the first and third properties and leave the remaining properties to Exercise 6. Let ๐, ๐, ๐ ∈ โค. This means that there exist natural numbers ๐, ๐ , ๐, ๐ , ๐ข, and ๐ฃ such that ๐ = [(๐, ๐)], ๐ = [(๐, ๐ )], and ๐ = [(๐ข, ๐ฃ)]. Then, ๐ + ๐ + ๐ = [(๐, ๐)] + [(๐, ๐ )] + [(๐ข, ๐ฃ)] = [(๐ + ๐, ๐ + ๐ )] + [(๐ข, ๐ฃ)] = [(๐ + ๐ + ๐ข, ๐ + ๐ + ๐ฃ)] = [(๐ + [๐ + ๐ข] , ๐ + [๐ + ๐ฃ])] = [(๐, ๐)] + [(๐ + ๐ข, ๐ + ๐ฃ)] = [(๐, ๐)] + ([(๐, ๐ )] + [(๐ข, ๐ฃ)]) = ๐ + (๐ + ๐) and ๐๐๐ = [(๐, ๐)] ⋅ [(๐, ๐ )] ⋅ [(๐ข, ๐ฃ)] = [(๐๐ + ๐๐ , ๐๐ + ๐๐)] ⋅ [(๐ข, ๐ฃ)] = [(๐ข [๐๐ + ๐๐ ] + ๐ฃ [๐๐ + ๐๐] , ๐ฃ [๐๐ + ๐๐ ] + ๐ข [๐๐ + ๐๐])] = [(๐ข๐๐ + ๐ข๐๐ + ๐ฃ๐๐ + ๐ฃ๐๐, ๐ฃ๐๐ + ๐ฃ๐๐ + ๐ข๐๐ + ๐ข๐๐)] = [(๐ [๐๐ข + ๐ ๐ฃ] + ๐ [๐ ๐ข + ๐๐ฃ] , ๐ [๐ ๐ข + ๐๐ฃ] + ๐ [๐๐ข + ๐ ๐ฃ])] = [(๐, ๐)] ⋅ [(๐๐ข + ๐ ๐ฃ, ๐ ๐ข + ๐๐ฃ)] = [(๐, ๐)] ⋅ ([(๐, ๐ )] ⋅ [(๐ข, ๐ฃ)]) = ๐(๐๐). To prove that every element of โค has an additive inverse, notice that [(๐, ๐)] + [(๐, ๐)] = [(๐ + ๐, ๐ + ๐)] = [(0, 0)]. Therefore, if ๐ = [(๐, ๐)], the additive inverse of ๐ is [(๐, ๐)]. For all ๐ ∈ โค, denote the additive inverse of ๐ by −๐, and for all ๐, ๐, ๐, ๐ ∈ ๐, define [(๐, ๐)] − [(๐, ๐ )] = [(๐, ๐)] + [(๐ , ๐)]. Rational Numbers As the integers were built using ๐, so the set of rational numbers will be built using โค. Its definition is motivated by the behavior of fractions in ๐. For instance, 2 8 = 3 12 because 2 ⋅ 12 = 3 ⋅ 8. Imagining that the ordered pair (๐, ๐) represents the fraction ๐โ๐, we define an equivalence relation. Notice that this is essentially the relation from Example 4.2.5. Notice that โค is defined using addition (Definition 5.3.1) while the rational numbers are defined using multiplication. 254 Chapter 5 AXIOMATIC SET THEORY DEFINITION 5.3.6 Let ๐ be the equivalence relation on โค × (โค โงต {0}) defined by (๐, ๐) ๐ (๐′ , ๐′ ) if and only if ๐๐′ = ๐′ ๐. Define โ = [โค × (โค โงต {0})]โ๐ to be the set of rational numbers. Using Definition 5.3.6, we associate elements of ๐ with elements of โ: ๐ โฎ −2 −1 0 1 2 โฎ ๐ โฎ 1โ2 2โ3 3โ4 4โ5 5โ6 โฎ โ โฎ [(−2, 1)] [(−1, 1)] [(0, 1)] [(1, 1)] [(2, 1)] โฎ โ โฎ [(1, 2)] [(2, 3)] [(3, 4)] [(4, 5)] [(5, 6)] โฎ Notice that since 1, 2 ∈ โค, the equivalence class [(1, 2)] is the name for [([(1, 0)]๐ , [(2, 0)]๐ ]๐ = [[({∅}, ∅)]๐ , [({∅, {∅}}, ∅)]๐ ]๐ , where ๐ is the relation of Definition 5.3.1. Also, [(1, 2)] = [(2, 4)] = [(3, 6)] = · · · . We next define a partial order on โ. DEFINITION 5.3.7 For all [(๐, ๐)], [(๐′ , ๐′ )] ∈ โ, define [(๐, ๐)] ≤ [(๐′ , ๐′ )] if and only if ๐๐′ ≤ ๐๐′ and [(๐, ๐)] < [(๐′ , ๐′ )] if and only if ๐๐′ < ๐๐′ . When working with โ, we often denote [(๐, ๐)] by ๐โ๐ or ๐๐ and [(๐, 1)] by ๐. Thus, since 2โ3 = [(2, 3)] and 7โ8 = [(7, 8)], we conclude that 2โ3 < 7โ8 because 2⋅8 < 3⋅7. As ๐ can be embedded in โค, so โค can be embedded in โ. The function that can be used is [ ] ๐ : โค → โค × (โค โงต {0}) โ๐ defined by ๐(๐) = [(๐, 1)]. (5.9) (See Exercise 2) Using the function ๐ (5.8), we see that ๐ can be embedded in โ via the order isomorphism ๐ โฆ ๐. Since (โ, ≤) has no least element (Exercise 3), it is not well-ordered. However, it is a chain (Exercise 11). Section 5.3 INTEGERS AND RATIONAL NUMBERS 255 THEOREM 5.3.8 (โ, ≤) is a linear order. Lastly, we define two operations on โ that represent the standard operations of + and ⋅ on ๐. DEFINITION 5.3.9 Let [(๐, ๐)], [(๐′ , ๐′ )] ∈ โ. โ [(๐, ๐)] + [(๐′ , ๐′ )] = [(๐๐′ + ๐′ ๐, ๐๐′ )]. โ [(๐, ๐)] ⋅ [(๐′ , ๐′ )] = [(๐๐′ , ๐๐′ )]. That + and ⋅ as given in Definition 5.3.9 are binary operations is left to Exercise 13. As examples of these two operations, the equation in โ [(1, 3)] + [(3, 4)] = [(1 ⋅ 4 + 3 ⋅ 3, 3 ⋅ 4)] = [(13, 12)] corresponds to the equation in ๐ 1 3 4 9 13 + = + = 3 4 12 12 12 and the equation in โ [(1, 3)] ⋅ [(3, 4)] = [(1 ⋅ 3, 3 ⋅ 4)] = [(3, 12)] = [(1, 4)] corresponds to the equation in ๐ 3 1 1 3 ⋅ = = . 3 4 12 4 We follow the same order of operations with + and ⋅ on โ as on โค and also write ๐๐ for ๐ ⋅ ๐. THEOREM 5.3.10 โ The binary operations + and ⋅ on โ are associative and commutative. โ [(0, 1)] is the additive identity, and [(1, 1)] is the multiplicative identity. โ Every rational number has an additive inverse, and every element of โโงต{[(0, 1)]} has a multiplicative inverse. โ The distributive law holds. 256 Chapter 5 AXIOMATIC SET THEORY PROOF Since [(๐, ๐)] ⋅ [(1, 1)] = [(๐ ⋅ 1, ๐ ⋅ 1)] = [(๐, ๐)] for all [(๐, ๐)] ∈ โ and multiplication is commutative, the multiplicative identity of โ is [(1, 1)]. Also, because ๐ ≠ 0 and ๐ ≠ 0 implies that [(๐, ๐)] ⋅ [(๐, ๐)] = [(๐๐, ๐๐)] = [(1, 1)], every element of โ โงต [(0, 1)] has a multiplicative inverse. The other properties are left to Exercise 14. For all ๐, ๐ ∈ โ with ๐ ≠ 0, denote the additive inverse of ๐ by −๐ and the multiplicative inverse of ๐ by ๐−1 . Actual Numbers We now use the elements of ๐, โค, and โ as if they were the actual elements of ๐, ๐, and ๐. For example, we understand the formula ๐ ∈ โค to mean that ๐ is an integer of Definition 5.3.1 with all of the properties of ๐ ∈ ๐ญ. Also, when we write that the formula ๐(๐) is satisfied by some rational number ๐, we interpret this to mean that ๐(๐) with ๐ ∈ โ where ๐ has all of the properties of ๐ ∈ ๐. This allows us to use the set properties of a natural number, integer, or rational number when needed yet also use the results we know concerning the actual numbers. That the partial orders and binary operations defined on ๐, โค, and โ have essentially the same properties as those on ๐, ๐, and ๐ allows for this association of properties to be legitimate. Exercises 1. Prove that the function ๐ (5.8) is an order isomorphism using the order of Definition 5.3.2. 2. Prove that the function ๐ (5.9) is an order isomorphism using the order of Definition 5.3.7. 3. Show that (โค, ≤) and (โ, ≤) do not have least elements. 4. Prove that ≤ is transitive on โค. 5. Prove that + is well-defined on โค. 6. Complete the remaining proofs of the properties of Theorem 5.3.5. 7. Show that every nonempty subset of โค− = {๐ : ๐ ∈ โค ∧ ๐ < 0} has a greatest element. 8. Let ๐, ๐ ∈ โค and ๐ ∈ โค− (Exercise 7) Prove that if ๐ ≤ ๐, then ๐๐ ≤ ๐๐. 9. Prove that for every ๐, ๐ ∈ โค, if ๐๐ = 0, then ๐ = 0 or ๐ = 0. Show that this same result also holds for โ. 10. The cancellation law for the integers states that for all ๐, ๐, ๐ ∈ โค, โ if ๐ + ๐ = ๐ + ๐, then ๐ = ๐, โ if ๐๐ = ๐๐ and ๐ ≠ 0, then ๐ = ๐. Section 5.4 MATHEMATICAL INDUCTION 257 Prove that the cancellation law holds for โค. Prove that a similar law holds for โ. 11. Prove that (โ, ≤) is a chain. 12. Demonstrate that every nonempty set of integers with a lower bound with respect to ≤ has a least element. 13. Show that + and ⋅ as given in Definition 5.3.9 are binary operations on โ. 14. Prove the remaining parts of Theorem 5.3.10. 15. Let ๐, ๐, ๐, ๐ ∈ โ. Prove. (a) If 0 ≤ ๐ ≤ ๐ and 0 ≤ ๐ ≤ ๐, then ๐๐ ≤ ๐๐. (b) If 0 ≤ ๐ < ๐ and 0 ≤ ๐ < ๐, then ๐๐ < ๐๐. 16. Let ๐, ๐ ∈ โ. Prove that if ๐ < 0, then ๐ + ๐ < ๐. 17. Let ๐, ๐ ∈ โ. Prove that if ๐ ≥ 0 and ๐ < 1, then ๐๐ ≤ ๐. 18. Prove that between any two rational numbers is a rational number. This means that โ is dense. Show that this is not the case for โค. 19. Generalize the definition of exponentiation given in Exercise 5.2.17 to โค. Prove that this defines a one-to-one function โค × (โค+ ∪ {0}) → โค. Does the proof require recursion (Theorem 5.2.14)? 20. Let ๐, ๐, ๐ ∈ โค. Assume that ๐ ≤ ๐. Demonstrate the following. (a) ๐ + ๐ ≤ ๐ + ๐. (b) ๐๐ ≤ ๐๐ if 0 ≤ ๐. (c) ๐๐ ≤ ๐๐ if ๐ < 0. (d) ๐๐ ≤ ๐๐ if ๐ ≥ 0. 21. Let ๐ : ๐ → โค be the embedding defined by (5.8). Define ๐ : ๐ × ๐ → ๐ by ๐ (๐, ๐) = ๐๐ and ๐ : โค × (โค+ ∪ {0}) → โค by ๐(๐ข, ๐ฃ) = ๐ข๐ฃ . Prove that for all ๐, ๐ ∈ ๐, we have that ๐(๐(๐), ๐(๐)) = ๐(๐ (๐, ๐)). Explain the significance of this result. 5.4 MATHEMATICAL INDUCTION Suppose that we want to prove ๐(๐) for all integers ๐ greater than or equal to some ๐0 . Our previous method (Section 2.4) is to take an arbitrary ๐ ≥ ๐0 and try to prove ๐(๐). If this is not possible, we might be tempted to try proving ๐(๐) for each ๐ individually. This is impossible because it would take infinitely many steps. Instead, we combine the results of Sections 5.2 and 5.3 and use the next theorem. THEOREM 5.4.1 [Mathematical Induction 1] Let ๐(๐) be a formula. For any ๐0 ∈ โค, if ๐(๐0 ) ∧ ∀๐ ∈ โค[๐ ≥ ๐0 ∧ ๐(๐) → ๐(๐ + 1)], then ∀๐ ∈ โค[๐ ≥ ๐0 → ๐(๐)]. 258 Chapter 5 AXIOMATIC SET THEORY PROOF Assume ๐(๐0 ) and that ๐(๐) implies ๐(๐ + 1) for all integers ๐ ≥ ๐0 . Define ๐ด = {๐ ∈ ๐ : ๐(๐0 + ๐)}. Notice that 0 ∈ ๐ด ⇔ ๐(๐0 ), 1 ∈ ๐ด ⇔ ๐(๐0 + 1), and so on. To prove ๐(๐) for all integers ๐ ≥ ๐0 , we show that ๐ด is inductive. โ By hypothesis, 0 ∈ ๐ด because ๐(๐0 ). โ Assume ๐ ∈ ๐ด. This implies that ๐(๐0 + ๐). Therefore, ๐(๐0 + ๐ + 1), so ๐ + 1 ∈ ๐ด. Theorem 5.4.1 gives rise to a standard proof technique known as mathematical induction. First, prove ๐(๐0 ). Then, show that ๐(๐) implies ๐(๐ + 1) for every integer ๐ ≥ ๐0 . Often an analogy of dominoes is used to explain this. Proving ๐(๐0 ) is like tipping over the first domino, and then proving the implication shows that the dominoes have been set properly. This means that by modus ponens, if ๐(๐0 + 1) is true, then ๐(๐0 + 2) is true, and so forth, each falling like dominoes: ๐(๐0 ) ๐(๐0 ) → ๐(๐0 + 1) ∴ ๐(๐0 + 1) ๐(๐0 + 1) → ๐(๐0 + 2) ∴ ๐(๐0 + 2) · · · This two-step process is characteristic of proofs by mathematical induction. So much so, the two stages have their own terminology. โ Proving ๐(๐0 ) is called the basis case. It is typically the easiest part of the proof but should, nonetheless, be explicitly shown. โ Proving that ๐(๐) implies ๐(๐ + 1) is the induction step. For this, we typically use direct proof, assuming ๐(๐) to show ๐(๐ + 1). The assumption is called the induction hypothesis. Often induction is performed to prove a formula for all positive integers, so to represent this set, define โค+ = {1, 2, 3, … }. EXAMPLE 5.4.2 Prove ๐(๐) for all ๐ ∈ โค+ , where ๐(๐) := 12 + 22 + · · · + ๐2 = ๐(๐ + 1)(2๐ + 1) . 6 Proceed by mathematical induction. โ The basis case ๐(1) holds because 1(1 + 1)(2 ⋅ 1 + 1) 1(2)(3) = = 12 . 6 6 Section 5.4 MATHEMATICAL INDUCTION 259 โ Now for the induction step, assume 12 + 22 + · · · + ๐2 = ๐(๐ + 1)(2๐ + 1) . 6 (5.10) This is the induction hypothesis. We must show that the equation holds for ๐ + 1. Adding (๐ + 1)2 to both sides of (5.10) gives 12 + 22 + · · · + ๐2 + (๐ + 1)2 = = = = = ๐(๐ + 1)(2๐ + 1) + (๐ + 1)2 6 (๐ + 1) [๐(2๐ + 1) + 6(๐ + 1)] 6 (๐ + 1)(2๐2 + 7๐ + 6) 6 (๐ + 1)(๐ + 2)(2๐ + 3) 6 (๐ + 1)([๐ + 1] + 1)(2[๐ + 1] + 1) . 6 EXAMPLE 5.4.3 Let ๐ฅ be a positive rational number. To prove that for any ๐ ∈ โค+ , (๐ฅ + 1)๐ ≥ ๐ฅ๐ + 1, we proceed by mathematical induction. โ (๐ฅ + 1)1 = ๐ฅ + 1 = ๐ฅ1 + 1. โ Let ๐ ∈ โค+ and assume that (๐ฅ + 1)๐ ≥ ๐ฅ๐ + 1. By multiplying both sides of the given inequality by ๐ฅ + 1, we have (๐ฅ + 1)๐ (๐ฅ + 1) ≥ (๐ฅ๐ + 1)(๐ฅ + 1) = ๐ฅ๐+1 + ๐ฅ๐ + ๐ฅ + 1 ≥ ๐ฅ๐+1 + 1. The first inequality is true by induction (that is, by appealing to the induction hypothesis) and since ๐ฅ + 1 is positive, and the last one holds because ๐ฅ ≥ 0. EXAMPLE 5.4.4 Recursively define a sequence of numbers, ๐1 = 3, ๐๐ = 2๐๐−1 for all ๐ ∈ โค such that ๐ > 1. 260 Chapter 5 AXIOMATIC SET THEORY The sequence is ๐1 = 3, ๐2 = 6, ๐3 = 12, ๐4 = 24, ๐5 = 48, … , and we conjecture that ๐๐ = 3 ⋅ 2๐−1 for all positive integers ๐. We prove this by mathematical induction. โ For the basis case, ๐1 = 3 ⋅ 20 = 3. โ Let ๐ > 1 and assume ๐๐ = 3 ⋅ 2๐−1 . Then, ๐๐+1 = 2๐๐ = 2 ⋅ 3 ⋅ 2๐−1 = 3 ⋅ 2๐ . EXAMPLE 5.4.5 Use mathematical induction to prove that ๐3 < ๐! for all integers, ๐ ≥ 6. โ First, show that the inequality holds for ๐ = 6: 63 = 216 < 720 = 6!. โ Assume ๐3 < ๐! with ๐ ≥ 6. The induction hypothesis yields three inequalities. Namely, 3๐2 < ๐ ⋅ ๐2 ≤ ๐!, 3๐ < ๐ ⋅ ๐ < ๐3 ≤ ๐!, and 1 < ๐!. Therefore, (๐ + 1)3 = ๐3 + 3๐2 + 3๐ + 1 < ๐! + ๐! + ๐! + ๐! = 4๐! < (๐ + 1)๐! = (๐ + 1)!. Combinatorics We now use mathematical induction to prove some basic results from two areas of mathematics. The first is combinatorics, the study of the properties that sets have based purely on their size. A permutation of a given set is an arrangement of the elements of the set. For example, the number of permutations of {๐, ๐, ๐, ๐, ๐, ๐ } is 720. If we were to write all of the permutations in a list, it would look like the following: Section 5.4 MATHEMATICAL INDUCTION ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ 261 โฎ We observe that 6! = 720 and hypothesize that the number of permutations of a set with ๐ elements is ๐!. To prove it, we use mathematical induction. โ There is only one way to write the elements of a singleton. Since 1! = 1, we have proved the basis case. โ Assume that the number of permutations of a set with ๐ ≥ 1 elements is ๐!. Let ๐ด = {๐1 , ๐2 , … , ๐๐+1 } be a set with ๐ + 1 elements. By induction, there are ๐! permutations of the set {๐1 , ๐2 , … , ๐๐ }. After writing the permutations in a list, notice that there are ๐ + 1 columns before, between, and after each element of the permutations: ๐1 ๐1 โฎ ๐๐ … … ๐2 ๐2 โฎ ๐๐−1 … ๐๐ ๐๐−1 โฎ ๐1 To form the permutations of ๐ด, place ๐๐+1 into the positions of each empty column. For example, if ๐๐+1 is put into the first column, the following permutations are obtained: ๐๐+1 ๐๐+1 โฎ ๐๐+1 ๐1 ๐1 โฎ ๐๐ ๐2 ๐2 โฎ ๐๐−1 … … … ๐๐ ๐๐−1 โฎ ๐1 Since there are ๐! rows with ๐ + 1 ways to add ๐๐+1 to each row, we conclude that there are (๐ + 1)๐! = (๐ + 1)! permutations of ๐ด. This argument proves the first theorem. THEOREM 5.4.6 Let ๐ ∈ โค+ . The number of permutations of a set with ๐ elements is ๐!. Suppose that we do not want to rearrange the entire set but only subsets of it. For example, let ๐ด = {๐, ๐, ๐, ๐, ๐}. To see all three-element permutations of ๐ด, look at the 262 Chapter 5 AXIOMATIC SET THEORY following list: ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ There are 60 arrangements because there are 5 choices for the first entry. Once that is chosen, there are only 4 left for the second, and then 3 for the last. We calculate that as 60 = 5 ⋅ 4 ⋅ 3 = 5! 5⋅4⋅3⋅2⋅1 = . 2⋅1 (5 − 3)! Generalizing, we define for all ๐, ๐ ∈ ๐, ๐ ๐๐ = ๐! , (๐ − ๐)! and we conclude the following theorem. THEOREM 5.4.7 Let ๐, ๐ ∈ โค+ . The number of permutations of ๐ elements from a set with ๐ elements is ๐ ๐๐ . Now suppose that we only want to count subsets. For example, ๐ด = {๐, ๐, ๐, ๐, ๐} has 10 subsets of three elements. They are the following: {๐, ๐, ๐} {๐, ๐, ๐} {๐, ๐, ๐} {๐, ๐, ๐} {๐, ๐, ๐} {๐, ๐, ๐} {๐, ๐, ๐} {๐, ๐, ๐} {๐, ๐, ๐} {๐, ๐, ๐} The number of subsets can be calculated by considering the next grid. ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ ๐๐๐ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ 3! columns โซ โช โช โช โช โช โฌ 10 rows โช โช โช โช โช โญ Section 5.4 MATHEMATICAL INDUCTION 263 There are 5 ๐3 permutations with three elements from ๐ด. They are found as the entries in the grid. However, since we are looking at subsets, we do not want to count ๐๐๐ as different from ๐๐๐ because {๐, ๐, ๐} = {๐, ๐, ๐}. For this reason, all elements in any given row of the grid are considered as one subset. Each row has 6 = 3! entries because that is the number of permutations of a set with three elements. Hence, multiplying the number of rows by the number of columns gives 5 ๐3 = 10(3!). Therefore, 5 ๐3 3! = 5! 5! = = 10. 3!(5 − 3)! 3!2! A generalization of this calculation leads to the formula for the arbitrary binomial coefficient, ( ) ๐ ๐! = , ๐ ๐!(๐ − ๐)! () where ๐, ๐ ∈ ๐. Read ๐๐ as “๐ choose ๐.” A generalization of the argument leads to the next theorem. THEOREM 5.4.8 Let( ๐, ๐ ∈ โค+ . The number of subsets of ๐ elements from a set with ๐ elements ๐) is ๐ . When we expand (๐ฅ + 1)3 , we find that (๐ฅ + 1)3 = ๐ฅ3 + 3๐ฅ2 + 3๐ฅ2 + 1 = 3 ( ) ∑ 3 3−๐ ๐ ๐ฅ 1. ๐ ๐=0 To prove this for any binomial (๐ฅ + ๐ฆ)๐ , we need the following equation. It was proved by Blaise Pascal (1653). The proof is Exercise 7. LEMMA 5.4.9 [Pascal’s Identity] If ๐, ๐ ∈ ๐ so that ๐ ≥ ๐, ( ) ( ) ( ) ๐ ๐ ๐+1 + = . ๐ ๐−1 ๐ THEOREM 5.4.10 [Binomial Theorem] Let ๐ ∈ โค+ . Then, ๐ ( ) ∑ ๐ ๐−๐ ๐ (๐ฅ + ๐ฆ) = ๐ฅ ๐ฆ. ๐ ๐=0 ๐ 264 Chapter 5 AXIOMATIC SET THEORY PROOF โ Since ( 1) 0 = (1) 1 = 1, ( ) ( ) 1 ( ) ∑ 1 1−๐ ๐ 1 1 ๐ฅ ๐ฆ. ๐ฆ= ๐ฅ+ (๐ฅ + ๐ฆ) = ๐ 1 0 ๐=0 1 โ Assume for ๐ ∈ โค+ , ๐ ( ) ∑ ๐ ๐−๐ ๐ ๐ฅ ๐ฆ. (๐ฅ + ๐ฆ) = ๐ ๐=0 ๐ Then, (๐ฅ + ๐ฆ)๐+1 = (๐ฅ + ๐ฆ)(๐ฅ + ๐ฆ)๐ = (๐ฅ + ๐ฆ) ๐ ( ) ∑ ๐ ๐−๐ ๐ ๐ฅ ๐ฆ. ๐ ๐=0 Multiplying the (๐ฅ + ๐ฆ) term through the summation yields ๐ ( ) ๐ ( ) ∑ ๐ ๐−๐+1 ๐ ∑ ๐ ๐−๐ ๐+1 ๐ฅ ๐ฆ + ๐ฅ ๐ฆ . ๐ ๐ ๐=0 ๐=0 Taking out the (๐ + 1)-degree terms and shifting the index on the second summation gives ๐ฅ ๐+1 ) ๐ ( ) ๐ ( ∑ ๐ ๐−๐+1 ๐ ∑ ๐ + ๐ฅ ๐ฆ + ๐ฅ๐−๐+1 ๐ฆ๐ + ๐ฆ๐+1 , ๐ ๐ − 1 ๐=1 ๐=1 which using Pascal’s identity (Lemma 5.4.9) equals ๐ฅ๐+1 + and this is ) ๐ ( ∑ ๐ + 1 ๐−๐+1 ๐ ๐ฅ ๐ฆ + ๐ฆ๐+1 , ๐ ๐=1 ( ๐+1 ∑ ๐=0 ) ๐ + 1 ๐+1−๐ ๐ ๐ฅ ๐ฆ. ๐ Euclid’s Lemma Our second application of mathematical induction comes from number theory. It is the study of the greatest common divisor (Definition 3.3.11). We begin with a lemma. LEMMA 5.4.11 Let ๐, ๐, ๐ ∈ โค such that ๐ ≠ 0 or ๐ ≠ 0. If ๐ โฃ ๐๐ and gcd(๐, ๐) = 1, then ๐ โฃ ๐. Section 5.4 MATHEMATICAL INDUCTION 265 PROOF Assume ๐ โฃ ๐๐ and gcd(๐, ๐) = 1. Then, ๐๐ = ๐๐ for some ๐ ∈ โค. By Theorem 4.3.32, there exist ๐, ๐ ∈ โค such that 1 = ๐๐ + ๐๐. Therefore, ๐ โฃ ๐ because ๐ = ๐๐๐ + ๐๐๐ = ๐๐๐ + ๐๐๐ = ๐(๐๐ + ๐๐). Suppose that ๐ ∈ โค is a prime (Example 2.4.18) that does not divide ๐. We show that gcd(๐, ๐) = 1. Take ๐ > 0 and assume ๐ โฃ ๐ and ๐ โฃ ๐. Since ๐ is prime, ๐ = 1 or ๐ = ๐. Since ๐ - ๐, we conclude that ๐ must equal 1, which means gcd(๐, ๐) = 1. Use this to prove the next result attributed to Euclid (Elements VII.30). THEOREM 5.4.12 [Euclid’s Lemma] An integer ๐ > 1 is prime if and only if ๐ โฃ ๐๐ implies ๐ โฃ ๐ or ๐ โฃ ๐ for all ๐, ๐ ∈ โค. PROOF โ Let ๐ be prime. Suppose ๐ โฃ ๐๐ but ๐ - ๐. Then, gcd(๐, ๐) = 1. Therefore, ๐ โฃ ๐ by Theorem 5.4.11. โ Let ๐ > 1. Suppose ๐ satisfies the condition, ∀๐∀๐(๐ โฃ ๐๐ → ๐ โฃ ๐ ∨ ๐ โฃ ๐). Assume ๐ is not prime. This means that there are integers ๐ and ๐ so that ๐ = ๐๐ with 1 < ๐ ≤ ๐ < ๐. Hence, ๐ โฃ ๐๐. By hypothesis, ๐ โฃ ๐ or ๐ โฃ ๐. However, since ๐, ๐ < ๐, ๐ can divide neither ๐ nor ๐. This is a contradiction. Hence, ๐ must be prime. Since 6 divides 3 ⋅ 4 but 6 - 3 and 6 - 4, the lemma tells us that 6 is not prime. On the other hand, if ๐ is a prime that divides 12, then ๐ divides 4 or 3. This means that ๐ = 2 or ๐ = 3. The next theorem is a generalization of Euclid’s lemma. Its proof uses mathematical induction. THEOREM 5.4.13 Let ๐ be prime and ๐๐ ∈ โค for ๐ = 0, 1, … , ๐ − 1. If ๐ โฃ ๐0 ๐1 · · · ๐๐−1 , then ๐ โฃ ๐๐ for some ๐ = 0, 1, … , ๐ − 1. PROOF โ The case when ๐ = 1 is trivial because ๐ divides ๐0 by definition of the product. โ Assume if ๐ โฃ ๐0 ๐1 · · · ๐๐−1 , then ๐ โฃ ๐๐ for some ๐ = 0, 1, … , ๐ − 1. Suppose ๐ โฃ ๐0 ๐1 · · · ๐๐ . Then, by Lemma 5.4.12, ๐ โฃ ๐0 ๐1 · · · ๐๐−1 or ๐ โฃ ๐๐ . 266 Chapter 5 AXIOMATIC SET THEORY If ๐ โฃ ๐๐ , we are done. Otherwise, ๐ divides ๐0 ๐1 · · · ๐๐−1 . Hence, ๐ divides one of the ๐๐ by induction. Exercises 1. Let ๐ ∈ โค+ . Prove. ๐(๐ + 1) 2 (b) 1 + 3 + 5 + · · · + (2๐ − 1) = ๐2 (a) 1 + 2 + 3 + · · · + ๐ = ๐(2๐ − 1)(2๐ + 1) (c) 12 + 32 + 52 + · · · + (2๐ − 1)2 = 3 [ ]2 ๐(๐ + 1) 3 3 3 3 (d) 1 + 2 + 3 + · · · + ๐ = 2 1 − ๐๐+1 (๐ ≠ 1) 1−๐ (f) 1 ⋅ 1! + 2 ⋅ 2! + · · · + ๐ ⋅ ๐! = (๐ + 1)! − 1 2 ๐ 1 1 (g) + +···+ =1− 2! 3! (๐ + 1)! (๐ + 1)! (2๐)! (h) 2 ⋅ 6 ⋅ 10 ⋅ 14 ⋅ · · · ⋅ (4๐ − 2) = ๐! 2. Prove for all positive integers ๐. ๐ ∑ ๐(๐ + 1)(๐ + 2) (a) ๐(๐ + 1) = 3 ๐=1 (e) 1 + ๐ + ๐2 + · · · + ๐๐ = (b) ๐ ∑ ๐=1 1 ๐ = (2๐ − 1)(2๐ + 1) 2๐ + 1 3. Let ๐ ∈ โค+ . Prove. (a) ๐ < 2๐ (b) ๐! ≤ ๐๐ ๐ ∑ 1 1 (c) ≤2− 2 ๐ ๐ ๐=1 (d) 1 2 3 ๐ ๐ + + +···+ ๐ ≤2− ๐ 2 22 23 2 2 4. Let ๐ ∈ โค. Prove. (a) ๐2 < 2๐ for all ๐ ≥ 5. (b) 2๐ < ๐! for all ๐ ≥ 4. (c) ๐2 < ๐! for all ๐ ≥ 4. 5. For ๐ ∈ โค+ , prove that if ๐ด has ๐ elements, P(๐ด) has 2๐ elements. 6. Prove that the number of lines in a truth table with ๐ propositional variables is 2๐ . 7. Demonstrate Pascal’s identity (Lemma 5.4.9). Section 5.4 MATHEMATICAL INDUCTION 267 8. For all integers ๐ ≥ ๐ ≥ 0, prove the given equations. ( ) ( ) ๐ ๐ (a) = 1. = ๐ 0 ) ( ) ( ๐ ๐ (b) . = ๐−๐ ๐ 9. Let ๐, ๐ ∈ โค+ with ๐ ≥ ๐. Prove. ( ) ( ) ( ) ( ) ๐ ๐+1 ๐ ๐+1 (a) + +···+ = ๐ ๐ ๐ ๐+1 ( ) 2๐ + 1 2 2 2 2 (b) 1 + 3 + 5 + · · · + (2๐ − 1) = 3 10. Let ๐ ≥ 2 be an integer and prove the given equations. ๐ ( ) ∑ ๐ (a) ๐ = ๐2๐−1 ๐ ๐=1 ( ) ๐ ∑ ๐ ๐−1 (b) (−1) ๐ =0 ๐ ๐=1 11. Let ๐ and ๐ be positive integers and ๐ ≥ ๐. Use induction to show the given equations. ( ) ( ) ( ) ( ) ๐ ๐+1 ๐ ๐+1 (a) + +···+ = ๐ ๐ ๐ ๐+1 ( ) 2๐ + 1 2 2 2 2 (b) 1 + 3 + 5 + · · · + (2๐ − 1) = 3 12. Prove the following for all ๐ ∈ ๐. (a) 5 โฃ ๐5 − ๐ (b) 9 โฃ ๐3 + (๐ + 1)3 + (๐ + 2)3 (c) 8 โฃ 52๐ + 7 (d) 5 โฃ 33๐+1 + 2๐+1 13. If ๐ is prime and ๐ and ๐ are positive integers such that ๐ + ๐ = ๐, prove that gcd(๐, ๐) = 1. 14. Prove for all ๐ ∈ โค+ , there exist ๐ consecutive composite integers (Example 2.4.18) by showing that (๐ + 1)! + 2, (๐ + 1)! + 3, … , (๐ + 1)! + ๐ + 1 are composite. 15. Prove. (a) If ๐ ≠ 0, then ๐ ⋅ gcd(๐, ๐) = gcd(๐๐, ๐๐). (b) Prove if gcd(๐๐ , ๐) = 1 for ๐ = 1, … , ๐, then gcd(๐1 ⋅ ๐2 · · · ๐๐ , ๐) = 1). 16. For ๐ ∈ โค+ , let ๐0 , ๐1 , … , ๐๐−1 ∈ ๐, not all equal to zero. Define ๐ = gcd(๐0 , ๐1 , … , ๐๐−1 ) to mean that ๐ is the greatest integer such that ๐ โฃ ๐๐ for all ๐ = 0, 1, … , ๐−1. Assuming ๐ ≥ 3, prove the given equations. 268 Chapter 5 AXIOMATIC SET THEORY (a) gcd(๐0 , ๐1 , … , ๐๐−1 ) = gcd(๐0 , ๐1 , … , ๐๐−3 , gcd(๐๐−2 , ๐๐−1 )) (b) gcd(๐๐0 , ๐๐1 , … , ๐๐๐−1 ) = ๐ gcd(๐0 , ๐1 , … , ๐๐−1 ) for all integers ๐ ≠ 0 5.5 STRONG INDUCTION Suppose we want to find an equation for the terms of a sequence defined recursively in which each term is based on two or more previous terms. To prove that such an equation is correct, we modify mathematical induction. Remember the domino picture that we used to explain how mathematical induction works (page 258). The first domino is tipped causing the second to fall, which in turn causes the third to fall. By the time the sequence of falls reaches the ๐ + 1 domino, ๐ dominoes have fallen. This means that sentences ๐(1) through ๐(๐) have been proved true. It is at this point that ๐(๐ + 1) is proved. This is the intuition behind the next theorem. It is sometimes called strong induction. THEOREM 5.5.1 [Mathematical Induction 2] Let ๐(๐) be a formula. For any ๐0 ∈ โค, if ๐(๐0 ) ∧ ∀๐(๐ ∈ ๐ ∧ [∀๐(๐ ∈ ๐ ∧ [๐ ≤ ๐ → ๐(๐0 + ๐)]) → ๐(๐0 + ๐ + 1)]), then ∀๐[๐ ∈ โค ∧ ๐ ≥ ๐0 → ๐(๐)]. PROOF Assume ๐(๐0 ) and ๐(๐0 ) ∧ ๐(๐0 + 1) ∧ · · · ∧ ๐(๐0 + ๐) → ๐(๐0 + ๐ + 1) (5.11) for ๐ ∈ ๐. Define ๐(๐) := ๐(๐0 ) ∧ ๐(๐0 + 1) ∧ · · · ∧ ๐(๐0 + ๐). We proceed with the induction. โ Since ๐(๐0 ) holds, we have ๐(0). โ Assume ๐(๐) with ๐ ≥ 0. By definition of ๐(๐), we have ๐(๐0 ) through ๐(๐0 + ๐). Thus, ๐(๐0 + ๐ + 1) by (5.11) from which ๐(๐ + 1) follows. Therefore, ๐(๐) is true for all ๐ ∈ ๐ (Theorem 5.4.1). Hence, ๐(๐) for all integers ๐ ≥ ๐0 . Fibonacci Sequence Leonardo of Pisa (known as Fibonacci) in his 1202 work Liber abaci posed a problem about how a certain population of rabbits increases with time (Fibonacci and Sigler 2002). Each rabbit that is at least 2 months old is considered an adult. It is a young Section 5.5 STRONG INDUCTION 269 rabbit if it is a month old. Otherwise, it is a baby. The rules that govern the population are as follows: โ No rabbits die. โ The population starts with a pair of adult rabbits. โ Each pair of adult rabbits will bear a new pair each month. The population then grows according to the following table: Month 1 2 3 4 5 6 Adult Pairs 1 1 2 3 5 8 Young Pairs 0 1 1 2 3 5 Baby Pairs 1 1 2 3 5 8 It appears that the number of adult (or baby) pairs at month ๐ is given by the sequence, 1, 1, 2, 3, 5, 8, 13, 21, 34, … . This is known as the Fibonacci sequence, and each term of the sequence is called a Fibonacci number. Let ๐น๐ denote the ๐th term of the sequence. So ๐น1 = 1, ๐น2 = 1, ๐น3 = 2, ๐น4 = 3, ๐น5 = 5, ๐น6 = 8, … . Each term of the sequence can be calculated recursively by ๐น1 = 1, ๐น2 = 1, ๐น๐ = ๐น๐−1 + ๐น๐−2 (๐ > 2). (5.12) Since we have only checked a few terms, we have not proved that ๐น๐ is equal to the number of adult pairs in the ๐th month. To show this, we use strong induction. Since the recursive definition starts by explicitly defining ๐น1 and ๐น2 , the basis case for the induction will prove that the formula holds for ๐ = 1 and ๐ = 2. โ From the table, in each of the first 2 months, there is exactly one adult pair of rabbits. This coincides with ๐น1 = 1 and ๐น2 = 1 in (5.12). โ Let ๐ > 2, and assume that ๐น๐ equals the number of adult pairs in the ๐th month for all ๐ ≤ ๐. Because of the third rule, the number of pairs of adults in any month is the same as the number of adult pairs in the previous month plus the 270 Chapter 5 AXIOMATIC SET THEORY number of baby pairs 2 months prior. Therefore, # of adult pairs in month ๐ + 1 = # of adult pairs in month ๐ + # of young pairs in month ๐ = # of adult pairs in month ๐ + # of baby pairs in month ๐ − 1 = # of adult pairs in month ๐ + # of adult pairs in month ๐ − 1 = ๐น๐ + ๐น๐−1 = ๐น๐+1 . It turns out that the Fibonacci sequence is closely related to another famous object of study in the history of mathematics. Letting ๐ ≥ 1, define ๐๐ = ๐น๐+1 . ๐น๐ The first seven terms of this sequence are ๐1 ๐2 ๐3 ๐4 ๐5 ๐6 ๐7 = 1โ1 = 1, = 2โ1 = 2, = 3โ2 = 1.5, = 5โ3 ≈ 1.667, = 8โ5 = 1.6, = 13โ8 = 1.625, = 21โ13 ≈ 1.615. This sequence has a limit that we call ๐. To find this limit, notice that ๐น๐+1 ๐น + ๐น๐−1 ๐น = ๐ = 1 + ๐−1 . ๐น๐ ๐น๐ ๐น๐ Because ๐๐−1 = ๐น๐ โ๐น๐−1 when ๐ > 1, ๐๐ = 1 + and therefore, ๐๐ − 1 − 1 , ๐๐−1 1 = 0. ๐๐−1 Because lim ๐ ๐→∞ ๐ we conclude that = lim ๐๐−1 = ๐, ๐→∞ ๐ 2 − ๐ − 1 = 0. (5.13) Section 5.5 STRONG INDUCTION 271 √ Therefore, (1 ± 5)โ2 are the solutions to (5.13), but since ๐น๐+1 โ๐น๐ > 0, we take the positive value and find that √ 1+ 5 ๐= . 2 The number ๐ is called the golden ratio. It was considered by the ancient Greeks to represent the ratio of the sides of the most beautiful rectangle. EXAMPLE 5.5.2 Prove ๐น๐ ≤ ๐ ๐−1 when ๐ ≥ 2 using strong induction. โ Since ๐น2 = 1 < ๐ 1 ≈ 1.618, the inequality holds for ๐ = 2. โ Let ๐ ≥ 3, and √ assume that ๐น๐ ≤ ๐ ๐−1 for√all ๐ such that 2 ≤ ๐ ≤ ๐. −1 Because ๐ = ( 5 − 1)โ2 and ๐ −2 = (3 − 5)โ2, ๐ −1 + ๐ −2 = 1. Therefore, the induction hypothesis gives ๐น๐+1 = ๐น๐ + ๐น๐−1 ≤ ๐ ๐−1 + ๐ ๐−2 = ๐ ๐ (๐ −1 + ๐ −2 ) = ๐ ๐ . Unique Factorization Theorem 5.4.13 states that if a prime divides an integer, it divides one of the factors of the integer. It appears reasonable that any integer can then be written as a product that includes all of its prime divisors. For example, we can write 126 = 2 ⋅ 3 ⋅ 3 ⋅ 7, and this is essentially the only way in which we can write 126 as a product of primes. All of this is summarized in the next theorem. It is also known as the fundamental theorem of arithmetic. It is the reason the primes are important. They are the building blocks of the integers. THEOREM 5.5.3 [Unique Factorization] If ๐ > 1, there exists a unique sequence of primes ๐0 ≤ ๐1 ≤ · · · ≤ ๐๐ (๐ ∈ ๐) such that ๐ = ๐0 ๐1 · · · ๐๐ . PROOF Prove existence with strong induction on ๐. โ When ๐ = 2, we are done since 2 is prime. โ Assume that ๐ can be written as the product of primes as described above for all ๐ such that 2 ≤ ๐ < ๐. If ๐ is prime, we are done as in the basis case. So suppose ๐ is composite. Then, there exist integers ๐ and ๐ such that ๐ = ๐๐ and 1 < ๐ ≤ ๐ < ๐. By the induction hypothesis, we can write ๐ = ๐0 ๐1 · · · ๐๐ข 272 Chapter 5 AXIOMATIC SET THEORY and ๐ = ๐0 ๐1 · · · ๐๐ฃ , where the ๐๐ and ๐๐ are primes. Now place these primes together in increasing order and relabel them as ๐0 ≤ ๐1 ≤ · · · ≤ ๐๐ with ๐ = ๐ข + ๐ฃ. Then, ๐ = ๐0 ๐1 · · · ๐๐ as desired. For uniqueness, suppose that there are two sets of primes ๐0 ≤ ๐1 ≤ · · · ≤ ๐๐ and ๐0 ≤ ๐1 ≤ · · · ≤ ๐๐ so that ๐ = ๐0 ๐1 · · · ๐๐ = ๐0 ๐1 · · · ๐๐ . By canceling, if necessary, we can assume the sides have no common primes. If the cancellation yields 1 = 1, the sets of primes are the same. In order to obtain a contradiction, assume that there is at least one prime remaining on the left-hand side. Suppose it is ๐0 . If the product on the right equals 1, then ๐0 โฃ 1, which is impossible. If there are primes remaining on the right, ๐0 divides one of them by Lemma 5.4.12. This is also a contradiction, since the sides have no common prime factors because of the cancellation. Hence, the two sequences must be the same. Unique Factorization allows us to make the following definition. DEFINITION 5.5.4 Let ๐ ∈ โค+ . If ๐0 , ๐1 , … , ๐๐−1 are distinct primes and ๐0 , ๐1 , … , ๐๐−1 are natural numbers such that ๐๐−1 ๐ ๐ ๐ = ๐00 ๐11 · · · ๐๐−1 , ๐ ๐ ๐ ๐−1 is called a prime power decomposition of ๐. then ๐00 ๐11 · · · ๐๐−1 EXAMPLE 5.5.5 Consider the integer 360. It has 23 ⋅32 ⋅51 as a prime power decomposition. If the exponents are limited to positive integers, the expression is unique. In this sense, we can say that 23 ⋅ 32 ⋅ 51 is the prime power decomposition of 360. However, there are times when primes need to be included in the product that are not factors of the integer. By setting the exponent to zero, these primes can be included. For example, we can also write 360 as 23 ⋅ 32 ⋅ 51 ⋅ 70 . EXAMPLE 5.5.6 Suppose ๐ ∈ โค such that ๐ > 1. Use unique factorization (Theorem 5.5.3) to prove that ๐ is a perfect square if and only if all powers in a prime power decomposition of ๐ are even. Section 5.5 STRONG INDUCTION 273 โ Let ๐ be a perfect square. This means that ๐ = ๐2 for some integer ๐ > 1. Write a prime power decomposition of ๐, ๐ ๐ ๐ ๐−1 ๐ = ๐00 ๐11 · · · ๐๐−1 . Therefore, 2๐ 2๐ 2๐ ๐ = ๐2 = ๐0 0 ๐1 1 · · · ๐๐−1๐−1 . โ Assume all the powers are even in a prime power decomposition of ๐. Namely, ๐๐−1 ๐ ๐ ๐ = ๐00 ๐11 · · · ๐๐−1 , where there exists ๐ข๐ ∈ โค so that ๐๐ = 2๐ข๐ for ๐ = 0, 1, … , ๐ − 1. Thus, ( ๐ข ๐ข 2๐ข ๐ข๐−1 )2 2๐ข 2๐ข , ๐ = ๐0 0 ๐1 1 · · · ๐๐−1๐−1 = ๐00 ๐11 · · · ๐๐−1 a perfect square. Exercises 1. Given each recursive definition, prove the formula for ๐๐ holds for all positive integers ๐. (a) If ๐1 = −1 and ๐๐ = −๐๐−1 , then ๐๐ = (−1)๐ . (b) If ๐1 = 1 and ๐๐ = 1โ3๐๐−1 , then ๐๐ = (1โ3)๐−1 . (c) If ๐1 = 0, ๐2 = −6, and ๐๐ = 5๐๐−1 − 6๐๐−2 , then ๐๐ = 3 ⋅ 2๐ − 2 ⋅ 3๐ . (d) If ๐1 = 4, ๐2 = 12, and ๐๐ = 4๐๐−1 − 2๐๐−2 , then √ √ ๐๐ = (2 + 2)๐ + (2 − 2)๐ . (e) If ๐1 = 1, ๐2 = 5, and ๐๐1 = ๐๐ + 2๐๐−1 for all ๐ > 2, then ๐๐ = 2๐ + (−1)๐ . (f) If ๐1 = 3, ๐2 = −3, ๐3 = 9, and ๐๐ = ๐๐−1 + 4๐๐−2 − 4๐๐−3 , then ๐๐ = 1 − (−2)๐ . (g) If ๐1 = 3, ๐2 = 10, ๐3 = 21, and ๐๐ = 3๐๐−1 − 3๐๐−2 + ๐๐−3 , then ๐๐ = ๐ + 2๐2 . 2. Let ๐1 = ๐, ๐2 = ๐, and ๐๐ = ๐๐−1 + ๐๐−2 for all ๐ > 2. This sequence is called the generalized Fibonacci sequence. Show that ๐๐ = ๐๐๐−2 + ๐๐๐−1 for all ๐ > 2. 274 Chapter 5 AXIOMATIC SET THEORY 3. Let ๐ > 0 be an integer. Prove. (a) ๐น๐+2 > ๐ ๐ ๐ ∑ (b) ๐น๐ = ๐น๐+2 − 1 ๐=1 4. Prove that Theorem 5.5.1 implies Theorem 5.4.1. √ ๐ ๐ − ๐๐ 5. Let ๐ = (1 − 5)โ2 and demonstrate that ๐น๐ = √ . 5 6. Let ๐ ≥ 1 and ๐ ∈ โค. Prove. (a) ๐๐+1 − 1 = (๐ + 1)(๐๐ − 1) − ๐(๐๐−1 − 1). (b) ๐๐ − 1 = (๐ − 1)(๐๐−1 + ๐๐−2 + · · · + ๐ + 1). 7. For all ๐ ∈ ๐, prove that 12 divides ๐4 − ๐2 (Definition 2.4.2). 8. Assume ๐ โฃ ๐ and ๐ โฃ ๐. Write prime power decompositions for ๐ and ๐: ๐ ๐ ๐ ๐ ๐ ๐ ๐−1 ๐ = ๐00 ๐11 · · · ๐๐−1 and ๐−1 . ๐ = ๐00 ๐11 · · · ๐๐−1 Prove that there exist ๐ก0 , ๐ก1 , … ๐ก๐−1 ∈ ๐ such that ๐ก ๐ก ๐ก ๐−1 , ๐ = ๐00 ๐11 · · · ๐๐−1 ๐ก๐ ≤ ๐๐ , and ๐ก๐ ≤ ๐ ๐ for all ๐ = 0, 1, … , ๐ − 1. 9. Prove that ๐3 โฃ ๐2 implies ๐ โฃ ๐ for all ๐, ๐ ∈ โค. 10. Let ๐ ∈ โค+ . Let ๐ have the property that for all primes ๐, if ๐ โฃ ๐, then ๐2 โฃ ๐. Prove that ๐ is the product of a perfect square and a perfect cube. 11. Prove that gcd(๐น๐ , ๐น๐+2 ) = 1 for all ๐ ∈ โค+ . 5.6 REAL NUMBERS As โค is defined using ๐ and โ is defined using โค, the set analog to ๐ is defined using โ. We start with a definition. DEFINITION 5.6.1 Let (๐ด, 4) be a poset. The set ๐ต is an initial segment of ๐ด when ๐ต ⊆ ๐ด and for all ๐, ๐ ∈ ๐ด, if ๐ 4 ๐ and ๐ ∈ ๐ต, then ๐ ∈ ๐ต. (5.14) An initial segment ๐ต of ๐ด is proper if ๐ต ≠ ๐ด. The condition (5.14) is called downward closed. Notice that for all ๐ ∈ ๐, both (−∞, ๐) and (−∞, ๐] are initial segments of (๐, ≤). A poset is an initial segment of itself, but it is not proper. Section 5.6 REAL NUMBERS 275 DEFINITION 5.6.2 Let (๐ด, 4) be a poset with ๐ ∈ ๐ด. Define ๐๐พ๐4 (๐ด, ๐) = {๐ ∈ ๐ด : ๐ โบ ๐}. For example, ๐๐พ๐≤ (๐, 5) = (−∞, 5) in (๐, ≤), and ๐๐พ๐≤ (โค, 5) = {… , 0, 1, 2, 3, 4} in (โค, ≤). Both of these are proper initial segments. Notice that every initial segment of โค is of the form ๐๐พ๐≤ (โค, ๐) for some ๐ ∈ โค. Neither โค nor โ are well-ordered by ≤. If a poset is well-ordered, its initial segments have a particular form. LEMMA 5.6.3 If (๐ด, 4) is a well-ordered set and ๐ต is a proper initial segment of ๐ด, there exists a unique ๐ ∈ ๐ด such that ๐ต = ๐๐พ๐4 (๐ด, ๐). PROOF Suppose that (๐ด, 4) is well-ordered and ๐ต ⊆ ๐ด is a proper initial segment. First, note that ๐ด โงต ๐ต is not empty since ๐ต is proper. Thus, ๐ด โงต ๐ต contains a least element ๐ because 4 well-orders ๐ด. โ Let ๐ ∈ ๐ต, which implies that ๐ โบ ๐. Otherwise, ๐ would be an element of ๐ต because ๐ต is downward closed. Hence, ๐ ∈ ๐๐พ๐4 (๐ด, ๐), which implies that ๐ต ⊆ ๐๐พ๐4 (๐ด, ๐). โ Conversely, take ๐ ∈ ๐๐พ๐4 (๐ด, ๐), which means ๐ โบ ๐. If ๐ ∈ ๐ด โงต ๐ต, then ๐ 4 ๐ because ๐ is the least element of ๐ด โงต ๐ต, so ๐ must be an element of ๐ต by the trichotomy law (Theorem 4.3.21). Thus, ๐๐พ๐4 (๐ด, ๐ต) ⊆ ๐ต. To prove uniqueness, let ๐′ ∈ ๐ด such that ๐ ≠ ๐′ and ๐ต = ๐๐พ๐4 (๐ด, ๐′ ). If ๐ โบ ๐′ , then ๐ ∈ ๐๐พ๐4 (๐ด, ๐′ ), and if ๐′ โบ ๐, then ๐′ ∈ ๐ต = ๐๐พ๐4 (๐ด, ๐). Both cases are impossible. Dedekind Cuts A basic property of ๐ is that it is complete. This means that every nonempty set of real numbers with an upper bound has a real least upper bound (Definition 4.3.12). For example, the set { } 1 ๐ด = 1 − : ๐ ∈ ๐+ ๐ 276 Chapter 5 AXIOMATIC SET THEORY is bounded from above, and its least upper bound is 1. Also, ๐ต = {3, 3.1, 3.14, 3.141, 3.1415, 3.14159, … } is bounded from above, and its least upper bound is ๐. Observe that both ๐ด and ๐ต are sets of rational numbers. The set ๐ด has a rational least upper bound, but ๐ต does not. This shows that the rational numbers are not complete. Intuitively, the picture is that of a number line. If ๐ is graphed, there are no holes because of completeness, but if ๐ is graphed, there are holes. These holes represent the irrational numbers that when filled, complete the rational numbers resulting in the set of reals. We use this idea to construct a model of the real numbers from โ (Dedekind 1901). DEFINITION 5.6.4 A set x of rational numbers is a Dedekind cut (or a real number) if x is a subset of (โ, ≤) such that โ x is nonempty, โ x is a proper initial segment of โ , โ x does not have a greatest element. Denote the set of Dedekind cuts by โ. By Lemma 5.6.3, some Dedekind cuts are of the form ๐๐พ๐≤ (โ, ๐) for some ๐ ∈ โ. In this case, write a = ๐๐พ๐≤ (โ, ๐). Therefore, โ can be embedded in โ using the function ๐ : โ → โ defined by ๐ (๐) = a (Exercise 14). The elements of โ โงต ran(๐ ) are the irrational numbers. EXAMPLE 5.6.5 โ The Dedekind cut that corresponds to the integer 7 is 7 = ๐๐พ๐≤ (โ, 7). Note that there is no gap between 7 and โ โงต 7 = {๐ ∈ โ : ๐ ≥ 7}. โ The Dedekind cut x that corresponds to ๐ includes {3, 3.1, 3.14, 3.141, 3.1415, 3.14159, … } as a subset. Notice that ๐ ∉ x and ๐ ∉ โ โงต x because ๐ is not rational. This means that we imagine a gap between x and โโงตx. This gap is where ๐ is located. Imagine that only the rational numbers have been placed on the number line (Section 5.3). For every Dedekind cut x, call the point on the number line where x and โ โงต x meet a cut. Following our intuition, if the least upper bound of x is an element of โ โงต x, there is a point at the cut [Figure 5.1(a)]. This means that x represents a rational number, a point already on the number line. However, if the least upper bound of x is Section 5.6 REAL NUMBERS x โ๎x โ x 277 โ๎x โ The cut The cut (a) A rational number Figure 5.1 (b) An irrational number The cuts of two types of numbers. not an element of โ โงต x, there is no point at the cut [Figure 5.1(b)]. This means that x represents an irrational number. To obtain all real numbers, a point must be placed at each cut without a point, filling the entire number line. Therefore, the first step in showing that โ is a suitable model for ๐ is to prove that every set of Dedekind cuts with an upper bound must must have a least upper bound that is a Dedekind cut. That is, โ must be shown to be complete. To accomplish this, we first define on order on โ. DEFINITION 5.6.6 Let x, y ∈ โ. Define x ≤ y if and only if x ⊆ y, and define x < y to mean x ≤ y and x ≠ y. For example, 3 ≤ 4 in โ because 3 = ๐๐พ๐≤ (โ, 3) and 4 = ๐๐พ๐≤ (โ, 4). Because the order on โ is linear (Theorem 5.3.8), it is left to Exercise 5 to prove that we have defined a linear order on โ. THEOREM 5.6.7 (โ, ≤) is a linear order but it is not a well-order. Now to prove that โ is complete using the order of Definition 5.6.6. THEOREM 5.6.8 Every nonempty subset of โ with an upper bound has a real least upper bound. PROOF Let โฑ ≠ ∅ and โฑ ⊆ โ. Let m ∈ โ be an upper bound โ โ of โฑ . By Example 4.3.14, โฑ is the least upper bound of โฑ . We show that โฑ ∈ โ. โ Takeโx ∈ โฑ . Since Dedekind cuts are nonempty, x ≠ ∅, which implies that โฑ ≠ ∅. โ By hypothesis, x ⊆ m for all x ∈ โฑ . Hence, โ have that m ≠ โ, so โฑ ⊂ โ. โ โฑ ⊆ m. Since m ∈ โ, we โ โ Let ๐ฅ ∈ โฑ and ๐ฆ ≤ ๐ฅ. Thus, ๐ฅ ∈ a for โ some Dedekind cut a ∈ โฑ . Sinceโ a is downward closed, ๐ฆ ∈ a, so ๐ฆ ∈ โฑ . Hence, with the previous part, โฑ is a proper initial segment of โ. 278 Chapter 5 AXIOMATIC SET THEORY โ Let ๐ฅ ∈ a ∈ โฑ . Since a is a Dedekind cut, it has no greatest element. โ Thus, there exists ๐ฆ ∈ a such that ๐ฅ < ๐ฆ. Because ๐ฆ ∈ โฑ , we see that โ โฑ has no greatest element. Since every nonempty bounded subset of real numbers has a least upper bound in โ, the set of Dedekind cuts does not have the same issue with gaps as โ does. For example, the least upper bound of the set of real numbers {2, 2.7, 2.71, 2.718, 2.7182, 2.71828, … } is the Dedekind cut that corresponds to ๐ ∈ ๐. Arithmetic As with the other sets of numbers that have been constructed using the axioms of ๐ญ๐๐, we now define addition and multiplication on โ. First, define the following Dedekind cuts: โ 0 = ๐๐พ๐≤ (โ, 0) โ 1 = ๐๐พ๐≤ (โ, 1). Let x, y ∈ โ. That ๐ = {๐ + ๐ : ๐ ∈ x ∧ ๐ ∈ y} is a Dedekind cut is Exercise 3. Assume that 0 ≤ x and 0 ≤ y. We claim that the set ๐1 = {๐๐ : 0 ≤ ๐ ∈ x ∧ 0 ≤ ๐ ∈ y} ∪ 0 is a Dedekind cut. โ ๐1 ≠ ∅ because 0 ≠ ∅. โ Let ๐ข ∈ โ โงต x and ๐ฃ ∈ โ โงต y. Let { ๐ข if ๐ข ≥ ๐ฃ, ๐= ๐ฃ if ๐ฃ > ๐ข. Then, ๐2 ∉ ๐1 , so ๐1 ≠ โ. โ Let 0 ≤ ๐ ∈ x and 0 ≤ ๐ ∈ y, and suppose that ๐ค ∈ โ such that ๐ค < ๐๐. Hence, ๐ค๐−1 < ๐, which implies that ๐ค๐−1 ∈ x since x is downward closed. Therefore, ๐ค ∈ ๐1 because ๐ค = (๐ค๐−1 )๐. โ Again, let 0 ≤ ๐ ∈ x and 0 ≤ ๐ ∈ y. Since x and y do not have greatest elements, there exists ๐ข ∈ x and ๐ฃ ∈ y such that ๐ < ๐ข and ๐ < ๐ฃ. Then, by Exercise 5.3.15(b) ๐๐ < ๐ข๐ฃ ∈ ๐1 . 279 Section 5.6 REAL NUMBERS Furthermore, if x < 0 and y < 0, define ๐2 = {๐๐ : ๐ ∈ x ∧ ๐ ∈ y ∧ 0 ≤ −๐ ∧ 0 ≤ −๐} ∪ 0, (5.15) if x < 0 and 0 ≤ y, define ๐3 = {๐๐ : ๐ ∈ x ∧ 0 ≤ ๐ ∈ y}, (5.16) ๐4 = {๐๐ : 0 ≤ ๐ ∈ x ∧ ๐ ∈ y}, (5.17) or 0 ≤ x and y < 0, define Since ๐1 , ๐2 , ๐3 , ๐4 , and ๐ are Dedekind cuts (Exercise 4), we can use them to define the two standard operations on โ. DEFINITION 5.6.9 Let x, y ∈ โ. Define x + y = {๐ + ๐ : ๐ ∈ x ∧ ๐ ∈ y} and โง{๐๐ : 0 ≤ ๐ ∈ x ∧ 0 ≤ ๐ ∈ y} ∪ 0 โช โช{๐๐ : ๐ ∈ x ∧ ๐ ∈ y ∧ 0 ≤ −๐ ∧ 0 ≤ −๐} ∪ 0 x⋅y=โจ โช{๐๐ : ๐ ∈ x ∧ 0 ≤ ๐ ∈ y} โช{๐๐ : 0 ≤ ๐ ∈ x ∧ ๐ ∈ y} โฉ if 0 ≤ x ∧ 0 ≤ y, if x < 0 ∧ y < 0, if x < 0 ∧ 0 ≤ y, if 0 ≤ x ∧ y < 0. Since addition and multiplication on โ are associative and commutative, addition and multiplication are associative and commutative on โ. THEOREM 5.6.10 Addition and multiplication of real numbers are associative and commutative. PROOF Let x, y, z ∈ โ. We prove that addition is associative, leaving the rest to Exercise 7. x + (y + z) = x + {๐ + ๐ : ๐ ∈ y ∧ ๐ ∈ z} = {๐ + ๐ฃ : ๐ ∈ x ∧ ๐ฃ ∈ {๐ + ๐ : ๐ ∈ y ∧ ๐ ∈ z}} = {๐ + (๐ + ๐) : ๐ ∈ x ∧ (๐ ∈ y ∧ ๐ ∈ z)} = {(๐ + ๐) + ๐ : (๐ ∈ x ∧ ๐ ∈ y) ∧ ๐ ∈ z} = {๐ข + ๐ : ๐ข ∈ {๐ + ๐ : ๐ ∈ x ∧ ๐ ∈ y} ∧ ๐ ∈ z} = {๐ + ๐ : ๐ ∈ x ∧ ๐ ∈ y} + z = (x + y) + z. 280 Chapter 5 AXIOMATIC SET THEORY The Dedekind cuts 0 and 1 behave as expected. For example, 0 + 4 = {๐ + ๐ : ๐ ∈ 0 ∧ ๐ ∈ 4} = ๐๐พ๐≤ (โ, 4) = 4 and 1 ⋅ 4 = {๐๐ : 0 ≤ ๐ ∈ 1 ∧ 0 ≤ ๐ ∈ 4} ∪ 0 = ๐๐พ๐≤ (โ, 4) = 4. These equations suggest the following. THEOREM 5.6.11 โ has additive and multiplicative identities. PROOF Let x ∈ โ. We first show that x = {๐ + ๐ : ๐ ∈ x ∧ ๐ ∈ 0} = x + 0. Take ๐ข ∈ x. Since x has no greatest element, there exists ๐ฃ ∈ x such that ๐ข < ๐ฃ. Write ๐ข = ๐ฃ + (๐ข − ๐ฃ). Since ๐ข − ๐ฃ < 0, we have that ๐ข ∈ x + 0. Conversely, let ๐ ∈ 0. Since ๐ ∈ 0 implies that ๐ < 0, we have that ๐ข + ๐ < ๐ข (Exercise 5.3.16), and because x is downward closed, ๐ข + ๐ ∈ x. We next show that x = x ⋅ 1. We have two cases to consider. โ Let 0 ≤ x. By Definition 5.6.9, x ⋅ 1 = {๐ ⋅ ๐ : 0 ≤ ๐ ∈ x ∧ 0 ≤ ๐ ∈ 1} ∪ 0. Let 0 ≤ ๐ ∈ x and 0 ≤ ๐ < 1. Then, ๐๐ ≤ ๐ (Exercise 5.3.17), so ๐๐ ∈ x since x is downward closed. Conversely, take ๐ ∈ x. If ๐ < 0, then ๐ ∈ 0, and if ๐ = 0, then ๐ = 0 ⋅ 0, so suppose ๐ > 0. Since x has no greatest element, there exists ๐ข ∈ x such that ๐ < ๐ข. This implies that ๐๐ข−1 < 1, so ๐ ∈ x ⋅ 1 because ๐ = ๐ข(๐๐ข−1 ). โ Let x < 0 and proceed like in the previous case. We leave the proof of the last result to Exercise 10. THEOREM 5.6.12 โ Every element of โ has an additive inverse. โ Every nonzero element of โ has a multiplicative inverse. โ The distributive law holds for โ. Complex Numbers The last set of numbers that we define are the complex numbers. Section 5.6 REAL NUMBERS 281 DEFINITION 5.6.13 Define โ = โ × โ to be the set of complex numbers. Denote (a, b) ∈ โ by a + b๐. Observe that the standard embedding ๐ : โ → โ defined by ๐ (x) = (x, 0) allows us to consider โ as a subset of โ. We will not define an order on โ but will define the standard two operations. DEFINITION 5.6.14 Let a + b๐, c + d๐ ∈ โ. โ (a + b๐) + (c + d๐) = (a + c) + (b + d)๐. โ (a + b๐) ⋅ (c + d๐) = (ac − bd) + (ad + bc)๐. We leave the proof of the following to Exercise 11. THEOREM 5.6.15 โ ๐2 = –1 + 0๐. โ Addition and multiplication are associative and commutative. โ โ has additive and multiplicative identities. โ Every element of โ has an additive inverse. โ Every nonzero element of โ has a multiplicative inverse. โ The distributive law holds in โ. Exercises 1. Find the given initial segments in the indicate posets. (a) ๐๐พ๐โฃ (โค, 50) from Example 4.3.6 (b) ๐๐พ๐4 ({0, 1}∗ , 101010) from Example 4.3.7 (c) ๐๐พ๐⊆ (P(โค), {1, 2, 3, 4}) from Example 4.3.9 2. Let (๐ด, 4) be a poset. Assume that ๐ต and ๐ถ are initial segments of ๐ด. Prove that ๐ต ∩ ๐ถ is an initial segment of ๐ด. Is ๐ต ∪ ๐ถ also an initial segment of ๐ด? 3. Let x, y ∈ โ. Prove that ๐ = {๐ + ๐ : ๐ ∈ x ∧ ๐ ∈ y} is a Dedekind cut. 4. Prove that ๐2 (5.15), ๐3 (5.16), and ๐4 (5.17) are Dedekind cuts. 5. Prove Theorem 5.6.7. 6. Show that every Dedekind cut has an upper bound in โ. 7. Finish the proof of Theorem 5.6.10. 282 Chapter 5 AXIOMATIC SET THEORY 8. Prove that 0 and 1 are Dedekind cuts. 9. Let ๐, ๐ ∈ โ and ๐๐ = 0. Prove that ๐ = 0 or ๐ = 0. 10. Prove Theorem 5.6.12. 11. Prove Theorem 5.6.15. 12. Prove that between any two real numbers is another real number. 13. Prove that between any two real numbers is a rational number. 14. Show that โ can be embedded in โ by proving that ๐ : โ → โ defined by ๐ (๐) = ๐๐พ๐≤ (โ, ๐) is an order isomorphism preserving ≤ with ≤. Furthermore, show that both ๐ and โค can be embedded into โ. 15. Let ๐ be the function defined in Exercise 14. Show that ๐ (๐ + 1) is an upper bound of ๐ (๐) for all ๐ ∈ โ. 16. The absolute value function (Exercise 2.4.23) can be defined so that for all x ∈ โ, |x| = x ∪ −x, where −x refers to the additive inverse of x (Theorem 5.6.12). Let a be a positive real number. Prove the following for every x, y ∈ โ. (a) | − x| = |x|. (b) |x2 | = |x|2 . (c) x ≤ |x|. (d) |xy| = |x| |y|. (e) |x| < a if and only if −a < x < a. (f) a < |x| if and only if a < x or x < −a. CHAPTER 6 ORDINALS AND CARDINALS 6.1 ORDINAL NUMBERS In Chapter 5, we defined certain sets to represent collections of numbers. Despite being sets themselves, the elements of those sets were called numbers. We continue this association with sets as numbers but for a different purpose. While before we defined ๐, โค, โ, โ, and โ to represent ๐, ๐, ๐, ๐, and ๐, the definitions of this chapter are intended to be a means by which all sets can be classified according to a particular criterion. Specifically, in the later part of the chapter, we will define sets for the purpose of identifying the size of a given set, and we begin the chapter by defining sets that are used to identify whether two well-ordered sets have the same order type (Definition 4.5.24). A crucial tool in this pursuit is the following generalization of Theorem 5.5.1 to wellordered infinite sets. THEOREM 6.1.1 [Transfinite Induction 1] Let (๐ด, 4) be a well-ordered set. If ๐ต ⊆ ๐ด and ๐๐พ๐4 (๐ด, ๐ฅ) ⊆ ๐ต implies ๐ฅ ∈ ๐ต for all ๐ฅ ∈ ๐ด, then ๐ด = ๐ต. 283 A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. 284 Chapter 6 ORDINALS AND CARDINALS PROOF To show that ๐ด is a subset of ๐ต, suppose that ๐ด โงต ๐ต is nonempty. Since ๐ด is well-ordered by 4, let ๐ be the least element of ๐ด โงต ๐ต. This implies that ๐๐พ๐4 (๐ด, ๐) ⊆ ๐ต, so ๐ ∈ ๐ต by hypothesis, a contradiction. Note that transfinite induction restricted to ๐ is simply strong induction (Theorem 5.5.1). To see this, let the well-ordered set (๐ด, 4) of Theorem 6.1.1 be (๐, ≤). Define the set ๐ต = {๐ : ๐(๐)} ⊆ ๐ for some formula ๐(๐). The conditional ๐๐พ๐≤ (๐, ๐) ⊆ ๐ต → ๐ ∈ ๐ต implies ๐(0) when ๐ = 0 because ๐๐พ๐≤ (๐, 0) = ∅ and implies ๐(0) ∧ ๐(1) ∧ · · · ∧ ๐(๐ − 1) → ๐(๐) when ๐ > 0 because ๐๐พ๐≤ (๐, ๐) = ๐. Our first use of transfinite induction is the following lemma. It uses the terminology of Exercise 4.4.32 and is the first of a sequence of lemmas that will play a critical role. LEMMA 6.1.2 Let (๐ด, 4) be well-ordered. If ๐ : ๐ด → ๐ด is increasing, then ๐ 4 ๐(๐) for all ๐ ∈ ๐ด. PROOF Define ๐ต = {๐ฅ ∈ ๐ด : ๐ฅ 4 ๐(๐ฅ)}, where ๐ is an increasing function ๐ด → ๐ด. Let ๐๐พ๐4 (๐ด, ๐) ⊆ ๐ต. We note that ๐ is the least element of ๐ด โงต ๐๐พ๐4 (๐ด, ๐). Let ๐ฆ ∈ ๐๐พ๐4 (๐ด, ๐). This implies that ๐ฆ 4 ๐(๐ฆ) โบ ๐(๐) by definition of ๐ต and because ๐ฆ โบ ๐. Hence, ๐(๐) ∈ ๐ด โงต ๐๐พ๐4 (๐ด, ๐). Thus, ๐ 4 ๐(๐) and ๐ด = ๐ต by transfinite induction (Theorem 6.1.1). LEMMA 6.1.3 For all well-ordered sets (๐ด, 4) and (๐ด′ , 4′ ), there exists at most one order isomorphism ๐ : ๐ด → ๐ด′ . PROOF Let ๐ : ๐ด → ๐ด′ and ๐ : ๐ด → ๐ด′ be order isomorphisms. Since both ๐−1 and ๐ −1 are order isomorphisms ๐ด′ → ๐ด (Theorem 4.5.26), ๐ −1 โฆ ๐ and ๐−1 โฆ ๐ are order isomorphisms ๐ด → ๐ด (Theorem 4.5.27). We note that for every ๐, ๐ ∈ ๐ด, if ๐ โบ ๐, then ๐(๐) โบ′ ๐(๐) and then ๐ −1 (๐(๐)) โบ ๐ −1 (๐(๐)). This means that ๐ −1 โฆ ๐ is increasing. A similar argument proves that ๐−1 โฆ ๐ is increasing. To show that ๐ = ๐, let ๐ ∈ ๐ด. By Lemma 6.1.2, ๐ 4 (๐ −1 โฆ ๐)(๐) and ๐ 4 (๐−1 โฆ ๐)(๐). Therefore, ๐(๐) 4′ ๐(๐) and ๐(๐) 4′ ๐(๐). Since 4′ is antisymmetric, we have that ๐(๐) = ๐(๐). Section 6.1 ORDINAL NUMBERS 285 LEMMA 6.1.4 No well-ordered set (๐ด, 4) is order isomorphic to any of its proper initial segments. PROOF Let (๐ด, 4) be a well-ordered set. Suppose that ๐ is a proper initial segment of ๐ด. In order to obtain a contradiction, assume that ๐ : ๐ด → ๐ is an order isomorphism. Take ๐ ∈ ๐ด โงต ๐. Since ๐(๐) ∈ ๐ and ๐ is increasing, we have that ๐ 4 ๐(๐) โบ ๐ by Lemma 6.1.2. The next result follows from Lemma 6.1.4 (Exercise 1). LEMMA 6.1.5 Distinct initial segments of a well-ordered set are not order isomorphic. The lemmas lead to the following theorem. THEOREM 6.1.6 If (๐ด, 4) and (๐ต, 4′ ) are well-ordered sets, there exists an order isomorphism such that exactly one of the following holds. โ ๐ด ≅ ๐ต. โ ๐ด is order isomorphic to a proper initial segment of ๐ต. โ ๐ต is order isomorphic to a proper initial segment of ๐ด. PROOF Let (๐ด, 4) and (๐ต, 4′ ) be well-ordered sets. Appealing to Lemma 6.1.5, if ๐ฅ ∈ ๐ด, there is at most one ๐ฆ ∈ ๐ต such that ๐๐พ๐4 (๐ด, ๐ฅ) ≅ ๐๐พ๐4′ (๐ต, ๐ฆ), so define the function ๐ = {(๐ฅ, ๐ฆ) ∈ ๐ด × ๐ต : ๐๐พ๐4 (๐ด, ๐ฅ) ≅ ๐๐พ๐4′ (๐ต, ๐ฆ)}. We have a number of facts to prove. โ Let ๐ฆ1 , ๐ฆ2 ∈ ran(๐) such that ๐ฆ1 = ๐ฆ2 . Take ๐ฅ1 , ๐ฅ2 ∈ ๐ด such that ๐๐พ๐4 (๐ด, ๐ฅ1 ) ≅ ๐๐พ๐4′ (๐ต, ๐ฆ1 ) and ๐๐พ๐4 (๐ด, ๐ฅ2 ) ≅ ๐๐พ๐4′ (๐ต, ๐ฆ2 ). Then, we have ๐๐พ๐4 (๐ด, ๐ฅ1 ) ≅ ๐๐พ๐4 (๐ด, ๐ฅ2 ), and ๐ฅ1 = ๐ฅ2 by Lemma 6.1.5. Therefore, ๐ is one-to-one. โ Take ๐ฅ1 , ๐ฅ2 ∈ dom(๐) and assume that ๐ฅ1 4 ๐ฅ2 . This implies that ๐๐พ๐4 (๐ด, ๐ฅ1 ) ⊆ ๐๐พ๐4 (๐ด, ๐ฅ2 ). 286 Chapter 6 ORDINALS AND CARDINALS Then, by definition of ๐, we have ๐๐พ๐4 (๐ด, ๐ฅ1 ) ≅ ๐๐พ๐4′ (๐ต, ๐(๐ฅ1 )) and ๐๐พ๐4 (๐ด, ๐ฅ2 ) ≅ ๐๐พ๐4′ (๐ต, ๐(๐ฅ2 )). Hence, ๐๐พ๐4′ (๐ต, ๐(๐ฅ1 )) is order isomorphic to an initial segment ๐ of ๐๐พ๐4′ (๐ต, ๐(๐ฅ2 )) (Exercise 17). If ๐ ≠ ๐๐พ๐4′ (๐ต, ๐(๐ฅ1 )), then ๐ต has two distinct isomorphic initial segments, contradicting Lemma 6.1.5. This implies that ๐(๐ฅ1 ) 4′ ๐(๐ฅ2 ), so ๐ is order-preserving. โ Let ๐ฅ1 , ๐ฅ2 ∈ ๐ด. Suppose that ๐ฅ1 4 ๐ฅ2 and ๐ฅ2 ∈ dom(๐). This means that there exists ๐ฆ2 ∈ ๐ต such that ๐๐พ๐4 (๐ด, ๐ฅ2 ) ≅ ๐๐พ๐4′ (๐ต, ๐ฆ2 ). If ๐ฅ1 = ๐ฅ2 , then ๐ฅ1 ∈ dom(๐), so assume that ๐ฅ1 ≠ ๐ฅ2 . Since ๐ฅ1 โบ ๐ฅ2 , we have that ๐ฅ1 ∈ ๐๐พ๐4 (๐ด, ๐ฅ2 ). Because ๐ is order-preserving, ๐๐พ๐4 (๐ด, ๐ฅ1 ) ≅ ๐๐พ๐4′ (๐ต, ๐ฆ1 ) for some ๐ฆ1 ∈ ๐๐พ๐4′ (๐ต, ๐ฆ2 ) (Exercise 17). Therefore, (๐ฅ1 , ๐ฆ1 ) ∈ ๐, so ๐ฅ1 ∈ dom(๐), proving that the domain of ๐ is an initial segment of ๐ด. โ That the range of ๐ is an initial segment of ๐ต is proved like the previous case. If ๐ is a surjection and dom(๐) = ๐ด, then ๐ is an order isomorphism ๐ด → ๐ต, else ๐−1 [๐ต] is a proper initial segment of ๐ด. If ๐ is not a surjection, ๐[๐ด] is a proper initial segment of ๐ต. Ordinals Theorem 6.1.6 is a sort of trichotomy law for well-ordered sets. Two well-ordered sets look alike, or one has a copy of itself in the other. This suggests that we should be able to choose certain well-ordered sets to serve as representatives of all the different types of well-ordered sets. No two of the chosen sets should be order isomorphic, but it should be the case that every well-ordered set is order isomorphic to exactly one of them. That is, we should be able to classify all of the well-ordered sets. This will be our immediate goal and is the purpose behind the next definition. DEFINITION 6.1.7 The set ๐ผ is an ordinal number (or simply an ordinal) if (๐ผ, ⊆) is a well-ordered set and ๐ฝ = ๐๐พ๐⊆ (๐ผ, ๐ฝ) for all ๐ฝ ∈ ๐ผ. For ordinals, define ๐๐พ๐(๐ผ, ๐ฝ) = ๐๐พ๐⊆ (๐ผ, ๐ฝ). Section 6.1 ORDINAL NUMBERS 287 Definition 6.1.7 implies that ๐ and every natural number is an ordinal because they are well-ordered by ⊆ and for all ๐ ∈ ๐ โงต {0}, ๐ = {0, 1, 2, … , ๐ − 1} = ๐๐พ๐(๐, ๐), and for all ๐ ∈ ๐, ๐ = {0, 1, 2, … , ๐ − 1} = ๐๐พ๐(๐, ๐). For example, 5 ∈ 7 and 5 = {0, 1, 2, 3, 4} = ๐๐พ๐(7, 5). We now prove a sequence of basic results about ordinals. The first is similar to Theorem 5.2.8, so its proof is left to Exercise 5. THEOREM 6.1.8 Ordinals are transitive sets. THEOREM 6.1.9 The elements of ordinals are transitive sets. PROOF Let ๐ผ be an ordinal and ๐ฝ ∈ ๐ผ. Take ๐พ ∈ ๐ฝ and ๐ฟ ∈ ๐พ. Since ๐ผ is transitive (Theorem 6.1.8), we have that ๐พ ∈ ๐ผ. Therefore, ๐พ = ๐๐พ๐(๐ผ, ๐พ) and ๐ฝ = ๐๐พ๐(๐ผ, ๐ฝ), so ๐ฟ ∈ ๐๐พ๐(๐ผ, ๐พ) ⊆ ๐๐พ๐(๐ผ, ๐ฝ) = ๐ฝ, which implies that ๐ฝ is transitive (Definition 5.2.7). THEOREM 6.1.10 Every element of an ordinal is an ordinal. PROOF Let ๐ผ be an ordinal and ๐ฝ ∈ ๐ผ. Notice that this implies that ๐ฝ is transitive (Theorem 6.1.9). Since ๐ฝ ⊆ ๐ผ, we have that (๐ฝ, ⊆) is a well-ordered set by a subset axiom (5.1.8) and Theorem 4.3.26. Now take ๐ฟ ∈ ๐ฝ. Since ๐ฟ ∈ ๐ผ, we have that ๐ฟ is transitive. Therefore, by Exercise 5.2.3, ๐ฟ = {๐พ : ๐พ ∈ ๐ฟ} = {๐พ : ๐พ ∈ ๐ฝ ∧ ๐พ ∈ ๐ฟ} ⊆ {๐พ : ๐พ ∈ ๐ฝ ∧ ๐พ ⊂ ๐ฟ} = ๐๐พ๐(๐ฝ, ๐ฟ) ⊆ ๐๐พ๐(๐ผ, ๐ฟ) = ๐ฟ. From this, we conclude that ๐ฟ = ๐๐พ๐(๐ฝ, ๐ฟ). 288 Chapter 6 ORDINALS AND CARDINALS THEOREM 6.1.11 Let ๐ผ and ๐ฝ be ordinals. Then, ๐ผ ⊂ ๐ฝ if and only if ๐ผ ∈ ๐ฝ. PROOF If ๐ผ ∈ ๐ฝ, then ๐ผ ⊂ ๐ฝ because ๐ฝ is transitive (Theorem 6.1.8 and Exercise 5.2.3). Conversely, suppose that ๐ผ ⊂ ๐ฝ. Let ๐พ ∈ ๐ผ and ๐ฟ ⊂ ๐พ with ๐ฟ ∈ ๐ฝ. Since ๐ฝ is an ordinal, ๐ฟ = ๐๐พ๐(๐ฝ, ๐ฟ). Hence, ๐ฟ = ๐๐พ๐(๐พ, ๐ฟ), which implies that ๐ฟ ∈ ๐พ because ๐พ is an ordinal by Theorem 6.1.10. Therefore, ๐ฟ ∈ ๐ผ because ๐ผ is transitive (Theorem 6.1.8). This shows that ๐ผ is a proper initial segment of ๐ฝ with respect to ⊆ (Definition 5.6.1). From this, it follows by Lemma 5.6.3 that ๐ผ = ๐๐พ๐(๐ฝ, ๐) for some ๐ ∈ ๐ฝ. Hence, ๐ผ ∈ ๐ฝ. THEOREM 6.1.12 Every ordinal is well-ordered by ∈. The next theorem is an important part of the process of showing that the ordinals are the sets that classify all well-ordered sets according to their order types. It states that distinct ordinals are not order isomorphic with respect to ⊆. THEOREM 6.1.13 For all ordinals ๐ผ and ๐ฝ, if (๐ผ, ⊆) ≅ (๐ฝ, ⊆), then ๐ผ = ๐ฝ. PROOF Let ๐ : ๐ผ → ๐ฝ be an order isomorphism preserving ⊆. Define ๐ด = {๐พ ∈ ๐ผ : ๐(๐พ) = ๐พ}. Take ๐ฟ ∈ ๐ผ and assume that ๐๐พ๐(๐ผ, ๐ฟ) ⊆ ๐ด. Then, ๐(๐ฟ) = ๐๐พ๐(๐ฝ, ๐(๐ฟ)) = ๐[๐๐พ๐(๐ผ, ๐ฟ)] = {๐(๐พ) : ๐พ ∈ ๐ผ ∧ ๐พ ⊂ ๐ฟ} = {๐พ : ๐พ ⊂ ๐ฟ} = ๐ฟ. The first equality follows because ๐(๐ฟ) is an ordinal in ๐ฝ, the second follows because ๐ is an order isomorphism, and the fourth equation follows by the assumption. Therefore, by transfinite induction (Theorem 6.1.1), ๐ด = ๐ผ, so ๐ is the identity map and ๐ผ = ๐ฝ. Because of Theorem 6.1.13, we are able to prove that there is a trichotomy law for the ordinals with respect to ⊆. THEOREM 6.1.14 [Trichotomy] For all ordinals ๐ผ and ๐ฝ, exactly one of the following holds: ๐ผ = ๐ฝ, ๐ผ ⊂ ๐ฝ, or ๐ผ ⊂ ๐ฝ. Section 6.1 ORDINAL NUMBERS 289 PROOF Since (๐ผ, ⊆) and (๐ฝ, ⊆) are well-ordered, by Theorem 6.1.6, exactly one of the following holds. โ ๐ผ ≅ ๐ฝ, which implies that ๐ผ = ๐ฝ by Theorem 6.1.13. โ There exists ๐ฟ ∈ ๐ฝ such that ๐ผ ≅ ๐๐พ๐(๐ฝ, ๐ฟ). Since ๐๐พ๐(๐ฝ, ๐ฟ) is an ordinal (Theorem 6.1.10), ๐ผ = ๐๐พ๐(๐ฝ, ๐ฟ), again by Theorem 6.1.13. Therefore, ๐ผ ⊂ ๐ฝ. โ There exists ๐พ ∈ ๐ผ such that ๐ฝ ≅ ๐๐พ๐(๐ผ, ๐พ). As in the previous case, we have that ๐ฝ ⊂ ๐ผ. Because of Theorem 6.1.11, we can quickly conclude the following. COROLLARY 6.1.15 For all ordinals ๐ผ and ๐ฝ, exactly one of the following holds: ๐ผ = ๐ฝ, ๐ผ ∈ ๐ฝ, or ๐ผ ∈ ๐ฝ. In addition to the ordinals having a trichotomy law, the least upper bound with respect to ⊆ of a set of ordinals is also an ordinal (compare Example 4.3.14). THEOREM 6.1.16 If โฑ is a set of ordinals, โ โฑ is an ordinal. PROOF โ We show that โฑ satisfies โ the conditions of Definition 6.1.7. Since the elements of ordinals are ordinals, โฑ is a set of ordinals,โ and by Theorem 6.1.14, we see โ that ( โฑ , ⊆) is a linearly ordered set. Let ๐ต ⊆ โฑ and take ๐ผ ∈ ๐ต. We have two cases to consider. โ Suppose ๐ผ ∩ ๐ต = ∅. Let ๐ฝ ∈ ๐ต. Then, ๐ฝ ∉ ๐ผ, so by Theorem 6.1.11 and Theorem 6.1.14, ๐ผ ⊆ ๐ฝ. Hence, ๐ผ is the least element of ๐ต. โ Let ๐ผ ∩ ๐ต be nonempty. Since ๐ผ is an ordinal, there exists an ordinal ๐ฟ that is the least element of ๐ผ ∩ ๐ต with respect to ⊆. Let ๐ฝ ∈ ๐ต. If ๐ฝ ⊂ ๐ผ, then ๐ฝ ∈ ๐ผ ∩ ๐ต, which implies that ๐ฟ ⊆ ๐ฝ. Also, if ๐ผ ⊆ ๐ฝ, then ๐ฟ ⊂ ๐ฝ. Since these are the only two options (Theorem 6.1.14), this implies that ๐ฟ is the least element of ๐ต. โ We conclude that ( โฑ , ⊆) is a well-ordered set. โ Next, let ๐ฝ ∈ โฑ. โ This means that there exists an ordinal ๐ผ ∈ ๐ด such that ๐ฝ ∈ ๐ผ. Since ๐๐พ๐( โฑ , ๐ฝ) ⊆ ๐ฝ by definition, โ take ๐ฟ ∈ ๐ฝ. Since ๐ผ is transitiveโ(Theorem 6.1.8), ๐ฟ ∈ ๐ผ. Therefore, ๐ฟ ∈ โฑ , which implies that ๐ฝ ⊆ ๐๐พ๐( โฑ , ๐ฝ). 290 Chapter 6 ORDINALS AND CARDINALS Classification Let ๐ผ be an ordinal number. We check the two conditions of Definition 6.1.7 to show that ๐ผ + is an ordinal. โ Let ๐ต be a nonempty subset of ๐ผ ∪ {๐ผ}. If ๐ต ∩ ๐ผ ≠ ∅, then ๐ต has a least element with respect to ⊆ since (๐ผ, ⊆) is well-ordered. If ๐ต = {๐ผ}, then ๐ผ is the least element of ๐ต. โ Let ๐ฝ ∈ ๐ผ ∪ {๐ผ}. If ๐ฝ ∈ ๐ผ, then ๐ฝ = ๐๐พ๐(๐ผ, ๐ฝ) = ๐๐พ๐(๐ผ + , ๐ฝ) because ๐ผ is an ordinal number. Otherwise, ๐ฝ = ๐ผ = ๐๐พ๐(๐ผ + , ๐ผ). The ordinal ๐ผ + is called a successor ordinal because it has a predecessor. For example, every positive natural number is a successor ordinal. Now assume that ๐ผ ≠ ∅ and ๐ผ is an ordinal that is not a successor. โ Let ๐ฟโ∈ ๐ฝ ∈ ๐ผ. Since ๐ผ is transitive (Theorem 6.1.8), ๐ฟ ∈ ๐ผ. Thus, we conclude that {๐ฝ : ๐ฝ ∈ ๐ผ} ⊆ ๐ผ. โ Now take ๐ฟ ∈ ๐ผ. This implies that ๐ฟ is an ordinal (Theorem 6.1.10), so ๐ฟ ⊂ ๐ผ by Theorem 6.1.11. Therefore, ๐ฟ + ⊆ ๐ผ, so ๐ฟ + ⊂ ๐ผ since ๐ผ is not a successor. Thus, ๐ฟ + ∈โ๐ผ, again by appealing to Theorem 6.1.11. Because ๐ฟ ∈ ๐ฟ + , we have that ๐ผ ⊆ {๐ฝ : ๐ฝ ∈ ๐ผ}. We conclude that for every nonempty ordinal ๐ผ that is not a successor, โ ๐ผ= ๐ฝ. ๐ฝ∈๐ผ Such an ordinal number is called a limit ordinal. For example, since every natural numโ ber is an ordinal, ๐ = {๐ : ๐ ∈ ๐} is a limit ordinal. Therefore, ๐+ , ๐++ , ๐+++ , … are also ordinals, but they are successors. All of this proves the following. THEOREM 6.1.17 A nonempty ordinal is either a successor or a limit ordinal. Therefore, by Theorem 6.1.14 and Corollary 6.1.15, we can view the ordinals as sorted by ⊂ giving 0 ⊂ 1 ⊂ 2 ⊂ · · · ⊂ ๐ ⊂ ๐+ ⊂ ๐++ ⊂ ๐+++ ⊂ · · · and as sorted by ∈ giving 0 ∈ 1 ∈ 2 ∈ · · · ∈ ๐ ∈ ๐+ ∈ ๐++ ∈ ๐+++ ∈ · · · . Characterizing every ordinal as being equal to 0, a successor ordinal, or a limit ordinal allows us to restate Theorem 6.1.1. The form of the theorem generalizes Theorem 5.4.1 to infinite ordinals. Its proof is left to Exercise 8. Section 6.1 ORDINAL NUMBERS 291 THEOREM 6.1.18 [Transfinite Induction 2] If ๐ผ is an ordinal and ๐ด ⊆ ๐ผ, then ๐ด = ๐ผ if the following hold: โ 0 ∈ ๐ด. โ If ๐ฝ ∈ ๐ด, then ๐ฝ + ∈ ๐ด. โ If ๐ฝ is a limit ordinal such that ๐ฟ ∈ ๐ด for all ๐ฟ ∈ ๐ฝ, then ๐ฝ ∈ ๐ด. We use this second form of transfinite induction to prove the sought-after classification theorem for well-ordered sets. THEOREM 6.1.19 Let (๐ด, 4) be a well-ordered set. Then, (๐ด, 4) ≅ (๐ผ, ⊆) for some ordinal ๐ผ. PROOF Define ๐(๐ฅ, ๐ฆ) := ๐ฆ is an ordinal ∧ ๐๐พ๐4 (๐ด, ๐ฅ) ≅ ๐ฆ. Let ๐ต = {๐ฆ : ∃๐ฅ [๐ฅ ∈ ๐ด ∧ ๐(๐ฅ, ๐ฆ)]}. By Theorem 6.1.13, ๐(๐ฅ, ๐ฆ) defines a function, so by a replacement axiom (5.1.9), we conclude that ๐ต is a set. We have a number of items to prove. Let ๐ท ⊆ ๐ต and ๐ท ≠ ∅. Let ๐ถ = {๐ ∈ ๐ด : ∃๐ผ[๐ผ ∈ ๐ท ∧ ๐(๐, ๐ผ)]}. Observe that ๐ถ is not empty. Therefore, there exists a least element ๐ ∈ ๐ถ with respect to 4. Take an ordinal ๐ฟ0 ∈ ๐ท such that ๐๐พ๐4 (๐ด, ๐) ≅ ๐ฟ0 . Let ๐ฟ ∈ ๐ท. This means that ๐ฟ is an ordinal and ๐๐พ๐4 (๐ด, ๐) ≅ ๐ฟ for some ๐ ∈ ๐ถ. Since ๐ 4 ๐, we have that ๐๐พ๐4 (๐ด, ๐) ⊆ ๐๐พ๐4 (๐ด, ๐). Hence, ๐ฟ0 is isomorphic to a subset of ๐ฟ, which implies that ๐ฟ0 ⊆ ๐ฟ (Theorem 6.1.13). We conclude that (๐ต, ⊆) is a well-ordered set. Let ๐ธ = {๐ฝ ∈ ๐ต : ๐๐พ๐(๐ต, ๐ฝ) = ๐ฝ}. Let ๐๐พ๐(๐ต, ๐) ⊆ ๐ธ for ๐ ∈ ๐ต. โ First, suppose that ๐ = ๐พ + for some ordinal ๐พ. Then, ๐๐พ๐(๐ต, ๐พ) = ๐พ, so ๐๐พ๐(๐ต, ๐พ) ∪ {๐พ} = ๐พ + . Also, ๐พ + ≅ ๐๐พ๐4 (๐ด, ๐) for some ๐ ∈ ๐ด. Let ๐ be the greatest element of ๐๐พ๐4 (๐ด, ๐) (Exercise 9). This implies that ๐พ ≅ ๐๐พ๐4 (๐ด, ๐)โงต{๐}, so ๐พ ∈ ๐ต. Hence, ๐๐พ๐(๐ต, ๐พ) ∪ {๐พ} = ๐๐พ๐(๐ต, ๐พ + ), 292 Chapter 6 ORDINALS AND CARDINALS and we have ๐ ∈ ๐ธ. โ โ Second, let ๐ = {๐พ : ๐พ ∈ ๐}. This means that ๐๐พ๐(๐ต, ๐พ) = ๐พ for all ๐พ ∈ ๐. Therefore, โ โ ๐๐พ๐(๐ต, ๐) = ๐๐พ๐(๐ต, ๐พ) = ๐พ = ๐, ๐พ∈๐ ๐พ∈๐ and ๐ is again an element of ๐ธ. By transfinite induction (Theorem 6.1.18), ๐ธ = ๐ต. This combined with (๐ต, ⊆) being a well-ordered set means that ๐ต is an ordinal. Define ๐ : ๐ด → ๐ต by ๐(๐ฅ) = ๐ฆ ⇔ ๐(๐ฅ, ๐ฆ). Since ๐ is an order isomorphism (Exercise 10), ๐ต is an ordinal that is order isomorphic to (๐ด, 4), and because of Theorem 6.1.13, it is the only one. For any well-ordered set (๐ด, โชฏ), the unique ordinal ๐ผ such that ๐ด ≅ ๐ผ guaranteed by Theorem 6.1.19 is called the order type of ๐ด. Compare this definition with Definition 4.5.24. For example, the order type of ({2๐ : ๐ > 5 ∧ ๐ ∈ โค}, ≤) is ๐. Burali-Forti and Hartogs โ Suppose ๐ = {0, 4, 6, 9}. Then, ๐ equals the ordinal โ 9, which is the least upper bound of ๐ . Also, assume that โฌ = {5, 100, ๐}. Then, โฌ = ๐. However, the least upper bound of ๐ = {๐ ∈ ๐ :โ∃๐(๐ ∈ ๐ ∧ ๐ = 2๐)} is not an element of ๐. Instead, the least upper bound of ๐ is ๐ = ๐. Moreover, notice that ๐ ⊆ 10, โฌ ⊆ ๐+ , and ๐ ⊆ ๐+ . We generalize this to the next theorem. THEOREM 6.1.20 If โฑ is a set of ordinals, there exists an ordinal ๐ผ such that โฑ ⊆ ๐ผ. PROOF โ โ Take โฑ to be a set of ordinals and โ let ๐ผ ∈ โฑ . Then, โ ๐ผ ⊆ โฑ and โฑ is an ordinal by Theorem 6.1.16. If ๐ผ ⊂ โฑ , then ๐ผ ∈ โฑ by Theorem 6.1.11. If (โ )+ โ โ ๐ผ = โฑ , then ๐ผ ∈ { โฑ }. Thus, โฑ ⊆ โฑ . Although every set of ordinals is a subset of an ordinal, there is no set of all ordinals, otherwise a contradiction would arise, as was first discovered by Cesare Burali-Forti (1897). This is why when we noted that ⊆ gives the ordinals a linear order, we did not claim that ⊆ is used to define a linearly ordered set containing all ordinals. THEOREM 6.1.21 [Burali-Forti] There is no set that has every ordinal as an element. PROOF โ Suppose โฑ = {๐ผ : ๐ผ is an ordinal} is a set. This implies that โฑ is an ordinal by Theorem 6.1.16.โHowever, for ∈ ๐ผ + ∈ ๐ด, โ every ๐ผ ∈ โฑ , we haveโthat ๐ผ โ showing that โฑ ⊆ โฑ . Since โฑ ∈ โฑ , we also have โฑ ∈ โฑ , which contradicts Theorem 5.1.16. Section 6.1 ORDINAL NUMBERS 293 The Burali-Forti theorem places a limit on what can be done with ordinals. One such example is a theorem of Friedrich Hartogs. THEOREM 6.1.22 [Hartogs] For every set ๐ด, there exists an ordinal ๐ผ such that there are no injections of ๐ผ into ๐ด. PROOF Let ๐ด be a set. Define โฐ = {๐ผ : ๐ผ is an ordinal ∧ ∃๐(๐ is an injection ๐ผ → ๐ด)}. Notice that for every ๐ผ ∈ โฐ , there exists a bijection ๐๐ผ such that ๐๐ผ : ๐ผ → ๐ต๐ผ for some ๐ต๐ผ ⊆ ๐ด. Define a well-order 4๐ผ on ๐ต๐ผ by ๐๐ผ (๐ฝ1 ) 4๐ผ ๐๐ผ (๐ฝ2 ) if and only if ๐ฝ1 ⊆ ๐ฝ2 for all ๐ฝ1 , ๐ฝ2 ∈ ๐ผ. Then, ๐๐ผ is an order isomorphism preserving ⊆ with 4๐ผ . Next, define โฑ = {(๐ต, ≤) : ๐ต ⊆ ๐ด ∧ ≤ is a well-ordering of ๐ต}. Since โฑ ⊆ P(๐ด) × P(๐ด × ๐ด), we have that โฑ is a set by the Power Set Axiom (5.1.7) and a Subset Axiom (5.1.8). Let ๐(๐ฅ, ๐ฆ) := ๐ฅ ∈ โฑ ∧ ∃๐พ(๐พ is an order isomorphism ๐ฅ → ๐ฆ preserving the order on ๐ฅ with ⊆}. Suppose that ๐((๐ต, ≤), ๐ผ1 ) and ๐((๐ต, ≤), ๐ผ2 ). By Theorem 6.1.13, we have that ๐ผ1 = ๐ผ2 , so ๐(๐ฅ, ๐ฆ) defines a function with domain โฑ . Moreover, โฐ is a subset of the range of this function because ๐((๐ต๐ผ , 4๐ผ ), ๐ผ) due to ๐๐ผ . Therefore, โฐ is a set by a replacement axiom (5.1.9) and a subset axiom, and โฐ cannot contain all ordinals by the Burali-Forti theorem (6.1.21). Transfinite Recursion Theorem 6.1.19 only applies to well-ordered sets, so, for example, it does not apply to (โค, ≤) or (โ, ≤). However, if we change the order on โค from the standard ≤ to 4 defined so as to put โค into this order, 0, 1, −1, 2, −2, 3, −3, … , then (โค, 4) is a well-ordered set of order type ๐. That this can be done even with sets like โ is due to a theorem first proved by Zermelo, which is often called the wellordering theorem. Its proof requires some preliminary work. Let ๐ด be a set and ๐ผ an ordinal. By Definition 4.4.13, ๐ผ๐ด is the set of all functions ๐ผ → ๐ด. Along these lines, define ๐ด = {๐ : ∃๐ฝ(๐ฝ ∈ ๐ผ ∧ ๐ is a function ๐ฝ → ๐ด)}. <๐ผ 294 Chapter 6 ORDINALS AND CARDINALS For example, ๐ , ๐ ∈ <5 โค, where ๐ = {(0, 1), (1, 2), (2, 3), (3, 4)} and ๐ = {(0, −4), (1, 14)}. Also, ๐ , ๐ ∈ <๐ โค, but the identity function on ๐ is not an element of <๐ โค because its domain is ๐. We should also note that for any set ๐ด, <∅ ๐ด = ∅. We use this notation in the following generalization of recursion to infinite ordinals. THEOREM 6.1.23 [Transfinite Recursion] Let ๐ผ be an ordinal. For every function ๐ : function ๐ : ๐ผ → ๐ด such that for every ๐ฝ ∈ ๐ผ, <๐ผ๐ด → ๐ด, there exists a unique ๐(๐ฝ) = ๐(๐ ๐ฝ). PROOF To prove uniqueness, in addition to ๐, let ๐′ be a function ๐ผ → ๐ด such that for all ๐ฝ ∈ ๐ผ, ๐′ (๐ฝ) = ๐(๐′ ๐ฝ). Define ๐ต = {๐ฝ ∈ ๐ผ : ๐(๐ฝ) = ๐′ (๐ฝ)}. We use transfinite induction (Theorem 6.1.1) to show that ๐ต = ๐ผ. Suppose seg(๐ผ, ๐ฟ) ⊆ ๐ต with ๐ฟ ∈ ๐ผ. That is, ∀๐ฝ[๐ฝ ∈ ๐ฟ → ๐(๐ฝ) = ๐′ (๐ฝ)]. This implies that ๐ ๐ฟ = ๐′ ๐ฟ. Therefore, ๐(๐ฟ) = ๐(๐ ๐ฟ) = ๐(๐′ ๐ฟ) = ๐′ (๐ฟ), so ๐ฟ ∈ ๐ต, and we conclude that ๐ = ๐′ . We prove existence indirectly. Suppose that ๐ผ is the least ordinal (Exercise 2) such that there exists a function ๐0 : <๐ผ๐ด → ๐ด such that for every ๐ : ๐ผ → ๐ด, there exists ๐ฝ ∈ ๐ผ such that ๐(๐ฝ) ≠ ๐0 (๐ ๐ฝ). Since the theorem is trivially true for ๐ผ = 0, we have two cases to consider. โ Let ๐ผ = ๐ฟ + for some ordinal ๐ฟ. By minimality of ๐ผ, we have a function ๐๐ฟ : ๐ฟ → ๐ด such that for all ๐ฝ ∈ ๐ฟ, ๐๐ฟ (๐ฝ) = ๐0 (๐๐ฟ ๐ฝ). Section 6.1 ORDINAL NUMBERS 295 Extend ๐๐ฟ to ๐ฬ ๐ฟ : ๐ผ → ๐ด by defining ๐ฬ ๐ฟ (๐ฟ) = ๐0 (๐๐ฟ ), so ๐ฬ ๐ฟ (๐ฟ) = ๐0 (๐ฬ ๐ฟ ๐ฟ). This contradicts the minimality of ๐ผ. โ Let ๐ผ be a limit ordinal. For each ๐ฟ ∈ ๐ผ, there exists a unique ๐๐ฟ : ๐ฟ → ๐ด such that ๐๐ฟ (๐ฟ) = ๐(๐๐ฟ ๐ฟ). Notice that ๐ฟ ∈ ๐พ implies that ๐๐พ is an extension of ๐๐ฟ , otherwise ๐๐พ ๐ฟ would have the property (๐๐พ ๐ฟ)(๐ฝ) = ๐0 ([๐๐พ ๐ฟ] ๐ฝ) for all ๐ฝ ∈ ๐ฟ yet ๐๐ฟ ≠ ๐๐พ ๐ฟ. This contradicts the uniqueness of ๐๐ฟ . Therefore, {๐๐ฟ : ๐ฟ ∈ ๐ผ} is a chain, so, as in Exercise 4.4.13, define the function ๐ : ๐ผ → ๐ด by โ ๐๐ฟ . ๐= ๐ฟ∈๐ผ To check that ๐ is the function given by the theorem, take ๐ฝ ∈ ๐ผ. Since ๐ผ is a limit ordinal, ๐ฝ + ∈ ๐ผ and ๐(๐ฝ) = ๐๐ฝ + (๐ฝ) = ๐0 (๐๐ฝ + ๐ฝ) = ๐0 (๐ ๐ฝ), again contradicting the minimality of ๐ผ. Theorem 6.1.23 has a corollary that can be viewed as an extension of Theorem 5.2.14. Its proof is left to Exercise 18. COROLLARY 6.1.24 Let ๐ด be a set and ๐ ∈ ๐ด. For every ordinal ๐ผ, if ๐ is a function ๐ด → ๐ด, there exists a unique function ๐ : ๐ผ → ๐ด such that โ ๐(0) = ๐, โ ๐(๐ฝ + ) = ๐(๐(๐ผ)) for all ๐ฝ ∈ ๐ผ, โ โ ๐(๐พ) = {๐(๐ฝ) : ๐ฝ ∈ ๐พ} for all limit ordinals ๐พ ∈ ๐ผ. We are now ready to prove that every set can be well-ordered. The following theorem is equivalent to the axiom of choice (Exercise 20). THEOREM 6.1.25 [Zermelo] For any set ๐ด, there exists a relation ๐ on ๐ด such that (๐ด, ๐ ) is a well-ordered set. 296 Chapter 6 ORDINALS AND CARDINALS PROOF Take a set ๐ด and let ๐ : P(๐ด) → ๐ด be a choice function (Corollary 5.1.11). By Theorem 6.1.22, there is an ordinal ๐ผ such that no injection ๐ผ → ๐ด exists. Hence, we have ๐ : ๐ด → ๐ผ that is one-to-one. Let ๐ต ⊆ ๐ด. Since every element of ๐ผ is an ordinal, there exists an ordinal ๐ฟ ⊆ ๐ผ such that ๐ [๐ต] = ๐ฟ (Theorem 6.1.16). Define ๐๐ต = ๐ ๐ต. Then, ๐๐ต−1 ∈ <๐ผ ๐ด. Let ๐ = {๐๐ต−1 : ๐ต ∈ P(๐ด)}, and define โ1 : ๐ → P(๐ด) by โ1 (๐๐ต−1 ) = ๐ต. Also, define โ2 : ๐ → ๐ด by โ2 = ๐ โฆ โ1 . Since ๐ ⊆ <๐ผ ๐ด, extend โ2 to some โ : <๐ผ ๐ด → ๐ด such that โ ๐ = โ2 . By transfinite recursion (Theorem 6.1.23), there exists a function ๐ : ๐ผ → ๐ด such that for all ๐ฝ ∈ ๐ผ, ๐ (๐ฝ) = โ(๐ ๐ฝ). Define Φ so that for all ๐ฝ ∈ ๐ผ, { โ(๐ด โงต ๐ [๐ฝ]) if ๐ด โงต ๐ [๐ฝ] ≠ ∅, Φ(๐ฝ) = ๐ด if ๐ด โงต ๐ [๐ฝ] = ∅. Notice that ๐ด ∉ ๐ด by Theorem 5.1.16. Let ๐ฝ0 be the least ordinal such that Φ(๐ฝ0 ) = ๐ด. Then, Φ ๐ฝ0 is a bijection ๐ฝ0 → ๐ด [Exercise 19(a)]. Lastly, define the relation ๐ on ๐ด by ๐ ๐ ๐ if and only if Φ−1 (๐) ⊆ Φ−1 (๐), for all ๐, ๐ ∈ ๐ด. Since ⊆ is a well-order on ๐ฝ0 , ๐ is a well-order on ๐ด [Exercise 19(b)]. Exercises 1. Prove Lemma 6.1.5. โ โ 2. Does ๐๐พ๐( ๐ด, ๐ฝ) = ๐ผ∈๐ด ๐๐พ๐(๐ผ, ๐ฝ) for all sets of ordinals ๐ด with ๐ฝ ∈ ๐ด? Explain. 3. Explain why {0, 2, 3, 4, 5} is not an ordinal. 4. Prove that ∅ is an element of every ordinal. Section 6.1 ORDINAL NUMBERS 297 5. Prove that an ordinal is a transitive set (Theorem 6.1.8). 6. Let ๐ด be a set of ordinals. Prove. โ (a) ๐ด is an ordinal. (b) ๐ด has a least element with respect to ⊆. 7. Let ๐ต be a nonempty subset of the ordinal ๐ผ. Prove that there exists ๐ฝ ∈ ๐ต such that ๐ฝ and ๐ต are disjoint. 8. Prove Theorem 6.1.18. 9. From the proof of Theorem 6.1.19, prove that ๐๐พ๐4 (๐ด, ๐) has a greatest element. 10. Prove that ๐ด is order isomorphic to ๐ต in the proof of Theorem 6.1.19. 11. The proof of Theorem 6.1.19 contains many isomorphisms without explicitly identifying the isomorphism. Find these functions and prove that they are order isomorphisms. 12. Let ๐ be a well-ordering on ๐ด and suppose that ๐ด has no greatest element. Show the the order type of (๐ด, ๐ ) is a limit ordinal. 13. Find a transitive set that is not an ordinal. 14. Theorem 6.1.21 comes from the Burali-Forti paradox. Like Russell’s paradox (page 225), it arises when any formula is allowed to define a set. In this case, suppose that ๐ด = {๐ผ : ๐ผ is an ordinal} and assume that ๐ด is a set. Prove that ๐ด is an ordinal that must include all ordinals as its elements. 15. Prove that there exists a function ๐น such that ๐น (๐) is the ๐th Fibonacci number. 16. Prove that for every function โ : <๐ ๐ด → ๐ด, there is a unique function ๐ : ๐ → ๐ด such that for all ๐ ∈ ๐, ๐ (๐) = โ(๐ ๐). 17. Let (๐ด, 4) and (๐ต, 4′ ) be well-ordered sets and ๐ : ๐ด → ๐ต be an order-preserving surjection. Prove that for every ๐ ∈ ๐ด, there exists ๐ ∈ ๐ต such that ๐[๐๐พ๐4 (๐ด, ๐)] = ๐๐พ๐4′ (๐ต, ๐). 18. Prove Corollary 6.1.24. 19. Prove the following from the proof of Zermelo’s theorem (6.1.25). (a) Φ ๐ฝ0 is a bijection. (b) ๐ is a well-order on ๐ด. 20. Prove that Theorem 6.1.25 implies Axiom 5.1.10. 21. Prove that Zorn’s lemma (5.1.13) implies Theorem 6.1.25 without using the axiom of choice. 298 6.2 Chapter 6 ORDINALS AND CARDINALS EQUINUMEROSITY How can we determine whether two sets are of the same size? One possibility is to count their elements. What happens, however, if the sets are infinite? We need another method. Suppose ๐ด = {12, 47, 84} and ๐ต = {17, 101, 200}. We can see that these two sets are the same size without counting. Define a function ๐ : ๐ด → ๐ต so that ๐ (12) = 17, ๐ (47) = 101, and ๐ (84) = 200. This function is a bijection. Since each element is paired with exactly one element of the opposite set, ๐ด and ๐ต must be the same size. This is the motivation behind our first definition. DEFINITION 6.2.1 The sets ๐ด and ๐ต are equinumerous (written as ๐ด ≈ ๐ต) if there exists a bijection ๐ : ๐ด → ๐ต. If ๐ด and ๐ต are not equinumerous, write ๐ด โ ๐ต. EXAMPLE 6.2.2 Take ๐ ∈ โค such that ๐ ≠ 0 and define ๐โค = {๐๐ : ๐ ∈ โค}. We prove that โค ≈ ๐โค. To show this, we must find a bijection ๐ : โค → ๐โค. Define ๐ (๐) = ๐๐. โ Assume ๐ฅ1 , ๐ฅ2 ∈ โค, and let ๐ (๐ฅ1 ) = ๐ (๐ฅ2 ). Then ๐๐ฅ1 = ๐๐ฅ2 , which yields ๐ฅ1 = ๐ฅ2 since ๐ ≠ 0. Thus, ๐ is one-to-one. โ Let ๐ฆ ∈ ๐โค. This means that ๐ฆ = ๐๐ for some ๐ ∈ โค, so ๐ฆ = ๐ (๐). This shows that ๐ is onto and, hence, a bijection. EXAMPLE 6.2.3 To see โค+ ≈ โค, define a one-to-one correspondence so that each even integer is paired with a nonnegative integer and every odd integer is paired with a negative integer (Figure 6.1). Let ๐ : โค+ → โค be defined by { ๐−1 ๐(๐) = −๐ if ๐ = 2๐ for some ๐ ∈ โค+ , if ๐ = 2๐ − 1 for some ๐ ∈ โค+ . Notice that ๐(4) = 1 since 4 = 2(2), and ๐(5) = −3 because 5 = 2(3) − 1. This function is a bijection (Exercise 12). Equinumerosity plays a role similar to that of equality of integers. This is seen in the next theorem. In fact, the theorem resembles Definition 4.2.4. Despite this, it does not demonstrate the existence of an equivalence relation. This is because an equivalence relation is a relation on a set, so, to define an equivalence relation, the next result would require a set of all sets, contradicting Corollary 5.1.17. Section 6.2 EQUINUMEROSITY ... −4 1 2 3 4 5 6 7 −3 −2 −1 0 1 2 3 Figure 6.1 8 299 ... ... โค ≈ โค+ . THEOREM 6.2.4 Let ๐ด, ๐ต, and ๐ถ be sets. โ ๐ด ≈ ๐ด. (Reflexive) โ If ๐ด ≈ ๐ต, then ๐ต ≈ ๐ด. (Symmetric) โ If ๐ด ≈ ๐ต and ๐ต ≈ ๐ถ, then ๐ด ≈ ๐ถ. (Transitive) PROOF โ ๐ด ≈ ๐ด since the identity map is a bijection. โ Assume ๐ด ≈ ๐ต. Then, there exists a bijection ๐ : ๐ด → ๐ต. Therefore, ๐−1 is a bijection. Hence, ๐ต ≈ ๐ด. โ By Theorem 4.5.23, the composition of two bijections is a bijection. Therefore, ๐ด ≈ ๐ต and ๐ต ≈ ๐ถ implies ๐ด ≈ ๐ถ. The symmetric property allows us to conclude that ๐โค ≈ โค and โค ≈ โค+ from Examples 6.2.2 and 6.2.3. The transitivity part of Theorem 6.2.4 allows us to conclude from this that ๐โค ≈ โค+ . EXAMPLE 6.2.5 Show (0, 1) ≈ โ. We do this in two parts. First, let ๐ : (0, 1) → (−๐โ2, ๐โ2) be defined by ๐ (๐ฅ) = ๐๐ฅ − ๐โ2. This function is a one-to-one correspondence since its graph is a nonvertical, nonhorizontal line (Exercise 4.5.4). Second, define ๐ : (−๐โ2, ๐โ2) → โ to be the function ๐(๐ฅ) = tan ๐ฅ. From trigonometry, we know that tangent is a bijection on (−๐โ2, ๐โ2). Hence, (0, 1) ≈ (−๐โ2, ๐โ2) and (−๐โ2, ๐โ2) ≈ โ. Therefore, (0, 1) ≈ โ by Theorem 6.2.4. 300 Chapter 6 ORDINALS AND CARDINALS Order If ≈ resembles equality, the following resembles the ≤ relation. DEFINITION 6.2.6 The set ๐ต dominates the set ๐ด (written as ๐ด โชฏ ๐ต) if there exists an injection ๐ : ๐ด → ๐ต. If ๐ต does not dominate ๐ด, write ๐ด ฬธโชฏ ๐ต. Furthermore, define ๐ด โบ ๐ต to mean ๐ด โชฏ ๐ต but ๐ด โ ๐ต. EXAMPLE 6.2.7 If ๐ด ⊆ ๐ต, then ๐ด โชฏ ๐ต. This is proved using the inclusion map (Exercise 4.4.26). For instance, let ๐ : โค+ → โค be the inclusion map and ๐ : โค → โ be defined as ๐ (๐) = ๐๐พ๐≤ (โ, ๐). Then, โค+ โชฏ โ because ๐ โฆ ๐ is an injection. Similarly, โ โชฏ โ. However, ๐ด ⊂ ๐ต does not imply ๐ด โ ๐ต. As an example, ๐โค ≈ โค, but ๐โค ⊂ โค when ๐ ≠ ±1. Another method used to prove that ๐ด โชฏ ๐ต is to find a surjection ๐ต → ๐ด. Consider the sets ๐ด = {1, 2} and ๐ต = {3, 4, 5}. Define ๐ : ๐ต → ๐ด to be the surjection given by ๐ (3) = 1, ๐ (4) = 2, and ๐ (5) = 2. This is the inverse of the relation ๐ in Figure 4.1. To show that ๐ต dominates ๐ด, we must find an injection ๐ด → ๐ต. To do this, modify ๐ −1 by deleting (2, 4) and call the resulting function ๐. Observe that ๐(1) = 3 and ๐(2) = 5, which is an injection, so ๐ด โชฏ ๐ต. THEOREM 6.2.8 If there exists a surjection ๐ : ๐ด → ๐ต, then ๐ต โชฏ ๐ด. PROOF Let ๐ : ๐ด → ๐ต be onto. Define a relation ๐ ⊆ ๐ต × ๐ด by ๐ = {(๐, ๐) : ๐(๐) = ๐}. Since ๐ is onto, dom(๐ ) = ran(๐) = ๐ต. Corollary 5.1.11 yields a function ๐ so that dom(๐ ) = dom(๐ ) and ๐ ⊆ ๐ . We claim that ๐ is one-to-one. Indeed, let ๐1 , ๐2 ∈ ๐ต. Assume that we have ๐ (๐1 ) = ๐ (๐2 ). Let ๐1 = ๐ (๐1 ) and ๐2 = ๐ (๐2 ) where ๐1 , ๐2 ∈ ๐ด. This means ๐1 = ๐2 . Also, ๐(๐1 ) = ๐1 and ๐(๐2 ) = ๐2 because ๐ ⊆ ๐ . Since ๐ is a function, ๐1 = ๐2 . EXAMPLE 6.2.9 Let ๐ be an equivalence relation on a set ๐ด. The map ๐ : ๐ด → ๐ดโ๐ defined by ๐(๐) = [๐]๐ is a surjection. Therefore, ๐ดโ๐ โชฏ ๐ด. Section 6.2 EQUINUMEROSITY 301 EXAMPLE 6.2.10 We know that โค+ โชฏ โ by Example 6.2.7. We can also prove this by using the function ๐ : โ → โค+ defined by ๐ (๐ฅ) = |โฆ๐ฅโง| + 1 and appealing to Theorem 6.2.8. The next theorem states that โชฏ closely resembles an antisymmetric relation (Definition 4.3.1). Cantor was the first to publish a statement of it (1888). He proved it using the axiom of choice, but it was later shown that it can be proved in ๐ญ๐. It was proved independently by Ernst Schröder and Felix Bernstein around 1890. The proof given here follows that of Julius König (1906). THEOREM 6.2.11 [Cantor–Schröder–Bernstein] If ๐ด โชฏ ๐ต and ๐ต โชฏ ๐ด, then ๐ด ≈ ๐ต. PROOF Let ๐ : ๐ด → ๐ต and ๐ : ๐ต → ๐ด be injections. To prove that ๐ด is equinumerous to ๐ต, we define a one-to-one correspondence โ : ๐ด → ๐ต. To do this, we recursively define two sequences of sets by first letting ๐ถ0 = ๐ด โงต ran(๐) and ๐ท0 = ๐ [๐ถ0 ]. Then, for ๐ ∈ ๐, ๐ถ๐+1 = ๐[๐ท๐ ] and ๐ท๐ = ๐ [๐ถ๐ ]. This is illustrated in Figure 6.2. Note that both {๐ถ๐ : ๐ ∈ ๐} and {๐ท๐ : ๐ ∈ ๐} are pairwise disjoint because ๐ and ๐ are one-to-one (Exercise 11). Define โ by ( ) ( ) โ โ −1 โ=๐ ๐ถ๐ ∪ ๐ ran(๐) โงต ๐ถ๐ . ๐∈๐ ๐∈๐ We show that โ is the desired function. โ Let ๐ฅ1 , ๐ฅ2 ∈ ๐ด such that ๐ฅ1 ≠ ๐ฅ2 . Since both ๐ and ๐ are one-to-one, we only need to check the case when ๐ฅ1 ∈ ๐ถ๐ for some ๐ ∈ ๐ and ๐ฅ2 ∉ ๐ถ๐ for all ๐ ∈ ๐. Then, ๐ (๐ฅ1 ) ∈ ๐ท๐ but ๐ −1 (๐ฅ2 ) ∉ ๐ท๐ , so ๐ (๐ฅ1 ) ≠ ๐ −1 (๐ฅ2 ). That is, โ(๐ฅ1 ) ≠ โ(๐ฅ2 ). โ Take ๐ฆ ∈ ๐ต. If ๐ฆ ∈ ๐ท๐ for some ๐ ∈ ๐, then โ ๐ฆ = ๐ (๐ฅ) for some ๐ฅ ∈ ๐ถ๐ . That is, ๐ฆ = โ(๐ฅ). Now suppose ๐ฆ ∉ ๐∈๐ ๐ท๐ . Clearly, ๐(๐ฆ) ∉ ๐ถ0 . If ๐(๐ฆ) ∈ ๐ถ๐ for some ๐ > 0, thenโ๐ฆ ∈ ๐ท๐−1 , a contradiction. Hence, ๐(๐ฆ) = ๐ฅ for some ๐ฅ ∈ ran(๐) โงต ๐∈๐ ๐ถ๐ . This implies that we have โ(๐ฅ) = ๐ −1 (๐ฅ) = ๐ฆ. 302 Chapter 6 ORDINALS AND CARDINALS ๎ A ๎ ran(g) ๎ A B f โพC0 C0 D0 gโพD0 C1 f โพC1 D1 gโพD1 C2 ran(g) . . . . . . ran(g) ๎ ๎ Cn n∈๐ Figure 6.2 ๎ g −1โพ ran(g) ๎ ๎ Cn ๎ n∈๐ The Cantor–Schröder–Bernstein theorem. B๎ ๎ Dn n∈๐ Section 6.2 EQUINUMEROSITY 303 EXAMPLE 6.2.12 Because (0, 1) ⊆ [0, 1], we have that (0, 1) โชฏ [0, 1], and because the function ๐ : [0, 1] → (0, 1) defined by ๐ (๐ฅ) = 1 1 ๐ฅ+ 2 4 is an injection, we have that [0, 1] โชฏ (0, 1), so we conclude by the Cantor– Schröder–Bernstein theorem (6.2.11) that (0, 1) ≈ [0, 1]. Diagonalization The strict inequality ๐ด โบ ๐ต is sometimes difficult to prove because we must show that there does not exist a bijection from ๐ด onto ๐ต. The next method was developed by Cantor (1891) to accomplish this for infinite sets. It is called diagonalization. Let ๐ be the set of all functions ๐ : ๐ → {๐, ๐ค}, where ๐ ≠ ๐ค. To show that ๐ โบ ๐, we prove two facts: โ There exists an injection ๐ → ๐. โ There is no one-to-one correspondence between ๐ and ๐. Cantor’s method does both of these at once. Let ๐ : ๐ → ๐ be a function. Writing the functions of the range of ๐ as infinite tuples, let ๐๐ = (๐๐0 , ๐๐1 , ๐๐2 , … , ๐๐๐ , … ), where ๐๐๐ ∈ {๐, ๐ค} for all ๐, ๐ ∈ ๐. For example, ๐(4)(3) = ๐4 (3) = ๐4,3 . Now, write the functions in order: ๐0 = (a00 , ๐01 , ๐02 , … , ๐0๐ , … ), ๐1 = (๐10 , a11 , ๐12 , … , ๐1๐ , … ), ๐2 = (๐20 , ๐21 , a22 , … , ๐2๐ , … ), โฎ ๐๐ = (๐๐0 , ๐๐1 , ๐๐2 , … , aii , … , ๐๐๐ , … ), โฎ From this, a function ๐ ∈ ๐ that is not in the list can be found by identifying the elements on the diagonal and defining ๐ (๐) to be the opposite of ๐๐๐ . In other words, define for all ๐ ∈ ๐, { ๐ if ๐๐๐ = ๐ค, ๐๐ = ๐ค if ๐๐๐ = ๐, 304 Chapter 6 ORDINALS AND CARDINALS and ๐ (๐) = ๐๐ is an element of ๐ not in the list because for all ๐ ∈ ๐, ๐ (๐) ≠ ๐๐๐ . Since the function ๐ mapping ๐ to ๐ is arbitrary, there are injections ๐ → ๐ but none of them are onto. Therefore, ๐ โบ ๐. Furthermore, note that the elements of [0, 1] can be uniquely represented as binary numbers of the form 0 . ๐0 ๐1 ๐2 … ๐๐ … , where ๐๐ ∈ {0, 1} for each ๐ ∈ ๐. For example 1 = 0.1111111 … and 1โ2 = 0.1000000 … . Therefore, ๐ ≈ [0, 1] . Hence, we can conclude like Cantor that since [0, 1] ≈ โ (Examples 6.2.5 and 6.2.12), ๐ โบ โ. Cantor’s diagonalization argument can be generalized, but we first need a definition. Let ๐ด be a set and ๐ต ⊆ ๐ด. The function ๐๐ต : ๐ด → {0, 1} is called a characteristic function and is defined by { 1 if ๐ ∈ ๐ต, ๐๐ต (๐) = 0 if ๐ ∉ ๐ต. For example, if ๐ด = โค and ๐ต = {0, 1, 3, 5}, then ๐๐ต (1) = 1 but ๐๐ต (2) = 0. Moreover, for every set ๐ด, ๐ด 2 = {๐๐ต : ๐ต ⊆ ๐ด}. The characteristic function plays an important role in the proof of the next theorem. THEOREM 6.2.13 If ๐ด is a set, ๐ด โบ ๐ด 2. Section 6.2 EQUINUMEROSITY 305 PROOF Since the function ๐ : ๐ด → ๐ด 2 defined by ๐(๐) = ๐{๐} is an injection, ๐ด 2 dominates ๐ด. To show that ๐ด is not equinumerous with ๐ด 2, we show that there is no surjection ๐ด → ๐ด 2. Let ๐ โถ ๐ด → ๐ด 2 be a function, and for all ๐ ∈ ๐ด, write ๐(๐) = ๐๐ต๐ for some ๐ต๐ ⊆ ๐ด. Define ๐ so that { 1 if ๐๐ต๐ (๐) = 0, ๐(๐) = 0 if ๐๐ต๐ (๐) = 1. Therefore, ๐ ∉ ran(๐) because ๐๐ต๐ (๐) ≠ ๐(๐) for all ๐ ∈ ๐ด. However, ๐ ∈ ๐ด 2. To prove this, define ๐ต = {๐ ∈ ๐ด : ๐๐ต๐ (๐) = 0}. We conclude that ๐(๐) = ๐๐ต (๐) because if ๐๐ต๐ (๐) = 0, then ๐ ∈ ๐ต, so ๐(๐) = 1 and ๐๐ต (๐) = 1, but if ๐๐ต๐ (๐) = 1, then ๐ ∉ ๐ต, so ๐(๐) = 0 and ๐๐ต (๐) = 0. Therefore, ๐ = ๐๐ต , and ๐ is not onto. By Exercise 14, P(๐ด) ≈ ๐ด 2. This result combined with Theorem 6.2.13 quickly yield the following. COROLLARY 6.2.14 If ๐ด is a set, ๐ด โบ P(๐ด). From the theorem, we conclude that there exists a sequence of sets ๐ โบ P(๐) โบ P(P(๐)) โบ P(P(P(๐))) โบ · · · . Thus, there are larger and larger magnitudes of infinity. Exercises 1. Given ๐๐ต : โค → {0, 1} with ๐ต = {2, 5, 19, 23}, find (a) ๐๐ต (1) (b) ๐๐ต (2) (c) ๐๐ต (−10) (d) ๐๐ต (19) 2. Find a surjection ๐ : ๐ × ๐ → ๐ showing that ๐ โชฏ ๐ × ๐. 3. Prove the given equations. (a) ๐๐ด∪๐ต = ๐๐ด + ๐๐ต − ๐๐ด ๐๐ต (b) ๐๐ด∩๐ต = ๐๐ด ๐๐ต 4. Prove that there exists a bijection between the given pairs of sets. 306 Chapter 6 ORDINALS AND CARDINALS (a) (b) (c) (d) (e) (f) (g) (h) ๐] and [−1, 1] [[0, ] −๐โ2, ๐โ2 and [−1, 1] (0, ∞) and โ ๐ and โค โค+ and โค− {(๐ฅ, 0) : ๐ฅ ∈ โ} and โ โค and โค × โค {(๐ฅ, ๐ฆ) ∈ โ × โ : ๐ฆ = 2๐ฅ + 4} and โ 5. Let ๐ < ๐ and ๐ < ๐, where ๐, ๐, ๐, ๐ ∈ โ. Prove. (a) (๐, ๐) ≈ (๐, ๐) (b) [๐, ๐] ≈ [๐, ๐] (c) (๐, ๐) ≈ [๐, ๐] (d) (๐, ๐) ≈ (๐, ๐] 6. Prove โ ≈ โ. 7. Let ๐ด, ๐ต, ๐ถ, and ๐ท be nonempty sets. Prove. (a) โ โชฏ โค− (b) ๐ด ∩ ๐ต โชฏ P(๐ด) (c) ๐ด โชฏ ๐ด × ๐ต (d) [0, 2] โชฏ [5, 7] (e) ๐ด ∩ ๐ต โชฏ ๐ด (f) ๐ด๐ต โชฏ ๐ด × ๐ต (g) ๐ด × {0} โชฏ (๐ด ∪ ๐ต) × {1} (h) ๐ด × ๐ต × ๐ถ โชฏ ๐ด × ๐ต × ๐ถ × ๐ท (i) ๐ด โงต ๐ต โชฏ ๐ถ × ๐ด 8. Prove. (a) If ๐ด โชฏ ๐ต and ๐ต ≈ ๐ถ, then ๐ด โชฏ ๐ถ. (b) If ๐ด ≈ ๐ต and ๐ต โชฏ ๐ถ, then ๐ด โชฏ ๐ถ. (c) If ๐ด โชฏ ๐ต and ๐ต โชฏ ๐ถ, then ๐ด โชฏ ๐ถ. (d) If ๐ด โชฏ ๐ต and ๐ถ โชฏ ๐ท, then ๐ด ๐ถ โชฏ ๐ต ๐ท. (e) If ๐ด ≈ ๐ต, then P(๐ด) ≈ P(๐ต). (f) If ๐ด ≈ ๐ต and ๐ถ ≈ ๐ท, then ๐ด × ๐ถ ≈ ๐ต × ๐ท. (g) If ๐ด ≈ ๐ต, ๐ ∈ ๐ด, and ๐ ∈ ๐ต, then ๐ด โงต {๐} ≈ ๐ต โงต {๐}. (h) If ๐ด โงต ๐ต ≈ ๐ต โงต ๐ด, then ๐ด ≈ ๐ต. (i) If ๐ด ⊆ ๐ต and ๐ด ≈ ๐ด ∪ ๐ถ, then ๐ต ≈ ๐ด ∪ ๐ต ∪ ๐ถ. (j) If ๐ถ ⊆ ๐ด, ๐ต ⊆ ๐ท, and ๐ด ∪ ๐ต ≈ ๐ต, then ๐ถ ∪ ๐ท ≈ ๐ท. 9. Given the function ๐ : ๐ด → ๐ต, prove that ๐ ≈ ๐ด and ๐ โชฏ ran(๐). 10. Without appealing to the Cantor–Schröder–Bernstein theorem (6.2.11), prove that (0, 1) ≈ [0, 1]. Section 6.3 CARDINAL NUMBERS 307 11. Prove that the sets {๐ถ๐ : ๐ ∈ ๐} and {๐ท๐ : ๐ ∈ ๐} from the proof of Theorem 6.2.11 are pairwise disjoint. 12. Show that ๐ is a bijection, where ๐ : โค+ → โค is defined by { ๐−1 ๐(๐) = −๐ if ๐ = 2๐ for some ๐ ∈ ๐, if ๐ = 2๐ + 1 for some ๐ ∈ ๐. 13. Let ๐ด be an infinite set and {๐1 , ๐2 , … } be a set of distinct elements from ๐ด. Prove that ๐ด → ๐ด โงต {๐1 } is a bijection, where { ๐๐+1 ๐(๐ฅ) = ๐ฅ if ๐ฅ = ๐๐ , otherwise. 14. For any set ๐ด, prove that P(๐ด) ≈ ๐ด 2. 15. Prove that if ๐ : ๐ด → ๐ต is a surjection, there exists a function ๐ : ๐ต → ๐ด such that ๐ โฆ ๐ = ๐ผ๐ต . 16. Use the power set to prove that there is no set of all sets. 6.3 CARDINAL NUMBERS Let ๐ด be a set. Define ๐ต = {๐ฝ : ๐ฝ is an ordinal ∧ ๐ฝ ≈ ๐ด}. By Zermelo’s theorem (6.1.25), ๐ด can be well-ordered, so by Theorem 6.1.19, ๐ด is order isomorphic to some ordinal, so ๐ต is nonempty. Moreover, ๐ต is subset of an ordinal (Theorem 6.1.20). Therefore, (๐ต, ⊆) has a least element, which has the property that it is not equinumerous to any of its elements. This allows us to define the second of our new types of number (page 283). This type will be used to denote the size of a set. DEFINITION 6.3.1 An ordinal ๐ is a cardinal number (or simply a cardinal) if ๐ผ โ ๐ for every ๐ผ ∈ ๐ . Observe that every infinite cardinal is a limit ordinal. This is because ๐ผ ≈ ๐ผ + for every infinite ordinal ๐ผ. However, a limit ordinal might not be a cardinal. Let ๐ and ๐ be cardinals. Suppose that ๐ด ≈ ๐ and ๐ด ≈ ๐. By Theorem 6.2.4, we have that ๐ ≈ ๐. If ๐ ∈ ๐ or ๐ ∈ ๐ , this would contradict the definition of a cardinal number. Therefore, ๐ = ๐ (Corollary 6.1.15), and we conclude that every set is equinumerous to exactly one cardinal. 308 Chapter 6 ORDINALS AND CARDINALS DEFINITION 6.3.2 The cardinality of a set ๐ด is denoted by |๐ด| and defined as the unique cardinal equinumerous to ๐ด. Observe that the cardinality of a cardinal ๐ is ๐ . EXAMPLE 6.3.3 โ Let ๐ be a set of cardinals. By Theorem 6.1.16, we know that ๐ is an ordinal. Now we show that it is also a cardinal. Suppose that ๐ผ is an ordinal such that โ โ ๐ผ ≈ ๐ . By Definition โ 6.3.1, we must show that ๐ ⊆ ๐ผ. Suppose there exists an ordinal ๐ฝ ∈ ๐ such that ๐ฝ ∉ ๐ผ. This means that there exists a cardinal ๐ ∈ ๐ such that ๐ฝ ∈ ๐ . This is impossible because โ ๐ผโชฏ๐ฝโบ๐ โชฏ ๐ ≈ ๐ผ. Finite Sets Intuitively, we know what a finite set is. Both of the sets ๐ด = {0, 2, 3, 5, 8, 10} and ๐ต = {๐ ∈ โค : (๐ − 1)(๐ + 3) = 0} are examples because we can count all of their elements and find that there is only one ordinal equinumerous to ๐ด and only one ordinal equinumerous to ๐ต. That is, |๐ด| = 6 and |๐ต| = 2. This suggests the following definition. DEFINITION 6.3.4 For every set ๐ด, if there exists ๐ ∈ ๐ such that ๐ด ≈ ๐, then ๐ด is finite. If ๐ด is not finite, it is infinite. As we will see, finite sets are fundamentally different from infinite sets. There are properties that finite sets have in addition to the number of their elements that infinite sets do not have. Let us consider some of those properties of finite sets. LEMMA 6.3.5 Let ๐ be a positive natural number. If ๐ฆ ∈ ๐, then ๐ โงต {๐ฆ} ≈ ๐− . PROOF We proceed by mathematical induction. โ When ๐ = 1, it must be the case that ๐ฆ = 0, so ๐ โงต {๐ฆ} = 0 = 1− . โ Take ๐ฆ ∈ ๐ + 1. If ๐ฆ = ๐, then (๐ + 1) โงต {๐} = ๐, so suppose that ๐ฆ < ๐. By induction, there exists a bijection ๐ : ๐ โงต {๐ฆ} → ๐− . Then, define ๐ : (๐ + 1) โงต {๐ฆ} → ๐ by ๐ (๐) = ๐(๐) for all ๐ < ๐ and ๐ (๐) = ๐. The function ๐ is a bijection (Exercise 3). Lemma 6.3.5 is used to prove a characteristic property of finite cardinals. Section 6.3 CARDINAL NUMBERS 309 THEOREM 6.3.6 No natural number is equinumerous to a proper subset of itself. PROOF Let ๐ ∈ ๐ be minimal such that there exists ๐ด ⊂ ๐ and ๐ ≈ ๐ด. Since ∅ has no proper subsets and the only proper subset of 1 is 0, we can assume that ๐ ≥ 2. Let ๐ : ๐ด → ๐ be a bijection and ๐ฅ ∈ ๐ด and ๐ฆ ∈ ๐ โงต ๐ด such that ๐ (๐ฅ) = ๐ฆ. We check the following. โ Let ๐ ∈ ๐ด โงต {๐ฅ}. If ๐ (๐) = ๐ฆ, then we contradict the hypothesis that ๐ is one-to-one because ๐ (๐ฅ) = ๐ฆ and ๐ฅ ≠ ๐. Thus, ๐ (๐) ∈ ๐ โงต {๐ฆ}. โ Take ๐ ∈ ๐ โงต {๐ฆ}. Since ๐ is a surjection, there exists ๐ ∈ ๐ด such that ๐ (๐) = ๐. If ๐ = ๐ฅ, then {๐, ๐ฆ} ⊆ [๐ฅ]๐ (Definition 4.2.7), which is impossible because ๐ is a function. Thus, ๐ ∈ ๐ด โงต {๐ฅ}, and we conclude that ๐ (๐ด โงต {๐ฅ}) is onto ๐ โงต {๐ฆ}. โ Since the restriction of a one-to-one function is one-to-one, ๐ (๐ด โงต {๐ฅ}) is one-to-one. Hence, ๐0 = ๐ (๐ด โงต {๐ฅ}) is a bijection with range ๐ โงต {๐ฆ}. We have two cases to consider. โ Suppose ๐− ∉ ๐ด. This implies that ๐ด โงต {๐ฅ} ⊆ ๐− . Since ๐ฅ ∈ ๐ด, we have that ๐ฅ ≠ ๐− , so ๐ฅ ∈ ๐− . Thus, ๐ด โงต {๐ฅ} ⊂ ๐− . By Lemma 6.3.5, there exists a bijection ๐ : ๐ โงต {๐ฆ} → ๐− , so we have ๐ด โงต {๐ฅ} ≈ ๐− because ๐ โฆ ๐0 is a bijection, contradicting the minimality of ๐. โ Assume ๐− ∈ ๐ด. Define ๐ด′ = ๐ด โงต {๐− } ∪ {๐ฆ}. Since ๐ฆ ∉ ๐ด, ๐ด′ ≈ ๐ด, which implies that ๐ด′ ≈ ๐. Replace ๐ด with ๐ด′ in the previous argument and use ๐ (๐ด′ โงต {๐ฆ}) to contradict the minimality of ๐. COROLLARY 6.3.7 Every finite set is equinumerous to exactly one natural number. PROOF Let ๐ด be finite. This means that ๐ด ≈ ๐ for some ๐ ∈ ๐. Let ๐ ∈ ๐ also have the property that ๐ด ≈ ๐. This implies that ๐ ≈ ๐. Hence, by Theorem 6.3.6, we conclude that ๐ = ๐ because ๐ ⊆ ๐ or ๐ ⊆ ๐. There are many results that follow directly from Theorem 6.3.6. The following six corollaries are among them. COROLLARY 6.3.8 [Pigeonhole Principle] Let ๐ด and ๐ต be finite sets with ๐ต โบ ๐ด. There is no one-to-one function ๐ด → ๐ต. 310 Chapter 6 ORDINALS AND CARDINALS PROOF There exists unique ๐, ๐ ∈ ๐ such that ๐ด ≈ ๐ and ๐ต ≈ ๐ by Corollary 6.3.7. Assume that ๐ : ๐ด → ๐ต is one-to-one. Then, ๐ ≈ ๐ด โชฏ ๐ต ≈ ๐, so ๐ is equinumerous to a subset of ๐. This implies that ๐ ⊆ ๐. However, ๐ ⊂ ๐ because ๐ต โบ ๐ด, which contradicts Theorem 6.1.14. COROLLARY 6.3.9 No finite set is equinumerous to a proper subset of itself. COROLLARY 6.3.10 A set equinumerous to a proper subset of itself is infinite. Because ๐ : ๐ → ๐ โงต {0} defined by ๐ (๐) = ๐ + 1 is a bijection, we have the next result by Corollary 6.3.10. COROLLARY 6.3.11 ๐ is infinite. The proofs of the last two corollaries are left to Exercise 5. COROLLARY 6.3.12 If ๐ด is a proper subset of a natural number ๐, there exists ๐ < ๐ such that ๐ด ≈ ๐. COROLLARY 6.3.13 Let ๐ด ⊆ ๐ต. If ๐ต is finite, ๐ด is finite, and if ๐ด is infinite, ๐ต is infinite. Countable Sets Since ๐ is the first infinite ordinal, it is also a cardinal. Therefore, |๐ด| = ๐ if and only if ๐ด ≈ ๐. As sets go, finite sets and those equinumerous with ๐ are small, so we classify them together using the next definition. DEFINITION 6.3.14 A set ๐ด is countable if ๐ด โชฏ ๐. Sometimes countable sets are called discrete or denumerable. For example, the bijection ๐ : โค+ → ๐ defined by ๐ (๐) = ๐− shows that โค+ is countable. Moreover, a nonempty finite set is countable and can be written as {๐0 , ๐1 , … , ๐๐−1 }, Section 6.3 CARDINAL NUMBERS 311 for some positive integer ๐. A countably infinite set can be written as {๐0 , ๐1 , ๐2 , … }, where there are infinitely many distinct elements of the set. EXAMPLE 6.3.15 The set of rational numbers is a countable set. To prove this, define a bijection ๐ : ๐ → โ by first mapping the even natural numbers to the nonnegative rational numbers. The function is defined along the path indicated in Figure 6.3. When a rational number that previously has been used is encountered, it is skipped. To complete the definition, associate the odd naturals with the negative rational numbers using a path as in the diagram. This function is a bijection, so we conclude that โ is countable. We have defined countability in terms of bijections. Now let us identify a condition for countability using surjections. THEOREM 6.3.16 A set ๐ด is countable if and only if there exists a function from ๐ onto ๐ด. PROOF If ๐ : ๐ → ๐ด is a surjection, by Theorem 6.2.8, ๐ด โชฏ ๐. Conversely, suppose ๐ด is countable. We have two cases to check. โ Suppose ๐ด ≈ ๐. Then, there is a surjection from the set of natural numbers to ๐ด. f(6) = 2 1 f(4) = 1 1 f(0) = 0 1 Figure 6.3 2 2 f(2) = 0 2 1 2 f(8) = 2 3 f(10) = 1 3 0 3 The rational numbers are countable. ... 312 Chapter 6 ORDINALS AND CARDINALS โ Now let ๐ด ≈ ๐ for some ๐ ∈ ๐. If ๐ด = ∅, then ∅ is a surjection ๐ → ๐ด. Thus, assume ๐ด ≠ ∅. This means that we can write ๐ด = {๐0 , ๐1 , … , ๐๐−1 }. Define ๐ : ๐ → ๐ด by { ๐๐ ๐(๐) = ๐๐−1 if ๐ = 0, 1, … , ๐ − 1, otherwise. This function is certainly onto. If ๐ด๐ is countable for all ๐ = 0, 1, … , ๐ − 1, then ๐ด0 × ๐ด1 × · · · × ๐ด๐−1 is countable (Exercise 15). In particular, ๐ × ๐ and โค × โค are countable. We also have the next theorem. THEOREM 6.3.17 The union of a countable family of countable sets is countable. PROOF Let {๐ด โ ๐ผ : ๐ผ ∈ ๐ผ} be a family of countable sets with ๐ผ countable. Since we have that ∅ = ∅ is countable (Example 3.4.12), we can assume that ๐ผ is nonempty. For each ๐ผ ∈ ๐ผ, there exists a surjection in ๐ (๐ด๐ผ ) by Theorem 6.3.16. Therefore, by Corollary 5.1.11, there exists ๐ : ๐ผ → ๐ (๐ด๐ผ ) such that ๐ (๐ผ) is a surjection ๐ → ๐ด๐ผ for all ๐ผ ∈ ๐ผ. Because ๐ผ is countable, we have a surjection ๐ : ๐ → ๐ผ, and since ๐ × ๐ is countable, we have another surjection โ : ๐ → ๐ × ๐. We now define โ ๐ :๐×๐→ ๐ด๐ผ ๐ผ∈๐ผ by ๐(๐, ๐) = ๐ (๐(๐))(๐) and let ๐:๐→ โ ๐ด๐ผ ๐ผ∈๐ผ be defined by ๐ = ๐ โฆ โ. To check that ๐ is onto, let ๐ ∈ ๐ด๐ผ , some ๐ผ ∈ ๐ผ. Since ๐ is onto, there exists ๐ ∈ ๐ so that ๐(๐) = ๐ผ. Furthermore, since ๐ (๐ผ) is onto, we have ๐ ∈ ๐ such that ๐ (๐ผ)(๐) = ๐, and since โ is onto, there exists ๐ ∈ ๐ so that โ(๐) = (๐, ๐). Therefore, ๐(๐) = (๐ โฆ โ)(๐) = ๐(๐, ๐) = ๐ (๐(๐))(๐) = ๐ (๐ผ)(๐) = ๐. For example, โ โ {๐โค : ๐ ∈ ๐} and {โ × {๐} : ๐ ∈ ๐} are countable. Section 6.3 CARDINAL NUMBERS 313 Alephs Cantor denoted the first infinite cardinal ๐ by ℵ0 . The symbol ℵ (aleph) is the first letter of the Hebrew alphabet. The next magnitude of infinity is ℵ1 , which seems to exist by Theorem 6.2.13. This continues and gives an increasing sequence of infinite cardinals, and since natural numbers must be less than any infinite cardinal, we have 0 โบ 1 โบ 2 โบ · · · โบ ℵ0 โบ ℵ1 โบ ℵ2 โบ · · · . For instance, 4 โบ ℵ1 , ℵ0 โชฏ ℵ0 , and ℵ3 โบ ℵ7 . EXAMPLE 6.3.18 Although he was unable to prove it, Cantor suspected that ℵ1 = |โ|. This conjecture is called the continuum hypothesis (๐ข๐ง). However, it is possible that ℵ1 โบ |โ|. It is also possible that ℵ1 = |โ|. Cantor was unable to prove ๐ข๐ง because it is undecidable assuming the axioms of ๐ญ๐๐. In other words, it is an ๐ฒ๐ณ-sentence that can be neither proved nor disproved from ๐ญ๐๐. Now to define the other alephs. Pick an ordinal ๐ผ. Define the function โ by โ(๐) = least infinite cardinal not in ran(๐), where ๐ is a function with dom(๐) ∈ ๐ผ. For example, if ๐ผ = 5 and ๐ = {(0, ๐ 0 ), (1, ๐ 1 ), (2, ๐ 2 ), (3, ๐ 3 ), (4, ๐ 4 )}, then โ(๐) is the least infinite cardinal not in {๐ 0 , ๐ 1 , ๐ 2 , ๐ 3 , ๐ 4 }. By Transfinite Recursion (6.1.23), there exists a unique function ๐ with domain ๐ผ and ๐ (๐ฝ) = least infinite cardinal not in ran(๐ ๐ฝ) for all ๐ฝ ∈ ๐ผ. Define ℵ๐ฝ = ๐ (๐ฝ). Since โ(0) = ๐, we have that ℵ0 = ๐. Moreover, the definition of ๐ implies that ℵ1 = least infinite cardinal not in {ℵ0 }, ℵ2 = least infinite cardinal not in {ℵ0 , ℵ1 }, ℵ3 = least infinite cardinal not in {ℵ0 , ℵ1 , ℵ2 }, โฎ ℵ๐ = least infinite cardinal not in {ℵ๐ : ๐ ∈ ๐}, โฎ ℵ๐ผ = least infinite cardinal not in {ℵ๐ฝ : ๐ฝ ∈ ๐ผ} โฎ The question at this point is whether the alephs name all of the infinite cardinals. The next theorem answers the question. 314 Chapter 6 ORDINALS AND CARDINALS THEOREM 6.3.19 For every infinite cardinal ๐ , there exists an ordinal ๐ผ such that ๐ = ℵ๐ผ . PROOF Suppose ๐ ≠ ℵ๐ผ for every ordinal ๐ผ. We can assume that ๐ is the minimal such cardinal. Thus, for all cardinals ๐ ∈ ๐ , there exists an ordinal ๐ฝ๐ such that ๐ = ℵ๐ฝ๐ . Therefore, the least infinite cardinal not an element of {ℵ๐ฝ๐ : ๐ ∈ ๐ ∧ ๐ is a cardinal} is the next aleph, but this is ๐ . EXAMPLE 6.3.20 The generalized continuum hypothesis (๐ฆ๐ข๐ง) states that for every ordinal ๐ผ, ℵ๐ผ+ = |P(ℵ๐ผ )|. When ๐ผ = 0, ๐ฆ๐ข๐ง implies that ℵ1 = |P(ℵ0 )| = |โ|, which is ๐ข๐ง (Example 6.3.18). Like ๐ข๐ง, ๐ฆ๐ข๐ง is undecidable in ๐ญ๐๐. Like the ordinals, the cardinals can be divided into two classes. DEFINITION 6.3.21 Let ๐ be a nonzero cardinal number. If ๐ ∈ ๐ or there exists an ordinal ๐ผ such that ๐ = ℵ๐ผ+ , then ๐ is a successor cardinal. Otherwise, ๐ is a limit cardinal. For example, the positive natural numbers and ℵ1 are successor cardinals, while ℵ0 and ℵ๐ are limit cardinals. Notice that if ๐ is a limit cardinal, โ ๐ = {๐ : ๐ ∈ ๐ ∧ ๐ is a cardinal}. Exercises 1. Prove that |๐ผ| = |๐ผ + | for every ordinal ๐ผ. 2. Let ๐(๐ฅ) be a formula. Prove that if ๐(๐ผ) is false for some ordinal ๐ผ, then there exists a least ordinal ๐ฝ such that ๐(๐ฝ) is false. 3. Prove that the function ๐ in the proof of Lemma 6.3.5 is a bijection. 4. Show that the following attempted generalization of Lemma 6.3.5 is false: Let ๐ผ be an ordinal and ๐ ∈ ๐ด. If |๐ด| = ℵ๐ผ + , then |๐ด โงต {๐}| = ℵ๐ผ . 5. Demonstrate Corollaries 6.3.12 and 6.3.13. 6. Let ๐ด and ๐ต be finite sets. Prove the following. Section 6.3 CARDINAL NUMBERS 315 (a) |๐ด ∪ ๐ต| = |๐ด| + |๐ต| − |๐ด ∩ ๐ต| (b) |๐ด ∩ ๐ต| = |๐ด| − |๐ด โงต ๐ต| (c) |๐ด × ๐ต| = |๐ด| ⋅ |๐ต| 7. Prove that the intersection and union of finite sets is finite is finite. 8. Prove that every finite set has a choice function without using the axiom of choice. 9. Let ๐ and ๐ −1 be well-orderings of a set ๐ด. Prove that ๐ด is finite. 10. Show that โค is countable. 11. Let ๐ด be infinite. Find infinite sets ๐ต and ๐ถ such that ๐ด = ๐ต ∪ ๐ถ. 12. If ๐ต is countable, prove that |๐ด × ๐ต| = |๐ด|. 13. Let ๐ด and ๐ต be sets and ๐ด is countable. Prove that ๐ต is countable when ๐ด ≈ ๐ต. 14. Let ๐ด and ๐ต be countable sets. Show that the given sets are countable. (a) ๐ด ∪ ๐ต (b) ๐ด ∩ ๐ต (c) ๐ด × ๐ต (d) ๐ด โงต ๐ต 15. Let ๐ด0 , ๐ด1 , … , ๐ด๐−1 be countable sets. Prove that the given sets are countable. (a) ๐ด0 ∪ ๐ด1 ∪ · · · ∪ ๐ด๐−1 (b) ๐ด0 ∩ ๐ด1 ∩ · · · ∩ ๐ด๐−1 (c) ๐ด0 × ๐ด1 × · · · × ๐ด๐−1 16. Prove. (a) If ๐ด ∪ ๐ต is countable, ๐ด and ๐ต are countable. (b) If ๐ด is countable, 2 ๐ ≈ ๐ด ๐ โ 17. Let โฑ be a set of cardinals. Prove that โฑ is a cardinal. 18. A real number is algebraic if it is a root of a nonzero polynomial with integer coefficients. A real number that is not algebraic is transcendental. Prove that the set of algebraic numbers is countable and the set of transcendental numbers is uncountable. 19. Take a set ๐ด and define ๐ต = {๐ผ : ๐ผ is an ordinal ∧ ๐ผ โชฏ ๐ด}. [See Hartogs’ theorem (6.1.22).] Prove the following. (a) ๐ต is a cardinal. (b) |๐ด| โบ ๐ต. (c) ๐ต is the least cardinal such that |๐ด| โบ ๐ต. 20. Assuming ๐ฆ๐ข๐ง, find |P(P(P(P(P(โ)))))|. 21. Prove that for all ordinals ๐ผ and ๐ฝ, if ๐ผ ∈ ๐ฝ, then ℵ๐ผ ∈ ℵ๐ฝ . 316 Chapter 6 ORDINALS AND CARDINALS 22. Recursively define the following function using i (beth), the second letter of the Hebrew alphabet. Let ๐ผ be an ordinal. i0 = ℵ0 , i๐ผ+ = 2i๐ผ , โ i๐ผ = i๐ฝ if ๐ผ is a limit. ๐ฝ∈๐ผ (a) Use i to restate ๐ฆ๐ข๐ง. (b) Use transfinite recursion (Corollary 6.1.24) to prove that i defines a function. 6.4 ARITHMETIC Since every natural number is both an ordinal and a cardinal, we want to extend the operations on ๐ to all of the ordinals and all of the cardinals. Since the purpose of the ordinals is to characterize well-ordered sets but the purpose of the cardinals is to count, we expect the two extensions to be different. Ordinals Definitions 5.2.15 and 5.2.18 define what it means to add and multiply finite ordinals. When generalizing these two definitions to the infinite ordinals, we must take care because addition and multiplication should be binary operations, but if we defined these operations on all ordinals, their domains would not be sets by the Burali-Forti Theroem (6.1.21), resulting in the operations not being sets. Therefore, we choose an ordinal and define addition and multiplication on it. Since 1 + 1 ∉ 2, the ordinal must be a limit ordinal. DEFINITION 6.4.1 Let ๐ be a limit ordinal. For all ๐ผ, ๐ฝ ∈ ๐, โ ๐ผ + 0 = ๐ผ, โ ๐ผ + ๐ฝ + = (๐ผ + ๐ฝ)+ , โ โ ๐ผ + ๐ฝ = {๐ผ + ๐ฟ : ๐ฟ ∈ ๐ฝ} if ๐ฝ is a limit ordinal. As with addition of natural numbers (Definition 5.2.15), to prove that Definition 6.4.1 gives a binary operation, let ๐ : ๐ → ๐ be the successor function. By transfinite recursion (Corollary 6.1.24), there exist unique functions ๐๐ผ : ๐ผ → ๐ for all ๐ผ ∈ ๐ such that โ ๐๐ผ (0) = ๐ผ โ ๐๐ผ (๐ฝ + ) = ๐(๐๐ผ (๐ฝ)) = ๐๐ผ (๐ฝ)+ Section 6.4 ARITHMETIC โ for all limit ordinals ๐ฝ ∈ ๐, ๐๐ผ (๐ฝ) = โ 317 ๐๐ผ (๐ฟ). ๐ฟ∈๐ฝ Define A : ๐ × ๐ → ๐ by A(๐ผ, ๐ฝ) = ๐๐ผ (๐ฝ). By the uniqueness of each ๐๐ผ , the binary operation A is the function of Definition 6.4.1, so ๐ผ + ๐ฝ = A(๐ผ, ๐ฝ). Furthermore, take ๐ ′ to be a limit ordinal such that ๐ ⊆ ๐ ′ and define ๐ ′ : ๐ ′ → ๐ ′ to be the successor function. As above, there are unique functions ๐′๐ผ for all ๐ผ ∈ ๐ ′ with the same properties as ๐๐ผ . Notice that ๐๐ผ = ๐′๐ผ ๐ . Otherwise, ๐′๐ผ ๐ would have the same properties as ๐๐ผ yet be a different function, contradicting the uniqueness given by transfinite recursion. Next, define the binary operation A′ : ๐ ′ × ๐ ′ → ๐ ′ by A′ (๐ผ, ๐ฝ) = ๐′๐ผ (๐ฝ). Therefore, A = A′ (๐ × ๐). This implies that although addition is not defined as a binary operation on all ordinals, we can add any two ordinals and obtain the same sum independent of the ordinal on which the addition is defined. Consider ๐ ∈ ๐. Since ๐ is a limit ordinal, โ ๐+๐= {๐ + ๐ : ๐ ∈ ๐} = ๐. However, ๐ + 1 = ๐ + 0+ = (๐ + 0)+ = ๐+ = ๐ ∪ {๐}, and ๐ + 2 = ๐ + 1+ = (๐ + 1)+ = (๐ ∪ {๐})+ = ๐ ∪ {๐} ∪ {๐ ∪ {๐}} = ๐++ . Therefore, addition of infinite ordinals is not commutative. Moreover, an order isomorphism can be defined between ๐ + ๐ and ({0} × ๐) ∪ ({1} × ๐) ordered lexicographically (Exercise 4.3.16) as 0 โ (0, 0) 1 โ (0, 1) 2 โ (0, 2) … … ๐ โ (0, ๐) … … ๐ โ (1, 0) ๐+ โ (1, 1) ๐++ โ (1, 2) … … This means that ๐ + ๐ looks like ๐ followed by a copy of ๐. Generalizing, the ordinal ๐ + ๐ looks like ๐ followed by a copy of ๐. In particular, โ ๐+๐= ๐ + ๐, ๐∈๐ which means that the proof its existence requires a replacement axiom (5.1.9). All of this suggests the next result. 318 Chapter 6 ORDINALS AND CARDINALS LEMMA 6.4.2 Let ๐ be a limit ordinal and ๐ผ, ๐ฝ ∈ ๐. If ๐ฅ ∈ ๐ผ + ๐ฝ, then ๐ฅ ∈ ๐ผ or ๐ฅ ∈ ๐ผ + ๐ for some ๐ ∈ ๐ฝ. PROOF Define ๐ด = {๐ง ∈ ๐ : ∀๐ฅ(๐ฅ ∈ ๐ผ + ๐ง → ๐ฅ ∈ ๐ผ ∨ ∃๐ ∈ ๐ง[๐ฅ ∈ ๐ผ + ๐])}. Clearly, 0 ∈ ๐ด, so take ๐พ ∈ ๐ such that ๐๐พ๐(๐ , ๐พ) ⊆ ๐ด. โ Let ๐พ = ๐ฟ + for some ordinal ๐ฟ. Take ๐ฅ ∈ ๐ผ + ๐ฟ + , which implies that ๐ฅ ∈ (๐ผ + ๐ฟ)+ . This means that ๐ฅ ∈ ๐ผ + ๐ฟ or ๐ฅ = ๐ผ + ๐ฟ. If ๐ฅ ∈ ๐ผ + ๐ฟ, then we are done. If ๐ฅ = ๐ผ + ๐ฟ, then ๐ฅ ∈ (๐ผ + ๐ฟ)+ = ๐ผ + ๐ฟ + . โ Let ๐พ be a limit ordinal. Take ๐ฅ ∈ ๐ผ + ๐พ. This means that ๐ฅ ∈ ๐ผ + ๐ฟ for some ๐ฟ ∈ ๐พ. Therefore, ๐ฅ ∈ ๐ผ or ๐ฅ ∈ ๐ผ + ๐ for some ๐ ∈ ๐ฟ ⊂ ๐พ. Using Lemma 6.4.2, we can prove the next useful result. LEMMA 6.4.3 Let ๐ be a limit ordinal. If ๐ผ, ๐ฝ ∈ ๐, then ๐ผ + ๐ฝ = ๐ผ ∪ {๐ผ + ๐ : ๐ ∈ ๐ฝ}. PROOF Define ๐ด = {๐ง ∈ ๐ : ๐ผ + ๐ง = ๐ผ ∪ {๐ผ + ๐ : ๐ ∈ ๐ง}}. We have that 0 ∈ ๐ด, so assume ๐๐พ๐(๐ , ๐พ) ⊆ ๐ด, where ๐พ ∈ ๐. โ Suppose ๐พ = ๐ฟ + . Then, ๐ผ ∪ {๐ผ + ๐ : ๐ ∈ ๐ฟ + } = ๐ผ ∪ {๐ผ + ๐ : ๐ ∈ ๐ฟ} ∪ {๐ผ + ๐ฟ} = (๐ผ + ๐ฟ) ∪ {๐ผ + ๐ฟ} = (๐ผ + ๐ฟ)+ = ๐ผ + ๐ฟ+. โ Let ๐พ be a limit ordinal. Take ๐ฅ ∈ ๐ผ + ๐พ. By Lemma 6.4.2, we have ๐ฅ ∈ ๐ผ or ๐ฅ ∈ ๐ผ + ๐ for some ๐ ∈ ๐พ. If the former, we are done, so suppose the latter. In this case, the assumption gives ๐ผ + ๐ = ๐ผ ∪ {๐ผ + ๐ ′ : ๐ ′ ∈ ๐}. Thus, ๐ฅ = ๐ผ +๐ ′ for some ๐ ′ ∈ ๐ ∈ ๐พ. Conversely, if ๐ฅ ∈ ๐ผ, then ๐ฅ ∈ ๐ผ +0, so ๐ฅ ∈ ๐ผ + ๐พ. For the other case, let ๐ฅ ∈ {๐ผ + ๐ : ๐ ∈ ๐พ}. This implies that ๐ฅ ∈ (๐ผ + ๐)+ = ๐ผ + ๐ + for some ๐ ∈ ๐พ. Since ๐ + ∈ ๐พ, โ ๐ฅ∈ {๐ผ + ๐ ′ : ๐ ′ ∈ ๐พ} = ๐ผ + ๐พ. Although ordinal addition is not commutative, it does have other familiar properties as noted in the next result. Section 6.4 ARITHMETIC 319 THEOREM 6.4.4 Addition of ordinals is associative, and 0 is the additive identity. PROOF Let ๐ be a limit ordinal. Define ๐ด = {๐ฝ ∈ ๐ : 0 + ๐ฝ = ๐ฝ}. Suppose ๐พ is an ordinal such that ๐๐พ๐(๐, ๐พ) ⊆ ๐ด. We have two cases to check. โ Let ๐พ = ๐ฟ + for some ordinal ๐ฟ. Then, 0 + ๐พ = 0 + ๐ฟ + = (0 + ๐ฟ)+ = ๐ฟ + = ๐พ. โ Let ๐พ be a limit ordinal. We then have โ โ 0+๐พ = (0 + ๐ฟ) = ๐ฟ = ๐พ. ๐ฟ∈๐พ ๐ฟ∈๐พ In both cases, ๐พ ∈ ๐ด, so by transfinite induction, ๐ด = ๐. Since we can also prove that ๐ = {๐ฝ ∈ ๐ : ๐ฝ + 0 = ๐ฝ}, we conclude that 0 is the additive identity. To prove that ordinal addition is associative, we proceed by transfinite induction. Let ๐ผ, ๐ฝ, ๐ฟ ∈ ๐. Define ๐ต = {๐ง ∈ ๐ : ๐ผ + (๐ฝ + ๐ง) = (๐ผ + ๐ฝ) + ๐ง}. Assume that ๐๐พ๐(๐, ๐พ) ⊆ ๐ต. Then, ๐พ ∈ ๐ต because by Lemma 6.4.3, we have โ ๐ผ + (๐ฝ + ๐พ) = ๐ผ ∪ {๐ผ + ๐ฅ : ๐ฅ ∈ ๐ฝ + ๐พ} โ = ๐ผ ∪ {๐ผ + ๐ฅ : ๐ฅ ∈ ๐ฝ ∨ ∃๐(๐ ∈ ๐พ → ๐ฅ = ๐ฝ + ๐)} โ = ๐ผ ∪ ({๐ผ + ๐ฅ : ๐ฅ ∈ ๐ฝ} ∪ {๐ผ + (๐ฝ + ๐) : ๐ ∈ ๐พ}) โ โ = ๐ผ ∪ {๐ผ + ๐ฅ : ๐ฅ ∈ ๐ฝ} ∪ {(๐ผ + ๐ฝ) + ๐ : ๐ ∈ ๐พ} โ = (๐ผ ∪ ๐ฝ) ∪ {๐ผ + ๐ฝ + ๐ : ๐ ∈ ๐พ} = (๐ผ + ๐ฝ) + ๐พ. Note that Exercise 3.4.28(e) is used on the fourth equality. DEFINITION 6.4.5 Let ๐ be a limit ordinal. For all ๐ผ, ๐ฝ ∈ ๐, โ ๐ผ ⋅ 0 = 0, 320 Chapter 6 ORDINALS AND CARDINALS โ ๐ผ ⋅ ๐ฝ + = ๐ผ ⋅ ๐ฝ + ๐ผ, โ โ ๐ผ ⋅ ๐ฝ = {๐ผ ⋅ ๐ฟ : ๐ฟ ∈ ๐ฝ} if ๐ฝ is a limit ordinal. As with ordinal addition, ordinal multiplication is well-defined by transfinite recursion (Exercise 3). Also, as with addition of ordinals, certain expected properties hold, while others do not. The next two results are the analog of Lemma 6.4.3 and Theorem 6.4.4. Their proofs are Exercise 4. LEMMA 6.4.6 If ๐ผ and ๐ฝ are ordinals, ๐ผ ⋅ ๐ฝ = {๐ผ ⋅ ๐ + ๐ : ๐ ∈ ๐ฝ ∧ ๐ ∈ ๐ผ}. THEOREM 6.4.7 Multiplication of ordinals is associative, and 1 is the multiplicative identity. Observe that 0⋅๐= โ {0 ⋅ ๐ : ๐ ∈ ๐} = 0. Also, ๐ ⋅ 1 = 1 ⋅ ๐ = ๐ (Theorem 6.4.7), so multiplication on the right by a natural number behaves as we would expect in that ๐⋅2=๐⋅1+๐=๐+๐ (6.1) and ๐ ⋅ 3 = ๐ ⋅ 2 + ๐ = ๐ + ๐ + ๐, but 2⋅๐= โ {2 ⋅ ๐ : ๐ ∈ ๐} = ๐ 3⋅๐= โ {3 ⋅ ๐ : ๐ ∈ ๐} = ๐. and Hence, multiplication of ordinals is not commutative. Because of this, it is not surprising that there are issues with the distributive law. For ordinals, there is a left distributive law but not a right distributive law (Exercise 9). THEOREM 6.4.8 [Left Distributive Law] ๐ผ ⋅ (๐ฝ + ๐ฟ) = ๐ผ ⋅ ๐ฝ + ๐ผ ⋅ ๐ฟ for all ordinals ๐ผ, ๐ฝ, and ๐ฟ. Since addition of ordinals is an operation on a limit ordinal ๐, we know that for all ordinals ๐ผ, ๐ฝ, ๐ฟ ∈ ๐, ๐ผ =๐ฝ ⇒๐ผ+๐ฟ =๐ฝ+๐ฟ and ๐ผ = ๐ฝ ⇒ ๐ผ ⋅ ๐ฟ = ๐ฝ ⋅ ๐ฟ. The next result gives information regarding how ordinal multiplication behaves with an inequality. Section 6.4 ARITHMETIC 321 THEOREM 6.4.9 Let ๐ผ, ๐ฝ, and ๐ฟ be ordinals. โ If ๐ผ ⊂ ๐ฝ, then ๐ฟ + ๐ผ ⊂ ๐ฟ + ๐ฝ. โ If ๐ผ ⊆ ๐ฝ, then ๐ผ + ๐ฟ ⊆ ๐ฝ + ๐ฟ. โ If ๐ผ ⊂ ๐ฝ and ๐ฟ ≠ 0, then ๐ฟ ⋅ ๐ผ ⊂ ๐ฟ ⋅ ๐ฝ. โ If ๐ผ ⊆ ๐ฝ, then ๐ผ ⋅ ๐ฟ ⊆ ๐ฝ ⋅ ๐ฟ. PROOF We prove the third part, leaving the others to Exercise 7. Let ๐ผ ⊂ ๐ฝ and ๐ฟ ≠ 0. Then, by Lemma 6.4.6, ๐ฟ ⋅ ๐ผ = {๐ฟ ⋅ ๐ + ๐ : ๐ ∈ ๐ผ ∧ ๐ ∈ ๐ฟ} ⊂ {๐ฟ ⋅ ๐ + ๐ : ๐ ∈ ๐ฝ ∧ ๐ ∈ ๐ฟ} = ๐ฟ ⋅ ๐ฝ. Finally, we define exponentiation so that it generalizes exponentiation on ๐ (Exercise 5.2.17). It is a binary operation (Exercise 3). DEFINITION 6.4.10 Let ๐ be a limit ordinal and ๐ผ, ๐ฝ ∈ ๐. โ ๐ผ0 = 1 + โ ๐ผ๐ฝ = ๐ผ๐ฝ ⋅ ๐ผ โ ๐ผ๐ฝ = โ ๐ฟ {๐ผ : ๐ฟ ∈ ๐ฝ} if ๐ฝ is a limit ordinal. For example, + ๐1 = ๐0 = ๐0 ⋅ ๐ = 1 ⋅ ๐ = ๐, and + ๐2 = ๐1 = ๐1 ⋅ ๐ = ๐ ⋅ ๐, so raising an ordinal to a natural number appears to behave as expected. Also, 1๐ = โ โ {1๐ : ๐ ∈ ๐} = {1} = 1 and 2๐ = โ {2๐ : ๐ ∈ ๐} = ๐. We leave the proof of the following properties of ordinal exponentiation to Exercise 11. 322 Chapter 6 ORDINALS AND CARDINALS THEOREM 6.4.11 Let ๐ผ, ๐ฝ, ๐ฟ be ordinals. โ ๐ผ ๐ฝ+๐ฟ = ๐ผ ๐ฝ ⋅ ๐ผ ๐ฟ . โ (๐ผ ๐ฝ )๐ฟ = ๐ผ ๐ฝ⋅๐ฟ . Cardinals Even though the finite cardinals are the same sets as the finite ordinals and every infinite cardinal is a limit ordinal, the arithmetic defined on the cardinals will only apply when the given sets are viewed as cardinals. The definitions for addition, multiplication, and exponentiation for cardinals are not given recursively. DEFINITION 6.4.12 Let ๐ and ๐ be cardinals. โ ๐ + ๐ = |(๐ × {0}) ∪ (๐ × {1})| โ ๐ ⋅ ๐ = |๐ × ๐| โ ๐ ๐ = |๐ ๐ |. Since ordinal arithmetic was simply a generalization of the arithmetic of natural numbers, that the addition worked in the finite case was not checked. Here this is not the case, so let us check 2 + 3 = |(2 × {0}) ∪ (3 × {1})| = |{(0, 0), (1, 0), (0, 1), (1, 1), (2, 1)}| = 5. Also, 3 + 2 = |(3 × {0}) ∪ (2 × {1})| = |{(0, 0), (1, 0), (2, 0), (0, 1), (1, 1)}| = 5. This suggests that cardinal addition is commutative. This and other basic results are given in the next theorem. Some details of the proof are left to Exercise 14. THEOREM 6.4.13 Addition of cardinals is associative and commutative, and 0 is the additive identity. PROOF Let ๐ , ๐, and ๐ be cardinals. Addition is associative because (๐ × {0}) ∪ [([๐ × {0}] ∪ [๐ × {1}]) × {1}] is equinumerous to [([๐ × {0}] ∪ [๐ × {1}]) × {0}] ∪ (๐ × {1}), Section 6.4 ARITHMETIC 323 it is commutative because (๐ × {0}) ∪ (๐ × {1}) ≈ (๐ × {1}) ∪ (๐ × {0}), and 0 is the additive identity because (๐ × {0}) ∪ (0 × {1}) = (๐ × {0}). Now let us multiply 2 ⋅ 3 = |{0, 1} × {0, 1, 2}| = |{(0, 0), (0, 1), (0, 2), (1, 0), (1, 1), (1, 2)}| = 6 and 3 ⋅ 2 = |{0, 1, 2} × {0, 1}| = |{(0, 0), (0, 1), (1, 0), (1, 1), (2, 0), (2, 1)}| = 6. As with cardinal addition, it seems that cardinal multiplication is commutative. This and other results are stated in the next theorem. Its proof is left to Exercise 15. THEOREM 6.4.14 Multiplication of cardinals is associative and commutative, and 1 is the multiplicative identity. As with ordinal arithmetic (Theorem 6.4.8), cardinal arithmetic has a left distribution law, but since cardinal multiplication is commutative, cardinal arithmetic also has a right distribution law. THEOREM 6.4.15 [Distributive Law] Let ๐ , ๐, and ๐ be cardinals. โ ๐ ⋅ (๐ + ๐) = ๐ ⋅ ๐ + ๐ ⋅ ๐. โ (๐ + ๐) ⋅ ๐ = ๐ ⋅ ๐ + ๐ ⋅ ๐. PROOF The left distribution law holds because ๐ × ([๐ × {0}] ∪ [๐ × {1}]) ≈ ([๐ × ๐] × {0}) ∪ ([๐ × ๐] × {1}). The remaining details of the proof are left to Exercise 16. The last of the operations of Definition 6.4.12 is exponentiation. Let ๐ be a cardinal. Observe that since there is exactly one function 0 → ๐ (Exercise 4.4.16), ๐ 0 = |0 ๐ | = 1 and if ๐ ≠ 0, 0๐ = |๐ 0| = 0 because there are no functions ๐ → 0, and by Theorem 6.2.13, ๐ โบ 2๐ . In addition, cardinal exponentiation follows other expected rules. 324 Chapter 6 ORDINALS AND CARDINALS THEOREM 6.4.16 Let ๐ , ๐, and ๐ be cardinals. โ ๐ ๐+๐ = ๐ ๐ ⋅ ๐ ๐ . โ (๐ ⋅ ๐)๐ = ๐ ๐ ⋅ ๐๐ . โ (๐ ๐ )๐ = ๐ ๐⋅๐ . PROOF We prove the last part and leave the rest to Exercise 17. Define ( ) ๐ : ๐ ๐ ๐ → ๐×๐ ๐ such that for all ๐ ∈ ( ๐ ๐๐ ) , ๐ผ ∈ ๐ and ๐ฝ ∈ ๐, ๐(๐)(๐ผ, ๐ฝ) = ๐(๐ผ)(๐ฝ). We claim that ๐ is a bijection. ( ) โ Let ๐1 , ๐2 ∈ ๐ ๐ ๐ . Assume that ๐(๐1 ) = ๐(๐2 ). Take ๐ผ ∈ ๐ and ๐ฝ ∈ ๐. Then, ๐1 (๐ผ)(๐ฝ) = ๐(๐1 )(๐ผ, ๐ฝ) = ๐(๐2 )(๐ผ, ๐ฝ) = ๐2 (๐ผ)(๐ฝ). Therefore, ๐1 = ๐2 , and ๐ is one-to-one. โ Let ๐ ∈ ๐×๐ ๐ . For ๐ผ ∈ ๐ and ๐ฝ ∈ ๐, define ๐ ′ (๐ผ)(๐ฝ) = ๐(๐ผ, ๐ฝ). This implies that ๐(๐ ′ ) = ๐, so ๐ is onto. Since every infinite cardinal number can be represented by an ℵ, let us determine how to calculate using this notation. We begin with a lemma. LEMMA 6.4.17 If ๐ ∈ ๐ and ๐ an infinite cardinal, ๐ + ๐ = ๐ ⋅ ๐ = ๐ . PROOF Let ๐ be a natural number. Define ๐ : (๐ × {0}) ∪ (๐ × {1}) → ๐ by ๐(๐, 0) = ๐ for all ๐ < ๐, ๐(๐ผ, 1) = ๐ + ๐ผ for all ๐ผ ∈ ๐ . For example, if ๐ = 5, then ๐(4, 0) = 4, ๐(0, 1) = 5, ๐(6, 1) = 11, ๐(๐, 1) = ๐, and ๐(๐ + 1, 1) = ๐ + 1. Therefore, ๐ + ๐ = ๐ because ๐ is a bijection. That ๐ ⋅ ๐ = ๐ is left to Exercise 18. Lemma 6.4.17 allows us to compute with alephs. Section 6.4 ARITHMETIC 325 THEOREM 6.4.18 Let ๐ผ and ๐ฝ be ordinals and ๐ ∈ ๐. โ ๐ + ℵ๐ผ = ๐ ⋅ ℵ๐ผ = ℵ๐ผ . { ℵ๐ผ โ ℵ๐ผ + ℵ๐ฝ = ℵ๐ผ ⋅ ℵ๐ฝ = ℵ๐ฝ if ๐ผ ≥ ๐ฝ, otherwise. PROOF The first part follows by Lemma 6.4.17. To prove the addition equation from the second part, let ๐ผ and ๐ฝ be ordinals. Without loss of generality assume that ๐ผ ⊆ ๐ฝ. By Definition 6.4.12, ℵ๐ผ + ℵ๐ฝ = |(ℵ๐ผ × {0}) ∪ (ℵ๐ฝ × {1})|. Since ℵ๐ฝ = |ℵ๐ฝ × {1}|, ℵ๐ฝ โชฏ |(ℵ๐ผ × {0}) ∪ (ℵ๐ฝ × {1})|. Furthermore, because ℵ๐ผ ⊆ ℵ๐ฝ , |(ℵ๐ผ × {0}) ∪ (ℵ๐ฝ × {1})| โชฏ |(ℵ๐ฝ × {0}) ∪ (ℵ๐ฝ × {1})| = |ℵ๐ฝ × {0, 1}| = ℵ๐ฝ . Because of Lemma 6.4.17, the last equality holds since ℵ๐ฝ is infinite and {0, 1} is finite. Hence, ℵ๐ผ + ℵ๐ฝ ≈ ℵ๐ฝ by the Cantor–Schröder–Bernstein theorem (6.2.11). Since both ℵ๐ผ + ℵ๐ฝ and ℵ๐ฝ are cardinals, ℵ๐ผ + ℵ๐ฝ = ℵ๐ฝ . For example, ℵ5 + ℵ9 = ℵ5 ⋅ ℵ9 = ℵ9 . More generally, we quickly have the following corollary by Theorem 6.3.19. COROLLARY 6.4.19 For every infinite cardinal ๐ , both ๐ + ๐ = ๐ and ๐ ⋅ ๐ = ๐ . Exercises 1. Let ๐ = {๐ด0 , ๐ด1 , … , ๐ด๐−1 } be a pairwise disjoint family of sets. Assuming that โ the sets are distinct, prove that the cardinality of ๐ is equal to the sum |๐ด0 | + |๐ด1 | + · · · + |๐ด๐−1 |. 2. Let ๐ผ and ๐ฝ be ordinals. Let ๐ : ๐ผ → ๐ฝ be a function such that ๐(๐ฟ) ∈ ๐(๐พ) for all ๐ฟ ∈ ๐พ ∈ ๐ผ (Compare Lemma 6.1.2). Prove the following. 326 Chapter 6 ORDINALS AND CARDINALS (a) ๐ผ ⊆ ๐ฝ. (b) ๐ฟ ⊆ ๐(๐ฟ) for all ๐ฟ ∈ ๐ผ. 3. Let ๐ be a limit ordinal. Use transfinite recursion to prove that ordinal multiplication (Definition 6.4.5) and ordinal exponentiation (Definition 6.4.10) are binary operations on ๐. 4. Prove Lemma 6.4.6 and Theorem 6.4.7. 5. For every ordinal ๐ผ, prove that 0 ⋅ ๐ผ = 0. 6. Prove that for all ๐ ∈ ๐, ๐ + ๐ = ๐ ⋅ ๐ as ordinals. Can this be generalized to ๐ผ + ๐ฝ = ๐ผ ⋅ ๐ฝ for ordinals ๐ผ ∈ ๐ฝ with ๐ฝ being infinite? If so, is the ๐ผ ∈ ๐ฝ required? 7. Prove the remaining parts of Theorem 6.4.9. 8. Find ordinals ๐ผ, ๐ฝ, and ๐ฟ such that the following properties hold. (a) ๐ผ ⊂ ๐ฝ but ๐ฝ + ๐ฟ ⊆ ๐ผ + ๐ฟ. (b) ๐ผ ⊂ ๐ฝ but ๐ฝ ⋅ ๐ฟ ⊆ ๐ผ ⋅ ๐ฟ. 9. Let ๐ผ, ๐ฝ, and ๐ฟ be ordinals. (a) Prove that ๐ผ ⋅ (๐ฝ + ๐ฟ) = ๐ผ ⋅ ๐ฝ + ๐ผ ⋅ ๐ฟ. (b) Show that it might be the case that (๐ผ + ๐ฝ) ⋅ ๐ฟ ฬธ≠ ๐ผ ⋅ ๐ฟ + ๐ฝ ⋅ ๐ฟ. (c) For which ordinals does the right distribution law hold? 10. Let ๐ผ, ๐ฝ, and ๐ฟ be ordinals. Prove the following. (a) ๐ผ + ๐ฝ ∈ ๐ผ ⋅ ๐ฟ if and only if ๐ฝ ∈ ๐ฟ. (b) ๐ผ + ๐ฝ = ๐ผ + ๐ฟ if and only if ๐ฝ = ๐ฟ. (c) If ๐ผ + ๐ฟ ∈ ๐ฝ + ๐ฟ, then ๐ผ ∈ ๐ฝ. (d) ๐ผ ⋅ ๐ฝ ∈ ๐ผ ⋅ ๐ฟ if and only if ๐ฝ ∈ ๐ฟ and ๐ผ ≠ 0. (e) If ๐ผ ⋅ ๐ฝ = ๐ผ ⋅ ๐ฟ, then ๐ฝ = ๐ฟ or ๐ผ = 0. 11. Prove Theorem 6.4.11. 12. Let ๐ผ, ๐ฝ, and ๐ฟ be ordinals. Prove the following. (a) ๐ผ ๐ฝ ∈ ๐ผ ๐ฟ if and only if ๐ฝ ∈ ๐ผ and 1 ∈ ๐ผ. (b) If ๐ผ ∈ ๐ฝ, then ๐ผ ๐ฟ ⊆ ๐ฝ ๐ฟ . (c) If ๐ผ ๐ฟ ∈ ๐ฝ ๐ฟ , then ๐ผ ∈ ๐ฝ. (d) If 1 ∈ ๐ผ, then ๐ฝ ⊆ ๐ผ ๐ฝ . (e) If ๐ผ ∈ ๐ฝ, there exists a unique ordinal ๐พ such that ๐ผ + ๐พ = ๐ฝ. 13. Prove that the ordinal ๐ + ๐ is not a cardinal. 14. Provide the details for the proof of Theorem 6.4.13. 15. Prove Theorem 6.4.14. 16. Provide the details to the proof of Theorem 6.4.15. 17. Prove the remaining parts of Theorem 6.4.16. 18. Prove that for any natural number ๐ and infinite cardinal ๐ , ๐ ⋅ ๐ = ๐ . 327 Section 6.5 LARGE CARDINALS 19. Prove that if ๐ is inifinte, (๐ + )๐ = 2๐ . 20. Is there a cancellation law for ordinals or for cardinals? 21. Prove that ๐ +๐ = ๐ ⋅๐ = ๐, given that ๐ is a countable cardinal and ๐ is an infinite cardinal. 22. Generalize Exercise 21 by showing that if ๐ and ๐ are cardinals with ๐ infinite such that ๐ โชฏ ๐, then ๐ + ๐ = ๐ ⋅ ๐ = ๐. 23. Let ๐ and ๐ be cardinals with ℵ0 โชฏ ๐. Show that if 2 โชฏ ๐ โบ ๐, then ๐ ๐ = 2๐ . 24. Prove for all ordinals ๐ผ that |๐ผ| โบ ℵ๐ผ . 25. For all ordinals ๐ผ, define Hartogs’ function by Γ(๐ผ) = {๐ฝ : ๐ฝ is an ordinal ∧ ๐ฝ โชฏ ๐ผ}. Prove that Γ(๐ผ) is an ordinal and ๐ผ โบ Γ(๐ผ) for all ordinals ๐ผ. 26. Using Exercise 25, define an initial number ๐๐ผ as follows. ๐0 = ๐, ๐๐พ + = Γ(๐๐พ ), โ ๐๐พ = ๐๐ฟ if ๐พ is a limit ordinal. ๐ฟ∈๐พ Prove that initial numbers are limit ordinals and ๐1 is the first uncountable ordinal. 27. Prove that there is no greatest initial number. 28. For all ordinals ๐ผ, show that ℵ๐ผ = |๐๐ผ |. Can we write ℵ๐ผ = ๐๐ผ ? 29. For all countable ordinals ๐ผ, show that 2ℵ๐ผ = ℵ๐ผ+ implies that 2ℵ๐1 = ℵ๐1 +1 . 6.5 LARGE CARDINALS Since every cardinal is a limit ordinal, every cardinal ๐ can be written in the form โ ๐ = {๐ผ : ๐ผ ∈ ๐ }. In particular, for the limit cardinal ℵ๐+๐ , we have that โ ℵ๐+๐ = {๐ผ : ๐ผ ∈ ℵ๐+๐ }. (6.2) Notice that (6.2) is the union of a set with ℵ๐+๐ elements. However, we also have โ ℵ๐+๐ = {ℵ๐ผ : ๐ผ ∈ ๐ + ๐}, (6.3) and ℵ๐+๐ = โ {ℵ๐+๐ : ๐ ∈ ๐}. (6.4) Both (6.3) and (6.4) are unions of sets with ℵ0 elements. The next definition is introduced to handle these differences. Since infinite cardinals are limit ordinals, the definition is given for limit ordinals. 328 Chapter 6 ORDINALS AND CARDINALS DEFINITION 6.5.1 The cofinality of a limit ordinal ๐ผ is denoted by ๐ผ๐ฟ (๐ผ) and defined as the least โ cardinal ๐ such that there exists โฑ ⊆ ๐ผ with |โฑ | = ๐ and ๐ผ = โฑ . โ Observe ๐ผ. There can be other sets ๐ต such that โ that ๐ผ๐ฟ(๐ผ) โชฏ |๐ผ| because ๐ผ = ๐ผ = ๐ต, and they might have different cardinalities. Any set of ordinals ๐ต with this property is said to be cofinal in ๐ผ. Moreover, we can write ๐ต = {๐ฝ๐ฟ : ๐ฟ ∈ ๐ } for some cardinal ๐ , so โ ๐ผ= ๐ฝ๐ฟ , ๐ฟ∈๐ and when ๐ = ๐ผ๐ฟ (๐ผ), โ ๐ผ= ๐ฝ๐ฟ . ๐ฟ∈๐ผ๐ฟ(๐ผ) EXAMPLE 6.5.2 Since the finite union of a finite set is finite, the cofinality of any infinite set must be infinite. Therefore, because โ ๐= ๐, ๐∈๐ we see that ๐ผ๐ฟ (๐) = ℵ0 . Also, since ℵ๐ = โ ℵ๐ , ๐∈๐ we conclude that ๐ผ๐ฟ (ℵ๐ ) = ℵ0 and {ℵ๐ : ๐ ∈ ๐} is cofinal in ℵ๐ . However, ๐ผ๐ฟ (ℵ1 ) = ℵ1 because the countable union of countable sets is countable (Theorem 6.3.17). Regular and Singular Cardinals Since infinite cardinals are limit ordinals, we can classify the cardinals based on their cofinalities. We make the following definition. DEFINITION 6.5.3 A cardinal ๐ is regular if ๐ = ๐ผ๐ฟ (๐ ), else it is singular. Notice that Example 6.5.2 shows that ℵ0 and ℵ1 are regular but ℵ๐ is singular. This implies a direction to follow to characterize the cardinal numbers. We begin with the successors. THEOREM 6.5.4 Successor cardinals are regular. Section 6.5 LARGE CARDINALS PROOF Let ๐ผ be an ordinal and let โฑ ⊆ ℵ๐ผ+1 such that ℵ๐ผ+1 = |๐ฝ| โชฏ ℵ๐ผ for all ๐ฝ ∈ โฑ . Thus, โ โฑ โชฏ |โฑ | ⋅ ℵ๐ผ . โ 329 โฑ . This implies that By Theorem 6.4.18, we conclude that ℵ๐ผ+1 โชฏ |โฑ |, so ๐ผ๐ฟ (ℵ๐ผ+1 ) = ℵ๐ผ+1 by the Cantor–Schröder–Bernstein theorem (6.2.11). Since ℵ0 is regular, Theorem 6.5.4 tells us where to find the singular cardinals. COROLLARY 6.5.5 A singular cardinal is an uncountable limit cardinal. Because we have not proved the converse of Corollary 6.5.5, we investigate the cofinality of certain limit cardinals in an attempt to determine which limit cardinals are singular. THEOREM 6.5.6 If ๐ผ is a limit ordinal, ๐ผ๐ฟ (ℵ๐ผ ) = ๐ผ๐ฟ(๐ผ). PROOF The proof uses the Cantor–Schröder–Bernstein theorem (6.2.11). Let ๐ผ be a limit ordinal. โ We first show that ๐ผ๐ฟ (ℵ๐ผ ) โชฏ ๐ผ๐ฟ(๐ผ). Let ๐ด be a cofinal subset of ๐ผ such that |๐ด| = ๐ผ๐ฟ(๐ผ). Notice that if ๐ฝ ∈ ๐ด, then ℵ๐ฝ ∈ ℵ๐ผ . On the other hand, take ๐ฟ ∈ ℵ๐ผ , which implies that |๐ฟ| โบ ℵ๐ผ . Therefore, there exists ๐พ ∈ ๐ผ such that |๐ฟ| = ℵ๐พ . Since ๐ด is cofinal in ๐ผ, there โ exists ๐ ∈ ๐ด such that ๐ฟ ∈ ๐. Hence, ๐ฟ ∈ ℵ๐ , which implies that ๐ฟ ∈ {ℵ๐ฝ : ๐ฝ ∈ ๐ด}. Therefore, โ ℵ๐ผ = {ℵ๐ฝ : ๐ฝ ∈ ๐ด}, from which follows, ๐ผ๐ฟ(ℵ๐ผ ) โชฏ |๐ด| = ๐ผ๐ฟ(๐ผ). โ We now show ๐ผ๐ฟ (๐ผ) โชฏ ๐ผ๐ฟ (ℵ๐ผ ). Let ๐ด ⊆ ℵ๐ผ so that ℵ๐ผ = ๐ผ๐ฟ(ℵ๐ผ ). Define โ โ ๐ด and |๐ด| = โฑ = {๐ฟ ∈ ๐ผ : ∃๐พ(๐พ ∈ ๐ด ∧ |๐พ| = ℵ๐ฟ )}. Then, ๐ฝ = โฑ is an ordinal by Theorem 6.1.16. For all ๐ ∈ ๐ด, we have that ๐ ∈ ℵ๐ฝ+1 because |๐| โชฏ ℵ๐ฝ . Hence, โ ๐ด ⊆ ℵ๐ฝ , โ from which follows that ๐ผ ∈ ๐ฝ, which means that ๐ผ ⊆ โฑ . Therefore, โ since the elements of โฑ are ordinals of ๐ผ, we have that ๐ผ = โฑ , so ๐ผ๐ฟ(๐ผ) โชฏ |โฑ | = |๐ด| = ๐ผ๐ฟ (ℵ๐ผ ). 330 Chapter 6 ORDINALS AND CARDINALS Theorem 6.5.6 confirms the result of Example 6.5.2 because ๐ผ๐ฟ(ℵ๐ ) = ๐ผ๐ฟ(๐) = ℵ0 . (6.5) ๐ผ๐ฟ (ℵ๐+๐ ) = ๐ผ๐ฟ(๐ + ๐) = ℵ0 , (6.6) Also, so ℵ๐+๐ is singular. Observe that by (6.5) and (6.6), ๐ผ๐ฟ (๐ผ๐ฟ(ℵ๐ )) = ๐ผ๐ฟ (๐ผ๐ฟ(๐)) = ๐ผ๐ฟ (ℵ0 ) = ℵ0 and ๐ผ๐ฟ (๐ผ๐ฟ(ℵ๐+๐ )) = ๐ผ๐ฟ (๐ผ๐ฟ(๐ + ๐)) = ๐ผ๐ฟ(ℵ0 ) = ℵ0 . The next result generalizes this and proves that ๐ผ๐ฟ(๐ผ๐ฟ (๐ผ)) = ๐ผ๐ฟ (๐ผ) for every limit ordinal ๐ผ. THEOREM 6.5.7 For any limit ordinal ๐ผ, ๐ผ๐ฟ (๐ผ) is a regular cardinal. PROOF โ Let ๐ผ be a limit ordinal โ and write ๐ผ = {๐ด๐พ : ๐พ ∈ ๐ผ๐ฟ (๐ผ)}. For every ordinal ๐พ ∈ ๐ผ๐ฟ (๐ผ), define ๐ผ๐พ = ๐ฟ∈๐พ ๐ด๐ฟ . Then, {๐ผ๐พ : ๐พ ∈ ๐ผ๐ฟ (๐ผ)} is a chain of ordinals (Theorem 6.1.16) and โ ๐ผ= {๐ผ๐พ : ๐พ ∈ ๐ผ๐ฟ (๐ผ)}. Now write ๐ผ๐ฟ (๐ผ) = Define โ {๐ฝ๐พ : ๐พ ∈ ๐ผ๐ฟ(๐ผ๐ฟ(๐ผ))}. ๐ต = {๐ผ๐ฝ๐พ : ๐พ ∈ ๐ผ๐ฟ (๐ผ๐ฟ(๐ผ))}. Let ๐ ∈ ๐ผ. This implies that ๐ ∈ ๐ผ๐พ0 for some ๐พ0 ∈ ๐ผ๐ฟ (๐ผ). Then, ๐พ0 ∈ ๐ฝ๐พ1 for some ๐พ1 ∈ ๐ผ๐ฟ (๐ผ๐ฟ(๐ผ)). Hence, ๐ ∈ ๐ผ๐พ0 ⊆ ๐ผ๐ฝ๐พ ∈ ๐ต, 1 โ โ so ๐ ∈ ๐ต. Therefore, ๐ผ = ๐ต, and this implies that ๐ผ๐ฟ(๐ผ) โชฏ ๐ผ๐ฟ (๐ผ๐ฟ(๐ผ)). Since the opposite inequality always holds, by the Cantor–Schröder–Bernstein theorem (6.2.11), ๐ผ๐ฟ (๐ผ) = ๐ผ๐ฟ (๐ผ๐ฟ(๐ผ)), which means that ๐ผ๐ฟ (๐ผ) is regular. Although the Continuum Hypothesis cannot be proved, it is possible to discover some information about the value of 2ℵ0 . Notice how its proof resembles Cantor’s diagonalization (page 303). It is due to König (1905). THEOREM 6.5.8 [König] If ๐ is an infinite cardinal and ๐ผ๐ฟ (๐ ) โชฏ ๐, then ๐ โบ ๐ ๐ . Section 6.5 LARGE CARDINALS 331 PROOF Suppose that ๐ is am infinite cardinal number and ๐ผ๐ฟ (๐ ) = ๐. Write โ ๐ = {๐ฟ๐ผ : ๐ผ ∈ ๐}. Let โฑ = {๐๐ฝ : ๐ฝ ∈ ๐ } be a subset of ๐ ๐ . Define ๐ : ๐ → ๐ such that ๐(๐ผ) = least element of ๐ โงต {๐๐ฝ (๐ฟ๐ผ ) : ๐ฝ ∈ ๐ฟ๐ผ }. For any ๐ผ ∈ ๐, ๐(๐ผ) ≠ ๐๐ฝ (๐ฟ๐ผ ) for all ๐ฝ ∈ ๐ฟ๐ผ . Therefore, ๐ ≠ ๐๐ฝ for all ๐ฝ ∈ ๐ฟ๐ผ . (6.7) Since (6.7) is true for all ๐ผ ∈ ๐ and {๐ฟ๐ผ : ๐ผ ∈ ๐} is cofinal in ๐ , we conclude that ๐ ≠ ๐๐ฝ for all ๐ฝ ∈ ๐ . Therefore, ๐ ∉ โฑ , so ๐ โบ ๐ ๐ . Note that the same argument leads to this conclusion if ๐ผ๐ฟ (๐ ) โบ ๐ (Exercise 4). COROLLARY 6.5.9 Let ๐ be an infinite cardinal. Then, ๐ โบ ๐ผ๐ฟ(2๐ ). PROOF Suppose that ๐ผ๐ฟ(2๐ ) โชฏ ๐ . By König’s theorem (6.5.8), Theorem 6.4.16, and Corollary 6.4.19, ๐ 2๐ โบ (2๐ )๐ผ๐ฟ(2 ) โชฏ (2๐ )๐ = 2๐ ⋅๐ = 2๐ . By Corollary 6.5.9, ℵ0 โบ ๐ผ๐ฟ(2ℵ0 ), but by Example 6.5.2, we know that ๐ผ๐ฟ (ℵ๐ ) = ℵ0 , so ๐ผ๐ฟ(ℵ๐ ) โบ ๐ผ๐ฟ(2ℵ0 ). Hence, even though we cannot prove what the cardinality of 2ℵ0 is, we do know that it is not ℵ๐ . Inaccessible Cardinals As we have noted, ℵ0 is both a regular and a limit cardinal. Are there any others with this property? DEFINITION 6.5.10 A regular limit cardinal that is uncountable is called weakly inaccessible. It is not possible using the axioms of ๐ญ๐๐ to prove the existence of a weakly inaccessible cardinal. Here is another class of cardinals “beyond” the weakly inaccessible cardinals. 332 Chapter 6 ORDINALS AND CARDINALS DEFINITION 6.5.11 The cardinal ๐ is strongly inaccessible if it is an uncountable regular cardinal such that 2๐ โบ ๐ for all ๐ โบ ๐ . Since every strongly inaccessible cardinal is weakly inaccessible (Exercise 1), it is not possible to prove from the axioms of ๐ญ๐๐ that a strongly inaccessible cardinal exists. However, it is apparent that assuming ๐ฆ๐ข๐ง, ๐ is a weakly inaccessible cardinal if and only if ๐ is a strongly inaccessible cardinal (Exercise 10). Cardinal numbers such as these are known as large cardinals because assumptions beyond the axioms of ๐ญ๐๐ are required to “reach” them. Exercises 1. Prove that every strongly inaccessible cardinal is weakly inaccessible. 2. Let ๐ ∈ ๐. Show that ๐ผ๐ฟ(ℵ๐ ) = ℵ๐ . 3. For all limit ordinals ๐ผ, ๐ฝ, and ๐ฟ, show that if ๐ผ is cofinal in ๐ฝ and ๐ฝ is cofinal in ๐ฟ, then ๐ผ is cofinal in ๐ฟ. 4. Rewrite the proof of König’s theorem (6.5.8) assuming that ๐ผ๐ฟ (๐ ) โบ ๐. 5. Let ๐ผ and ๐ฝ be limit ordinals. Prove that ๐ผ๐ฟ (๐ผ) = ๐ผ๐ฟ (๐ผ) if and only if (๐ผ, ⊆) and (๐ฝ, ⊆) have order isomorphic cofinal subsets. 6. Let ๐ผ be a countable limit ordinal. Show that ๐ผ๐ฟ (๐ผ) = ℵ0 . 7. Let ๐ and ๐ be cardinals such that ๐ is infinite and 2 โชฏ ๐. Show the following. (a) ๐ โบ ๐ผ๐ฟ(๐๐ ). (b) ๐ โบ ๐ ๐ผ๐ฟ(๐ ) . 8. Assume ๐ฆ๐ข๐ง and let ๐ผ and ๐ฝ be ordinals. Prove. ℵ (a) If ℵ๐ฝ โบ ๐ผ๐ฟ(ℵ๐ผ ), then ℵ๐ผ ๐ฝ = ℵ๐ผ . ℵ (b) If ๐ผ๐ฟ(ℵ๐ผ ) โชฏ ℵ๐ฝ โบ ℵ๐ผ , then ℵ๐ผ ๐ฝ = ℵ๐ผ+ . 9. Let ๐ผ be a limit ordinal. Show that ๐ผ๐ฟ (i๐ผ ) = ๐ผ๐ฟ (๐ผ). (See Exercise 6.3.22.) 10. Prove that ๐ฆ๐ข๐ง implies that a cardinal is weakly inaccessible if and only if it is strongly inaccessible. 11. Let ๐ be a cardinal. Prove the following biconditionals. (a) ๐ is weakly inaccessible if and only if ๐ is regular and ℵ๐ = ๐ . (b) ๐ is strongly inaccessible if and only if ๐ is regular and i๐ = ๐ . CHAPTER 7 MODELS 7.1 FIRST-ORDER SEMANTICS We now return to logic. In Section 1.5, we proved that propositional logic is both sound and complete (Theorems 1.5.9 and 1.5.15). We now do the same for first-order logic. We have an added complication in that this logic involves formulas with variables. Sometimes the variables are all bound resulting in a sentence (Definition 2.2.14), but other times the formula will have free occurrences. We need additional machinery to handle this. Throughout this chapter, let ๐ be a first-order alphabet and ๐ฒ its set of theory symbols. We start with the fundamental definition (compare Definition 4.1.1). DEFINITION 7.1.1 The pair ๐ = (๐ด, ๐) is an ๐ฆ-structure if ๐ด ≠ ∅ and ๐ is a function with domain ๐ฒ such that โ ๐(๐) is an element of ๐ด for every constant ๐ ∈ ๐ฒ, โ ๐(๐ ) is an ๐-ary relation on ๐ด for every ๐-ary relation symbol ๐ ∈ ๐ฒ, โ ๐(๐ ) is an ๐-ary function on ๐ด for every ๐-ary function symbol ๐ ∈ ๐ฒ. 333 A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. 334 Chapter 7 MODELS The set ๐ด is the domain of ๐ and is denoted by dom(๐). The domain of the function ๐ is the signature of the structure. If ๐ฒ = {๐ 0 , ๐ 1 , ๐ 2 , … }, we often write the structure as (๐ด, ๐(๐ 0 ), ๐(๐ 1 ), ๐(๐ 2 ), … ) or (๐ด, ๐ ๐ , ๐ ๐ , ๐ ๐ , … ) 0 1 2 The font used to identify a structure and its function is the traditional one. It is called Fraktur and can be found in the appendix. The purpose of the function ๐ is to associate a symbol with a particular object in the domain of the structure. For this reason, if ๐ is an element of the signature of the structure, the object ๐(๐ ) is called the meaning of ๐ and the symbol ๐ is the name of ๐(๐ ). EXAMPLE 7.1.2 We are familiar with the constant, function, and relation symbols of ๐ญ๐ณ (Example 2.1.4). We define an ๐ญ๐ณ-structure ๐ with domain ๐. To do this, specify the function ๐: ๐(0) = ∅, ๐(1) = {∅}, ๐(+) = {((๐, ๐), ๐ + ๐) : ๐, ๐ ∈ ๐}, ๐(⋅) = {((๐, ๐), ๐ ⋅ ๐) : ๐, ๐ ∈ ๐}. Notice that ∅ is the meaning of 0 and 1 is the name of {∅}. Also, observe that ๐(+) and ๐(⋅) are the addition and multiplication functions on ๐ (Definitions 5.2.15 and 5.2.18). These operations are usually represented by + and ⋅, but these symbols already appear in ๐ญ๐ณ. Therefore, for the structure ๐, 0๐ = ๐(0), 1๐ = ๐(1), +๐ = ๐(+), ⋅๐ = ๐(⋅), so ๐ = (๐, ๐) is an ๐ญ๐ณ-structure with signature {0, 1, +, ⋅} that can be written as (๐, ∅, {∅}, {((๐, ๐), ๐ + ๐) : ๐, ๐ ∈ ๐}, {((๐, ๐), ๐ ⋅ ๐) : ๐, ๐ ∈ ๐}) or, more compactly, (๐, 0๐ , 1๐ , +๐ , ⋅๐ ). Section 7.1 FIRST-ORDER SEMANTICS 335 EXAMPLE 7.1.3 When the symbols of a structure’s signature are not needed to represent the meanings of the symbols, the notation of Example 7.1.2 is not needed. This is common for ๐ฆ๐ฑ-structures. For example, if ๐(๐) = 0 and ๐(โฆ) = +, then (โค, ๐) = (โค, 0, +) is a ๐ฆ๐ฑ-structure with signature {๐, โฆ} (Example 2.1.5), and if ๐ (๐) = 1 and ๐ (โฆ) = ⋅, then (โ โงต {0}, ๐ ) = (โ โงต {0}, 1, ⋅) is also a ๐ฆ๐ฑ-structure with signature {๐, โฆ}. Satisfaction The purpose of a structure is to serve as a universe for a given language. Recall that the terms of a language represent objects and the formulas of a language describe the properties of objects (Figure 2.3). With this in mind, we now make sense of the terms and formulas within structures. The first step in doing this is to define how to give meaning to terms. This is done so that the terms represent elements of the domain of a structure. The second step is to develop a method by which it can be determined which formulas hold true in a structure and which do not. We begin by defining the interpretation of terms. DEFINITION 7.1.4 Let ๐ = (๐ด, ๐) be an ๐ฒ-structure. Define an ๐ฆ-interpretation of ๐ to be a function ๐ผ : ๐ณ๐ค๐ฑ๐ฌ๐ฒ(๐ ) → ๐ด that has the following properties: โ If ๐ฅ is a variable symbol, then ๐ผ(๐ฅ) ∈ ๐ด, thus assigning a value to ๐ฅ. โ If ๐ is a constant symbol, ๐ผ(๐) = ๐(๐). โ If ๐ is a function symbol and ๐ก0 , ๐ก1 , … , ๐ก๐−1 are ๐ฒ-terms, ๐ผ(๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 )) = ๐(๐ )(๐ผ(๐ก0 ), ๐ผ(๐ก1 ), … , ๐ผ(๐ก๐−1 )). Definition 7.1.4 is an example of a definition by induction on terms. This means that first the definition was made for variable and constant symbols, and then assuming that it was made for terms in general, the definition is made for functions applied to terms. Induction on terms simply follows Definition 2.1.7. A proof done by induction on terms is one that uses the same process to prove a result about terms. EXAMPLE 7.1.5 Let (๐, 0๐ , 1๐ , +๐ , ⋅๐ ) be the ๐ญ๐ณ-structure from Example 7.1.2. Let ๐ผ be an ๐ญ๐ณ-interpretation such that 336 Chapter 7 MODELS ๐ผ(๐ฅ) = 5, ๐ผ(๐ฆ) = 7, ๐ผ(0) = 0๐ , ๐ผ(1) = 1๐ . Then, ๐ผ(๐ฅ + ๐ฆ) = 12 is the interpretation of ๐ฅ + ๐ฆ because ๐ผ(๐ฅ + ๐ฆ) = ๐(+)(๐ผ(๐ฅ), ๐ผ(๐ฆ)) = ๐(+)(5, 7) = 5 +๐ 7, and ๐ผ(0 ⋅ 1) = 0๐ because ๐ผ(0 ⋅ 1) = ๐(⋅)(๐ผ(0), ๐ผ(1)) = ๐(⋅)(0, 1) = 0๐ ⋅๐ 1๐ . We are now ready for the main definition. It describes what it means for a formula to be interpreted as true. The definition, which is foundational to model theory, is generally attributed to Alfred Tarski. This contribution is found in two papers, “Der wahrheitsbegriff in den formalisierten sprachen” (1935) and “Arithmetical extensions of relational systems” with Robert Vaught (1957). DEFINITION 7.1.6 Let ๐ = (๐ด, ๐) be an ๐ฒ-structure and ๐ผ be an ๐ฒ-interpretation of ๐. Assume that ๐ and ๐ are ๐ฒ-formulas and ๐ก0 , ๐ก1 , … , ๐ก๐−1 are ๐ฒ-terms. Define โจ as follows: โ ๐ โจ ๐ก0 = ๐ก1 [๐ผ] ⇔ ๐ผ(๐ก0 ) = ๐ผ(๐ก1 ). โ ๐ โจ ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) [๐ผ] ⇔ (๐ผ(๐ก0 ), ๐ผ(๐ก1 ), … , ๐ผ(๐ก๐−1 )) ∈ ๐(๐ ). โ ๐ โจ ¬๐ [๐ผ] ⇔ (not ๐ โจ ๐ [๐ผ]). โ ๐ โจ ๐ → ๐ [๐ผ] ⇔ (if ๐ โจ ๐ [๐ผ] then ๐ โจ ๐ [๐ผ]). โ ๐ โจ ∃๐ฅ๐ [๐ผ] ⇔ (๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด), where for every ๐ข ∈ ๐ด, the function ๐ผ๐ฅ๐ข is the ๐ฒ-interpretation of ๐ such that if ๐ฆ is a variable symbol, { ๐ข if ๐ฆ = ๐ฅ, ๐ข ๐ผ๐ฅ (๐ฆ) = ๐ผ(๐ฆ) if ๐ฆ ≠ ๐ฅ, and if ๐ is a constant symbol, ๐ผ๐ฅ๐ (๐) = ๐ผ(๐). Definition 7.1.6 is an example of a definition by induction on formulas. This means that first the definition was made for the basic formulas involving equality and relation symbols, and then assuming that it was made for formulas in general, the definition is made for formulas written using connectives or quantifiers. Induction on formulas simply follows Definition 2.1.9. A proof done by induction on formulas is one that uses the same process to prove a result about terms. Definition 7.1.6 can be extended to the other connectives and the existential quantifier using the next theorem. Section 7.1 FIRST-ORDER SEMANTICS 337 THEOREM 7.1.7 Let ๐ = (๐ด, ๐) be an ๐ฒ-structure and ๐ผ be an ๐ฒ-interpretation of ๐. Assume that ๐ and ๐ are ๐ฒ-formulas. โ ๐ โจ ๐ ∧ ๐ [๐ผ] ⇔ (๐ โจ ๐ [๐ผ] and ๐ โจ ๐ [๐ผ]). โ ๐ โจ ๐ ∨ ๐ [๐ผ] ⇔ (๐ โจ ๐ [๐ผ] or ๐ โจ ๐ [๐ผ]). โ ๐ โจ ๐ [๐ผ] ↔ ๐ ⇔ (๐ โจ ๐ [๐ผ] if and only if ๐ โจ ๐ [๐ผ]). โ ๐ โจ ∀๐ฅ๐ [๐ผ] ⇔ (๐ โจ ๐ [๐ผ๐ฅ๐ ] for all ๐ ∈ ๐ด). PROOF Since ๐ ∨ ๐ ⇔ ¬๐ → ๐ (page 59), we have the following: ๐ โจ ๐ ∨ ๐ [๐ผ] ⇔ ๐ โจ ¬๐ → ๐ [๐ผ] ⇔ if ๐ โจ ¬๐ [๐ผ], then ๐ โจ ๐ [๐ผ] ⇔ if not ๐ โจ ๐ [๐ผ], then ๐ โจ ๐ [๐ผ] ⇔ ๐ โจ ๐ [๐ผ] or ๐ โจ ๐ [๐ผ]. Since ∀๐ฅ๐ ⇔ ¬∃๐ฅ¬๐ by QN, we have the following: ๐ โจ ∀๐ฅ๐ [๐ผ] ⇔ ๐ โจ ¬∃๐ฅ¬๐ [๐ผ] ⇔ not ๐ โจ ∃๐ฅ¬๐ [๐ผ] ⇔ not (๐ โจ ¬๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด) ⇔ not (not ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด) ⇔ ๐ โจ ๐ [๐ผ๐ฅ๐ ] for all ๐ ∈ ๐ด. The remaining parts are left to Exercise 1. EXAMPLE 7.1.8 Let the ๐ฒ๐ณ-structure ๐ be defined as (๐, ∈). Since ๐ฒ๐ณ has only relation symbols, ๐ is called a purely relational structure. Although we usually make a notational distinction between a name and its meaning, we do not do this with the ∈ symbol. Likewise, the equality symbol = can be used in a formula and during any interpretation of the formula. To see how this works, define ๐ := ∀๐ฅ∀๐ฆ∀๐ง(๐ฅ ∈ ๐ฆ ∧ ๐ฆ ∈ ๐ง → ๐ฅ ∈ ๐ง). We show that ๐ โจ ๐ for any ๐ฒ๐ณ-interpretation ๐ผ. To determine what needs to be done to accomplish this, we work backwards using Definition 7.1.6 and Theorem 7.1.7. Let ๐ผ be an ๐ฒ๐ณ-interpretation and assume that ๐ โจ ∀๐ฅ∀๐ฆ∀๐ง(๐ฅ ∈ ๐ฆ ∧ ๐ฆ ∈ ๐ง → ๐ฅ ∈ ๐ง) [๐ผ]. This implies that 338 Chapter 7 MODELS ๐ โจ ๐ฅ ∈ ๐ฆ ∧ ๐ฆ ∈ ๐ง → ๐ฅ ∈ ๐ง [((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง ] for all ๐, ๐, ๐ ∈ ๐. How this formula is interpreted is still based on the order of connectives found in Definition 1.1.5. Therefore, since the conjunction has precedence, we apply Definition 7.1.6 to find that for all ๐, ๐, ๐ ∈ ๐, if ๐ โจ ๐ฅ ∈ ๐ฆ ∧ ๐ฆ ∈ ๐ง [((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง ], then ๐ โจ ๐ฅ ∈ ๐ง [((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง ]. Hence, by Theorem 7.1.7, for all ๐, ๐, ๐ ∈ ๐, if ๐ โจ ๐ฅ ∈ ๐ฆ [((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง ] and ๐ โจ ๐ฆ ∈ ๐ง [((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง ], then ๐ โจ ๐ฅ ∈ ๐ง [((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง ]. We now apply the interpretation using Definition 7.1.4 to find that for all ๐, ๐, ๐ ∈ ๐, if ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฅ) ∈ ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฆ) and ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฆ) ∈ ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ง), then ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฅ) ∈ ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ง). Therefore, for all ๐, ๐, ๐ ∈ ๐, if ๐ ∈ ๐ and ๐ ∈ ๐, then ๐ ∈ ๐, which follows by Theorem 5.2.8. This means that we can back through the steps to prove that ๐ โจ ๐ [๐ผ]. Typically, formulas cannot be understood as true or false on their own. They have to be examined against a given universe. The structure is that universe, and the examining is done by the interpretation. For this reason, the structure-interpretation pair forms the basis for our work in first-order logic, so we give it a name. DEFINITION 7.1.9 Let ๐ผ be an ๐ฒ-interpretation of the ๐ฒ-structure ๐. Let ๐ be an ๐ฒ-formula. The pair (๐, ๐ผ) is called an ๐ฆ-model. Additionally, if ๐ โจ ๐ [๐ผ], then (๐, ๐ผ) is a model of ๐ and satisfies ๐. If (๐, ๐ผ) is not a model of ๐, write ๐ ฬธโจ ๐ [๐ผ]. EXAMPLE 7.1.10 Let ๐ be the ๐ญ๐ณ-structure with domain ๐ and signature {0, 1, +, ⋅} from Example 7.1.2. Let ๐ผ be an ๐ญ๐ณ-interpretation of ๐ such that ๐ผ(๐ฅ) = 0๐ , ๐ผ(๐ฅ๐ ) = ๐ +๐ 1๐ , and ๐ผ(1) = 1๐ . โ (๐, ๐ผ) is a model of ๐ฅ4 + ๐ฅ7 = ๐ฅ7 + ๐ฅ4 because by Theorem 5.2.17, ๐ผ(๐ฅ4 + ๐ฅ7 ) = ๐ผ(๐ฅ4 ) +๐ ๐ผ(๐ฅ7 ) = 5 +๐ 8 = 8 +๐ 5 = ๐ผ(๐ฅ7 ) +๐ ๐ผ(๐ฅ4 ) = ๐ผ(๐ฅ7 + ๐ฅ4 ). Section 7.1 FIRST-ORDER SEMANTICS 339 โ (๐, ๐ผ) is a model of ∀๐ฅ(๐ฅ = ๐ฅ2 → ๐ฅ + 1 = ๐ฅ2 + 1). To see this, take ๐ ∈ ๐ and assume that ๐ โจ ๐ฅ = ๐ฅ2 [๐ผ๐ฅ๐ ]. This implies that ๐ = ๐ผ(๐ฅ2 ). Hence, ๐ +๐ 0๐ = ๐ผ(๐ฅ2 ) +๐ 0๐ , ๐ +๐ ๐ผ(1) = ๐ผ(๐ฅ2 ) +๐ ๐ผ(1), ๐ผ๐ฅ๐ (๐ฅ) +๐ ๐ผ๐ฅ๐ (1) = ๐ผ๐ฅ๐ (๐ฅ2 ) +๐ ๐ผ๐ฅ๐ (1), ๐ผ๐ฅ๐ (๐ฅ + 1) = ๐ผ๐ฅ๐ (๐ฅ2 + 1). Therefore, ๐ โจ ๐ฅ + 1 = ๐ฅ2 + 1 [๐ผ๐ฅ๐ ]. We conclude that for all ๐ ∈ ๐, if ๐ โจ ๐ฅ = ๐ฅ2 [๐ผ๐ฅ๐ ], then ๐ โจ ๐ฅ + 1 = ๐ฅ2 + 1 [๐ผ๐ฅ๐ ], so for all ๐ ∈ ๐, ๐ โจ ๐ฅ = ๐ฅ2 → ๐ฅ + 1 = ๐ฅ2 + 1 [๐ผ๐ฅ๐ ]. This implies that ๐ โจ ∀๐ฅ(๐ฅ = ๐ฅ2 → ๐ฅ + 1 = ๐ฅ2 + 1) [๐ผ]. Definition 7.1.9 can be generalized to sets of formulas. DEFINITION 7.1.11 If (๐, ๐ผ) is an ๐ฒ-model and โฑ is a set of ๐ฒ-formulas, ๐ โจ โฑ [๐ผ] if and only if ๐ โจ ๐ [๐ผ] for all ๐ ∈ โฑ . If ๐ โจ โฑ [๐ผ], then (๐, ๐ผ) is a model of โฑ and satisfies โฑ . For example, using the ๐ญ๐ณ-model of Example 7.1.10, we see that ๐ โจ {๐ฅ4 + ๐ฅ7 = ๐ฅ7 + ๐ฅ4 , ∀๐ฅ(๐ฅ = ๐ฅ2 → ๐ฅ + 1 = ๐ฅ2 + 1)} [๐ผ], (7.1) so (๐, ๐ผ) is a model of {๐ฅ4 + ๐ฅ7 = ๐ฅ7 + ๐ฅ4 , ∀๐ฅ(๐ฅ = ๐ฅ2 → ๐ฅ + 1 = ๐ฅ2 + 1)}. Here is a more involved example. EXAMPLE 7.1.12 Let ๐ be the ๐ฆ๐ฑ-structure (โค, ๐) so that ๐ is defined by ๐(๐) = 0 and ๐(โฆ) = +. Let the interpretation ๐ฝ have the property that { ๐โ2 if ๐ is even, ๐ฝ (๐ฅ๐ ) = (๐ + 1)โ2 if ๐ is odd. โ ๐ โจ ∃๐ฅ(๐ฅ7 โฆ ๐ฅ = ๐) [๐ฝ ]. This is because ๐ โจ ๐ฅ7 โฆ ๐ฅ = ๐ [๐ฝ๐ฅ−4 ], 340 Chapter 7 MODELS and this holds since 4 + (−4) = 0. โ ๐ โจ ∀๐ฅ∃๐ฆ(๐ฅ โฆ ๐ฆ = ๐) [๐ฝ ]. To prove this, take ๐ ∈ โค. Then, ๐ โจ ∃๐ฆ(๐ฅ โฆ ๐ฆ = ๐) [๐ฝ๐ฅ๐ ] because ๐ โจ ๐ฅ โฆ ๐ฆ = ๐ [(๐ฝ๐ฅ๐ )−๐ ๐ฆ ]. This last satisfaction holds because −๐ ∈ โค and ๐ + −๐ = 0. โ The previous two satisfactions demonstrate that ๐ โจ {∃๐ฅ(๐ฅ7 โฆ ๐ฅ = ๐), ∀๐ฅ∃๐ฆ(๐ฅ โฆ ๐ฆ = ๐)} [๐ฝ ]. (7.2) The previous work with models leads to the following definition. DEFINITION 7.1.13 The ๐ฒ-formula ๐ is ๐ฆ-satisfiable (denoted by ๐ฒ๐บ๐๐ฆ ๐) if there is an ๐ฒ-structure ๐ and ๐ฒ-interpretation ๐ผ such that (๐, ๐ผ) is a model for ๐. The set of ๐ฒ-formulas โฑ is ๐ฆ-satisfiable (denoted by ๐ฒ๐บ๐๐ฆ โฑ ) if there is a model for โฑ . By (7.1), we have that ๐ฒ๐บ๐๐ก๐ง {๐ฅ4 + ๐ฅ7 = ๐ฅ7 + ๐ฅ4 , ∀๐ฅ[(๐ฅ + ๐ฅ5 ) + ๐ฅ8 = ๐ฅ + (๐ฅ5 + ๐ฅ8 )]}, and by (7.2), we have that ๐ฒ๐บ๐๐๐ฅ {∃๐ฅ(๐ฅ7 โฆ ๐ฅ = ๐), ∀๐ฅ∃๐ฆ(๐ฅ โฆ ๐ฆ = ๐)}. Groups As noted in Example 2.1.5, the ๐ฆ๐ฑ symbols are intended for the study of a set with a single binary operation defined on it. The basic example is โค with addition. Although other properties will be added later, the language developed for this is designed to handle just the basic properties of this pair. Namely, the operation should be associative, the set should have an identity, and every element should have an inverse. This means that the axioms that will define this theory will be ๐ฆ๐ฑ-sentences, and we need only three. AXIOMS 7.1.14 [Group] โ G1. ∀๐ฅ∀๐ฆ∀๐ง [๐ฅ โฆ (๐ฆ โฆ ๐ง) = (๐ฅ โฆ ๐ฆ) โฆ ๐ง]. โ G2. ∀๐ฅ(๐ โฆ ๐ฅ = ๐ฅ ∧ ๐ฅ โฆ ๐ = ๐ฅ). โ G3. ∀๐ฅ∃๐ฆ(๐ฅ โฆ ๐ฆ = ๐ ∧ ๐ฆ โฆ ๐ฅ = ๐). Section 7.1 FIRST-ORDER SEMANTICS 341 EXAMPLE 7.1.15 Define a ๐ฆ๐ฑ-structure ๐ = (๐บ, ๐ค) by letting the domain ๐บ = {0}, ๐ค(๐) = 0, and ๐ค(โฆ) = +. The binary operation + is addition of integers and 0 ∈ โค (Section 5.3). Let ๐ผ be an interpretation of ๐. We show that ๐ โจ {G1, G2, G3} [๐ผ]. โ Let ๐, ๐, ๐ ∈ ๐บ. Since 0 is the only element of ๐บ, observe that by Definition 7.1.4, ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฅ โฆ [๐ฆ โฆ ๐ง]) = ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฅ) + ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฆ โฆ ๐ง) (7.3) = ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฅ) + [((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฆ) + ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ง)] (7.4) = 0 + (0 + 0) =0+0+0 (7.5) (7.6) = [((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฅ) + ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฆ)] + ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ง) (7.7) = = ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ฅ โฆ ๐ฆ) + ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง (๐ง) ((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง ([๐ฅ โฆ ๐ฆ] โฆ ๐ง). (7.8) (7.9) We see that (7.6) follows from (7.5) by Theorem 5.3.5. Therefore, by Theorem 7.1.7, ๐ โจ ๐ฅ โฆ (๐ฆ โฆ ๐ง) = (๐ฅ โฆ ๐ฆ) โฆ ๐ง [((๐ผ๐ฅ๐ )๐๐ฆ )๐๐ง ] for all ๐, ๐, ๐ ∈ ๐บ, which implies that ๐ โจ ∀๐ง[๐ฅ โฆ (๐ฆ โฆ ๐ง) = (๐ฅ โฆ ๐ฆ) โฆ ๐ง] [(๐ผ๐ฅ๐ )๐๐ฆ ] for all ๐, ๐ ∈ ๐บ. Therefore, ๐ โจ ∀๐ฆ∀๐ง[๐ฅ โฆ (๐ฆ โฆ ๐ง) = (๐ฅ โฆ ๐ฆ) โฆ ๐ง] [๐ผ๐ฅ๐ ] for all ๐ ∈ ๐บ, so ๐ โจ ∀๐ฅ∀๐ฆ∀๐ง[๐ฅ โฆ (๐ฆ โฆ ๐ง) = (๐ฅ โฆ ๐ฆ) โฆ ๐ง] [๐ผ]. โ Let ๐ ∈ ๐บ. Again, by Definition 7.1.4, ๐ผ๐ฅ๐ (๐ โฆ ๐ฅ) = ๐ผ๐ฅ๐ (๐) + ๐ผ๐ฅ๐ (๐ฅ) = 0 + 0 = 0 = ๐ผ๐ฅ๐ (๐ฅ), so by Definition 7.1.6, ๐ โจ ๐ โฆ ๐ฅ = ๐ฅ [๐ผ๐ฅ๐ ]. Also, so ๐ผ๐ฅ๐ (๐ฅ โฆ ๐) = ๐ผ๐ฅ๐ (๐ฅ) + ๐ผ๐ฅ๐ (๐) = 0 + 0 = 0 = ๐ผ๐ฅ๐ (๐ฅ), ๐ โจ ๐ฅ โฆ ๐ = ๐ฅ [๐ผ๐ฅ๐ ]. 342 Chapter 7 MODELS Therefore, by Theorem 7.1.7, ๐ โจ ๐ โฆ ๐ฅ = ๐ฅ ∧ ๐ฅ โฆ ๐ = ๐ฅ [๐ผ๐ฅ๐ ]. Since ๐ was arbitrarily chosen, ๐ โจ ∀๐ฅ(๐ โฆ ๐ฅ = ๐ฅ ∧ ๐ฅ โฆ ๐ = ๐ฅ) [๐ผ]. โ Take ๐ ∈ ๐บ. Because (๐ผ๐ฅ๐ )0๐ฆ (๐ฅ โฆ ๐ฆ) = (๐ผ๐ฅ๐ )0๐ฆ (๐ฅ) + (๐ผ๐ฅ๐ )0๐ฆ (๐ฆ) = 0 + 0 = 0 = (๐ผ๐ฅ๐ )0๐ฆ (๐), we have ๐ โจ ๐ฅ โฆ ๐ฆ = ๐ [(๐ผ๐ฅ๐ )0๐ฆ ]. Similarly, ๐ โจ ๐ฆ โฆ ๐ฅ = ๐ [(๐ผ๐ฅ๐ )0๐ฆ ], so ๐ โจ ๐ฅ โฆ ๐ฆ = ๐ ∧ ๐ฆ โฆ ๐ฅ = ๐ [(๐ผ๐ฅ๐ )0๐ฆ ]. Therefore, since 0 ∈ ๐บ, there exists ๐ ∈ ๐บ such that ๐ โจ ๐ฅ โฆ ๐ฆ = ๐ ∧ ๐ฆ โฆ ๐ฅ = ๐ [(๐ผ๐ฅ๐ )๐๐ฆ ], and since ๐ was arbitrarily chosen, we have that for all ๐ ∈ ๐บ, there exists ๐ ∈ ๐บ such that ๐ โจ ๐ฅ โฆ ๐ฆ = ๐ ∧ ๐ฆ โฆ ๐ฅ = ๐ [(๐ผ๐ฅ๐ )๐๐ฆ ]. Then, by Definition 7.1.6, we conclude that for all ๐ ∈ ๐บ, ๐ โจ ∃๐ฆ(๐ฅ โฆ ๐ฆ = ๐ ∧ ๐ฆ โฆ ๐ฅ = ๐) [๐ผ๐ฅ๐ ], so by Theorem 7.1.7, ๐ โจ ∀๐ฅ∃๐ฆ(๐ฅ โฆ ๐ฆ = ๐ ∧ ๐ฆ โฆ ๐ฅ = ๐) [๐ผ]. Based on Example 7.1.15, we conclude that {G1, G2, G3} is ๐ฆ๐ฑ-satisfiable (Definition 7.1.13). That is, there exists a model in which G1, G2, and G3 are interpreted as true. The name of the model was first used by Évariste Galois in the early 1830s. DEFINITION 7.1.16 A ๐ฆ๐ฑ-structure that models the group axioms is called a group. A group with a commutative binary operation, one that satisfies ∀๐ฅ∀๐ฆ(๐ฅ โฆ ๐ฆ = ๐ฆ โฆ ๐ฅ), is known as an abelian group. It is named after the Norwegian mathematician Niels Abel. Using the ๐ฆ๐ฑ-structure ๐ and interpretation ๐ผ of Example 7.1.15, the set of ๐ฆ๐ฑ-sentences {G1, G2, G3, ∀๐ฅ∀๐ฆ(๐ฅ โฆ ๐ฆ = ๐ฆ โฆ ๐ฅ)} is shown to be ๐ฆ๐ฑ-satisfiable. Section 7.1 FIRST-ORDER SEMANTICS 343 EXAMPLE 7.1.17 Using definitions of Examples 4.2.6 and 4.2.9 for โค, define โค๐ = {[๐]๐ : ๐ ∈ โค}, and on this set, specify + by [๐]๐ + [๐]๐ = [๐ + ๐]๐ . As we have seen, the meaning of the symbol + is determined by context. The + on the left is the new definition, but the + on the right is standard addition (Definition 5.3.4). With this definition, a generalization of Example 4.4.30 shows that + is a binary operation. We check that ๐ = (โค๐ , [0]๐ , +) is a group. Let ๐ผ be a ๐ฆ๐ฑ-interpretation of ๐ such that ๐ผ(๐) = [0]๐ . We have three axioms to check. โ Let [๐]๐ , [๐]๐ , [๐]๐ ∈ โค๐ , where ๐, ๐, ๐ ∈ โค. Observe that by Definition 7.1.4, [๐] [๐] [๐] ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ฅ โฆ [๐ฆ โฆ ๐ง]) [๐] [๐] [๐] [๐] [๐] [๐] (7.10) = ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ฅ) + ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ฆ โฆ ๐ง) [๐] [๐] [๐] [๐] [๐] [๐] [๐] [๐] [๐] = ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ฅ) + [((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ฆ) + ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ง)] (7.11) = [๐]๐ + ([๐]๐ + [๐]๐ ) = [๐]๐ + ([๐ + ๐]๐ ) = [๐ + (๐ + ๐)]๐ = [๐ + ๐ + ๐]๐ = [๐ + ๐]๐ + [๐]๐ = ([๐]๐ + [๐]๐ ) + [๐]๐ (7.12) (7.13) [๐] [๐] [๐] [๐] [๐] [๐] [๐] [๐] [๐] = [((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ฅ) + ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ฆ)] + ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ง) = = [๐] [๐] [๐] [๐] [๐] [๐] ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ฅ โฆ ๐ฆ) + ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ง) [๐] [๐] [๐] ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ ([๐ฅ โฆ ๐ฆ] โฆ ๐ง). (7.14) (7.15) (7.16) Therefore, [๐] [๐] [๐] [๐] [๐] [๐] ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ (๐ฅ โฆ [๐ฆ โฆ ๐ง]) = ((๐ผ๐ฅ ๐ )๐ฆ ๐ )๐ง ๐ ([๐ฅ โฆ ๐ฆ] โฆ ๐ง) for all [๐]๐ , [๐]๐ , [๐]๐ ∈ โค๐ , which implies that ๐ โจ ∀๐ฅ∀๐ฆ∀๐ง [๐ฅ โฆ (๐ฆ โฆ ๐ง) = (๐ฅ โฆ ๐ฆ) โฆ ๐ง] [๐ผ]. Notice that (7.13) follows from (7.12) by Theorem 5.3.5. More importantly, notice that (7.10), (7.11), (7.14), (7.15), and (7.16) mimic (7.3), (7.4), (7.7), (7.8), and (7.9) of Example 7.1.15. The other equalities are 344 Chapter 7 MODELS ∗ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ Figure 7.1 ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ The Klein-4 group. specific to this example. We conclude that in order to prove that ๐ is a model of G1, we only need to show the work specific to the structure ๐. We use this to prove ๐ โจ {G2, G3} [๐ผ]. โ Let ๐ ∈ โค. Then, [0]๐ + [๐]๐ = [0 + ๐]๐ = [๐]๐ , and [๐]๐ + [0]๐ = [๐ + 0]๐ = [๐]๐ . Therefore, ๐ โจ G2 [๐ผ]. โ Let ๐ ∈ โค. Then, [๐]๐ + [−๐]๐ = [๐ + (−๐)]๐ = [0]๐ and [−๐]๐ + [๐]๐ = [(−๐) + ๐]๐ = [0]๐ . Hence, ๐ โจ G3 [๐ผ]. Therefore, ๐ is a group. Moreover, because for all ๐, ๐ ∈ โค, [๐]๐ + [๐]๐ = [๐ + ๐]๐ = [๐ + ๐]๐ = [๐]๐ + [๐]๐ , ๐ is an abelian group. EXAMPLE 7.1.18 The structures (๐+ , ∅, +), (โค, 1, ⋅), and (โ, 1, ⋅) are not groups, but (โค, 0, +), (โ, 0, +), (โ, 0, +), (โ โงต {0}, 1, ⋅), and (โ โงต {0}, 1, ⋅) are. These are examples of infinite groups. When the group’s set is finite, we use the term order to refer to its cardinality. The group ๐ = ({๐}, ๐, ∗), where ∗ = {((๐, ๐), ๐)}, is the only group of order 1 (Example 7.1.15). This means that any other group with one element, such as ({0}, 0, +) or ({1}, 1, ⋅), has the same structure as ๐. We say that these three groups are isomorphic. Any two groups of order 2 will be isomorphic, and any groups of order 3 will also be isomorphic. There are essentially two groups of order 4, one being the Klein-4 group (Figure 7.1) and the other being the group of Example 7.1.17 when ๐ = 4. Section 7.1 FIRST-ORDER SEMANTICS 345 EXAMPLE 7.1.19 All of the examples of groups given so far have been abelian. Here is one that is not. For all ๐ ∈ โค+ , define M๐ (โ) as the set of ๐ × ๐ matrices with real entries. In other words, each matrix has ๐ rows, ๐ columns, and looks like โก ๐1,1 โข โฎ โข โฃ๐๐,1 ๐1,๐ โค โฎ โฅ, โฅ ๐๐,๐ โฆ ··· โฑ ··· where ๐๐,๐ ∈ โ for ๐ = 1, … , ๐ and ๐ = 1, … , ๐. As an example, โก1 โข4 โข โฃ7 3โค 6โฅ ∈ M3 (โ). โฅ 9โฆ 2 5 8 Define matrix multiplication for 2 × 2 matrices by ] [ ] [ [ ๐ ๐ + ๐1,2 ๐2,1 ๐ ๐1,2 ๐1,1 ๐1,2 = 1,1 1,1 ⋅ 1,1 ๐2,1 ๐1,1 + ๐2,2 ๐2,1 ๐2,1 ๐2,2 ๐2,1 ๐2,2 For example, [ 1 3 ] [ 2 0 ⋅ 4 2 ] [ −1 4 = 1 8 This multiplication is not commutative because [ ] [ ] [ 0 −1 1 2 −3 ⋅ = 2 1 3 4 5 ] ๐1,1 ๐1,2 + ๐1,2 ๐2,2 . ๐2,1 ๐1,2 + ๐2,2 ๐2,2 ] 1 . 1 (7.17) ] −4 , 8 (7.18) but it is associative [Exercise 12(a)]. The identity matrix for M2 (โ) is [ ] 1 0 I2 = . 0 1 Notice that I2 is the multiplicative identity. If ๐ด ∈ M๐ (โ), then ๐ด is invertible if there exists ๐ต ∈ M๐ (โ) such that ๐ด๐ต = ๐ต๐ด = I๐ . For ๐ = 2, [ ] 2 1 0 −3 is invertible, but [ 1 5 ] 0 0 is not. All of this can be generalized to any ๐ × ๐ matrix. Finally, define M∗๐ (โ) = {๐ด ∈ M๐ (โ) : ๐ด is invertible}. Let GL(๐, โ) denote the group (M∗๐ (โ), I๐ , ⋅). This is called the general linear group of degree ๐. 346 Chapter 7 MODELS Consequence We now generalize the notion of logical implication (Definition 1.2.2) to the theory of models. DEFINITION 7.1.20 Let โฑ be a set of ๐ฒ-formulas. An ๐ฒ-formula ๐ is an ๐ฆ-consequence of โฑ (written as โฑ โจ ๐) when ๐ โจ โฑ [๐ผ] implies ๐ โจ ๐ [๐ผ] for any ๐ฒ-model (๐, ๐ผ). If {๐} โจ ๐, simply write ๐ โจ ๐. Definition 7.1.20 implies that if the ๐ฒ-formula ๐ is not an ๐ฒ-consequence of โฑ , there exists an ๐ฒ-structure ๐ with interpretation ๐ผ such that ๐ โจ โฑ [๐ผ] but ๐ ฬธโจ ๐ [๐ผ]. For example, define ๐ := ∀๐ฅ∀๐ฆ(๐ฅ + ๐ฆ = ๐ฆ + ๐ฅ). We know that GL(๐, โ) is a group, but under any ๐ฆ๐ฑ-interpretation ๐ผ of GL(๐, โ), GL(๐, โ) ฬธโจ ๐ [๐ผ] because of (7.17) and (7.18). Therefore, ๐ is not an ๐ฒ-consequence of the group axioms (7.1.14). In other words, not all groups are abelian groups. EXAMPLE 7.1.21 Suppose that โฑ = {∀๐ฅ∀๐ฆ(๐ฅ + ๐ฆ = ๐ฆ + ๐ฅ), ∀๐ฅ∃๐ฆ(๐ฅ + ๐ฆ = 0)}. Let (๐, ๐ผ) be an ๐ญ๐ณ-model of โฑ . Let ๐ด = dom(๐). Then, ๐ โจ ∀๐ฅ∃๐ฆ(๐ฅ + ๐ฆ = 0) [๐ผ], so by Theorem 7.1.7, for all ๐ข ∈ ๐ด, ๐ โจ ∃๐ฆ(๐ฅ + ๐ฆ = 0) [๐ผ๐ฅ๐ข ], which by Definition 7.1.6 implies that there exists ๐ฃ ∈ ๐ด such that for all ๐ข ∈ ๐ด, ๐ โจ ๐ฅ + ๐ฆ = 0 [(๐ผ๐ฅ๐ข )๐ฃ๐ฆ ]. Hence, for arbitrary ๐ (UI) and particular ๐ (EI) in ๐ด, (๐ผ๐ฅ๐ )๐๐ฆ (+)((๐ผ๐ฅ๐ )๐๐ฆ (๐ฅ), (๐ผ๐ฅ๐ )๐๐ฆ (๐ฆ)) = (๐ผ๐ฅ๐ )๐๐ฆ (0). However, since ๐ โจ ∀๐ฅ∀๐ฆ(๐ฅ + ๐ฆ = ๐ฆ + ๐ฅ) [๐ผ], we find that (๐ผ๐ฅ๐ )๐๐ฆ (+)((๐ผ๐ฅ๐ )๐๐ฆ (๐ฅ), (๐ผ๐ฅ๐ )๐๐ฆ (๐ฆ)) = (๐ผ๐ฅ๐ )๐๐ฆ (+)((๐ผ๐ฅ๐ )๐๐ฆ (๐ฆ), (๐ผ๐ฅ๐ )๐๐ฆ (๐ฅ)), Section 7.1 FIRST-ORDER SEMANTICS 347 so, (๐ผ๐ฅ๐ )๐๐ฆ (+)((๐ผ๐ฅ๐ )๐๐ฆ (๐ฆ), (๐ผ๐ฅ๐ )๐๐ฆ (๐ฅ)) = (๐ผ๐ฅ๐ )๐๐ฆ (0). Therefore, by EG and UG, there exists ๐ฃ ∈ ๐ด such that for all ๐ข ∈ ๐ด, ๐ผ(+)(๐ผ(๐ฃ), ๐ผ(๐ข)) = ๐ผ(0), and we can reverse the steps above to find that ๐ โจ ∃๐ฅ∀๐ฆ(๐ฆ + ๐ฅ = 0) [๐ผ], so we conclude that โฑ โจ ∃๐ฅ∀๐ฆ(๐ฆ + ๐ฅ = 0). We say that an ๐ฒ-sentence ๐ is valid if ∅ โจ ๐ and write โจ ๐. This means that if an ๐ฒsentence ๐ is valid, every ๐ฒ-structure is a model of ๐ since every ๐ฒ-structure is a model of the empty set using any ๐ฒ-interpretation (Exercise 7). For example, ∀๐ฅ(๐ฅ = ๐ฅ) and ๐ ∨ ¬๐ are valid. We now connect the notions of consequence and satisfaction. THEOREM 7.1.22 Let โฑ be a set of ๐ฒ-formulas and ๐ be an ๐ฒ-formula. Then, โฑ โจ ๐ if and only if not ๐ฒ๐บ๐๐ฆ โฑ ∪ {¬๐}. PROOF The following are equivalent: โ โฑ โจ ๐. โ For every ๐ฒ-model (๐, ๐ผ), if ๐ โจ โฑ [๐ผ], then ๐ โจ ๐ [๐ผ]. โ There does not exist an ๐ฒ-model (๐, ๐ผ) so that ๐ โจ โฑ [๐ผ] and ๐ ฬธโจ ๐ [๐ผ]. โ There does not exist an ๐ฒ-model (๐, ๐ผ) so that ๐ โจ โฑ [๐ผ] and ๐ โจ ¬๐ [๐ผ]. โ There does not exist an ๐ฒ-model (๐, ๐ผ) such that ๐ โจ โฑ ∪ {¬๐} [๐ผ]. โ Not ๐ฒ๐บ๐๐ฆ โฑ ∪ {¬๐}. EXAMPLE 7.1.23 Since Zorn’s lemma was proved from the axioms of ๐ญ๐๐ (Theorem 5.1.13), as in Example 7.1.17, we can use the work specific to the proof of Zorn’s lemma to conclude that ๐ญ๐๐ โจ Zorn’s lemma, so by Theorem 7.1.22, there is no ๐ฒ๐ณ-model that satisfies ๐ญ๐๐ and the negation of Zorn’s lemma. 348 Chapter 7 MODELS EXAMPLE 7.1.24 The ๐ฒ-formula ๐ is valid if ¬๐ is not ๐ฒ-satisfiable. To see this, suppose that ๐ is not valid. This means that there is an ๐ฒ-model (๐, ๐ผ) so that ๐ ฬธโจ ๐ [๐ผ]. Hence, by Definition 7.1.6, ๐ โจ ¬๐ [๐ผ]. Therefore, ๐ฒ๐บ๐{¬๐}. That the converse is true is Exercise 19. Compare the next definition with Definition 1.3.1. DEFINITION 7.1.25 Let ๐ and ๐ be ๐ฒ-formulas. Then, ๐ is logically equivalent to ๐ means ๐ โจ ๐ if and only if ๐ โจ ๐. Notice Definition 7.1.25 implies that the formulas ๐ and ๐ are logically equivalent if and only if โจ (๐ ↔ ๐). For example, by De Morgan’s law, ¬(๐ ∧ ๐) is logically equivalent to ¬๐ ∨ ¬๐, and by QN, we conclude that ¬∀๐ฅ๐(๐ฅ) is logically equivalent to ∃๐ฅ¬๐(๐ฅ). Coincidence Let ๐ = (โค, ๐) and ๐ = (โค, ๐) be ๐ฆ๐ฑ-structures such that ๐(๐) = ๐(๐) = 0, ๐(โฆ) = ๐(โฆ) = +. Let ๐ผ be an ๐ฆ๐ฑ-interpretation of ๐ and ๐ฝ be a ๐ฆ๐ฑ-interpretation of ๐ such that ๐ผ(๐ฅ) = ๐ฝ (๐ฅ) = 3. Other assignments of these functions are not identified. Consider the following deduction: −3 + 3 = 0. ๐ + 3 = 0 for some ๐ ∈ โค. ๐ผ๐ฆ๐ (๐ฆ) + ๐ผ๐ฆ๐ (๐ฅ) = ๐ผ๐ฆ๐ (๐) for some ๐ ∈ โค. ๐ผ๐ฆ๐ (๐ฆ โฆ ๐ฅ) = ๐ผ๐ฆ๐ (๐) for some ๐ ∈ โค. ๐ โจ ๐ฆ โฆ ๐ฅ = ๐ [๐ผ๐ฆ๐ ] for some ๐ ∈ โค. Therefore, ๐ โจ ∃๐ฆ(๐ฆ โฆ ๐ฅ = ๐) [๐ผ]. By replacing ๐ผ with ๐ฝ and ๐ with ๐ in the deduction, we conclude that ๐ โจ ∃๐ฆ(๐ฆ โฆ ๐ฅ = ๐) [๐ฝ ]. Since ๐ผ and ๐ฝ agree on their interpretations of ๐, โฆ, and ๐ฅ, it is not surprising that they should agree on their interpretation of any {๐, โฆ}-formula with ๐ฅ as its only free variable. The generalization of this to terms and formulas is the next two results. Section 7.1 FIRST-ORDER SEMANTICS 349 LEMMA 7.1.26 [Coincidence for Terms] Let ๐ฒ and ๐ณ be sets of theory symbols. Let ๐ = (๐ด, ๐) be an ๐ฒ-structure and ๐ = (๐ต, ๐) be a ๐ณ-structure such that ๐ด = ๐ต. Let ๐ผ be an interpretation of ๐ and ๐ฝ be an interpretation of ๐ . If ๐ผ ๐ต๐ ๐ฑ = ๐ฝ ๐ต๐ ๐ฑ and ๐(๐ข) = ๐(๐ข) for all ๐ข ∈ ๐ฒ ∩ ๐ณ, then ๐ผ(๐ก) = ๐ฝ (๐ก) for every (๐ฒ ∩ ๐ณ)-term ๐ก. PROOF By induction on (๐ฒ ∩ ๐ณ)-terms. โ Let ๐ฅ be a variable symbol. Then, ๐ผ(๐ฅ) = ๐ฝ (๐ฅ) by hypothesis. โ Let ๐ be a constant symbol in ๐ฒ ∩ ๐ณ. We have ๐ผ(๐) = ๐(๐) = ๐(๐) = ๐ฝ (๐). โ Suppose ๐ผ(๐ก๐ ) = ๐ฝ (๐ก๐ ) for all (๐ฒ ∩ ๐ณ)-terms ๐ก๐ with ๐ = 0, 1, … , ๐ − 1. Then, ๐ผ(๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 )) = ๐(๐ )(๐ผ(๐ก0 ), ๐ผ(๐ก1 ), … , ๐ผ(๐ก๐−1 )) = ๐(๐ )(๐ฝ (๐ก0 ), ๐ฝ (๐ก1 ), … , ๐ฝ (๐ก๐−1 )) = ๐(๐ )(๐ฝ (๐ก0 ), ๐ฝ (๐ก1 ), … , ๐ฝ (๐ก๐−1 )) = ๐ฝ (๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 )). LEMMA 7.1.27 [Coincidence for Formulas] Let ๐ฒ and ๐ณ be sets of theory symbols. Let ๐ = (๐ด, ๐) be an ๐ฒ-structure and ๐ = (๐ต, ๐) be a ๐ณ-structure such that ๐ด = ๐ต. Let ๐ผ be an interpretation of ๐ and ๐ฝ be an interpretation of ๐ . If ๐ผ ๐ต๐ ๐ฑ = ๐ฝ ๐ต๐ ๐ฑ and ๐(๐ข) = ๐(๐ข) for every ๐ข ∈ ๐ฒ ∩ ๐ณ, then ๐ โจ ๐ [๐ผ] if and only if ๐ โจ ๐ [๐ฝ ] for all (๐ฒ ∩ ๐ณ)-formulas ๐. PROOF By induction on (๐ฒ ∩ ๐ณ)-formulas. โ Let ๐ก0 and ๐ก1 be (๐ฒ ∩ ๐ณ)-terms. Then, by Lemma 7.1.26, ๐ โจ ๐ก0 = ๐ก1 [๐ผ] ⇔ ๐ผ(๐ก0 ) = ๐ผ(๐ก1 ) ⇔ ๐ฝ (๐ก0 ) = ๐ฝ (๐ก1 ) ⇔ ๐ โจ ๐ก0 = ๐ก1 [๐ฝ ]. โ Let ๐ก0 , ๐ก1 , … , ๐ก๐−1 be (๐ฒ ∩ ๐ณ)-terms and ๐ a relation symbol of ๐ฒ ∩ ๐ณ. Then, by Lemma 7.1.26, ๐ โจ ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) [๐ผ] ⇔ (๐ผ(๐ก0 ), ๐ผ(๐ก1 ), … , ๐ผ(๐ก๐−1 )) ∈ ๐(๐ ) ⇔ (๐ฝ (๐ก0 ), ๐ฝ (๐ก1 ), … , ๐ฝ (๐ก๐−1 )) ∈ ๐(๐ ) ⇔ (๐ฝ (๐ก0 ), ๐ฝ (๐ก1 ), … , ๐ฝ (๐ก๐−1 )) ∈ ๐(๐ ) ⇔ ๐ โจ ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) [๐ฝ ]. Now let ๐ be an (๐ฒ ∩ ๐ณ)-formula. โ ๐ โจ ¬๐ [๐ผ] ⇔ ๐ ฬธโจ ๐ [๐ผ] ⇔ ๐ ฬธโจ ๐ [๐ฝ ] ⇔ ๐ โจ ¬๐ [๐ฝ ]. 350 Chapter 7 MODELS โ Assume that ๐ โจ ๐ [๐ผ] implies ๐ โจ ๐ [๐ผ]. Also, suppose that ๐ โจ ๐ [๐ฝ ]. Then, ๐ โจ ๐ [๐ผ] by induction, so ๐ โจ ๐ [๐ผ]. Thus, ๐ โจ ๐ [๐ฝ ] by induction. The converse is proved similarly, so we have ๐ โจ ๐ → ๐ [๐ผ] ⇔ if ๐ โจ ๐ [๐ผ] then ๐ โจ ๐ [๐ผ] ⇔ if ๐ โจ ๐ [๐ฝ ] then ๐ โจ ๐ [๐ฝ ] ⇔ ๐ โจ ๐ → ๐ [๐ฝ ]. โ Note that for all ๐ ∈ ๐ด, ๐ผ๐ฅ๐ ๐ต๐ ๐ฑ = ๐ฝ๐ฅ๐ ๐ต๐ ๐ฑ because and if ๐ฆ ≠ ๐ฅ, ๐ผ๐ฅ๐ (๐ฅ) = ๐ข = ๐ฝ๐ฅ๐ (๐ฅ) ๐ผ๐ฅ๐ (๐ฆ) = ๐ผ(๐ฆ) = ๐ฝ (๐ฆ) = ๐ฝ๐ฅ๐ (๐ฆ). Therefore, by induction and since ๐ด = ๐ต, ๐ โจ ∃๐ฅ๐ [๐ผ] ⇔ ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด ⇔ ๐ โจ ๐ [๐ฝ๐ฅ๐ ] for some ๐ ∈ ๐ต ⇔ ๐ โจ ∃๐ฅ๐ [๐ฝ ]. EXAMPLE 7.1.28 Define the sets of theory symbols ๐ฒ = {0, 1, +, ⋅, ≤} and ๐ณ = {0, 1, +, ∗, ≥}. Let ๐ = (๐, ๐) be an ๐ฒ-structure and ๐ = (๐, ๐) be a ๐ณ-structure where ๐(0) = ๐(0) = ∅, ๐(1) = ๐(1) = {∅}, ๐(+) = ๐(+). Notice, for example, that under the right interpretation, ๐ could be a model of ∀๐ฅ(๐ฅ ⋅ 1 = ๐ฅ) since it is an ๐ฒ-formula, but it does not make sense for ๐ to be a model of the same sentence because ∀๐ฅ(๐ฅ ⋅ 1 = ๐ฅ) is not a ๐ณ-formula. Now, let ๐ผ be an ๐ฒ-interpretation of ๐ and ๐ฝ be a ๐ณ-interpretation of ๐ such that they agree on all variable symbols. Since we have the hypotheses of Lemma 7.1.27 satisfied, let us confirm the lemma. Consider the (๐ฒ ∩ ๐ณ)-formula ∀๐ฅ(๐ฅ + 0 = ๐ฅ). Assume ๐ โจ ∀๐ฅ(๐ฅ + 0 = ๐ฅ) [๐ผ], so ๐ โจ (๐ฅ + 0 = ๐ฅ) [๐ผ๐ฅ๐ ] for all ๐ ∈ ๐. This implies that Section 7.1 FIRST-ORDER SEMANTICS 351 ๐(+)(๐ผ๐ฅ๐ (๐ฅ), ๐ผ๐ฅ๐ (0)) = ๐ผ๐ฅ๐ (๐ฅ) for all ๐ ∈ ๐. Since ๐ผ and ๐ฝ agree on all variable symbols, ๐(0) = ๐(0), and ๐(+) = ๐(+), ๐(+)(๐ฝ๐ฅ๐ (๐ฅ), ๐ฝ๐ฅ๐ (0)) = ๐ฝ๐ฅ๐ (๐ฅ) for all ๐ ∈ ๐. Therefore, ๐ โจ (๐ + 0 = ๐) [๐ฝ๐ฅ๐ ] for all ๐ ∈ ๐, which gives ๐ โจ ∀๐ฅ(๐ฅ + 0 = ๐ฅ) [๐ฝ ]. The purpose of the coincidence lemmas (7.1.26 and 7.1.27) is to minimize the use of interpretation functions, especially when modeling sentences. LEMMA 7.1.29 Let ๐ be an ๐ฒ-structure. If ๐ is an ๐ฒ-sentence, ๐ โจ ๐ [๐ผ] if and only if ๐ โจ ๐ [๐ฝ ] for all ๐ฒ-interpretations ๐ผ and ๐ฝ of ๐. PROOF Suppose that ๐ผ and ๐ฝ are ๐ฒ-interpretations of ๐. Let ๐ โจ ๐ [๐ผ]. Since ๐ has no free variables, ๐ โจ ๐ [๐ฝ ] by the proof of Lemma 7.1.27. Lemma 7.1.29 implies that any interpretation will do when modeling sentences. Therefore, we make the next definition. DEFINITION 7.1.30 For any ๐ฒ-sentence ๐ and ๐ฒ-structure ๐ = (๐ด, ๐), write ๐ โจ ๐ if ๐ โจ ๐ [๐ผ] for all ๐ฒ-interpretations ๐ผ of ๐. Lemma 7.1.29 can be used to quickly prove the next result. THEOREM 7.1.31 For any ๐ฒ-structure ๐ and ๐ฒ-sentence ๐, ๐ โจ ๐ if and only if ๐ โจ ๐ [๐ผ] for some ๐ฒ-interpretation ๐ผ of ๐. Therefore, letting ๐ be the ๐ฆ๐ฑ-structure of Example 7.1.12, by (7.2), we have that ๐ โจ ∀๐ฅ∃๐ฆ(๐ฅ โฆ ๐ฆ = 0). The coincidence lemmas (7.1.26 and 7.1.27) also minimize the use of the sets of theory symbols. Consider the following. 352 Chapter 7 MODELS THEOREM 7.1.32 Let ๐ฒ ⊆ ๐ณ be theory symbol sets. If โฑ is a set of ๐ฒ-formulas, โฑ is ๐ฒ-satisfiable if and only if โฑ is ๐ณ-satisfiable. PROOF Let โฑ be a set of of ๐ฒ-formulas. First, suppose that (๐ด, ๐) โจ โฑ [๐ผ], where dom(๐) = ๐ฒ and dom(๐ผ) = ๐ณ๐ค๐ฑ๐ฌ๐ฒ(๐ฒ). Let ๐′ be an extension of ๐ to ๐ณ and ๐ผ ′ be an extension of ๐ผ such that dom(๐ผ ′ ) = ๐ณ๐ค๐ฑ๐ฌ๐ฒ(๐ณ). Notice that this implies that ๐ and ๐′ agree on ๐ฒ = ๐ฒ ∩ ๐ณ. Therefore, (๐ด, ๐′ ) โจ โฑ [๐ผ ′ ] by Lemma 7.1.27. Conversely, assume that (๐ด, ๐) โจ โฑ [๐ผ] such that both dom(๐) = ๐ณ and dom(๐ผ) = ๐ณ๐ค๐ฑ๐ฌ๐ฒ(๐ณ). Let ๐′ = ๐ ๐ฒ and ๐ผ ′ = ๐ผ ∩ ๐ณ๐ค๐ฑ๐ฌ๐ฒ(๐ฒ). This implies that (๐ด, ๐′ ) โจ โฑ [๐ผ ′ ] by Lemma 7.1.27. There is terminology to name the relationship between the structures found in the proof of Theorem 7.1.32. Let the theory symbols ๐ฒ be a subset of the theory symbols ๐ณ. Let ๐ = (๐ด, ๐) be an ๐ฒ-structure and ๐′ (๐ด′ , ๐′ ) be an ๐ณ-structure. If ๐ด = ๐ด′ and ๐ = ๐′ ๐ฒ, we call ๐′ an expansion of ๐ and ๐ a reduct of ๐′ . Hence, in the first part of the proof, we started with a structure and then moved to an expansion, and in the second part, we started with a structure and then moved to a reduct. Theorems 7.1.31 and 7.1.32 motivate the next two definitions. DEFINITION 7.1.33 Let ๐ฒ ⊆ ๐ณ be sets of theory symbols. An ๐ฒ-formula ๐ is satisfiable (denoted by ๐ฒ๐บ๐ ๐) if there exists an ๐ณ-structure that is a model for ๐. The set of ๐ฒ-formulas โฑ is satisfiable (denoted by ๐ฒ๐บ๐ โฑ ) if there exists an ๐ณ-structure that is a model for โฑ . DEFINITION 7.1.34 Let ๐ฒ ⊆ ๐ณ be sets of theory symbols. Assume that ๐ฏ is a set of ๐ฒ-sentences. An ๐ฒ-sentence ๐ is a consequence of ๐ฏ (denoted by ๐ฏ โจ ๐) when ๐ โจ ๐ฏ implies ๐ โจ ๐ for every ๐ณ-structure ๐. EXAMPLE 7.1.35 The group axioms state that in a group there is an identity and there are inverses. Based on what we know about the integers, we should be able to prove more about these elements. For example, we expect that in a group, both there is exactly one identity and every element has a unique inverse are true. The uniqueness of the identity is left to Exercise 20. To show the uniqueness of inverses, let ๐ = (๐บ, ๐, โฆ) be a group, and take ๐ ∈ ๐บ. Suppose that Section 7.1 FIRST-ORDER SEMANTICS 353 ๐′ , ๐′′ ∈ ๐บ and are inverses of ๐. Then, ๐′ = ๐′ โฆ ๐ = ๐′ โฆ (๐ โฆ ๐′′ ) = (๐′ โฆ ๐) โฆ ๐′′ = ๐ โฆ ๐′′ = ๐′′ . Therefore, the uniqueness of inverses is a consequence (Definition 7.1.34) of the group axioms. Rings Consider the equation 2๐ฅ + 1 = 0. The exact steps needed to find its solution are (2๐ฅ + 1) + −1 = 0 + −1, 2๐ฅ + (1 + −1) = 0 + −1, 2๐ฅ + 0 = 0 + −1, 2๐ฅ = −1, 1โ2(2๐ฅ) = 1โ2(−1), (1โ2 ⋅ 2)๐ฅ = −1โ2, 1๐ฅ = −1โ2, ๐ฅ = −1โ2. Now examine the steps. There are two operations, addition and multiplication. We used inverses and identities. The associative law was also used. When studying these steps, we realize that they cannot be performed within (โค, 0, +) even though the initial equation had only integer coefficients. This means that the group idea needs to be expanded. This is done by including two symbols to represent addition and multiplication. Since these two operations can have their own identities, replace ๐ with โ to represent the additive identity. The ideas behind the group axioms are then extended using ๐ฑ๐จ-sentences. AXIOMS 7.1.36 [Ring] โ R1. ∀๐ฅ∀๐ฆ∀๐ง [๐ฅ ⊕ (๐ฆ ⊕ ๐ง) = (๐ฅ ⊕ ๐ฆ) ⊕ ๐ง] ∀๐ฅ∀๐ฆ∀๐ง [๐ฅ ⊗ (๐ฆ ⊗ ๐ง) = (๐ฅ ⊗ ๐ฆ) ⊗ ๐ง] โ R2. ∀๐ฅ∀๐ฆ(๐ฅ ⊕ ๐ฆ = ๐ฆ ⊕ ๐ฅ) โ R3. ∀๐ฅ(0 ⊕ ๐ฅ = ๐ฅ) โ R4. ∀๐ฅ∃๐ฆ(๐ฅ ⊕ ๐ฆ = โ) โ R5. ∀๐ฅ∀๐ฆ∀๐ง [๐ฅ ⊗ (๐ฆ ⊕ ๐ง) = ๐ฅ ⊗ ๐ฆ ⊕ ๐ฅ ⊗ ๐ง] ∀๐ฅ∀๐ฆ∀๐ง [(๐ฅ ⊕ ๐ฆ) ⊗ ๐ง = ๐ฅ ⊗ ๐ง ⊕ ๐ฆ ⊗ ๐ง] 354 Chapter 7 MODELS DEFINITION 7.1.37 An ๐ฑ๐จ-structure ℜ = (๐ , 0, +, ⋅) that models the ring axioms is called a ring. If there exists an multiplicative identity in ๐ , then ℜ is a ring with unity. The additive inverse of ๐ is −๐, and the multiplicative inverse of ๐ is ๐−1 assuming that ๐ ≠ 0. We usually write ๐−๐ instead of ๐+(−๐). Notice that if ℜ = (๐ , 0, +, ⋅) is a ring, its reduct (๐ , 0, +) is a group. Also, letting + and ⋅ denote addition and multiplication on โค, โจ = (โค, 0, +, ⋅) is a ring with unity. Also, โ, โ, and โ are the domains of rings with unity using the typical operations of addition and multiplication. EXAMPLE 7.1.38 Axioms 7.1.36 do not require that the ring multiplication be commutative. Let ℜ = (๐ , 0, +, ⋅). Then, ℜ is a commutative ring. if ℜ โจ ∀๐ฅ∀๐ฆ(๐ฅ ⊗ ๐ฆ = ๐ฆ ⊗ ๐ฅ). โ Let + and ⋅ denote standard addition and multiplication on โค. Let ๐ ∈ โค. We conclude that ๐ = (๐โค, 0, +, ⋅) is a commutative ring. It is without unity if ๐ ≠ ±1. โ Take [๐]๐ , [๐]๐ ∈ โค๐ and define + as in Example 7.1.17 and multiplication defined by [๐]๐ ⋅ [๐]๐ = [๐๐]๐ . Then, ๐ = (โค๐ , [0]๐ , +, ⋅) is a commutative ring (Exercise 29). Axioms 7.1.36 also do not state that when the additive identity is multiplied by any element of the ring, the result is the additive identity. It is not among the axioms because it can be proved. Take ๐ ∈ ๐ . By R3, 0 + 0 = 0, so by R5, 0 ⋅ ๐ = (0 + 0) ⋅ ๐ = 0 ⋅ ๐ + 0 ⋅ ๐. By R4 and since + is a binary operation, 0 ⋅ ๐ + −(0 ⋅ ๐) = (0 ⋅ ๐ + 0 ⋅ ๐) + −(0 ⋅ ๐). Because of R1, 0 ⋅ ๐ + −(0 ⋅ ๐) = 0 ⋅ ๐ + [0 ⋅ ๐ + −(0 ⋅ ๐)]. Hence, 0 = 0 ⋅ ๐ + 0, which implies that 0 = 0 ⋅ ๐. Therefore, ℜ โจ ∀๐ฅ(โ = โ ⊗ ๐ฅ), and ∀๐ฅ(โ = โ ⊗ ๐ฅ) is a consequence (Definition 7.1.34) of the ring axioms. Section 7.1 FIRST-ORDER SEMANTICS 355 EXAMPLE 7.1.39 Let ๐ ∈ โค+ , define matrix addition on M๐ (โ) entrywise. For instance, โก1 โข3 โข โฃ5 2 4 6 1 โค โก1 −4โฅ + โข2 โฅ โข 0 โฆ โฃ0 0 8โค โก2 −5 0โฅ = โข5 โฅ โข −2 3โฆ โฃ5 2 −1 4 9โค −4โฅ . โฅ 3โฆ Let 0๐ be the zero matrix. It is the ๐ × ๐ matrix with all of its entries equal to 0. As in Example 7.1.19, let ⋅ represent matrix multiplication and ๐ผ๐ the identity matrix. Prove that ๐๐ (โ) = (M๐ (โ), 0๐ , +, ⋅) is a ring. โ To see that matrix addition is associative, we rely on the fact that standard addition of real numbers is associative. Take three matrices from M2 (โ) and add: ]) ] [ ] ([ [ ๐ ๐ ๐1,1 ๐1,2 ๐1,1 ๐1,2 + 1,1 1,2 + ๐2,1 ๐2,2 ๐2,1 ๐2,2 ๐2,1 ๐2,2 ] ] [ [ ๐1,1 + ๐1,1 ๐1,2 + ๐1,2 ๐1,1 ๐1,2 + = ๐2,1 + ๐2,1 ๐2,2 + ๐2,2 ๐2,1 ๐2,2 [ ] ๐1,1 + (๐1,1 + ๐1,1 ) ๐1,2 + (๐1,2 + ๐1,2 ) = ๐2,1 + (๐2,1 + ๐2,1 ) ๐2,2 + (๐2,2 + ๐2,2 ) ] [ (๐1,1 + ๐1,1 ) + ๐1,1 (๐1,2 + ๐1,2 ) + ๐1,2 = (๐2,1 + ๐2,1 ) + ๐2,1 (๐2,2 + ๐2,2 ) + ๐2,2 ] ] [ [ ๐1,1 ๐1,2 ๐1,1 + ๐1,1 ๐1,2 + ๐1,2 + = ๐2,1 ๐2,2 ๐2,1 + ๐2,1 ๐2,2 + ๐2,2 ] [ ]) [ ] ([ ๐1,1 ๐1,2 ๐1,1 ๐1,2 ๐1,1 ๐1,2 . + + = ๐2,1 ๐2,2 ๐2,1 ๐2,2 ๐2,1 ๐2,2 โ Since [ 0 0 ] [ 0 ๐ + 1,1 0 ๐2,1 ] [ ] ๐1,2 0 + ๐1,1 0 + ๐1,2 = ๐2,2 0 + ๐2,1 0 + ๐2,2 ] [ ๐ ๐1,2 , = 1,1 ๐2,1 ๐2,2 the zero matrix is the additive identity. โ To prove that every element of M2 (โ) has an additive inverse, take [ ๐ ๐ด = 1,1 ๐2,1 ] ๐1,2 ∈ M2 (โ). ๐2,2 356 Chapter 7 MODELS Then, [ −๐1,1 −๐ด = −๐2,1 −๐1,2 −๐2,2 ] because ๐ด + (−๐ด) = 02 . Generalizing, conclude that ๐๐ (โ) โจ {G1, G2, G3}, making the ๐ฆ๐ฑ-structure (M๐ (โ), 0๐ , +) a group. โ Matrix addition is commutative because addition on โ is commutative. Therefore, (M๐ (โ), 0๐ , +) is an abelian group. โ Matrix multiplication is associative. โ Lastly, to show that the operations are distributive, we must show for all ๐ด, ๐ต, ๐ถ ∈ M2 (โ), ๐ด(๐ต + ๐ถ) = ๐ด๐ต + ๐ด๐ถ and (๐ด + ๐ต)๐ถ = ๐ด๐ถ + ๐ต๐ถ. Therefore, ๐๐ (โ) โจ {R1, R2, R3, R4, R5}, so ๐๐ (โ) is a ring. โ Since I๐ is the multiplicative identity, ℜ is a ring with unity, and since matrix multiplication is not commutative, ℜ is a noncommutative ring. This proves that ∀๐ฅ∀๐ฆ(๐ฅ ⊗ ๐ฆ = ๐ฆ ⊗ ๐ฅ) is not a consequence of the ring axioms. EXAMPLE 7.1.40 If ๐ is the domain of a ring, ๐, ๐ ∈ ๐ โงต {0} are zero divisors of the ring means that ๐ ⋅ ๐ = 0. Defining addition and multiplication coordinatewise (Exercise 18), the ring (โค × โค, (0, 0), +, ⋅) has zero divisors such as (1, 0) ⋅ (0, 1) = (0, 0). Other examples can be found in M2 (โ) where [ ] [ ] [ 1 1 1 1 0 ⋅ = 0 0 −1 −1 0 However, [ 1 −1 ] [ 1 1 ⋅ −1 0 ] [ 1 1 = 0 −1 ] 0 . 0 ] 1 , −1 showing that an element can be a left zero divisor but not a right zero divisor. This situation is common for rings where multiplication is not commutative. We do, however, have many rings that do not have zero divisors. An integral domain is a commutative ring with unity that does not have zero divisors. The rings (โค, 0, +, ⋅), (โ, 0, +, ⋅), (โ, 0, +, ⋅), and (โ, 0, +, ⋅) are integral domains. Section 7.1 FIRST-ORDER SEMANTICS 357 The equation 2๐ฅ + 1 = 0 is written with elements of โค and the operations of regular addition and multiplication. Although โค has no zero divisors, there is no integer that is a solution to this equation. To solve the equation, we need the existence of multiplicative inverses. Let (๐ , 0, +, ⋅) be a ring with unity. If ๐ข ∈ ๐ has the property that there exists ๐ฃ ∈ ๐ such that ๐ข ⋅ ๐ฃ = ๐ฃ ⋅ ๐ข = 1, then ๐ข is called a unit. Notice that units are multiplicative inverses of each other. With this terminology, we make the next definition. DEFINITION 7.1.41 Let (๐ , 0, +, ⋅) be a ring with unity. โ If all nonzero elements of ๐ are units, ๐ is called a division ring or sometimes a skew field. โ A commutative division ring is called a field. The reason that the equation on page 353 can be solved the way it was is that โ with addition and multiplication form a field. EXAMPLE 7.1.42 While โค is not a field with standard addition and multiplication, โ, โ, and โ are. A more interesting structure is (โค๐ , [0]๐ , +, ⋅) when ๐ is a prime. To prove that it is a field, let [๐]๐ ∈ โค๐ so that [๐]๐ ≠ [0]๐ . We must find an element of โค๐ so that when it is multiplied with [๐]๐ the result is [1]๐ . Since [๐]๐ ≠ [0]๐ , ๐ does not divide ๐. Hence, ๐ and ๐ are relatively prime, so there are integers ๐ข and ๐ฃ such that ๐ข๐ + ๐ฃ๐ = 1. We are then able to calculate: [๐ข]๐ ⋅ [๐]๐ = [๐ข๐]๐ = [1 − ๐ฃ๐]๐ = [1]๐ + [−๐ฃ๐]๐ = [1]๐ + [0]๐ = [1]๐ . EXAMPLE 7.1.43 Let ℜ be a division ring and take ๐ข and ๐ฃ to be elements of the domain of ℜ. Let 1 be unity. Assume ๐ข๐ฃ = 0 and ๐ข ≠ 0. Then, ๐ข−1 exists, and we can calculate ๐ข−1 (๐ข๐ฃ) = ๐ข−1 0, (๐ข−1 ๐ข)๐ฃ = 0, 1๐ฃ = 0, ๐ฃ = 0. Therefore, ℜ has no zero divisors. 358 Chapter 7 MODELS Exercises 1. Prove the remaining parts of Theorem 7.1.7. 2. Let 4 be a linear order on a nonempty set ๐ด. Let ๐ be a binary relation symbol. Define the {๐ }-structure ๐ = (๐ด, 4). Let ๐ผ be an ๐ฒ-interpretation of ๐ such that ๐ผ(๐ ) = 4. Prove the following. (a) ๐ โจ ∃๐ฅ(๐ฅ๐ ๐ฅ) [๐ผ]. (b) ๐ โจ ∀๐ฅ∀๐ฆ(๐ฅ๐ ๐ฆ ∨ ๐ฆ๐ ๐ฅ) [๐ผ]. (c) ๐ โจ ∀๐ฅ∀๐ฆ(๐ฅ๐ ๐ฆ ∧ ๐ฆ๐ ๐ฅ → ๐ฅ = ๐ฆ) [๐ผ]. 3. Let ๐ be the ๐ญ๐ณ-structure (โค, 0๐ , 1๐ , +๐ , ⋅๐ ), where 0๐ and 1๐ are the numbers 0 and 1 in โค while +๐ and ⋅๐ are the standard operations of addition and multiplication of integers. Let ๐ผ be a ๐ญ๐ณ-interpretation such that ๐ผ(๐ฅ) = 2 and ๐ผ(๐ฆ) = −2. Prove the following. (a) ๐ โจ ๐ฅ + ๐ฆ = 0 [๐ผ]. (b) ๐ โจ ∃๐ฅ([๐ฅ + 1] + 1 = 0) [๐ผ]. (c) ๐ โจ ∃๐ฅ∀๐ฆ(๐ฅ ⋅ ๐ฆ = ๐ฆ) [๐ผ]. (d) ๐ โจ ∀๐ฅ∀๐ฆ∀๐ง(¬๐ง = 0 ∧ ๐ฅ ⋅ ๐ง = ๐ฆ ⋅ ๐ง → ๐ฅ = ๐ฆ) [๐ผ]. 4. Show that ๐ โจ ∀๐ฅ[(๐ฅ + ๐ฅ5 ) + ๐ฅ8 = ๐ฅ + (๐ฅ5 + ๐ฅ8 )], where ๐ is the ๐ญ๐ณ-structure of Example 7.1.10. 5. Find a set of theory symbols ๐ฒ, an ๐ฒ-structure ๐, and an ๐ฒ-interpretation ๐ผ such that ๐ โจ ๐ [๐ผ] for each given formula ๐. (a) ๐ฅ + ๐ฆ = ([1 + 1] + 1) + 1 (b) ๐ฅโ๐ฆ + ๐ง = 10 (c) ∃๐ฅ∃๐ฆ(๐ฅ < ๐ฆ ∧ ๐ฅ + 1 = ๐ฆ) (d) ∀๐ฅ∀๐ฆ∀๐ง(๐ฅ๐ ๐ฆ ∧ ๐ฆ๐ ๐ง → ๐ง๐ ๐ฅ) (e) ∀๐ฅ∀๐ฆ(๐ฅ ⋅ ๐ฆ = 0 → ๐ฅ = 0 ∨ ๐ฆ = 0) 6. For each formula in Exercise 5, find a model (๐, ๐ผ) such that ๐ ฬธโจ ๐ [๐ผ] for each given formula ๐. 7. Prove that every ๐ฒ-structure is a model of the empty set. 8. Let ๐ด be a set. Is (P(๐ด), ∅, ∩) a group? Explain. 9. Explain why (โค+ , 0, +), (โค, 1, ⋅), and (โ, 1, ⋅) are not groups, where the operations are the standard ones. 10. Suppose that ∗ is an operation on โค defined by ๐ฅ ∗ ๐ฆ = ๐ฅ + ๐ฆ + 2. (a) Identify the identity ๐ and the inverses with respect to ∗. (b) Prove that (โค, ๐, ∗) is a group, where ๐ is the identity found in Exercise 10(a). (c) Solve 8 ∗ ๐ฅ = 10. 11. Let 0 represent the zero function โ → โ and + be function addition. That is, For all ๐ฅ ∈ โ, (๐ + ๐)(๐ฅ) = ๐ (๐ฅ) + ๐(๐ฅ). Section 7.1 FIRST-ORDER SEMANTICS 359 (a) Prove that (โ โ, 0, +) is a group. (b) Is โ โ the domain of a group where the binary operation is function division? If so, what is its identity? (c) Is โ โ the domain of a group where the binary operation is composition? If so, what is identity? 12. Let ๐ be a positive integer. (a) Prove that matrix multiplication is associative. (b) Solve the equation in ๐2 (โ): [ ] [ ] [ ] 1 4 ๐ ๐ −3 8 + = . −3 0 ๐ ๐ 0 −6 (c) Show that ๐๐ (โ) is not the domain of a group under matrix multiplication. 13. Let (๐บ, ๐, ∗) and (๐บ′ , ๐ ′ , ∗′ ) be two groups. For all ๐, ๐ ∈ ๐บ and ๐′ , ๐′ ∈ ๐บ′ define (๐, ๐′ ) ⋅ (๐, ๐′ ) = (๐ ∗ ๐, ๐′ ∗′ ๐′ ). (a) Confirm that ⋅ is a binary operation on ๐บ × ๐บ′ . (b) Show that (๐บ × ๐บ′ , (๐, ๐ ′ ), ⋅) is a group. Prove that it is abelian if and only if both of the given groups are abelian. 14. Let ๐ be an integer. Prove that (๐โค, 0, +, ⋅) is a commutative ring. 15. Why is {[ ๐ ๐ ๐ ๐ ] : ๐, ๐, ๐, ๐, ∈ โค + } not the domain of a ring under the standard matrix operations? 16. Prove that the set {[ ] } ๐ 0 : ๐, ๐ ∈ โ 0 ๐ is the domain of a ring with the standard matrix operations. 17. Both + (function addition) and โฆ (composition) are binary operations on โ โ, but (โ โ, 0, +, โฆ) is not a ring. Identify which ring axioms fail. 18. Let (๐ , 0, +, ⋅) and (๐ ′ , 0′ , +′ , ⋅′ ) be rings. Define addition and multiplication on ๐ × ๐ ′ so that for all (๐, ๐), (๐, ๐) ∈ ๐ × ๐ ′ , (๐, ๐) + (๐, ๐) = (๐ + ๐, ๐ +′ ๐), and (๐, ๐) ⋅ (๐, ๐) = (๐ ⋅ ๐, ๐ ⋅′ ๐). Prove that (๐ × ๐ ′ , (0, 0), +, ⋅) is a ring. 19. Prove the converse of Example 7.1.24. 20. Prove that {G1, G2, G3} โจ ∀๐ฅ∀๐ฆ[∀๐ง(๐ฅ โฆ ๐ง = ๐ง ∧ ๐ง โฆ ๐ฅ = ๐ฅ) ∧ ∀๐ง(๐ฆ โฆ ๐ง = ๐ง ∧ ๐ง โฆ ๐ฆ = ๐ง) → ๐ฅ = ๐ฆ]. 360 Chapter 7 MODELS 21. Let ๐ = (๐บ, ๐, ∗) be a group so that ๐ โจ ∀๐ฅ∀๐ฆ[(๐ โฆ ๐) โฆ (๐ โฆ ๐) = ๐ โฆ ๐ โฆ ๐ โฆ ๐]. Show that ๐ is abelian. 22. Prove that ∀๐ฅ(โ ⊗ ๐ฅ = โ) is a consequence of the ring axioms. 23. Let − be a unary function symbol. Define ๐ฑ๐จ′ = ๐ฑ๐จ ∪ {−}. Show that the given sentences are consequences of the ring axioms and ∀๐ฅ(−๐ฅ ⊕ ๐ฅ = โ). (a) ∀๐ฅ∀๐ฆ[−(๐ฅ ⊗ ๐ฆ) = −๐ฅ ⊗ ๐ฆ ∧ −๐ฅ ⊗ ๐ฆ = ๐ฅ ⊗ −๐ฆ] (b) ∀๐ฅ∀๐ฆ(−๐ ⊗ −๐ = ๐ ⊗ ๐) (c) ∀๐ฅ∀[−(๐ ⊕ ๐) = −๐ ⊕ −๐] (d) −โ = โ 24. This exercise uses the notation of Exercise 23. Let ℜ be a ring with unity. Let ℜ′ be the expansion of ℜ to ๐ฑ๐จ′ = ๐ฑ๐จ ∪ {−}. Assume that for all ๐ ∈ dom(ℜ′ ), −ℜ (๐) ⊕ℜ ๐ = โℜ . Prove that ℜ′ โจ ∀๐ฅ[∀๐ฆ(๐ฅ ⊗ ๐ฆ = ๐ฆ ∧ ๐ฆ ⊗ ๐ฅ = ๐ฆ) → ∀๐ง(−๐ง = −๐ฅ ⊗ ๐ง)]. 25. Let ๐ and ๐ be ๐ฒ-formulas. Prove that the given ๐ฒ-sentences are valid. (a) ๐ ∨ ¬๐ (b) ๐ → ๐ ↔ ¬๐ ∨ ๐ (c) ∃๐ฅ(๐ ∨ ๐) ↔ ∃๐ฅ๐ ∨ ∃๐ฅ๐ (d) ∀๐ฅ(๐ ∧ ๐) ↔ ∀๐ฅ๐ ∧ ∀๐ฅ๐ 26. Let ๐ be a binary relation symbols and ๐ be a binary function symbol. Show that the given sentences are satisfiable. (a) ∃๐ฅ(๐ฅ = ๐ฅ) (b) ∃๐ฅ∃๐ฆ∃๐ง(¬๐ฅ = ๐ฆ ∧ ¬๐ฅ = ๐ง ∧ ¬๐ฆ = ๐ง) (c) ∃๐ฅ∀๐ฆ(๐ ๐ฅ๐ฆ ∨ ๐ฅ = ๐ฆ) (d) ∀๐ฅ∀๐ฆ(๐ ๐ฅ๐ฆ = 1) (e) ∀๐ฅ∀๐ฆ[๐ ๐ฅ๐ฆ → ∃๐ง(๐ ๐ฅ๐ง ∧ ๐ ๐ง๐ฆ)] 27. Let ๐ฒ and ๐ณ be sets of theory symbols such that ๐ฒ ⊆ ๐ณ. Let ๐ be a reduct to ๐ฒ of the ๐ณ-structure ๐ . Prove that ๐ โจ ๐ if and only if ๐ โจ ๐ for all ๐ฒ-sentences ๐. 28. Suppose that ๐0 , ๐1 , … , ๐๐−1 are ๐ฒ-sentences. For every ๐ฒ-structure ๐, prove that ๐ โจ ๐0 ∧ ๐1 ∧ · · · ∧ ๐๐−1 if and only if ๐ โจ ๐๐ for all ๐ = 0, 1, … , ๐ − 1. 29. Answer the following about (โค๐ , [0]๐ , +, ⋅): (a) Prove that addition and multiplication of congruence classes is well-defined. (b) Show that the additive identity is [0]๐ . (c) For all ๐ ∈ โค, show that − [๐]๐ = [๐ − ๐]๐ . (d) Show that [1]๐ is the multiplicative identity. (e) Prove that (โค๐ , [0]๐ , +, ⋅) is a commutative ring. (f) Prove that the ring contains zero divisors when ๐ is not prime. Section 7.2 SUBSTRUCTURES 361 30. Prove that ∀๐ฅ∀๐ฆ∀๐ง(๐ฅ ⊕ ๐ฆ = ๐ฅ ⊕ ๐ง → ๐ฆ = ๐ง) is a consequence of the ring axioms. 31. Let ℜ be an integral domain. Prove the following. (a) ℜ โจ ∀๐ฅ∀๐ฆ(๐ฅ ⊗ ๐ฆ = โ → ๐ฅ = โ ∨ ๐ฆ = 0). (b) ℜ โจ ∀๐ฅ∀๐ฆ∀๐ง(๐ฅ ⊗ ๐ฆ = ๐ฅ ⊗ ๐ง ∧ ๐ฅ ≠ โ → ๐ฆ = ๐ง). 32. Suppose that ℜ = (๐ , 0, +, ⋅) is a commutative ring with unity. Show that if ℜ โจ ∀๐ฅ∀๐ฆ∃๐ง(๐ฅ ⊗ ๐ง ⊕ ๐ฆ = โ), then ℜ a field. 33. Is ({0}, 0, +, ⋅) a field? Explain. 7.2 SUBSTRUCTURES When looking for examples of groups, the ๐ฆ๐ฑ-structure (โค, 0, +) is often the first to come to mind. The benefit of this example is that not only are we familiar with the integers but it has the property that many of its subsets also form groups. Let ๐ ∈ โค. Addition on โค restricted to ๐โค × ๐โค is an associative binary operation on ๐โค, every element of ๐โค has an additive inverse in ๐โค, and 0 ∈ ๐โค, so the ๐ฆ๐ฑ-structure (๐โค, 0, +) is a group. Since ๐ ≠ ±1 implies that ๐โค ⊂ โค, there are infinitely many different examples of ๐ฆ๐ฑ-structures, all within (โค, 0, +). We generalize this idea to arbitrary structures. DEFINITION 7.2.1 If ๐ = (๐ด, ๐) and ๐ = (๐ต, ๐) are ๐ฒ-structures, ๐ is a substructure of ๐ (written as ๐ ⊆ ๐ ) means that ๐ด ⊆ ๐ต and the following properties hold. โ ๐(๐) = ๐(๐) for all constant symbols ๐. โ ๐(๐ ) = ๐(๐ ) ∩ ๐ด๐ for every ๐-ary relation symbol ๐ . โ ๐(๐ ) = ๐(๐ ) ๐ด๐ for every ๐-ary function symbol ๐ . If ๐ is a substructure of ๐ , then ๐ is an extension of ๐. Note the difference between a substructure and a reduct and between an extension and an expansion (page 352). For all ๐ ∈ โค, the group (๐โค, 0, +) is a substructure of (โค, 0, +), and (โค, 0, +) is an extension of (๐โค, 0, +). Here both structures have the same set of theory symbols, and the domain of one is a subset of the other. However, (โค, 0, +) is a reduct of (โค, 0, 1, +, ⋅), and (โค, 0, 1, +, ⋅) is an expansion of (โค, 0, +). In this case, the domains are the same, but the theory symbol set of the one is a subset of the theory symbol set of the other. EXAMPLE 7.2.2 Let ๐ be a binary relation symbol. Let ๐ = ([0, 1] , ๐) and ๐ = ([0, 2] , ๐) be {๐ }-structures such that ๐(๐ ) and ๐(๐ ) are both standard less-than. That is, ๐(๐ ) = {(๐ฅ, ๐ฆ) ∈ โ × โ : 0 ≤ ๐ฅ < ๐ฆ ≤ 1} 362 Chapter 7 MODELS and ๐(๐ ) = {(๐ฅ, ๐ฆ) ∈ โ × โ : 0 ≤ ๐ฅ < ๐ฆ ≤ 2}. We conclude that ๐ ⊆ ๐ because of the following: โ [0, 1] ⊆ [0, 2]. โ There are no constant symbols. โ ๐(๐ ) = ๐(๐ ) ∩ ([0, 1] × [0, 1]). โ There are no function symbols. EXAMPLE 7.2.3 Let ๐ ∈ โค. Define the ๐ญ๐ณ-structure ๐ = (โค, ๐), where ๐(0) is the additive identity of โค, ๐(1) is the multiplicative identity of โค, ๐(+) is standard addition on โค, and ๐(⋅) is standard multiplication on โค. Let ๐๐ = (๐โค, ๐) such that ๐(0) = ๐(0), ๐(1) = ๐(1), ๐(+) = ๐(+) (๐โค × ๐โค), ๐(⋅) = ๐(⋅) (๐โค × ๐โค). Then, ๐ is a substructure of ๐ . In particular, Example 7.2.3 gives ๐8 ⊆ ๐4 ⊆ ๐2 , which implies that ๐8 ⊆ ๐2 . This is a special case of the next theorem. THEOREM 7.2.4 Let ๐, ๐ , and โญ be ๐ฒ-structures. โ ๐ ⊆ ๐. โ If ๐ ⊆ ๐ and ๐ ⊆ โญ, then ๐ ⊆ โญ. PROOF That ๐ is a substructure of itself is clear, so suppose that ๐ is a substructure of ๐ and ๐ is a substructure of โญ. Write ๐ = (๐ด, ๐), ๐ = (๐ต, ๐), and โญ = (๐ถ, ๐ ). Then, for all constant symbols ๐, ๐(๐) = ๐(๐) = ๐ (๐). Since ๐ ⊆ ๐ , ๐(๐ ) = ๐(๐ ) ∩ ๐ด๐ , and since ๐ ⊆ โญ, ๐(๐ ) = ๐ (๐ ) ∩ ๐ต ๐ for every ๐-ary relation symbol ๐ , so ๐(๐ ) = ๐ (๐ ) ∩ ๐ต ๐ ∩ ๐ด๐ = ๐ (๐ ) ∩ ๐ด๐ . Section 7.2 SUBSTRUCTURES 363 Also, ๐(๐ ) = ๐(๐ ) ๐ด๐ and ๐(๐ ) = ๐ (๐ ) ๐ต ๐ for all ๐-ary function symbols ๐ , so ๐(๐ ) = (๐ (๐ ) ๐ต ๐ ) ๐ด๐ = ๐ (๐ ) ๐ด๐ . Therefore, ๐ is a substructure of โญ. Subgroups Let ๐ be an element of a group (๐บ, ๐, ∗). For all positive integers ๐, define ๐๐ to be the result of operating ๐ with itself ๐ times. That is, ๐1 = ๐, ๐2 = ๐ ∗ ๐, ๐3 = ๐ ∗ ๐ ∗ ๐, … and ๐๐ ∗ ๐๐ = ๐๐+๐ . Further, define ๐0 = ๐ and ๐−1 to be the inverse of ๐. With this notation, we observe that (๐ ∗ ๐)−1 = ๐−1 ∗ ๐−1 and ๐−๐ = (๐๐ )−1 = (๐−1 )๐ . We then gather all of these elements into a set, โจ๐โฉ = {๐๐ : ๐ ∈ โค}, and define the following. DEFINITION 7.2.5 A group ๐ is cyclic if there exists ๐ ∈ dom(๐) such that dom(๐) = โจ๐โฉ. The element ๐ is called a generator of ๐. For example, (โค, 0, +) is a cyclic group. Both 1 and −1 are generators. However, โ and โ paired with addition do not form cyclic groups. As for finite groups, each โค๐ is cyclic, generated by [1]๐ , but the Klein-4 group (Example 7.1.18) is not cyclic because ๐2 = ๐ for all ๐ in the group. An element ๐ of a group might not generate the entire group, but since ๐ ∈ โจ๐โฉ and both ๐๐ and ๐−๐ are elements of โจ๐โฉ, the set generated by ๐ forms a group using the operation from ๐. DEFINITION 7.2.6 A substructure โ of a group ๐ that is a group is called a subgroup of ๐. Every group with at least two elements has at least two subgroups, itself (the improper subgroup) and the subgroup with domain {๐} (the trivial subgroup). A group that has at most these two subgroups is called simple. For example, (โค2 , 0, +) and (โค3 , 0, +) form simple groups, but (โค4 , 0, +) does not because it has a subgroup with domain {[0]4 , [2]4 }. Other examples of nonsimple groups are (โ, 0, +) because (โค, 0, +) 364 Chapter 7 MODELS is one of its subgroups and (โจ2โฉ, 0, +) because (โจ6โฉ, 0, +) is one of its subgroups. These subgroups that are not improper are called proper. It is tempting to define a subgroup simply as a substructure of a group, but this would not work if the subgroup is to be a group. For example, viewing ๐ as a subset of โค via (5.8) allows (๐, 0, +) to be a ๐ฆ๐ฑ-substructure of (โค, 0, +), but (๐, 0, +) โจ {G1, G2} yet (๐, 0, +) ฬธโจ G3. This example suggests the following. THEOREM 7.2.7 A substructure โ of a group ๐ is a subgroup of ๐ if and only if โ โจ G3. PROOF Write ๐ = (๐บ, ๐ค) and โ = (๐ป, ๐ฅ) and let โ ⊆ ๐. If โ is a subgroup, then โ โจ G3. To prove the converse, assume โ โจ G3. โ Let ๐ฅ, ๐ฆ, ๐ง ∈ ๐ป. Since โ(โฆ) = ๐(โฆ) (๐ป × ๐ป), ๐ฅ(โฆ)(๐ฅ, ๐ฅ(โฆ)(๐ฆ, ๐ง)) = ๐ค(โฆ)(๐ฅ, ๐ค(โฆ)(๐ฆ, ๐ง)) = ๐ค(โฆ)(๐ค(โฆ)(๐ฅ, ๐ฆ), ๐ง) = ๐ฅ(โฆ)(๐ฅ(โฆ)(๐ฅ, ๐ฆ), ๐ง). The second equality holds because the interpretation of โฆ in ๐ is associative. โ Let ๐ฅ ∈ ๐ป. Because ๐ฅ(๐) = ๐ค(๐), ๐ฅ(โฆ)(๐ฅ(๐), ๐ฅ) = ๐ค(โฆ)(๐ค(๐), ๐ฅ) = ๐ฅ and ๐ฅ(โฆ)(๐ฅ, ๐ฅ(๐)) = ๐ค(โฆ)(๐ฅ, ๐ค(๐)) = ๐ฅ. Therefore, โ is a group and, thus, a subgroup of ๐. The standard way to show that a subset of a group forms a subgroup is not to show directly that the set satisfies the three group axioms or to appeal to Theorem 7.2.7. Instead, what is typically done in algebra is to check that the conditions of the next theorem are satisfied by the set. THEOREM 7.2.8 If ๐ = (๐บ, ๐, ∗) is a group and ๐ป ⊆ ๐บ, there exists a subgroup of ๐ with domain ๐ป if โ ๐ป is closed under ∗, โ ๐ ∈ ๐ป, โ ๐−1 ∈ ๐ป for all ๐ ∈ ๐ป. Section 7.2 SUBSTRUCTURES 365 PROOF Suppose that the three hypotheses of the theorem hold. โ Let ๐, ๐ ∈ ๐ป. By the first hypothesis, ๐ ∗ ๐ ∈ ๐ป. Therefore, ∗ (๐ป × ๐ป) is a binary operation on ๐ป. โ Since ∗ is associative on ๐บ, the restriction of ∗ to ๐ป must be associative. โ The second hypothesis gives ๐ป an identity element. โ Every element of ๐ป has its inverse in ๐ป by the third hypothesis. Therefore, (๐ป, ๐, ∗ [๐ป × ๐ป]) is a group. Since ๐ป ⊆ dom(๐), we conclude that (๐ป, ๐, ∗ [๐ป × ๐ป]) is a subgroup of ๐. EXAMPLE 7.2.9 To illustrate the theorem, take a group ๐ = (๐บ, ๐, ∗) and a family of subgroups (๐ป๐ , ๐, ∗ [๐ป๐ × ๐ป๐ ]) for all ๐ ∈ ๐ผ. Although the union of subgroups might not be a subgroup (Exercise 9), we can show that (โ ๐ป๐ , ๐, ∗ ๐∈๐ผ [โ ๐ป๐ × ๐∈๐ผ โ ๐ป๐ ]) ๐∈๐ผ is a subgroup of ๐. โ By Exercise 3.4.22(b), โ ๐∈๐ผ ๐ป๐ ⊆ ๐บ. โ โ Let ๐, ๐ ∈ ๐∈๐ผ ๐ป๐ . This means that ๐, ๐ ∈ ๐ป๐ for all ๐ ∈ ๐ผ. Since each ๐ป๐ is closed under the operation of ๐, ๐ ∗ ๐ ∈ ๐ป๐ for all ๐ ∈ ๐ผ. Hence, ๐∗๐∈ โ ๐ป๐ . ๐∈๐ผ โ Since ๐ ∈ ๐ป๐ for every ๐ ∈ ๐ผ, we must have ๐ ∈ โ Take ๐ to be an element of โ ๐−1 ∈ ๐∈๐ผ ๐ป๐ . โ ๐∈๐ผ Now we return to cyclic groups. THEOREM 7.2.10 A subgroup of a cyclic group is cyclic. โ ๐∈๐ผ ๐ป๐ . ๐ป๐ . Then, ๐−1 ∈ ๐ป๐ for all ๐ ∈ ๐ผ, so 366 Chapter 7 MODELS PROOF Let ๐ = (๐บ, ๐, ∗) be a cyclic group with generator ๐. Let โ = (๐ป, ๐, ∗) be a subgroup of ๐. If โ is the trivial subgroup, the subgroup is cyclic with generator ๐. So suppose that โ is not the trivial subgroup. Because ๐ is cyclic, there exists a least natural number ๐ > 0 such that ๐๐ ∈ ๐ป (Theorem 5.2.13). Suppose that ๐๐ is not a generator of โ. This means that there exists ๐ ∈ ๐ with ๐ > ๐ such that ๐๐ ∈ ๐ป but ๐๐ ∉ โจ๐๐ โฉ. This combined with the division algorithm (Theorem 4.3.31) yields unique natural numbers ๐ and ๐ such that ๐ = ๐๐ + ๐ and 0 < ๐ < ๐. Therefore, ๐๐ = ๐๐๐+๐ = ๐๐๐ ∗ ๐๐ , and from this, we conclude that ๐๐ = ๐−๐๐ ∗ ๐๐ . Since ๐−๐๐ , ๐๐ ∈ ๐ป, ๐๐ is an element of ๐ป. This contradicts the minimality of ๐ because ๐ < ๐. Thus, ๐๐ is a generator of โ. Subrings Some of the examples of rings had domains that were subsets of other rings. For example, ๐โค is a subset of โค, and โ is a subset of โ. Generalizing leads to the next definition. DEFINITION 7.2.11 A substructure ๐ of a ring ℜ that is a ring is called a subring of ℜ. A subring of ℜ such that its domain is a proper subset of the domain of ℜ is called a proper subring. The ring itself is called the improper subring. The subring with domain {0} is the trivial subring. EXAMPLE 7.2.12 โ ({[0]9 , [3]9 , [6]9 }, [0]9 , +, ⋅) is a subring of (โค9 , [0]9 , +, ⋅). โ (โค, 0, +, ⋅) is a subring of (โ, 0, +, ⋅). โ (M2 (โ), 02 , +, ⋅) is a subring of (M2 (โ), 02 , +, ⋅). As with subgroups, a substructure of a ring is not necessarily a subring, but we do have results similar to those for groups found in Theorems 7.2.8 and 7.2.7. They are stated without proof since they follow quickly from Definition 7.2.11. THEOREM 7.2.13 A substructure ๐ of a ring ℜ is a subring of ℜ if and only if ๐ โจ R4. We follow the convention that if ℜ represents an arbitrary ring, ℜ = (๐ , 0, +, ⋅) and if ℜ′ also represents an arbitrary ring, ℜ′ = (๐ ′ , 0′ , +′ , ⋅′ ). This will help us with our notation. Section 7.2 SUBSTRUCTURES 367 THEOREM 7.2.14 If ℜ is a ring and ๐ ⊆ ๐ , there exists a subring of ℜ with domain ๐ if โ ๐ is closed under + and ⋅, โ 0 ∈ ๐, โ −๐ ∈ ๐ for all ๐ ∈ ๐ . The subring found while proving Theorem 7.2.14 is (๐, 0, + [๐ × ๐], ⋅ [๐ × ๐]). EXAMPLE 7.2.15 We use Theorem 7.2.14 to show that ๐ = (๐, 02 , + [๐ × ๐], ⋅ [๐ × ๐]) is a subring of ๐2 (โ) (Example 7.1.39), where {[ ๐= ] } ๐ 0 : ๐, ๐ ∈ โ . 0 ๐ โ Let ๐, ๐, ๐′ , ๐′ ∈ โ, and assume that ๐ด= Then, [ ] [ ′ ๐ 0 ๐ and ๐ต = 0 ๐ 0 [ ๐ + ๐′ ๐ด+๐ต = 0 and ๐ด๐ต = [ ′ ๐๐ 0 0 ๐ + ๐′ ] 0 . ๐๐′ These are elements of ๐. โ Clearly, the zero matrix is in ๐. (Let ๐ = ๐ = 0.) โ Take ๐, ๐ ∈ โ and write Hence, is an element of ๐. [ ] ๐ 0 ๐ด= . 0 ๐ [ −๐ −๐ด = 0 ] 0 −๐ ] 0 . ๐′ ] 368 Chapter 7 MODELS EXAMPLE 7.2.16 Let ๐ and ๐ be subrings of a ring ℜ. Let ๐ be the domain of ๐ and ๐ be the domain of ๐. Check the conditions of Theorem 7.2.14 to show that there exists a subring of ℜ with domain ๐ ∩ ๐ . โ To prove closure, let ๐ฅ, ๐ฆ ∈ ๐ ∩๐ . This means that ๐ฅ+๐ฆ ∈ ๐ and ๐ฅ+๐ฆ ∈ ๐ . Hence, ๐ฅ + ๐ฆ ∈ ๐ ∩ ๐ . Similarly, ๐ฅ๐ฆ ∈ ๐ ∩ ๐ . โ Since 0 ∈ ๐ and 0 ∈ ๐ , 0 ∈ ๐ ∩ ๐ . โ Suppose ๐ฅ ∈ ๐ ∩ ๐ . Then ๐ฅ ∈ ๐ and ๐ฅ ∈ ๐ . Since these are subrings, −๐ฅ ∈ ๐ and −๐ฅ ∈ ๐ . Thus, −๐ฅ ∈ ๐ ∩ ๐ . Ideals The subring ๐ of the ring ℜ in Example 7.2.15 lacks a property that is often desirable to have in a subring. Observe that [ ] [ ] [ ] 1 0 1 2 1 2 ⋅ = ∉ ๐, 0 1 3 4 3 4 so in general it is false that ๐ด๐ต ∈ ๐ and ๐ต๐ด ∈ ๐ for all ๐ด ∈ ๐ and ๐ต ∈ M2 (โ). DEFINITION 7.2.17 Let ℜ be a ring with domain ๐ and ℑ be a subring of ℜ with domain ๐ผ. โ If ๐๐ ∈ ๐ผ and ๐๐ ∈ ๐ผ for all ๐ ∈ ๐ and ๐ ∈ ๐ผ, then ℑ is an ideal of ℜ. โ If ๐๐ ∈ ๐ผ for all ๐ ∈ ๐ and ๐ ∈ ๐ผ, then ℑ is a left ideal. โ If ๐๐ ∈ ๐ผ for all ๐ ∈ ๐ and ๐ ∈ ๐ผ, then ℑ is a right ideal. A ring ℜ is an ideal of itself, called the improper ideal of ℜ. All other ideals of ℜ are proper, including the ideal formed by {0}. Furthermore, in a commutative ring, there is no difference between a left and right ideal. However, if the ring is not commutative, a left ideal might not be a right ideal. EXAMPLE 7.2.18 Define {[ ] } ๐ 0 ๐ผ= : ๐, ๐ ∈ โ . ๐ 0 Using matrix multiplication, [ ] [ ] [ ] ๐ฅ ๐ฆ ๐ 0 ๐ฅ๐ + ๐ฆ๐ 0 ⋅ = ∈ ๐ผ, ๐ง ๐ค ๐ 0 ๐ง๐ + ๐ค๐ 0 Section 7.2 SUBSTRUCTURES but ] [ [ ] [ 1 1 0 1 1 = ⋅ 1 1 1 1 0 369 ] 1 ∉ ๐ผ. 1 Therefore, the subring (๐ผ, 02 , + [๐ผ × ๐ผ], ⋅ [๐ผ × ๐ผ]) of ๐2 (โ) is a left ideal but not a right ideal. EXAMPLE 7.2.19 Let ๐ผ = {[0]4 , [2]4 }. Then, ℑ = (๐ผ, [0]4 , + [๐ผ × ๐ผ], ⋅ [๐ผ × ๐ผ]) is a subring of the ring ℜ = (โค4 [0]4 , +, ⋅). It also forms an ideal. To see this, check the calculations: [0]4 ⋅ [0]4 [1]4 ⋅ [0]4 [2]4 ⋅ [0]4 [3]4 ⋅ [0]4 [0]4 ⋅ [2]4 [1]4 ⋅ [2]4 [2]4 ⋅ [2]4 [3]4 ⋅ [2]4 = [0]4 = [0]4 = [0]4 = [0]4 = [0]4 = [2]4 = [0]4 = [2]4 [0]4 ⋅ [0]4 [0]4 ⋅ [1]4 [0]4 ⋅ [2]4 [0]4 ⋅ [3]4 [2]4 ⋅ [0]4 [2]4 ⋅ [1]4 [2]4 ⋅ [2]4 [2]4 ⋅ [3]4 = [0]4 , = [0]4 , = [0]4 , = [0]4 , = [0]4 , = [2]4 , = [0]4 , = [2]4 . When we multiply any element of โค4 by an element of ๐ผ on either side, the result is an element of ๐ผ. EXAMPLE 7.2.20 Let ℜ be a ring. It is left to Exercise 18 to show that the ring ๐ = (๐ × {0}, (0, 0), +′ , ⋅′ ) is a subring of ๐ = (๐ × ๐ , (0, 0), +, ⋅) with + and ⋅ defined coordinatewise and +′ and ⋅′ being the restrictions of + and ⋅ to ๐ ×{0}. To show that ๐ is an ideal of ๐, let (๐, ๐ ) ∈ ๐ ×๐ and (๐, 0) ∈ ๐ ×{0}. We then calculate (๐, ๐ ) ⋅ (๐, 0) = (๐๐, 0) ∈ ๐ × {0} and (๐, 0) ⋅ (๐, ๐ ) = (๐๐, 0) ∈ ๐ × {0}. EXAMPLE 7.2.21 Let ℜ be a ring. Let ๐ and ๐ be subsets of ๐ such that ๐ = (๐, 0, + [๐ × ๐], ⋅ [๐ × ๐]) 370 Chapter 7 MODELS and ๐ = (๐ , 0, + [๐ × ๐ ], ⋅ [๐ × ๐ ]) are ideals of ℜ. Define ๐ + ๐ = {๐ + ๐ก : ๐ ∈ ๐ and ๐ก ∈ ๐ }. We prove that ๐ + ๐ is the domain of an ideal of ℜ. โ Take ๐ฅ, ๐ฆ ∈ ๐ + ๐ . This means that ๐ฅ = ๐ + ๐ก and ๐ฆ = ๐ ′ + ๐ก′ for some ๐ , ๐ ′ ∈ ๐ and ๐ก, ๐ก′ ∈ ๐ . Then, ๐ฅ + ๐ฆ = (๐ + ๐ก) + (๐ ′ + ๐ก′ ) = ๐ + (๐ก + ๐ ′ ) + ๐ก′ = ๐ + (๐ ′ + ๐ก) + ๐ก′ = (๐ + ๐ ′ ) + (๐ก + ๐ก′ ). Thus, ๐ฅ + ๐ฆ ∈ ๐ + ๐ since ๐ + ๐ ′ ∈ ๐ and ๐ก + ๐ก′ ∈ ๐ . Also, ๐ฅ๐ฆ = (๐ + ๐ก)(๐ ′ + ๐ก′ ) = (๐ + ๐ก)๐ ′ + (๐ + ๐ก)๐ก′ . Since ๐ + ๐ก ∈ ๐ and ๐ is an ideal, (๐ + ๐ก)๐ ′ ∈ ๐. Likewise, (๐ + ๐ก)๐ก′ is an element of ๐ . Hence, ๐ฅ๐ฆ ∈ ๐ + ๐ . โ We know that 0 ∈ ๐ + ๐ because 0 = 0 + 0 and 0 ∈ ๐ and 0 ∈ ๐ . โ Let ๐ ∈ ๐ and ๐ก ∈ ๐ . Then, since −๐ ∈ ๐ and −๐ก ∈ ๐ก, −(๐ + ๐ก) = −๐ + −๐ก ∈ ๐ + ๐ . โ Let ๐ ∈ ๐ , ๐ ∈ ๐, and ๐ก ∈ ๐ . Since ๐๐ ∈ ๐ and ๐๐ก ∈ ๐ , ๐(๐ + ๐ก) = ๐๐ + ๐๐ก ∈ ๐ + ๐ , and since ๐ ๐ ∈ ๐ and ๐ก๐ ∈ ๐ , (๐ + ๐ก)๐ = ๐ ๐ + ๐ก๐ ∈ ๐ + ๐ . Cyclic subgroups (Definition 7.2.5) are generated by a single element. The corresponding notion in rings is the following definition. DEFINITION 7.2.22 Let ℜ be a ring. For every ๐ ∈ ๐ , define โจ๐โฉ = {๐๐ : ๐ ∈ ๐ }. If ℑ is a left ideal of ℜ such that dom(ℑ) = โจ๐โฉ for some ๐ ∈ ๐ , then ℑ is called a principal ideal left ideal and ๐ is a generator of ℑ. If ℜ is commutative, ℑ is an ideal of ℜ called a principal ideal. The set ๐โค is the domain of an ideal of the ring (โค, 0, +, ⋅). It is a principal ideal because ๐โค = โจ๐โฉ. (See Exercises 16 and 21.) In fact, every element of a ring generates a left ideal of the ring. Section 7.2 SUBSTRUCTURES 371 THEOREM 7.2.23 For every ring ℜ and ๐ ∈ ๐ , the ring ℑ = (โจ๐โฉ, 0, + [โจ๐โฉ × โจ๐โฉ], ⋅ [โจ๐โฉ × โจ๐โฉ]) is a left ideal of ℜ. PROOF First, show that ℑ is a subring. โ Let ๐, ๐ ∈ ๐ . Then, because ๐ + ๐ and (๐๐)๐ are elements of dom(๐ ), ๐๐ + ๐ ๐ = (๐ + ๐ )๐ ∈ โจ๐โฉ and (๐๐)(๐ ๐) = [(๐๐)๐ ] ๐ ∈ โจ๐โฉ. โ Since 0๐ = 0 [Exercise 7.1.22], we have that 0 ∈ โจ๐โฉ. โ If ๐ ∈ ๐ , then (−๐)๐ ∈ โจ๐โฉ and −๐๐ + ๐๐ = (−๐ + ๐)๐ = 0๐ = 0. To prove that ℑ is a left ideal of ℜ, take ๐, ๐ ∈ ๐ . Then, ๐(๐ ๐) ∈ โจ๐โฉ because ๐๐ ∈ ๐ and ๐(๐ ๐) = (๐๐ )๐. Notice that a principal left ideal might not be a two-sided ideal. Example 7.2.18 combined with the next example illustrates this fact. EXAMPLE 7.2.24 Let ℑ = (๐ผ, 02 , + [๐ผ × ๐ผ], ⋅ [๐ผ × ๐ผ]) be a left ideal of ๐2 (โ), where ๐ผ= {[ ] } ๐ 0 : ๐, ๐ ∈ โ . ๐ 0 By Theorem 7.2.23, it is a principal left ideal of ๐2 (โ) because โจ[ ๐ผ= 1 0 ]โฉ 0 . 0 To prove this, it suffices to take ๐, ๐ ∈ โ and observe that [ ] [ ] [ ๐ 0 ๐ 0 1 = ⋅ ๐ 0 ๐ 0 0 implies [ ] โจ[ ๐ 0 1 ∈ ๐ 0 0 ] 0 0 ]โฉ 0 . 0 372 Chapter 7 MODELS EXAMPLE 7.2.25 Every ideal in the ring โจ = (โค, 0, +, ⋅) is principal. To see this, let ℑ be an ideal of โจ. We must find an element of โค that generates ๐ผ = dom(ℑ). We have two cases to consider: โ If ๐ผ = {0}, then ๐ผ = โจ0โฉ. โ Suppose ๐ผ ≠ {0}. This means that ๐ผ ∩ โค+ ≠ ∅. By Theorem 5.2.13 and Exercise 5.3.1, ๐ผ must contain a minimal positive integer. Call it ๐. We claim that ๐ผ = โจ๐โฉ. It is clear that โจ๐โฉ ⊆ ๐ผ, so take ๐ ∈ ๐ผ and divide it by ๐. The division algorithm (Theorem 4.3.31) gives ๐, ๐ ∈ ๐ so that ๐ = ๐๐ + ๐ with 0 ≤ ๐ < ๐. Then, ๐ ∈ ๐ผ because ๐ = ๐ − ๐๐ and ๐, ๐๐ ∈ ๐ผ. If ๐ > 0, then we have a contradiction of the fact that ๐ is the smallest positive integer in ๐ผ. Hence, ๐ = 0 and ๐ = ๐๐. This means that ๐ ∈ โจ๐โฉ. โจ is an example of a principal ideal domain, an integral domain in which every ideal is principal. Exercises 1. Let ๐ be a binary relation symbol. Define the {๐ }-structures ๐ = (โ, <โ ) and ๐ = (โ, <โ ), where <โ refers to standard less-than on โ and <โ refers to standard less-than on โ. Prove that ๐ is a substructure of ๐ . 2. Let ๐ = (๐ด, ๐), ๐ = (๐ต, ๐), and โญ = (๐ถ, ๐ ) be ๐ฒ-structures such that ๐ ⊆ ๐ and โญ ⊆ ๐. Define ๐ = (๐ท, ๐ก) such that ๐ท = ๐ต ∩ ๐ถ, ๐ก(๐) = ๐(๐) for all constant symbols ๐ ∈ ๐ฒ, ๐ก(๐ ) = ๐(๐ ) ∩ ๐ (๐ ) for all relation symbols ๐ ∈ ๐ฒ, and ๐ก(๐ ) = ๐(๐ ) ∩ ๐ (๐ ) for all function symbols ๐ ∈ ๐ฒ. Prove that ๐ ⊆ ๐. 3. Let ๐ be a cardinal. Define ๐๐ผ = (๐ด๐ผ , ๐๐ผ ) to be an ๐ฒ-structure for all ๐ผ ∈ ๐ . The family {๐๐พ : ๐พ ∈ ๐ } is called a chain of ๐ฒ-structures if for all ๐ผ ∈ ๐ฝ ∈ ๐ , we have that โ โ ๐๐ผ ⊆ ๐๐ฝ . Define the ๐ฒ-structure ๐พ∈๐ ๐๐พ = ( ๐พ∈๐ ๐ด๐พ , ๐) so that for every relation symbol ๐ ∈ ๐ฒ, โ ๐(๐ ) = ๐๐ผ (๐ ), ๐พ∈๐ and for every function symbol ๐ ∈ ๐ฒ, ๐(๐ ) = โ ๐๐ผ (๐ ). ๐พ∈๐ Prove the following. (a) {๐๐พ (๐ ) : ๐พ ∈ ๐ } is a chain with respect to ⊆ for all relation symbols ๐ ∈ ๐ฒ. โ (b) ๐พ∈๐ ๐๐พ (๐ ) is a function. โ (c) ๐๐ผ ⊆ ๐พ∈๐ ๐๐พ for all ๐ผ ∈ ๐ . Section 7.2 SUBSTRUCTURES 373 4. Let ๐ฒ ⊆ ๐ณ be sets of subject symbols. Let ๐ and ๐ be ๐ณ-structures. Prove that if ๐ ⊆ ๐ , then the reduct of ๐ to ๐ฒ is a substructure of the reduct of ๐ to ๐ฒ. 5. Find ๐ฒ-substructures of the given ๐ฒ-structures. If possible, find a ๐ฒ-sentence that is true in the given structure but not true in the substructure. (a) (โค ∪ P(โค), ∈), ๐ฒ = ๐ฒ๐ณ (b) (๐, ∅, {∅}, +, ⋅), ๐ฒ = ๐ญ๐ณ (c) (โ, 0, +), ๐ฒ = ๐ฆ๐ฑ (d) (โ, 0, +, ⋅, <), ๐ฒ = ๐ฎ๐ฅ 6. Let ๐ be a group with domain ๐บ. Find all subgroups of ๐ given the following and assuming the standard operations for each set. (a) ๐บ is the Klein-4 group (b) ๐บ = โค5 (c) ๐บ = โค8 (d) ๐บ = โค2 × โค3 (e) ๐บ = โค2 × โค6 7. Show that {๐ ∈ (โ โ, 0, +). โโ : ๐ (0) = 0} is the domain of a subgroup of the group 8. Define ๐ป = {๐ด ∈ M๐ (โ) : ๐1,1 + ๐2,2 + · · · + ๐๐,๐ = 0}. Prove that (๐ป, 0๐ , +) is a subgroup of the group (M๐ (โ), 0๐ , +). 9. Let โ {โ๐ : ๐ ∈ ๐ผ} be a family of subgroups of the group ๐. Give an example to show that ๐∈๐ผ โ๐ is not necessarily the domain of a subgroup of ๐. 10. Let (๐ป, ๐, ∗ [๐ป × ๐ป]) and (๐พ, ๐, ∗ [๐พ × ๐พ]) be two subgroups of an abelian group ๐ = (๐บ, ๐, ∗). Define ๐ป๐พ = {๐ ∗ ๐ : ๐ ∈ ๐ป ∧ ๐ ∈ ๐พ}. Prove that (๐ป๐พ, ๐, ∗ [๐ป๐พ × ๐ป๐พ]) is a subgroup of ๐. 11. For any group ๐ = (๐บ, ๐, ∗), let ๐ ⊆ ๐บ and define ๐ป = {๐ ∈ ๐บ : ๐ ∗ ๐ = ๐ ∗ ๐ for all ๐ ∈ ๐}. Prove that (๐ป, ๐, ∗ [๐ป × ๐ป]) is a subgroup of ๐. 12. Demonstrate that simple groups are cyclic. 13. Prove Theorem 7.2.14. 14. Prove that the subrings of a ring with unity are rings with unity. 15. Let ℜ be a ring with domain ๐ . Find all ideals of ℜ given the following and assuming the standard operations for each set. (a) ๐ = โค2 (b) ๐ = โค6 (c) ๐ = โค7 374 Chapter 7 MODELS (d) ๐ = โค12 16. Let ๐ ∈ โค. Prove that ๐โค is the domain of an ideal of the ring (โค, 0, +, ⋅). 17. Let ℜ = (โค × โค, (0, 0), +, ⋅). (a) Prove that {(2๐, 2๐) : ๐, ๐ ∈ โค} is the domain of an ideal of ℜ. (b) Prove that {(2๐, 2๐) : ๐ ∈ โค} is not the domain of an ideal of ℜ. (c) Prove that โค × 3โค is the domain of a principal ideal of ℜ. (d) Is {(2๐, 2๐) : ๐, ๐ ∈ โค} the domain of a principal ideal of ℜ? 18. Prove the ๐ is a subring of ๐ in Example 7.2.20. 19. Let ℜ be a ring with unity and ℑ an ideal of ℜ. Let ๐ be the domain of ℜ and ๐ผ be the domain of ℑ. Prove that if ๐ข is a unit and ๐ข ∈ ๐ผ, then ๐ผ = ๐ . 20. Prove that a field has no proper, nontrivial ideals. 21. Prove that ๐โค is the domain of a principal ideal of the ring (โค, 0, +, ⋅) for any integer ๐. 22. Show that for any ideal ℑ, if ๐ ∈ dom(ℑ), then โจ๐โฉ ⊆ dom(ℑ). 23. Take a ring ℜ and let ๐ ∈ dom(ℜ). Show โจ๐โฉ is the domain of a left ideal of ℜ but not necessarily a right ideal if ℜ is not commutative. 24. Let ๐ข be a unit of a ring ℜ. Show for all ๐ ∈ dom(ℜ), โจ๐โฉ = โจ๐ข๐โฉ. 25. An ideal ๐ = (๐ , 0, +, ⋅) of a commutative ring ℜ = (๐ , 0, +, ⋅) is prime means for all ๐, ๐ ∈ ๐ , if ๐๐ ∈ ๐ , then ๐ ∈ ๐ or ๐ ∈ ๐ . Let ๐ ∈ โค+ be a prime number (Example 2.4.18). Prove that (๐โค, 0, + ๐โค, ⋅ ๐โค) is a prime ideal of (โค, 0, +, ⋅). 26. Prove that the trivial subring of an integral domain is a prime ideal. 27. Let ℜ be a commutative ring and ๐ a proper ideal of ℜ. If no proper ideal of ℜ has ๐ as a proper ideal, ๐ is a maximal ideal of ℜ. Assume that ๐ is a prime number. Prove that ๐โค is the domain of a maximal ideal of (โค, 0, +, ⋅). 28. Prove that the trivial ideal is the maximal ideal of a field. 29. Let ๐ ∈ โค be prime. Prove that {(๐๐, ๐) : ๐, ๐ ∈ โค} is the domain of a maximal ideal of the ring with domain โค × โค (Example 7.2.20). 30. Use Zorn’s lemma (Theorem 5.1.13) to prove that every commutative ring with unity has a maximal ideal. 7.3 HOMOMORPHISMS The function ๐ : [0, ∞) → (−∞, 0] defined by ๐ (๐ฅ) = −๐ฅ preserves ≤ with ≥ (Example 4.5.25). Define the {4}-structures ๐ = ([0, ∞) , ๐) and ๐ = ((−∞, 0] , ๐), where ๐(4) = ≤ and ๐(4) = ≥. When ๐ฅ0 , ๐ฅ1 ∈ [0, ∞), observe that (๐ฅ0 , ๐ฅ1 ) ∈ ๐(4) if and only if (๐ (๐ฅ0 ), ๐ (๐ฅ1 )) ∈ ๐(4). Section 7.3 HOMOMORPHISMS 375 because ๐ฅ0 ≤ ๐ฅ1 if and only if −๐ฅ0 ≥ −๐ฅ1 . Therefore, we know that 4 behaves in ๐ as it behaves in ๐ because of ๐ . We generalize this notion to all ๐ฒ-structures in the next definition. DEFINITION 7.3.1 Let ๐ = (๐ด, ๐) and ๐ = (๐ต, ๐) be ๐ฒ-structures. A function ๐ : ๐ด → ๐ต is a homomorphism ๐ → ๐ if it preserves the structure of ๐ in ๐ . This means that ๐ satisfies the following conditions. โ ๐(๐(๐)) = ๐(๐) for all constant symbols ๐ ∈ ๐ฒ. โ If ๐ is an ๐-ary relation symbol in ๐ฒ, then for all ๐0 , ๐1 , … , ๐๐−1 ∈ ๐ด, (๐0 , ๐1 , … , ๐๐−1 ) ∈ ๐(๐ ) if and only if (๐(๐0 ), ๐(๐1 ), … , ๐(๐๐−1 )) ∈ ๐(๐ ). โ If ๐ is an ๐-ary function symbol in ๐ฒ, for all ๐0 , ๐1 , … , ๐๐−1 ∈ ๐ด, ๐(๐(๐ )(๐0 , ๐1 , … , ๐๐−1 )) = ๐(๐ )(๐(๐0 ), ๐(๐1 ), … , ๐(๐๐−1 )). When ๐ : ๐ด → ๐ต is a homomorphism, write ๐ : ๐ → ๐ . There are many important examples of homomorphisms that can be found in algebra. For instance, let ℜ and ℜ′ be rings and ๐ : ℜ → ℜ′ be a homomorphism. By Definition 7.3.1, this means that ๐(0) = 0′ . Because ⊕ and ⊗ are the only function symbols in ๐ฑ๐จ, for all ๐, ๐ ∈ ๐ , ๐(๐ + ๐) = ๐(๐) +′ ๐(๐) and ๐(๐ ⋅ ๐) = ๐(๐) ⋅′ ๐(๐). This together with a similar analysis of homomorphisms between groups motivates the next definition. DEFINITION 7.3.2 โ A group homomorphism is a homomorphism ๐ → ๐′ , where ๐ and ๐′ are groups. โ A ring homomorphism is an homomorphism ℜ → ℜ′ , where ℜ and ℜ′ are rings. Throughout this section the focus will be on ring homomorphisms. 376 Chapter 7 MODELS EXAMPLE 7.3.3 Let ℜ and ℜ′ be rings. The function ๐ : ๐ → ๐ ′ so that ๐(๐) = 0′ for all ๐ ∈ ๐ is a ring homomorphism. This function is called a zero map. EXAMPLE 7.3.4 Define the function ๐ : โค → โค × โค by ๐(๐) = (๐, 0). Assume that + and ⋅ are the standard operations on โค while +′ and ⋅′ are the coordinatewise operations on โค × โค. Let ๐, ๐ ∈ โค. Then, โ ๐(0) = (0, 0) โ ๐(๐ + ๐) = (๐ + ๐, 0) = (๐, 0) +′ (๐, 0) = ๐(๐) +′ ๐(๐) โ ๐(๐๐) = (๐๐, 0) = (๐, 0) ⋅′ (๐, 0) = ๐(๐) ⋅′ ๐(๐). Therefore, ๐ is a ring homomorphism (โค, 0, +, ⋅) → (โค × โค, (0, 0), +′ , ⋅′ ). The homomorphism of Example 7.3.4 provides a good opportunity to clarify what a ring homomorphism does. Take the integers 1 and 4. Adding them together in โค yields 5. The images of these integers under ๐ are ๐(1) = (1, 0), ๐(4) = (4, 0), and ๐(5) = (5, 0). Observe that (1, 0) + (4, 0) = (5, 0) in โค × โค, illustrating that the ring homomorphism ๐ preserves the addition structure of โค in โค × โค. That is, with respect to addition, both sets behave the same way. Multiplication also has the property. For example, when 3 is multiplied with 6 the result is 18 in โค, and their images yield ๐(3)๐(6) = ๐(18) in โค × โค. This is illustrated in Figure 7.2. Although ring homomorphisms will always preserve the additive identity by definition, this is not a condition that needs to be checked. ๐ โค 1 (1, 0) 4 (4, 0) 5 โค 3 (5, 0) ๐ 6 18 Figure 7.2 โค×โค (3, 0) โค×โค (6, 0) (18, 0) The ring homomorphism ๐ : โค → โค × โค defined by ๐(๐) = (๐, 0). Section 7.3 HOMOMORPHISMS 377 THEOREM 7.3.5 Let ℜ and ℜ′ be rings. Then, ๐ : ๐ → ๐ ′ is a ring homomorphism if and only if for all ๐ฅ, ๐ฆ ∈ ๐ , โ ๐(๐ฅ + ๐ฆ) = ๐(๐ฅ) +′ ๐(๐ฆ), and โ ๐(๐ฅ ⋅ ๐ฆ) = ๐(๐ฅ) ⋅′ ๐(๐ฆ). PROOF Since sufficiency is clear, assume that ๐ preserves addition and multiplication. Then, ๐(0) +′ ๐(0) = ๐(0 + 0) = ๐(0). Adding −๐(0) to both sides yields ๐(0) = 0′ . In addition to preserving operations and the additive identity, ring homomorphisms also preserve inverses. Remember, if ๐ : ๐ → ๐ ′ is a function and ๐ ∈ ๐ , then −๐ is the additive inverse of ๐ in ๐ and −๐(๐) is the additive inverse of ๐(๐) inside of ๐ ′ . THEOREM 7.3.6 Let ℜ and ℜ′ be rings. If ๐ : ℜ → ℜ′ is a ring homomorphism, for all ๐ ∈ ๐ , ๐(−๐) = −๐(๐). PROOF If ๐ : ℜ → ℜ′ is a ring homomorphism, then for all ๐ฅ ∈ ๐ , ๐(๐ฅ) +′ ๐(−๐ฅ) = ๐(๐ฅ + −๐ฅ) = ๐(0) = 0′ . EXAMPLE 7.3.7 Check Theorem 7.3.6 using the function ๐ as defined in the Example 7.3.4. Since 5 and −5 are additive inverses in โค, we have that (5, 0) + (−5, 0) = (0, 0), so ๐(5) and ๐(−5) are additive inverses in โค × โค. Ring homomorphisms also preserve subrings and ideals. THEOREM 7.3.8 Let ℜ and ℜ′ be rings and ๐ : ℜ → ℜ′ be a ring homomorphism. Let ℑ be an ideal of ℜ with domain ๐ผ and ℑ′ be an ideal of ℜ′ with domain ๐ผ ′ . โ If ๐ is onto, (๐[๐ผ], 0′ , +′ ๐[๐ผ], ⋅′ ๐[๐ผ]) is an ideal of ℜ′ . โ ๐ = (๐−1 [๐ผ ′ ], 0, + ๐−1 [๐ผ ′ ], ⋅ ๐−1 [๐ผ ′ ]) is an ideal of ℜ. 378 Chapter 7 MODELS PROOF The first part is left to Exercise 6. To prove the second, first prove that ๐ is a subring of ℜ. โ To show closure, let ๐ฅ1 , ๐ฅ2 ∈ ๐−1 [๐ผ ′ ]. This means that ๐(๐ฅ1 ) and ๐(๐ฅ2 ) are elements of ๐ผ ′ . Hence, ๐(๐ฅ1 + ๐ฅ2 ) = ๐(๐ฅ1 ) +′ ๐(๐ฅ2 ) ∈ ๐ผ ′ and ๐(๐ฅ1 ⋅ ๐ฅ2 ) = ๐(๐ฅ1 ) ⋅′ ๐(๐ฅ2 ) ∈ ๐ผ ′ . Therefore, ๐ฅ1 + ๐ฅ2 , ๐ฅ1 ⋅ ๐ฅ2 ∈ ๐−1 [๐ผ ′ ]. โ Since 0′ ∈ ๐ผ ′ and ๐(0) = 0′ , it follows that 0 ∈ ๐−1 [๐ผ ′ ]. โ Let ๐ฅ ∈ ๐−1 [๐ผ ′ ]. Then, ๐(๐ฅ) ∈ ๐ผ ′ , and we find that ๐(−๐ฅ) = −๐(๐ฅ) ∈ ๐ผ ′ . Thus, −๐ฅ ∈ ๐−1 [๐ผ ′ ]. To see that ๐ is an ideal of ℜ, take ๐ ∈ ๐ and ๐ ∈ ๐−1 [๐ผ ′ ]. This means that ๐(๐) ∈ ๐ผ ′ . Then, ๐(๐๐) = ๐(๐)๐(๐) ∈ ๐ผ ′ since ℑ′ is an ideal. Thus, ๐๐ ∈ ๐−1 [๐ผ ′ ]. Similarly, ๐๐ ∈ ๐−1 [๐ผ ′ ]. EXAMPLE 7.3.9 The function ๐ : โค → โคโ6โค defined by ๐(๐) = ๐ + 6โค is an onto homomorphism. (See Exercise 7.) The image of 2โค under this map is ๐[2โค] = {๐ + 6โค : ๐ ∈ 2โค} = {0 + 6โค, 2 + 6โค, 4 + 6โค}. The pre-image of ๐ผ = {0 + 6โค, 3 + 6โค} is ๐−1 [๐ผ] = {๐ ∈ โค : ∃๐ ∈ โค(๐ = 6๐ ∨ ๐ = 3 + 6๐)} = 3โค. Notice that both ๐[2โค] and ๐−1 [๐ผ] are domains of ideals of their respective rings. Take a ring ℜ and let ๐ be its domain such that |๐ | ≥ 2. Let 0 be the additive identity of ℜ. We specify two functions. First, define ๐ : ๐ × ๐ × ๐ → ๐ × ๐ by ๐(๐ฅ, ๐ฆ, ๐ง) = (๐ฅ, ๐ฆ). This is a function similar to that found in Example 4.5.16. Notice that ran(๐) = ๐ × ๐ , which means that ๐ is onto, and ๐ด = {(๐ฅ, ๐ฆ, ๐ง) : ๐(๐ฅ, ๐ฆ, ๐ง) = (0, 0)} = {(0, 0, ๐ง) : ๐ง ∈ ๐ }. Because |๐ด| > 1, we conclude that ๐ is not one-to-one. Second, define the function ๐ : ๐ × ๐ → ๐ × ๐ × ๐ so that ๐(๐ฅ, ๐ฆ) = (๐ฅ, ๐ฆ, 0). Compare ๐ with Example 4.5.7. Section 7.3 HOMOMORPHISMS 379 This function is not onto because ran(๐) = ๐ × ๐ × {0}, but it does appear to be one-to-one because ๐ต = {(๐ฅ, ๐ฆ) : ๐(๐ฅ, ๐ฆ) = (0, 0, 0)} = {(0, 0)}. The sets ๐ด and ๐ต that contain the elements of the domain that are mapped to the identity of the range appear important to determining whether a function is one-to-one. For this reason, such sets are named. DEFINITION 7.3.10 Let ℜ and ℜ′ be rings. Let ๐ : ๐ → ๐ ′ be a function. The kernel of ๐ is ker(๐) = {๐ฅ ∈ ๐ : ๐(๐ฅ) = 0′ }. Notice that Definition 7.3.10 implies that ker(๐) = ๐−1 [{0′ }]. Therefore, ker(๐) is an ideal of ℜ (Theorem 7.3.8). Similarly, because ran(๐) = ๐[๐ ], we conclude that ran(๐) is an ideal of ℜ′ . EXAMPLE 7.3.11 Let ๐ : โค → โค5 defined by ๐(๐) = [๐]5 . โ To find the kernel, assume ๐(๐) = [0]5 . By the assumption, [๐]5 = [0]5 . So, ๐ ∈ [0]5 , which means 5 โฃ ๐. Hence, ker(๐) ⊆ 5โค. Since the steps are reversible, ker(๐) = 5โค. โ Because ran(๐) = {[๐]5 โถ ๐ ∈ โค} encompasses all congruence classes modulo 5, ๐ is onto and ran(๐) = โค5 . We now prove that the kernel does provide a test for whether a ring homomorphism is an injection. Since any ring homomorphism ๐ : ℜ → ℜ′ maps the additive identity of ℜ to the additive identity of ℜ′ , it is always the case that 0 ∈ ker(๐). Therefore, to show that ker(๐) = {0}, we only need to prove ker(๐) ⊆ {0}. THEOREM 7.3.12 Let ℜ and ℜ′ be rings such that 0 is the additive identity of ℜ. Suppose that the function ๐ : ℜ → ℜ′ is a ring homomorphism. Then, ๐ is one-to-one if and only if ker(๐) = {0}. PROOF โ Suppose that ๐ is one-to-one. Take ๐ฅ ∈ ker(๐). This means that ๐(๐ฅ) = 0′ , where 0′ is the additive identity of ℜ′ . Since ๐ is an injection, ๐ฅ = 0. This implies that ker(๐) = {0}. 380 Chapter 7 MODELS โ Now let ker(๐) = {0} and assume ๐(๐ฅ1 ) = ๐(๐ฅ2 ), where ๐ฅ1 , ๐ฅ2 ∈ dom(ℜ). We then have ๐(๐ฅ1 ) − ๐(๐ฅ2 ) = 0. Since ๐ is a homomorphism, we have ๐(๐ฅ1 − ๐ฅ2 ) = ๐(๐ฅ1 ) − ๐(๐ฅ2 ) by Theorem 7.3.6. Hence, ๐ฅ1 − ๐ฅ2 ∈ ker(๐), which means ๐ฅ1 − ๐ฅ2 = 0 by hypothesis. Therefore, ๐ฅ1 = ๐ฅ2 . A similar proof can be used to demonstrate that same result for group homomorphisms. THEOREM 7.3.13 Let ๐ and ๐′ be groups such that ๐ is the additive identity of ๐. Suppose that the function ๐ : ๐ → ๐′ is a group homomorphism. Then, ๐ is one-to-one if and only if ker(๐) = {๐}. Isomorphisms Define ๐ : โค5 × โค6 → โค6 × โค5 by ๐([๐]5 , [๐]6 ) = ([๐]6 , [๐]5 ). Let + be coordinatewise addition on โค5 × โค6 and +′ be coordinatewise addition on โค6 × โค5 . We prove that this function preserves addition and is a bijection. โ Let ๐, ๐, ๐, ๐ ∈ โค. Then, ๐(([๐]5 , [๐]6 ) + ([๐]5 , [๐]6 )) = ๐([๐]5 + [๐]5 , [๐]6 + [๐]6 ) = ๐([๐ + ๐]5 , [๐ + ๐]6 ) = ([๐ + ๐]6 , [๐ + ๐]5 ) = ([๐]6 + [๐]6 , [๐]5 + [๐]5 ) = ([๐]6 , [๐]5 ) +′ ([๐]6 , [๐]5 ) = ๐([๐]5 , [๐]6 ) +′ ๐([๐]5 , [๐]6 ). โ Let ([๐]5 , [๐]6 ) ∈ ker(๐). In other words, ([๐]6 , [๐]5 ) = ([0]6 , [0]5 ). Hence, 5 โฃ ๐ and 6 โฃ ๐, which implies that [๐]5 = [0]5 and [๐]6 = [0]6 (Example 4.2.6). Thus, ๐ is an injection by Theorem 7.3.13. โ To see that ๐ is onto, take ([๐]6 , [๐]5 ) ∈ โค6 × โค5 . Then, ๐([๐]5 , [๐]6 ) = ([๐]6 , [๐]5 ). The function ๐ generalizes to the next definition. Note the similarity between this definition and that of an order isomorphism (Definition 4.5.24). DEFINITION 7.3.14 Let ๐ and ๐ be ๐ฒ-structures. An isomorphism ๐ → ๐ is a homomorphism ๐ → ๐ that is a bijection. An isomorphism ๐ → ๐ is called an automorphism. If there is an isomorphism ๐ : ๐ → ๐ , the ๐ฒ-structures are isomorphic and we write ๐ ≅ ๐ . Section 7.3 HOMOMORPHISMS 381 If ๐ = (โค5 × โค6 , ([0]5 , [0]6 ), +) and ๐ = (โค6 × โค5 , ([0]6 , [0]5 ), +′ ), then ๐ shows that ๐ ≅ ๐ . Like order isomorphisms (Example 4.5.28), there are three basic isomorphism results. The proof of this is left to Exercise 8. THEOREM 7.3.15 Let ๐, ๐ , and โญ be ๐ฒ-structures. โ ๐ ≅ ๐. โ If ๐ ≅ ๐ , then ๐ ≅ ๐. โ If ๐ ≅ ๐ and ๐ ≅ โญ, then ๐ ≅ โญ. EXAMPLE 7.3.16 Let ๐ฒ = {0, <, ⋅}. Let ๐ = (2 ๐, ๐), where ๐(0)(0) = 0 and ๐(0)(1) = 0, and for all ๐ , ๐ ∈ 2 ๐, (๐ , ๐) ∈ ๐(<) if and only if ๐ (0) < ๐(0) ∨ [๐ (0) = ๐(0) ∧ ๐ (1) < ๐(1)], that is, < is interpreted as a lexicographical order (compare Exercise 4.3.16), and ๐(⋅) is function multiplication. This means that ๐(⋅)(๐ , ๐)(๐ฅ) = ๐ (๐ฅ)๐(๐ฅ). Also, let ๐ = (๐ × ๐ × {0}, ๐), where ๐(0) = (0, 0, 0), and for all ๐, ๐, ๐′ , ๐′ ∈ ๐, ((๐, ๐, 0), (๐′ , ๐′ , 0)) ∈ ๐(<) if and only if ๐ < ๐′ ∨ [๐ = ๐′ ∧ ๐ < ๐′ ], and ๐(⋅) is coordinate-wise multiplication. That is, ๐(⋅)((๐, ๐, 0), (๐′ , ๐′ , 0)) = (๐๐′ , ๐๐′ , 0). Show that ๐ ≅ ๐ by showing that ๐ : ๐ → ๐ defined by ๐(๐) = (๐(1), ๐(0), 0) is an ๐ฒ-isomorphism. โ ๐(๐(0)) = (๐(0)(1), ๐(0)(0), 0) = (0, 0, 0) = ๐(0). 382 Chapter 7 MODELS โ Let ๐ , ๐ ∈ 2 ๐. Then, (๐ , ๐) ∈ ๐(<) ⇔ ๐ (0) < ๐(0) ∨ [๐ (0) = ๐(0) ∧ ๐ (1) < ๐(1)] ⇔ ((๐ (1), ๐ (0), 0), (๐(1), ๐(0), 0)) ∈ ๐(<) ⇔ (๐(๐ ), ๐(๐)) ∈ ๐(<). โ Let ๐ , ๐ ∈ 2 ๐. Then, ๐(๐(⋅)(๐ , ๐)) = ๐({(0, ๐ (0)๐(0)), (1, ๐ (1)๐(1))}) = (๐ (1)๐(1), ๐ (0)๐(0), 0) = ๐(⋅)((๐ (1), ๐ (0), 0), (๐(1), ๐(0), 0)) = ๐(⋅)(๐(๐ ), ๐(๐)). Therefore, ๐ is an homomorphism. That ๐ is a bijection is Exercise 9. Hence, ๐ ≅ ๐ , and by Theorem 7.3.15, we have that ๐ ≅ ๐. Because there is a bijection between โค5 × โค6 and โค6 × โค5 , they have the same cardinality. Since the bijection that is typically chosen is also a homomorphism, the algebraic structure of the two rings are the same. Putting this together, we conclude that the two rings “look” the same as rings. The only difference is in the labeling of their elements. To formalize this notion, we make the next definition. DEFINITION 7.3.17 โ A group homomorphism that is a bijection is a group isomorphism. โ A ring homomorphism that is a bijection is a ring isomorphism. We say that two rings ๐ and ๐ ′ are isomorphic if there is an isomorphism ๐ : ๐ → ๐ ′ . If two rings are isomorphic, write ๐ ≅ ๐ ′ . For example, we saw that โค5 × โค6 ≅ โค6 × โค5 . The next example illustrates what we mean by “looking” the same. EXAMPLE 7.3.18 Suppose that ๐ : ℜ → ℜ′ is a ring isomorphism. โ If ℜ is an integral domain, ℜ′ is an integral domain. โ If ℜ is a division ring, ℜ′ is a division ring. โ If ℜ is a field, ℜ′ is a field. The proofs of the first two are left to Exercise 13. For example, suppose that ℜ is a field. We show that ℜ′ is also a field. โ Let 1 be unity from ℜ. Then, ๐(1) is unity from ℜ′ (Exercise 14). Section 7.3 HOMOMORPHISMS 383 โ Let ๐ฆ0 , ๐ฆ1 ∈ ๐ ′ . Since ๐ is onto, there exists ๐ฅ0 , ๐ฅ1 ∈ ๐ so that ๐(๐ฅ0 ) = ๐ฆ0 and ๐(๐ฅ1 ) = ๐ฆ1 . Hence, ๐ฆ0 ⋅′ ๐ฆ1 = ๐(๐ฅ0 ) ⋅′ ๐(๐ฅ1 ) = ๐(๐ฅ0 ⋅ ๐ฅ1 ) = ๐(๐ฅ1 ⋅ ๐ฅ0 ) = ๐(๐ฅ1 ) ⋅′ ๐(๐ฅ0 ) = ๐ฆ1 ⋅′ ๐ฆ0 . โ Let ๐ฆ0 ∈ ๐ ′ โงต {0′ }. There exists ๐ฅ0 ∈ ๐ such that ๐(๐ฅ0 ) = ๐ฆ0 . Since ๐ is one-to-one, ๐ฅ0 ≠ 0. Thus, because ℜ is a field, there exists ๐ฅ1 ∈ ๐ โงต {0} such that ๐ฅ0 ⋅ ๐ฅ1 = 1, so ๐ฆ0 ⋅′ ๐(๐ฅ1 ) = ๐(๐ฅ0 ) ⋅′ ๐(๐ฅ1 ) = ๐(๐ฅ0 ⋅ ๐ฅ1 ) = ๐(1). The result of the next example is known as the fundamental homomorphism theorem. EXAMPLE 7.3.19 Let ℜ and ℜ′ be rings. Let ๐ : ℜ → ℜ′ be a surjective ring homomorphism. Define ๐ : ๐ โ ker(๐) → ๐ ′ by ๐(๐ + ker(๐)) = ๐(๐). To prove that ๐ is well-defined, let ๐, ๐ ∈ ๐ such that ๐ + ker(๐) = ๐ + ker(๐). This implies that ๐ − ๐ ∈ ker(๐), so 0 = ๐(๐ − ๐) = ๐(๐) −′ ๐(๐). That is, ๐(๐) = ๐(๐). Now to show that ๐ is a ring homomorphism. โ Take ๐, ๐ ∈ ๐ . Then, ๐(๐ + ๐ + ker(๐)) = ๐(๐ + ๐) = ๐(๐) +′ ๐(๐) = ๐(๐ + ker(๐)) +′ ๐(๐ + ker(๐)), and ๐(๐ ⋅ ๐ + ker(๐)) = ๐(๐ ⋅ ๐) = ๐(๐) ⋅′ ๐(๐) = ๐(๐ + ker(๐)) ⋅′ ๐(๐ + ker(๐)). โ To prove that ๐ is onto, take ๐ ∈ ๐ ′ . Since ๐ is onto, there exists ๐ ∈ ๐ such that ๐(๐) = ๐. Therefore, ๐(๐ + ker(๐)) = ๐(๐) = ๐. 384 Chapter 7 MODELS โ To prove that ๐ is one-to-one, let ๐, ๐ ∈ ๐ and assume that ๐(๐) = ๐(๐). Since ๐ is a ring homomorphism, ๐(๐ − ๐) = ๐(๐) −′ ๐(๐) = 0. Hence, ๐ − ๐ ∈ ker(๐), which implies that ๐ + ker(๐) = ๐ + ker(๐). Elementary Equivalence Let ๐ be a first-order language with theory symbols ๐ฒ. Suppose that ๐ = (๐ด, ๐) and ๐ = (๐ต, ๐) are ๐ฒ-structures and ๐ผ is an interpretation of ๐. Let ๐ : ๐ → ๐ be an isomorphism. Define ๐ผ๐ : ๐ณ๐ค๐ฑ๐ฌ๐ฒ(๐ ) → ๐ต by ๐ผ๐ (๐ก) = (๐ โฆ ๐ผ)(๐ก) for all ๐ก ∈ ๐ณ๐ค๐ฑ๐ฌ๐ฒ(๐ ). We confirm that ๐ผ๐ is an interpretation of ๐ . โ If ๐ฅ is a variable symbol, ๐ผ๐ (๐ฅ) = (๐ โฆ ๐ผ)(๐ฅ) ∈ ๐ต. โ Let ๐ be a constant symbol. Then, ๐ผ๐ (๐) = (๐ โฆ ๐ผ)(๐) = ๐(๐(๐)) = ๐(๐). โ Let ๐ be an ๐-ary function symbol and ๐ก0 , ๐ก1 , … , ๐ก๐−1 ∈ ๐ณ๐ค๐ฑ๐ฌ๐ฒ(๐ ). Then, ๐ผ๐ (๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 )) = (๐ โฆ ๐ผ)(๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 )) = ๐(๐(๐ )(๐ผ(๐ก0 ), ๐ผ(๐ก1 ), … , ๐ผ(๐ก๐−1 ))) = ๐(๐ )(๐(๐ผ(๐ก0 )), ๐(๐ผ(๐ก1 )), … , ๐(๐ผ(๐ก๐−1 ))) = ๐(๐ )(๐ผ๐ (๐ก0 ), ๐ผ๐ (๐ก1 ), … , ๐ผ๐ (๐ก๐−1 )). An example will clarify the use of the interpretation ๐ผ๐ . EXAMPLE 7.3.20 Let ๐0 = (โค, 0, +) and ๐1 = (2โค, 0, + 2โค). Assume that ๐ผ is an interpretation of ๐0 . Define the group isomorphism ๐ : โค → 2โค by ๐ (๐) = 2๐. We consider the ๐ฆ๐ฑ-formula ๐ฆ โฆ ๐ฅ = ๐. ๐ โจ ∃๐ฅ(๐ฆ โฆ ๐ฅ = ๐) [๐ผ] ⇔ ๐ โจ (๐ฆ โฆ ๐ฅ = ๐) [๐ผ๐ฅ๐ ] for some ๐ ∈ โค ⇔ ๐ผ๐ฅ๐ (๐ฆ โฆ ๐ฅ) = ๐ผ๐ฅ๐ (๐) for some ๐ ∈ โค ⇔ ๐ (๐ผ๐ฅ๐ (๐ฆ โฆ ๐ฅ)) = ๐ (๐ผ๐ฅ๐ (๐)) for some ๐ ∈ โค ⇔ ๐ (๐ผ๐ฅ๐ (๐ฆ)) + ๐ (๐ผ๐ฅ๐ (๐ฅ)) = ๐ (๐ผ๐ฅ๐ (๐)) for some ๐ ∈ โค ⇔ ๐ (๐ผ๐ฅ๐ (๐ฆ)) + ๐ (๐) = ๐ (๐ผ๐ฅ๐ (๐)) for some ๐ ∈ โค ⇔ ๐ (๐ผ๐ฅ๐ (๐ฆ)) + ๐ = ๐ (๐ผ๐ฅ๐ (๐)) for some ๐ ∈ 2โค ⇔ ๐ (๐ผ๐ฅ๐ (๐ฆ)) + ๐ (๐ผ๐ฅ๐ (๐ฅ)) = ๐ (๐ผ๐ฅ๐ (๐)) for some ๐ ∈ 2โค ⇔ ๐ (๐ผ๐ฅ๐ (๐ฆ โฆ ๐ฅ)) = ๐ (๐ผ๐ฅ๐ (๐)) for some ๐ ∈ 2โค ⇔ ๐ โจ ๐ฆ โฆ ๐ฅ = ๐ [(๐ผ๐ )๐๐ฅ ] for some ๐ ∈ 2โค ⇔ ๐ โจ ∃๐ฅ(๐ฆ โฆ ๐ฅ = ๐) [๐ผ๐ ]. Section 7.3 HOMOMORPHISMS 385 Using ๐ , we see that the interpretation of ๐ gives rise to an interpretation of ๐ . For example, notice that if ๐ผ(๐ฆ) = 3, then ๐ (๐ผ(๐ฆ)) = 6. Hence, in ๐, ∃๐ฅ(๐ฆ โฆ ๐ฅ = ๐) is interpreted as 3 + ๐ฅ = 0 for some ๐ฅ ∈ ๐ด, and the witness of ∃๐ฅ(๐ฆ โฆ ๐ฅ = ๐) is −3. In ๐ , ∃๐ฅ(๐ฆ โฆ ๐ฅ = ๐) is interpreted as 6 + ๐ฅ = 0 for some ๐ฅ ∈ ๐ต and the witness of ∃๐ฅ(๐ฆ โฆ ๐ฅ = ๐) is −6. LEMMA 7.3.21 Let ๐ = (๐ด, ๐) and ๐ = (๐ต, ๐) be ๐ฒ-structures. Assume that ๐ : ๐ → ๐ is an for all isomorphism. Let ๐ผ be an ๐ฒ-interpretation of ๐. Then, (๐ผ๐ฅ๐ )๐ = (๐ผ๐ )๐(๐) ๐ฅ ๐ ∈ ๐ด. PROOF We proceed by induction on terms, relying on Definitions 7.1.4 and 7.3.1. โ Let ๐ฅ and ๐ฆ be variable symbols. Then, (๐ผ๐ฅ๐ )๐ (๐ฅ) = ๐(๐ผ๐ฅ๐ (๐ฅ)) = ๐(๐) = (๐ผ๐ )๐(๐) ๐ฅ (๐ฅ), and if ๐ฆ ≠ ๐ฅ, (๐ผ๐ฅ๐ )๐ (๐ฆ) = ๐(๐ผ๐ฅ๐ (๐ฆ)) = ๐(๐ผ(๐ฆ)) = (๐ผ๐ )๐(๐) ๐ฅ (๐ฆ). โ If ๐ is a constant symbol, then (๐ผ๐ฅ๐ )๐ (๐) = ๐(๐ผ๐ฅ๐ (๐)) = ๐(๐ผ(๐)) = (๐ผ๐ )๐(๐) ๐ฅ (๐). โ Let ๐ be an ๐-ary function symbol and ๐ก0 , ๐ก1 , … , ๐ก๐−1 be ๐ฒ-terms. Take ๐ ∈ ๐ด. Then, (๐ผ๐ฅ๐ )๐ (๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 )) = ๐(๐ผ๐ฅ๐ (๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ))) = ๐(๐(๐ )(๐ผ๐ฅ๐ (๐ก0 ), ๐ผ๐ฅ๐ (๐ก1 ), … , ๐ผ๐ฅ๐ (๐ก๐−1 ))) = ๐(๐ )(๐(๐ผ๐ฅ๐ (๐ก0 )), ๐(๐ผ๐ฅ๐ (๐ก1 )), … , ๐(๐ผ๐ฅ๐ (๐ก๐−1 ))) = ๐(๐ )((๐ผ๐ฅ๐ )๐ (๐ก0 ), (๐ผ๐ฅ๐ )๐ (๐ก1 ), … , (๐ผ๐ฅ๐ )๐ (๐ก๐−1 )) ๐(๐) ๐(๐) = ๐(๐ )(((๐ผ๐ )๐(๐) ๐ฅ )(๐ก0 ), ((๐ผ๐ )๐ฅ )(๐ก1 ), … , ((๐ผ๐ )๐ฅ )(๐ก๐−1 )) = (๐ผ๐ )๐(๐) ๐ฅ (๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 )). The fifth equality follows by induction. If ๐ : ๐ → ๐ is an isomorphism, the structures look the same. Thus, formulas should be interpreted in ๐ essentially in the same way that they are interpreted in ๐. In other words, given an interpretation ๐ผ of ๐, there should be an interpretation of ๐ that is like ๐ผ. This interpretation is ๐ผ๐ . 386 Chapter 7 MODELS LEMMA 7.3.22 Let ๐ and ๐ be ๐ฒ-structures with isomorphism ๐ : ๐ → ๐ . Assume that ๐ผ is an ๐ฒ-interpretation of ๐. Then, for every ๐ฒ-formula ๐, ๐ โจ ๐ [๐ผ] if and only if ๐ โจ ๐ [๐ผ๐ ]. PROOF We proceed by induction on formulas, relying on Theorem 7.1.7. โ Suppose that ๐ก0 and ๐ก1 are ๐ฒ-terms. Since ๐ is one-to-one, ๐ โจ ๐ก0 = ๐ก1 [๐ผ] ⇔ ๐ผ(๐ก0 ) = ๐ผ(๐ก1 ) ⇔ ๐(๐ผ(๐ก0 )) = ๐(๐ผ(๐ก1 )) ⇔ ๐ผ๐ (๐ก0 ) = ๐ผ๐ (๐ก1 ) ⇔ ๐ โจ ๐ก0 = ๐ก1 [๐ผ๐ ]. โ Let ๐ be an ๐-ary relation symbol and ๐ก0 , ๐ก1 , … , ๐ก๐−1 be ๐ฒ-terms. Since ๐ is an isomorphism, ๐ โจ ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) [๐ผ] ⇔ (๐ผ(๐ก0 ), ๐ผ(๐ก1 ), … , ๐ผ(๐ก๐−1 )) ∈ ๐ผ(๐ ) ⇔ (๐(๐ผ(๐ก0 )), ๐(๐ผ(๐ก1 )), … , ๐(๐ผ(๐ก๐−1 ))) ∈ ๐(๐ผ(๐ )) ⇔ (๐ผ๐ (๐ก0 ), ๐ผ๐ (๐ก1 ), … , ๐ผ๐ (๐ก๐−1 )) ∈ ๐ผ๐ (๐ ) ⇔ ๐ โจ ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) [๐ผ๐ ]. โ Assume that ๐ is an ๐ฒ-formula. Then, ๐ โจ ¬๐ [๐ผ] ⇔ ๐ ฬธโจ ๐ [๐ผ] ⇔ ๐ ฬธโจ ๐ [๐ผ๐ ] ⇔ ๐ โจ ¬๐ [๐ผ๐ ], where the middle equivalence holds by induction. โ Let ๐ and ๐ be ๐ฒ-formulas. Suppose that ๐ โจ ๐ → ๐ [๐ผ]. This means that ๐ โจ ๐ [๐ผ] implies ๐ โจ ๐ [๐ผ]. Assume ๐ โจ ๐ [๐ผ๐ ]. By induction, we have that ๐ โจ ๐ [๐ผ], whence ๐ โจ ๐ [๐ผ]. Again, by induction, ๐ โจ ๐ [๐ผ๐ ]. Therefore, ๐ โจ ๐ → ๐ [๐ผ๐ ]. The converse is proved similarly. โ Let ๐ be an ๐ฒ-formula. Then, by Lemma 7.3.21 and since ๐ is an isomorphism, ๐ โจ ∃๐ฅ๐ [๐ผ] ⇔ ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด ⇔ ๐ โจ ๐ [(๐ผ๐ฅ๐ )๐ ] for some ๐ ∈ ๐ด ⇔ ๐ โจ ๐ [(๐ผ๐ )๐(๐) ๐ฅ ] for some ๐ ∈ ๐ด ⇔ ๐ โจ ๐ [(๐ผ๐ )๐๐ฅ ] for some ๐ ∈ ๐ต ⇔ ๐ โจ ∃๐ฅ๐ [๐ผ๐ ]. Section 7.3 HOMOMORPHISMS 387 Let ๐1 = (2โค, 0, +) and ๐2 = (3โค, 0, +). These isomorphic groups are infinite and cyclic, each with exactly two generators. The group ๐1 is generated by 2 and −2, and ๐2 is generated by 3 and −3. When Lemma 7.3.22 is restricted to sentences, we conclude that isomorphic structures model the same sentences. Hence, the ๐ฆ๐ฑsentences satisfied by ๐1 are exactly those ๐ฆ๐ฑ-sentences satisfied by ๐2 . For example, ๐1 โจ ∀๐ฅ∀๐ฆ(๐ฅ โฆ ๐ฆ = ๐ฆ โฆ ๐ฅ), and ๐2 โจ ∀๐ฅ∀๐ฆ(๐ฅ โฆ ๐ฆ = ๐ฆ โฆ ๐ฅ). Also, ๐1 โจ ∃๐ฅ∀๐ฆ∃๐ง(๐ฆ = ๐ฅ โฆ ๐ง), and ๐2 โจ ∃๐ฅ∀๐ฆ∃๐ง(๐ฆ = ๐ฅ โฆ ๐ง). Therefore, we make the next definition and follow it immediately with the theorem that follows from Lemma 7.3.22 and Theorem 7.1.31. DEFINITION 7.3.23 The ๐ฒ-structures ๐ and ๐ are elementary equivalent (denoted by ๐ ≡ ๐ ) if for all ๐ฒ-sentences ๐, ๐ โจ ๐ if and only if ๐ โจ ๐. THEOREM 7.3.24 For all ๐ฒ-structures ๐ and ๐ , if ๐ ≅ ๐ , then ๐ ≡ ๐ . Theorem 7.3.24 implies that if structures are isomorphic, there is no first-order sentence that can be used to distinguish between the two. That is, there is no sentence ๐ such that ๐ โจ ๐ but ๐ ฬธโจ ๐ when ๐ ≅ ๐ . Conversely, if there is a sentence that is satisfied by one structure but not the other, the structures cannot be isomorphic. EXAMPLE 7.3.25 Let ๐ and ๐ be ๐ฒ๐ณ-structures such that ๐ = (๐+ , ∈) and ๐ = (๐, ∈). Observe that ๐ โจ ∃๐ฅ∀๐ฆ(๐ฆ ∈ ๐ฅ ∨ ๐ฆ = ๐ฅ) but ๐ ฬธโจ ∃๐ฅ∀๐ฆ(๐ฆ ∈ ๐ฅ ∨ ๐ฆ = ๐ฅ). Therefore, by Theorem 7.3.24, we have that ๐ is not isomorphic to ๐ . EXAMPLE 7.3.26 Let ๐ = (๐ด, ๐) and ๐ = (๐ต, ๐) be {๐ }-structures, where ๐ is a binary relation symbol. Let ๐ด = {๐0 , ๐1 } with ๐0 ≠ ๐1 . Assume that the structures are not isomorphic yet ๐ ≡ ๐ . Define ๐ := ∃๐ฅ(๐ฅ = ๐ฅ), 388 Chapter 7 MODELS ๐ := ∀๐ฅ∃๐ฆ(๐ฅ ≠ ๐ฆ), and ๐ := ∀๐ฅ∀๐ฆ∀๐ง(๐ฅ ≠ ๐ฆ ∧ ๐ฅ ≠ ๐ง → ๐ฆ = ๐ง). Then, ๐ โจ ๐ because ๐ด is nonempty, ๐ โจ ๐ because ๐ด has at least two elements, and ๐ โจ ๐ because ๐ด has at most two elements. Hence, ๐ โจ ๐ ∧ ๐ ∧ ๐. Since ๐ ≡ ๐ , we have that ๐ โจ ๐ ∧ ๐ ∧ ๐, so |๐ต| = 2. Write ๐ต = {๐0 , ๐1 } with ๐0 ≠ ๐1 . This implies that there are only two bijections ๐ด → ๐ต. Call them ๐0 and ๐1 , and write ๐0 = {(๐0 , ๐0 ), (๐1 , ๐1 )} and ๐1 = {(๐0 , ๐1 ), (๐1 , ๐0 )}. By assumption, neither of these functions are {๐ }-homomorphisms, so they fail to preserve ๐ . Suppose that (๐0 , ๐1 ) ∈ ๐(๐ ) is the particular element that causes the second condition of Definition 7.3.1 to fail. Therefore, ๐ โจ ∃๐ฅ∃๐ฆ[๐ฅ ≠ ๐ฆ ∧ ๐ (๐ฅ, ๐ฆ)]. Since ๐0 does not preserve ๐ , (๐0 , ๐1 ) ∉ ๐(๐ ), and since ๐1 does not preserve ๐ , (๐1 , ๐0 ) ∉ ๐(๐ ). Hence, ๐ ฬธโจ ∃๐ฅ∃๐ฆ[๐ฅ ≠ ๐ฆ ∧ ๐ (๐ฅ, ๐ฆ)], which is impossible because ๐ ≡ ๐ , so ๐0 or ๐1 must be an {๐ }-isomorphism. The argument generalizes for all structures with a domain of cardinality 2 (Exercise 17) and then to all structures with a finite domain (Exercise 18). Therefore, the converse of Theorem 7.3.24 holds, provided that the domains of the structures are finite. Elementary Substructures Let ๐ = (๐ด, ๐) and ๐ = (๐ต, ๐) be ๐ฒ-structures such that ๐ ⊆ ๐ . Let ๐ผ be an interpretation of ๐. By Definition 7.1.4, we have the following: โ For all variable symbols ๐ฅ, ๐ผ(๐ฅ) ∈ ๐ด ⊆ ๐ต. โ For all constant symbols ๐, ๐ผ(๐) ∈ ๐ด ⊆ ๐ต. โ For all ๐-ary function symbols ๐ and ๐ฒ-terms ๐ก0 , ๐ก1 , … , ๐ก๐−1 , because ๐ผ(๐ก๐ ) ∈ ๐ด for ๐ = 0, 1, … , ๐ − 1 and ๐(๐ ) = ๐(๐ ) ๐ด๐ , ๐ผ(๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 )) = ๐(๐ )(๐ผ(๐ก0 ), ๐ผ(๐ก1 ), … , ๐ผ(๐ก๐−1 )) = ๐(๐ )(๐ผ(๐ก0 ), ๐ผ(๐ก1 ), … , ๐ผ(๐ก๐−1 )). This implies that ๐ผ is also an interpretation of ๐ . Moreover, when ๐ is a quantifier-free ๐ฒ-formula, the interpretation of ๐ in ๐ requires no element of ๐ต โงต ๐ด. This implies the next theorem. Its proof is left to Exercise 21. Section 7.3 HOMOMORPHISMS 389 THEOREM 7.3.27 Let ๐ and ๐ be ๐ฒ-structures such that ๐ ⊆ ๐ . For all ๐ฒ-interpretations ๐ผ of ๐ and all quantifier-free ๐ฒ-formulas ๐, ๐ โจ ๐ [๐ผ] if and only if ๐ โจ ๐ [๐ผ]. Next, consider ๐ โจ ∃๐ฅ๐ [๐ผ]. This means that ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ต. If ๐ is also in ๐ด, then ๐ โจ ๐ [๐ผ๐ฅ๐ ], but this conclusion is not guaranteed if ๐ ∈ ๐ต โงต ๐ด. For example, let ๐ = (โค, <) and ๐ = (โ, <) be {๐ }-structures, where ๐ is a binary relation symbol. Since โค ⊆ โ, we have that ๐ is a substructure of ๐ . However, because of differences between โ and โค, ๐ โจ ∀๐ฅ1 ∀๐ฅ2 ∃๐ฆ(๐ฅ1 < ๐ฆ < ๐ฅ2 ) but ๐ ฬธโจ ∀๐ฅ1 ∀๐ฅ2 ∃๐ฆ(๐ฅ1 < ๐ฆ < ๐ฅ2 ). Hence, in order for ๐ โจ ๐ [๐ผ] if and only if ๐ โจ ๐ [๐ผ] for even quantified formulas ๐, a stronger condition is required. DEFINITION 7.3.28 ๐ is an elementary substructure of ๐ (written as ๐ โชฏ ๐ ) if ๐ ⊆ ๐ and for all interpretations ๐ผ of ๐ and ๐ฒ-formulas ๐, ๐ โจ ๐ [๐ผ] if and only if ๐ โจ ๐ [๐ผ]. If ๐ โชฏ ๐ , then ๐ is an elementary extension of ๐. Suppose that ๐ and ๐ are ๐ฒ-structures such that ๐ is an elementary substructure of ๐ . Let ๐ be an ๐ฒ-sentence. Since ๐ is also an ๐ฒ-formula, ๐ โจ ๐ if and only if ๐ โจ ๐ by Definition 7.3.28. Therefore, ๐ and ๐ are elementary equivalent, proving the next result. THEOREM 7.3.29 Let ๐ and ๐ be ๐ฒ-structures. If ๐ โชฏ ๐ , then ๐ ≡ ๐ . However, the converse of Theorem 7.3.29 does not hold. It is possible for ๐ ⊆ ๐ and ๐ ≡ ๐ yet ๐ not be an elementary substructure of ๐ . If this is to be the case, there must be a formula ๐ with at least one free variable that is satisfied by one of the structures but not the other. EXAMPLE 7.3.30 Let ๐ = ([0, 1] , ๐) and ๐ = ([0, 2] , ๐) be the {๐ }-structures of Example 7.2.2, where ๐(๐ ) is standard < on [0, 1] and ๐(๐ ) is standard < on [0, 2]. Recall that ๐ is a substructure of ๐ . Since ๐ defined by ๐ (๐ฅ) = 2๐ฅ is an {๐ }-isomorphism [0, 1] → [0, 2], we conclude that ๐ ≡ ๐ (Theorem 7.3.24). However, define the formula ๐(๐ฅ) := ∃๐ฆ๐ (๐ฅ, ๐ฆ) 390 Chapter 7 MODELS and let ๐ผ be an interpretation such that ๐ผ(๐ฅ) = 1. Then, ๐ ฬธโจ ๐(๐ฅ) [๐ผ] and ๐ โจ ๐(๐ฅ) [๐ผ]. Therefore, ๐ is not an elementary substructure of ๐ . Because of Theorem 7.3.27, proving that one structure is an elementary substructure of another reduces to a check of one particular condition. This condition is found in the next theorem, which is due to Tarski and Vaught (1957). THEOREM 7.3.31 [Tarski–Vaught] Let ๐ and ๐ be ๐ฒ-structures such that ๐ ⊆ ๐ . Let ๐ด be the domain of ๐. The following are equivalent. โ ๐ is an elementary substructure of ๐ . โ For every ๐ฒ-formula ๐ and ๐ฒ-interpretation ๐ผ of ๐, if ๐ โจ ∃๐ฅ๐ [๐ผ] then ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด. (7.19) PROOF Suppose that ๐ โชฏ ๐ . Let ๐ be an ๐ฒ-formula and ๐ผ an ๐ฒ-interpretation of ๐. Let ๐ โจ ∃๐ฅ๐ [๐ผ]. By hypothesis, we have that ๐ โจ ∃๐ฅ๐ [๐ผ]. Thus, ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด, so again by hypothesis, ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด. Conversely, let ๐ be an ๐ฒ-formula and ๐ผ an ๐ฒ-interpretation of ๐. We proceed by induction on formulas to show that ๐ โชฏ ๐ . โ If ๐ is ๐ก0 = ๐ก1 or ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) for ๐ฒ-terms ๐ก0 , ๐ก1 , … , ๐ก๐−1 , then ๐ has no quantifiers, and the result follows by Theorem 7.3.27. โ Let ๐ be ¬๐. Then, ๐ โจ ๐ [๐ผ] ⇔ ๐ โจ ¬๐ [๐ผ] ⇔ ๐ ฬธโจ ๐ [๐ผ] ⇔ ๐ ฬธโจ ๐ [๐ผ] ⇔ ๐ โจ ¬๐ [๐ผ] ⇔ ๐ โจ ๐ [๐ผ]. โ Suppose ๐ is ๐ → ๐. Assume that ๐ โจ ๐ → ๐ [๐ผ] and let ๐ โจ ๐ [๐ผ]. Then, ๐ โจ ๐ [๐ผ] by induction, so ๐ โจ ๐ [๐ผ]. Again, by induction, ๐ โจ ๐ [๐ผ]. Therefore, ๐ โจ ๐ → ๐ [๐ผ]. The converse is proved similarly. โ Now let ๐ be ∃๐ฅ๐. First, because ๐ ⊆ ๐ , ๐ โจ ∃๐ฅ๐ [๐ผ] ⇒ ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด ⇒ ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ต ⇒ ๐ โจ ∃๐ฅ๐ [๐ผ]. Section 7.3 HOMOMORPHISMS 391 Second, by (7.19), ๐ โจ ∃๐ฅ๐ [๐ผ] ⇒ ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด ⇒ ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด ⇒ ๐ โจ ∃๐ฅ๐ [๐ผ]. EXAMPLE 7.3.32 Prove that ๐ = (โ, ≤) is an elementary substructure of ๐ = (โ, ≤). To do this, follow the test given by Theorem 7.3.31. Let ๐ be an ๐ฒ-formula and ๐ผ be an ๐ฒ-interpretation of ๐. Assume that ๐ โจ ∃๐ฅ๐ [๐ผ]. This implies that ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ โ. Take ๐0 , ๐1 , ๐ ∈ โ such that ๐, ๐ ∈ (๐0 , ๐1 ) and define the functions ๐0 ๐1 ๐2 ๐3 : (−∞, ๐0 ] → (−∞, ๐0 ], : (๐0 , ๐) → (๐0 , ๐), : [๐, ๐1 ) → [๐, ๐1 ), : [๐1 , ∞) → [๐1 , ∞), so that ๐0 is the identity on (−∞, ๐0 ], ๐3 is the identity on [๐1 , ∞), and ๐1 and ๐2 are order isomorphisms (Exercise 22). Let ๐ = ๐0 ∪ ๐1 ∪ ๐2 ∪ ๐3 . Then, ๐ is an order isomorphism โ → โ such that ๐ (๐) = ๐, ๐ (๐0 ) = ๐ (๐0 ), and ๐ (๐1 ) = ๐ (๐1 ). Hence, ๐ is an automorphism (Exercise 23). Therefore, by Lemmas 7.3.21 and 7.3.22, ๐ โจ ๐ [๐ผ๐ฅ๐ ] ⇔ ๐ โจ ๐ [(๐ผ๐ฅ๐ )๐ ] ⇔ ๐ โจ ๐ [(๐ผ๐ )๐๐ฅ (๐) ], which implies that ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ โ. Thus, ๐ โชฏ ๐ , which also implies that ๐ ≡ ๐ (Theorem 7.3.29). Notice that this example implies that there is no first-order sentence that can distinguish between (โ, <) and (โ, <) even though they are not isomorphic. Also, this example shows that the converse of Theorem 7.3.24 is false if the domains of the structures are infinite. Certainly, for all ๐ฒ-structures ๐, ๐ , and โญ, we have the following by Exercise 19: โ ๐ ≡ ๐. โ If ๐ ≡ ๐ , then ๐ ≡ ๐. โ If ๐ ≡ ๐ and ๐ ≡ โญ, then ๐ ≡ โญ. Similar to this and Theorem 7.2.4, we have the following result for elementary substructures. 392 Chapter 7 MODELS THEOREM 7.3.33 Let ๐, ๐ , and โญ be ๐ฒ-structures. โ ๐ โชฏ ๐. โ If ๐ โชฏ ๐ and ๐ โชฏ โญ, then ๐ โชฏ โญ. โ If ๐ โชฏ โญ, ๐ โชฏ โญ, and ๐ ⊆ ๐ , then ๐ โชฏ ๐ . PROOF We prove the second part and leave the others to Exercise 25. Let ๐ โชฏ ๐ and ๐ โชฏ โญ. This first means that ๐ ⊆ ๐ and ๐ ⊆ โญ. Thus, by Theorem 7.2.4, ๐ ⊆ โญ. Let ๐ผ be an interpretation of ๐. The proof is completed for quantifierfree formulas using Theorem 7.3.27 (Exercise 24), so let ๐ be an ๐ฒ-formula with a quantifier. โ Suppose ๐ โจ ∃๐ฅ๐ [๐ผ]. This implies that ๐ โจ ๐ [๐ผ๐ฅ๐ ] for some ๐ ∈ ๐ด. Then, ๐ โจ ๐ [๐ผ๐ฅ๐ ] from which it follows that โญ โจ ๐ [๐ผ๐ฅ๐ ]. Since ๐ด ⊆ ๐ถ, we conclude that โญ โจ ∃๐ฅ๐ [๐ผ]. โ Conversely, let โญ โจ ∃๐ฅ๐ [๐ผ]. By Theorem 7.3.31, there exists ๐ ∈ ๐ต such that โญ โจ ๐ [๐ผ๐ฅ๐ ]. By hypothesis, ๐ โจ ๐ [๐ผ๐ฅ๐ ], so ๐ โจ ∃๐ฅ๐ [๐ผ]. Again, by Theorem 7.3.31, there exists ๐ ∈ ๐ด such that ๐ โจ ๐ [๐ผ๐ฅ๐ ]. From this follows that ๐ โจ ๐ [๐ผ๐ฅ๐ ], whence ๐ โจ ∃๐ฅ๐ [๐ผ]. Exercises 1. Let ๐, ๐ , and โญ be ๐ฒ-structures such that ๐ ⊆ ๐ . Let ๐ : ๐ → โญ be a homomorphism. Prove that ๐[dom(๐)] is the domain of a substructure of โญ. 2. Prove that the following are group homomorphisms. Assume in each instance that โฆ is interpreted as the standard addition on the set. (a) ๐ : โค × โค → โค where ๐(๐, ๐) = ๐ (b) ๐ : โค12 → โค6 where ๐([๐]12 ) = [๐]6 (c) ๐ : โค × โค → M2, 2 (โ) where [ ] ๐ 0 ๐(๐, ๐) = 0 ๐ 3. Prove that the zero map is a group homomorphism. 4. Let โจ = (โค, +) and โจ๐ = (โค๐ , +) with ๐ ∈ โค+ . Prove that ๐ : โค → โค๐ defined by ๐(๐ฅ) = [๐ฅ]๐ is a ring homomorphism โจ → โจ๐ . 5. Let โญ = (โ, 0 + 0๐, +, ⋅). Define ๐ : โ → โ by ๐(๐ + ๐๐) = ๐ − ๐๐ for all ๐, ๐ ∈ โ. Prove the following. (a) ๐ is a ring homomorphism โญ → โญ. (b) ๐ is one-to-one and onto. Section 7.3 HOMOMORPHISMS 393 6. Prove the first part of Theorem 7.3.8. 7. Let ℜ = (๐ , 0, +, ⋅) be a ring and ℑ be an ideal of ℜ with domain ๐ผ. Let the function ๐ : ๐ → ๐ โ๐ผ be defined as ๐(๐) = ๐ + ๐ผ for all ๐ ∈ ๐ . (a) Prove that ๐ is a homomorphism. (b) Show that ker(๐) = ๐ผ. (c) Prove that ๐ is onto. 8. Prove Theorem 7.3.15. 9. Prove that the function ๐ of Example 7.3.16 is a bijection. 10. Let ๐ and ๐ be ๐ฒ-structures. Write ๐ - ๐ if there exists a one-to-one homomorphism ๐ : ๐ → ๐ . Prove the following. (a) ๐ - ๐ (b) If ๐ - ๐ and ๐ - โญ, then ๐ - โญ. (c) It is possible for ๐ - ๐ and ๐ - โญ yet ๐ and ๐ are different ๐ฒ-structures. 11. Let ๐ and ๐′ be {๐ }-structures with equal domains that are finite, where ๐ is a binary relation symbol. Assume that ๐ โจ X has a least element with respect to R and ๐′ โจ X has a least element with respect to R for every ๐ ⊆ dom(๐). Prove that ๐ ≅ ๐′ . 12. Assume that ๐ฒ is finite and that the domain of the ๐ฒ-structure ๐ is finite. Find a ๐ฒ-sentence ๐ such that ๐ โจ ๐ if and only if ๐ ≅ ๐, for all ๐ฒ-structures ๐ . 13. Suppose that ๐ : ℜ → ℜ′ is a ring isomorphism as in Example 7.3.18. Prove. (a) If ℜ is an integral domain, ℜ′ is an integral domain. (b) If ℜ is a division ring, ℜ′ is a division ring. 14. Let ๐ : ℜ → ℜ′ be a ring isomorphism. Prove that 1 is unity from ℜ if and only if ๐(1) is unity from ℜ′ . 15. Prove or show false the given elementary equivalences. (a) (โ, 0, +) ≡ (โ, 0, +). (b) (โค, 0, +) ≡ (โ, 0, +). (c) (โค, ≤) ≡ (โ, ≤). (d) (๐, ≤) ≡ (โค+ , ≤). (e) (๐, ≤) ≡ (โค− , ≤). 16. Let ๐ด and ๐ต be sets and assume that ๐ฒ = ∅. Prove that (๐ด) ≡ (๐ต). 17. Generalize the argument of Example 7.3.26 to prove that ๐ ≡ ๐ implies that ๐ ≅ ๐ if the cardinality of the domain of ๐ equals 2. 18. Generalize the argument of Example 7.3.26 to prove that ๐ ≡ ๐ implies that ๐ ≅ ๐ if the domain of ๐ is finite. 394 Chapter 7 MODELS 19. Let ๐, ๐ , and โญ be ๐ฒ-structures. Prove the following. (a) ๐ ≡ ๐. (b) If ๐ ≡ ๐ , then ๐ ≡ ๐. (c) If ๐ ≡ ๐ and ๐ ≡ โญ, then ๐ ≡ โญ. 20. Let ๐ and ๐ be elementary equivalent ๐ฒ-structures. Prove that there exists an ๐ฒ-structure โญ such that ๐ โชฏ โญ and ๐ โชฏ โญ. 21. Prove Theorem 7.3.27. 22. Find the order isomorphisms ๐1 and ๐2 in Example 7.3.32. 23. Prove that the function ๐ from Example 7.3.32 is a {≤}-isomorphism. 24. Assume that for all ๐ฒ-structures ๐, ๐ , and โญ, if ๐ โชฏ ๐ and ๐ โชฏ โญ, then ๐ โจ ๐ if and only if โญ โจ ๐ for all quantifier-free ๐. 25. Prove the first and third parts of Theorem 7.3.33. 26. Let โฑ = {๐๐พ : ๐พ ∈ ๐ } be a chain of ๐ฒ-structures for some cardinal ๐ (Exercise 7.2.3). Assume that ๐๐ผ โชฏ ๐๐ฝ when ๐ผ ∈ ๐ฝ ∈ ๐ , making โฑ an elementary chain. โ Prove that ๐๐ผ โชฏ ๐พ∈๐ ๐๐พ for all ๐ผ ∈ ๐ . 27. Find a set of subject symbols ๐ฒ and a chain of ๐ฒ-structures {๐๐ : ๐ ∈ ๐} such that โ ๐๐ ≡ ๐๐ for all ๐, ๐ ∈ ๐ but ๐0 โข ๐∈๐ ๐๐ . 7.4 THE THREE PROPERTIES REVISITED This section is an extension of Section 1.5 to first-order logic. First, we define what it means to be consistent with the ultimate goal to show that any consistent system has a model. Next, we show that first-order logic is sound in that every sentence that can be proved is true, provided we have the correct understanding of what it means to prove and what it means to be true. Lastly, we show that first-order logic is complete in that every true sentence can be proved. Consistency Consider the following propositions representing Euclid’s axioms for his geometry (Euclid 1925, I Postulates). โ Eu1. A line can be drawn through two distinct points. โ Eu2. A line segment can be drawn between two distinct points. โ Eu3. A circle can be drawn given any center and radius. โ Eu4. All right angles are congruent to each another. โ Eu5. If a line falling on two straight lines make the interior angles on the same side less than two right angles, the two lines intersect on the side on which the angles are less than two right angles. Section 7.4 THE THREE PROPERTIES REVISITED 395 There are three sets of proposition here. First, is the list itself, โฐ = {Eu1, Eu2, Eu3, Eu4, Eu5}. Second, is the set of consequences (Definition 7.1.20) of โฐ , โฐ1 = {๐ : โฐ โจ ๐}. Third, is the set of propositions provable (Definition 1.2.13) from โฐ , โฐ2 = {๐ : โฐ โข ๐}. For Euclidean geometry, โฐ1 = โฐ2 . This means that both โฐ1 and โฐ2 can be said to describe all of the propositions that are in the geometry. It is the job of the geometer to discover what those propositions are. We want to similarly analyze first-order logic. Given the rules of logic and a set of theory symbols, we want the set of consequences to be equal to the set of provable sentences. One way to have this happen is for a contradiction to follow from the axioms (1.2.8). Then, by Theorem 1.5.2, every propositional form will have a proof and, in turn, will also be a consequence, but we do not want such a system. Therefore, for all of this to work effectively, it is a requirement that first-order logic is without contradiction. To deal with this concept, we start with a definition. DEFINITION 7.4.1 An ๐ฆ-theory is a set of ๐ฒ-sentences. The set of all sentences provable in first-order logic given a set of theory symbols ๐ฒ is an example of an ๐ฒ-theory. The theories that we study should have a familiar property (compare Definition 1.5.1). DEFINITION 7.4.2 An ๐ฒ-theory ๐ฏ is consistent [denoted by ๐ข๐๐(๐น )] if ๐ฏ โฌ ๐ ∧ ¬๐ for every ๐ฒ-sentence ๐. Otherwise, ๐ฏ is inconsistent. We next generalize Theorem 1.5.2 to first-order logic. Its proof is left to Exercise 2. THEOREM 7.4.3 Let ๐ฏ be an ๐ฒ-theory. The following are equivalent. โ ๐ฏ is consistent. โ Every finite subset of ๐ฏ is consistent. โ There is an ๐ฒ-sentence that is not provable from ๐ฏ . As a consequence of Theorem 7.4.3, a theory ๐ฏ is inconsistent if either it has a finite subset that is inconsistent or it is able to prove all sentences. This identifies the major 396 Chapter 7 MODELS weakness of an inconsistent theory. Not only can a contradiction be proved, but any sentence can also be proved. Such a system is certainly worthless. If a theory is consistent, it can be shown to be a subset of a theory that is the greatest possible consistent theory. We generalize Definition 1.5.3 to name this theory. DEFINITION 7.4.4 The ๐ฒ-theory ๐ฏ is maximally consistent if ๐ฏ is consistent and ๐ข๐๐(๐ฏ ∪ {๐}) implies ๐ ∈ ๐ฏ for all ๐ฒ-sentences ๐. The next lemma gives some basic properties of maximally consistent theories. LEMMA 7.4.5 Let ๐ฏ be a maximally consistent ๐ฒ-theory. Let ๐ and ๐ be ๐ฒ-sentences. โ If ๐ฏ โข ๐, then ๐ ∈ ๐ฏ . โ If โข ๐, then ๐ ∈ ๐ฏ . โ ๐ ∈ ๐ฏ or ¬๐ ∈ ๐ฏ , but not both. โ If ๐ → ๐, ๐ ∈ ๐ฏ , then ๐ ∈ ๐ฏ . PROOF โ Let ๐ฏ โข ๐. Suppose that ๐ฏ ∪ {๐} โข ๐ ∧ ¬๐ for some ๐ฒ-sentence ๐. This means that there exists ๐ฒ-sentences ๐0 , ๐1 , … , ๐๐−1 such that ๐, ๐0 , ๐1 , … , ๐๐−1 , ๐ ∧ ¬๐ is a proof of ๐ ∧ ¬๐ from ๐ฏ ∪ {๐}. Since ๐ฏ proves ๐, there are ๐ฒ-sentences ๐0 , ๐1 , … , ๐๐−1 such that ๐0 , ๐1 , … , ๐๐−1 , ๐ is a proof of ๐ from ๐ฏ . Therefore, ๐0 , ๐1 , … , ๐๐−1 , ๐, ๐0 , … , ๐๐−1 , ๐ ∧ ¬๐ is a proof of ๐ ∧ ¬๐ from ๐ฏ , a contradiction. This implies that ๐ฏ ∪ {๐} is consistent, so since ๐ฏ is maximally consistent, ๐ ∈ ๐ฏ . โ If โข ๐, then ๐ฏ โข ๐, so ๐ ∈ ๐ฏ by the first property. โ Since ๐ฏ is consistent, ๐ข๐๐(๐ฏ ∪ {๐}) or ๐ข๐๐(๐ฏ ∪ {¬๐}). Because ๐ฏ is maximally consistent, ๐ ∈ ๐ฏ or ¬๐ ∈ ๐ฏ , but not both because ๐ข๐๐(๐ฏ ). โ Let ๐ → ๐ and ๐ be members of ๐ฏ . By MP, we have that ๐ฏ โข ๐, so again by the first part, ๐ ∈ ๐ฏ . Given a consistent theory, the construction of the maximally consistent theory that contains it requires the use of a chain. Section 7.4 THE THREE PROPERTIES REVISITED 397 LEMMA 7.4.6 If (๐ , ⊆) is a chain of consistent ๐ฒ-theories, โ ๐ is consistent. PROOF Let ๐ be a chain of consistent sets of ๐ฒ-sentences with respect to ⊆. Suppose โ โ ๐ โข ๐ ∧ ¬๐ for some sentence ๐. Hence, there exists ๐0 , ๐1 , … , ๐๐−1 ∈ ๐ such that ๐0 , ๐1 , … , ๐๐−1 โข ๐ ∧ ¬๐. This implies that for each ๐ = 0, 1, … , ๐ − 1, there exists ๐ถ๐ ∈ ๐ such that ๐๐ ∈ ๐ถ๐ . Since {๐ถ๐ : ๐ = 0, 1, … , ๐ − 1} is a finite chain, arrange them so that ๐ถ๐ is the greatest element with respect to ⊆. This gives {๐0 , ๐1 , … , ๐๐−1 } ⊆ ๐ถ๐ , which implies that ๐ถ๐ โข ๐ ∧ ¬๐, contradicting the consistency of ๐ถ๐ . The next result proves the existence of a greatest consistent theory by generalizing Theorem 1.5.4 to an arbitrary set of sentences. THEOREM 7.4.7 [Lindenbaum] Every consistent ๐ฒ-theory is a subset of a maximally consistent ๐ฒ-theory. PROOF Let ๐ฏ be a consistent set of ๐ฒ-sentences. Define ๐ = {๐ธ : ๐ฏ ⊆ ๐ธ and ๐ธ is a consistent set of ๐ฒ-formulas}. Note that ๐ ≠ ∅ since ๐ฏ ∈ ๐ . Let ๐ be a chain in ๐ . Then, โ โ ๐ฏ ⊆ ๐. โ โ ๐ ∈ ๐ because โ ๐ is consistent because each element of the chain ๐ is consistent by Lemma 7.4.6. We conclude by Zorn’s lemma (5.1.13) that there is a greatest element ๐ ∈ ๐ with respect to ⊆. By definition, ๐ฏ ⊆ ๐ and ๐ข๐๐(๐). Also, let ๐ be a formula such that ๐ข๐๐(๐ ∪ {๐}). Then, ๐ ∪ {๐} ∈ ๐ , but since ๐ is maximal, we conclude that ๐ ∪ {๐} = ๐. Thus, ๐ ∈ ๐, showing that ๐ is maximally consistent. Soundness We previously defined a logic to be sound if every theorem is a tautology and complete if every tautology is a theorem (Definition 1.5.5). We now give the corresponding definition for first-order logic (Figure 7.3). 398 Chapter 7 MODELS Sound The sentence is valid. The sentence is a theorem. Complete Figure 7.3 Sound and complete logics. DEFINITION 7.4.8 โ A logic is sound if every theorem is valid. โ A logic is complete if every valid sentence is a theorem. A propositional form ๐ is a tautology if ๐(๐) = T for every valuation ๐ (Definition 1.1.14). This means that any interpretation of ๐ will always yield a true proposition. This, in turn, implies that ๐ will hold in every model. That is, tautologies are valid. Therefore, it is evident that Definition 7.4.8 is a generalization of Definition 1.5.5 to first-order logic. As in Section 1.5, we begin with soundness. THEOREM 7.4.9 [Soundness] Let ๐ฏ be a set of ๐ฒ-sentences. For any ๐ฒ-sentence ๐, if ๐ฏ โข ๐, then ๐ฏ โจ ๐. PROOF Using strong induction, we prove that for all ๐ ∈ โค+ , if there exists a proof of ๐ from ๐ฏ consisting of ๐ sentences, then ๐ฏ โจ ๐. (7.20) In the case that ๐ = 1, we see that โข ๐. This implies that ๐ is an axiom or ๐ ∈ ๐ฏ (Definition 1.2.13), so we have that ๐ฏ โจ ๐. Now suppose that ๐ > 1 and assume that (7.20) holds for all ๐ ≤ ๐. This means that there exists sentences ๐0 , ๐1 , … , ๐๐−1 provable from ๐ฏ such that ๐0 , ๐1 , … , ๐๐−1 , ๐ is a proof of ๐. Let ๐ be any model of ๐ฏ . We have three cases to consider (Theorem 1.4.2). โ ๐ ∈ ๐ฏ , which implies that ๐ โจ ๐. Section 7.4 THE THREE PROPERTIES REVISITED 399 โ ๐ is derived using MP. This implies there exists ๐ฒ-sentences ๐ and ๐ such that ๐, ๐ → ๐ ∈ {๐0 , ๐1 , … , ๐๐−1 , ๐}. Since a sentence of a proof is proved from previous sentences of the proof, by induction, ๐ โจ ๐ → ๐ and ๐ โจ ๐. Therefore, ๐ โจ ๐. โ ๐ is logically equivalent to ๐๐ for some ๐ = 0, 1, … , ๐ − 1. This means that ๐ ↔ ๐๐ is a tautology, so ๐ โจ ๐ ↔ ๐๐ . By Theorem 7.1.7, we conclude that ๐ โจ ๐ if and only if ๐ โจ ๐๐ , but by induction, ๐ โจ ๐๐ , so ๐ โจ ๐. If ๐ฏ = ∅, we can apply Theorem 7.4.9 to conclude the corollary. COROLLARY 7.4.10 First-order logic is sound. As with Corollary 1.5.11 for propositional logic, we can also prove the next result. COROLLARY 7.4.11 First-order logic is consistent. Completeness It is now time to prove the completeness of first-order predicate logic. This result is due to Kurt Gödel (1929) but the proof given here is due to Leon Henkin (1949). As opposed to the soundness theorem (7.4.9), the proof is rather involved. We begin with a definition (compare with EI, Theorem 2.3.14). DEFINITION 7.4.12 Let ๐ฏ be a set of ๐ฒ-sentences and ๐ถ a set of constant symbols of ๐ฒ. Define ๐ถ to be a witness set for ๐ฏ if for all ๐ฒ-formulas ๐ = ๐(๐ฅ), there exists ๐ ∈ ๐ถ such that ๐ ๐ฏ โข ∃๐ฅ๐ → ๐ . ๐ฅ For example, let ๐ฏ = {๐ฅ + 1 = 0, ๐ฅ + (1 + 1) = 0, (๐ฅ + ๐ฆ) + 1 = 1} be a set of ๐ญ๐ณ-sentences. Extend ๐ญ๐ณ to ๐ฒ = ๐ญ๐ณ ∪ ๐ถ, where ๐ถ = {−1, −2}. Then, ๐ถ is a witness set for ๐ฏ if −1 ๐ฏ โข ∃๐ฅ(๐ฅ + 1 = 0) → (๐ฅ + 1 = 0) ๐ฅ and ๐ฏ โข ∃๐ฅ(๐ฅ + (1 + 1) = 0) → (๐ฅ + (1 + 1) = 0) −2 . ๐ฅ Since (๐ฅ + ๐ฆ) + 1 = 1 has two free variables, the witness set does not apply to it. Recall that for a set of theory symbols ๐ฒ, the notation ๐ซ(๐ฒ) refers to the set of all ๐ฒ-formulas (Definition 2.1.13). 400 Chapter 7 MODELS LEMMA 7.4.13 Let ๐ฏ be a consistent set of ๐ฒ-sentences. Let ๐ถ be a set of constant symbols not in ๐ฒ such that |๐ถ| = |๐ซ(๐ฒ)|. Then, ๐ฏ can be extended to a consistent set of (๐ฒ ∪ ๐ถ)-sentences that has ๐ถ as a witness set. PROOF Let ๐ = |๐ซ(๐ฒ)| and suppose that ๐ถ = {๐๐ผ : ๐ผ ∈ ๐ } is a set of new constant symbols. Assume that ๐๐ผ ≠ ๐๐ฝ for all ๐ผ ∈ ๐ฝ ∈ ๐ . Since |๐ฟ(๐ฒ ∪ ๐ถ)| = ๐ , we identify the formulas of ๐ฟ(๐ฒ ∪ ๐ถ) with at most one free variable as ๐๐ผ = ๐๐ผ (๐ฅ๐ผ ) with ๐ผ ∈ ๐ . Now define a chain ๐ = {๐ฏ๐พ : ๐พ ∈ ๐ } and a set {๐๐พ : ๐พ ∈ ๐ } by transfinite recursion (Corollary 6.1.24) such that for all ๐ฝ ∈ ๐ , โ ๐ฏ๐ฝ is consistent, โ ๐ฏ๐ฝ is a (๐ฒ ∪ ๐ถ)-theory, โ |๐ฏ๐ฝ+1 โงต ๐ฏ๐ฝ | = 1, โ ๐๐ฝ is not among the symbols of the sentences in ๐ฏ๐ฝ . Let ๐ฏ0 = ๐ฏ and suppose that ๐ฏ๐ฟ and ๐๐ฟ has been defined for all ๐ฟ ∈ ๐ผ with the indicated properties. โ Let ๐ผ = ๐ฝ + 1. Choose ๐๐ฝ from ๐ถ such that ๐๐ฝ is not among the symbols of the formulas in ๐ฏ๐ฝ ∪ {๐๐ฝ }. This can be done since there are less than ๐ theory symbols in ๐ฏ๐ฝ ∪ {๐๐ฝ }. Now define ๐ฏ๐ฝ+1 } { ๐๐ฝ . = ๐ฏ๐ฝ ∪ ∃๐ฅ๐ฝ ๐๐ฝ → ๐๐ฝ ๐ฅ๐ฝ (7.21) In order to obtain a contradiction, suppose that ๐ฏ๐ฝ+1 is inconsistent. Since ๐ฏ๐ฝ is consistent, it must be the case that ) ( ๐๐ฝ . ๐ฏ๐ฝ โข ¬ ∃๐ฅ๐ฝ ๐๐ฝ → ๐๐ฝ ๐ฅ๐ฝ This implies that ๐ฏ๐ฝ โข ∃๐ฅ๐ฝ ๐๐ฝ ∧ ¬๐๐ฝ By Com and Simp, ๐ฏ๐ฝ โข ¬๐๐ฝ ๐๐ฝ ๐ฅ๐ฝ ๐๐ฝ ๐ฅ๐ฝ . . Since ๐๐ฝ is not among the symbols of ๐๐ฝ or ๐ฏ๐ฝ , it is arbitrary, so by UG, ๐ฏ๐ฝ โข ∀๐ฅ๐ฝ ¬๐๐ฝ . 401 Section 7.4 THE THREE PROPERTIES REVISITED That is, ๐ฏ๐ฝ โข ¬∃๐ฅ๐ฝ ๐๐ฝ . Using Simp and Conj, we conclude that ๐ฏ๐ฝ โข ∃๐ฅ๐ฝ ๐๐ฝ ∧ ¬∃๐ฅ๐ฝ ๐๐ฝ , contradicting the consistency of ๐ฏ๐ฝ . โ If ๐ผ is a limit ordinal, define ๐ฏ๐ผ = โ ๐ฏ๐ฝ , ๐ฝ∈๐ผ which is consistent by Lemma 7.4.6. We claim that the (๐ฒ ∪ ๐ถ)-theory โ ๐ฏ๐ฝ ๐ฝ∈๐ is the desired extension of ๐ฏ . To see this, first note that this union is consistent by Lemma 7.4.6. Also, let ๐(๐ฅ) be an ๐ฒ ∪ ๐ถ-formula with at most one free variable. This implies that ๐ = ๐๐ผ and ๐ฅ = ๐ฅ๐ผ for some ๐ผ ∈ ๐ . Therefore, ∃๐ฅ๐ผ ๐๐ผ → ๐๐ผ so โ ๐พ∈๐ ๐๐ผ ∈ ๐ฏ๐ผ+1 , ๐ฅ๐ผ ๐ฏ๐พ โข ∃๐ฅ๐ผ ๐๐ผ → ๐๐ผ ๐๐ผ . ๐ฅ๐ผ Let ๐ฒ be a set of theory symbols and ๐ถ be a set of constants from ๐ฒ. Let ๐ฏ be a consistent ๐ฒ-theory. We want to define an ๐ฒ-structure that has a chance of being a model of ๐ฏ . One of the issues that must be overcome is whether any pair of constants should be interpreted as being equal. If so, we want to view those constants as equivalent. To do this, define a relation ∼ on ๐ถ by ๐0 ∼ ๐1 if and only if ๐0 = ๐1 ∈ ๐ฏ (7.22) for all ๐0 , ๐1 ∈ ๐ถ. This defines an equivalence relation under the right conditions. LEMMA 7.4.14 Let ๐ถ be a set of constants from ๐ฒ. If ๐ฏ is a maximally consistent ๐ฒ-theory, then ∼ is an equivalence relation on ๐ถ. 402 Chapter 7 MODELS PROOF Let ๐0 , ๐1 , ๐2 ∈ ๐ถ. โ Since ๐ฏ ∪ {๐0 = ๐0 } is consistent by E1 (Axioms 5.1.1), by the maximal consistency of ๐ฏ , we have that ๐0 = ๐0 ∈ ๐ฏ . โ Suppose ๐0 = ๐1 ∈ ๐ฏ . Then, ๐ฏ โข ๐0 = ๐1 , so ๐ฏ โข ๐1 = ๐0 by E2. Thus, ๐1 = ๐0 ∈ ๐ฏ . โ Let ๐0 = ๐1 ∈ ๐ฏ and ๐1 = ๐2 ∈ ๐ฏ . Hence, we have ๐ฏ โข ๐0 = ๐1 and ๐ฏ โข ๐1 = ๐2 . Then, ๐ฏ โข ๐0 = ๐2 by E3, which yields ๐0 = ๐2 ∈ ๐ฏ . We define the domain of the desired structure to be ๐ถโ∼, the set of equivalence classes modulo ∼ (Definition 4.2.10). Before we define the function ๐, we need some actual functions and relations on ๐ถโ∼. Use the following lemma to define them. LEMMA 7.4.15 Let ๐ฏ be a maximally consistent ๐ฒ-theory and ๐ถ be a set of constant symbols from ๐ฒ. Let ๐ก0 , ๐ก1 , … , ๐ก๐−1 , ๐ก′0 , ๐ก′1 , … , ๐ก′๐−1 be ๐ฒ-terms such that ๐ก0 ∼ ๐ก′0 , ๐ก1 ∼ ๐ก′1 , … , ๐ก๐−1 ∼ ๐ก′๐−1 . โ For every ๐-ary function symbol ๐ , ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) ∼ ๐ (๐ก′0 , ๐ก′1 , … , ๐ก′๐−1 ). โ For every ๐-ary relation symbol ๐ , ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) ∈ ๐ฏ ⇔ ๐ (๐ก′0 , ๐ก′1 , … , ๐ก′๐−1 ) ∈ ๐ฏ . PROOF By (7.22) we have that ๐ฏ โข ๐ก0 = ๐ก′0 , ๐ฏ โข ๐ก1 = ๐ก′1 , … , ๐ฏ โข ๐ก๐−1 = ๐ก′๐−1 . Let ๐ be an ๐-ary function symbol. Then, ๐ฏ โข ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) = ๐ (๐ก′0 , ๐ก′1 , … , ๐ก′๐−1 ) by E4 (Axioms 5.1.1), so ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) = ๐ (๐ก′0 , ๐ก′1 , … , ๐ก′๐−1 ) ∈ ๐ฏ by Lemma 7.4.5, which implies that ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) ∼ ๐ (๐ก′0 , ๐ก′1 , … , ๐ก′๐−1 ). Next, let ๐ be an ๐-ary relation symbol such that ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) ∈ ๐ฏ . This implies that ๐ฏ โข ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ), so we have that ๐ฏ โข ๐ (๐ก′0 , ๐ก′1 , … , ๐ก′๐−1 ) by E5. Again, by Lemma 7.4.5, ๐ (๐ก′0 , ๐ก′1 , … , ๐ก′๐−1 ) ∈ ๐ฏ . Section 7.4 THE THREE PROPERTIES REVISITED 403 We now define the functions and relations on the domain ๐ถโ∼ with ๐ฏ being maximally consistent. โ Let ๐ be an ๐-ary function symbol from ๐ฒ. Let ๐0 , ๐1 , … , ๐๐−1 ∈ ๐ถ. Define the ๐-ary function ๐∼ : ๐ถโ∼ → ๐ถโ∼ by ๐∼ ([๐0 ], [๐1 ], … , [๐๐−1 ]) = [๐] ⇔ ๐ (๐0 , ๐1 , … , ๐๐−1 ) = ๐ ∈ ๐ฏ . (7.23) Since ๐∼ is defined on a set of equivalence classes, we must confirm that ๐∼ is well-defined. Let ๐0 , ๐1 , … , ๐๐−1 ∈ ๐ถ such that ๐0 ∼ ๐0 , ๐1 ∼ ๐1 , … , ๐๐−1 ∼ ๐๐−1 . Then, [๐] = [๐] because by Lemma 7.4.15, ๐ (๐0 , ๐1 , … , ๐๐−1 ) ∼ ๐ (๐0 , ๐1 , … , ๐๐−1 ). โ Let ๐ be an ๐-ary relation symbol from ๐ฒ and take ๐0 , ๐1 , … , ๐๐−1 ∈ ๐ถ. Define the ๐-ary relation ๐ ∼ on ๐ถโ∼ by ([๐0 ], [๐1 ], … , [๐๐−1 ]) ∈ ๐ ∼ ⇔ ๐ (๐0 , ๐1 , … , ๐๐−1 ) ∈ ๐ฏ . (7.24) We must check that the relation holds if different constant symbols are used to represent the classes [๐0 ], [๐1 ], … , [๐๐−1 ]. To do this, let ๐0 , ๐1 , … , ๐๐−1 ∈ ๐ถ such that ๐0 ∼ ๐0 , ๐1 ∼ ๐1 , … , ๐๐−1 ∼ ๐๐−1 . Then, we have that ([๐0 ], [๐1 ], … , [๐๐−1 ]) ∈ ๐ ∼ since by Lemma 7.4.15, ๐ (๐0 , ๐1 , … , ๐๐−1 ) ∈ ๐ฏ . The previous work makes the interpretation of the function and relation symbols of ๐ฒ obvious. However, what about the constant symbols? To find out, take a constant symbol ๐ of ๐ฒ. If ๐ ∈ ๐ถ, then ๐ should be interpreted as [๐], so suppose that ๐ ∉ ๐ถ. Because โข ∃๐ฅ(๐ฅ = ๐) and ๐ฏ is maximally consistent, ∃๐ฅ(๐ฅ = ๐) ∈ ๐ฏ . At this point make the further assumption that ๐ถ is a witness set of ๐ฏ . Then, there exists ๐ ∈ ๐ถ such that ๐ =๐ ∈๐ฏ. This implies that [๐] = [๐]. Therefore, for every ๐ ∈ ๐ฒ, there exists ๐∼ ∈ ๐ถ such that [๐] = [๐∼ ]. We can now define the structure. 404 Chapter 7 MODELS DEFINITION 7.4.16 Let ๐ฏ be a maximally consistent set of ๐ฒ-sentences with witness set ๐ถ. Let ∼ be the relation (7.22). Define ๐∼ to be the ๐ฒ-structure with domain ๐ถโ∼ and function ๐ such that โ ๐(๐) = [๐∼ ] for all constant symbols ๐ ∈ ๐ฒ, โ ๐(๐ ) = ๐ ∼ for all ๐-ary relation symbols ๐ ∈ ๐ฒ, โ ๐(๐ ) = ๐∼ for all ๐-ary function symbols ๐ ∈ ๐ฒ. Since we are investigating consistent sets of sentences, we do not need to involve a particular interpretation for the structures in the proofs of the following results (Theorem 7.1.31). Using Definition 7.4.16 will suffice. LEMMA 7.4.17 Suppose that ๐ฏ is a maximally consistent set of ๐ฒ-sentences with ๐ถ as a witness set. Then, ๐∼ โจ ๐ก = ๐ if and only if ๐ก = ๐ ∈ ๐ฏ , for every constant symbol ๐ ∈ ๐ฒ and ๐ฒ-term ๐ก. PROOF Assume that ๐ ∈ ๐ฒ is a constant symbol and let ๐ก be an ๐ฒ-term. Since ๐ฏ is a set of sentences, ๐ก cannot have any free variables. โ Let ๐ก = ๐ for some constant symbol ๐ from ๐ฒ. Then, ๐∼ โจ ๐ = ๐ ⇔ [๐] = [๐] ⇔ ๐ ∼ ๐ ⇔ ๐ = ๐ ∈ ๐ฏ . โ Let ๐ก = ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) for some ๐-ary function symbol ๐ and ๐ฒ-terms ๐ก0 , ๐ก1 , … , ๐ก๐−1 containing no free variables. Assume that ๐∼ โจ ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) = ๐. Since ๐ฏ is maximally consistent, ∃๐ฅ(๐ก๐ = ๐ฅ) ∈ ๐ฏ for ๐ = 0, 1, … , ๐ − 1. Since ๐ถ is a witness set, there exists ๐๐ ∈ ๐ถ such that ๐ก๐ = ๐๐ ∈ ๐ฏ , which is equivalent to ๐ก๐ ∼ ๐๐ . Therefore, by induction, ๐∼ โจ ๐ก๐ = ๐๐ . Hence, ๐∼ โจ ๐ (๐0 , ๐1 , … , ๐๐−1 ) = ๐. (7.25) Section 7.4 THE THREE PROPERTIES REVISITED 405 This implies that ๐∼ ([๐0 ], [๐1 ], … , [๐๐−1 ]) = [๐], so by (7.23), ๐ (๐0 , ๐1 , … , ๐๐−1 ) = ๐ ∈ ๐ฏ . Therefore, by Lemma 7.4.15 and (7.25), ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) = ๐ ∈ ๐ฏ . The converse is left to Exercise 4. LEMMA 7.4.18 If ๐ฏ is a consistent set of ๐ฒ-sentences with ๐ถ as a witness set, then ๐ ∈ ๐ฏ if and only if ๐∼ โจ ๐ [๐ผ] for every ๐ฒ-sentence ๐. PROOF We can assume that ๐ฏ is maximally consistent (Exercise 5). Let ๐ถ be a witness set of ๐ฏ . We prove the theorem by induction on formulas. โ Let ๐ก0 = ๐ก1 ∈ ๐ฏ for ๐ฒ-terms ๐ก0 and ๐ก1 with no free variables. Since ๐ฏ is maximally consistent, ∃๐ฅ(๐ก0 = ๐ฅ) ∈ ๐ฏ and ∃๐ฅ(๐ก1 = ๐ฅ) ∈ ๐ฏ . Because ๐ถ is a witness set for ๐ฏ , there exists ๐, ๐ ∈ ๐ถ such that ๐ก0 = ๐ ∈ ๐ฏ and ๐ก1 = ๐ ∈ ๐ฏ . This implies that ๐ก0 ∼ ๐ and ๐ก1 ∼ ๐, so because ๐ก0 ∼ ๐ก1 , [๐] = [๐]. Therefore, ๐∼ โจ ๐ = ๐, but because ๐∼ โจ ๐ก0 = ๐ and ๐∼ โจ ๐ก1 = ๐ (Lemma 7.4.17), ๐∼ โจ ๐ก0 = ๐ก1 . The proof of the converse is Exercise 7(a). โ Let ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) ∈ ๐ฏ for some relation symbol ๐ in ๐ฒ and ๐ฒ-terms ๐ก0 , ๐ก1 , … , ๐ก๐−1 with no free variables. As in the first part of this proof, there exist constants ๐๐ ∈ ๐ถ (๐ = 0, 1, … , ๐ − 1) such that ๐ก๐ ∼ ๐๐ . Thus, by Lemma 7.4.15, ๐ (๐0 , ๐1 , … , ๐๐−1 ) ∈ ๐ฏ , and then by the definition of ๐ ∼ , we have that ([๐0 ], [๐1 ], … , [๐๐−1 ]) ∈ ๐ ∼ . Therefore, ๐∼ โจ ๐ (๐0 , ๐1 , … , ๐๐−1 ), 406 Chapter 7 MODELS so ๐∼ โจ ๐ (๐ก0 , ๐ก1 … , ๐ก๐−1 ) [๐ผ] as in first part. The proof of the converse is Exercise 7(b). โ Let ๐ be an ๐ฒ-sentence. Then, by induction and the maximal consistency of ๐ฏ , ¬๐ ∈ ๐ฏ ⇔ ๐ ∉ ๐ฏ ⇔ ๐∼ ฬธโจ ๐ ⇔ ๐∼ โจ ¬๐ [๐ผ]. โ Let ๐ and ๐ be ๐ฒ-sentences. Assume ๐ → ๐ ∈ ๐ฏ and ๐∼ โจ ๐ [๐ผ]. By induction, ๐ ∈ ๐ฏ , so by Lemma 7.4.5 and MP, ๐ ∈ ๐ฏ . Again by induction, ๐∼ โจ ๐ [๐ผ], so ๐∼ โจ ๐ → ๐ [๐ผ]. See Exercise 7(c) for the proof of the converse. โ Let ๐ be an ๐ฒ-formula with at most one free variable ๐ฅ. By induction and the maximal consistency of ๐ฏ , ๐∼ โจ ∃๐ฅ๐ ⇔ ๐ ๐ ∈ ๐ฏ for some ๐ ∈ ๐ถ ⇔ ∃๐ฅ๐ ∈ ๐ฏ . ๐ฅ The hard work of this section leads to the fundamental theorem of model theory. THEOREM 7.4.19 [Henkin] An ๐ฒ-theory ๐ฏ is consistent if and only if ๐ฏ has a model. PROOF Let ๐ฏ be a set of ๐ฒ-sentences. Suppose ๐ฏ is consistent and let ๐ถ be a set of constant symbols not found in ๐ฒ such that |๐ถ| = |๐ซ(๐ฒ)|. By Lemma 7.4.13, there exists a consistent extension ๐ฏ of ๐ฏ such that the elements of ๐ฏ are (๐ฒ ∪ ๐ถ)sentences and ๐ถ is a witness set for ๐ฏ . By Lemma 7.4.18, there exists a model ๐ of ๐ฏ . Since the sentences of ๐ฏ do not contain any of the constants of ๐ถ, the reduct of ๐ to ๐ฒ is a model of ๐ฏ . To see this, let ๐ be the indicated reduct and proceed by induction on formulas. โ Let ๐ก0 and ๐ก1 be ๐ฒ-terms such that (๐ก0 = ๐ก1 ) ∈ ๐ฏ . These terms are also (๐ฒ ∪ ๐ถ)-terms, so since ๐ฏ ⊆ ๐ฏ , we have that ๐ โจ ๐ก1 = ๐ก2 . Thus, there is an (๐ฒ∪๐ถ)-interpretation ๐ผ such that ๐ โจ ๐ก1 = ๐ก2 [๐ผ]. Hence, ๐ผ(๐ก1 ) = ๐ผ(๐ก2 ). Let ๐ผ ′ be the restriction of ๐ผ to ๐ฒ. Since ๐ก1 and ๐ก2 contain no symbols from ๐ถ, ๐ผ ′ (๐ก1 ) = ๐ผ ′ (๐ก2 ), so ๐ โจ ๐ก1 = ๐ก2 [๐ผ ′ ] because ๐ is the reduct of ๐ to ๐ฒ. By Theorem 7.1.31, we have that ๐ โจ ๐ก1 = ๐ก2 . โ That ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) ∈ ๐ฏ implies ๐ โจ ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) for any ๐-ary relation symbol ๐ ∈ ๐ฒ and ๐ฒ-terms ๐ก0 , ๐ก1 , … , ๐ก๐−1 is Exercise 8(a). โ Let ¬๐ ∈ ๐ฏ . This implies that ๐ โจ ¬๐. That is, not ๐ โจ ๐. Since ๐ is the reduct of ๐ to ๐ฒ, not ๐ โจ ๐. In other words, ๐ โจ ¬๐. โ That (๐ → ๐) ∈ ๐ฏ implies ๐ โจ ๐ → ๐ for ๐ฒ-sentences ๐ and ๐ is Exercise 8(b). Section 7.4 THE THREE PROPERTIES REVISITED 407 โ Suppose that ∃๐ฅ๐ ∈ ๐ฏ , where ๐ = ๐(๐ฅ) is an ๐ฒ-formula. Since ๐ถ is a ๐ witness set and ๐ฏ is maximally consistent, ๐ ∈ ๐ฏ for some ๐ ∈ ๐ถ. By ๐ฅ induction, ๐ ๐โจ๐ . ๐ฅ Let ๐ผ be an interpretation of ๐. Then, ๐ โจ ๐ [๐ผ๐ฅ๐(๐) ], so we conclude that ๐ โจ ∃๐ฅ๐. To prove the converse, suppose that ๐ฏ โข ๐ ∧ ¬๐ for some ๐ฒ-sentence ๐. By the soundness theorem (7.4.9), ๐ฏ โจ ๐ ∧ ¬๐. Hence, if ๐ is an ๐ฒ-structure such that ๐ โจ ๐ฏ , then ๐ โจ ๐ ∧ ¬๐, which implies that ๐ โจ ๐ and not ๐ โจ ๐. Hence, ๐ฏ does not have a model. Henkin’s theorem (7.4.19) is used to prove the converse to the soundness theorem (7.4.9). COROLLARY 7.4.20 [Completeness] Let ๐ฏ be a consistent set of ๐ฒ-sentences. For any ๐ฒ-sentence ๐, if ๐ฏ โจ ๐, then ๐ฏ โข ๐. PROOF Suppose that ๐ฏ โฌ ๐. This implies that ๐ฏ ∪ {¬๐} is consistent, so by Henkin’s theorem (7.4.19), there exists an ๐ฒ-structure ๐ such that ๐ โจ ๐ฏ ∪ {¬๐}. Hence, ๐ฏ ฬธโจ ๐. COROLLARY 7.4.21 First-order logic is complete. The next corollary was first proved by Kurt Gödel (1929). It was his doctoral dissertation. COROLLARY 7.4.22 [Gödel’s Completeness Theorem] An ๐ฒ-sentence is a theorem if and only if it is valid. COROLLARY 7.4.23 For all sets of ๐ฒ-sentences ๐ฏ and ๐ฒ-sentences ๐, ๐ฏ โข ๐ if and only if ๐ฏ โจ ๐. The use of models to show consistency is now a common technique in mathematical logic. An early example was Hilbert’s use of โ × โ to show that Euclidean geometry is consistent (Hilbert 1899). That is, โ × โ serves as the domain of a structure that models the postulates of Euclidean geometry. The proof relies on the consistency of the properties of โ × โ, a fact that is left unproved. A later usage is the model used by Gödel that shows that both ๐ข๐ง and the axiom of choice (5.1.10) are consistent with ๐ญ๐ (Gödel 1940). Both are examples of relative consistency proofs. Gödel assumed the consistency of ๐ญ๐, so he had a model for it. From this he defined another model that 408 Chapter 7 MODELS satisfies ๐ข๐ง and, in addition, the axiom of choice. We represent this by writing ๐ข๐๐(๐ญ๐) → ๐ข๐๐(๐ญ๐๐ + ๐ข๐ง). This is a partial answer to one of the ten then unsolved problems that Hilbert presented at the International Congress of Mathematicians at Paris in 1900. An expanded list was published that year in Germany with an English translation released two years later (Hilbert 1902). Hilbert’s intention was to outline the important problems that mathematicians should strive to prove in the twentieth century. The first problem involved the continuum hypothesis. Namely, it should be determined whether as regards equivalence there are . . . only two assemblages of numbers, the countable assemblage and the continuum. Hilbert’s second problem dealt with the axioms of arithmetic. Specifically, he thought that any of the chosen axioms should not be provable from the others. That is, they should be independent. More importantly, mathematicians should seek [t]o prove that they [the axioms] are not contradictory, that is, that a finite number of logical steps based upon them can never lead to contradictory results. In this problem we see the notions of finite proof (Definition 1.2.13) and consistency (Definitions 1.5.1 and 7.4.2), which would influence to the development of model theory some 20–30 years after Hilbert’s talk and lead to some interesting results. Exercises 1. Is ∅ consistent? Explain. 2. Prove Theorem 7.4.3. 3. Prove that the group axioms (7.1.14) and the ring axioms (Axioms 7.1.36) are consistent. 4. From the proof of Lemma 7.4.17, prove that if ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) = ๐ ∈ ๐ฏ , then ๐∼ โจ ๐ (๐ก0 , ๐ก1 , … , ๐ก๐−1 ) = ๐ for all constant symbols ๐, ๐-ary function symbols ๐ , and ๐ฒ-terms ๐ก0 , ๐ก1 , … , ๐ก๐−1 . 5. In Lemma 7.4.18, why can ๐ฏ be assumed to be maximally consistent? 6. Let ๐ be an ๐ฒ-structure. Define ๐ณ๐(๐) to be the set of ๐ฒ-sentences that are satisfied by ๐. Prove that ๐ณ๐(๐) is maximally consistent. 7. Prove the given results from the proof of Lemma 7.4.18. (a) If ๐∼ โจ ๐ก0 = ๐ก1 , then ๐ก0 = ๐ก1 ∈ ๐ฏ for all ๐ฒ-terms ๐ก0 and ๐ก1 with no free variables. (b) For all ๐ฒ-terms ๐ก0 , ๐ก1 … , ๐ก๐−1 with no free variables, ๐ (๐ก0 , ๐ก1 … , ๐ก๐−1 ) ∈ ๐ฏ if ๐∼ โจ ๐ (๐ก0 , ๐ก1 … , ๐ก๐−1 ). (c) ๐∼ โจ ๐ → ๐ implies that ๐ → ๐ ∈ ๐ฏ for all ๐ฒ-sentences ๐ and ๐. 8. Show the following from the proof of Henkin’s theorem (7.4.19). (a) If ๐ (๐ก1 , … , ๐ก๐ ) ∈ ๐ฏ , then ๐ โจ ๐ (๐ก1 , … , ๐ก๐ ) for any relation symbol ๐ ∈ ๐ฒ and ๐ฒ-terms ๐ก1 , … , ๐ก๐ . Section 7.5 MODELS OF DIFFERENT CARDINALITIES 409 (b) If ๐ → ๐ ∈ ๐ฏ , then ๐ โจ ๐ → ๐ for all ๐ฒ-sentences ๐ and ๐. 9. Prove Corollary 7.4.22. 10. Let ๐ฏ and ๐ฐ be consistent sets of ๐ฒ-sentences. Prove that if ๐ฏ ∪๐ฐ is inconsistent, then there exists an ๐ฒ-sentence ๐ such that ๐ฏ โจ ๐ but ๐ฐ ฬธโจ ๐. 11. We say that an ๐ฒ-theory ๐ฏ is closed under deductions if for all ๐ฒ-sentences ๐, if ๐ฏ โข ๐, then ๐ ∈ ๐ฏ . (a) If a theory is closed under deductions, is the theory maximally consistent? (b) Let ๐ฝ be an ordinal and suppose that {๐ฏ๐ผ : ๐ผ ∈ ๐ฝ} is a family of ๐ฒ-theories such that for all ๐พ, ๐ฟ ∈ ๐ฝ, ifโ๐พ ∈ ๐ฟ, then ๐ฏ๐พ ⊆ ๐ฏ๐ฟ . Prove that if ๐ฏ๐ผ is consistent for all ๐ผ ∈ ๐ฝ, then ๐ผ∈๐ฝ ๐ฏ๐ผ is consistent. 12. A theory ๐ฏ is finitely axiomatizable if there exists an ๐ฒ-sentence ๐ such that for all ๐ฒ-sentences ๐, ๐ฏ โจ ๐ if and only if ๐ โจ ๐. Let {๐ฏ๐ : ๐ ∈ ๐} be a chain of finitely axiomatizable ๐ฒ-theories such that ๐ ≤ ๐ implies that ๐ฏ๐ ⊆ ๐ฏ๐ . Suppose that for all โ ๐ ∈ ๐, there exists a model of ๐ฏ๐ that is not a model of ๐ฏ๐+1 . Prove that ๐∈๐ ๐ฏ๐ is not finitely axiomatizable. 13. Suppose that ๐ฏ1 and ๐ฏ2 are ๐ฒ-theories. Let ๐ and ๐ be ๐ฒ-structures. Prove the following. (a) If ๐ฏ1 ⊆ ๐ฏ2 , then every model of ๐ฏ2 is a model of ๐ฏ1 . (b) If ๐ ⊆ ๐ , then ๐ณ๐(๐ ) ⊆ ๐ณ๐(๐). (c) If ๐ โจ ๐ฏ1 ∪ ๐ฏ2 , then ๐ โจ ๐ฏ1 and ๐ โจ ๐ฏ2 . (d) ๐ โจ ๐ฏ1 if and only if ๐ฏ1 ⊆ ๐ณ๐(๐). 7.5 MODELS OF DIFFERENT CARDINALITIES The natural numbers and their basic operations of addition and subtraction can be defined using the axioms of ๐ญ๐๐ (Section 5.2). It can then be proved that ๐ has basically the same algebraic properties as ๐. Another approach to studying the natural numbers is to break down the subject to its most basic parts. Think about addition of natural numbers and how it is first explained. Adding 4 to 3 to obtain 7, for example, means starting at 3 and adding 1 four times in sequence: 3 + 1 = 4, 4 + 1 = 5, 5 + 1 = 6, 6 + 1 = 7. Adding 1 simply means moving to the next natural number, so to add any two natural numbers, all one needs is to know the numerals and have the ability to count (compare Definition 5.2.15). Although not efficient, this will do. Multiplication, the other operation, is based on addition. Multiplying 4 by 3 means writing 4 down 3 times and then adding: 4 + 4 + 4 = 12. 410 Chapter 7 MODELS This means that multiplication is also based on the ability to count (compare Definition 5.2.18). This ability is represented by the successor function, ๐๐ = ๐ + 1, which is the basis of arithmetic (compare Definition 5.2.1). Thus, to understand the natural numbers and how they work, we simply need a successor function that satisfies the right rules. Peano Arithmetic Using the theory symbols ๐ ๐ฑ (Example 2.1.4), we make the following axioms. They are a minor modification of the axioms for the system of arithmetic found in Arithmetices principia by Giuseppe Peano (1889). AXIOMS 7.5.1 โ P1. ∀๐ฅ(¬ ๐๐ฅ = 0) โ P2. ∀๐ฅ∀๐ฆ(๐๐ฅ = ๐๐ฆ → ๐ฅ = ๐ฆ) โ P3. For every ๐ ๐ฑ-formula ๐ with free variable ๐ฅ, ๐(0) ∧ ∀๐ฅ[๐(๐ฅ) → ๐(๐๐ฅ)] → ∀๐ฅ๐(๐ฅ). Denote {P1, P2, P3} by ๐ฃ. If we define ๐ญ(๐) = + , ๐ญ(0) = ∅, (7.26) then Theorem 5.2.2, Theorem 5.2.6, and Corollary 5.2.12 imply that ๐ = (๐, ๐ญ) is a model of ๐ฃ. Because of the existence of a model for these axioms, which was constructed using ๐ญ๐๐, we conclude the following (Theorem 7.4.19). THEOREM 7.5.2 The consistency of ๐ฃ is a consequence of ๐ญ๐๐. Since the Peano axioms are intended to be the assumptions of number theory, ๐ ๐ฑ is often extended to ๐ ๐ฑ′ (Example 2.1.4) and the sentences of Axioms 7.5.1 broadened to include axioms involving +, ⋅, and <. These axioms serve as the foundation of number theory. AXIOMS 7.5.3 [Peano] โ PA1. ∀๐ฅ(¬ ๐๐ฅ = 0) โ PA2. ∀๐ฅ∀๐ฆ(๐๐ฅ = ๐๐ฆ → ๐ฅ = ๐ฆ) โ PA3. ∀๐ฅ(๐ฅ + 0 = ๐ฅ) Section 7.5 MODELS OF DIFFERENT CARDINALITIES 411 โ PA4. ∀๐ฅ∀๐ฆ[๐ฅ + ๐๐ฆ = ๐(๐ฅ + ๐ฆ)] โ PA5. ∀๐ฅ(๐ฅ ⋅ 0 = 0) โ PA6. ∀๐ฅ∀๐ฆ(๐ฅ ⋅ ๐๐ฆ = ๐ฅ ⋅ ๐ฆ + ๐ฅ) โ PA7. ∀๐ฅ(¬ ๐ฅ < 0) โ PA8. ∀๐ฅ∀๐ฆ[๐ฅ < ๐๐ฆ ↔ (๐ฅ < ๐ฆ ∨ ๐ฅ = ๐ฆ)] โ PA9. ∀๐ฅ∀๐ฆ(๐ฅ < ๐ฆ ∨ ๐ฅ = ๐ฆ ∨ ๐ฆ < ๐ฅ) โ PA10. For every ๐ ๐ฑ′ -formula ๐ with free variable ๐ฅ, ๐(0) ∧ ∀๐ฅ[๐(๐ฅ) → ๐(๐๐ฅ)] → ∀๐ฅ๐(๐ฅ). Let ๐ฃ๐ denote the Peano axioms (7.5.3). We call the set of consequences of the Peano axioms Peano arithmetic. To find a model for these axioms, extend the function ๐ญ (7.26) to ๐ญ′ so that ๐ญ′ (+) is the addition of Definition 5.2.15, ๐ญ′ (⋅) is the multiplication of Definition 5.2.19, and using the order of Definition 5.2.10, ๐ญ′ (<) = {(๐, ๐) ∈ ๐ × ๐ : ๐ ∈ ๐}. That ๐ญ′ (<) satisfies PA7–PA9 is left to Exercise 2. Then, as before, ๐′ = (๐, ๐ญ′ ) is a model of the Peano axioms, which is an expansion of the model ๐, and we can again apply Theorem 7.4.19 to obtain a consistency result. THEOREM 7.5.4 The consistency of ๐ฃ๐ is a consequence of ๐ญ๐๐. We call ๐′ and any ๐ ๐ฑ′ -structure isomorphic to it a standard model of Peano arithmetic. The the elements of the domain of any standard model are called standard numbers. Any model of Peano arithmetic that is not isomorphic to ๐′ is called a nonstandard model of Peano arithmetic. Observe that ๐ฃ and ๐ฃ๐ are sets of axioms separate from ๐ญ๐๐. However, because of the work of Section 5.2, both ๐ฃ and ๐ฃ๐ can be viewed as consequences of ๐ญ๐๐. Specifically, if P1 is replaced by P2 is replaced by ∀๐ฅ(๐ฅ+ ≠ ∅), (7.27) ∀๐ฅ∀๐ฆ(๐ฅ+ = ๐ฆ+ → ๐ฅ = ๐ฆ), (7.28) ๐(∅) ∧ ∀๐ฅ[๐(๐ฅ) → ๐(๐ฅ+ )] → ∀๐ฅ๐(๐ฅ) (7.29) and P3 is replaced by for every ๐ฒ๐ณ-formula ๐ with free variable ๐ฅ, then ๐ญ๐๐ โจ {(7.27), (7.28), (7.27)}. 412 Chapter 7 MODELS That is, a copy of ๐ฃ is found among the consequences of ๐ญ๐๐. Also, using Definitions 5.2.15 and 5.2.18, it can be shown that a copy of Peano arithmetic is found among the consequences of ๐ญ๐๐. We will not repeat all our number theory work in Peano arithmetic, although it is possible to do so. Instead, we give a sample of some basic results to illustrate that the two systems are the same. Because of the work in Section 5.2, the addition and multiplication of Peano arithmetic are commutative, associative, satisfy the distributive and cancellation laws, and have identities in the standard model. That these properties hold in every model of Peano arithmetic is a separate issue, yet their proofs are similar to the work of Section 5.2. THEOREM 7.5.5 The following ๐ ๐ฑ′ -sentences are theorems of ๐ฃ๐. โ Associative Laws ∀๐ฅ∀๐ฆ∀๐ง[๐ฅ + (๐ฆ + ๐ง) = (๐ฅ + ๐ฆ) + ๐ง] ∀๐ฅ∀๐ฆ∀๐ง[๐ฅ ⋅ (๐ฆ ⋅ ๐ง) = (๐ฅ ⋅ ๐ฆ) ⋅ ๐ง] โ Commutative Laws ∀๐ฅ∀๐ฆ(๐ฅ + ๐ฆ = ๐ฆ + ๐ฅ) ∀๐ฅ∀๐ฆ(๐ฅ ⋅ ๐ฆ = ๐ฆ ⋅ ๐ฅ) โ Additive Identity ∀๐ฅ(0 + ๐ฅ = ๐ฅ) โ Multiplicative Identity ∀๐ฅ(๐0 ⋅ ๐ฅ = ๐ฅ) โ Distributive Law ∀๐ฅ∀๐ฆ∀๐ง[๐ฅ ⋅ (๐ฆ + ๐ง) = ๐ฅ ⋅ ๐ฆ + ๐ฅ ⋅ ๐ง] PROOF We prove the first associative law and the multiplicative identity property, leaving the others to Exercise 9. โ Define ๐(๐ฅ) := ∀๐ง[๐ฅ + (๐ฆ + ๐ง) = (๐ฅ + ๐ฆ) + ๐ง]. By PA3, we have ๐(0) because ๐ฅ + (๐ฆ + 0) = (๐ฅ + ๐ฆ) + 0. Next, assume ๐(๐). Since ๐ is a function symbol, PA4 yields ๐ [๐ฅ + (๐ฆ + ๐)] = ๐ [(๐ฅ + ๐ฆ) + ๐] , ๐ฅ + ๐(๐ฆ + ๐) = (๐ฅ + ๐ฆ) + ๐๐, ๐ฅ + (๐ฆ + ๐๐) = (๐ฅ + ๐ฆ) + ๐๐. Section 7.5 MODELS OF DIFFERENT CARDINALITIES 413 Therefore, ๐(๐๐), so the associative law for multiplication holds by PA10. โ Assuming that the commutative laws have already been proved, PA6, PA5, and PA3 imply that ๐0 ⋅ ๐ฅ = ๐ฅ ⋅ ๐0 = ๐ฅ ⋅ 0 + ๐ฅ = 0 + ๐ฅ = ๐ฅ + 0 = ๐ฅ, so ๐0 ⋅ ๐ฅ = ๐ฅ by E3 (Axioms 5.1.1). To ease our notation when dealing with numbers other than 1 = ๐0, define ๐ 0๐ฅ = ๐ฅ and for all ๐ ∈ ๐ โงต {0}, ๐ ๐ = ๐๐ … ๐ ๐ฅ. โโโโโ ๐ times We will use this notation in the next example. (7.30) (7.31) EXAMPLE 7.5.6 In the formula, (3 + 2) + 1 = (1 + 3) + 2, (7.32) two properties from Theorem 7.5.5 are being applied. First, we conclude that (3 + 2) + 1 = 1 + (3 + 2) by the commutative law. The associative law is then used to draw the final conclusion. Notice that (7.32) is the standard interpretation of ++๐๐๐0๐๐0๐0 = +๐0๐๐๐0 + ๐๐0, which, using (7.31), is equivalent to ++๐ 3 0๐ 2 0๐ 1 0 = +๐ 1 0๐ 3 0 + ๐ 2 0. Also, the distributive law is applied in the formula 2 ⋅ ๐ฅ + 3 ⋅ ๐ฅ = 5 ⋅ ๐ฅ, because 2 ⋅ ๐ฅ + 3 ⋅ ๐ฅ = ๐ฅ ⋅ 2 + ๐ฅ ⋅ 3 = ๐ฅ ⋅ (2 + 3) = ๐ฅ ⋅ 5 = 5 ⋅ ๐ฅ. Given an equation like 5 ⋅ ๐ฅ + 1 = 11, (7.33) the standard routine is to solve it like this: 5 ⋅ ๐ฅ + 1 = 11, 5 ⋅ ๐ฅ = 10, ๐ฅ = 2. In the first step, −1 was added to both sides, and in the second, 1โ5 was multiplied by both sides. However, only 0 has an additive inverse in Peano arithmetic, and only 1 has a multiplicative inverse. The way around this problem are cancellation laws. 414 Chapter 7 MODELS THEOREM 7.5.7 [Cancellation] The following ๐ ๐ฑ′ -sentences are theorems of ๐ฃ๐. โ ∀๐ฅ∀๐ฆ∀๐ง[(๐ฅ + ๐ง = ๐ฆ + ๐ง) → ๐ฅ = ๐ฆ] โ ∀๐ฅ∀๐ฆ∀๐ง[(๐ฅ ⋅ ๐ง = ๐ฆ ⋅ ๐ง ∧ ¬๐ง = 0) → ๐ฅ = ๐ฆ]. PROOF The proof of the cancellation law for multiplication is Exercise 10. For addition, define ๐(๐ง) := ∀๐ฅ∀๐ฆ(๐ฅ + ๐ง = ๐ฆ + ๐ง → ๐ฅ = ๐ฆ). First, notice that if ๐ฅ + 0 = ๐ฆ + 0, ๐ฅ = ๐ฅ + 0 = ๐ฆ + 0 = ๐ฆ, where the first and last equality hold by PA3. Therefore, ๐(0). For the induction step, suppose ๐(๐). Then, by PA4 and PA2, ๐ฅ + ๐๐ = ๐ฆ + ๐๐, ๐(๐ฅ + ๐) = ๐(๐ฆ + ๐), ๐ฅ + ๐ = ๐ฆ + ๐, ๐ฅ = ๐ฆ, where the last step follows by ๐(๐). Therefore, ∀๐ง๐(๐ง) by PA10. Therefore, in Peano arithmetic, solve (7.33) like this: 5 ⋅ ๐ฅ + 1 = 11, 5 ⋅ ๐ฅ + 1 = 10 + 1, 5 ⋅ ๐ฅ = 10, 5 ⋅ ๐ฅ = 5 ⋅ 2, ๐ฅ = 2. The third and last equations follow by cancellation (Theorem 7.5.7). Compactness Theorem Having a model that satisfies ๐ฃ or ๐ฃ๐ means that basic arithmetic can be done in this model. We know that this happens in the standard model (assuming ๐ญ๐๐), so we now want to know whether there are any nonstandard models of Peano arithmetic. To be successful in our search, we turn to some theorems that follow from Henkin’s theorem (7.4.19). The first states that the existence of a model rests on the finite. It was first proved by Gödel for countable first-order theories (1930) and by Anatolij Mal’tsev for arbitrary first-order theories (1936). Section 7.5 MODELS OF DIFFERENT CARDINALITIES 415 THEOREM 7.5.8 [Compactness] An ๐ฒ-theory ๐ฏ has a model if and only if every finite subset of ๐ฏ has a model. PROOF Since sufficiency is clear, let ๐ฏ have the property that every finite subset has a model. This implies that every finite subset of ๐ฏ is consistent by Henkin’s theorem. Therefore, ๐ฏ is consistent by Theorem 7.4.3, so ๐ฏ also has a model by Henkin’s Theorem. We apply the compactness theorem to find another model of ๐ฃ๐. Let ๐ be a constant symbol other than 0. For every natural number ๐, use (7.30) and (7.31) to define the set of (๐ ๐ฑ′ ∪ {๐})-sentences โฑ๐ = ๐ฃ๐ ∪ {๐ ๐ 0 < ๐ : ๐ ∈ ๐ ∧ ๐ ≤ ๐}, and let ๐๐ = ({0, 1, … , ๐ + 1}, ๐), where ๐ ๐ ๐ฑ′ = ๐ญ′ and ๐(๐) = ๐ + 1. Observe that ๐๐ โจ โฑ๐ , so โฑ๐ is consistent for all ๐ ∈ ๐, which implies that it has a model by Henkin’s Theorem (7.4.19). Therefore, โ โฑ = โฑ๐ ๐∈๐ has a model by the compactness theorem (7.5.8). Let ๐ be the reduct of this model to ๐ ๐ฑ′ . Notice that ๐ โจ ๐ฃ๐, but because it has an element that is interpreted to be greater than every element of its domain, ๐ is a nonstandard model of Peano arithmetic (Theorem 7.3.24). Löwenheim–Skolem Theorems Now that we know that a nonstandard model of Peano arithmetic exists under ๐ญ๐๐, we want to know if there are others. For this, we need a definition. The power of an ๐ฒ-structure ๐ is denoted by |๐| and refers to the cardinality of the domain of ๐. This means that a model is countable if and only if its power is countable. We introduce a sequence of theorems related to this. They are due to Skolem (1922), Tarski and Vaught (1957), and Leopold Löwenheim (1915). THEOREM 7.5.9 [Downward Löwenheim–Skolem] Every consistent ๐ฒ-theory has a model with power of at most |๐ซ(๐ฒ)|. 416 Chapter 7 MODELS PROOF Recall these facts from the proof of Henkin’s theorem (7.4.19). โ ๐ถ is a set of new constant symbols such that |๐ถ| = |๐ซ(๐ฒ)|. โ ๐ is a (๐ฒ ∪ ๐ถ)-structure. โ ๐ is the reduct of ๐ to ๐ฒ. We can assume that ๐ has the property that every element of the domain of ๐ is the interpretation of a constant of ๐ถ. Then, since ๐ and ๐ have the same power, |๐ | ≤ |๐ซ(๐ฒ ∪ ๐ถ)| = |๐ซ(๐ฒ)|. Since the witness set of a single sentence is finite and there are only finitely many symbols in a sentence, ๐ฒ ∪ ๐ถ can be assumed to be finite in the proof of the downward Löwenheim–Skolem theorem (7.5.9). If this is the case, |๐ซ(๐ฒ ∪ ๐ถ)| = |๐ซ(๐ฒ)| = ℵ0 , and we have the next corollary. COROLLARY 7.5.10 [Löwenheim] If an ๐ฒ-sentence has a model, it has a countable model. Since ๐ซ(๐ ๐ฑ′ ) is countable, Theorem 7.5.9 also implies the following. COROLLARY 7.5.11 [Skolem] ๐ญ๐๐ implies that there is a countable nonstandard model of Peano arithmetic. In fact, although we do not prove it here, ๐ญ๐๐ implies that there are 2ℵ0 countable nonstandard models of Peano arithmetic. The title of Theorem 7.5.9 suggests the existence of the following theorem. THEOREM 7.5.12 [Upward Löwenheim–Skolem] If an ๐ฒ-theory has an infinite model, it has a model of cardinality ๐ for every ๐ ≥ |๐ซ(๐ฒ)|. PROOF Let ๐ฏ be an ๐ฒ-theory with an infinite model ๐ = (๐ด, ๐). Take ๐ ≥ |๐ซ(๐ฒ)|. Choose a set ๐ถ = {๐๐ผ : ๐ผ ∈ ๐ } of distinct constant symbols not found in ๐ฒ. Extend ๐ฏ to the (๐ฒ ∪ ๐ถ)-theory ๐ฏ by defining ๐ฏ = ๐ฏ ∪ {๐๐ผ ≠ ๐๐ : ๐ผ ∈ ๐ฝ ∈ ๐ }. Let ๐ฎ be a finite subset of ๐ฏ . This implies that there exists ๐ถ ′ = {๐๐ผ0 , ๐๐ผ1 , … , ๐๐ผ๐−1 } ⊆ ๐ถ such that the constants of the sentences of ๐ฎ are among the constants of ๐ถ ′ . Expand the ๐ฒ-structure ๐ to the (๐ฒ ∪ {๐๐ผ0 , ๐๐ผ1 , … , ๐๐ผ๐−1 })-structure ๐′ = (๐ด, ๐′ ), Section 7.5 MODELS OF DIFFERENT CARDINALITIES where ๐′ ๐ฒ = ๐ and 417 ๐′ (๐๐ผ๐ ) ∈ ๐ด for all ๐ = 0, 1, … , ๐ − 1. Since ๐ด is infinite, we further assume that ๐′ (๐๐ผ0 ), ๐′ (๐๐ผ1 ), … , ๐′ (๐๐ผ๐−1 ) are distinct. Therefore, ๐′ โจ ๐ฎ , so by the compactness theorem (7.5.8), there exists a model ๐ of ๐ฏ , and due to the interpretation of the new constants, the power of ๐ is ๐ . Thus, the reduct of ๐ to ๐ฒ is a model of ๐ฏ and has power ๐ . The upward Löwenheim–Skolem theorem with ๐ญ๐๐ implies that the Peano axioms have models of all infinite cardinalities, which are nonstandard by definition. The von Neumann Hierarchy We constructed a model of ๐ฃ๐ using ๐ญ๐๐. If we can find models of ๐ญ๐๐, we would have other models of ๐ฃ๐, plus prove the consistency of ๐ญ๐๐. In order to find models of ๐ญ๐๐, we begin by searching for models of individual axioms using a definition due to von Neumann (1929). The objective of the definition is to construct a sequence of sets that have the property that every set is in one of the stages of the sequence. However, since von Neumann’s definition used functions instead of sets, it was Zermelo (1930) who gave it its more recognizable form. While von Neumann left his base stage empty, Zermelo allowed the first set of the sequence to contain objects, which are called urelements, that were not sets yet were allowed to be elements of sets (Exercise 20). These two approaches are combined in the following definition, which is named after von Neumann. DEFINITION 7.5.13 Let ๐0 = ∅. Let ๐ผ be an ordinal. โ ๐๐ผ+1 = P(๐๐ผ ). โ โ ๐๐ผ = ๐ฝ<๐ผ ๐๐ฝ if ๐ผ is a limit ordinal. This is called the von Neumann hierarchy. As proved in Exercise 22, ๐๐ผ is a set for every ordinal ๐ผ by the empty set, union, power set, and replacement axioms (5.1.2, 5.1.6, 5.1.7, 5.1.9). Thus, every element of ๐๐ผ is a set. For example, ๐1 = {∅}, ๐2 = {∅, {∅}}, ๐3 = {∅, {∅}, {{∅}}, {∅, {∅}}} โฎ The sets in ๐๐ are finite and called the hereditarily finite sets. There are countably many of these sets because ๐๐ is countable for each ๐ ∈ ๐ (Theorem 6.3.17), while 418 Chapter 7 MODELS |๐๐+1 | = 2ℵ0 . Observe that the cardinality of ๐๐ for each ๐ ∈ ๐ is finite but grows very quickly. |๐0 | = 0, |๐1 | = 1, |๐2 | = 2, |๐3 | = 22 , 2 |๐4 | = 22 , 22 |๐5 | = 22 , 2 22 |๐6 | = 22 โฎ Before we can use individual stages of von Neumann’s hierarchy, we need to know some of their key properties. For this, we start with some lemmas. LEMMA 7.5.14 Let ๐ผ and ๐ฝ be ordinals. โ ๐๐ผ is a transitive set. โ If ๐ผ ⊆ ๐ฝ, then ๐๐ผ ⊆ ๐๐ฝ . PROOF Let ๐ be a limit ordinal containing ๐ผ and ๐ฝ. Define ๐ด = {๐ ∈ ๐ : ๐๐ is transitive}. Assume that ๐๐พ๐(๐ด, ๐ฟ) ⊆ ๐ด for the ordinal ๐ฟ ∈ ๐. Let ๐ต ∈ ๐๐ฟ . โ ๐0 is transitive because ๐0 = ∅. โ Suppose that ๐ฟ = ๐พ + for some ordinal ๐พ. By definition, ๐ต ∈ P(๐๐พ ), so ๐ต ⊆ ๐๐พ . Take ๐ฅ ∈ ๐ต. This implies that ๐ฅ ∈ ๐๐พ . Because ๐๐พ is transitive, we have that ๐ฅ ⊆ ๐๐พ . Hence, ๐ฅ ∈ P(๐๐พ ) = ๐๐ฟ , so ๐ต ⊆ ๐๐ฟ . โ Let ๐ฟ be a limit ordinal. Then, there exists ๐พ ∈ ๐ฟ such that ๐ต ∈ ๐๐พ . Since ๐๐พ is transitive by hypothesis, ๐ต ⊆ ๐๐พ ⊆ ๐๐ฟ . We conclude that ๐ฟ ∈ ๐ด. Thus, by transfinite induction (Theorem 6.1.18), ๐ด = ๐ and ๐๐ผ is transitive. Now, suppose ๐ผ ⊆ ๐ฝ and take ๐ฅ ∈ ๐๐ผ . Since ๐๐ผ ∈ P(๐๐ผ ) ⊆ ๐๐ฝ , ๐๐ผ ∈ ๐๐ฝ . However, the first part shows that ๐๐ฝ is transitive, so ๐๐ผ ⊆ ๐๐ฝ . Section 7.5 MODELS OF DIFFERENT CARDINALITIES 419 LEMMA 7.5.15 If ๐ผ is an ordinal, then ๐ผ ∈ ๐๐ผ+ . PROOF Let ๐ผ be an ordinal and take ๐ to be a limit ordinal such that ๐ผ ∈ ๐ and proceed by transfinite induction. โ Since ๐0 = ∅, we have that ∅ ∈ ๐1 . โ Suppose that ๐ผ ∈ ๐๐ผ+ . Since ๐๐ผ+ is transitive (Lemma 7.5.14), ๐ผ ⊂ ๐๐ผ+ . Also, {๐ผ} ⊂ ๐๐ผ+ . Therefore, ๐ผ ∪ {๐ผ} ⊂ ๐๐ผ+ , so ๐ผ + ∈ ๐๐ผ ++ . โ Let ๐ผ be a limit ordinal and assume that ๐ฝ ∈ ๐๐ฝ + for all ๐ฝ ∈ ๐ผ. That is, ๐ฝ ⊂ ๐๐ฝ + for all ๐ฝ ∈ ๐ผ since each ๐๐ฝ + is transitive. Hence, ๐ผ= โ ๐ฝ∈๐ผ ๐ฝ⊆ โ ๐๐ฝ + = ๐๐ผ , ๐ฝ∈๐ผ which implies that ๐ผ ∈ ๐๐ผ+ . The definition of the von Neumann hierarchy along with the fact that every ordinal belongs to a member of the hierarchy suggests that many, if not all, sets also belong to the hierarchy. For this reason, we define the following. DEFINITION 7.5.16 Let ๐ฉ denote the collection of all sets ๐ด such that ๐ด ∈ ๐๐ผ for some ordinal ๐ผ. It is the case that all sets belong to the hierarchy. This is the next theorem. Its proof is aided by the use of two terms and a lemma. โ A set ๐ด is grounded if there exists an ordinal ๐ผ such that ๐ด ⊆ ๐๐ผ . โ The transitive closure of ๐ด is TC(๐ด) = {๐ข : ∀๐ฃ(๐ด ∈ ๐ฃ ∧ ๐ฃ is transitive → ๐ข ∈ ๐ฃ)}. As the name implies, TC(๐ด) is a transitive set (Exercise 24). The proof of the next lemma is left to Exercise 21. LEMMA 7.5.17 Every element of ๐ด is grounded if and only if ๐ด is grounded. THEOREM 7.5.18 For every set ๐ด, there exists an ordinal ๐ผ such that ๐ด ∈ ๐๐ผ . 420 Chapter 7 MODELS PROOF Suppose that ๐ด is a set such that ๐ด ∉ ๐๐ผ for all ordinals ๐ผ. If ๐ด ⊆ ๐๐ฝ for some ordinal ๐ฝ, then ๐ด ∈ ๐๐ฝ+ , which contradicts the hypothesis. Hence, ๐ด is not grounded, which implies that {๐ด} is not grounded (Lemma 7.5.17). Define ๐ต = {๐ข ∈ TC({๐ด}) : ๐ข is not grounded}. Since ๐ต ≠ ∅, the regularity axiom (5.1.15) implies that there exists ๐ถ ∈ ๐ต such that ๐ถ ∩ ๐ต = ∅. Let ๐ฅ ∈ ๐ถ. Since transitive closures are transitive, ๐ฅ ∈ TC(๐ด). However, ๐ฅ ∉ ๐ต, so ๐ฅ is grounded. From this we conclude that ๐ถ is grounded by Lemma 7.5.17, a contradiction. Therefore, every set is in ๐ฉ, but we know by Corollary 5.1.17 that ๐ฉ is not a set in that it cannot be built using ๐ญ๐๐. However, we sometimes want to refer to such collections even though they are not sets. For this reason the term class was introduced, so we call ๐ฉ the class of all sets. We are now ready to use sets from the von Neumann hierarchy to serve as models for axioms from ๐ญ๐๐ (Section 5.1). DEFINITION 7.5.19 Let ๐ผ be an ordinal. Define the ๐ฒ๐ณ-structure ๐๐ผ = (๐๐ผ , ∈). Consider ๐3 = ({∅, {∅}, {{∅}}, {∅, {∅}}}, ∈). (7.34) The elements of ๐3 are the sets of the model. These elements are equal exactly when they share the same elements (Definition 3.3.7), so ๐3 โจ extensionality axiom. Because the union of any two elements of ๐3 is an element of ๐3 , such as ∅ ∪ {∅} = {∅} and {{∅}} ∪ {∅, {∅}} = {∅, {∅}}, we see that ๐3 โจ union axiom. If we take a ๐ฒ๐ณ-formula ๐(๐ฅ) and ๐ด ∈ ๐3 , then {๐ฅ : ๐ฅ ∈ ๐ด ∧ ๐(๐ฅ)} ∈ ๐3 . Hence, ๐3 โจ subset axioms. Because ๐3 is finite, and we have that ๐3 โจ axiom of choice, ๐3 โจ axiom of regularity since, for example, {∅} ∩ {{∅}} = ∅. These results are particular examples of the next general theorem. Section 7.5 MODELS OF DIFFERENT CARDINALITIES 421 THEOREM 7.5.20 If ๐ผ is an ordinal, โ ๐๐ผ โจ extensionality axiom, โ ๐๐ผ โจ union axiom, โ ๐๐ผ โจ subset axioms, โ ๐๐ผ โจ axiom of choice, โ ๐๐ผ โจ axiom of regularity. PROOF Let ๐ผ be an ordinal and ๐ผ be an ๐ฒ๐ณ-interpretation of ๐๐ผ . We check that the extensionality axiom (5.1.4) holds in the model and leave the other parts of the proof to Exercise 25. Take ๐ด, ๐ต ∈ ๐๐ผ . Assume that ๐๐ผ โจ ∀๐ข(๐ข ∈ ๐ฅ ↔ ๐ข ∈ ๐ฆ) [(๐ผ๐ฅ๐ด )๐ต ๐ฆ ]. That is, ๐ด ๐ต ๐ ๐ for all ๐ ∈ ๐๐ผ , ๐๐ผ โจ ๐ข ∈ ๐ฅ [((๐ผ๐ฅ๐ด )๐ต ๐ฆ )๐ข ] if and only if ๐๐ผ โจ ๐ข ∈ ๐ฆ [((๐ผ๐ฅ )๐ฆ )๐ข ]. We want to show that ๐ด = ๐ต, so let ๐ ∈ ๐ด. Since ๐๐ผ is transitive (Lemma 7.5.14), ๐ ∈ ๐๐ผ , which implies that ๐ ๐๐ผ โจ ๐ข ∈ ๐ฅ [((๐ผ๐ฅ๐ด )๐ต ๐ฆ )๐ข ]. Therefore, ๐ ๐๐ผ โจ ๐ข ∈ ๐ฆ [((๐ผ๐ฅ๐ด )๐ต ๐ฆ )๐ข ], which implies that ๐ ∈ ๐ต. This proves that ๐ด ⊆ ๐ต. A similar proof shows that ๐ต ⊆ ๐ด. Thus, ๐ด = ๐ต, from which follows that ๐๐ผ โจ ๐ฅ = ๐ฆ [(๐ผ๐ฅ๐ด )๐ต ๐ฆ ]. Therefore, ๐ด ๐ต if ๐๐ผ โจ ∀๐ข(๐ข ∈ ๐ฅ ↔ ๐ข ∈ ๐ฆ) [(๐ผ๐ฅ๐ด )๐ต ๐ฆ ], then ๐๐ผ โจ ๐ฅ = ๐ฆ [(๐ผ๐ฅ )๐ฆ ]. In other words, ๐๐ผ โจ ∀๐ข(๐ข ∈ ๐ฅ ↔ ๐ข ∈ ๐ฆ) → ๐ฅ = ๐ฆ [(๐ผ๐ฅ๐ด )๐ต ๐ฆ ]. Since ๐ด and ๐ต were arbitrarily chosen, ๐๐ผ โจ ∀๐ฅ∀๐ฆ(∀๐ข[๐ข ∈ ๐ฅ ↔ ๐ข ∈ ๐ฆ] → ๐ฅ = ๐ฆ). Again, using the ๐ฒ๐ณ-structure ๐3 (7.34), we see that the result of pairing two arbitrarily chosen elements of ๐3 into a single set might not be an element of ๐3 . For example, {{∅}, {{∅}}} ∉ ๐3 . 422 Chapter 7 MODELS The same is true regarding power sets. For example, P({{∅}}) = {∅, {{∅}}} ∉ ๐3 . However, both of these sets are elements of ๐๐ , which suggests that for the pairing (5.1.5) and power set (5.1.7) axioms to hold in stage ๐๐ผ of the von Neumann hierarchy, ๐ผ needs to be a limit ordinal. THEOREM 7.5.21 If ๐ผ is a limit ordinal, โ ๐๐ผ โจ pairing axiom, โ ๐๐ผ โจ power set axiom. PROOF Let ๐ผ be a limit ordinal. Take ๐, ๐ ∈ ๐๐ผ . Then, there exists ๐ฝ1 , ๐ฝ2 ∈ ๐ผ such that ๐ ∈ ๐๐ฝ1 and ๐ ∈ ๐๐ฝ2 . Without loss of generality, we can assume that ๐ฝ1 ⊆ ๐ฝ2 , which implies that ๐ ∈ ๐๐ฝ2 by Lemma 7.5.14. Therefore, {๐, ๐} ⊆ ๐๐ฝ2 , so {๐, ๐} ∈ P(๐๐ฝ2 ) = ๐๐ฝ + ⊆ ๐๐ผ . 2 Hence, ๐๐ผ โจ ∀๐ข ∀๐ฃ ∃๐ฅ ∀๐ค (๐ค ∈ ๐ฅ ↔ ๐ค = ๐ข ∨ ๐ค = ๐ฃ). The proof of the second part of the theorem is left to Exercise 26. Certainly, the empty set will be an element of ๐๐ผ provided that ๐ผ is not empty, so the proof of the next theorem is left to Exercise 27. In addition, ๐๐ผ needs to contain ๐ to satisfy the infinity axiom. THEOREM 7.5.22 If ๐ผ is a nonempty ordinal, ๐๐ผ โจ empty set axiom. THEOREM 7.5.23 If ๐ผ is an ordinal such that ๐ ∈ ๐ผ, then ๐๐ผ โจ infinity axiom. PROOF Since Lemma 7.5.15 implies that ๐ ∈ ๐๐+ , by Lemma 7.5.14, we have that ๐๐ผ โจ ∃๐ฅ({ } ∈ ๐ฅ ∧ ∀๐ข[๐ข ∈ ๐ฅ → ∃๐ฆ(๐ฆ ∈ ๐ฅ ∧ ๐ข ∈ ๐ฆ ∧ ∀๐ฃ[๐ฃ ∈ ๐ข → ๐ฃ ∈ ๐ฆ])]). Since ๐ ⋅ 2 = ๐ + ๐ is a limit ordinal greater than ๐, we conclude the following. THEOREM 7.5.24 ๐๐⋅2 โจ ๐ญ. The ordinal ๐ + ๐ requires one of the replacement axioms (5.1.9) to prove its existence (page 317). This means that the proof of the consistency of ๐ญ relies on axioms not in Section 7.5 MODELS OF DIFFERENT CARDINALITIES 423 ๐ญ, so let us continue to examine the von Neumann hierarchy for a model of ๐ญ๐๐. It will not be ๐๐⋅2 because not all replacement axioms are true in ๐๐⋅2 . This is because ๐๐⋅2 does not satisfy Theorem 6.1.19. To use a stage of the von Neumann hierarchy to serve as a model for ๐ญ๐๐, we need a strongly inaccessible cardinal (Definition 6.5.11), which is the next theorem. We state it without proof. THEOREM 7.5.25 ๐๐ โจ ๐ญ๐๐ if and only if ๐ is a strongly inaccessible cardinal. However, the sentence there exists a strongly inaccessible cardinal (7.35) cannot be proved or disproved using ๐ญ๐๐. This means that (7.35) is independent of the axioms of set theory. It also means that we are ready for the next definition. Do not confuse it with the notion of a complete logic (Definition 7.4.8). DEFINITION 7.5.26 An ๐ฒ-theory ๐ฏ is complete if ๐ฏ โข ๐ or ๐ฏ โข ¬๐ for all ๐ฒ-sentences ๐, else ๐ฏ is incomplete. The definition means that ๐ญ๐๐ is not complete. It turns out that the underlying issue is that ๐ญ๐๐ satisfies the Peano axioms (7.5.3). The ability to do basic arithmetic guarantees that there exists a sentence that is independent of ๐ญ๐๐. This result generalizes to the incompleteness theorems due to Gödel (1931). THEOREM 7.5.27 [Gödel’s First Incompleteness Theorem] If the Peano axioms are provable from a consistent theory, the theory is incomplete. Since ๐ญ๐๐ is assumed to be consistent and it can be used to deduce the Peano axioms, we conclude that ๐ญ๐๐ is incomplete. Now suppose that instead of assuming the consistency of ๐ญ๐๐, we try to prove that ๐ญ๐๐ is consistent using its own axioms. Gödel’s next theorem proves that we cannot do this, except under one condition. THEOREM 7.5.28 [Gödel’s Second Incompleteness Theorem] If a theory proves the Peano axioms and its own consistency, the theory is inconsistent. Therefore, if ๐ญ๐๐ could be used to prove (7.35), ๐ญ๐๐ would prove that it has a model, which would imply that ๐ญ๐๐ is consistent by Henkin’s theorem (7.4.19). Since we believe that ๐ญ๐๐ is consistent, we conclude that ๐ญ๐๐ cannot prove the existence of a strongly inaccessible cardinal. If ๐ญ๐๐ was extended with an axiom that would allow such a proof, there would be another issue with the extension that would prevent it from proving its own consistency, provided that the new theory was consistent. 424 Chapter 7 MODELS The statement of the first incompleteness theorem does not explicitly give a mathematical statement that cannot be proved. It was left to later mathematicians to find some. For example, when combined with Gödel’s proof of the relative consistency of ๐ข๐ง, the proof of Paul Cohen (1963) of ๐ข๐๐(๐ญ๐๐) → ๐ข๐๐(๐ญ๐๐ + ¬๐ข๐ง), shows that both ๐ข๐ง and its negation cannot be proved from ๐ญ๐๐. This means that the continuum hypothesis is independent of ๐ญ๐๐. In general, to prove that a sentence ๐ is independent of a theory ๐ฏ , do two things. โ Find a model ๐ such that ๐ โจ ๐ฏ ∪ {๐}. Then, ๐ฏ ฬธโจ ¬๐ by Definition 7.1.20. This implies that ๐ฏ โฌ ¬๐ by Corollary 7.4.23. โ Find another model ๐ such that ๐ โจ ๐ฏ ∪ {¬๐}. From this conclude that ๐ฏ ฬธโจ ๐, which implies ๐ฏ โฌ ๐. Using this strategy, other statements have been discovered to be independent of ๐ญ๐๐. Some of these are technical set theoretic statements such as Martin’s axiom (Martin and Solovay 1970) or the diamond principle (Jensen 1972). There are independent statements in other branches of mathematics as well. An example from group theory is the independence of the Whitehead problem (Shelah 1974), named after the mathematician John H. C. Whitehead (1950). Exercises 1. Write an ๐ ๐ฑ′ -formula equivalent to the given English sentences. (a) ๐ฅ is an even number. (b) ๐ฅ is an odd number. (c) ๐ฅ is a prime. (d) ๐ฅ divides ๐ฆ. 2. Prove that the order of Definition 5.2.10 proves the given ๐ ๐ฑ′ -sentences. (a) ∀๐ฅ(¬ ๐ฅ < 0) (b) ∀๐ฅ∀๐ฆ[๐ฅ < ๐๐ฆ ↔ (๐ฅ < ๐ฆ ∨ ๐ฅ = ๐ฆ)] (c) ∀๐ฅ∀๐ฆ(๐ฅ < ๐ฆ ∨ ๐ฅ = ๐ฆ ∨ ๐ฆ < ๐ฅ). 3. Prove that the given ๐ ๐ฑ′ -formulas can be proved from Axioms 7.5.3. (a) ∀๐ฅ(0 < ๐ฅ ∨ ๐ฅ = 0) (b) ∀๐ฅ[๐ฅ = 0 ∨ ∃๐ฆ(๐ฅ = ๐๐ฆ)] (c) ∀๐ฅ∀๐ฆ(๐ฅ ⋅ ๐ฆ = 0 → ๐ฅ = 0 ∨ ๐ฆ = 0) (d) ∀๐ฅ∀๐ฆ(๐ฅ < ๐ฆ ↔ ๐๐ฅ < ๐๐ข) (e) ∀๐ฅ∀๐ฆ(๐ฅ < ๐ฆ ∨ ๐ฅ = ๐ฆ ∨ ๐ฆ < ๐ฅ) 4. Prove ๐ฃ๐ โจ (๐ฅ + 2) ⋅ (๐ฅ + 3) = (๐ฅ ⋅ ๐ฅ + 5 ⋅ ๐ฅ) + 6, where 2, 3, 5, and 6 are understood to mean the appropriate successors of 0. Section 7.5 MODELS OF DIFFERENT CARDINALITIES 425 5. Find examples of the following if possible. (a) An ๐ฒ-theory ๐ฏ such that every two finite models of ๐ฏ are isomorphic, but there exists two models of ๐ฏ that are infinite and not isomorphic. (b) An ๐ฒ-theory ๐ฏ such that every two countable models of ๐ฏ are isomorphic, but there exists two models of ๐ฏ that are uncountable and not isomorphic. 6. Let ๐ฏ be an ๐ฒ-theory and ๐ an ๐ฒ-sentence where ๐ฏ โข ๐. Prove that there exists a finite ๐ฐ ⊆ ๐ฏ such that ๐ฐ โข ๐. 7. Let ๐ฏ be an ๐ฒ-theory and ๐ an ๐ฒ-sentence such that ๐ฏ โจ ๐ and ๐ โจ ๐ฏ . Show that there exists a finite subset ๐ฐ of ๐ฏ such that ๐ฐ โจ ๐ฏ . 8. Prove that the following are equivalent for an ๐ฒ-theory ๐ฏ . โ ๐ฏ is consistent. โ ๐ฏ has a model. โ ๐ฏ has a countable model. โ Every finite subset of ๐ฏ is consistent. โ Every finite subset of ๐ฏ has a model. โ Every finite subset of ๐ฏ has a countable model. 9. Finish the proof of Theorem 7.5.5. 10. Prove that the cancellation law holds for multiplication in Peano arithmetic. See Exercise 7.5.7. 11. If possible, find an ๐ ๐ฑ′ -sentence ๐ ∈ ๐ณ๐(๐′ ) that is not a consequence of ๐ฃ๐ (Exercise 7.4.6). 12. Demonstrate that there is no finite model of ๐ฃ or ๐ฃ๐. 13. Prove that there exists a model of ๐ฃ that is not isomorphic to ๐. 14. Prove that there is a countable nonstandard model of Peano arithmetic. 15. The axioms for an ordered field are the ring axioms (7.1.36) plus the following ๐ฎ๐ฅ-sentences: โ ∀๐ฅ∀๐ฆ(๐ฅ ⊗ ๐ฆ = ๐ฆ ⊗ ๐ฅ), โ ∃๐ฅ∀๐ฆ(๐ฅ ⊗ ๐ฆ = ๐ฆ), โ ∀๐ฅ[¬๐ฅ = 0 → ∃๐ฆ(๐ฅ ⊗ ๐ฆ = 1)], โ ∀๐ฅ∀๐ฆ(๐ฅ < ๐ฆ ∨ ๐ฅ = ๐ฆ ∨ ๐ฆ < ๐ฅ). Find a model for these axioms. 16. Show that there exists a model of the axioms for an ordered field (Exercise 15) such that ๐ is a subset of the domain of the model and there exists ๐ in the domain so that ๐ < ๐ for every natural number ๐. In addition, prove that there are infinitely many such ๐. What does 1โ๐ look like? 17. Let ℜ = (โ, 0, +, ⋅, <) be a model of the axioms for an ordered field. Prove that there is a countable model of ๐ณ๐(ℜ). What is the significance of this model? 426 Chapter 7 MODELS 18. Expand ๐ฎ๐ฅ to ๐ฎ๐ฅ ∪ {๐ธ}, where ๐ธ is a binary function symbol. Find a model of the axioms for an ordered field including the following (๐ฎ๐ฅ ∪ {๐ธ})-sentences: โ ∀๐ฅ(๐ฅ๐ธ0 = ๐0) โ ∀๐ฅ∀๐ฆ(๐ฅ๐ธ(๐๐ฆ) = (๐ฅ๐ธ๐ฆ)๐ฅ 19. Let ๐ฏ be a theory such that for every ๐ ∈ ๐, there exists ๐ ∈ ๐ such that ๐ > ๐ and ๐ฏ has a model of power ๐. Prove that ๐ฏ has an infinite model. 20. This hierarchy is due to Zermelo. Let ๐0 be a set of atoms.โFor every ordinal ๐ผ, define ๐๐ผ+ = ๐๐ผ ∪ P(๐๐ผ ), and if ๐ผ is a limit ordinal, define ๐๐ผ = ๐ฝ<๐ผ ๐๐ฝ . Find ๐1 and ๐2 assuming the given sets of atoms. (a) ๐0 = ∅ (b) ๐0 = {๐} (c) ๐0 = {0, 1, 2, 3} 21. Prove Lemma 7.5.17. 22. Prove that ๐๐ผ is a set for every ordinal ๐ผ. 23. Using Exercise 6.3.22, show that |๐๐+๐ผ | = i๐ผ for every ordinal ๐ผ. 24. Let ๐ด be a set. Prove that TC(๐ด) is a transitive set. 25. Prove the remaining parts of Theorem 7.5.20. 26. Let ๐ผ be a limit ordinal. Prove that ๐๐ผ โจ ∀๐ฅ∃๐ฆ∀๐ข(๐ข ∈ ๐ฆ ↔ ∀๐ฃ[๐ฃ ∈ ๐ข → ๐ฃ ∈ ๐ฅ]). 27. Prove that ๐๐ผ โจ ∃๐ฅ∀๐ฆ¬(๐ฆ ∈ ๐ฅ) if ๐ผ ≠ ∅. 28. Let ๐ be an ๐ฒ-structure. Prove that ๐ณ๐(๐) is complete. 29. Let ๐ be a cardinal and {๐ฏ๐ผ : ๐ผ ∈ ๐ } be a chain of complete ๐ฒ-theories such that โ for all ๐พ ∈ ๐ฟ ∈ ๐ we have that ๐ฏ๐พ ⊆ ๐ฏ๐ฟ . Show that ๐ผ∈๐ ๐ฏ๐ผ is complete. 30. Let ๐ฎ ⊆ ๐ฏ be ๐ฒ-theories. Prove or show false: If ๐ฏ is complete, then ๐ฎ is complete. APPENDIX ALPHABETS Greek Alphabet Upper A B Γ Δ E Z H Θ I K Λ M Lower ๐ผ ๐ฝ ๐พ ๐ฟ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ Name alpha beta gamma delta epsilon zeta eta theta iota kappa lambda mu Upper N Ξ O Π P Σ T Υ Φ X Ψ Ω Lower ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ ๐ Name nu xi omicron pi rho sigma tau upsilon phi chi psi omega 427 A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. 428 ALPHABETS English Alphabet in the Fraktur Font Upper ๐ ๐ โญ ๐ ๐ ๐ ๐ โ ℑ ๐ ๐ ๐ ๐ Lower ๐ ๐ ๐ ๐ก ๐ข ๐ฃ ๐ค ๐ฅ ๐ฆ ๐ง ๐จ ๐ฉ ๐ช Name A B C D E F G H I J K L M Upper ๐ ๐ ๐ ๐ ℜ ๐ ๐ ๐ ๐ ๐ ๐ ๐ โจ Lower ๐ซ ๐ฌ ๐ญ ๐ฎ ๐ฏ ๐ฐ ๐ฑ ๐ฒ ๐ณ ๐ด ๐ต ๐ถ ๐ท Name N O P Q R S T U V W X Y Z REFERENCES Aristotle (1984). The Complete Works of Aristotle, ed. J. Barnes. Princeton, NJ: Princeton University Press. Boole, G. (1847). The Mathematical Analysis of Logic : Being an Essay Towards a Calculus of Deductive Reasoning. Cambridge, UK: Macmillan, Barclay, & Macmillan. Boole, G. (1854). An Investigation of the Laws of Thought, on Which Are Founded the Mathematical Theories of Logic and Probabilities. London: Walton and Maberly. Boyer, C. B. and U. C. Merzbach (1991). A History of Mathematics (2nd ed.). New York: John Wiley & Sons, Inc. Burali-Forti, C. (1897). Una questione sui numeri transfiniti. Rendiconti del Circolo Matematico di Palermo 11(1), 154–164. Cantor, G. (1874). Ueber eine Eigenschaft des Inbegriffs aller reellen algebraischen Zahlen. Journal Fur Die Reine Und Angewandte Mathematik 77, 258–262. Cantor, G. (1888). Mitteilungen zur Lehre vom Transfiniten. Zeitschrift für Philosophie und philosophische Kritik 91, 81–125. Cantor, G. (1891). Über eine elementare Frage def Mannigfaltigkeitslehre. Jahresbericht der Deutschen Mathematiker-Vereinigung 1, 75–78. Cantor, G. (1932). Gesammelte Abhandlungen mathematischen und philosophischen Inhalts. Berlin: Springer-Verlag. 429 A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. 430 REFERENCES Chang, C. C. and H. J. Keisler (1990). Model Theory (3rd ed.). Studies in Logic and the Foundations of Mathematics. Amsterdam: North Holland. Church, A. (1956). Introduction to Mathematical Logic. Princeton, NJ: Princeton University Press. Ciesielski, K. (1997). Set Theory for the Working Mathematician. London Mathematical Society Student Texts. Cambridge, UK: Cambridge University Press. Cohen, P. J. (1963). The independence of the continuum hypothesis. Proceedings of the National Academy of Sciences of the United States of America 50(6), 1143–1148. Copi, I. M. (1979). Symbolic Logic (5th ed.). New York: Macmillan Publishing. Dauben, J. W. (1979). George Cantor: His Mathematics and Philosophy of the Infinite. Princeton, NJ: Princeton University Press. De Morgan, A. (1847). Formal Logic: or, the Calculus of Inference, Necessary and Probable. London: Taylor and Walton. Dedekind, R. (1893). Was sind und was sollen die Zahlen? Braunschweig: F. Vieweg. Dedekind, R. (1901). Essays on the Theory of Numbers, trans. W. W. Beman. Chicago: Open Court. Descartes, R. (1985). The Philosophical Writings of Descartes, eds. J. Cottingham, R. Stoothoff, and D. Murdoch. Cambridge, UK: Cambridge University Press. Doets, K. (1996). Basic Model Theory. Stanford: CSLI Publications. Drake, F. R. (1974). Set Theory: An Introduction to Large Cardinals, Volume 76 of Studies in Logic and Foundations of Mathematics. Amsterdam: North-Holland. Ebbinghaus, H., J. Flum, and W. Thomas (1984). Mathematical Logic. Undergraduate Texts in Mathematics. New York: Springer-Verlag. Ebbinghaus, H. and V. Peckhaus (2007). Ernst Zermelo: An Approach to His Life and Work. Berlin: Springer. Eklof, P. C. (1976). Whitehead’s problem is undecidable. American Mathematical Monthly 83, 775–788. Eklof, P. C. and A. H. Mekler (2002). Almost Free Modules: Set-theoretic Methods (revised ed.). Amsterdam: North-Holland. Enderton, H. B. (1977). Elements of Set Theory. San Diego: Academic Press. Euclid (1925). The Elements, ed. T. Heath. Reprint, New York: Dover, 1956. Ewald, W. B. (2007). From Kant to Hilbert: A Source Book in the Foundations of Mathematics. Oxford: Oxford University Press. Fibonacci and L. E. Sigler (2002). Fibonacci’s Liber Abaci: A Translation into Modern English of Leonardo Pisano’s Book of Calculuation. Sources and Studies in the History of Mathematics and Physical Sciences. New York: Springer. Fraenkel, A. A. (1922). Zu den Grundlagen der Cantor-Zermeloschen Mengenlehre. Mathematische Annalen 86, 230–237. Fraleigh, J. B. (1999). A First Course in Abstract Algebra (6th ed.). Reading, MA: AddisonWesley. Frege, G. (1879). Begriffsschrift, Eine Der Arithmetischen Nachgebildete Formelsprache Des Reinen Denkens. Halle a/S. REFERENCES 431 Frege, G. (1884). Die Grundlagen Der Arithmetik: Eine Logisch Mathematische Untersuchung Über Den Begriff Der Zahl. Breslau: W. Koebner. Frege, G. (1893). Grundgesetze Der Arithmetik : Begriffsschriftlich Abgeleitet. Jena: H. Pohle. Gödel, K. (1929). Über die Vollständigkeit des Logikkalküls. Ph. D. thesis, University of Vienna. Gödel, K. (1930). Die Vollständigkeit der Axiome des logischen Functionenkalküls. Monatshefte für Mathematik und Physik 37, 349–360. Gödel, K. (1931). Über formal unentscheidbare Sätze der Principia Mathematica und verwandter Systeme, I. Monatshefte für Mathematik und Physik 38, 173–198. Gödel, K. (1940). Consistency of the Continuum Hypothesis. Princeton, NJ: Princeton University Press. Halmos, P. R. (1960). Naive Set Theory. New York: Springer-Verlag. Henkin, L. (1949). The Completeness of Formal Systems. Ph. D. thesis, Princeton University, Princeton, NJ. Herrlich, H. (2006). Axiom of Choice. Lecture Notes in Mathematics. Berlin: Springer. Hilbert, D. (1899). Grundlagen der Geometrie. Leipzig: Teubner. Hilbert, D. (1902). Mathematical problems, trans. M. F. W. Newson. Bulletin of the American Mathematical Society 8, 437–479. Hodges, W. (1993). Model Theory. Encyclopedia of Mathematics and its Applications. Cambridge, U.K.: Cambridge University Press. Hofstadter, D. R. (1989). Gödel, Escher, Bach: an Eternal Golden Braid. Reprint, New York: Vintage Books. Jech, T. (1973). The Axiom of Choice, Volume 75 of Studies in Logic and the Foundations of Mathematics. Amsterdam: North-Holland. Jech, T. (2003). Set Theory: The Third Millennium Edition. Springer Monographs in Mathematics. Berlin: Springer. Jensen, R. B. (1972). The fine structure of the constructible hierarchy. Annals of Mathematical Logic 4(3), 229–308. Kaye, R. (1991). Models of Peano Arithmetic, eds. D. S. Angus Macintyre, John Shepherdson. Oxford, UK: Clarendon Press. Kline, M. (1972). Mathematical Thought from Ancient to Modern Times. New York: Oxford University Press. Kneale, W. and M. Kneale (1964). The Development of Logic. Oxford, UK: Clarendon Press. König, J. (1905). Zum Kontinuum-problem. Mathematische Annalen 60(2), 177–180. König, J. (1906). Sur la théorie des ensembles. Comptes rendus hebdomadaires des séances de l’Académie des sciences 143, 110–112. Kunen, K. (1990). Set Theory: An Introduction to Independence Proofs, Volume 102 of Studies in Logic and the Foundations of Mathematics. Amsterdam: North-Holland. Kuratowski, K. (1921). Sur la notion de l’order dans la théorie des ensembles. Fundamenta Mathematicae 2, 161–171. Kuratowski, K. (1922). Sur l’opération a de l’analysis situs. Fundamenta Mathematicae 3, 182– 199. 432 REFERENCES Leary, C. C. (2000). A Friendly Introduction to Mathematical Logic. Upper Saddle River, NJ: Prentice Hall. Leibniz, G. W. (1666). Dissertatio de arte combinatoria, in qua ex arithmeticae fundamentis complicationum ac transpositionum doctrina novis praeceptis extruitur, & usus ambarum per universum scientiarum orbem ostenditur; nova etiam artis meditandi, seu logicae inventionis semina sparguntur. Lipsiae, apud Joh. Simon Fickium et Joh. Polycarp. Seuboldum, Literis Spörelianis. Levy, A. (1979). Basic Set Theory. Perspectives in Mathematical Logic. Berlin: Springer-Verlag. Löwenheim, L. (1915). Über Möglichkeiten im Relativkalkül. Mathematische Annalen 76(4), 447–470. ลukasiewicz, J. (1930). Elementy logiki matematycznej. Warsaw: s.n. ลukasiewicz, J. (1951). Aristotle’s Syllogistic From the Standpoint of Modern Formal Logic. Oxford, UK: Clarendon Press. Mal’tsev, A. I. (1936). Untersuchungen aus dem Gebiete der mathematischen Logik. Matematicheskii Sbornik 1(43), 323–336. Martin, D. A. and R. M. Solovay (1970). Internal Cohen extensions. Annals of Mathematical Logic 2(2), 143–178. Mirimanoff, D. (1917). Les antinomies de Russel et de Burali-Forti et le problème fondamental de la théorie des ensembles. L’Enseignement Mathématique 19, 37–52. Pascal, B. (1665). Traité du triangle arithmetique, avec quelques autres petits traitez sur la mesme matrière. Paris: chez G. Desprez. Peano, G. (1889). Arithmetices principia: nova methodo. Augustae Taurinorum [Torino]: Fratres Bocca. Reid, C. (1996). Hilbert. New York: Copernicus. Rosen, K. H. (1993). Elementary Number Theory and Its Applications (3rd ed.). Reading, MA: Addison-Wesley. Rubin, H. and J. E. Rubin (1985). Equivalents of the Axiom of Choice, II, Volume 116 of Studies in Logic and the Foundations of Mathematics. Amsterdam: North-Holland. Rubin, J. E. (1973). The compactness theorem in mathematical logic. Mathematics Magazine 46(5), 261–265. Shelah, S. (1974). Infinite abelian groups, Whitehead problem and some constructions. Isreal Journal of Mathematics 18(3), 243–256. Skolem, T. (1922). Einige Bemerkungen zur axiomatischen Begründung der Mengenlehre. In Proceedings of the 5th Scandinavian Mathematicians’ Congress in Helsinki, pp. 217–32. Suppes, P. (1972). Axiomatic Set Theory. New York: Dover Publications. Tarski, A. (1935). Der wahrheitsbegriff in den formalisierten sprachen. Studia Philosophica 1, 261–405. Tarski, A. (1983). Logic, Semantics, Metamathematics: Papers from 1923 to 1938, trans. J. H. Woodger. Indianapolis: Hackett Publishing Company. Tarski, A. and R. L. Vaught (1957). Arithmetical extensions of relational systems. Compositio Mathematica 13, 81–102. van Dalen, D. (1994). Logic and Structure (3rd ed.). Berlin: Springer-Verlag. REFERENCES 433 van Dalen, D., H. C. Doets, and H. de Swart (1978). Sets: Naive, Axiomatic and Applied, Volume 106 of International Series in Pure and Applied Mathematics. Oxford, UK: Pergamon Press. Van Heijenoort, J. (1971). Frege and Gödel: Two Fundamental Texts in Mathematical Logic. Cambridge MA: Harvard University Press. Van Heijenoort, J. (1977). From Frege to Gödel: A Source Book in Mathematical Logic, 1879– 1931. Source Books in History of Sciences. Cambridge, MA: Harvard University Press. Venn, J. (1894). Symbolic Logic (2nd ed.). London: Macmillan. von Neumann, J. (1923). Zur Einführung der transfiniten Zahlen. Acta literarum ac scientiarum Regiae Universitatis Hungaricae Francisco-Josephinae, Sectio scientiarum mathematicarum 1, 199–208. von Neumann, J. (1928). Über die Definition durch transfinite Induktion und verwandte Fragen der allgemeinen Mengenlehre. Mathematische Annalen 99, 373–391. von Neumann, J. (1929). Über eine Widerspruchsfreiheitsfrage der axiomatischen Mengenlehre. Journal für die reine und angewandte Mathematik 160, 227–241. Whitehead, A. N. and B. Russell (1910). Principia Mathematica (2nd ed.). Cambridge, UK: Cambridge University Press. Whitehead, J. H. C. (1950). Simple homotopy types. American Journal of Mathematics 72(1), 1–57. Wussing, H. (1984). The Genesis of the Abstract Group Concept, ed. H. Grant, trans. A. Shenitzer. Cambridge, MA: MIT Press. Zermelo, E. (1908). Untersuchungen über die grundlagen der mengenlehre. Mathematische Annalen 65, 261–281. Zermelo, E. (1930). Über Grenzzahlen und Mengenbereiche. Neue Untersuchungen über die Grundlagen der Mengenlehre. Fundamenta mathematicae 16, 29–47. Zorn, M. (1935). A remark on method in transfinite algebra. Bulletin of the American Mathematical Society 41, 667–670. INDEX Abel, Niels, 342 Abelian group, 342 Absolute value, 116 Abstraction, 122 Addition, inference rule, 25 Addition, matrix, 355 Additive identity, 199, 412 Additive inverse, 199 Aleph, 313 Algebraic, 315 Alphabet, 5 first-order, 68 second-order, 73 And, 4 Antecedent, 4 Antichain, 183 Antisymmetric, 177 Arbitrary, 88 Argument form, 21 Assignment, 7 Associative, 34, 139, 199, 412 Assumption, 46 Asymmetric, 177 Atom, 3, 6 Automorphism, 380 Axiom scheme, 228 Axiom(s), 24 choice, 231, 235 empty set, 227 equality, 226 extensionality, 227 foundation, 234 Frege–ลukasiewicz, 24 group, 340 paring, 228 power set, 228 regularity, 234 replacement, 230 ring, 353 separation, 228 subset, 229 union, 228 Zermelo, 231 Axiomatizable, finitely, 409 Basis case, 258 Bernstein, Felix, 301 Beth, 316 Biconditional, 5 Biconditional proof, 107 435 A First Course in Mathematical Logic and Set Theory, First Edition. Michael L. O’Leary. © 2016 John Wiley & Sons, Inc. Published 2016 by John Wiley & Sons, Inc. 436 INDEX Bijection, 211 Binary, 68 function, 189 operation, 198 relation, 161 Binomial coefficient, 263 Binomial theorem, 263 Boole, George, 117 Bound lower, 181 upper, 180 Bound occurrence, 77 Burali-Forti paradox, 297 Burali-Forti theorem, 292 Cancellation, 248, 256, 414 Candidate, 99 Cantor–Schröder–Bernstein theorem, 301 Cantor, Georg, 117, 226, 229, 303, 313 Cardinal, 307 large, 332 limit, 314 regular, 328 singular, 328 strongly inaccessible, 332 successor, 314 weakly inaccessible, 331 Cardinality, 308 Cartesian ๐-space, 131 Cartesian plane, 130 Cartesian product, 130 Cases, 112 Chain, 182 elementary, 394 of structures, 372 Characteristic function, 304 Choice axiom, 231, 235 Choice function, 231 Class, 420 Class (relation), 171 Closed, 198 Closed interval, 120 Closed under deductions, 409 Codomain, 190 Cofinal, 328 Cofinality, 328 Cohen, Paul, 424 Coincidence, 349 Combinatorics, 260 Common divisor, 140 Commutative, 34, 139, 199, 412 Commutative ring, 354 Compactness theorem, 52, 415 Comparable, 181 Compatible, 183 Complement, 128 Complete, 56, 398 Complete theory, 423 Completeness theorem, 61, 407 Gödel’s, 407 Completeness, real numbers, 275 Complex number, 281 Composite number, 107 Composition, 163, 195 Compound, 4, 6 Concatenation, 178 Conclusion, 22, 26 Conditional, 4 Conditional proof, 45 Congruent, 170 Conjunct, 4 Conjunction, 4 Conjunction, inference rule, 25 Connective, 6 Consequence, 22, 346, 352 Consequent, 4 Consistency, relative, 407 Consistent, 52, 395 maximally, 53, 396 Constant symbol, 68 Constructive dilemma, 25 Contingency, 16 Continuous, 94 Continuum hypothesis, 313 generalized, 314 Contradiction, 16 Contradiction, proof by, 47 Contrapositive, 32, 103 Contrapositive law, 34 Converse, 32 Coordinates, 130 Coordinatewise, 356 Copy, 214 Corollary, 20 Corresponding occurrences, 77 Countable, 310 Counterexample, 102 Cyclic group, 363, 365 De Morgan, Augustus, 117 De Morgan’s laws, 35, 137, 139 Decreasing, 184, 203 Dedekind cut, 276 Dedekind, Richard, 229, 276 Deduce, 26 Deduction, 20 Deduction theorem, 42 Deductive logic, 2 Dense, 257 Denumerable, 310 INDEX Descartes, René, 117 Destructive dilemma, 25 Diagonalization, 303 Diamond principle, 424 Direct existential proof, 99 Direct proof, 45 Discrete, 310 Disjoint, 127, 155 pairwise, 155 Disjunct, 4 Disjunction, 4 Disjunctive normal form, 38 Disjunctive syllogism, 25 Distributive, 34, 139, 246, 323, 412 left, 320 Divides, 98 Divisible, 98 Division algorithm, 185 polynomial, 110 Division ring, 357 Divisor, 98 common, 140 zero, 356 Domain, 162, 334 Dominate, 300 Double negation, 35 Downward closed, 274 Downward Löwenheim–Skolem theorem, 415 Element, 117 Elementary chain, 394 Elementary equivalent, 387 Elementary extension, 389 Elementary substructure, 389 Embedding, 214 Empty set, 118, 141, 227 Empty set axiom, 227 Empty string, 6 Endpoint, 120 Equal, 118 Equality, 194 Equality axioms, 226 Equality symbol, 68 Equinumerous, 298 Equivalence class, 171 Equivalence relation, 169 induced, 175 Equivalence rule, 110 Equivalence, logical, 31, 348 Equivalent elementary, 387 pairwise, 109 Euclid, 19, 107, 265 Euclid’s lemma, 265 Evaluation map, 193 Even integer, 98 Excluded middle, 37 Exclusive or, 11 Existence, 104 Existential formula, 72 Existential generalization, 91 Existential instantiation, 91 Existential proof, 99, 106 Existential proposition, 65 Existential quantifier, 65 Expansion, 352 Exponentiation, 249, 321, 322 Exportation, 35 Extension, 197, 361 elementary, 389 Extensionality axiom, 227 Factor, 98, 110 Factorial, 242 False, 3 Family of sets, 148 Fibonacci, 268 Fibonacci number, 269 Fibonacci sequence, 269 generalized, 273 Field, 357 ordered, 69, 425 Field theory, 69 Finite, 308 hereditarily, 417 Finitely axiomatizable, 409 First-order alphabet, 68 formula, 73 language, 73 logic, 96 Formal proof, 26 Formation sequence, 7 Formula, 71, 72 first-order, 73 second-order, 73 Foundation axiom, 234 Fraenkel, Abraham, 229 Fraktur, 334, 428 Free occurrence, 77 Free variable, 78 Frege, Gottlob, 24, 117 Function, 189 bijection, 211 binary, 189 characteristic, 304 choice, 231 continuous, 94 decreasing, 203 embedding, 214 437 438 INDEX evaluation, 193 greatest integer, 192 homomorphism, 375 identity, 191 inclusion, 203 increasing, 203 injection, 206 inverse, 204 invertible, 204 isomorphism, 382 one-to-one, 206 one-to-one correspondence, 211 onto, 208 order-preserving, 212 periodic, 203 projection, 210 real-valued, 193 surjection, 208 unary, 189 uniformly continuous, 94 zero, 376 Function equality, 194 Function notation, 190 Function symbol, 68 Fundamental homomorphism theorem, 383 Fundamental theorem of arithmetic, 271 Galois, Évariste, 342 General linear group, 345 Generalization, 85 existential, 91 universal, 88 Generalized continuum hypothesis, 314 Generalized Fibonacci sequence, 273 Generator, 363, 370 Gödel, Kurt, 399, 407, 414, 423 Gödel’s completeness theorem, 407 Gödel’s incompleteness theorems, 423 Golden ratio, 271 Grammar, 6 Greatest common divisor, 140 Greatest element, 180 Greatest integer function, 192 Greatest lower bound, 181 Grounded, 419 Group, 342 abelian, 342 cyclic, 363 general linear, 345 Klein-4, 344 simple, 363 Group axioms, 340 Group theory, 69 Grouping symbol, 6 Half-open interval, 120 Hartogs’ function, 327 Hartogs’ theorem, 293 Hausdorff maximal principle, 237 Henkin, Leon, 399 Henkin’s theorem, 406 Hereditarily finite sets, 417 Hereditary set, 235 Hierarchy, von Neumann, 417 Hilbert, David, 226, 408 Hilbert’s problems, 408 Homomorphism, 375 group, 375 ring, 375 Hypothetical syllogism, 25 Ideal, 368 improper, 368 left, 368 maximal, 374 prime, 374 principal, 370 principal left, 370 proper, 368 right, 368 Idempotent laws, 139 Identity, 162, 199 additive, 199 multiplicative, 199 Identity map, 191 Image, 190, 208, 216 Implication, 4 Improper ideal, 368 Improper subgroup, 363 Improper subring, 366 Improper subset, 136 Inaccessible cardinal, 331 Inclusion map, 203 Inclusive or, 11 Incomparable, 181 Incompatible, 183 Incomplete theory, 423 Incompleteness theorems, 423 Inconsistent, 52, 395 Increasing, 184, 203 Independent, 408 Index, 148 Index set, 148 Indexed, 149 Indirect existential proof, 106 Indirect proof, 47 Induced equivalence relation, 175 Induced partition, 174 Induction mathematical, 257 INDEX on formulas, 336 on propositional forms, 59 on terms, 335 strong, 268 transfinite, 283, 291 Induction hypothesis, 59, 258 Induction step, 258 Inductive logic, 1 Inductive set, 238 Infer, 24 Inference, 24 Inference rule, 25 Infinite, 308, 344 Infinity axiom, 227 Infinity symbol, 120 Initial number, 327 Initial segment, 274 proper, 274 Injection, 206 Instantiation, 85 existential, 91 universal, 87 Integers, 250 Integral domain, 356 International Congress of Mathematicians, 408 Interpretation, 335 Intersection, 126, 152 Interval, 120 Interval notation, 120 Introduction, 97 Invalid semantically, 21 syntactically, 21 Inverse, 166, 199 additive, 199 function, 204 image, 216 left, 215 multiplicative, 199 right, 216 Inverse image, 216 Inverse relation, 166 Inverse statement, 37 Invertible function, 204 Invertible matrix, 345 Irrational number, 128, 276 Irreflexive, 177 Isomorphic, 212, 344, 380, 382 Isomorphism, 212, 380 group, 382 ring, 382 Kernel, 379 Klein-4 group, 344, 363 König, Julius, 301, 330 König’s theorem, 330 Kuratowski, Kazimierz, 130, 232, 237 Language, 73 first order, 73 Large cardinal, 332 Law of noncontradiction, 37 Law of the excluded middle, 37 Least element, 180 Least upper bound, 180 Left distributive, 320 Left ideal, 368 Left inverse, 215 Left zero divisor, 356 Leibniz, Gottfried, 117 Lemma, 20 Leonardo of Pisa, 268 Lexicographical order, 187 Liber abaci, 268 Limit cardinal, 314 Limit ordinal, 290 Lindenbaum’s theorem, 397 Linear combination, 186 Linear order, 182 Linearly ordered set, 182 Logic, 1 deductive, 2 first-order, 96 inductive, 1 mathematical, 2 propositional, 20 second-order, 96 Logic symbol, 67 Logical implication, 21 Logical system, 20 Logically equivalent, 31, 348 Löwenheim, Leopold, 415 Löwenheim’s theorem, 416 Löwenheim–Skolem theorem downward, 415 upward, 416 Lower bound, 181 greatest, 181 ลukasiewicz, Jan, 24, 71 Mal’tsev, Anatolij, 414 Map, 190 Martin’s axiom, 424 Material equivalence, 13, 35 Material implication, 12, 35 Mathematical induction, 257 strong, 268 Mathematical logic, 2 Mathematics, 1 Mathesis universalis, 117 439 440 INDEX Matrix, 345 identity, 345 invertible, 345 zero, 355 Matrix addition, 355 Matrix multiplication, 345 Maximal element, 181 Maximal ideal, 374 Maximally consistent, 53, 396 Meaning, 334 Metatheory, 96 Minimal element, 181 Mirimanoff, Dmitry, 229, 234 Model, 338, 339 Modern square of opposition, 86 Modulo, 170, 172 Modus ponens, 25 Modus tolens, 25 Multiple, 98 Multiplication, matrix, 345 Multiplicative identity, 199, 412 Multiplicative inverse, 176, 199 Mutually exclusive, 127 ๐-ary, 68, 161, 189 Name, 334 Natural number, 238 Necessary, 4, 109 Negated, 4 Negation, 4 Niece, 64, 161 Noncontradiction law, 37 Nonstandard model, 411 Norway, 342 Number, 256 standard, 411 Number theory, 69 Occurrence, 65 Odd integer, 98 One-to-one correspondence, 211 One-to-one function, 206 Onto function, 208 Open interval, 120 Operation, binary, 198 associative, 199 commutative, 199 Operation, set, 126 Or, 4 exclusive, 11 inclusive, 11 Order, 344 increasing, 184 lexicographical, 187 linear, 182 partial, 178 well, 183 Order isomorphism, 212 Order of connectives, 8, 9 Order of operations, 132 Order type, 212, 292 Order-preserving, 212 Ordered ๐-tuple, 131 Ordered field, 69, 425 Ordered pair, 130 Ordinal, 286 limit, 290 successor, 290 Ordinal number, 286 Pairing axiom, 228 Pairwise disjoint, 155 Pairwise equivalent, 109 Paragraph proof, 97 Parsing tree, 6 Partial order, 178 Partially ordered set, 178 Particular, 88 Partition, 173 induced, 174 Pascal’s identity, 263 Peano arithmetic, 69, 411 Peano axioms, 410 Peano, Giuseppe, 410 Periodic, 203 Permutation, 260 Pigeonhole principle, 309 Polynomial division algorithm, 110 Poset, 178 Positive form, 86 Postulate, 20 Power, 415 Power set, 151 Power set axiom, 228 Pre-image, 190 Predecessor, 237 Predicate, 64, 65 Prefix notation, 71 Premise, 22, 26 Prenex normal form, 96 Preserves, 212, 375 Prime ideal, 374 Prime number, 107 Prime power decomposition, 272 Principal ideal, 370 left, 370 Principal ideal domain, 372 Projection, 210 Proof methods biconditional, 107 INDEX biconditional, short rule, 108 cases, 112 conditional, 45 contradiction, 47 counterexample, 102 direct, 45 disjunctions, 111 equivalence rule, 110 existence, 104 existential, direct, 99 existential, indirect, 106 indirect, 47 reductio ad absurdum, 47 relative consistency, 407 uniqueness, 104 universal, 98 Proof, formal, 26 Proof, paragraph, 97 Proof, two-column, 27 Proper ideal, 368 Proper initial segment, 274 Proper subgroup, 364 Proper subring, 366 Proper subset, 136 Proposition, 3 compound, 4 universal, 66 Proposition alphabet, 5 Propositional form, 6 Propositional logic, 20 Propositional variable, 6 Prove, 26 Pure set, 235 Purely relational, 337 Quantifier, 65, 66 Quantifier negation, 86 Quantifier symbol, 68 Quotient, 110, 185 Quotient set, 172 Range, 162, 208 Rational numbers, 254 Ray, 120 Real number, 276 Real-valued function, 193 Recursion, 242 transfinite, 294 Recursive definition, 6, 242 Reduct, 352 Reductio ad absurdum, 47 Reflexive, 169 Regular cardinal, 328 Regularity axiom, 234 Relation, 161 antisymmetric, 177 asymmetric, 177 binary, 161 inverse, 166 irreflexive, 177 reflexive, 169 symmetric, 169 transitive, 169 unary, 161 Relation on, 161 Relation symbol, 68 Relative consistency, 407 Remainder, 110, 185 Replacement axiom, 230 Replacement rule, 34 Restriction, 197 Right ideal, 368 Right inverse, 216 Right zero divisor, 356 Ring, 354 commutative, 354 division, 357 field, 357 integral domain, 356 principal ideal domain, 372 skew field, 357 with unity, 354 Ring axioms, 353 Ring theory, 69 Roster, 118 Roster method, 118 Russell’s paradox, 225 Russell, Bertrand, 225 Satisfiable, 340, 352 Satisfy, 65, 338, 339 Scheme, axiom, 228 Schröder, Ernst, 301 Scope, 77 Second-order alphabet, 73 formula, 73 Second-order logic, 96 Selector, 231 Self-evident, 24 Semantically invalid, 21 Semantically valid, 21, 22 Semantics, 21 Sentence, 83 Separation axioms, 228 Set, 117 hereditary, 235 Set difference, 127 Set operation, 126 Set theory, 68 441 442 INDEX Short rule of biconditional proof, 108 Signature, 334 Simple group, 363 Simplification, 25 Simultaneous substitution, 81 Singleton, 118 Singular cardinal, 328 Skew field, 357 Skolem’s theorem, 416 Skolem, Thoralf, 229, 234, 415 Sound, 56, 57, 398 Soundness theorem, 57, 398 Square of opposition, modern, 86 Standard model, 411 Standard number, 411 String, 5 Strong induction, 268 Strongly inaccessible, 332 Structure, 333 pureley relational, 337 Subgroup, 363 improper, 363 proper, 364 trivial, 363 Subject, 64 Subproof, 46 Subring, 366 improper, 366 proper, 366 trivial, 366 Subset, 135 Subset axioms, 229 Substitution, 64, 75, 78 simultaneous, 81 Substructure, 361 elementary, 389 Successor, 237 Successor cardinal, 314 Successor ordinal, 290 Sufficient, 4, 109 Surjection, 208 Symmetric, 169 Symmetric closure, 177 Symmetric difference, 236 Syntactically invalid, 21 Syntactically valid, 21, 27 Syntax, 23 Tarski, Alfred, 336, 415 Tarski–Vaught theorem, 390 Tautology, 16 Tautology rule, 35 Term, 70 Theorem, 20, 27 Theory, 395 Theory symbol, 68 Tower, 233 Transcendental, 315 Transfinite induction, 283, 291 Transfinite recursion, 294 Transitive, 169 Transitive closure, 419 Transitive set, 239 Trichotomy, 183, 288 weak, 179 Trichotomy law, 183 Trivial subgroup, 363 Trivial subring, 366 True, 3 Truth table, 11 Truth value, 3 Two-column proof, 27 Unary, 68 function, 189 relation, 161 Undecidable, 313 Undefined, 190 Uniformly continuous, 94 Union, 126, 151 Union axiom, 228 Unique, 104 Unique factorization theorem, 271 Unit, 357 Unity, 354 Universal formula, 72 Universal generalization, 88 Universal instantiation, 87 Universal proof, 98 Universal proposition, 66 Universal quantifier, 66 Universe, 97, 141 Upper bound, 180 least, 180 Upward Löwenheim–Skolem theorem, 416 Urelement, 417 Valid, 347 semantically, 21, 22 syntactically, 21, 27 Valuation, 10, 13 Valuation patterns, 15 Variable, 67 propositional, 6 Variable symbol, 67 Vaught, Robert, 336 Venn diagram, 127 Venn, John, 127 Vertical line test, 189 Von Neumann hierarchy, 417 INDEX Von Neumann, John, 234, 238, 417 Weak-trichotomy law, 179 Weakly inaccessible, 331 Well-defined, 191 Well-ordered set, 183 Well-ordering theorem, 293 Whitehead, John H. C., 424 Whitehead problem, 424 Witness, 91 Witness set, 399 Write, 23 Zermelo axioms, 235 Zermelo’s axiom, 231 Zermelo’s theorem, 295 Zermelo, Ernst, 225, 234, 293, 417 Zermelo–Fraenkel axioms, 235 Zero, 102, 110 Zero divisor, 356 left, 356 right, 356 Zero map, 376 Zero matrix, 355 Zorn’s lemma, 232 Zorn, Max, 232 443 WILEY END USER LICENSE AGREEMENT Go to www.wiley.com/go/eula to access Wiley’s ebook EULA.