Standard Properties (Axioms) of the number system Natural numbers The basis of all number systems is the natural numbers Þ = {1, 2, 3, 4, ... }. This is also the simplest example of an infinite set. The formal construction of this set according to axioms of set theory is the province of mathematics and logic (but note we need a special axiom to guarantee the existence of an infinite set). Once we have the set sensibly defined it is quite easy to construct the usual operations: +, * and verify they satisfy the standard properties: a+b=b+a a*b = b*a æ a, b 0 Þ. (commutative) æ a, b 0 Þ. (commutative) (a + b) + c = a + (b + c) (a*b)*c = a*(b*c) æ a, b, c 0 Þ. (associative) æ a, b, c 0 Þ. (associative) a*(b + c) = a*b + a*c æ a, b, c 0 Þ. (distributive) Ordering. There exists an order relation < on Þ which satisfies x < y or x = y or x> y æ x, y 0 Þ One important property of natural numbers which can be derived from the axioms of set theory is Well ordering. Every non-empty subset S of Þ has a minimum element which is in S. This may seem fairly obvious but it is not true for the rationals for example. The well ordering property is closely related to the Principle of induction. If P is a proposition about natural numbers such that P(1) is true and P(k) implies P(k+1) for all k 0 Þ then P(n) is true æ n 0 Þ. Example. Let P(n) be the proposition that the sum of the first n natural numbers is ½n(n+1). Then 1. The proposition is true for n = 1, because 1 = ½.1.2. 2. If 1 + 2 + ...+ k = ½k(k+1) then (by adding k+1 to both sides) 1 1 + 2 + ... k + (k+1) = ½k(k+1) + (k+1) and ½k(k+1) + (k+1) = ½(k+1)(k+2), which is the formula with n = k+1. Hence P(k) imples P(k+1). Hence, by the principle of induction, P(n) is true æ n 0 Þ. Exercise (Binomial theorem). Using the result n r % n r&1 ' n%1 r prove by induction on n that n n&1 a b % 1 (a % b)n ' a n % % n n&2 2 a b % ... % b n 2 for any real a, b and natural number n. Properties of natural numbers The additive identity 1 can be used to generate Þ additively, 1 + 1 = 2, 2 + 1 = 3, 3 + 1 = 4, etc. But note that we can't do this for multiplication. Subtraction. If a, b 0 Þ and a > b then ý a unique x 0 Þ such that b + x = a, but no such x exists if a # b. Divisibilty. We say a divides b (often written a * b) if ý c 0 Þ such that b = a*c. Note. 1 divides every a 0 Þ since one can show from the axioms that a = 1*a æ a 0 Þ. For the same reason every natural number divides itself. Primes. A natural number p 0 Þ, p > 1, is called prime if the only natural numbers dividing p are 1 and p. E.g. 2, 3, 5, 7, 11, 13, 17, 19, ... are primes. Theorem. Every natural number n 0 Þ is uniquely representable as a product of primes e e n ' p1 1 p2 2 ... ph eh (3) The remainder theorem We mention in passing the following useful theorem regarding natural numbers (and more generally integers) Theorem. For any pair of natural numbers x, y with 1 # x # *y* there exist a unique pair (m, r) such that y = m x + r, and 0 # r < x. 2 We can use the remainder theorem to construct a simple algorithm for constructing the highest common factor of two integers (Euclid’s algorithm). [See the Mathematica™ file euclid.ma] Demonstration Euclid’s algorithm To find the HCF of 10946 and 6765. 10946 = 1*6765 + 4181 6765 = 1*4181 + 2584 4181 = 1*2584 + 1597 2584 = 1*1597 + 987 1597 = 1*987 + 610 987 = 1*610 + 377 610 = 1*377 + 233 377 = 1*233 + 144 233 = 1*144 + 89 144 = 1*89 + 55 89 = 1*55 + 34 55 = 1*34 + 21 34 = 1*21 + 13 21 = 1*13 + 8 13 = 1*8 + 5 8 = 1*5 + 3 5 = 1*3 + 2 3 = 1*2 + 1 2 = 2*1 + 0 Conclusion: the HCF is 1. Integers The integers Z = { ..., -3, -2, -1, 0, 1, 2, 3, ...} are constructed as an extension of the natural numbers Þ so that: æ a, b 0 Z ý x 0 Z such that b + x = a Note. Given the natural numbers the formal construction of the integers is quite straight forward. This idea of embedding one number system into a bigger one which has nicer properties is one we shall see again when embedding the real numbers within the complex numbers. However, it can only be taken so far. The invention of zero (i.e. that 0 should be considered as a number) was a huge conceptual leap. It is called the additive identity because 0 + a = a æ a 0 Z. We identify that subset of the constructed set of integers which corresponds to the natural numbers with the natural numbers. So that it is OK to write ÞfZ 3 Ordering. We can do this extension from Þ to Z in such a way that the natural ordering on Þ extends nicely to Z. So, as you might expect ...-2 < -1 < 0 < 1 < 2 etc. Identities. In Z we have addition and multiplication. We also have 0 + a = a æ a 0 Z and 1*a = a æ a 0 Z Thus Z has both an additive identity and a multiplicative identity. Rings The set of integers form a simple example of a ring. A ring R is an algebraic structure which has: Commutative and associative addition +. Commutative and associative multiplication *. Multiplication is distributive over addition. ý 0 0 R such that a + 0 = a æ a 0 R. ý 1 0 R such that a*1 = a æ a 0 R. æ a 0 R ý a’ 0 R such that a + a’ = 0. This says there always exists an additive inverse. Note. In a ring there is no guarantee of a multiplicative inverse. i.e. we cannot always solve a*x = b for some x 0 R. Representation of integers. We generally use the decimal system to represent integers (and numbers in general) so for example 345 = 3*100 + 4*10 + 5 = 3*102 + 4*101 + 5*100 The reason we use 10 as the base to which our numbers are represented is probably because we have 10 fingers, but we can represent numbers using any natural number base. In particular for computers it is particularly convenient to represent integers (numbers) to base 2, i.e. in binary. 0, 1, 10, 11, 100, 101, ... Note. An integer has a unique logical identity which is the same no matter how it is represented. Thus the set Þ = {1, 2, 3, ... } = {1, 10, 11, ...} where it is understood that the first description is in decimal and the second is in binary but that the object being described is the same in both cases. 4 Rational numbers. It would be nice to have multiplicative inverses. We extend the ring Z to a set of numbers which allows us to solve q*x = p, when q 3 0. The idea is to give a solution x = p/q. The technical problem is that p/q does not yet have a meaning. We define Q (the set of rationals) as the set of all ordered pairs (p, q) with q 0 Þ (notice that this means q > 0) and p 0 Z. Technically this is not difficult, and so we arrive at the rational numbers (fractions) which form a field, which is a ring with the additional property that every element other than the additive identity (zero) has a unique multiplicative inverse, i.e. q*x = p is soluble in Q for any q 3 0. Equality. We define (p, q) = (r, s) if and only if p*s = r*q 0 Z. Ordering. We define (p, q) < (r, s) if and only if p*s < q*r. Addition. We define addition in Q by (p, q) £ (r, s) = (p*s + r*q, q*s) 0 Q Multiplication. We define multiplication in Q by (p, q) ¡ (r, s) = (p*r, q*s) Question. What are the additive and multiplicative identities in Q? Properties of rational numbers. With these definitions we can then check: Exercise. Show that addition £ and multiplication ¡ in Q are commutative, associative and that ¡ is distributive over £. Once these definitions have been checked we can make a natural embedding of a 0 Z <-> (a, 1) 0 Q. The ordering on Q naturally extends the ordering of Z. If we then identify Z with its identical image Z´ = {(a, 1) 0 Q : a 0 Z} we can then sensibly abuse language and write Z f Q. So that ÞfZfQ Once all these details are taken care of we can drop the circles around addition, multiplication and equals. Cardinality One way to tell if two sets have the same number of elements is to ‘pair elements off’ i.e. set up a 1-1 correspondence. If we can do this we say the two sets have the same cardinality. In fact this is the most primitive way to count. 5 If two finite sets A, B have the same cardinality then they have the same number of elements, n say. We can write Card(A) = Card(B) = n 0 Þ. Theorem. If A and B are finite sets, if A f B and Card(A) = Card(B) then A = B. Note. It is a rather counter intuitive fact that this theorem is false for infinite sets. Definition. If there is a 1-1 correspondence between a set S and Þ we say that S has cardinality )0 (aleph zero, aleph is the first letter of the Hebrew alphabet). If S has cardinality )0 we say S is countable. E.g. The most primitive of all infinite sets is Þ. Now consider S = {a2 : a 0 Þ} = {1, 4, 9, 16, ...}. This is an infinite set and because we can set up the 1-1 correspondence a2 0 S <-> a 0 Þ it follows Card(S) = Card(Þ) = )0. But obviously S 3 Þ. Theorem. Card(Z) = )0, i.e. the integers are countable. Proof. Consider the 1-1 correspondence ... &4, &3, &2, &1, 0, 1, 2, 3, 4, ... ... 2, 4, 6, 8, ... 9, 7, 5, 3, 1, which establishes the assertion. It is a most remarkable and counter intuitive fact that Theorem (Cantor). Card(Q) = )0, i.e. the rationals are countable. Proof. Exercise. It begins to look as if all infinite sets have cardinality )0, i.e. are countable. In fact this is not true (as was also proved by Cantor); not all infinities are the same! We shall see why later. Fields The set of rationals form a simple example of a field. There are other systems which form rings or fields. Example. The ring of integers (mod n). Write Zn = {0, 1, 2, 3, ... , n-1}. Any integer x when divided by n leaves one of these k (say) numbers as a remainder. Under these circumstances we write x / k (mod n). We can define addition £ in Zn by writing a £ b = c whenever a, b 0 Zn and a + b / c (mod n), where c 0 Zn. Of course we need to check commutativity and associativity, but this turns out to 6 be easy because these properties are inherited from Z. Similarly, we can define a ¡ b = c to mean a * b / c (mod n), where c 0 Zn and again check that the new multiplication is commutative and associative. We finally have to check that ¡ is distributive over £. Again this poses no problem. Thus Zn forms a ring. It is an interesting question to ask: When does Zn form a field? I.e. When do we also have multiplicative inverses for all non-zero elements? Example. Consider Z5 which we shall write as F5 = {0, 1, 2, 3, 4} In F5 we have 3 £ 4 = 2 because 3 + 4 = 7 and 7 / 2 (mod 5). Similarly, 3 ¡ 4 = 2 because 3 * 4 = 12 and 12 / 2 (mod 5). Notice that 2 ¡ 3 = 1, 3 ¡ 2 = 1, and 4 ¡ 4 = 1 in Z5 so that every non-zero element has a multiplicative inverse. I.e. F5 is a field. [See the Mathematica™ file finflds.ma] Finite Fields. Theorem. For any prime p 0 Þ we have Fp = Zp = {0, 1, 2, ..., p-1} is a field. In particular when p = 2 we have the important field F2 = {0, 1}, with multiplication and addition tables Table 1 Addition in F2. Table 2 Multi-plication in F2. £ 0 1 ¡ 0 1 0 0 1 0 0 0 1 1 0 1 0 1 7 Giving a mathematical structure to the set {0, 1} allows us to develop useful ideas in Computer Science. For example, in order to detect errors in data stored on magnetic media we might use n + 1 bits to store n bits with the extra bit bn+1 defined in terms of b1, ..., bn as bn+1 = b1 £ b2 £ ... £ bn E.g n = 7 = 1110011 could be stored as 11100111 This enables a 1-bit error check on any data. Real numbers So far we have encountered sets Þ f Z f Q. It was known to the Greeks that whilst a square of area 2 was a perfectly reasonable object it’s side length was not rational. Theorem. %2 is not rational. Proof. Suppose that %2 is rational, i.e. suppose 2 ' p q where p and q > 0 have no common factor other than 1. Then 2 ' p2 q2 i.e. p 2 ' 2(q 2 But this means p2 is even. Hence p is even (why?). Thus we can write p = 2*r, where r 0 Þ. Substituting gives 4r 2 ' 2q 2 i.e. 2r 2 ' q 2 Hence q2 is even, so that q is even. But this is a contradiction against the fact that p and q have no common factor. Hence the original hypothesis that %2 is rational must be false. %2 is an example of a real number which is not rational but this begs the question: what is a real number? Up to now we have extended the number system by essentially saying something like.. "Within this number system we cannot solve a certain type of equation, so let us try to extend the system to a bigger one which includes the solution to the equations that are not soluble within the present number system." This approach is called building an algebraic extension. Thus we might try to add the ‘solution’ to x2 = 2 to the rationals and get an extension, say Q(%2) which covered this case. But there are several problems with this approach. 8 E.g. %3, %5, ... are not rational and are not in Q(%2) (whatever that is exactly). We should need to do this kind of extension an infinite number of times! Even if we did that unfortunately, we still cannot get the reals from the rationals by using an approach like this. Dedekind was the first to perform a careful construction of the reals from the rationals (he was teaching an introductory undergraduate course and realised that no one had ever done it convincingly!).The construction is not terribly difficult but it is a bit finicky and more the kind of thing that Mathematics students have to do for the good of their soul. The basic idea is to define any ‘point gap’ in the ‘real line’ to be an equivalence class of sequences of rationals which ‘pile up’ at the gap. r1 r2 r3 r4 r5 r6 q6 q5 q4 q3 q2 q1 ε Figure 1 Defining real numbers in terms of sequences of rationals. The essential requirement for two sequences of rationals r1, r2, ... and q1, q2, ... to be equivalent is that given any number > 0 we can make the difference *rn - qm* < simply by making both n and m sufficiently large. We denote the set of real numbers by ß. The glb/lub principle for real numbers The essential property possessed by the reals (but not by the rationals) can be summed up as follows: Let S be a non-empty bounded subset of the reals. We define the greatest lower bound (glb) of S to be the real number L such that (i) L # x æ x 0 S (L is a lower bound) (ii) If y > L then ý x 0 S such that x < y (L is the greatest possible lower bound). In a similar way we can define the least upper bound (lub). Theorem. For every non-empty set S of real numbers lub(S) and glb(S) are real numbers. This theorem (which we shall not attempt to prove) says essentially that ‘the reals haven't got any holes’. Since that was how we were trying to construct the reals this is a reassuring result. This is a very useful property which one can use in proving other important theorems. 9 With some careful construction we can define reals with addition, multiplication and ordering so that: Addition and multiplication are commutative and associative. Multiplication is distributive over addition. The reals form a field. There is a natural embedding of Q in ß, which is consistent with addition and multiplication in Q, and the ordering of ß preserves the ordering of Q. Thus we can sensibly abuse language by writing ÞfZfQfß Notice that, unlike the algebraic constructions of the integers and the rationals, the construction of the reals involved a notion of nearness (a so called topological notion). The power of the formal approach can be used to prove things from the axioms that would otherwise have to be taken as obvious - sometimes that which may seem obvious is actually wrong! Theorem. 0*x = 0 æ x 0 ß. Proof. By definition of zero 0 + x = 0 for all x. In particular 0 + 0 = 0. Now multiplying both sides by x we have 0*x + 0*x = 0*x and 0(x % 0(x ' 1((0(x) % 1((0(x) (multiplicative identity) ' (1 % 1)((0(x) (distributive) ' 2((0(x) (definition of 2) Thus 0*x = 2*(0*x) and so adding the additive inverse of 0*x to both sides we obtain 0 = 0*x. Theorem. (-1)*(-1) = 1. Proof. We start from the observation that because -1 is by definition the additive inverse of 1 (which is the multiplicative identity) we have 1 + (-1) = 0. Now we have for any x 0 ß 0(x ' (1 % (&1))(x (definition of 0) ' x((1 % (&1)) (multiplication is commutative) ' x(1 % x((&1) (distributive) ' x % x((&1) ' 0 (previous theorem) Hence x*(-1) must be the additive inverse of x i.e. x*(-1) = -x for any x 0 ß. 10 But -(-x) + (-x) = 0 by definition of additive inverse. Hence (adding x to both sides) -(-x) = x æ 0 ß. Thus (putting x = 1) we have -(-1) = 1. Now putting x = -1 in x*(-1) = -x we get (-1)*(-1) = -(-1) = 1, as required. Representation of real numbers The thing to grasp about real numbers is that in general any one of them requires an infinite number of natural numbers (or integers, or rationals) to specify it. In fact )0 bits are required. The most convenient way to represent a real is by using an infinite sequence of rationals. We can do this in many ways but to avoid confusion it is best to fix on a standard way. Thus we usually represent a real in [0, 1] by (0 # ai # 9) 0.a1a2a3... which means a1 101 % a2 102 % a3 103 % ... although notice that this might involve an infinite sum(or, as it is called later, an infinite series) and thus poses a question of interpretation - which we deal with much later. This notation is an extension (to the right of the decimal point) of the decimal representation for integers. Of course we can also use base 2 to represent reals. Rational numbers have a decimal expansion which either terminates or recurs. E.g. 1/2 = 0.5 or 0.499999... 1/3 = 0.333333.... Facts about real numbers. The definition of a real number involves an infinite process (of successive approximation). The set of real numbers is not just all the solutions of various algebraic equations such as anx n % an&1x n&1 % ... % a0 ' 0 where the ai 0 Z or Q. Any number which satisfies such an equation is called an algebraic number. Examples of algebraic numbers are %2 (which is real) and %(-1) (which is not real). There are also numbers (a huge infinity of them) which are real but not algebraic. Examples of such numbers are = 3.14159... and e = 2.7182818... (it is actually quite hard to prove that any particular number is not algebraic). 11 Theorem. The algebraic numbers are countable. Real numbers which are not algebraic are called transcendental. Transcendental and algebraic numbers have infinite decimal expansions which do not terminate and do not recur. E.g. = 3.14159... or %2 = 1.4142135... Since there is generally no simple way to predict the decimal expansion of a given real number, if we want to know one to a given precision we have to develop ways of calculating it (i.e. algorithms). The algorithm terminates when we have computed the number to a specified precision. E.g e ' 1 % 1 1 1 % % % ... 1! 2! 3! where n! = n*(n-1)*(n-2)*...1 for n 0 Þ We have the following remarkable theorem. Theorem. (Cantor). The reals are not countable, i.e. cannot be put into 1-1 correspondence with the natural numbers. Proof. Suppose the reals between [0, 1] were countable. Let an enumeration of all of them in decimal notation be 0.a11a12a13a14a15 0.a21a22a23a24a25 0.a31a32a33a34a35 0.a41a42a43a44a45 ... ... ... ... ... where the convention is that any decimal ending in an infinite string of 9’s is replaced by one ending in an infinite string of 0’s (this is just to avoid ambiguity). Now consider the number x = 0.b1b2b3...where bi is defined by bi ' 1, if aii ' 0, 0, if 1 # aii # 9 Now we can be sure of three things: 12 x is a real number in [0, 1] x does not end in a string of 9’s x is not contained in the enumeration But this is a contradiction against the fact that the enumeration was supposed to include all reals in [0, 1] whose decimal expansion does not end in an infinite string of 9’s. Hence there can be no such enumeration. Hence the reals are not countable. We denote Card(ß) by c. We have established that in some sense )0 < c. Thus not all infinities are the same. In fact there are infinitely many different kinds! Powers We make the following definition for powers Definition. For real a > 0 and x 0 ß we write a x ' Exp[xLoge[a]] This all looks a bit dodgy because we have not yet defined Exp, Log or e. We shall see later that Exp is defined by an infinite series, Log is defined as the inverse function of Exp and e is just Exp[1] . 2.7182818284... To arrive at the ordinary laws of powers, such as a x ( a y ' ax % y we need the basic properties of Exp, in particular we need Exp[a]*Exp[b] = Exp[a + b], so that a x ( a y ' Exp[x(Log[a]] ( Exp[y(Log[a]] ' Exp[x(Log[a] % y(Log[a]] ' Exp[(x % y)(Log[a]] ' ax % y How these basic properties of the exponential function follow from its definition as a power series will be sketched later in Analytic Math II. At this point we note that there is no problem with positive or negative integral powers - we can just define these in the ordinary way, e.g. a2 = a*a. (So we shall continue to use these kinds of powers without comment.) For these cases the ordinary laws of powers obviously hold, and we note that a0 = 1 for any a > 0. 13