SCHAUM’S OUTLlNE OF THEORY AND PROBLEMS OF FINITE MATHE RIATIC S 0 BY SEYMOUR LIPSCHUTZ, Ph.D. Associate Professor of Mathematics Temple University 0 SC€IAUMfS OUTLINE SERIES McGRAW-HILL BOOK COMPANY New York, St. Louis. San Francisco, Toronto, Sydne) Copyright @ 1966 b j McGraw-Hill, Inc. All Rights Reserved. Printed in the United States of America. No p a r t of this publication may be reproduced. stored in a rerrieval sy%tem, or transmitted, in any form or by any means. electronic, mechanical, photocopying, recording. or otherwise, without the prior written permissioii of the imblisher. 579hT 6 7 8 9 1 0 SHSH 7 5 4 3 2 1 0 Finite mathematics has in recent years become an integral part of the mathematical background necessary for such diverse fields as biology, chemistry, economics, psychology, sociology, education, political science, business and engineering. This book, in presenting the more essential material, is designed for use as a supplement to all current standard texts or as a textbook for a formal course in finite mathematics. The material has been divided into twenty-five chapters, since the logical arrangement is thereby not disturbed while the usefulness as a text and reference book on any of several levels is greatly increased. The basic areas covered are: logic; set theory; vectors and matrices: counting - permutations, combinations and partitions; probability and Markov chains; linear programming and game theory. The area on vectors and matrices includes a chapter on systems of linear equations; it is in this context that the important concept of linear dependence and independence is introduced. The area on linear programming and game theory includes a chapter on inequalities and one on points, lines and hyperplanes; this is done to make this section self-contained. Furthermore, the simplex method is given for solving linear programming problems with more than two unknowns and fo r solving relatively large games. In using the book it is possible to change the order of many later chapters or even t o omit certain chapters without difficulty and without loss of continuity. Each chapter begins with a clear statement of pertinent definitions, principles and theorems together with illustrative and other descriptive material. This is followed by graded sets of solved and supplementary problems. The solved problems serve to illustrate and amplify the theory, bring into sharp focus those fine points without which the student continually feels himself on unsafe ground, and provide the repetition of basic principles so vital t o effective learning. Proofs of theorems and derivations of basic results are included among the solved problems. The supplementary problems serve as a complete review of the material in each chapter. More material has been included here than can be covered in most first courses. This has been done t o make the book more flexible, to provide a more useful book of reference and to stimulate further interest in the topics. I wish to thank many of my friends and colleagues, especially P. Hagis, J. Landman, B. Lide and T. Slook, for invaluable suggestions and critical review of the manuscript. I also wish to express my gratitude t o the staff of the Schaum Publishing Company, particularly to N. Monti, for their unfailing cooperation. S. LIPSCHUTZ Temple University June, 1966 CONTENTS Page Chapter 1 PROPOSITIONS AND TRUTH T A B L E S . . .................... Statements. Compound statements. Conjunction, p Negation, -p. Propositions and truth tables. Chapter 2 A q. 1 Disjunction, p v q. ALGEBRA OF PROPOSITIONS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Tautologies and contradictions. Logical equivalence. Algebra of propositions. Chapter 3 CONDITIONAL STATEMENTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Conditional, p Chapter 4 + q. 18 Biconditional, p ++ q. Conditional statements and variations. ARGUMENTS, LOGICAL IMPLICATION ..................... 26 Arguments. Arguments and statements. Logical implication. Chapter 5 S E T THEORY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 Sets and elements. Finite and infinite sets. Subsets. Universal and null sets. Set operations. Arguments and Venn diagrams. Chapter 6 PRODUCT SETS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 Ordered pairs. Product sets. Product sets in general. Truth sets of propositions. Chapter 7 RELATIONS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58 Relations. Relations a s sets of ordered pairs. Inverse relation. Equivalence relations. Partitions. Equivalence relations and partitions. Chapter 8 FUNCTIONS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66 Definition of a function. Graph of a function. Composition function. One-one and onto functions. Inverse and identity functions. Chapter 9 VECTORS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 Column vectors. Vector addition. Scalar multiplication. Row vectors. Multiplication of a row vector and a column vector. Chapter 10 MATRICES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Matrices. Matrix addition. Scalar multiplication. Matrix multiplication. Square matrices. Algebra of square matrices. Transpose. 90 CONTENTS Page Chapter II LINEAR EQUATIONS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Linear equation in two unknowns. Two linear equations in two unknowns. General linear equation. General system of linear equations. Homogeneous systems of linear equations. Chapter 12 DETERMINANTS OF ORDER TWO AND THREE. . . . . . . . . . . . 127 Introduction. Determinants of order one. Determinants of order two. Linear equations in two unknowns and determinants. Determinants of order three. Linear equations in three unknowns and determinants. Invertible matrices. Invertible matrices and determinants. Chapter 13 THE BINOMIAL COEFFICIENTS AND THEOREM . . . . . . . . . . 141 Factorial notation. Binomial coefficients. angle. Multinomial coefficients. Chapter 14 Binomial theorem. Pascal's tri- PERMUTATIONS, ORDERED SAMPLES .................... 152 Fundamental principle of counting. Permutations. Permutations with repetitions. Ordered samples. Chapter 15 COMBINATIONS, ORDERED PARTITIONS . . . . . . . . . . . . . . . . . 161 Combinations. Chapter 16 Partitions and cross-partitions. Ordered partitions. TREE DIAGRAMS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176 Tree diagrams. Chapter 17 PROBABILITY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184 Introduction. Sample space and events. Finite probability spaces. Equiprobable spaces. Theorems on finite probability spaces. Classical birthday problem. Chapter 18 CONDITIONAL PROBABILITY. INDEPENDENCE . . . . . . . . . 199 Conditional probability. Multiplication theorem for conditional probability. Finite stochastic processes and tree diagrams. Independence. Chapter 19 INDEPENDENT TRIALS, RANDOM VARIABLES. . . . . . . . . . . 216 Independent or repeated trials. Repeated trials with two outcomes. Random variables. Probability space and distribution of a random variable. Expected value. Gambling games. Chapter 20 MARKOV CHAINS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233 Probability vectors. Stochastic and regular stochastic matrices. Fixed points of square matrices. Fixed points and .regular stochastic matrices. Markov chains. Higher transition probabilities. Stationary distribution of regular Markov chains. Absorbing states. Chapter 21 INEQUALITIES .............................................. The real line. Positive numbers. Order. (Finite) Intervals. Infinite intervals. Linear inequalities in one unknown. Absolute value. 257 CONTENTS Page Chapter 22 POINTS, LINES AND HYPERPLANES.. . . . . . . . . . . . . . . . . . . . . . 266 Cartesian plane. Distance between points. Inclination and slope of a line. Lines and linear equations. Point-slope form. Slope-intercept form. Parallel lines. Distance between a point and a line. Euclidean nz-space. Bounded sets. Hyperplanes. Parallel hyperplanes. Distance between a point and a hyperplane. Chapter 23 CONVEX SETS AND LINEAR INEQUALITIES. . . . . . . . . . . . . . 281 Line segments and convex sets. Linear inequalities. Polyhedral convex sets and extreme points. Polygonal convex sets and convex polygons. Linear functions on polyhedral convex sets. Chapter 24 LINEAR PROGRAMJIIKG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 300 Linear programming problems. Dual problems. Matrix notation. Introduction t o the simplex method. Initial simplex tableau. Pivot entry of a simplex tableau. Calculating the new simplex tableau. Interpreting the terminal tableau. Algorithm of the simplex method. Chapter 25 THEORY OF GAMES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 321 Introduction to matrix games. Strategies. Optimum strategies and the value of a game. Strictly determined games. 2 X 2 matrix games. Recessive rows and columns. Solution of a matrix game by the simplex method. 2 X m and i i i x 2 matrix games. Summary. INDEX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 337 Chapter I Propositions and Truth Tables STATEMENTS A statement (or verbal assertion) is any collection of symbols (or sounds) which is either true or false, but not both. Statements will usually be denoted by the letters P, (1, r, ... The truth or falsity of a statement is called its truth value. Example 1.1: Consider t h e following expressions: (i) P ar i s is in England. (ii) 2 +2 = 4 (iii) Where a r e you going? (iv) P u t the honiework on the blackboard. 6) are statements; the first is false and the second is true. The expressions (i) and The expressions (iii) and [iv) a r e not statements since neither is either true o r false. COMPOUND STATEMENTS Some statements are composite, that is, composed of substatements and various logical connectives which we discuss subsequently. Such composite statements are called compound statements. Example 2.1: “Roses a r e red and violets a r e blue” is a compound statement with substatements “Roses a r e red” and “Violets a r e blue”. Example 2.2: “He is intelligent o r studies every night” is, implicitly, a compound statement with substatements “He is intelligent” and “He studies every night”. The fundamental property of a compound statement is that its truth value is completely determined by the truth values of its substatements together with the way in which they are connected to form the compound statement. We begin with a study of some of these connectives. CONJUNCTION, p A Q Any two statements can be combined by the word “and” to form a compound statement called the conjunction of the original statements. Symbolically, P A 9 denotes the conjunction of the statements p and q, read “ p and q”. Example 3.1: Let p be “It is raining” and let q be “The sun is shining”. Then p A q denotes the statement “It is raining and the sun is shining”. The truth value of the compound statement p ~ satisfies q the following property: [TI] If p is true and Q is true, then p A q is true; otherwise, 11 A q is false. In other words, the conjunction of two statements is true only in the case when each substatement is true. 1 PROPOSITIONS AND TRUTH T A B L E S 2 Example 3.2: [CHAP. 1 Consider the following four statements: (i) +2 2 +2 2 +2 2 Paris is in France and (ii) P a r i s is in France and (iii) P a r i s is in England and (iv) Paris is in England and 2 = 4. = 5. = 4. - 2 = 5. By property [Ti], only the first statement is true. Each of the other statements is false since at least one of its substatements is false. A convenient way to state property [Ti] is by means of a table as follows: Here, the first line is a short way of saying that if p is true and q is true then p A 4 is true. The other lines have analogous meaning. We regard this table as defining precisely the truth value of the compound statement p A q as a function of the truth values of p and of q. DISJUNCTION, 11 q Any two statements can be combined by the word “or” (in the sense of “and/or”) to form a new statement which is called the disjunction of the original two statements. Symbolically, P”4 denotes the disjunction of the statements p and q and is read ‘ip or q” Example 4.1: Let p be “Marc studied French at the university”, and let q be “Marc lived in France”. Then p v q is the statement “Marc studied French at t h e university o r (Marc) lived in France”. The truth value of the compound statement p v q satisfies the following property: IT,] If p is true or q is true o r both p and q a r e true, then q is true; otherwise p v q 13 is false. Accordingly, the disjunction of two statements is false only when both substatements a r e false. The property LT,] can also be written in the form of the table below, which we regard as defining 11 v q: Example 4.2: Consider the following four statements: +2 +2 2 +2 2+2 P a r i s is in France o r 2 = 4. (ii) P a r i s is in France o r 2 = 5. ii) iiii) P a r i s is in England or iiv) P a r i s is in England o r = 4. = 5. By property [T2], only (iv) is false. Each of the other statements is t r u e since a t least one of its substatements is true. PROPOSITIONS A N D TRUTH T A B L E S CHAP. 11 Remark: 3 The English word “or” is commonly used in two distinct ways. Sometimes it is used in the sense of “ p o r (I or both”, i.e. a t least one of the two alternates occurs, as above, and sometimes it is used in the sense of “ p o r 4 but not both”, i.e. exactly one of the two alternatives occurs. For example, the sentence “He ill go to Harvard o r to Yale” uses “or” in the latter sense, called the e,cclusice disjunction. Unless otherwise stated, “01”’ shall be used in the former sense. This discussion points out the precision we gain from o u r symbolic language: p b (I is defined by its truth table and always means “p andlor q”. NEGATION, -11 Given any statement p , another statement, called the negatioii of 11, can be formed by writing “It is false t h a t . . .” before p or, if possible, by inserting in p the word “not”. Symbolically, -1, denotes the negation of p (read “not 71”). Example 5.1 : Consider the following three statements: ( i ) P a r i s is in France. (ii, I t is false t h a t P a r i s is in France. (iii) P a r i s is not in France. Then (ii) and (iii) a r e each the negation of (i). Example 5.2: Consider the following statements: (i) 2 - 2 = 5 (ii) It is false t h a t 2 - 2 = 5 . + (iii) 2 2 # 5 Then (ii) and (iii) a r e each the negation of (i). The truth value of the negation of a statement satisfies the following property: [T,] If p is true, then - p is false; if 11 is false, then -p is true. Thus the truth value of the negation of any statement is always the opposite of the truth value of the original statement. The defining property IT,] of the connective can also be written in the form of a table: P F I -P T Example 5.3: Consider t h e statements in Example 5.1. Obserre t h a t ti) is t r u e and (ii) and [iii), each its negation, a r e false. Example 5.4: Consider the statements in Example 5.2. Observe t h a t ri) is false and (ii) and (iiii, each its negation, a r e true. PROPOSITIONS AND TRUTH TABLES By repetitive use of the logical connectives (4,v, and others discussed subsequently), we can construct compound statements that are more involved. In the case where the substatements p , Q, . . . of a compound statement P ( p , q, . . .) are variables, we call the compound statement a proposition. Now the truth value of a proposition depends exclusively upon the tru th values of its variables, that is, the truth value of a proposition is known once the tru th values of its variables are known. A simple concise way to show this relationship is through a truth table. The truth table, for example, of the proposition - ( p A -q) is constructed as follows: - [CHAP.1 PROPOSITIONS AND T R U T H TABLES 4 P Q T T T F F I T - (P A - 4) F / F Step F 1’ 4 T T T F F T F F Step - - 4) P F T F F F T F T T F T F T T F F 3 2 1 (P A T T F F 1 F F F T F T F Step T F T T 4 F - 4 F T F F F T F T T F T F 3 2 1 (11 A T T F F 1 5 PROPOSITIONS A N D T R U T H TABLES CHAP. 11 Solved Problems STATEMENTS 1.1. Let p be “It is cold” and let q be “It is raining.” Give a simple verbal sentence which describes each of the following statements: (1) -21, ( 2 ) P A Q, (3) P ” 4, (4)cl ” -21, ( 5 ) ”P A -4, (6) Q. -- - In each case, translate A , v and to read “and”, “or” and “ I t is false t h a t ” o r “not”, respectively, and then simplify the English sentence. (1) It is not cold. (41 It is raining or i t is not cold. (2) It is cold and raining. ( 5 ) I t is not cold and i t is not raining. (3) It is cold or it is raining. ( G I I t is not true t h a t i t is not raining. Let p be “He is tall” and let Q be “He is handsome.” statements in symbolic form using 11 and q . Write each of the following (1) He is tall and handsome. (2) He is tall but not handsome. (3) It is false that he is short or handsome. (4) He is neither tall nor handsome. (5) He is tall, or he is short and handsome. (6) It is not true that he is short or not handsome. (Assume t h a t “He is short” means “He is not tall”, i.e. -11.) (1) P A q (3) -(-P (2) P A -!I (4) -P A v 4) ( 5 ) 1’ @q (6) - ( - P V (“P A 4) ” -q) TRUTH VALUES OF COMPOUND STATEMENTS 1.3. Determine the truth value of each of the following statements. (i) 3 + 2 = 7 and 4 f 4 = 8. (ii) 2 + 1 = 3 and 7 + 2 = 9. (iii) 6 + 4 = 10 and 1 + 1 = 3. By property [TI], the compound statement “ p and q” is t r u e only when p and q a r e both true. Hence: ti, False, iii) True, (iii) False. 1.4. Determine the truth value of each of the following statements. + + + (i) Paris is in England or 3 4 = 7. (ii) Paris is in France or 2 1 = 6. (iii) London is in France or 5 2 = 3 . By property [ T L ] ,the compound statement “ p o r q” is false only when p and q a r e both false. Hence: li) True, (ii) True, (iii) False. 1.5. Determine the truth value of each of the following statements. (i) It is not true that London is in France. (ii) It is not true that London is in England. By property IT,], the t r u t h value of the negation of p is the opposite of the t r u t h value of p . Hence: Ii) True, rii) False. 6 1.6. [CHAP. 1 PROPOSITIONS AND T R U T H T A B L E S Determine the truth value of each of the following statements. (i) It is false that 2 + 2 = 4 and 1 + 1 = 5 . (ii) Copenhagen is in Denmark, and 1 + 1 = 5 or 2 f 2 = 4. (iii) It is false that 2 + 2 = 4 or London is in France. (i) + + + + The conjunctive statement “2 2 = 4 and 1 1 = 5” is false since one of i t s substatements “ I 1 = 5” is false. Accordingly its negation, the given statement, is true. + (ii) The disjunctive statement “1 1 = 5 or 2 2 = 4” is t r u e since one of i t s substatements “2 2 = 4” is true. Hence the given statement is t r u e since it is the conjunction of two t r u e statements, “Copenhagen is in Denmark” and “1 1 = 5 o r 2 2 = 4”. + + + + (iii) The disjunctive statement “ 2 2 = 4 o r London is in France” is t r u e since one of its substatements “2 2 = 4” is true. Accordingly its negation, the given statement, is false. + TRUTH TABLES OF PROPOSITIONS 1.7. Find the truth table of -13 ,. q . 9 - P a - T F T F T T F F T F T F F F T T T T F F F F T F T F T F 2 1 3 1 Step P A P Method 1 1.8. Find the truth table of -(]I v 4). T F T T F F F T F F T F Step Method 2 Method 1 1.9. Find the truth table of - ( p P T T F F I a I -9 I T F T F F T F T PV -a \d) 1 -q). -9) -(?I” T T F F F T T p l q I - (P v - a) F Method 1 1.10. Find the truth table of the following: (i) p A (4 v r ) , (ii) ( p A 4) v ( p A T Since there a r e three variables, we will need 23 = 8 rows in the t r u t h table. ) ~ C'HAP. I] PROPOSITIONS AND TRUTH TABLES P q T T T T I F F T F F T T T F F T ' F Fl q r T T T T T F T F F , T F T F ~ F F T F T F T F F T F P PA(q"r) 7. 7 PA?. i T T T F F F F T T T T F F F F F T T F T T T F MISCELLANEOUS PROBLEMS 1.11. Let A p q denote p / q and let Sp denote -p. using A and N instead of A and -. (9 P A -q (iii) - p (4 -(-P (i) p (ii) -(-p (iii) - p (iv) - ( p = p -q A 4 A A A -q) A 1.) A ~F i F Rewrite the following propositions (-q A (iv) - ( p Q) T F T F F F A A -q) A 1) (-(I A -7,) iYq = A p N q q ) = -(.Vp (-q A T T F F F F F F (-q \ q) = -(ASpq) = NANpq A = Np (Nq A r) = S p A -Y) = -(ApXq) A A ( A N q r ) = ANpA'VqT (ANqSr) = (NApNq) A ( A S q S r ) = ANApNqANqXr Observe t h a t there a r e no parentheses in the final answer when A and .V a r e used instead of A and -. In fact, i t has been proved t h a t parentheses a r e never needed in any proposition using A and N . 1.12. Rewrite the following propositions using (i) A and (i) N A m (iii) ApNq (ii) A N p q (iv) ApAqr NApq = S ( p A q) = - ( p (ii) A N p q = A ( - p ) q = - p A A q) q - instead of A and N. (v) Iv'AANpql. (vi) A S p A q N r (iii) ApNq = A p ( - q ) = p (iv) A p A q r = A p ( q A 7,) A -q = p A (q A r) (v) N A A S p q r = S A A ( - p ) q r = N A ( - p fi q ) r = Nr(-p A q ) / r1 = -:(-p A q ) A r ] (vi) A N p A q S r = A N p A q ( - r ) = A N p ( q A -7) = A ( - p ) ( q A - 1 . ) = -lJ A ( Q A - r ) Notice t h a t the propositions involving A and N a r e unraveled from right to left. Supplementary Problems STATEMENTS 1.13. Let p be "Marc is rich" and let q be "Marc is happy". Write each of the following in symbolic form. (i) Marc is poor but happy. (ii) Marc is neither rich nor happy. (iii) Marc is either rich o r unhappy. (iv) Marc is poor o r else he is both rich and unhappy. 8 1.14. PROPOSITIONS AND TRUTH TABLES [CHAP. 1 Let p be “E r i k reads Newsiccek”, let Q be “Erik reads Life” and le t r be “Erik reads Time”. Write each of the following in symbolic form. (I) iii) Erik reads Newsweek or L i f e , but not Tima. Erik reads Newsweek and Lit’c, o r he does not read N e w s w e e k and T i m e . ( i i i ) I t is not t r u e t h a t Erik reads Newsweek but not Time. (ivi I t is not t r u e t h a t Er i k reads Time or Life but not N e w s w e e k . 1.15. 1.16. Let p be “Audrey speaks French” and let q be “Audrey speaks Danish”. Give a simple verbal sentence which describes each of the following. (i) P v 4 (iii) p (ii) 11 A q (iv) - p v -q A (v) - - p -q (vi) -(-P A -4 Determine the t r u t h value of each of the following statements. (i, 3 f 3 = 6 and 1 + 2 = 5. iiii I t is not t r u e t h a t 3 iiii) It is true t h a t 2 +3 = 6 o r 1 f 2 = 3. + 2 # 4 and 1 + 2 = 3. + 3 # 6 o r 1+ 2 # 5. iiv) It is not t r u e t h a t 3 TRUTH TABLES OF PROPOSITIONS 1.17. Find the t r u t h table of each of the following. ii) p 1.18. iii) - p -q, v A -q, iiii) - ( - p A q ) , (iv) -(-p v -q) Find the t r u t h table of each of the following. (i) ( p A - q ) v r , (ii) - p v ( q A (iii) ( p v -v) -T), A (q v -T), (iv) - ( p v -9) A (-p v 7). MISCELL-4NEOUS PROBLEMS 1.19. Let A p q denote P A q and let S p denote -p. using A and N instead of n and -. (i) -p 1.20. A q, (ii) - p A -q, iiii) -(p A -q), Rewrite the following propositions using (i) N A p N q , (ii) A N A p q N y , (See Problem 1.11.) Rewrite the following propositions (iv) ( - p and A A q) A -T. - instead of A and N . (iii) A A p N r A q N p , (iv) A S A N p A N q r N p . Answers to Supplementary Problems 1.13. [i) - p 1.14. (i) (p v q ) 1.15. (i) A q, (ii) - p A -7, A -q, (iii) p v - q , (ii) ( p A q ) v - ( p A T), (iv) - p v ( p A - q ) (iii) - ( p A -T), (iv) -[(Y v q) A -p] Audrey speaks French o r Danish. (ii) Audrey speaks French and Danish. (iii) Audrey speaks French but not Danish. (iv) Audrey does not speak French or she does not speak Danish. ( v ) I t is not t r u e t h a t Audrey does not speak French. (vi\ I t is not t r u e t h a t Audrey speaks neither French nor Danish. 1.16. (i) F, ( i i ) F, (iiii F, (iv) F -p 1.18. 9 PROPOSITIONS AND TRUTH TABLES CHAP. 11 F F F T F p q )* (i) (ii) T T T T F F F F T T F F T T F F T F T F T F T F T F T T T F T F F T F F T T T T F T A -(-p -q F T A 4) -(-pv F T F F (iii) (ivi T T F T F T F T F F F F T T F F 1.19. (i) Ah'pq, (iii A.YpNq, (iii) N A p N q , (ivi A A S p q N r 1.20. (i) - ( p A -q), (ii) - ( p A -q) q) A -Y, (iii) ( p A -T) A (q A -p), (iv) - [ - p n ( - p A T)] A -p Chapter 2 Algebra of Propositions TAUTOLOGIES AND CONTRADICTIONS Some propositions P ( p , q , . . . ) ca,ntain only T in the last column of their truth tables, i.e. a r e true for any truth values of their variables. Such propositions are called tautologies. Similarly, a proposition P ( p ,(1,. . .) is called a cotitmdiction if it contains only F in the last column of its truth table, i.e. is false for any truth values of its variables. Example 1.1: ' The proposition " p o r not p", i.e. p v -11, is a tautology and the proposition " p and not p", i.e. p -11, is a contradiction. This is verified by constructing their t r u t h tables. 1' 1 -11 P " -p I i l l P;-P ," 1 T F F T ~ TT ~ F ~ T IF Since a tautology is always true, the negation of a tautology is always false, i.e. is a contradiction, and vice versa. That is, Theorem 2.1: If P ( p , q , . . .) is a tautology then -Pip, conversely. (I, , . . ) is a contradiction, and Now let P ( p , (I, . . .) be a tautology, and let P , ( p , q, . . . ), P,(p, q , . . .), . . . be any propositions. Since P ( p ,q , . . . ) does not depend upon the particular truth values of its variables p , 'I, . . ., we can substitute P , for p , P , for q, . . . in the tautology P ( p , q, . . . ) and still have a tautology. In other words: Theorem 2.2 (Principle of Substitution): If P ( p , q , . . .) is a tautology, then P(P,,P,, . . . ) is a tautology f o r any propositions P,, P,~,, . . . LOGICAL EQUIVALENCE Two propositions P(p, q, . . . ) and Q ( p , q , simply equicnleiit o r equal, denoted by P(l4 9 , . . .) a r e said to be logically equivalent, or . 1 = &(P, q, . . . ) * * if they have identical truth tables. Example 2.1: The t r u t h tables of -(]I q i and - p v - q folio\\. T T T F F Accordingly, the propositions - ( p "(p 10 T T A A q ) and -11 v - q 9) -pV -q are logically equivalent: ALGEBRA O F PROPOSITIONS CHAP. 21 11 The statement Example 2.2: “It is false that roses a r e red and violets a r e blue” can be written in the form - ( p A q ) where p is “Roses a r e red” and q is “Violets a r e blue”. B y the preceding example, - ( p A q ) is logically equivalent to - p v - q ; that is, the given statement is equivalent to the statement “Either roses a r e not red or violets a r e not blue.” ALGEBRA OF PROPOSITIONS Propositions, under the relation of logical equivalence, satisfy various laws or identities which are listed in Table 2.1 below. In fact, we formally state: Propositions satisfy the laws of Table 2.1. Theorem 2.3: LAWS OF THE ALGEBRA OF PROPOSITIONS Idempotent Laws = la. p v p lb. p p ~ p = p r = Associative Laws 2a. ip 3a. p 4 q = 4U. p / q) J = d T 2b. p v (q v r ) ( p A q) A p (q A r) /\ Commutative Laws 3b. q v p p =q q ~ ~ p Distributive Laws (4 J ( p V 4) T) A (p 4b. u‘ T ) p A = (q v r ) (p A q) v ( p A r ) Identity Laws 5a. p v f = p 6a. p t = ~ 5b. p 6b. t ~ t e p = f P A f Complement Laws 7a. p v -p 8a. --p 5 = f 7b. p ~ - p f 8b. p --t 3 f, -f t De Morgan’s Laws 9a. -(p V q) 3 -p A 9b. -q -(p A q) = -P” -g Table 2.1 I n the above table, t and f denote variables which are restricted to the truth values true and false, respectively. 12 [CHAP. 2 ALGEBRA O F PROPOSITIONS Solved Problems TAUTOLOGIES AND CONTRADICTIONS 2.1. Verify that the proposition p v - ( P A q ) is a tautology. Construct the t r u t h table of p v - ( p 2.2. 4): T T F T T F F T T F T F T T F F F T T 4 1 A C /\ 4) q ) is T f o r all values of p and q, i t is a tautology. A Construct the t r u t h table of 1 4 "(P T A 4) P q Verify that the proposition ( p A q ) A -(p p "(P p Since the t r u t h value of p v - ( p p A v q ) is a contradiction. ( p A q ) A - ( p v q): P A 4 I P V Q I "(PVq) 1 (PAq)*-(pvQ) T T T F F T F F T F T F F F F F T F F F Since the t r u t h value of ( p + q ) A - ( p v q ) is F for all values of p and q, i t is a contradiction. LOGICAL EQUIVALENCE 2.3. Prove the Associative Law: (11 A q ) A r = p A (4A r). Construct the required t r u t h tables: P - 1' T T T T F F F F Since the t r u t h tables a r e identical, the propositions a re equivalent. 2.4. Prove that disjunction distributes over conjunction; t h a t is, prove the Distributive Law: 11 v ( q A r ) = ( p v q ) A ( p v Y). Construct the required t r u t h tables: ALGEBRA O F PROPOSITIONS CHAP. 21 13 Since the t r u t h tables a r e identical, the propositions a r e equivalent. 2.5. Prove that the operation of disjunction can be written in terms of the operations of conjunction and negation. Specifically, p v (r = -(-?I A -(I). Construct the required t r u t h tables: Since the t r u t h tables a r e identical, the propositions a r e equivalent. 2.6. There are exactly four non-equivalent propositions of one variable; the t r u t h tables of such propositions follow: P 1 Pi(P0) 1 P,(?J) 1 1 p,(P) pi(?,) T F 1 i Find four such propositions. ;; Observe t h a t ~ F Hence 2.7. PI (11) p v -P, P2h) = T PVT-P P y P T F p, P 3 ( p ) = - p , P4(p) = P A -11. Determine the number of non-equivalent propositions of two variables p and 4. The t r u t h table of a proposition P ( p ,q ) will contain 22 = 4 lines. appear as follows: In each line T or F can In other words, there a r e 24 = 16 non-equivalent propositions of two variables p and (I. 14 2.8. ALGEBRA O F PROPOSITIONS Determine the number of non-equivalent propositions of: and 1’; (ii) 72 variables pl, p , , . . . ,p,]. (i) three variables 11, q (i) The truth table of a proposition P ( p , q , r ) will contain 2? = 8 lines. Since in each line T o r F can appear, there a r e 2 8 = 236 non-equivalent propositions of three variables. (iir The truth table of a proposition P ( p l , . . . , p,) will contain Z2” non-equivalent propositions of 71 variables. l l N E GAT1ON 2.9. [CHAP. 2 Prove De Morgan’s Laws: (i) - ( p A (I) = -p v In each case construct the required truth tables. (i’ ~ F F -(?;Y) F F T F T t ip iq T T F T 2l’ (ii) ”(7: lines; hence, as above, there are -(TI q) = -p A -q. -”-‘ t F F T t t - -p = p . 2.10. Verify: 1’ I -P I --P 2.11. TJse the results of the preceding problems to simplify each of the following propositions: (i) -(p (1) -[I1 V (ii) -(I), v -q) (ii) -(-11 A (iii) -(-11 v -9) q) -(-]I r * p A --q a), -1) - -p = p = - -p = = v -a). *Y v -y v -y A (iii) -(-p -q p \ 4 2.12. Simplify each of the following statements. It is not true that his mother is English or his father is French. (ii) It is not true that he studies physics but not mathematics. (iii) It is not true that sales are decreasing and prices are rising. (i) (iv) It is not true that it is not cold or it is raining. (ii Let p denote “His mother is English” and let q denote “His father is French”. Then the given statement is - ( p v q ) . But - ( p v y) = - p A - q . Hence the given statement is logically equivalent to the statement “His mother is not English and his father is not French”. CHAP. 21 15 ALGEBRA O F PROPOSITIONS (ii) Let p denote “He studies physics” and let y denote “He studies mathematics”. Then the given statement is -(PA -4). But - ( p A - q ) = - p v - - y 5 - p v q. Hence the given statement is logically equivalent t o the statement “He does not study physics or he studies mathematics” (iii) Since -()I (11 5 - p v -q, the given statement is logically equivalent to the statement “Sales a l e increasing or prices are falling”. (iv) Since -(-p q ) “It is cold and it = IS -q, the given statement is logically equivalent to the statement not raining”. ALGEBRA OF PROPOSITIONS 2.13. Simplify the proposition ( p v q ) -11 listed on Page 11. by using the laws of the algebra of propositions Statement (1) ( I ] \ 4) = -i PA -11 Reason (1) Commutative law (PV 4) (2) = (-PAP)v(-P (3) = = fv(-p/\q) (3) Complement law -pAq (4) Identity law (4) (2) Distributive law (I) 2.14. Simplify the proposition p v ( p ~ q by ) using the laws of the algebra of propositions listed on Page 11. Statement Reason = ( P A t)v ( P A 4) (2) = p A ( t v q) (2) Distributive law (3) E pAt (3) Identity law (4) = p (1) (11 4) (1) Identity law (4) Identity law 2.15. Simplify the proposition + v q ) propositions listed on Page 11. ( - p ~ q ) by using the laws of the algebra of Statement (1) -(P \ 4 ) ‘4 (‘11 Reason (“PA -q) 4Q ) V (”p A 4) (1) De Morgan’s law -p (3) -pAt (3) Complement law -p (4) Identity law (4) == MISCELLANEOUS PROBLEMS 2.16. The propositional connective “ p or q but not both”. A (-qv q) y (2) is called the exclusive disjunction; p y q is read (i) Construct a truth table for I-, y q. . (ii) Prove: p y q = ( p v q ) ~ - ( p ~ q ) Accordingly the original three connectives A , v and -. (i) Distributive law (2) y can be written in terms of Now p 1 q is true if p is true or if q is true but not if both a r e true: hence the t r u t h table of p y q is as follows: ALGEBRA OF PROPOSITIONS 16 (iii We construct the t r u t h table of ( pv q) A -(p A [CHAP. 2 q ) , by the second method, as follows: P I Y / ( P ” 4) A - (P A n) T F F T F F T T F F T F T T F T T T T F F F F F F T F 1 2 1 4 3 1 2 1 F T F Step 2.17. The propositional connective J, is called the joint denial; p J y is read “Neither p nor q”. (i) Construct a truth table f o r p J q . (ii) Prove: The three connectives V, connective J, as follows: ( a ) -13 (i) A and - may = P J P , ( b ) P A C ? = (PJP)J,(YLY), be expressed in terms of the (c) PJC? = (PL4)J(PJq)* Now p J q is t r u e only in the case tha t p is not true and y is not true; hence the truth table of p y is the following: 1 F ciij ; ; ”;‘ Construct the appropriate truth tables: (a) ;;1 U i ”%” (b) F F T F “ q F F f T T F T (pyqLq) F F t ALGEBRA OF PROPOSITIONS CHAP. 21 17 Supplementary Problems LOGICAL EQUIVALENCE = 2.18. Prove the associative law f o r disjunction: 2.19. Prove t h a t conjunction distributes over disjunction: p 2.20. Prove ( p v y) 2.21. Prove p v ( p A q ) 2.22. Pkove - ( p v q ) v ( - p 2.23. (i) Express (ii) Express 2.25. = -p A = -p p A ( p vy)v T A p v ( q v r) (qv r) = (p A q ) v ( q A r ) . q by constructing the appropriate t r u t h tables fsee Problem 2.13). by constructing the appropriate t r u t h tables (see Problem 2.14). q) = -p \i in ternis of A and -. A in terms of v and -. by constructing the appropriate t r u t h tables isee Problem 2.15). Write the negation of each of the following statements a s simply as possible. (i) H e is tall but handsome. (ii) H e has blond hair o r blue eyes. (iii) He is neither rich nor happy. (iv) He lost his job o r he did not go to mork today. (v) Neither Marc nor Erik is unhappy. (vi) Audrey speaks Spanish o r French, but not German. ALGEBRA OF PROPOSITIOKS 2.26. Prove t h e following equivalences by using the l a ~ of s the algebra of propositions listed on P a g e 11: (i) p ~ ( p v q = ) 11, iii) ( p ~ q ) v - p = - p v q , f i i i ) p ~ ( - p v q )5 p A q . Answers to supplementary Problems = -q), (ii) P A q -q, (iii) p v q. 2.23. (i) p v q 2.24. (i) - p v q , (ii) p 2.25. (iii) He is rich or happy. 2.26. (i) p ~ ( p v q = ) (pvt)r\(pvq) -(-p P = - ( - p v -4). (vi) Audrey speaks German but neither Spanish nor French. = p v ( f (I) =- p f E p. Chapter 3 Conditional Statements CONDITIONAL, p -+ y Many statements, particularly in mathematics, are of the form “If p then q”. statements are called coriditioizal statements and are denoted by Such p+q The conditional p + Q The truth value of p + can also be read: (i) p implies q (iii) (ii) p only if q (iv) (I is necessary f o r p . (I 23 is sufficient f o r (I satisfies: [T,] The conditional p + y is true except in the case that 1-I is true and q is false. The truth table of the conditional statement follows: Example 1.1 : Consider the following statements: If Paris is in France, then 2 1 - 2 = 4. (iij If P a r i s is in France, then 2 + 2 = 5. (iJ (iii) If P a r i s is in England, then 2 + 2 = 4. (iv) If P a r i s is in England, then 2 + 2 = 5. By the property I T J ] , only (ii) is a false statement; the others a r e true. JVe emphasize t h a t , by definition, (iv) is a t r u e statement even though its substatements “Paris is in England” and “ 2 f 2 = 5” a r e false. It is a statement of the type: If monkeys a r e human, then t h e earth is flat. Now consider the truth table of the proposition -11 J q: T T Observe that the above truth table is identical to the truth table of p - + y. is logically equivalent t o the proposition -p v q: p’q Hence 1 1 3 4 = -pvq I n other words, the conditional statement “If p then g” is logically equivalent t o the statement “Not 11 or q” which only involves the connectives \ and and thus was already a part of our language. We may regard p + q as an abbreviation f o r a n oft-recurring statement. - 18 CHAP. 31 19 CONDITIONAL S T A T E M E N T S BICONDITIONAL, p * q Another common statement is of the form ‘‘p if and only if q” o r , simply, “ p iff q“. Such statements, denoted by P f-$ (I are called bicoudifio/ial statements. The truth value of the biconditional statement p e q satisfies the following property: [T5J If p and Q have the same truth value, Then p - q truth values, then p - q is false. is true; if p and (I have opposite The truth table of the biconditional follows: 1 P T T F F Example 2.1 : 1 Q P*9 T F T F T F F T Consider the following statements: Paris is in France if and only if 2 2 = 4. (ii) Paris is in France if and only if 2 2 = 5. (i) (iii) P a r i s is in England if and only if 2 - 2 = 4. (iv) Paris is in England if and only if 2 +2 = 5. By property [T5], the statements (i) and (iv) a r e true, and (ii) and (iii) a r e false. Recall that propositions P ( p , (I, . . . ) and Q ( p , q, . . .) are logically equivalent if and only if they have the same truth table; but then, by property [Ti], the composite proposition P(p, (I,. . . ) cf Q ( p , Q, . . .) is always true, i.e. is a tautology. In other words, Theorem 3.1: P ( p , (I, . . , ) = Q ( p , q, . . .) if and only if the proposition PiPt (I, . .) f is a tautology. Conditional Theorem 3.2: P q P-’9 T T F F T F T F T F T T Converse (I A conditional statement p equivalent. + T T F T -+ P ‘ 3 &(P, q, . . . ) Inverse -P -9 Contrapositive T T F T T F T T + -q + ”11 q and its contrapositive -q + -p are logically [CHAP. 3 CONDITIONAL STA TEMEN TS 20 Example 3.1: Consider the following statements about a triangle A : p q Note t h a t p Example 3.2: + - q: If A is equilateral, then A is isosceles. p: If A is isosceles, then A is equilateral. 3 q is true, but q + p is false. Prove: ( p + q ) If ~2 is odd then x is odd. We show t h a t the contrapositive - q -p, “If x is even then x2 is even”, is true. Let J be even: then x = 2n where is a n integer. Hence x2 = (2n)(2n)= 2 ( 2 4 ) is also even. Since the contrapositive statement -q + - p is true, the original conditional statement p q is also true. -f --f Solved Problems CONDITIONAL 3.1. Let p denote “It is cold” and let q denote “It rains”. Write the following statements in symbolic form. It rains only if it is cold. A necessary condition for it to be cold is that it rain. A sufficient condition f o r it to be cold is that it rain. Whenever it rains it is cold. (v) It never rains when it is cold. (i) (ii) jiii) (iv) Recall t h at p + q can be read ‘‘p only if q”, “p is sufficient for q” or “ q is necessary f o r p”. (i) q+p iii)p+q (iii)q-p (iv) Now the statement “Whenever it rains i t is cold” is equivalent to “If i t rains then i t is cold”. Th at is, q + p. ( v ) The statement “It never rains when it is cold” is equivalent to “If i t is cold then it does not rain”. Th at is, p + -q. 3.2. Rewrite the following statements without using the conditional. (i) If it is cold, he wears a hat. (ii) If productivity increases, then wages rise. Recall t h a t “If p then q” is equivalent to “Not p or q”. (i) It is not cold o r he wears a hat. (ii) Productivity does not increase or wages rise. 3.3. Determine the truth table of ( p -$ q ) * ( p A 4). F T C,HAP. 31 CONDITIONAL STATEMENTS 3.4. Determine the truth table of - p 3.5. Verify that (p q ) -+ p). -+ ( p v q ) is a tautology. F F 3.6. (q -+ 21 T F T F F F T T Prove that the conditional operation distributes over conjunction: P P Q ?“ T T T T F F F F T T F F T T F F T F T F T F T F -+ ((I A 79 = (P -+ (I) A q“r P+(qfiTj P-+9 T F F F T F F F T F F F T T T T T T F F T T T T (P r ) -+ T t 4 BICONDITIONAL 3.7. Show that “ p implies (I and (I implies 11’’ is logically equivalent to the biconditional “71 if and only if 4”; that is, (21 q ) A ( q .+ p ) = p cs q. + F F I F l 3.8. T F T I 1 T 4 I Determine the truth value of each statement. 2 + 2 = 4 iff 3 + 6 = 9 (ii) 2 2 = 7 if and only if 5 +1= 2 (iii) 1 + 1 = 2 iff 3 + 2 = 8 (iv) 1 2 = 5 if and only if 3 +1= 4 (i) + + Now p e q is true whenever p and q have the same t r u t h value: hence (i) and (ii) a r e true statements. but (iii) and (iv) a r e false, (Observe tha t (ii) is a true statement by definition of the conditional, even though both substatements 2 2 = 7 and 5 1 = 2 a re false.) + + 22 3.9. CONDITIONAL STATEMENTS Show that the biconditional p * q connectives v, A and -. Now = q TI+ P*q can be written in terms of the original three = - p v q and q + p -qv p ; hence by Problem 3.7, (P+Q)A(q+P) (“P‘-‘q)A(-qVP) 3.10. Determine the truth value of (11 3.11. Determine the truth value of ( p t)-4) e ( 4 T T F F T F T F + 4 ) v - ( p H Wq). F T T F F T F T [CHAP. 3 T T F T -+ p). F T F F Method 1 I Step 1 1 4) F I T I T T F T F I T I F F I F ~ T T F TF 4 1 1 ( 2 / 1 3 Method 2 NEGATION 3.12. Verify by truth tables that the negation of the conditional and biconditional are as follows: (i) - ( p 4) = 13 A -q, (ii) - ( p t)4 ) = 11 H - 4 = -p t)q . + ii) P 1 Q T T i : F I T Remark: 1 I P+q F T F T F T Since p + q “(P’4) E 1 -q F T F 1 PA-4 F T F - p v q , we could have used De Morgan’s law t o verify (ii as follows: -(p + 4 ) -(-fJ V 4) 5 - - T I A -4 p A -q C 0 N D IT I 0 N A L STATEMENTS CHAP. 31 3.13. Simplify: (i) -01 (ii) -(-]I (iii) -(-/I (ii -(/I -i -q) = = (ii) - ( - ] I -q), - -q p -q) q) c) - -p * q -p A - H = p tf q = IJ tf (I = -q -11 q ) , (iii) -(-p 23 + -(I). 11 ~ 3.14. Write the negation of each statement as simply as possible. (i) If he studies, he will pass the exam. (ii) He sn-inis if and only if the water is warm. (iii) If it snows, then he does not drive the car. (i) B y Problem 3.12, - ( p + q ) = hence the negation of !ii is -q: 11 H e studies and he will not pass the exam. (ii) By Problein 5.12, - ( p f 011 ow in g : = t)q ) p = -q -p H q ; hence the negation of (ii) is either of the H e swims if and only if the water is not warm. H e does not swim if and only if the water is warm. (iii) Note t h a t -(]I+ -q) =p - -q A = p q. Hence the negation of (iiii is I t sno\vs and he drives the car. 3.15. Write the negation of each statement in as simple a sentence as possible. (i) If it is cold, then he wears a coat but no sweater. (ii) If he studies, then he will go t o college or to art school. (i) Let 11 he “ I t is cold”, q be “He weals a coat” and r be “He n e a r s a sweater”. Then the given Now statement can be written as p + ( q - 7 ) . -[p + ((I -?)I =_ 11 A -(q = A -?) 11 (-q v Y) Hence the negation of (i) is I t is cold and he wears a sweater (ii) The g i ~ e nstatement is of the f o l m 1 1 - ( q v 1.). - [ p + ((I 7) = p A 01 no coat. But - ( q v Y) = 11 -q I -r Thus the negation of (ii) is He studies and he does not go to college 01 t o a r t school. CONDITIONAL STATEMENTS AKD VARIATIONS 3.16. Determine the contrapositive of each statement. (i) (ii) (iii) (iv) If John is a poet, then he is poor. Only if Marc studies will he pass the test. It is necessary to have snow in order f o r Eric t o ski. is not positive. If ,I‘ is less than zero, then (i) The tontraposltive of 11- q is - q + -11. Hence the contrapositive of (i) is If John is not poor, then he is not a poet. (ii) The Riven statement is equivalent to “If Marc passes the test, then he studied”. contrapositive of (ii) is If Marc does not study, then he will not pass the test. Hence the 24 CONDITIONAL S T A T E M E N T S [CHAP. 3 (iiij The given statement i s equivalent to “If Eric skis, then it snowed”. Hence the contrapositive of iiii) is If it did not snow, then Eric will not ski. (iv) The contrapositive of p - -q is - -q + -p = q -p. + Hence the contrapositive of fiv) is If n: is positive, then x is not less than zero. 3.17. Find and simplify: (i) Contrapositive of the contrapositive of p + q. (ii) Contrapositive of the converse of p + q. (iii) Contrapositive of the inverse of p + q. (i, (ii, - - -11 -t -q The contrapositive of p - t q is - q + -p. The contrapositive of - q + - p is p + q , which is the original conditional proposition. The converse of p + q is q + p . The contrapositive of q - t p is -p+ - q , which is the inverse of p - t q. (iii) The inverse of p + q is - p i -q. which is the converse of p-+ q. The contrapositive of - p + -q is - -q + - -p = q + p, In other words, the inverse and converse a r e contrapositives of each other, and the conditional and contrapositive a r e contrapositives of each other! Supplementary Problems STATEMENTS 3.18. Let p denote “He is rich” and let q denote “He is happy”. Write each statement in symbolic form using p and q. (i) (ii) If he is rich then he is unhappy. H e is neither rich nor happy. (iii, It is necessary t o be poor in order to be happy. (iv) To be poor i s to be unhappy. (v) Being rich is a sufficient condition to being happy. (vi) Being rich is a necessary condition t o being happy. (vii) One is never happy when one is rich. (viii) He is poor only if he is happy. (ix) To be rich means the same as to be happy. (x) He is poor or else he is both rich and happy. N o t e . Assume “He is poor” is equivalent t o -p. 3.19. Determine the t r u t h value of each statement. (i) If 5 < 3, then - 3 < -5. (ii) I t is not true t h a t 1 1 = 2 iff 3 + + 4 = 5. (iii) A necessary condition t h a t 1 + 2 = 3 i s t h a t 4 + 4 = 4. +1= 5 (iv) I t is not t r u e t h a t 1 (vi iff 3 + 3 = 1. If 3 < 5 , then - 3 < - 5 . (vi) A sufficient condition t h a t 1 1.2 = 3 is t h a t 4 3.20. + 4 = 4. Determine the t r u t h value of each Statement. +3 = 5 2+1 = 3 + 1 = 2. 5 + 5 = 10. (i) It is not t r u e t h a t if Z T 2 = 4, then 3 or 1 (ii, If 2 1 . 2 = 4, then i t is not t r u e t h a t and (iii) If 2 + 2 = 4, then 3 + 3 = 7 iff 1’1 = 4. CHAP. 31 3.21. Write the negation of each statement in a s simple a sentence a s possible. (i) If stock prices fall, then unemployment rises. (ii) He has blond hair if and only if he has blue eyes. (iii) If Marc is rich, then both Eric and Audrey a r e happy. (iv) Betty smokes Kent o r Salem only if she doesn't smoke Camcls. (v) M a r y speaks Spanish or French if and only if she speaks Italian. (vi) If John reads Sewsweek then he reads neither Life nor Time. TRUTH TABLES 3.22. Find the t r u t h table of each proposition 3.23. 3.25. ii) ( - p v q ) + p , Find the t r u t h table of each proposition (4 -cl) (-11 q ) , (ii) (-q v P ) (i) ( P ++ 3.24. 25 CONDITIONAL S T A T E M E N T S ++ + Find the t r u t h table of each proposition: (p--q)+ (qv-p)], (i) [ p r \ I - q + p ) ] A - Prove: iii (pnq)+v = + (ii, q e ( - q ~ p ) . -PI. in) q e ( r - + - p ) ] v [ ( - q + p ) + + r ] . ( p + r ) v ( q + ~ ) . (iii z ~ + ( q + r = ) ( P A - ? ) + -q. CONDITIONAL XVD VARIATIONS 3.26. Determine the contrapositive of each statement. (i) If he has courage he will win. (ii) It is necessary to be strong in order to be a sailor. (iii) Only if he does not tire will he win. (iv) It is sufficient for i t to be a square in order to be a rectangle. 3.27. Find: ii) Contrapositive of p + -q. (ii) Contrapositive of - p + q. (iii) Contrapositive of the converse of p + -q. iiv) Converse of the contrapositive of - p -+ -q. Answers to Supplementary Problems 3.18. (i) P (ii) - p -q -+ A (iii) q iiv) - p + -q - iv) -p -q q p (vii 4 + 3.19. (i) T, (ii) T, (iii) F , (iv) F, (v) F, (vi) T 3.20. (i) F, (ii, F , (iii) T 3.21. (i) (ii) (iii) (iv) (v) (vi) 3.22. (i) T T F F , (ii) F F F T 3.23. (i) T F T T , iii) F T F T 3.24. (i) F T F F , (ii) T T T F T T F T 3.25. Hint. Construct the appropriate t r u t h tables. 3.26. (i) (ii) (iii) (iv) 3.27. (i) q-f - p , p Stock prices fall and unemployment does not rise. He has blond hair but does not have blue eyes. Marc is rich and E r i c or Audrey is unhappy. Betty smokes K e r t or Salem, and Camels. Mary speaks Spanish or French, but not Italian. John reads Sczcsweek, and Life o r T i m e . If If If If he does not win, then he does not have courage. he is not strong, then he is not a sailor. he tires, then he will not win. it is not a rectangle, then i t is not a square. iii) -q+ p, (iii) - p + q, (iv) p + q (vii) p 4 -q (viii) -11 q + (ix) P (x) -P" 4 (P A 4) ++ Chapter 4 Arguments, Logical Implication ARGUMENTS An aiyzrmerzt is a n assertion that a given set of propositions P I ,P,, . . ., P,I, called premises, yields (has as a consequence) another proposition Q, called the conclz~sion. Such a n argument is denoted by The truth value of an argument is determined as follows: [T,] An argument P I ,P,, . . . ,P,, - Q is true if Q is true whenever all the premises P,, P2, . . . , P I Iare true; otherwise the argument is false. Thus an argument is a statement, i.e. has a truth value. If an argument is true it is called a valid argument; if a n argument is false it is called a f a l l a c y . Example 1.1: The following argument is valid: 11, 11 + q + (Law of Detachment) Q The proof of this rule follows from the following t r u t h table. P T T F F 26 I Q j T F T F P”4 T F T T ARGUMEKTS, LOGICAL IMPLICATION CHAP. 41 I-, 4 T [(P T T T T F F F F T T F F T T F F T F T F T F T F T T T T F F F F Example 1.4: T I T T T F F F F T T T T T F T F ll (n + r)] T T F T F F F F T ’ T F T T F T F T F T T T F T T T F T 4) + 27 (P + T T T T T T T T + T T T T F F F F T F T F T T T T :~ F T F r) T F T F T F T F The following argument is a fallacy: For t h e proposition table below. F F F l(p 2’;q + ~ ; ; q) T T 1 P’q, -P t. - q - p ] .+ - q is not a tautology, as seen in the t r u t h (p+q;A-l’l T T ~ [(p+q)i-pl+-q T F T Equivalently, the argument is a fallacy since, in Case (line) 3 of the t r u t h table, p + q and - p a r e t r u e but - q is false. An argument can also be shown t o be valid by using previous results as illustrated in the next example. Example 1.5: We prove t h a t the argument p + -q, q t -p Statement (1) q is true. ( 2 ) p + - q is true. (3) q - p is true. (4) - p is true. + is valid: Reason (1) (2) (3) (4) Given Given Contrapositive of (2) Law of Detachment (Example 1.11 using (1) and (3) ARGUMENTS AND STATEMENTS We now apply the theory of the preceding section to arguments involving specific statements. We emphasize that the validity of a n argument does not depend upon the truth values nor the content of the statements appearing in the argument, but upon the particular form of the argument. This is illustrated in the following examples. Example 2.1 : Consider the following argument: S,: If a man is a bachelor, he is unhappy. S p : If a man is unhappy, he dies young. ....................................... S : Bachelors die young. Here t h e statement S below the line denotes the conclusion of t h e argument, and the statements S1 and S, above the line denote the premises. We claim t h a t the argument S1,S2 + S is valid. For t h e argument is of t h e form p+q,q+r t. P + T where p is “He is a bachelor”, q is “He is unhappy” and T is “He dies young”; and by Example 1.3 this argument (Law of Syllogism) is valid. [CHAP. 4 ARGUMENTS, LOGICAL IMPLICATION 28 Example 2.2: W e claim t h a t the following argument is not valid: S1: If two sides of a triangle a r e equal, then the opposite angles a re equal. SB: Two sides of a triangle a r e not equal. ...................................................................... S: The opposite angles a r e not equal. F o r t h e argument is of the form p - t q , - p c -q, where p is “Two sides of a triangle a r e equal” and q is “The opposite angles a r e equal”; and by Example 1.4 this argument is a fallacy. Although the conclusion S does follow from S 2 and axioms of Euclidean geometry, the above argument does not constitute such a proof since the argument is a fallacy. Example 2.3: We claim t h a t the following argument is valid: S1:If 5 is a prime number, then 5 does not divide 15. Sy: 5 divides 15. ............................................. S: 5 is not a prime number. F o r t h e argument is of the form p + -q, q t- - p where p is “ 5 is a prime number” and q is “ 5 divides 15”; and we proved this argument is valid in Example 1.5. We remark t ha t although the conclusion here is obviously a false statement, the argument as given is still valid. It is because of the false premise Slthat we can logically arrive a t the false conclusion. Example 2.4: Determine the validity of the following argument: S1:If 7 is less tha n 4, then 7 is not a prime number. Sy: 7 is not less tha n 4. .................................................. S: 7 is a prime number. W e translate the argument into symbolic form. Let p be “7 is less tha n 4” and q be “7 is a prime number”. Then the argument is of the form p-t-q, -P + 4 T T T The argument is a fallacy since in Case (line) + -q and - p a r e t r u e b u t q is false. The f act t h a t the conclusion of the argument happens to be a true statement i s irrelevant to the fa c t t h a t the argument is a fallacy. 4 of the adjacent truth table, p LOGICAL IMPLICATION A proposition P(p, q, . . .) is said to logically imply a proposition Q ( p , q , Q ( p , q, . . .) is true whenever P(p,q, . . . ) is true. Example 3.1: W e claim t h a t p logically implies p v q. For consider the t r u t h tables of p and p v q in the adjacent table. Observe t h a t p is t r u e in Cases (lines) 1 and 2, and in these Cases p v q is also true. In other words, p logically implies PVQ. Now if Q ( p , 4,. . .) is true whenever P ( p ,q, . . .) . . .) if F is true, then the argument P ( P ,Q, . . .) I- Q@, 4, . . . ) is valid; and conversely. Furthermore, the argument P c Q is valid if and only if the conditional statement P + Q is always true, i.e. a tautology. We state this result formally. CHAP. 41 29 ARGUMENTS, LOGICAL IMPLICATION Theorem 4.2: The proposition P(p, q , . . . ) logically implies the proposition & ( p , q, . . .) if and only if (i) the argument P(p,q , or, equivalently, . ..) (ii) the proposition P ( p ,q , . . . I Remark: Q ( p , q, - + . . .) is valid Q ( p , q, . . . ) is a tautology. The reader should be warned that logicians and many texts use the word “implies” in the same sense as we use “logically implies”, and so they distinguish between “implies” and “if . . . then”. These two distinct concepts are, of course, intimately related as seen in the above Theorem 4.2. Solved Problems ARGUMENTS 4.1. Show that the following argument is valid: p - q , 4 t t 11. Method 1. Construct the t r u t h table on the right. Now p e p is true in Cases (lines) 1 and 4, and Q is t r u e in Cases 1 and 3: hence p w q and q a re true simultaneously only in Case 1 where p is also true. Thus the argument p* q , q F p is valid. F Method 2. Construct the t r u t h table of Since [ ( p w 4 ) A 91 --f [ ( p t t q ) A 91 .+ p : p is a tautology, the argument is valid. 4.2. P T T F F Q [ ( P - 4) A T F T F T T F F T F T T T F T F F F F T 1 2 1 3 Step 4.3. Determine the validity of the argument Construct the t r u t h table of [(-]I+ q ) A py - - P T F T T F T F T ’ F T I F T T T I T T T F F 2 1 4 1 F 41 + -p+ q, p t -q. -q: 2 F F T ARGUMENTS, LOGICAL IMPLICATION 30 T F F F T F F T T Since the proposition is a fallacy. [(-p Observe t h at - p + 4.4. T F F F q) + A ?I’ ~i -p + q, p + -q q and p ar e both true in Case (line) 1 but in this Case -q is false. Prove that the following argument is valid: Method 1. T T T F is not a tautology, the argument -q + [CHAP. 4 p + -q, T+ q, r yw. I- Construct the following truth tables: ~ 1 4 ?, I r 1 1 p+-q r+q T T F 1 -p ‘ F F I F ;1; T F T T Now p + -y, 7 + q and r a r e true siinultaneously only in Case (line) 5 , where - p is also true; hence the given argument i s valid. Method 2. Construct the t r u t h table for the proposition [(P+-Q) A (‘)“-’q) A r] + -P I t will be a tautology, and so the given argument is valid. Method 3. Reason Statement (1) p + -y (1) Given ( 2 ) Given is true. ( 2 ) r + q is true. (3) -q + -7- (3) Contrapositive of (2) (4) Law of Syllogism, using (1) and (3) is true. (4) p + - ~ is true. (6) r is true. ( 5 ) Contrapositive of (4) (6) Given (7) Hence - p is true. (7) Law of Detachment, using (5) and (6) (5) r+-p is true. ARGUMENTS AND STATEMENTS 4.5. Test the validity of each argument: (ii) If i t rains, Erik will be sick. (i) If it rains, Erik will be sick. It did not rain. Erik was not sick. . . . ...................... ........ .,.............. * It did not rain. Erik was not sick. F i r s t translate the arguments into symbolic form: (i) P + Y, -11 i- -q (ii) P -+ q, - q I- -P where p is “It rains” and q is “Erik is sick”. By Example 1.4, the argument (i) is a fallacy; by Problem 4.2, the argument (ii) is valid. CHAP. 41 4.6. 31 ARGUMENTS, LOGICAL IMPLICATION Test the validity of the following argument: If 6 is not even, then 5 is not prime. But 6 is even. ................................ Therefore 5 is prime. Translate the argument into symbolic form. Let p be “6 is even” and let q be “5 is prime.” Then the argument is of the form -p-+-q, P t- 4 F Now in the adjacent t r u t h table, -11 + - q and p a r e both t r u e in Case (line) 2; but in this Case q is false. Hence the argument is a fallacy. , T The argument can also be shown to be a fallacy by constructing the truth table of the proposition [ ( - p + - 9 ) ~ p +/ q and observing t h a t the proposition is not a tautology. The f act t h at the conclusion is a true statement does not affect the fa c t t h a t the argument is a fallacy. 4.7. Test the validity of the following argument: If I like mathematics, then I will study. Either I study or I fail. ................................. If I fail, then I do not like mathematics. F i r s t translate the argument into synibolic form. Let p be “ I like mathematics”, q be “I study” and r be “I fail”. Then t h e given argument is of the form p + q , q v r F r+-p To test the validity of the argument, construct the truth tables of the propositions p + q , q v r and ~ + - p : P 9 7, P’4 9 “ r T T T T T F T T T T T F T T F T F T F F F F T T T F F T T T T F -11 F F F F T T T T 1 ?’+ -p F T F T T T T T Recall t h at an argument is valid if the conclusion is true whenever the premises a re true. Now in Case (line) 1 of the above t r u t h table, the premises p + q and q v 7‘ a r e both true but the conclusion 7.+ - p is false; hence the argument is a fallacy. 4.8. Test the validity of the following argument: If I study, then I ~7illnot fail mathematics. If I do not play basketball, then I will study. But I failed mathematics. ........................................ Therefore, I played basketball. 32 ARGCRIENTS, LOGICAL IMPLICATION p T T T T F F F F 1 q ‘ I I I I ’ r l; p+-q -r -r+ P F F T T T T T ~ T T T F T T T F T T F F T -q T T I F T I F F F I T I T F ~ F I T ~ T F T F F F ~ T F F T LOGICAL IMPLICATION 4.9. Show that p 4 (I logically implies p cs 4. Construct the truth table for ( p A q ) + (p Since (IJ q) - F q): Pi‘ p i q F T F ( p ++ q ) is a tautology, p - [CHAP. 4 ‘ (pAq)i(p-q) q logically implies p tf q. 4.10. Show that p tf (r logically implies p + q. Consider the truth tables of p 1’ :I: F and p + q : g I 4 T P I F I P-q T F F T I P-)Q T F T T 4.11. Prove: Let P(p,q, . . .) logically imply Q ( p , q , . . .). Then for any propositions P I ,P,, . . . , P(P,, P,, . . .) logically implies Q(P,,P,, . , ,). By Theorem 4.2, if P ( p , q , . . . ) logically implies Q ( p , q , . . . ) then the proposition P ( p , q , . . . ) -t Q ( p , q , , . .) is a tautology. By the Principle of Substitution (Theorem 2.2), the proposition P ( P , ,P 2 , , . , I Q(Pl,P,, . . .) is also a tautology. Accordingly, P(P,, P,, . . .) logically implies QIP,,P,, . . . ). + CHAP. 41 33 ARGUMEKTS. L O G I C A L IMPLICATION 4.12. Determine the number of nonequivalent propositions P ( p ,q ) which logically imply the proposition 11 ++ q. TI- Consdc,r the adjacent t r u t h tahle of p ++ q . Now P ( p , (I) logically imq IS t r u e plies p t-t (J i f IJ H q is t r u e whenever P ( p , qi is true. But p only in Cases linesi 1 and 4; hence Pi7i, q ) c a n n o t be t r u e in Cases 2 anti 3. There a r e foul, iuch propositions which a l e listed below. F T F F F F T 4.13. Show that pt-t-(1 does not logically imply Method I . Construct the t r u t h tables of 21 2' 1 T T F A I r! ' cf - q -rl1 I q. TI+ and p + q: I p--q l,+q T T T F T F T F T P F T T I F T Supplementary Problems ARGUMENTS 4.14. Test the validity of each argument: l i ) 4.15. Test the validity of each argument, 4.16. Test the validity of each argument: riI q , p t- - q ; -]I+ ti1 11- 9, r ' + -q li+ -4, Y--' (ii) -11- t- Y - p , q t- - r : ARGUMENTS AND STATERIENTS 1.17. Test the validity of the argument: If London is not in Denmark, then Paris is not in France. But Paris is i n France. ....................................................... Therefore, London is in Denmark. 4.18, Test the validity of the argument: If I study, then I will not fail mathematics. I did not study. .......................................... I failed mathematics. -p; - q , q I- p. (ii) p + - q , - r + - q iii) p+ q , YV - q , -r t- p + I- -p. -T. 34 4.19. [CHAP. 4 ARGTJPVIENTS, LOGICAL IMPLICATION Translate into symbolic form and test the validity of the argument: icr) If G is even, then 2 does not divide 7. Either 5 is not prime or 2 divides 7 . But 5 is prime. ........................................ Therefore, 6 is odd (not e v e n ) . ( b ) On iiiy wife’s birthday, I bring her flowers. Either it’s iny wife’s birthday or I work late. I did not bring m y wife flowers today. ............................................ Therefore, today I worked late. If I woi.k, I cannot study. Either I work, o r I pass mathematics. I passed mathematics. (c) ........................................ Therefore, I studied. i d ) If I work, I cannot study. Either I study, o r I pass mathematics. I worked. ........................................ ‘Therefore, I passed mathematics. LOGICAL IXIPLICATION 4.20. Show t h a t rii p A q logically implies p , (ii) p v q does not logically imply p . 4.21. Show t h a t iil q logically implies p - q, (ii) - p logically implies p + q . 4.22. Show t h a t p 4.23. Determine those propositions 4.24. Determine the number of nonequivalent propositions P ( p , q ) which logically imply the proposition p- q , and construct t r u t h tables for such propositions (see Problern 4.12). ( q v r ) logically implies ( p 15 ? q ) v r. hich logically imply (i) a tautology, (ii) a contradiction. Answers to Supplementary Problems 4.14. (I) fallacy, (11) valid 4.15. ti) valid, 4.16. (1) valid, i i i i valid 4.17. valid 4.18. fallacy 4.19. (a) 4.23. I ) E\ e i y pioposition logically implies a tautology. conti adiction. 4.24. Theie a r e eight such propositions (111 fallacy - p -q, - 7 v q, r + - p ; valid. ( b ) p + q , p v I * , - q + r ; valid. ( c ) p + - q , p v i , r F y, fallacy. ( d ) p + - q , q v T , p + 7 , valid. (ii, Only a contradiction logically implies a F F F T T Chapter 5 Set Theory SETS AND ELEMENTS The concept of a set appears in all branches of mathematics. Intuitively, a set is any well-defined list or collection of objects, and will be denoted by capital letters A , B, X , Y , . . , , The objects comprising the set are called its elements or menzbem and will be denoted by lower case letters a,b , x , y , . . . . The statement ‘‘p is an element of A” or, equivalently, “ p belongs to A” is written p EA The negation of p E A is written p 6!A. There are essentially two ways to specify a particular set. One way, if it is possible, is to list its members. For example, A = {a,e, i, 0,u ) denotes the set A whose elements are the letters a,e,i,o,u. Note that the elements are separated by commas and enclosed in braces { 1. The second way is t o state those properties which characterize the elements in the set. For example, B = { x : x i s a n integer, x >0} which reads “ B is the set of x such that x is an integer and x is greater than zero,” denotes the set B whose elements are the positive integers. A letter, usually x, is used to denote a typical member of the set; the colon is read as “such that” and the comma as “and”. Example 1.1: The set B above can also be written as B = {1,2,3, . . .I. Observe t h a t -6 4 B , 3 E B and Example 1.2: P 4 B. The set A above can also be written as A = {Z : x is a letter in the English alphabet, E is a vowel} Observe t h a t b 6Z A , e E A and p fZ A . Example 1.3: Let E = {x : xz-33x f 2 = 0 ) . In other words, E consists of those numbers which a r e solutions of the equation xz- 3% 2 = 0, sometimes called the solution set of the given equation. Since the solutions of the equation a r e 1 and 2, we could also write E = {1,2}. + Two sets A and B are equal, written A = B, if they consist of the same elements, i.e. if each member of A belongs to B and each member of B belongs to A . The negation of A = B is written A # B. Example 1.4: Let E = {x : + 3x 2 = 0 } , F = ( 2 , l} and G = {l,2,2,1,6/3}. Then E = F = G. Observe that a set does not depend on t h e way in which its elements are displayed. A set remains the same if its elements a r e repeated or rearranged. x2- 35 36 [CHAP. 5 S E T THEORY FINITE AND INFINITE SETS Sets can be finite or infinite. X set is finite if it consists of exactly n different elements, where 72 is some positive integer; otherwise i t is infinite. Example 2.1: Let M be the set of the days of the week. In other words, M = {Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, Sunday] Then M is finite. Example 2.2: Let Y = { 2 , 4, 6, 8 , Example 2.3: Let P = {x : s is a river on the earth}. Although it may be difficult to count the number of rivers on the earth, P is a finite set. . , ,}. Then Y is infinite. SUBSETS A set A is a subset of a set E or, equivalently, B is a superset of A , written A C E or B > A iff each element in A also belongs t o E ; that is, x E A implies x E B. We also say that A is contailzed in B or B contains A . The negation of A c B is written A $! E or B $ A and states that there is a n x E A such that x 6! E . Example 3.1: Consider the sets A = (1,3, 5, 7, C = {.c : 2 . . .}, is prime, T . B = { 5 , 10, 15, 20, . . .I > 2 } = { 3 , 5 , 7 , 1 1 , ...} Then C c A since every prime number greater than 2 is odd. On the other hand, B @ A since 10 E B but 10 @ A . Example 3.2: Let N denote the set of positive integers, 2 denote the set of integers, Q denote the set of rational numbers and R denote the set of real numbers. Then N c Z c Q c R Example 3.3: I The set E = { 2 , 4 , 6 ] is a subset of the set F = {6,2,4}, since each number 2, 4 and 6 belonging to E also belongs to F . In fact, E = 8’. I n a similar manner it can be shown t ha t every set is a subset of itself. As noted in the preceding example, A C B does not exclude the possibility that A = B. In fact, we may restate the definition of equality of sets as follows: Definition: I Two sets A and E are equal if A c B and E c A . In the case that A c B but A = E , we say that A is a proper subset of B o r B contains A properly. The reader should be warned that some authors use the symbol C for a subset and the symbol c only for a proper subset. The following theorem is a consequence of the preceding definitions: Theorem 5.1: Let A , B and C be sets. Then: (i) A c A ; (ii) if A C B and B A = B ; and (iii) if A C E and B C C, then A C C. C A , then UNIVERSAL AND NULL SETS In any application of the theory of sets, all sets under investigation are regarded as subsets of a fixed set. We call this set the universal set or universe of discourse and denote it (in this chapter) by C . Example 1.1: I n plane geometry, the universal set consists of all the points in the plane. Example 4.2: I n human population studies, the universal set consists of all the people in the world. CHAP. 51 37 SET THEORY It is also convenient to introduce the concept of the empty or null set, that is, a set which contains no elements. This set, denoted by 8, is considered finite and a subset of every other set. Thus, f o r any set A , $8 c A C U. Then A is empty, i.e. A = @. Example 4.3: Let A = {x : x’ = 4, z is odd}. Example 4.4: Let B be the set of people in the world who a r e older than 200 years. According to known statistics, B is the null set. CLASS, COLLECTION, FAMILY Frequently, the members of a set are sets themselves. For example, each line in a set of lines is a set of points. To help clarify these situations, other words, such as “class”, “collection” and “family” are used. Usually we use class o r collection for a set of sets, and family f o r a set of classes. The words subclass, subcollection and subfamily have meanings analogous to subset. Example 5.1: The members of the class {{2,3}, {2}, {5,6}} Example 5.2: Consider any set A . The power set of A , denoted by ‘?‘(A) or Z A , is the class of all subsets of A . In particular, if A = {a,b , c}, then a r e the sets {2,3}, {2} and { 5 , S}. T(A) = { A , {a,b } , {a,c>, ( 6 , c>, { a } , Cb), {c>, @I In general, if A is finite and has n elements, then “ ( A ) will have 2n elements. SET OPERATIONS The union of two sets A and B, denoted by A U B, is the set of all elements which belong to A o r to B: A U B = { x : x E A or x E B } Here “or” is used in the sense of andlor. The intersection of two sets A and B, denoted by A n B, is the set of elements which belong to both A and B: A n B = { x : x E A and x E B } If A n B = 0,that is, if A and B do not have any elements in common, then A and B are said to be disjoint or non-intersecting. The relative co?nplernent of a set 5’ with respect to a set A or, simply, the difference of A and B, denoted by A\B, is the set of elements which belong to A but which do not belong to B: A\B = { n : : x E A , x4B} Observe that A \B and B are disjoint, i.e. (A\B) n B = $3. The absolute complement or, simply, complement of a set A , denoted by Ac, is the set of elements which do not belong t o A : A“ = { x : x E U , x 4 A ) That is, Ac is the difference of the universal set U and A . Example 6.1 : The following diagrams, called Venn diagrams, illustrate the above set operations. Here sets are represented by simple plane areas and U , the universal set, by the area in the entire rectangle. SET THEORY 38 A U A n B is shaded. B is shaded. A\B Example 6.2: [CHAP. 5 A c is shaded. is shaded. Let A = {1,2, 3 , 4 ) and B = {3,4,5, 6} where ’7, = {1,2,3, . . .}. A U A\B Then: A n B = (3, 4) B = (1, 2, 3, 4, 5, 6} Ac = (5, 6 , 7, . ..} = {1,2} Sets under the above operations satisfy various laws o r identities which are listed in Table 5.1 below. In fact we state: Theorem 5.2: Sets satisfy the laws in Table 5.1. LAWS OF THE $LGEBRA OF SETS la. A U A = A 2a. (AuB)uC = Au(BUC) 3a. A U B = B u A 4a. Idempotent Laws lb. AnA = A Associative Laws 2b. (AnB)nC = An(BnC) Commutative Laws 3b. A n B = B n A Distributive Laws Au(BnC) = (AuB)n(AuC) 4b. A n ( B u C ) = ( A n B ) u ( A n C ) 5a. A u @ = A Identity Laws 5b. A n U = A 6a. AUU = U 6b. A n @ = @ 7a. AuAc = U 8a. (Ac)c = A 9a. ( A u B ) ~= AcnBc Complement Laws 7b. A n A c = @ 8b. Uc = 13, @c =U De Morgan’s Laws 9b. ( A nB)c = A c u B c Table 5.1 Remark: Each of the above laws follows from the analogous logical law in Table 2.1, Page 11. For example, A n B = { x : x E A and z E B } = { x : x E B a n d x E A } = B n A CHAP. 51 39 SET THEORY Here me use the fact that if 11 is I E A and q is x E B, then p equivalent to q ~ p :P A C != q ~ p . A q is logically Lastly we state the relationship between set inclusion and the above set operations: Theorem 5.3: Each of the following conditions is equivalent t o A c B: (i) A n B = A (iii) Bc c Ac (v) B (ii) A U B = B u A‘ = Lr (iv) A n Br = $3 ARGUMENTS AND VENN DIAGRAMS Many verbal statements can be translated into equivalent statements about sets which can be described by Venn diagrams. Hence Venn diagrams are very often used to determine the validity of an argument. Example 7.1 : Consider t h e following argument: S1: Babies a re illogical. S p : Nobody is despised who can manage a crocodile. S,: Illogical people a r e despised. ................................................ S : Babies cannot manage crocodiles. (The above argument is adapted from Lewis Carroll, Sumbolic L o g i c ; he is also the the set of babies is a subset of the author of Alice in WoizclerLa?zcZ.) Now by S1, set of illogical people: babies By S,, t h e set of illogical people is contained in the set of despised people: Furthermore, by S 2 , the set of despised people and the set of people who can manage a crocodile a r e disjoint: despised people people who can manage crocodiles illogical people But by t h e above Venn diagram, the set of babies is disjoint from the set of people who can manage crocodiles, o r “Babies cannot manage crocodiles” is a consequence of S,, S , and S3. Thus the above argument, is valid. s,, s,, s 3 + s [CHAP. 5 S E T THEORY 40 Solved Problems SETS, ELEMENTS 5.1. Let A = ( x :3x = 6 ) . Does A = 2 ? A is the set which consists of the single element 2, that is, A = {2}. The number 2 belongs t o A ; it does not equal A . There is a basic difference between a n element p and the singleton set { p } . 5.2. Which of these sets are equal: {T, t , s } , {s,t, Y,S}, { t ,s, t , r } , {s,r,s, t}? They are all equal. Order and repetition do not change a set. 5.3. M'hich of the following sets are finite? (iv) {x : x is an even number) (i) The months of the year. (ii) (1, 2, 3, . . ., 99, loo} (v) (1,2, 3, * . . > (iii) The number of people living on the earth. The first three sets a r e finite; the last two sets a r e infinite. 5.4. Determine which of the following sets are equal: 8, (01,{g}. set 5.5. Each is different from the other. The set ( 0 ) contains one element, the number zero. The is the empty set. The set (0)also contains one element, the null set. 0 contains no elements; it Determine whether or not each set is the null set: (i) X = { x :x2 = 9, 2x = 4}, (ii) Y = {x:x # x}, (iii) 2 = { x :x + 8 = 8 ) . (i) There is no number which satisfies both x2 = 9 and 2 2 = 4; hence X is empty, i.e. X = 0. (ii) We assume t h a t any object is itself, so Y is also empty. In fact, some texts define the null set as follows: $3 E {x:x#x) + (iii) The number zero satisfies z 8 = 8; hence 2 = (0). since i t contains 0. That is, Z # @ . Accordingly, 2 is not the empty set SUBSETS 5.6. Prove that A = {2,3,4,5} is not a subset of B = ( x :x is even}. It is necessary to show that at least one element in A does not belong to B . Now 3 E A and, since B consists of even numbers, 3 6? B ; hence A is not a subset of B . 5.7. Prove Theorem 5.l(iii): If A C B and B c C , then A c C. x 5.8. We must show t h a t each element in A also belongs to C. Let x E A . Now A c B implies c C; hence x E C. We have shown t h a t x E A implies x E C, t h a t is, that A c C. E B. But B Find the power set "(8) of the set S = { 1,2,3} . The power set T(S) of S is the class of all subsets of S; these a r e {1,2,3), {1,2}, {1,3), {2,3}, {l}, {2}, (3) and the empty set 0. Hence T(S) = { S , C1,3}, {2,3}, {1,2}, C1), {2>, { 3 ) , Note t h a t there a r e 23 = 8 subsets of S. CHAP. 51 5.9. 41 SET T H E O R Y Let V = ( d ) , W = { c , d } , X = ( a , b , c ) , Y = { a , b } and 2 = ( a , b , d } . Determine whether each statement is true or false: (i) Y c X , (ii) W + Z , (iii) Z 3 17, (iv) V c X , (v) X = W , ( ~ 7 i ) W c Y . (i) Since each element in Y is a member of X , Y c X is true. (ii) Now a E 2 but (-1 4 W ; hence W # 2 is true. (iii) The only element in V i s d and it also belongs to 2 ; hence Z > V is true. (iv) V is not a subset of X since d E V but d 4 X ; hence V C X is false. (v) Now u E X but a 4 W ; hence X = W is false. (vi) W is not a subset of Y since c E 68 but c 4 Y ; hence W C Y is false. 5.10. Prove: If A is a subset of the empty set @, then A = @ . 8 is a subset of every set; in particular, @ C A . But, by hypothesis, A c The null set hence A = 0. SET OPERATIONS 5.11. Let U = ( l , 2 , . . . , 8,9}, A = { l , 2 , 3 , 4 } , B = [2,4,6,8} (i) Ac, (ii) A n C, (iii) ( A n C ) [ , (iv) A u B , (v) B\C. (i) and C={3,4,5,6}. $3; Find: Ac consists of the elements in C t h at a re not in A ; hence Ac = {5,6, 7,8,9}. (ii) A n C consists of the elements in both A and C; hence A n C = { 3 , 4 } . (iii) ( A n C ) c consists of the elements in C tha t a r e not in A n C . so (AnCC)c= {1,2,5,6,7,8,9}. Now by iii), A n C = { 3 , 4 ) and (iv) A U B consists of the elements in A or B ( o r both): hence A LB = { 1 , 2 , 3 , 4 , 6 ,8}. (v) B\C consists of the elements in B which a r e not in C ; hence B\C 5.12. In each Venn diagram below, shade: (i) = {S,S}. [i) A U B, (ii) A n B. A L E consists of those elements which belong to A or B (or both); hence shade the a re a in A and in B as follows: A U B is shaded. (ii) A n B consists of the ar ea t h a t is common to both A and B . To compute A n B, first shade A with strokes slanting upward t o the right (////) and then shade B with strokes slanting downward t o the r i g h t (\\\\), as follows: Then A n E consists of t h e cross-hatched area which is shaded below: 42 SET THEORY [CHAP. 5 A n B is shaded. Observe the following: ( a ) A n E is empty if A and E a r e disjoint. ( b ) A r E = B if B c A . i c ) A , - I B = A if A C E . 5.13. In the Venn diagram below, shade: (i) B", (ii) ( A C B ) < ,(iii) (B\A)c, (il (iv) A' n B c . Bc consists of the elements which do not belong to B : hence shade the a r e a outside B a s follows: EC is shaded. (ii) F i r s t shade A U B ; then iA1-Bj' is the a r e a outside A I J B : A U B is shaded. (iii, F i r s t shade B\A, ( AU E)' is shaded. the area in B which does not lie in A ; then (B\A)c is the area outside E\A. B \A is shaded. ( E \ A)c is shaded. (ivr F i r s t shade Ac, the area outside of A , with strokes slanting upward to the right ( / / I / ) , and then Ac0iBc is the then shade Bc with strokes slanting downward t o the right (\\\\I; cross-hatched area: S E T THEORY CHAP. 51 A c and Bc a r e shaded. 43 AcnBc is shaded. Observe t h at ( A LIB)"= ACnBc, a s expected by De Morgan's law. 5.14. I n the Venn diagram below, shade (i) (i) A n ( B U C), (ii) ( A n B ) U ( An C). F i r st shade A with upward slanted strokes, and then shade B U C with downward slanted strokes; now A n (BU C) is the cross-hatched area: A and BUC a r e shaded. A n ( B u C ) is shaded. (ii) F i r st shade A nB with upward slanted strokes, and then shade A n C with downward slanted strokes; now ( An B ) U ( A n C) is the total a re a shaded: A n B and A 17 C are shaded. ( An B)U ( An C) is shaded. Notice t h a t A n ( B uC ) = ( A n B ) u (A n C), a s expected by the distributive law. B\A = B n A". Thus the set operation of difference can be written in terms of the operations of intersection and complementation. 5.15. Prove: B\A = {x: x E B , .,;@A) = {x: x E B , x E A C } = B n A c 44 SET THEORY [CHAP. 5 A n ( B U C) = ( A n B ) U ( An C). 5.16. Prove the Distributive Law: An(BuC) 17 = =x = = {x : x € A ; x E B u C } {x: x E A ; x € B or x E C } { x : % € A ,x E B ; o r x E A , x E C } { r e : x E A n B or a E A n C } (AnB)u(AnC) Observe t h a t in t h e third step above we used the analogous logical la w P 5.17. Prove: ( A \B) 4 (Q "4 = (P A 4) " (P A 4 n B = @. (A\B)nB = = { x : xEA\B, xEB} { x : x E A , x 4 B ;z E B } =!a The last step follows from the fa c t t h a t there is no element 2 satisfying z E B and z 48. 5.18. Prove De Morgan's Law: ( A U B ) c= A c n B c . ( A u B ) ~= {x: x 4 A u B } = {x: x4A, x4B} = {x: x E A c , x E B C } = AcnBc Observe t h at in the second step above we used the analogous logical law "(p V 4) 5.19. Prove: For any sets A and B, A n B C A -p C A -q A UB. Let z E A n B ; then x E A and x E B. I n particular, x E A . Since x E A n B implies x € A , A n B c A. Furthermore, if z E A , then x E A o r x E B , i.e. 3: E A u B . Hence A c AUB. In other words, A n B c A c A u B . 5.20. Prove Theorem 5.3(i): A c B if and onlyif A n B = A . Suppose A c B . Let x E A ; then by hypothesis, x E B. Hence x € A and x E B , i.e. x E A n B. Accordingly, A c A nB. On the other hand, i t is always true (Problem 5.19) t h a t A n B C A . Thus A n B = A . Now suppose t h a t A n B = A . Then in particular, A C A nB. A n B c B. Thus A c A n B c B and so, by Theorem 5.1, A c B. But i t is always true t h a t ARGUMENTS AND VENN DIAGRAMS 5.21. Show that the following argument is not valid by constructing a Venn diagram in which the premises hold but the conclusion does not hold: Some students are lazy. All males are lazy. ..................... Some students are males. Consider the following Venn diagram: 43 SET THEORY CHAP. 51 lazy people students males Notice t h a t both premises hold, but the conclusion does not hold. For a n argument to be valid, the conclusion must always be t r u e whenever the premises a r e true. Since the diagram represents a case in which the conclusion is false, even though the premises a r e true, the argument is false. It is possible to construct a Venn diagram in which the premises and conclusion hold, such as lazy people students males 5.22. Show that the following argument is not valid: All students are lazy. Nobody who is wealthy is a student. ............................... Lazy people are not wealthy. Consider the following Venn diagram: wealthy people Now the premises hold in the above diagram, but the conclusion does not hold; hence the argument is not valid. 5.23. For the following set of premises, find a conclusion such that the argument is valid: S,: All lawyers are wealthy. S,: Poets are temperamental. S,: No temperamental person is wealthy. .................................... S: By S,, the set of lawyers is a subset of the set of wealthy people: and by S3, the set of wealthy people and the set of temperamental people a r e disjoint. Thus temperamental people 46 CHAP. 5 S E T THEORY By S y , the set of poets is a subset of the set of temperamental people; hence Thus the statement “No poet is a lawyer’’ o r equivalently “No lawyer is a poet” is a valid conclusion. The statements “NO poet is wealthy” and “No lawyer is temperamental” conclusions which do not make use of all the premises. are also valid MISCELLANEOUS PROBLEMS 5.24. Let A = (2, (4,5}, 4). Which statements are incorrect and why? (i) ( 4 , 5 ) C A , (ii) {4,5} E A , (iii) ((4,5}} C A . The elements of A a r e 2, 1 and the set {4,5}. Therefore (iii is correct, but (i) is a n incorrect statement. Furthermore, (iii) is also a correct statement since the set consisting of the single element {4, 5 ) which belongs to A is a subset of A . 5.25. Let A = (2, {4,5}, 4}. Which statements are incorrect and why? (il 5 E A , (ii) (5) E A , jiii) {5) c A. Each statement is incorrect. The elements of A a re 2 , 4 and the set {4,5}; hence (ii and (ii) a r e incorrect. There a r e eight subsets of A and (5) is not one of them; so (iii) is also incorrect. 5.26. Find the power set T(S) of the set S = (3, { 1 , 4 ) } . Note first t h a t S contains two elements, 3 and the set {1,4). Therefore T(S) contains 22 = 3 elements: S itself, the empty set (3, and the two singleton sets which contain the elements 3 and { 1 , 4 } respectively, i.e. {3} and {{1,4}}. I n other words, Tu(S) = {S,{3}, {{L4}}> c)} Supplementary Problems SETS, SUBSETS 5.27. Let A = { 1 , 2,..., 8,9}, B = {2,4,6,8}, C = {1,3,5,7,9], D = {3,4,5} and E = { 3 , 5 } . Which sets can equal X if we ar e given the following information? C c but # A. (i) X and E a r e disjoint. (ii) X c D but X # B. (iii) X C A but x’ C. (iv) x x 5.28. S t a te whether each statement is t rue or false: (ii Every subset of a finite set is finite. rii) Every subset of an infinite set is infinite. 5.29. Find the power set “ ( A ) of A = (1,2,3,4} and the power set T(i3) of B = {l,{ 2 , 3 ) , 4 ) . CHAP. 51 5.30. 47 State whether each set is finite or infinite: (i) (ii) (iii) (iv) (v) (vi) 5.31. S E T THEORY The The The The The The set set set set set set of of of of of of lines parallel to the 7 axis. letters in the English alphabet. numbers which a r e multiples of 5. animals living on the earth. numbers which are solutions of the equation x 2 i circles through the origin (0,O). + 2 6 ~ -1 17x11 ~ + 7x3 - 10 = 0. State whether each statement is true o r false: (i) {1,4,3] = { 3 , 4 , 1 ] (ii) {1,3,1,2, 3 , 2 ) c {I,2,3} (iii) { 1 , 2 } = { 2 , 1 , 1 , 2 , 1 ] (iv) C4) E CC4)) (v) {4} c CC4)) (vi) 8 c CC43) SET OPERATIONS 5.32. Let U = { u , b , c , d , e , f , g } , A = { a , b , c , d , e ) , B = { a , c , e , g } and C = { b , e , f , g } . Find: ( i ] AUC (iii) C \ B (v) C c n A ivii) ( A \ B c ) c iiir B n A (vi) ( A \ C P iviii) ( An A c ) c (iv) B C ~ C (i) W\V (ii) V c U W (iiii V n W c (iv) Vc\Wc. 5.33. I n the Venn diagrams below, shade 5.34. Draw a Venn diagram of three non-empty sets A , B and C so t h a t A , B and C have the following properties: (iii) A c C, A # C, B n C = $3 ( i ) A c B , C c B , AnC = 8 (iv) A c ( B n C ) , B c C , C I B , A Z C (ii, A C E , CQB, AnC # 0 5.35. The formula A \ B = A nBc defines the difference operation in terms of the operations of intersection and complement. Find a formula that defines the union of two sets, A U B , in terms of the operations of intersection and complement. 5.36. Prove Theorem 5.3iiil: A c B if and only if A U B = B. 5.37. Prove: If A T E = (3, then A c B c . 5.38. Prove: Ac \ Bc = b' \A. 5.39. Prove: A c B implies A u ( B \A) 5.40. Prove: An(B\C) = ( A n B )\(A?C). Give an example to show t h a t A ( B \ C) f ( AU B ) \ ( AJ C ) . (ii) = B. (i) -ARGUMENTS AND VENS DIAGRAMS Determine the validity of each argument for each proposed conclusion. 5.41. No college professor is wealthy. Some poets a r e wealthy. (i) Some poets a r e college professors. (ii) Some poets a r e not college professors. SET THEORY 48 5.42. Determine the validity of each a r g ~ i m e n tf o r each proposed conclusion. All poets a r e interesting people. Audrey is a n interesting person. 5.43. iiI Audrey is a poet. iiii Audrey is not a poet. Determine the validity of the argument f o r each proposed conclusion. All poets a r e poor. I n order to be a teacher, one must graduate from college. No college graduate is poor. (i) Teachers a r e not poor. (ii) Poets a r e not teachers. (iii) If Marc is a college graduate, then he is not a poet. 5.44. Determine the validity of the argument f o r each proposed conclusion. A11 mathematicians a r e interesting people. Some teachers sell insurance. Only uninteresting people become insurance salesmen. Insurance salesnien a r e not mathematicians. Some interesting people a r e not teachers. Some teachers a r e not interesting people. Some mathematicians a r e teachers. Some teachers a r e not mathematicians. If Eric is a mathematician, then he does not sell insurance. Answers to Supplementary Problems 5.27. (i) C and E , (ii) D and E , [iii) A , E and D, (iv) None 5.28. (i) T, (ii, F 5.30. (i) infinite, (ii) finite, (iii) infinite, (iv) finite, (v) finite, ( v i ) infinite 5.31. (i) T, (ii) T, (iii) T, (iv) T, (v) F, (vi) T 5.32. AUC = U (vj (ii) B n A = { a , c , e } (vi) (i) (iii) C \ B = { b , f} (iv) BCUC = { b , d , e , f , g } 5.33. (a) CcnA = {a,c,d: = CL ( A \ C). = { b , e , f,d (vii) ( A \Bc)c = { b , cI, t’, gl (viii) (AnAc)c = U [CHAP. 5 CHAP. 51 49 SET THEORY W\V Observe t h at V cu W = U and 5.34. (i) @ VnW C = 0 in case ( b ) where V C VV. (iii) @@ (ii) 5.35. A u B = (AcnBc)C 5.40. (ii) A L ( E \ C) is shaded. ( A u B ) \ ( A u C) is shaded. 5.41. (i) fallacy, (iii valid 5.42. (i) fallacy, (iij fallacy 5.43. (i) valid, (ii, valid, iiii) valid 5.44. (i) valid, (ii) fallacy, riii) valid, (iv) fallacy, ivi valid, (vi) valid The following Venn diagrams show why (ii) and (iv) a r e fallacies: interesting insurance 0 mathematicians u teachers Chapter 6 Product Sets ORDERED PAIRS An ordeyed pair consists of two elements, say a and 0 , in which one of them, say a, is designated as the first element and the other as the second element. Such an ordered pair is written (a, b ) Two ordered pairs (a, b ) and (c, d ) are equal if and only if a = c and b = d. Example 1.1: The ordered pairs ( 2 , 3 ) and ( 3 , 2 ) a r e different. Example 1.2: The points in the Cartesian plane in Fig. 6-1 below represent ordered pairs of real numbers. Example 1.3: The set { 2 , 3 } is not a n ordered pair since the elements 2 and 3 a re not distinguished. Example 1.4: Ordered pairs can have the same first and second elements, such as (1,l), (4,4)and (5,5). Remark: An ordered pair ( u , b ) can be defined rigorously as follows: (a, b ) = {@It @,b)} the key here being that {a} c { a , b J which we use to distinguish a as the first element. From this definition, the fundamental property of ordered pairs can be proven: (a,b ) = (c, d ) if and only if u = c and b = d PRODUCT SETS Let A and B be two sets. The iwoduct set (or Cartesian product) of A and B, written A X B, consists of all ordered pairs (a, b ) where a E A and b E B: A x B = ( ( a , b ): a E A, b E B } The product of a set with itself, say A X A , is sometimes denoted by A2. Example 2.1: The reader is familiar with the Cartesian plane R2 = R X R (Fig. 6-1 below). Here each point P represents a n ordered pair ( u , b ) of real numbers and vice versa; the vertical line through P meets the x axis a t a , and the horizontal line through P meets t h e 7~ axis at b. B P b a t -1 J:: 1 2 Fig. 6-2 Fig. 6-1 50 8 A CHAP. 61 PRODUCT SETS 51 Since A and B do not contain many elements, it is possible to represent A X B by a coordinate diagram a s shown i n Fig. 6-2 above. Here the vertical lines through the points of A and the horizontal lines through the points of B meet in 6 points which represent A X B in the obvious way. The point P is the ordered pair (2, b ) . In general, if a finite set A has s elements and a finite set B has t elements, then A x B has s times t elements. If either A or B is empty, then A X B is empty. Lastly, if either A or B is infinite, and the other is not empty, then A x B is also infinite. PRODUCT SETS I N GENERAL The concept of a product set can be extended to more than two sets in a natural way. The Cartesian product of sets A , B, C, denoted by A X B X C, consists of all ordered triplets (a,b, c ) where u E A, 2, E B and c E C: A x B x C = ((ct,b,c): u E A , b E B , c E C ] Analogously, the Cartesian product of n sets A,, A,, . . . , A n , denoted by A, X A, consists of all ordered n-tuples (a,, a,, . . ., uil) where a, E A,, . . . , anE An: A , x A, x X - X An, . . ., an E A n } Here an ordered ?i-tuple has the obvious intuitive meaning, that is, it consists of n elements, * x An = {(cL,,. . .,an) : a, E A,, not necessarily distinct, in which one of them is designated as the first element, another as the second element, etc. Furthermore, (a,, Example 3.1 : . . .,an) = ( b l , . . ., b J a, = b,, . . ., ail= biz iff In three dimensional Euclidean geometry each point represents a n ordered triplet: its x-component, its y-component and its z-component. TRUTH SETS OF PROPOSITIONS Recall that any proposition P containing, say, three variables p, value to each of the eight cases below: P I q l T I T T T T F T F F T F T F F F F r T F T F T F T F (r and r , assigns a truth 52 [CHAP. 6 PRODUCT SETS 1- The truth set of a proposition P, written S ( P ) , consists of those elements of U for which the proposition P is true. Example 1.1: Following is the truth table of ( p + q ) A (q p+q T T T T T (P+q)A\(Q+r) T F T T T F T T T F F F T F T T T Accordingly, the truth set of ( p -+ q ) T ( ( p+ q) A ( q + Y)) A T): q+r T T F F F + ( q + T ) is = {TTT, FTT, FFT, FFF} The next theorem shows the intimate relationship between the set operations and the logical connectives. Theorem 6.1: Let P and Q be propositions. Then: (i) S(P + Q ) = 7 ( P )n S ( Q ) (ii) T ( P \ Q) = T ( P )U T ( Q ) (iii) T ( - P ) = (T(P))" (iv) P logically implies Q if and only if S(P)c 7 ( Q ) The proof of this theorem follows directly from the definitions of the logical connectives and the set operations. Solved Problems ORDERED PAIRS 6.1. Let W = {John, Jim, Tom) and let V = {Betty, Mary}. Find W X V . TV x V consists of all ordered pairs (a,b ) where a E IT' and b E V . Hence, W 6.2. X = V {(John, Betty), (John, Mary), (Jim, Betty), (Jim, ilIaryi, (Tom, Betty), (Tom, Rlary)} (x+ y, 1) = (3, n.- y). Find r and y. ( x - u,1) = (3, x - y) then, by the fundamental Suppose If .r+r/=3 and property of ordered pairs, l = x - y The solution of these simultaneous equations is giverrby .2: = 2, y = 1. CHAP. 61 6.3. 53 PRODUCT SETS Let A = { a , b , c , d , e , f } and B = (a, e , i , o , u } . Determine the ordered pairs corresponding to the points PI,P,, P , and P, which appear in the coordinate diagram of A X B on the right. The vertical line through P I crosses the A axis at b and the horizontal line through PI crosses t h e B axis a t i; hence P I corresponds to the ordered pair ( b , i). Similarly, P , = ( a ,a), P , = (d, u) and P, = ( e , e ) . B 0 i a PRODUCT SETS 6.1. Let A = {1,2,3}, B = {2,4} and C = {3,4,5}. Find AX B A convenient method of finding A X B >; X A C. C is through the so-called “tree diagram” shown below: The “tree” is constructed from the left to the right. listed to the r i g h t of the “tree”. 6.5. a b c d e f A X B X C consists of the ordered triples Let A = {a,b ) , B = {2,3) and C = {3,4}. Find: (i) A x ( B u C), (ii) ( A x B ) u ( A x C ) , (iii) A x ( B n C), (iv) ( A X B ) fl ( A X C). (i) F i r st compute B u C = {2,3,4}. A (ii) F i r st find A X X (BUG‘) B and A X Then = { ( a , 21, (a,3), (a,4),( b , 2 ) , ( b , 3), ( b , 4): C: A xB A xC = { ( a , 2), (a,3), ( b , 2 ) , ( b , 3 ) ) = { ( a , 31, (a,4),( b , 3), ( b , 4)) Then compute the union of the two sets: ( A x B ) u ( A x C) = { ( a , 21, (a,3), ( b , 2), ( b , 31, ( a , 41, ( b , 4)) Observe, from (i) and (ii), t h a t Ax(BuC) = (AXB)U(AXC) (iii) Fi r st compute B r C = {3}. Then A x ( Br ,C ) = { ( a ,31, ( b , 3)) 54 PRODUCT S E T S [CHAP. 6 (iv) Now 4 X B and A X C were computed above. The intersection of A X B and A X C consists of those ordered pairs which belong t o both sets: ( A x B ) n ( A x C ) = { ( a , 31, ( h3)) Observe from (iii) and (iv) that Ax(BnC)= (AxB)n(AxC) 6.6. Prove: A x ( B n C ) = ( A x B ) n ( A x C). A x (BnC) = {(x,y) : x € A , y E = {(x,y) : x E A , = = 6.7. y BnC} E B , y E C} {(x,Y) : (x,4 E A x B , ( A x B ) n ( A x C) (2, Y) E A Let S = { a , b } , W = { l , 2 , 3 , 4 , 5 , 6 } and V = { 3 , 5 , 7 , 9 } . x c> Find ( S x W ) n ( S X V ) . The product set (SX W ) n ( S K V ) can be found by first computing S X W and S X V , and then computing the intersection of these sets. On the other hand, by the preceding problem, ( S X W ) n ( S X V ) = S X ( W n V ) . NOW W n V = {3,5}, and so (S x W ) n 6.8. Prove: Let A C (Sx V ) = S x ( W n V ) = { ( a ,31, ( a , 5 ) , ( b , 3), ( b , 5)) B and C C D; then ( A X C) C ( B X D). Let (x,y) be a n y arbitrary element in A X C; then x € A and y EC. By hypothesis, A c B and C c D ; hence x E B and y E D. Accordingly, (x,y) belongs t o B X D. W e have shown t h a t ( x , y ) € A X C implies (z,y) E B X D ; hence ( A X C) C ( B X D). TRUTH SETS O F PROPOSITIONS 6.9. Find the truth set of p A -q. First construct the truth table of P A -q: F Note t h a t p A -q is true only in the case t h a t p is true and q is false; hence PA -q) = {TF) 6.10. Find the truth set of - p + q. First construct the truth table of - p + q: F Now - p + q is true in the first three cases; hence T(-p + q) = {TT, T F , F T ] PRODUCT SETS CHAP. 61 P rl 1 1 ?’ P”4 T T T T F F F F T T F F T T F F T F T F T F T F T T 6.12. Suppose the proposition P = P ( p ,q, T ( P )of the proposition P. , , 55 I ( P V d A Y T T F T T F T T F F T F F F . ) is a tautology. Determine the t r u t h set If P is a tautology, then P is true for any t r u t h values of its variables. Hence t h e t r u t h set of P i s the universal set: T ( P ) = U . 6.13. Suppose the proposition P = P ( p , g , , . . ) is a contradiction. set T ( P ) of the proposition P. Determine the t r u t h If P is a contradiction, then there is no case in which P is true, i.e. P is false f o r a n y t r u t h values of its variables. Hence t h e t r u t h set of P is empty: T ( P ) = $3, 6.14. Let P = P ( p , q , . , .) and Q = Q ( p , q , . . , ) be propositions such t h a t P A Q is a contradiction. Show t h a t t h e t r u t h sets T ( P ) and T ( Q )a r e disjoint. If P A Q is a contradiction, then its t r u t h set is empty: T ( P A Q) = 0. Hence, by Theorem 6.1(i), T ( P ) n 7 ( Q ) = T ( P A Q ) = $3 6.15. Suppose t h a t the proposition P = P(p,g, . . .) logically implies the proposition Q = Q ( p , g, . . .). Show t h a t the t r u t h sets T ( P ) and T ( - Q ) a r e disjoint. Since P logically implies Q, 7 ( P ) is a subset of T ( Q ) . B u t by Theorem 5.3, 7 ( P )c T ( Q ) is equivalent to T ( P )n ( 7 ( Q ) ) c = @ Furthermore, by Theorem 6.l(iii), ( T ( Q ) ) c = 7 ( - Q ) .Hence 7 ( P )n T ( - Q ) = @ 56 ‘CHAP. 6 PRODUCT S E T S Supplementary Problems ORDERED P-IIIRS, PRODUCT SETS (u-2, 2 x t 1) = :m 2 Find ) . x and y. 6.16. Suppose 6.17. Find the ordered pairs corresponding to the points P I , P,, P , and P , which appear below in the coordinate diagram of {l, 2, 3 , 4 > Y ( 2 , 4 , 6 , S}. (.I.- 1, ~ ~ 4 2 1 2 3 4 6.18. Let W = {Mark,Eric, Paul: and let V = {Eric,David]. t i ) IT’ v T’, (ii) V X W , (iii) 1’1 = T’X V . 6.19. Let A = {2,3}, B = { 1 , 3 , 5 ) and C = {3,4}. in Problem 6.4, and then find A X B X C. 6.20. Let S = { a , b , c}, T = { b , c, cl) and W = {a,d } . Construct the tree diagram of then find S X T X W . 6.21. Suppose t h a t the sets V , W and Z have 3, 4 and 5 elements respectively. Determine the number of elenients in (i) V x W x 2, iii) Z x V X W , (iii) W X 2 X V . 6.22. Let A = B r: C. Determine if either statement is true: rii A A A = ( B x B ) n ( ( C x C ) , (ii) A x A = ( B x C ) ? ( C X B ) . 6.23. Prove: A X Find: Construct the “tree diagram” of A SX X C, as ?’ A TI.‘ and X E (BUG‘) = ( A X B ) u ( A X C ) . TRUTH SETS OF PROPOSITIONS 6.2-1. Find the truth se t of p 6.25. Find the t r u t h set of - p v -q. 6.26. Find the t r u t h set of ( p v 6.27. Find the truth set of ( p .+ q ) 6.28. Let P = P ( p ,q, . . .) and Q = Q ( p , q , . . .) be propositions such t h a t P v Q is a tautology. t ha t the union of the t r u t h sets T ( P ) and T(Q)is the universal set: T ( P ) U T ( Q )= 7J. 6.29. Let the proposition P = P ( p ,q , . . . ) logically imply the proposition Q = Q ( p , q, . . . ) . Show t h a t the union of the t r u t h sets T ( - P ) and T ( Q )is the universal set: T ( - P ) u T ( Q ) = L‘. t)- q , Q) i A -r. ( p ++ 9.). Show Answers to Supplementary Problems = 2, u = 3. 6.16. .I. 6.17. P I = ( 1 , 3 ) , P, = (2,8), P; = ( 4 6 ) and P4 = (3,2). 6.18. ii) V A V = {(Mark, Eric), (Mark,David), (Eric, Eric), (Eric, David), (Paul, Eric), (Paul, David)) (ii) V (iii) V X X W = {(Eric, Ma rk), (David,Nark), (Eric, Eric), (David, Eric), (Eric, Paul), iDavid, Paul)} V = ((Eric, Eric), (Eric, David), (David, Eric), (David, David)} 57 PRODUCT S E T S CHAP. 61 6.19. 3, 3) 3, 4) 5 , 3) 5 . 4) 1, 3 ) 1, 4) 3, 3) 3 , 4) 5, 3) 5 , 4) The elements of A x E x C a r e the ordered triplets to the right 6.21. Each has 60 elements. 6.22. Both a r e true: 6.23. A X (BuC) = = = = = A X A = ( B X B ) n (C X C ) = ( B X C) n (C X 6’). { ( Y , y) : 2 E A , y EBUC} { ( x , ~ ) :% € A ;y E B o r y E C } { ( x , ~ :) x E A , ZJ E B : or .T E A , y E C} {(x,y) : (x,y) E A X B ; o r (s,u)E A X C} ( A x B ) u ( A X C) Observe t h a t the logical law p A (q \ T) E ( p 1, q ) v ( p A = {TF. FT} 6.24. T(p ts - q ) 6.25. T(--p v - q ) = { T F , F T , F F } 6.26. T ( ( p v q) -t -T) the tree diagram above. = { T T F , T F F , FTF, F F T , FFF] T) was used in the third step above. Chapter 7 Relations RELATIONS A binuisu i7elotion or, simply, relation R from a set A to a set B assigns to each pair (a, b ) in A Y E exactly one of the following statements: (i) “u is related to b”, written a R b (ii) “u is not related to b”, written a$ Z, A relation from a set A to the same set A is called a relation in A . Example 1.1: Set inclusion is a relation in any class of sets. For, given a n y pair of sets A and B , either A C B o r A d B . Example 1.2: Marriage is a relation from t h e set M of men t o the set W of women. For, given a n y man m E -11 and any woman w E TV, either ?ii is married to ~ t ‘ or nz is not married to w. Example 1.3: Order, symbolized by “<”, or, equivalently, the sentence “x is less than y”, is a relation in any set of real numbers. For, given any ordered pair ( u , b ) of real numbers, either a<b or a Q b Example 1.4: Perpendicularity is a relation in the set of lines in the plane. For, given any pair of lines a and b , either a is perpendicular t o b o r a is not perpendicular t o b. RELATIOKS AS SETS OF ORDERED PAIRS Now any relation R from a set A t o a set B uniquely defines a subset R“ of A follows: €2:: = { ( a ,b ) : a is related to b } = {(a, b ) : aR b } X B as On the other hand, any subset R,’of A x B defines a relation R from A to B as follows: n I: 2, iff ( a , b ) E In view of the correspondence between relations R from A to B and subsets of A X B , we redefine a relation by I Definition: 1 A relation R from A to B is a subset of A x B. Example 2.1: Let R be the following relation from A = {1,2, 3) B = {a, b } : R = CCL 4, (1, b ) , (370)) I l l 1 to Then 1R a, 2 @ b , 3 R a, and 3 $ b. The relation R is displayed on the coordinate diagram of A X B on the right. Example 2 . 2 : Let R be the following relation in W = { a , b , c]: R = { ( a , b ) , (a,c), (c, Then a$a, u R b , c R c and c $ a . 58 4,(c, b ) ) a 1 2 3 CHAP. 71 Example 2.3: 59 RELATIONS Let A be a n y set. The i d e x t i t y relation in A , denoted by 1 or P A , is the set of all pairs in A X A with equal coordinates: A.4 = { ( a , a ) : a E A } The identity relation is also called the diagonal by virtue of its position in a coordinate diagram of A X A . INVERSE RELATION Let R be a relation from A to B. The inverse of €2, denoted by R-l, is the relation from B to A which consists of those ordered pairs which when reversed belong to R: R-’ Example 3.1 : = { ( b ,a ) : (a, b ) E R } Consider the relation in A = {1,2,3}. R = {(1,2), (1,3), ( 2 , 3 ) ) Then R-’ = ((2, I), (3,1), (3,2)$ a r e identical, respectively, to the relations Observe t h a t R and R-’ in A , i.e., (a, b ) E R iff a Example 3.2: <b and < and > ( a , b ) E R-l iff a > b The inverses of the relations defined by “x is the husband of y” ‘‘x is taller than and y” a r e respectively “x is the wife of y” and “3: is shorter than y” EQUIVALENCE RELATIONS A relation R in a set A is called .i.e.flexive if a R a , i.e. (a,a) E R, for every a E A . Example 4.1: (i) Let R be the relation of similarity in the set of triangles in the plane. Then R is reflexive since every triangle is similar to itself. (ii) Let R be the relation < in any set of real numbers, i.e. (a,b ) E R iff a < b. Then R is not reflexive since a Q a for any real number a. A relation R in a set A is called symmetric if whenever a R b then b R a , i.e. if (a, b ) E R implies ( b , a ) E R. Example 4.2: ii) The relation R of similarity of triangles is symmetric. similar to triangle /3, then ,8 is similar t o 01. For if triangle 01 is (ii) The relation R = {(l, 3), (2,3), (2,2), (3, l ) } in A = {l,2,3} is not symmetric since (2,3) E R but ( 3 , 2 ) 4 R . A relation R in a set A is called t ~ a n s i t i v eif whenever a R b and b R c then a R c , i.e. if (a, 6) E R and ( b , c ) E R implies (a,c) E R. Example 4.3: (i) The relation R of similarity of triangles is transitive since if triangle similar to ,8 and /3 is similar to y, then a is similar to y. 01 is (ii) The relation R of perpendicularity of lines in the plane is not transitive. Since if line a is perpendicular to line b and line b is perpendicular to line c, then a is parallel and not perpendicular to c. A relation R is an equivalence r e l a t i m if R is (i) reflexive, (ii) symmetric, and (iii) transitive. Example 4.4: By the three preceding examples, the relation R of similarity of triangles is a n equivalence relation since it is reflexive, symmetric and transitive. 60 RELATIONS [CHAP. 7 PARTITIONS A partition of a set X is a subdivision of X into subsets which are disjoint and whose union is X , i.e. such that each a € X belongs to one and only one of the subsets. The subsets in a partition are called cells. of subsets of X is a partition of X iff: Thus the collection { A l ,A,, . . . , (ij X = A,UA,U UA,,,; (ii) for any A i , A , , either A , = A j or A , n A j = (2 - Example 5.1 : a + Consider the following classes of subsets of X = { l , 2 , . . . ,8,9}: (i) [{I,3,5}, { 2 , 6 ) , {4,8,931 !ii) [{I, 3,5}, {2,4,6,83, {5,7,9}] (iii) [{I,3,51, {2,4,6,8}, (7,911 Then (i) is not a partition of X since 7 E X but 7 does not belong to any of the cells. Furthermore, (ii) is not a partition of X since 5 E X and 5 belongs t o both { l , 3,5} and {5,7,9). On the other hand, ( 5 ) is a partition of X since each element of belongs to exactly one cell. x EQUIVALENCE RELATIONS AND PARTITIONS Let R be an equivalence relation in a set A and, for each a E A , let [a], called the equivalence class of A , be the set of elements t o which a is related: [a] = {X : ( a , ~E ) R} The collection of equivalence classes of A , denoted by A I R, is called the quotient of A by R: AIR = {[a]: a E A ) The fundamental property of a quotient set is contained in the following theorem. Theorem 7.1: Let R be an equivalence relation in a set A . Then the quotient set A I R is a partition of A . Specifically, (i) a E [ a ] , for every a E A ; (iij [a] = [ b ] if and only if (a,b ) E R; (iii) if [a]# [ b ] , then [a] and [ b ] are disjoint. Example 6.1: Let R , be the relation in Z, the set of integers, defined by x = y (mod 5 ) which reads “z is congruent to y modulo 5” and which means t h a t the difference x - y is divisible by 5 . Then R5 is an equivalence relation in Z. There a r e exactly five distinct equivalence classes in Z I Rj : A, zz {. . ., -10, -5, 0, 5 , 10, . . .} A , = { ..., -9, -4, 1, 6, 11, ...} A , = { ..., -8, -3, 2 , 7 , 12, ...} A , = { ..., -7, -2, 3, 8, 13, . . . } A4 = { ..., -6, -1, 4, 9, 14, . . . } + Observe t h a t each integer x which is uniquely expressible in the form x = 5 q T where 0 5 r < 5 is a member of the equivalence A , where r is the remainder. Note t h a t the equivalence classes a r e pairwise disjoint and t h a t 2 = A, U A, U A, U A, U A, 61 RELATIONS CHAP. 71 Solved Problems RELATIONS 7.1. Let El be the relation < from A = (1 ,2 ,3 ,4 ) t o B = (1,3,5}, i.e. defined by “x is less than y”. (i) Write R as it set of ordered pairs. (ii) Plot R on a coordinate diagram of ‘4x B. (iii) Find the inverse relation l?*. (i) R consists of those ordered pairs (a,b ) E A X B for < b ; hence R = h l , 3 ) , ( 1 , 5 ) , (2,31, 12, 51, 1 3 ,51, (4 , 5 )) which a (ii) R is sketched on the coordinate diagram of A X E as shown in the figure. (iii) The inverse of R consists of the same pairs as a r e in R but in the reverse order; hence R-’ 1 = ((3,1), ( 5 , 1 ) , (3,21, ( 5 , 2 ) , 1 5 , 3 ) ,(5,4)) Observe t h a t R-1 is the relation greater than y”. 7.2. IiEEE 1 >, i.e. 2 3 4 defined by ‘‘x is Let €2 be the relation from E = [ 2 , 3 , 4 , 5 ) to F = {3,6,7,10) defined by “x divides y”. (i) Write R as a set of ordered pairs. (ii) Plot R on a coordinate diagram of E XF . (iii) Find the inverse relation R-I. (i) Choose from the sixteen ordered pairs in E X F those in which the first element divides the second; then R = { ( 2 ,6), ( 2 , lo), ( 3 , 31, 13, 61, 1 5 , l O ) ) (ii) R is sketched on the coordinate diagram of E X F as shown in the figure. (iii) To find the inverse of R , write the elements of R but in reverse order. R-l = { ( 6,2) , (10,2), ( 3 , 3 ) ,f6,3i, 110,5)} 7.3. Let d l = (a,b , c, d } and let R be the relation in M consisting of those points which are displayed on the coordinate diagram of M x M on the right. d Find all the elements in A l which are related to b, that is, ( J . (x,b ) E R } . b (i) (ii) Find all those elements in J l to which d is related, that is, {x : (d,x)€ R ) . (iii) Find the inverse relation R-l. (i) C a a b c d The horizontal line through b contains all points of R in x h i c h b appears as the second element (cr, b ) , ( b , b ) and (d,b ) . Hence the desired set is { a , b , d ] . (ii) The vertical line through d contains all the points of R in Nhich ( I appears as the first element (d,a)and ( t l , b ) . Hence { u , b \ i b the desired set. (iii) F i i s t n r i t e R a s a set of ordered pails, and then write the pairs in reverse order R R-’ = ( ( a , b ) , ( b , a ) , (6, b), ( b , 4 , (c, c ) , ( d , (0, ( d , b ) ) = ( ( b , a),( a , b ) , ( b , b ) , (6b ) , (c, c), ( a , 4, ( b , 4 ) 62 [CHAP. 7 RELATIONS EQUIVALENCE RELATIONS 7.4. Let R be the relation 4 in N = {1,2,3, . . .}, i.e. (a, b ) E R iff a L b. Determine whether R is (i) reflexive, (ii) symmetric, (iii) transitive, (iv) an equivalence relation. (i) R is reflexive since a f a for every a € N . (ii) R is not symmetric since, for example, 3 f 5 but 5 (iii) R is transitive since a 5 b and b 6 c implies a 6 c. a 3, i.e. (3,5) E R b u t (5,3) 4 R. (iv) R is not an equivalence relation since i t is not symmetric. 7.5. Let R be the relation ( 1 (parallel) in the set of lines in the plane. Determine whether R is (i) reflexive, (ii) symmetric, (iii) transitive, (iv) an equivalence relation. (Assume that every line is parallel to itself.) (i) R is reflexive since, by assumption, (Y / I a f o r every line a. (ii) R is symmetric since if a ( 1 p then p 11 a, i.e. if the line a is parallel to the line /3 then /3 is parallel to a. iiii) R is transitive since if Q ) j /3 and /3 1) y then a I)y. (iv) R is an equivalence relation since it is reflexive, symmetric and transitive. 7.6. Let W = {1,2,3,4}. Consider the following relations in W : R , = ((1,2 ) , (4,317(2,2),(2,1), (3,1)) R, = ((2,219 (2,3), (392)) R, = {(1,3)) Determine whether each relation is (i) symmetric, (ii) transitive. (i) Now a relation R is not symmetric if there exists an ordered pair (a, b ) E R ( b , a ) 4 R. Hence: R , is not symmetric since (4,3) E Rl but (3,4) 4 R , , such that R3 is not symmetric since (1,3)E R3 but ( 3 , l ) 4 R3. On the other hand, R, is symmetric. (ii) A relation R is not transitive if there exist elements a, b and c , not necessarily distinct, such that ( a , b ) E R and ( b , c) E R but ( a , c ) 4 R Hence R , is not transitive since (4,3) E R1 and ( 3 , l ) E Rl but ( 4 , l ) 4 Rl but (3,3) 4 R, Furthermore, R, is not transitive since (3,2) E R , and (2,3) E R, On the other hand, R3 is transitive. 7.7. Let R be a relation in A . Show that: (i) R is reflexive iff (i) Recall t h a t A A C R ; (ii) R is symmetric iff R = R-l. A = {(a,a) : a E A } . Thus R is reflexive iff (a,a) E R for every a € A iff c R. (ii) Suppose R is symmetric. Let ( a , b ) E R , then ( b , a ) E R by symmetry. Hence ( a ,b ) E R-1, and so R c R-1. On the other hand, let (a,b ) E R-1; then ( b , a) E R and, by symmetry, ( a , b ) E R. Thus R-1 c R , and so R = R-1. Now suppose R = R-1. symmetric. Let (a,b) E R; then ( b , a ) E R-1 = R. Accordingly R is 7.8. 63 RELATIONS CHAP. 71 Consider the relation R = ((1,l), ( 2 , 3 ) , (3,2)} in X = { 1 , 2 , 3 } . Determine whether or not El is ( i ) reflexive, (ii) symmetric, (iii) transitive. (ij R is not reflexive since 2 E X but 12,2j 4 R. (ii) R is symmetric since R-1 = ((1,l), (3,2), (2,3)} = R. (iii) R is not transitive since ( 3 , 2 ) E R and 12,3) E R b u t (3,3)4 R. 7.9. Let N = [ l ,2 , 3 , . . . }, and let R be the relation = in N (a,b ) (c, d ) iff ad = bc Prove that R is an equivalence relation. X N defined by -- F o r every (a, b ) E N X N , ( a , b ) ( a , b ) since a6 = ba; hence R is reflexive. Suppose (a,b ) = (c, d). Then ad = bc, which implies t h a t cb = da. Hence (c, d ) and so R is symmetric. Now suppose (a,b ) ( e , d ) and (c, d ) = ( e , f ) . Then ad = bc and cf = de. Thus (a, b) iatZ)(cf) = (be)(&) and, by cancelling from both sides, at’ = b e . Accordingly, (a, b ) = (e, f ) and so R is transitive. Since R is reflexive, symmetric and transitive, R is a n equivalence relation. then the above relation R is, Observe t h a t if the ordered pair (a,b ) is written as a fraction b a c . in fact, the usual definition of equality between fractions, i.e. 7 = - 1ff ad = bc. j d 7.10. Prove Theorem 7.1: Let R be an equivalence relation in a set A . Then the quotient set A 1R is a partition of A . Specifically, (i) a E [ a ] , for every a € A ; (ii) [a] = rbl if and only if (a, b ) E R ; (iii) if [ a ] + Ibl, then la] and [bl are disjoint. Proot o f ( I ) , Since R is reflexive, (a, a ) E R f o r every a € A and therefore a € [ a ] . Proof o t (11). Suppose ( a ,b ) E R. We want to show t h a t [ a ] = [ b ] . Let x E [b]; then ( b , x) E R. But by hypothesis ( a , b ) E R and so, by transitivity, ( a ,a ) E R. Accordingly x € [ a ] . Thus [bl C [ a ] . To prove t h a t a c , b ] , we observe t h a t ( a , b ) E R implies, by symmetry, t h a t ( b , a ) E R. Then by a similar argument, we obtain [a]~ [ b l . Consequently, [a]= [ b ] . On the other hand, if [a]= [ b ] ,then, by ( I ) , b E [b]= [a]; hence ( a , b ) E R. Proof o t (1111. We prove the equivalent contrapositive statement if ‘a1 bl # @ then [a] = [bl If [a]r b] P $3, then there exists an element x E A with .c E [ a ] n [ b l . Hence (a,x)E R and (b,x) E R. By symmetry, (x,b) E R and by transitivity, ( a , b ) E R. Consequently by (ii), [ a ] = lbl. PARTITIONS 7.11. Let X = { a , b , c , d , e , f , g } , and let: (i) A , = { a , c , e ) , A , = { b } , A , = { d , ~ } ; (ii) B , = ( a , e , Y), B, = { c , d } , B, = { b , e, f } ; (iii) C, = ( a , b , e , g } , C, = { c } , C, = { d , j ) ; (iv) D,= { a , b , c, d, e , f, SI. Which of { A , ,A 2 ,A 3 } , { B l ,B2,BJ), {C,, C,, C3}, {D,} are partitions of X ? (i) { A , , A , , A , ] is not a partition of (ii) { B , , E 2 , E , ] is not a partition of (iii) {C,, C 2 ,C,] is a partition of X X = C , d C 2 u C3 and the sets (iv) {D,;is a partition of X . X since t E X but f does not belong to either A l , A > , o r A , . X since e E X belongs to both B , and B,. since each element in X belongs t o exactly one cell, i.e. a r e pairwise disjoint. 64 RELATIONS [CHAP. 7 7.12. Supplementary Problems RELATIONS 7.13. Let R be the relation in A = ( 2 , 3 , 4 , 5 ) defined by “x and ?/ a r e relatively prime”, i.e. “the only comnion divisor of x and y is 1”. (i) Write R as a set of ordered pairs. (ii) Plot R on a coordinate diagram of A X A . (iii) Find R-1. 7.14. Let N = {l,2 , 3 , .. . ] and let R be the relation in N defined by r R = {(rr,y): x , y E N , r + 2 y = 8 } (i) Write R as a set of ordered pairs. 7.15. = 8, i.e., (ii) Find R - l . Let C = { 1 , 2 , 3 , 4 , 5 } and let R be the relation in C consisting of the points displayed in the following coordinate diagram of C X C: (i) State whether each is t r u e or false: ( a ) 1 R 4, ( b ) 2 R 5, (c) 3 (ii) Find the elements of each of the following subsets of C: (a) {x : 3 R x}, ( b ) {x : (4,x ) E R } , ( c ) {x : ( x , 2 ) 62 R } , 7.16. + 2y Consider the relations diagonal relation. < and 5 in N = {l,2 , 3 , . . .}. $ 1, (d) 5 ( d ) {Z : x R 5). Show t h a t <UA = EQUIVALENCE RELAATIONS 7.17. Let TY = {l,2 , 3 , 4 } . Consider the following relations in W : Rl = {(1,1), ( 1 , 2 ) } R , = {(1,1), ( 2 , 3 ) , (431)) R4 = ((1, I), (2,217 (3,311 R, = T.V X W R , = {(1,3), ( 2 , 4 ) ) R, = Determine whether or not each relation is reflexive. 8 $ 3. f where 1 is the RELATIONS CHAP. 71 65 7.18. Determine whether o r not each relation in Problem 7.17 is symmetric. 7.19. Determine whether o r not each relation in Problem 7.17 is transitive. 7.20. Let R be the relation I of perpendicularity in the set of lines in the plane. Determine whether R is Ii) reflexive, (ii) symmetric, (iii) transitive, (iv) a n equivalence relation. 7.21. Let N = {l,2,3, . . . } and let = be the relation in N X N defined by (a,b) (i) Prore r is a n equivalence relation. (c,d) iff a +d = b +c (ii) Find the equivalence class of (2,5), i.e. [ ( 2 , 5 ) ] . 7.22. Prove: If R and S a r e equivalence relations in a set X , then R n S is also a n equivalence relation in X . 7.23. (i) Show t h a t if relations R and S a r e each reflexive and symmetric, then R reflexive and symmetric. U S is also (ii) Give a n example of transitive relations R and S f o r which R U S is not transitive. V = {l,Z,3 ) . 7.25. Find all partitions of 7.26. Let [ A , , A , , . . ., A,,,1 and [B,, B,, . . ., B,] be partitions of a set X . Show t h a t the collection of sets is also a partition (called t h e [ A , n B , : i = l , ...,m , j = l , . . . ,n] p ~ ~ , t i t i ~ofi l X ) . CTOSS 7.13. 7.14. 7.15. 7.17. 7.18. R,, R, and R , a r e the only symmetric relations. 7.19. All the relations a r e transitive. 7.20. (i) no, ( i i ) yes, iiii) no, (iv) no 7.21. (ii) [(Z,5)l = {(a,b ) : a = {(a,a + 5 = b + 2, a , b E N ) + 3) : a E NI = {(I,4), (2,5), (3,6), ( 4 , 7 ) , . . .I 1.23. (ii) R = { \ l , 2 ) ) and S = {(2,3)} 7.24. (i) no, (iii no, (iii) yes, (iv) yes 7.25. [ { 1 , 2 , 3 ) ] , [{l:, {Z, 3)], [C2>, C1,3$], [{3), {1,2)1 and [{I},{z:, (311 Chapter 8 Functions DEFINITION OF A FUNCTION Suppose that to each element of a set A there is assigned a unique element of a set B ; the collection, f , of such assignments is called a functioyz (or mapping) from (or on) A into B and is written f : ~ +B or A & B The unique element in B assigned to a E A by f is denoted by f’(a),and called the i m a g e of a under f o r the value of f at a. The domain of f is A , the co-domain B. The m x g e of f, denoted by f [ A ]is the set of images, i.e., f l A ] = { f ( a ): a € A } Example 1.1: Let f assign to each real number its square, t h a t is, for every real number f ( x ) = x2. Then the image of -3 is 9 and so we may write f ( - 3 ) = 9 o r f : -3 2 : let 9. + Example 1.2: Let f assign to each country in the world its capital city. Here the domain of f is t h e set of countries in the world; the co-domain is the list of cities of the world. The image of France is Paris, t h a t is, f\Fra nc e ) = Paris. Example 1.3: Let A = { a , b , c , d ) and B = { a, b, c } . a-b, The assignments b - t c , c + c and d + b define a function f from A into B. Here f(a) = b, f(b) = c, f(c) = c and f(c1) = b. The range of f is { b , c ) , tha t is, f [ A ]= { b , c } . Example 1.4: Let R be t h e set of real numbers, and let f : R + R assign to each rational number the number 1, and to each irrational number the number -1. Thus f(x) = 1 if 3: is rational -1 if x is irrational The range of f consists of 1 and -1: f [ R ]= {I,-1). Example 1.5: Let A = {a, b , c , d ) and B = {x,y,z}. f : A+B. The following diagram defines a function Here f ( a ) = y, f(b) = N , f(c) = x and f ( d ) = y. and t h e co-domain a r e identical. 66 Also f [ A ]= B , t h a t is, the range CHAP. S] F CN CTIO N S 67 GRAPH OF A FUNCTION To each function J : A B there corresponds the relation in A X B given by ((a,/'(a)) : a E A ) We call this set the g r a p h of f . Two functions f : A B and g : A B are defined to be equal, written f = y, if ?(a)= g(a) for every a E A , that is, if they have the same graph. Accordingly, we do not distinguish between a function and its graph. The negation of J = g is written i = y and is the statement: there exists an ci E A f o r which f ( a ) g ( a ) -+ + -+ - A subset f of A x B , i.e. a relation from A to B , is a function if it possesses the folloving property: [F] Each a E A appears as the first coordinate in exactly one ordered pair (a, b ) in f . Accordingly, if ( a , b ) E f, then t ( a ) = b , Example 2.1: Let f : A -+ B be the function defined by the diagram in Example 1.5. Then the graph of f is the relation { ( a ,Y), ( b , x), (c, x ) , ( d . L/)S Example 2.2: Consider the following relations in A = {1,2,3]: f is a function from A into A since each member of A appears as the first coordinate in exactly one ordered pair in f ; here f(1)= 3, f ( 2 ) = 3 and f(3) = 1. g is not a function from A into A since 2 € A is not the first coordinate of a n y pair in g and so g does not assign any image to 2. Also h is not a function from A into A since 1 € A appears as the first coordinate of two distinct ordered pairs in h , (1,3) and (1,2). If h is t o be a function i t cannot assign both 3 and 2 to the element 1 E A . Example 2.3: - A real-valued function ,f : R R of the form f ( x ) = ax t b (or: defined by y = a.c 4- b ) is called a linear f u x c t i o x : its graph is a line in the Cartesian plane RZ, The g r ap h of a linear function can be obtained by plotting ( a t least) two of its points. F o r example, to obtain the graph of f j x ) = 2%- 1, set up a table with a t least t w o values of x and the corresponding values of f ( x ) as in the adjoining table. The line through these points, (-2, -5), (0, -1) and (2,3), is the graph of f as shown in the diagram. Graph of f(x) = 2 x -1 68 [CHAP. 8 FUNCTIONS Esaiiiple 2.4: A real-valued function f’ : R + R of the form . . . a,,-,:c f ( l ) = q)T’* alX,--l + + + :c + a, is called a p o l y z o m i u l f m c t i o n . The g r a p h of such a function is sketched by plotting various points and drawing a smooth continuous curve through them. Consider, for example, the function f i x ) = .c’ - 2.1; - 3. Set up a table of values f o r s and then find the corresponding values f o r f ( x ) as in the adjoining table. The diagram shows these points and the sketch of the graph. -2 -1 0 1 2 3 4 I f(x) 5 0 -3 --1 -3 0 5 t Graph of f(x) = COMPOSITION FUNCTION Consider now functions f : A + E and g : B + - 2.1: - 3 C illustrated below: Let a E A ; then its image f ( a ) is in E , the domain of g . Hence we can find the image of f ( a ) under the function g, i.e. g ( f ( a ) ) . The function from A into C which assigns t o each a E A the element g(f(a)) E C is called the composition o r product of f and g and is denoted by g 0 f . Hence, by definition, ( g o f ) ( a ) = !ma)) Example 3.1: Let f : A .+ E and g : B C be defined by the following diagrams: f A W e compute ( g 0 t’) : A 3 + B C g C by its definition. (gof)(4 = s(f(a))= d u ) = t ( 9 0 f N b ) = g ( f ( b ) ) = g(z) = ?“ iu O f ) ( C ) = g ( f ( c ) ) = u ( u ) = t Notice t h a t the composition function g 0 t‘ is equivalent to “following the arrows” from A to C in the diagrams of the functions j and g. CHAP. 81 Example 3.2: 69 FUNCTIONS Let R be the set of real numbers, and let f : R follows: f(2) = R and g(x) = z and 22 + g :R -+ R be defined a s +3 ( f o g ) ( 2 ) = f ( d 2 ) ) = f(5) = 25 Then = d f ( 2 ) ) = g(4) = 7 ( g Of)@) Observe t h a t the product functions f o g and g o f a r e not the same function. We compute a general formula f o r these functions: ( f o g ) ( x ) = f(g(z)) = f ( x t 3 ) = ( x ( g o f ) ( x ) = g ( f ( x ) ) = g(22) = + 3)2 = +3 x2 + 62 + 9 ONE-ONE AND ONTO FUNCTIONS A function f : A + B is said to be one-to-one (or: one-one or 1-1)if different elements in the domain have distinct images. Equivalently, f :A + B is one-one if f ( a ) = f(a') implies Example 4.1: Consider the functions f : A ing diagram: A f - a = a' B , g :B B + C and h : C C B + D defined by the followh D a W Now f is not one-one since the two elements a and c in its domain have the same image 1. Also, g is not one-one since 1 and 3 have the same image y. On the other hand, h is one-one since the elements in the domain, x, y and x, have distinct images. A function f : A + B is said t o be onto (or: f is a function from A onto B or f maps A o n t o B ) if every Z, E B is the image of some a E A . Hence f : A B is onto iff the range of f is the entire co-domain, i.e. f [ A ]= B. + Example 4.2: Consider the functions f, g and h in the preceding example. Then f is not onto since 2 € B is not the image of a n y element in the domain A . On the other hand, both g and h a r e onto functions. Example 4.3: Let R be the set of real numbers and let f : R R, g : R + R and h : R defined as follows: f ( x ) = 22, g(z) = z3 - x and h ( z ) = x2 + The graphs of these functions follow: f ( x ) = 2" g(2) = 23 - x h(x) = 22 + R be [CHAP. 8 FUNCTIONS 70 The function f is one-one; geometrically, this means t h a t each horizontal line does not contain more than one point of f. The function g is onto; geometrically, this means t h a t each horizontal line contains a t least one point of g. The function h is neither one-one or onto; f o r h(2) = h ( - Z ) = 4, i.e. the two elements 2 and -2 have t h e same image 4, and h[R]is a proper subset of R ; f o r example, -16 4 h [ R ] . INVERSE AND IDENTITY FUNCTIONS In general, the inverse relation f - I of a function f C A X B need not be a function. However, if f is both one-one and onto, then f - ' is a function from B onto A and is called t,he inverse function. Example 5.1: Let A = { a , b , c ) and B = { r ,s, t}. Then f = { ( a ,s), ( b , t ) , ( c , 41 is a function from A into B which is both one-one and onto. seen by the following diagram of f : This can easily be Hence t h e inverse relation f-1 = ((5, a ) , ( 4 JJ), ( r ,4) is a function from B into A . The diagram of Observe t h a t the diagram of ing the arrows. f-'follows: f-'can be obtained from the diagram of f by revers- For any set A, the function f :A + A defined by f ( x ) = x, i.e. which assigns to each element in A itself, is called the i d e n t i t y function on A and is usually denoted by 1, or simply 1. Note that the identity function 1, is the same as the diagonal relation: 1, = A,. The identity function satisfies the following properties: Theorem 8.1: For any function f : A + B, lBOf = f = Theorem 8.2: f o l , If f : A + B is both one-one and onto, and so has a n inverse function f-' : B + A , then f-lof = 1, and f o f - ' = 1, The converse of the previous theorem is also true: Theorem 8.3: Let f : A-+ B and y : B + A satisfy g o f = 1, and f o g = 1, Then f is both one-one and onto, and y = f - ' . FU I\:C TIONS CHAP. 81 71 Solved Problems FUNCTIONS 8.1. State whether or not each diagram defines a function from A = ( a , b , c } into B = {r,y,z], m@gJm z C z C (i) C (ii) 2 (iii) (i) No. There is nothing assigned t o the element b € A . (ii) No. Two elements, x and x , a r e assigned t o c € A . (iii) Yes. 8.2. Rewrite each of the following functions using a formula: (i) To each number let f assign its cube. (ii) To each number let g assign the number 5. (iii) To each positive number let li assign its square, and to each non-positive number let 11 assign the number 4. (i) Since f assigns l o a n y number ?. its cube .v3, f can be defined by f(.c) = x3, (ii) Since g assigns 5 to a n y number s, we can define g by g(..) = 5. (iii) Two different rules a r e used to define k as follows: li (?.) 8.3. = $2 4 if x > 0 ifx"0 Let f , g and 11 be the functions of the preceding problem. Find: (i) f(4),f'(-2), f(@); (ii) ~ ~ ( 4 g1(,- 2 ) , ~(0);(iii) h(4), h(-2), 4 0 ) . (i) Now ,t'(:cs) = x3 (ii) Since y(x) = 5 f o r every number 2;: hence f(4) = 43 = 64. J ( - 2 ) = (-2)s = -8, 5, g(-2) = 5 and g(0) = 5. h(4) = 42 = 16. On the other hand, if (iii) If 3: > 0, then h(x) = x2; hence h(a) = 4: thus h(-2) = 4 and h i O ) = 4. 8.4. Let A = ( 1 , 2 , 3 , 4 , 5 } and let J:,4 f(0) O3 = 0. y(4) = f o r every number x , + x f 0, then A be the function defined in the diagram: (i) Find the range of f. (ii) Find the graph of f , i.e. write f as a set of ordered pairs. The range f[AI of the function f consists of all t h e image points. Kow only 2, 3 and 5 appear as the image of a n y elements of A ; hence f[A] = { 2 , 3 , 5 ) . (ii) The ordered pairs ( a , f ( a ) ) ,where a € A , form the graph of f. Now f(1) = 3, f ( 2 ) = 5, f(3) = 5, f(4) = 2 and f(5) = 3; hence f = {(1,3), ( 2 , 5 ) , (3,5), (4,2), (5,3)}. (i) 72 8.5. FUNCTIONS [CHAP. 8 Determine whether o r not each relation is a function from Let X = (1,2,3,4>. X into X. ti) f = ii2,3), (1,4), (2, I), ( 3 , 2 ) ,14,4)} iii) g = “3, l),(4,2), (1,1)) iiii) h = j(2, l ) , (3,4), (1,4), j2,11, (4,4)} Recall t h a t a subset f of X X X is a function f : X X if and only if each a E X appears as the first coordinate in exactly one ordered pair in f . iir No. Two different ordered pairs ( 2 , 3 ) and ( 2 , l ) in f’ have the same number 2 as their first coordinate. i i i l No. The element 2 E X does not nppear as the first coordinate in a n y ordered pair in 8 . iiiii Yes. Although 2 E X appears a s the first coordinate in two ordered pairs in h, these two ordered pairs a r e equal. --f 8.6. Find the geometric conditions under which a set f of points on the coordinate diagram of A x B defines a function f : A B. + The requirement t h a t each u E A appear a s the first coordinate in exactly one ordered pair in f is equivalent to the geometric condition t h a t each vertical line contains exactly one point in f . 8.7. :m- Let TV = { a , b, c, d } . Determine whether the set of points in each coordinate diagram of T I ’ x IT’ is a function froin IT’ into W . tM b a a b c d a a (i) b c f f!Ff a d a (ii) b c d (iii) No. The vertical line through b contains two points of the set, i.e. two different ordered pairs ( b , b ) and ( b , d ) contain the same first element b. ( i i ) KO. The vertical line through c contains no point of the set, i.e. c E W does not appear a s the first element in any ordered pair. (iiir Yes. Each vertical line contains exactly one point of the set. (i) GRAPHS OF REAL-VALUED FUNCTIONS 8.8. Sketch the graph of f ( x ) = 3 x - 2 . This is a linear function: only two points (three a s a check) a r e needed t o sketch its graph. Set up a table with three values of s, say, x = -2, 0, 2 and find the corresponding values of f(r): t’(-2) = 3(-2) - 2 = -8, ((0) = 3(0) - 2 = -2, f(2) = 3(2) - 2 = 4 D r a u the line through these points as in the diagram. X 10, - 2 ) CHAP. 81 8.9. FUNCTIONS Sketch the graph of (i) f ( x ) = 2 t .L‘ - 73 6 , (ii) g(x) = x? - 3x2 - x + 3. In each case, set up a table of values f o r x and then find the corresponding values of t ’ ( ~ ) . Plot the points In a coordinate diagram, and then d r a w a smooth continuous curve through the points (ii) x I gis) -2 -15 0 3 0 -3 0 15 -1 0 1 2 3 4 Graph of f . Graph of g. COMPOSITION OF FUNCTIONS 8.10. Let the functions f : A -+ B and y : B -+ C be defined by the diagram (i) Find the composition function y o I: : A (i) .+ C. (ii) Find the ranges of f , g and y o f . We use the definition of the composition function to compute. (Y {)(a) = M a ) ) = g(u) = t (gof)(b) = g(f(b)) = g(r) = s (Y t’)(C) = g ( f ( c ) ) = s(l/)= t Note t h a t we arrive at the same answer if we “follow the arrows” in the diagram: c+y +t a --t u + t , b + x + s , (ii) By the diagram, the images under the function f a r e x and y, and the images under g a r e T , s and t : hence and range of g = { Y , s, t } range of f = {.c, g } By the images under the composition function a r e t and s. hence range of g o f = {s, t ) i ~ , Note t h a t the ranges of g and y 8.11. Consider the functions f c t’ a r e different. = { ( 1 , 3 ) ,( 2 , 5 ) , (3,317 (4,1), (592)) g = { ( I ,4),(2, I), (3911, (4, a, 15,311 from X = j l , 2 , 3 , 4 , 5 } into X. (i) Determine the range of f and of g. the composition functions g o f and f o g . (ii) Find 74 FUNCTIONS (ij [CHAP. 8 The range of a function is the set of image values, i.e. the set of second coordinates: hence r an g e o f f = ( 3 , 5 , 1 , 2 } and range of g = { 4 , 1 , 2 , 3 } 8.12. Let the functions f and g be defined by f(x) = 2 x f l and g(x) = x 2 - 2 . formulas defining the composition functions (i) g 0 t' and (ii) f o g . (i) Compute g 0 f as follows: (g 0 f i ( r j = g ( f ( z ) ) = g(2.2. + 1) = ( 2 +~ 1)2 - 2 = 422 Find + 42 - 1. Observe t h a t the same answer can be found by writing y = f ( ~= ) 2.t +1 and z = g(y) = y2 - 2 and then eliminating y from both equations: x = (iil Compute fog as follows: Note t h a t f o g # g u2 -2 = (2% + 1)2 - 2 (fog)(x) = f(g(x))= f(x'- = 4x2 + 4.1 - 1 2 ) = 2 ( n 2 - 2 ) + 1 = 2%'- 3. of. 8.13. Prove the associative law for composition of functions: if A A B % C % D then ( J z o g ) o f = h o ( g o f ) . ( ( h 0s) 0 For every a E A , ({l = (hog)(f(a)) = h(u(f(aj)) (Y f ) j ( a ) = h((sO f)(a)) = h ( g ( f ( a ) ) ) Hence ( h 0 9 )0 f = h o ( g o f ) , since they each assign the same image to every a E A . Accordingly, we may simply write the composition function without parentheses: h 0 g 0 f . ONE-ONE AND ONTO FUNCTIONS 8.14. Let A = ( a , b , c , d , e J , and let B be the set of letters in the alphabet. functions f , g and h from A into B be defined as follows: f 9 a+ r b+a a+ x b+Y C+S C + X b+c c+e d-+r d+Y d-+r e-+e e+x e+s (iii) (9 Are any of these functions one-one? (ii) h a+ a Let the CHAP. 81 75 FUNCTIONS Recall t h a t a function is one-one if it assigns distinct image values t o distinct elements in the domain. (i) No. For f assigns T to both a and cl. (ii) No. F o r g assigns x to both a and e . (iii) Yes. F o r h assigns distinct images to different elements in the domain. 8.15. Determine if each function is one-one. (i) To each person on the earth assign the number which corresponds to his age. (ii) To each country in the world assign the latitude and longitude of its capital. (iii) To each book written by only one author assign the author. (iv) To each country in the world which has a prime minister assign its prime minister. (i) (ii) (iii) (iv) No. Many people in the world have the same age. Yes. No. There a r e different books with the same author. Yes. Different countries in the world have different prime ministers. 8.16. Prove: If f : A+ B and g : B + C are one-one functions, then the composition function g 0 f : A C is also one-one. + Let (g o f ) ( a ) = ( g o f ) ( a ’ ) ; i.e. &“(a)) = g(j(a’)). Then f(a)= f(a’) since g is one-one. Furthermore, a = a‘ since f is one-one. Accordingly, g o f is also one-one. 8.17. Let the functions f : A i A + B, g : B + C and h : C + D be defined by the diagram. C B (i) Determine if each function is onto. (i) The function f : A The function g : B + --f h D (ii) Find the composition function h o g o f . B is not onto since 3 E B is not the image of any element in A. C is not onto since z E C is not the image of any element in B. The function h : C + D is onto since each element in D is the image of some element of C. (ii) Now a+Z+x+4, b+l+y+6, c+Z+x+4. Hence h o g o f = {(a,4),(b,6),(~,4)}. 8.18. Prove: If f : A + B and g : B -+ C are onto functions, then the composition function g o f : A + C is also onto. Let e be any arbitrary element of C. Since g is onto, there exists a b E B such t h a t g ( b ) = c. Since f is onto, there exists a n a E A such t h a t f ( a ) = b . But then ( g o f ) ( 4 = g(f(a\) = g ( b ) = c Hence each c E C is the image of some element a € A . Accordingly, g 0 f is a n onto function. ;CHAP. 8 FUNCTIONS 76 INVERSE FUNCTIONS 8.19. Let W = { 1 , 2 , 3 , 4 , 5 } a n d l e t f : W - + W ,g : W + W and h : W - + W b ed efi n ed b y the following diagrams: Determine whether each function has a n inverse function. In order for a function to have a n inverse, the function must be both one-one and onto. Only h is one-one and onto; hence h , and only h, h a s a n inverse function. 8.20. Let f : R -+ R be defined by f ( x ) = 2x - 3. Now f is one-one and onto; hence f has an inverse function f - l . Find a formula f o r f F 1 . Let y be the image of x under the function f : y = f ( x ) = 2x - 3 Consequently, 2; will be the image of y under the inverse function in the above equation: x = (y 3)/2 f-1. Solve for x in terms of y + J-W Then = (~+3)/2 is a formula defining t h e inverse function. 8.21. Let A = (a,b, c, d } . Then f = ((a, b), ( b , d ) , (c,a), ( d , c)} is a one-one, onto function from A into A . Find the inverse function f - ' . To find the inverse function in reverse order: which is the inverse relation, simply write each ordered pair f-1, = { ( b , 4, (4b), ( a ,c ) , ( c , 4) f-' The function assigns to any element the square of the element minus 3 times the element plus 2. (a) = t(-3) (-3)2 - 3(-3) ( b ) f(2) = (2)2 - 3(2) +2 + + + 2 = 9 9 2 = 20 = 0, f(-4) = (-4)2 - 3(-4) f(2) - f(-4) 1(y) = (L/)2 - 3(y) (a2)2 - (t) (X2)Z - - 3(a*) - - - /I) = 30. = 0 - 30 = -30 + 2 = y2 3y + 2 + 2 = a1 3a* + 2 t(.') = 3(x2) + 2 = a4 32' + 2 + 2 = y2 - 2yz + (t ) f ( y - x ) = (y z)2 - 3(y = (x + h)2 3(a + h ) + 2 = x2 + 2xh + h' ( 9 ) f(.c + (c) ( d ) f(a2) = +2 - 2) - 22 - 3y + 32 + 2 +2 - 3%- 3h Then FUNCTIONS CHAP. 81 (h) f ( x + 3) = (X + 3)' - 3(x + 3) + 2 = +2 (N? (i) f ( 2 x - 3 ) = ( 2 2 - 3)2 - 3(2x - 3) 77 + 62 + 9) - 3% - 9 + 2 = x2 + 3%+ 2 + 9 - 6 x + 9 + 2 = 4x2 - 18% + 20 = 4r2 - 122 ( j ) Using ( h ) and (i), we have + f ( +~ 3) = (429 - 18s + 20) + (6+ 3%+ 2 ) = 5 ~ ' - 15%+ 22 - 6x3 + 1 0 ~ ' 3% (k) f ( 6 -3~ + 2 ) = ( 9 -3%+ 2)2 - 3 ( z * - 3%+ 2) + 2 = (1) f(f(x)) = f ( d - 3~ + 2) = x4 + 10x2 - 3% ( m ) f(f(x + 1)) = f([(x + 1)2 - 3 ( x + 1) + 21) = f([x2+ 22 + 1 - 3 x - 3 + 21) f ( 2 -~ 3) ~4 - 623 = ( n ) By (g), f ( z f(z' - x) +h) = x2 = Using ( n ) ,we have - x)' - 3(s'- x) +2 = x4 - 2x3 - 2x2 + 2xh + h2 3%- 3h + 2. Hence + 2 ~ +h h' 3.2: - 3 h + 2) ( ~ 2 3~ i- 2 ) = 2xh ($2- - + + [ f ( x h) - f ( z ) ] / h = (2rh h2 - 3 h ) h = 2r 8.23. Prove Theorem 8.1: + 32 + 2 - f ( 2 Ih ) - t ( x ) = (0) (22 For any function f :A + B, + h2 - 3h +h-3 = f = f~ 1,. lnof Let a be an y ar b i t r ar y element in A ; then ( 1 0!)(4 ~ = 1~(f(a)= ) f(a) Hence 1,o f = f = f 0 1, and (folA)(a) = f(la(a)) = f ( a ) since they each assign f ( a ) to every element a E A . Supplementary Problems FUNCTIONS 8.24. 8.25. S t a t e whether each diagram defines a function from { l , 2 , 3 } into { 4 , 5 , 6 ) . Define each function by a formula: (i) To each number let f assign its square plus 3. (ii) To each number let g assign its cube plus twice the number. (iii) To each number greater t h an o r equal to 3 let h assign the number squared, and to each number less than 3 let h assign the number -2. 8.26. Determine the number of different functions from { a , b } into {l, 2 , 3 ) . 8.27. Let f : R + R (iv) f ( x - 2 ) . 8.28. Let g : R + be defined by R be defined by f(x) = g(x) = z2 - 4 2 + 3. Find (i) f ( 4 ) , (ii) f(-3), (iii) f ( y - 2 x ) , 78 FUNCTIONS [CHAP. 8 8.30. Let the function g assign t o each name in the set {Betty, Martin, David, Alan, Rebecca} the number of different letters needed t o spell the name. Find the g r a p h of g , i.e. write g a s a set of ordered pairs. 8.31. Let it’ = [ 1 , 2 , 3 , 4 } and let g : W + be defined by the diagram (iii) Find iii) Plot g on the coordinate diagram of W X TT’. ( i i Write g a s a set of ordered pairs. the range of g . 8.32. TIT :mfffF ## Let V = { 1 . 2 , 3 , 4 } . Determine whether the set of points in each coordinate diagram of V X V is a function from V into 1’. 4 3 2 2 1 1 1 2 3 4 1 2 (i) 3 4 1 2 (ii) 3 4 1 (iii) 2 3 4 (iv) GRAPHS OF REhL-VALUED FUNCTIONS 8.33. Sketch the graph of each function: (i) f(z)= 2, (ii) g(x) = 4%- 1, (iii) k ( z ) 1 2%’- 4%- 3. 8.34. Sketch the g r a p h of each function: (i) f ( x ) = :c3 - 3%-I- 2, (ii) g(z) = :ci - 10x2 + 9, 0 if x = O (iii) h ( x ) = -if x#O COMPOSITION OF FUNCTIONS 8.35, The functions f :A + B, g : B + A , h : C + B, F : B + C, G : A + C a r e pictured in the diagram below. Determine whether each of the folloLving defines a product function and if i t does, find its domain and co-domain: (i) g o f , iii) h o f , (iii) F o f , (iv) G o f , (v) g c h , (vi) F o h , (vii) h o G o g , rviiil li 0 G. 8.36. @Q@- The following diagrams define functions .f, g and h which map the set {1,2,3,4} into itself. 4 [it (ii) 4 Find the ranges of f, y and h . Find the composition functions 4 4 4 (1) f o g , (2) h o j , (3) g2, i.e. gog. 4 8.37. 79 FUKCTIONS CHAP. 8 ) + + Let f : R 3 R and g R + R be defined by Ji.r) = z2 3% 1 and g ( r ) = 2s - 3. Find fomiulas defining the product functions (i) f c g , iii) g 3 f , (iii) g o g , (iv) f o f . ONE-ONE, ONTO .iKD INVERSE FUNCTIONS 8.38 Let f : X - 17. Which conditions define a one-one function: J ( n l = f ( b ) implies a = b (iii) f ( a ) # J ( b ) iniplies a # b iiir n = b implies f(n) = f(b) (iv) a f b implies J ( a ) # f ( b ) li) 8.39. (i) State whether o r not each function i n Problem 8.36 is one-one (ii) State whether or not each function in Problem 8.36 is onto. 8.10. Prove Theorem 8.2: If f : A B is one-one and onto, and so has an inverse function (i) f - 1 o . f = 1, and (ii) f o f - 1 = 1,. 8.41. Prove: If t : A 8.42. Let f : R 8.43. Let 8.14. Let R be an equivalence relation in a non-empty set A . The function from A into the quotient set AIR is defined by ~ ( a=) [ a ] , the equivalence class of a. Show t h a t 17 is a n onto function. 8.45. Prove Theorem 8.3: Let f : A + B f is one-one and onto, and g = f - 1 . -f + g :R + 4 E and g : B + A satisfy y of Find a formula f o r the inverse function and y : E .+ A (i) No, iii) Yes, iiii) No 8.25. 8.26. Nine. 8.27. (i) 3, (iil 24, ( i i i ) y2 - 4xy 8.28. (i) g ( 5 ) = 10. (ii) y(0) = 2, (iii) g(-2) = 0 8.29. (i) Yes, iii) No, iiii) Yes, (iv) No 8.30. g = {(Betty, 4 , (RIartin, 6), (David, 4 ) , (Alan, 3 ) , (Rebecca, 5)) 8.31. (i) g = C(1,2), (2,3), (3, I), (4,3)), 8.32. (i) No, (ii\ No, (iii) Yes, (iv) No 8.33. + 4x2 - 4y + 8x + 3, + -2 -3 Graph of f :R .+ R. g-1 : R 3 R. f-1 satisfy g 0 t' = 1,4 and f o g = lg. Then Answers to Supplementary Problems 8.24. then = l A ,then f is one-one and g is onto. R be defined by f ( x ) = 3 x - 7 . Find a formula f o r the inverse function R be defined by g(z) = x? -t 2. f-1, (iv) x2 - 8% 4- 15 (iii) : 2 , 3 , 1 ) -- 4 -- 2 # C H A P .8 FC N C TI 0N S 80 (iii) -3 -5 -3 #Graph of h 8.34. - 46 Graph o f f -15 Graph of g '---4 Graph of h 8.35. (i) g of :A + A , (ii) No, (iii) F ;1 to;^ ) I ,I of :A + C, 8.37. (i) iii) 4 (I/(Z) 1 {1,3} + 6r - 1 (iii) (y 0 g ) ( x ) = 4x - 9 (ivl (f o f ) ( x ) = x4 6x3 + range of f = {1,2,4}, range of g = { 1 , 2 , 3 , 4 } , range of h ( , f o g ) ( x )= 4 2 2 - 6 ~ + 1 (ii) (g 0 f)(x) = 2x2 (iv) No, (v) g 0 h : C + A , (vi) F 0 h : C -t C, (vii) h 0 G 0 g : B + B , (viii) h 0 G : A B 8.36. lii + + 14x2 + 152 + 5 8.38. ( i ) Yes, Iii) No, (iii) No, (iv) Yes 8.39. ti) Only g is one-one. 8.42. I-1 ( x ) = (x+ 7)/3 8.43. g-l 9;) (3:) = 2- (ii, Only g is onto. Chapter 9 Vectors COLUMN VECTOKS A column rector ~r is a set of numbers u l , u,,. . . ,zcn written in a column: ZG The numbers ti, = are called the components of the vector u. Example 1.1: The following a r e column vectors: The first two vectors have two components; whereas the last two vectors have three components. Two column vectors 2 i and v a r e equal, written u = v, if they have the same number of components and if corresponding components are equal. The vectors \3i' are not equal since corresponding elements are not equal. Example 1.2: Let Then, by definition of equality of column vectors, = 4 = 2 x - l = 3 x-y z+y Solving the above system of equations gives x = 3, y = -1, and x = 4. Remark: In this chapter we shall frequently refer t o numbers as scalars. VECTOR ADDITION Let u and 2' be column vectors with the same number of components. The sum of ZL and v, denoted by ti -t v , is the column vector obtained by adding corresponding components: 81 [CHAP. 9 VECTORS 82 Note that ii + 1: has the same number of components as with different numbers of components is not defined. 11 and 2'. The sum of two vectors Example 2.1: Example 2.2: (-a; The sum + f,-54)l// is not defined since the vectors have different numbers of components. A column vector whose components are all zero is called a zero vector and is also denoted by 0. The next Example shows that the zero vector is similar to the number zero in that, f o r any vector 11, u f O = u. r) p SCALAR RIULTIPLICATION The product of a scalar lc and a column vector u,denoted by k - u or simply kx, is the column vector obtained by multiplying each component of 21 by It: k * 11 = k = 4/ un Observe that ZJ and 1c.u have the same number of components. We also define: -u = -1.21 and u- 2j = z! t (-v) Example 3.1: The main properties of the column vectors under the operations of vector addition and scalar multiplication are contained in the following theorem: CHAP. 9 83 VECTORS Theorem 9.1: Let V T be L the set of all 1%-componentcolumn vectors, and let and let k , k r be scalars. Then: (i) (ii) (iii) (ivj (v) (vi) (vii) (viii) (21 t -L 19) I' t 2% ZI,I - , 20 E V, w = u + (c t 2c), i.e. addition is associative; v + u,i.e. addition is commutative: + 0 = 0 u = u; LI T ( - 1 1 ) = (-u) z/ = 0: l;(u t 2') = ku kv; ZI -t (k+~ + ' ) I I= + ku + lc'u; (kk'jrc = k(lc'u); l u = 21. The properties listed in the above theorem are those which are used to define an abstract niathematical system called a linear spuce 3r rector space. Accordingly, the theorem can be restated as follon-s: Theorem 9.1: The set of all ,i-component column vectors under the operations of vector addition and scalar multiplication is a vector space. ROM' VECTORS Analogously, a row vector 2) is a set of numbers = ZL (ul, u,,. . . , u,,, . .,u, written in a row: ifI, 21,) The numbers ut are called the coiiiyoilents of the vector 21. Two row vectors are equal if they have the same number of components and if corresponding components are equal. Observe that a row vector is simply an ordered ri-tuple of numbers. The sun1 of two row vectors ?( a n d 21 with the same number of components, denoted by u + r , is the row vector obtained by adding corresponding components from 11 and u: = (ul, u,,. . . , u,,) (vl, v2, . . ., w,,) = (211 + w l , u2+v2,. . . , 20, + 2yJ 1' 2' T + The product of a scalar lc and a row vector u,denoted by l;*u or simply kri, is the row vector obtained by multiplying each component of 21 by k : k-u = k(/,ll,u , , We also define - R = -1.u as v e did f o r column vectors. Example 4.1: . . . ) un) = (lW1, kLI,, . . rCuJ .) and u-zt = u + ( - w ) + ( 0 , 5, -7, 11) = (I, 3, -4, 7 ) = (5, -10, 15, -20) 5 * (1, -2, 3, -4) (1,-2, 3, -4) We also have a theorem f o r row vectors which corresponds to Theorem 9.1. Theorem 9.2: The set of all 11-component row vectors under the operations of vector addition and scalar multipIication satisfies the properties listed in Theorem 9.1, that is, is a vector space. MULTIPLICATION O F A ROW VECTOR AND A COLUMN VECTOR If a row vector u and a column vector v have the same number of components, then their product, denoted by 21 * 1' or simply uv, is the scalar obtained by multiplying corresponding elements and adding the resulting products: 86 VECTORS ROW VECTORS 9.6. Let 11. = (2,-7,1), iiii) -3u, (iv) -w. 1i1 ii) = (-3,0,4) and = (0,5,-8). IL' (i) u + v , (ii) ZI fw, Find: Add corresponding components: + u I 2: [CHAP. 9 2, = 12, - 7 , 1) + (-3, 0, 4) = ( 2 - 3, - 7 + 0 , 1 + 4 ) = (-1, - 7 , 5 ) Add corresponding components: 2, + = ( - 3 , 0 , 4) 20 + ( 0 , 5, - 8 ) = (-3 f 0, 0 + 5 , 4 - 8 ) iiiii Multiply each component of u by the scalar -3. -3u = (-3, 5, -4) = -3(2,-7,l) = (-6,21,-3). iiv) Xultiply each component of w by -1, i.e. change the sign of each component: -W 9.7. = -(O, 5, - 8 ) = ( 0 , -5, 8) Let u,v and w be the row vectors of the preceding problem. (ii) 22~ 3v - 5w. + Find (i) 3u-4v, F i r s t perform the scalar multiplication and then the vector addition. 9.8. (i) 3%- 42, = 3(2, - 7 , l ) - 4(-3,0,4) iii) 2x + 3v - 5w + + = (6, -21.3) + (12,0, -16) = (18, -21, -13) 3(-3, 0, 4) - 5(0, 5, - 8 ) = (4, -14, 2 ) (-9, 0, 12) (0,-25, 40) = (4 - 9 i0,-14 0 - 25, 2 12 440) = (-5, -39, 54) = 2(2, -7, 1) + + + (2x,3, v) = (4, x + x , 22). Find x, y and x if Set corresponding components equal to each other to obtain the system of equations 2x = 4 3 = x + x y = 22 Then x = 2, x = 1 and y = 2 . 9.9. Find x and ?-J if s(1,l)-C y(2,-1) = (1,4). First multiply by the scalars x and y and then add: x ( l , 1) f y(2, -1) = ( z , x) + (2y, -y) = (x + 2y, x - y) = (1, 4) Now set corresponding components equal to each other t o obtain x t 2 y = 1 x--l/ = 4 Solve the system of etpations to find x = 3 and y = -1. RilULTIPLICATION OF A ROW VECTOR AND A COLUMN VECTOR (i) Multiply corresponding coinponents and add: (2, -3,6)(-!) = 2.8 + ( - 3 ) * 2 + 6*(-3) = 16 - 6 - 18 = -8 CHAP. 9J \' E CT 0R S 87 (ii) The product is not defined since the vectors have different numbers of components. (iii) Multiply corresponding components and add: / 4 = (3,-5,2,1) 9.11. Let u = (1, li., -3) + (-51.1 + 2 * ( - 2 ) 3.4 \-!I f and v = 2' . = 4- 1 . 5 12 - 5 Determine k so that ZL*U - 4 +5 = 8 = 0. Compute z( * r and set i t equal to 0: 21.2, = (l,k,-3) :i = 1-2 -5 T (-5l.k = 2 - (-3).4 - 5 k - 12 = -10 - 5k = 0 = 0 is k = -2. The solution t o the equation -10-5k RIISCELLANEOUS PROBLEMS 9.12. Prove Theorem 9,1(i): F o r any vectors v and w, (uA u ) + zv = u 11, + (v +w). Let i t , , uZ and zc, be the ith components of I ! , 21 and w respectively. Then u,+ v, is the 7th component of z( - 2: and so (24% 71,) - ti', is the zth component of ( I ( - 2:) - z ~ . On the other hand, v,+ w, is the zth component of v ZL' and so z(, ( v , f w,)is the ith component of u ( t , + zc). But u,, T~ and u',a r e numbers for which the associative law holds. t h a t is, + + + vt) = u + (v + (u, Accordingly, (z( - 1 % )- w + A w,= i(, + + (v,+ w J , f o r each i since all their corresponding components a r e equal. 11) 9.13. Let 0 be the number zero and 0 the zero vector. Show that, for a n y vector u,OIL = 8. Method I : Let ? I , be the ith component of u. Then 0 i t i = O is the ith component of Ozi. component of 0(( is zero, Ozt is the zero vector, i.e. Ozt = e. Method 2: We use Theorem 9.1: Now adding -0" 0 u = (0 + 0)zc = ou Since every + ou to both sides gives e = Ozt + (-010 9.14. Prove Theorem 9.l(vi): ( k + k')u = 021 + [Oa+ (-Ou)] = kzi = OH e = OI( + k'u. Observe t h a t the first plus sign refers to the addition of the two scalars k and k' whereas the second plus sign refers to t h e vector addition of the two vectors k u and k'u. Let u, be the ith component of u . Then (k +k ')zi1 is the ith component of ( k + k')u. On the other hand, kii, and k'u, a r e the ith components of the vectors kzi and k'", respectively, and so ku, k'u, is the ith component of the vector kit - k'u. B u t k , k' and i!, a r e numbers; hence + (k Thus (k - k')zi = /;I( + k'zi, + k')zc, = + k z ~ , k'ul, f o r each z a s corresponding components a r e equal. 9.15. The norm 1 u 1 of a vector u = ( u l , I [ , , . . . , u,J is defined by IIull = du; +u;+ . * . +v; VECTORS 88 [CHAP. 9 Supplementary Problems COLUMN VECTORS 9.16. Compute: (i) (-!)+ -1 9.17. Compute: (i) 3 f (ii) -7 I-:) + 5 (1;) - 2 (ii) 3 (iii) - (1;) -4 9.19. Determine x and y if: 9.20. Determine 9.21. 2, y (i) and x if Determine r and y if (. z x + (t) = + = 2~ y ("6 (ii) x (i) = 2 (-t]i + .(8) (-:) . = (22). ROW VECTORS 9.22. Let u = (2,-1,0,-3), (iI 32(, (iij IL + 71, 9.23. Find 2 and y if 9.24. Find 2, u and x if w = (1,-1, -1,3) (iii) 2u - 3c, (ivj and w = (1,3,- 2 , Z ) . 524 - 3V - 4W, (v) --2l + 2 V - 22c. x(1,Z) = - 4 ( y , 3 ) . x(l,l,O) + y(2,0, -1) + z ( O , l , l ) Find: = (-1,3,3) MULTIPLIC.-ITION OF A ROW VECTOR AND A COLUMN VECTOR - 4 VECTORS CHAP. 91 9.26. Determine lc so t h a t 9.27. Prove: If u = ( u I , . . , u J i s a row vector such t h a t n components, then u is the zero vector. 89 (1, k , -2, -3) . u.2) = 0 f o r every column vector v with MISCELLANEOUS PROBLEMS 9.28. Prove Theorem 9 . l ( v ) : F o r a n y vectors u and v and scalar 9.29. Prove Theorem 9.1tvii): For a n y vector zi 9.30. Find I / (2, -3, 1 ii) 11 ( 3 , 5 , -4) 11, (ii) 6, 4) and scalars k,k', li, k(ic T IS) = k u + kv. ( k k ' ) ~= k ( k ' u ) . Answers to Supplementary Problems 9.16. 9.17. 9.18. 9.19. (i) x = 2, y = 4 9.20. z = 2 , y=3, x=-2 9.21. x = O , y = 0 or x = - 2 , 9.22. (ii) x = -1, y = - 3 / 2 y=-4 + 3% = (6, -3,0, -9), u ZI = (3, -2, -1,O), 2v - 2w = ( - 2 , - 7 , 2 , 5 ) -u + 9.23. 9.24. x = l , y=-l, 9.25. (i) 7 9.26. 9.30. 2 = 2 (ii) Not defined (iii) 0 22t - 3~ = (1,1,3,-15), 52t - 3~ - 4~ = (3, -14,11, - 3 2 ) , Chapter 10 Matrices MATRI C E S A rnatyiy is a rectangular array of numbers; the general form of a matrix with Tn rows and n columns is ,\ . . . . . . . . . . . . . . . . . .%::: '\ ... all a13 * * * a21 a22 a23 * * * UnLl am2 am3 * .' ainn 1 We denote such a matrix by (a,,),,,, o r simply (a,]) and call it an n z x "rz matrix (read ('nz by n"). Note that the row and column of the element atj is indicated by its first and second subscript respectively. Example 1.1: Consider the 2 X 3 matrix -;3 (: Its rows are (1, -3,4) and (0,5, -2) and its columns a r e In this chapter, capital letters A , B , . . . denote matrices whereas lower case letters a, b, . . . denote numbers, which we call scalars. Two matrices A and E are equal, written A = E , if they have the same shape, i.e. the same number of rows and the same number of columns, and if corresponding elements are equal. Hence the equality of two m matrices is equivalent to a system of mn equalities, one for each pair of elements. Example 1.2: The statement 22 ( x+ yy +w x-l.0 'I = X n (; :) is equivalent to the system of equations -I x+u = = 2i+w = 2-20 = x-y 3 1 5 4 The solution of the system of equations is x = 2, Remark: u = 1, 2 = 3 , w = -1. A matrix with one row is simply a row vector, and a matrix with one column is simply a column vector. Hence vectors are a special case of matrices. MATRIX ADDITION Let A and B be two matrices with the same shape, i.e. the same number of rows and of columns. The sum of A and B , written A E , is the matrix obtained by adding corresponding elements from A and B: + 90 all a12 91 i\IA T R I CE S CHAP. 101 . + bll be1 b12 bml bmz " + + + \ . . . aan + azl + bzl a22 + b.2 ................................. I + b,i,l an,?+ amn+ b,,,,/ all ' bz2 ... ................. amn a12 bll 2112 * * aln blll 212, bi7,2 anti * Note that A + B has the same shape as A and B. The sum of two matrices with different shapes is not defined. 3 0 -6) 2 -3 11 Example 2.1: Example 2.2: = 1+3 -2+0 3+f-6) 0 + 2 4 + ( - 3 ) 1'5 The sum is not defined since the matrices have different shapes. A matrix whose elements are all zero is called a zero matrix and is also denoted by 0. Part (iii) of the following theorem shows the similarity between the zero matrix and the scalar zero. Theorem 10.1: For matrices A , B and C (with the same shape), + + C = A + ( B+ C), i.e. addition is associative; (ii) A + B = B + A , i.e. addition is commutative; (iii) A + 0 = 0 + A = A. (i) ( A B) SCALAR MULTIPLICATION The product of a scalar lc and a matrix A , written k A or Ak, is the matrix obtained by multiplying each element of A by lz: all kall kazl a12 ................. Gn1 am2 - * kaln ka22 1 . 9 + . kaI n . kU2, .................... ka,l a kam2 * ' ka,,,, Note that A and /<Ahave the same shape. Example 3.1: /1 3(4 -2 3 0) -5 = (::: 3.(--2) 3.3 We also introduce the following notation: A - A = (-l)A and - 3.(-5) B = A + (-B) The next theorem follows directly from the above definition of scalar multiplication. Theorem 10.2: For any scalars k , and lz., and any matrices A and B (with the same shape), (i) (klk2)A = kl(kpA) (iv) 1 . A = A , and OA = 0 (v) A + (-A) = ( - A ) + A = 0. (ii) k l( A + B ) = k1A k1B (iii) (kl k2)A = klA kzA + + + 92 [CHAP. 10 RI AT RICE S Using (iii) and (iv) above, we also have t h at A + A = 2A, A + A + A = 3A, , . . MATRIX PIULTIPLICATION Let A and B be matrices such that the number of columns of A is equal to the number of rows of B . Then the product of A and B , written A B , is the matrix with the same number of rows as A and of columns as B and whose element in the ith row and jth column is obtained by multiplying the ith row of A by the j t h column of B: + + + where ciJ = aiibij a22b23 - * * atpbpJ. I n other words, if A = (atj)is an m x p matrix and B = ( b t l ) is a p A B = ( c i J ) is the nz X n matrix f o r which czj = atlblj &2b& f * ' . azpbpj = azkbkj + + X n matrix, then $ k=l If the number of columns of A is not equal to the number of rows of B , say A is m x p and B is q X 71 where p # q, then the product A 6 is not defined. val tal Example 4.1: + sb, + xbl m2 ta2 + sb, + ub, ra3 ta, + sb, + ttb, Example 4.2: (t :)(: :) = 1.1+1.3 1*2+1°4) (0*1+2*3 0.2+2*4 = (' ;) We see by the preceding example that matrices under the operation of matrix multiplication do not satisfy the commutative law, i.e. the products A B and B A of matrices need not be equal. Matrix multiplication does, however, satisfy the following properties: Theorem 10.3: (i) (AB)C = A ( B C ) (ii) A ( B + C ) = A B (iii) ( B + C ) A = B A + AC + CA (iv) k ( A B ) = ( k A ) B = A ( k B ) , where k is a scalar. We assume that the sums and products in the above theorem are defined. Remark: In the special case where one of the factors of A B is a vector, then the product is also a vector: CHAP. 101 h I 4 TRICE S Example 4.3: (2, - 3 , 4 ) -;I = (2 * 1 T (-3) * 5 93 + 4 . (-2), 2 . (-3) t ( - 3 ) + 1* (-1) (-3) * 2 5*(-1)+0*2 1-2) (-1) 4*2 1 -3 + ) = * 0 + 4.4) = (-21,lO) “I) Example 4.4: A system of linear equations, such a s is equivalent t o the matrix equation T h a t is, any solution to the system of equations is also a solution to the matrix equation, and vice versa. SQUARE MATRICES A matrix with the same number of rows as columns is called a square matrix. A square matrix with 71 rows and n columns is said to be of order 11, and is called a n n-square matrix. The main diagonal, or simply diagonal, of a square matrix A = (at,) is the numbers a,,, aZ2, . . ., ~ l , ~ ~ . Example 5.1: The matrix is a square matrix of order 3. The numbers along the main diagonal a r e 1, -4 and 2. The n-square matrix with 1’s along the main diagonal and 0’s elsewhere, e.g., I 10 \,o 0 0 1 0 0 1 is called the unit matrix and will be denoted by I . The unit matrix I plays the same role in matrix multiplication as the number 1 does in the usual multiplication of numbers. Specifically, Theorem 10.4: For any square matrix A , AI = I A = A 94 RIA TRI CE S [CHAP. 10 ALGEBRA OF SQUARE MATRICES Recall that not every two matrices can be added or multiplied. However, if we only consider square matrices of some given order n, then this inconvenience disappears. Specifically, any 12 x n matrix can be added to o r multiplied by another n x n matrix, or multiplied by a scalar, and the result is again an n X n matrix. In particular, if A is any n-square matrix, we can form powers of A : A 2 = A A , A 3 = A 2 A , . . . and A n = Z We can also form polynomials in A . That is, for any polynomial f(x) = + Uo we define f ( A ) to be the matrix f ( A ) zz aoZ Uln: + u2x2 + + *.. + alA + uzA2 + * a UnXn + anAn * In the case that J ( A ) is the zero matrix, then A is said to be a zero o r r o o t of the polynomial J(.T). Example 6.1: Let A = /1 (3 If f ( x ) = 2x9 '); -4 - 3~ + 5, = A2 = then g(x) = x2 On the other hand, if g(A) then + 3x - 10 (-:ii) + 3(' 3 then ') -4 - lo(: ;) = ('0 '1 0 Thus A is a zero of the polynomial g(z). TRANSPOSE The transpose of a matrix A , written At, is the matrix obtained by writing the rows of A , in order, as columns: a1 u2 a1 bl b2 * * * Note that if A is an m x n matrix, then .4t is an n X nz matrix. Example 7.1: The transpose operation on matrices satisfies the following properties. Theorem 10.5: + (i) ( A B)t = A t (ii) (At)t= A + Bt (iii) ( ~ c A=)kAt, ~ for k a scalar (iv) (AB)t= F A t . CHAP. 101 MATRICES 95 Solved Problems MATRIX ADDITION AND SCALAR MULTIPLICATION 10.1. Compute: 11 -1 1 -3 2 1 -2 (i) Add corresponding elements: 4 5 1+1 4+0 6 2+(-1) 6+(-5) 5+3 (ii) The sum is not defined since the matrices have different shapes. (iii) Add corresponding elements: 1 2 0 -5 3 -3 1 -1 -5 2 - -1) 6 0 -2 -3) 1 0 + 3 2 + (-5) (-3) + 6 + 2 (-5) + 0 1 + (-2) 4 -3 (i) + (-3) 3 -1 -4 -5 2 (-1) Multiply each element of the matrix by the scalar 3: ;) 3(-: 3.2 (3.(-3) = 3 3:;) 6 12‘ (-9 3) = (ii) Multiply each element of the matrix by the scalar -2: (-2) 1 (-2) 7 -Z(i 0 -1 = ( ((-2) - 2 ) **20 (-2)*(-3)) (-2) ’ (-1) -”) 1-2 ;) -14 (,,-; = (iii) Multiply each element of t h e matrix by -1, or equivalently change the sign of each element in the matrix: 2 -3 10.3. Compute: -:-:) 3%; - 1 -2 2 \’o -1 -3) 5 + qo 1 -2) 1 -1 -1 Fi r st perform the scalar multiplication, and then the matrix addition: 3(23 -: -:> 1 -2 - 2 ( 0 -1 -15 6 9 6 0 3) -12 + (-2) + 0 9 +0 +4 -3) 5/ + 2 (-2 0 + + {1 ’ 4 2 -1 6) -10 + (0 4 4 -4 + 6 + (-8) + (-10) + (-4) 3 -15 4 4 0 2 (-4) -12 + + -1 -2) -4 -8) 4 -7 -CHAP. 10 $1A TRI CE S 96 10.4. Find x, y, x and w if: x 6 2,) \-1 + 4 jz+w First write each side as a single matrix: ($ :;,) ( = .?: X Sx W + 4- 1 X+ZJ 3 + y + 6 2x+3 Set corresponding elements equal t o each other to obtain the four linear equations 22 = 4 32 = 2 + 4 2y = 6 f x 3y = z + u + 6 or 22 = Zt' - 1 3X = z t ? C - l zc: = 3 3w = 220 3 + The solution of this system of equations is z = 2, ?J = 4, z = I , w = 3. MATRIX NIULTIPLICATION 10.5. Let (Y x s) denote a matrix with shape r X s. When will the product of t w o matrices ( r x s ) ( t X ZL) be defined and what will be its shape? The product ( r X s ) ( t X u)of an Y X s matrix and a t X u matrix is defined if the inner numbers s and t a r e equal, i.e. s = t. The shape of the product will then be the outer numbers in the given order. i.e. T X ZL. 10.6. Let (Y x s) denote a matrix with shape r x s. Find the shape of the following products if the product is defined: (2 x 3)(3 x 4) (iv) ( 5 x 2)(2X 3) (ii) (4x 1)(1x 2) (iii) (1X 2)(3x 1) (v) (4 x 4)(3 x 3) (vi) (2 x 2 ) ( 2 X 4) (i) In each case the product is defined if the inner numbers a re equal, and then the product will have the shape of the outer numbers in the given order. (i) The product is a 2 X 4 matrix. (ii) The product is a 4 X 2 matrix. (iii) The product is not defined since the inner numbers 2 and 3 a r e not equal. (iv) The product is a 5 X 3 matrix. (v) The product is not defined since the innar numbers 4 and 3 a r e not equal. (vi) The product is a 2 X 4 matrix. I\' ") 10.7. Let A = 2 (i) -1 " and B = ( 3 -2 -') 6 ' Find (i) A B , (ii) BA. Now A is 2 X 2 and B is 2 x 3, so the product matrix A B is defined and is a 2 x 3 matrix. To obtain the elements in the first row of the product matrix A B , multiply the first row ( l , 3 ) of A by the columns , (-:) -*6 :I = ( 1.2 - 3.3 1.0 and + 3.(-2) (-$ of B , respectively: 1*(-4) , + 3.6) = (11 -6 14) To obtain the elements in the second row of the product matrix A B , multiply the second row (2,-1) of A by the columns of 6,respectively: CHAP. 101 97 MATRICES 1 3 \ / = 11 ( 2 * 2 4-(-1).3 Thus -C (-1)*(-2) = AB (ii) Now B is 2 X 3 and A i s 2 BA is not defined. X 14 -6 2.0 (111 -6 2 14) -14) 2. Since the inner numbers 3 and 2 a re not equal, the product 10.8. Let (i) (-1)*6 2.(-4) Find (i) A B , (ii) BA. Now A is 1 X 2 and B is 2 X 3, so the product A B is defined and is a 1 X 3 matrix, i.e. a row vector with 3 components. To obtain the elements of the product A B , multiply the row of A by each column of B : AB = (2,1) (i -:-:) = (2.1 + 1.4, 2 . ( - 2 ) + 1 . 5 , 2.0 + 1.(-3)) = (6,1,-3) (ii) Now B is 2 x 3 and A is 1 X 2. Since the inner numbers 3 and 1 a r e not equal, the product BA is not defined. 1 2 -1 1 0 10.9. Let A = i \-3 (i) 4 ) andB= (i -f -:I. Find (i) A B , (ii) BA. Now A is 3 X 2 and B is 2 X 3, so the product A B is defined and is a 3 X 3 matrix. To obtain the first row of the product matrix AB, multiply the first row of A by each column of B, respectively: \ =I: 2.1 + (-1)*3 2.(-2) + (-1)*4 + (-1)*O 2.(-5) = r -I0? To obtain the second row of the product matrix A B , multiply the second row of A by each column of B , respectively: -8 1.lt0.3 -10 = 1*(-2)+0*4 1*(-5)&0-0) -1 -8 1 -2 -8 -10 -5) To obtain the third row of the product matrix A B , multiply the third row of A by each column of B, respectively: -3 4/\ =I: (-3) -1 1 1 4 * 3 (-3) + * -8 -2 (-2) +4 i/ 4 -1 Thus AB = (-3) * -10 -5 (-5) - -I -1 +4 * 0 9 -8 -10 22 --: -8 -10 (ii) Now B is 2 x 3 and A is 3 X 2, so the product BA is defined and is a 2 X 2 matrix. To obtain the first row of the product matrix BA, multiply the first row of B by each column of A , respectively: [CHAP. 10 AT AT RICE S 98 =( 1 '2 + (-2) '1 + (-5) * 1 * (-1) + (-2) (-3) * 0 + (-5) 4 = ;" To obtain the second row of the product matrix B A , multiply the second row of B by each column of A : 15 = (3.2 -21 + 4 . 1 + 0*(-3) Thus 3*(-1) = 15 (10 + 4.0 + 0.4 ) = 15 (10 -21 -3) -21 -3) Remark: Observe t h a t in this case both A B and B A a re defined, but A B and B A a re not equal; in fact, they do not even have the same shapes. (i) Determine the shape of AB. (ii) Let cij denote the element in the ith row and j t h column of the product matrix AB, that is, A B = (el!). Find: c23, c14, C P I and CIZ. (ii Since A is 2 X 3 and B is 3 X 4, the product A B is a 2 X 4 matrix. (ii) Now c I j is defined as the product of the ith row of A by the j t h column of B . Hence: / o\ / 1\ C14 = (2,-1,O) = 2 ~ 1 + ( - 1 ) ~ ( - 1 ) + 0 ~= 0 2+1+0 -1 = 3 0:' c21 = ~ 1 21 (ii) (i) = (2, -1, ( -3 , (Tb) 1.1 + 0 . 2 + (-3)*4 = 2*(-4) = + (-l)*(-1) + 0 . 0 1 + 0 - 12 = -8 = -11 +1+0 = -7 '\ 5 / -7) The first factor is 2 X 2 and the second is 2 X 2, so the product is defined and is a 2 X 2 matrix: 1*4+6*2 1*0+6*(-l) (-3) 4 5 2 (-3) 0 5 ' (-1) + + -2 -5 99 RIA TRICES CHAP. 101 (ii) The first factor is 2 X 2 and the second is 2 X 1, so the product is defined and is a 2 X 1 matrix / 1.2+6.(-7) ) + 5*(-7) / \(-3)*2 - (iii) Now the first factor is 2 X 1 and the second is 2 X 2. Since the inner numbers 1 and 2 a r e distinct, the product is not defined.. (iv) Here the first factor is 2 X 1 and the second is 1 X 2, so the product is defined and is a 2 matrix: (v) The first factor is 1 X 2 and the second is 2 which we frequently write as a scalar. (2, -1) (-:) j: 1, (2. 1 = Y, 2 so the product is defined and is a 1 X 1 matrix + (-1). (-6)) = 181 = 8 / SQUARE MATRICES + . Find (i) A', [ii) As, (iii) f ( A ) , where J ( x ) = 2x3 - 42, 5 . \ 4 -3 (iv) Show t h a t A is a zero of t h e polynomial g(x) = M ? 2s - 11. 10.12. L e t A = (i) A2 I = AA = (: -;)(: -$ + + 1.2 4.2 1.1 2.4 (-3)*4 4.1 9 \-8 (-3).(-3) 1.(-4) f 2.17 4 * (-4) (-3) * 17 1 . 9 $- 2.(-8) (-3) * (-8) 4 9 + 2.(-3) + + -41 IT, 1 G O -67 (iii) To find J ( A ) ,first substitute A for x and 51 f o r the constant 5 in the given polynomial = 2x3 - 4 x - 5 : m Then multiply each matrix by its respective scalar: Lastly, add the corresponding elements in the matrices: - 1 /-14-4+5 \ l 2 0 - 16 0 + 60-8+0 ) -133 + 1 2 - 5, -13 104 52') -117 (iv) Now A is a zero of g(x) if the matrix g ( A ) is the zero matrix. Compute g ( A ) as was done f o r J ( A ) ,i.e. first substitute A for .z and 111 f o r the constant 11 in g ( x ) = x2+ 22 - 11: Then multiply each matrix by the scalar preceding it: >IAT RI CE S 100 ;CHAP. 10 Lastly, add the corresponding elements in the matrices. s(A) = / + 2 - 11 +0 9 (-8 t 8 + -4 4 i- o:, 17 - 6 - 111 - Since g i A ) = 0, A is a zero of the polynomial g(z). 10.13. Let B = [' -4" 3 0 and let j ( x ) = x2 - 4x - 3. Findf(B). -1 Fi r s t coinpute B2: 2 = BL BB 2 = + + + + + + 1.1 2 . 3 0.0 3.1 (-4).3 5.0 0 * 1 (-1) * 3 2 * 0 -3 2 + + + 2.(-4) T 0*(-1) 1.2 3.2 (-4).(-4) - 5*(-1) 0-2 (-1) * (-4) - 2 * (-1) + + + + + + 1.0 2.5 0.2 3.0 (-4)*5 5.2 0.0 (-1)*5 2.2 -1/ Then f ( B ) = 6' - 4B + 31 2 = 0 - (-: -6 1; -3 = 10.14. Let -4 = /1 j 4 -3 . (-9 -1 0 0 1 -;:)+ -1 7-4+3 - 12 - 0 -3-0+0 -6-8+0 17+ 16+3 2+4+0 10-0+0 -10 - 20 + O ) -1-8+3 Find a non-zero column vector 2~ = (-21 6 -3 -14 3; -;:I -6 such that AZL= 3u. = F i r s t set up t h e matrix equation Au = 3u: Write each side as a single matrix (column vector): Set corresponding elements equal to each other t o obtain the system of equations + x 321 = 3 2 4x - 3y = 3 y or -2.1. + 321 = 0 4~ - 621 = 0 The two linear equations a r e the same (see Chapter 11) and there exist a n infinite number of solutions. One such solution is N = 3, has the property t h a t A u = 3 2 ~ . = 2. In other words, the vector u = ("2> is non-zero and X A TR I C E S CHAP. 101 TRANSPOSE 101 1 0 1 0 \ 10.15. Find the transpose At of the matrix A = Rewrite the rows of A a s the columns of At: A t = [l "\ . 0 3 4 0 5 4 10.16.Let A be an arbitrary matrix. Under what conditions is the product AAt defined? Suppose A is an m X n matrix: then At is n x m . Thus the product AAt is always defined. Observe t h a t AtA is also defined. Here AAt is a n m X m matrix, whereas AtA is a n n X n matrix. 10.17. Let A = /j 3 ") -1 4 Find (i) A A t , (ii) AtA. * (i -:). 1 At = To obtain At, rewrite the rows of A as columns: - = + 2 * 2 + 0.0 + (-1)*2 + 4 . 0 + + i + + + 1.3 2*(-1) 0.4 3.3 (-1)*(-1) -I-4 . 4 ) 1.1 (3-1 1.1 3.3 2.1 (-1).3 0.1 + 4 - 3 3\ + 3*(-1) ++(-l)*(-l) 4*(-1) 1.2 2.2 0.2 = Then (n ++( -41 .) .44 2.0 0.0 2;) 10 -1 12 12 -4 16 PROOFS 10.18. Prove Theorem 10.3(i): (AB)C = A ( B C ) . Let A = ( a , ] ) , B = ( b j k ) and C = (chi). A B = S = (s,k) Furthermore, let T = (t,J. Then and BC = m s,k tji = = allblk bjicii a12brk f aambm, = -k + b@c,i + " ' + x a,,bJh 1=1 n bjncnL = 2 b,hChl h=l Now multiplying S by C, i.e. ( A B ) by C, the element in the ith row and lth column of the matrix (AB)G is n n m S,1C11 S,$21 ..' f s l , I C n ~ = 2 S l k c k [ = 2 (aLIb&?d + + k=l k = l )=1 On the other hand, multiplying A by T , i.e. A by BC, the element in the ath row and Zth column of t h e matrix A(BC) is Since the above sums a r e equal, the theorem is proven. 102 MATRICES 10.19. Prove Theorem 10.3(ii): A ( B + C) = AB [CHAP. 10 + AC. Let A = ( a i j ) , B = (bik) and C = (cjk). Furthermore, let D = B and F = AC = ( f i k ) . Then + djk = eik = ailbl, bjk fiirc + + aiibzk + alacak allClk m + *. * + . '* = aimbn,k CLijbjk j=1 ai~rlcink x = j=1 Hence the element in the ith row and kth column of the matrix A B eik + ??1 = fik + C = (djk),E = A B = (eiJ aijbJi, j=1 + m 2 j=1 m x = CLijCjk aijcjk + AC + aij(bjk j=1 is Cjk) On the other hand, the element in the ith row and kth column of the matrix A D = A ( B aildlk Thus A ( B f CLi2dzk + C) = A B + AC f ' ' ' + m ailndmk = in 2 = a,,dj, i=1 3 U,j(bjk j=1 + + C) is Cjk) since the corresponding elements are equal. 10.20. Prove Theorem 10.5(iv): ( A B ) t= P A t . Let A = matrix A B is (a,]) and B = (bJ. Then the element in the ith row and j t h column of the allb,, + a22b23 + . * * + (11 annbm, Thus ( 1 ) is the element which appears in the j t h row and ith column of the transpose matrix ( A B ) t . On the other hand, the j t h row of B f consists of the elements from the j t h column of B: (blJ bzt ". (21 bWIJ) Furthermore, the ith column of A f consists of the elements from the ith row of A : i'i (3) aim Consequently, the element appearing in the j t h row and ith column of the matrix BtAt is the product of ( 2 ) by ( 3 ) which gives ( 1 ) . Thus (AB)t = BtAt. Supplementary Problems MATRIX OPERATIONS For Problems 10.21-23, let -:), + C, C = ( - -3 2 0 5 -1 1 -4 0 10.21. Find: (i) A 10.22. Find: ii) A B , (ii) A C , (iii) A D , (iv) BC. (v) BD, (vi) CD. 10.23. Find: (i) A t , (ii) AtC, (iii) DtAt, (iv) B t A , (v) DtD, (vi) DDt. + B , (ii) A (iii) 3A - 4B. 0 ") 3 and D = (,-!I 103 $1AT R I CE S CHAP. 101 SQUARE MATRICES 10.24. g ( ~z ) 10.25. l2 Let A = ') (ii) If f ( x ) = (i) Find A2 and A3, -1 . 8, find g ( A ) . j3 .(.? - .C - "> 23 - 3 9 - 2r + + 4, (i) If f ( x ) = 2.c2 - 4.c 3, find f ( B ) . (ii) If g ( r ) = fj 15 3 . (iii) Find a non-zero column vector ti = f " such t h a t Bu = 6 2 4 . Let B = r2 \ Y) 10.26. Matrices A and E a r e said t o commute if A B = B A . Find all matrices 10.27. with (t t). Let A = 2\ 1) b , (E find f ( A ) . (iii) If - 42 - 12, find g ( B ) . s> which commute Find An. PROOFS (k, + k,)A = klA + k,A. 10.28. Prove Theorem 10.2iiii): 10.29. Prove Theorem 10.4[iii): k(AB) = ( k A ) B = A ( k B ) . 10.30. Prove Theorem 10.5(i): ( A + B)t = A + + B t . Answers to Supplementary Problems (-! -; 10.21. (i) 10.22. (i) Not defined. 5 (ii) (T1 10.23. 10.24. 10.26. 4 (i) A2 = (Io 3 (') -5) 18 (ii) Not defined. , 17 11 -12 4 -12 2) ( iiiii (iii) -2 -3 1 0 (i) (-1 3) 2 (ii) Not defined. A3 = (26 1 ) Only matrices of the form 27 (iii) ( 9 , 9 ) ( i d Is\ -1) ( 4 0 -3 (ii, f ( A ) = (8 i)commute with -7 -6 12 (y: -1) -1$ (vi) Not defined. (-: -!) 4 -2 (v) 14 (vi) (iii) g ( A ) = -31 Chapter 11 linear Equations LINEAR EQUATION IN TWO UNKNOWNS A linear equation in two unknowns x and y is of the form ax + by = c where a,b , c are real numbers. We seek a pair of numbers the equation, that is, for which ak, bk, = c N = (k,, k2) which satisfies + is a true statement. Solutions of the equation can be found by assigning arbitrary values to x and solving f o r y (or vice versa). Example 1.1: Consider the equation 2r+y = 4 If we substitute :c = -2 2*(-2)+y = 4 or in t h e equation, we obtain = 4 -4+y Hence (-2,8) is a solution. If we substitute equation, we obtain 2*3+y = 4 or 6 + y = 4 y = 8 or or N = 3 in the y = -2 Hence (3, -2) is a solution. The table on the right lists six possible values for z and the corresponding values for y , i.e. six solutions of the equation. T 2 3 Now any solution a = (k,,k,) of the linear equation ax b y = c determines a point in the Cartesian plane R'. If a,b are not both zero, the solutions of the linear equation correspond precisely to the points on a straight line (whence the name linear equation). + Example 1.2: + Consider the equation 2 x y = 4 of the preceding example. On the r i g h t we have plotted the six solutions of t h e equation which appear in t h e table above. Note t h a t they all lie on the same line. We call this line t h e g r a p h of the equation since i t corresponds precisely to the solution set of the equation. Graph of 2 x + y = 4 TWO LINEAR EQUATIONS I N w o UNKNOWNS We now consider a system of two linear equations in two unknowns x and p: + b,y = uPx+ b2y = U,X 104 c, c2 2 0 -2 CHAP. 111 105 L I N E A R EQUATIONS A pair of numbers which satisfies both equations is called a simultaneous solution of the given equations o r a solution of the system of equations, There are three cases which can be described geometrically. (Here we assume that the coefficients of x and y in each equation are not both zero.) t = -3 x+2y = 3 2 2 f 2y = 6 x+y = 1 3x 3y = 3 (a) (b) (4 z-cy x-y = 1 + Fig. 10-1 1. T h e system has exactly one solution. Here the lines corresponding to the linear equations intersect in one point as shown in Fig. 1 0 - l ( ~ above. ) 2. T h e system has n o solutions. Here the lines corresponding to the linear equations are parallel as shown in Fig. lO-l(b) above. 3. T h e systenz has an infinite number o f solictions. Here the lines corresponding t o the linear equations coincide as shown in Fig. 10-l(c) above. Now the special cases 2 and 3 can occur if and only if the coefficients of coefficients of y are proportional: bl - - - x and the b2 a2 There are then two possibilities: (i) If the constant terms of the equations are in the same proportion, i.e. - _ -b 1 - a, - Cl c2 b2 then the lines are coincident. Hence the system has an infinite number of solutions which correspond to the solutions of either equation. (ii) If the constant terms of the equations are not in the same proportion, i.e. then the lines are parallel. Example 2.1: a, _ -b1 f - a, b2 CZ C1 Hence the system has no solution. Consider t h e system + 6y 3% 2% + 4y = 9 = 6 Note t h a t 3/2 = 6/4 = 9/6. Hence t h e lines are coincident and the system has an infinite number of solutions which correspond t o t h e solutions of either equation. L I N E A R EQUATIONS 106 [CHAP. 11 Particular solutions of t h e first equation and hence of the system can be obtained as shown in Example 1.1. For example, substitute x = 1 in the first equation t o obtain 3*1+62/ = 9 or 3f6y = 9 6y = 6 or or y = 1 Thus x = 1 and y = 1 or, in other words, t h e pair of numbers a = (1,l) is a solution of the system. By substituting other values f o r x (or y) in either equation, we obtain other solutions of t h e system. Example 2.2: Consider the equations 3x + 62/ = 9 2x -t. 4y = 5 Note t h a t 312 = 614 but 312 = 614 # 915. Hence the lines are parallel and the system has no solution. On the other hand, if the coefficients of the unknowns are not proportional, i.e. a, bl f a2 b2 then case 1 occurs; that is, the system has a unique solution. This solution can be obtained by a process known as elimination, i.e. by reducing the equations in two unknowns to an equation in only one unknown. This is accomplished by: (i) multiplying each of the given equations by numbers such that the coefficients of one of the unknowns in the resulting equations are negatives of each other; (ii) adding the resulting equations. We illustrate this method in the following example. Example 2.3 : Consider the equations (1) 32 + 2y (2) 22 - = 8 5y = -1 Note t h a t 3 / 2 i .21-5, so the system h a s a unique solution. and ( 2 ) by -3 and then add in order to “eliminate” 2: 2 X (1): -3 X ( 2 ) : + 4y = - 6 ~+ 1571 = 6% 16 3 19y = 19 Addition: Multiply ( 1 ) by 2 or y =1 Substitute y = 1 in ( 1 ) to obtain 3x+2*1 = 8 or 3x+2 = 8 or 3% = 6 or 2 = 2 Thus x = 2 and y = 1 or, in other words, the pair 01 = ( 2 , l ) is the unique solution to t h e given system. GENERAL LINEAR EQUATION We now consider a linear equation in any arbitrary number of unknowns, say xl,x,, . . ., x I 1 . Such an equation is of the form a,x, a p , * . . a?txll= b + + + where a,,a,, . . .,a,,, b are real numbers. The numbers at are called the coefficients of xi, and b is called the constant of the equation. An n-tuple of real numbers (Y = (kl, k,, , , . ,k,), i.e. a n n-component row vector, is a solution of the above equation if, on substituting ki for xl,the statement a,k, a2k2 . - . a,kI1 = b + is true. We then say that N + + satisfies the equation. LINEAR EQUATIONS CHAP. 111 Example 3.1 : Consider t h e equation Now 107 = 3 x+2y-4x+w 01 = (3,2,1,0) is a solution of the equation since 3+2.2-4*1+0=3 or 3 + 4 - 4 + 0 = 3 or 3 = 3 is a t r u e statement. On the other hand, /3 = ( 1 , 1 , 2 , 2 ) is not a solution of the equation since 1+2*1-4*2+2 = 3 or 1+2-8+2 = 3 or -3 = 3 is not a t r u e statement. A linear equation is said to be d e g e n e m t e if the coefficients of the unknowns are all zero. There are two cases: (i) The constant is not zero, i.e. the equation is of the form 0x1 0x2 . . . + Oxn = b, with b i 0 Then there is no solution to the linear equation. + + (ii) The constant is also zero, i.e. the equation is of the form 0x1 0x2 * * * O X , = 0 Then every n-tuple of real numbers is a solution of the equation. + + + GENERAL SYSTEM O F LINEAR EQUATIONS A system of nz equations in n unknowns n.,, x2, . . .,xn is of the form a21 x1 + + XI + all x1 am1 U22XZ + + am2 ~2 + U1zX2 * ' ' * ' ' * ' + + + amnxn alnxn = Z;I ~ 2 n x n = bz = bm = (kl, k,, . . . , k J which where the ail, b, are real numbers. 4 n n-tuple of numbers satisfies all the equations is called a solution of the system. We first consider the special case where all the above equations are degenerate, i.e. where every a,, = 0. There are two cases: O( (i) If one of the constants b, # 0, i.e. the system has an equation of the form 0x1 0x2 . Ox, = b,, with b, # 0 then this equation, and hence the system, has no solution. + + -- + (ii) If every constant bt= 0, i.e. every equation in the system is of the form 0x1 0x2 . * Oxn = 0 then each equation, and hence the system, has every n-tuple of real numbers as a solution. + Example 4.1: The system + + ox ox + o y + o x + ow + oy + o x + ow Ox+Oy+Ox+Ow = 0 = 4 = -1 has no solution since one of the constants on the right is not zero. On t h e other hand, the system ox+ o y + o x + o w = ox + oy o x ow = ox oy o x ow = has every 4-tuple a = (kl, k,, k,,k,) as a solution + + + + + right a r e zero. 0 0 0 since all the constants on the 108 LINEAR EQUATIONS [CHAP. 11 In the usual case where the equations a r e not all degenerate, i.e. where one of the a,,#O, we reduce the system ( 1 ) to a simpler system which is equivalent to the original system, i.e. has the same solutions. This process of reduction, known as (Gauss) elimination, is as follows: (i) Interchange equations and the position of the unknowns so that all#O. (ii) Multiply the first equation by the appropriate non-zero constant so that a,,= 1. (iiij For each i > 1, multiply the first equation by -ai, and add i t to the ith equation so that the first unknown is eliminated. Then the system ( 1 ) is replaced by an equivalent system of the form XI . . . + al,x, bl aZ2x2 -+ a2353 + - . . + a&x,, = b; .................................... uLx.2 Example 4.2: (g3X3 + am353 + + a:;"& -+ + + * . * b: = am,5,, Consider the system 2 ~ + 6 y - x = 4 3x-22y- = 1 5% + 9y x - 22 = 12 4 so t h a t the system is replaced by Multiply the first equation by x+3y-*x 3%-22y- x + 9y - 5x 2x = 2 = 1 = 12 Multiply t h e new first equation by -3 and add it to the second equation so t h a t t h e system is replaced by x + 3 y - * x = 2 -1ly + 9y 52 + 32 = -5 - 2x = 12 Lastly multiply the first equation by -5 and add it to the third equation so t h a t the system is replaced by the following which is in the desired form: = x + 3 y - + z -1ly -6y Example 4.3: + + 42 = $2 = 2 -5 2 Consider the system of the preceding example. Here we reduce the system to a simpler form without first changing the leading coefficient to 1. This method has the advantage of minimizing the number of fractions appearing. Multiply the first equation by 3 and the second by -2 and then add t o elimin a t e x from the second equation: 3 X first: GX 1 8~ 3x = 12 -2 X second: -6.2: + + 4x1 + 22 = -2 22r/ - x = 10 Addition: Now multiply the first equation by 5 and the third equation by -2 to eliminate from t h e third equation: 5 -2 X first: X third: Addition: lox T 30y - 52 = -lox - 18y 12y 20 + 4x = -24 - = 2 -4 3 CHAP. 111 L I K E A R EQUATIONS 109 Thus the system is equivalent t o 2x + 61/ - x = 22y - 122) - 2 r+ 3u-3z= 4 10 z = -4 22y - z = 10 1 2 y - z = -4 or z = Continuing the above process, we obtain the following fundamental result: Theorem 11.1: Every system of m equations in n unknowns can be reduced t o an equivalent system of the following form: x1 f c12x2 C13s3 * ' ' f C ~ T f ~ T' ' ' ClnT,, = dl, ?" ??% x2 c.323 . . . czrx7 * . . c2,zxn = dz + + s j + + + ' + .+ c3rrr Xr + + + + * * * t C?,,Xn = dS + . . . + c77,xn= a, 0 = d T + l o = d , Remark: A system of linear equations in the above form, where the leading coefficient is not zero, is said to be in echelon form. The solutions of a system of equations in the echelon form ( 2 ) can easily be described and found. . . . , d m are not all zero, i.e. there is a n (i) Inconsisteizt Equutiotis. If the numbers equation of the form Oxr Oxz . . . Ox,, = dk, with d h = 0 + + + then the system is said to be inconsistent, and there a r e no solutions. (ii) Consistent Equations. If the numbers d,,,, . . . , d m a r e all zero, then the system is said to be consistent and there does exist a solution. There are two cases: ( a ) r = n , that is, there are as many non-zero equations as unknowns. Then we can successively solve uniquely for the unknowns xr, x r p l ,. . ., x, and so there exists a unique solution for the system. ( b ) r < n , that is, there are more unknowns then there are non-zero equations. Then we can arbitrarily assign values to the unknowns x,,,, , , .,xn and then solve uniquely f o r the unknowns xr, , . . , x,. L4ccordingly there exists a n infinite number of solutions. The following diagram shows the various cases: System of linear equations I Inconsistent I I I solution solution I L I N E A R EQUATIONS 110 Example 4.4: [CHAP. 11 Consider the system x + 2 y - 3x = 4 O x + y + 2x = 5 ox + Oy + ox x+2y-332 = 4 y+2x = 5 or = 3 0 = 3 Since the system has a n equation of the form 0 = c with c # 0, the system is inconsistent and has no solution. Example 4.5: ox+ y + 2 z = 5 y+2x = 5 z = 1 or Ox+Oy+ x = 1 o x oy 02 = 0 + + yt2x = 5 or x = l o = o Since there is no equation of the form 0 = c, with c f 0, t h e system is consistent. Furthermore, since there a r e three unknowns and three non-zero equations, the system h as a unique solution. Substituting x = 1 in the second equation we obtain y + 2 . 1 = 5 or y + 2 = 5 or y = 3 Substituting z = 1 and y = 3 in the first equation, we obtain = 4 xf2.3-3.1 or x + 6 - 3 = 4 or x + 3 = 4 or s = l Thus x = 1, y = 3 and x = 1 or, in other words, the ordered triple ( 1 , 3 , 1 ) is the unique solution of the system. Example 4.6: Consider the system z+2y-3x+ w = 4 02+ y + 2 2 + 3 w = 5 02 o y + O x + Ow = 0 2+2y-3x+ w = 4 y+2x+3w = 5 or + o = o The system is consistent, and since there a r e more unknowns tha n non-zero equations t h e system has a n infinite number of solutions. I n fact, we can arbitrarily give values to z and zu and solve for x and y. To obtain a particular solution substitute, say, x = 1 and w = 2 in the second equation to obtain or y+ 2*1+ 3*2 =5 y+2+6 = 5 or y + 8 = 5 or y = - 3 z = 1 and w = 2 in the first equation to obtain Substitute y = -3, x+2*(-3)-3*1+2 = 4 or = 4 or x - 7 = 4 x-6-3+2 o r x = 11 Thus x = 11, y = -3, x = 1 and w = 2 or, in other words, the row vector (11,-3,1,2) is a specific solution to the system. To obtain the general solution to the system substitute, say, z = a and zu = b in t h e second equation t o obtain y+2u+3b Substitute y = 5 - 2a - = 5-2a-3b or 3b, x = u and t*! = b into the first equation to obtain x+2(5-2a-3b)-3a+b or = 5 = 4 z+10-7u-5b or = 4 ~+10-4~-6b-33a+b = 4 or s = 7a+5b-6 Thus t h e general solution of the system is (7a + 5b - 6, 5 - 2a - 3b, a, b), where a and b are real numbers Frequently, the general solution is left in terms of z and w (instead of a and b ) as follows: or, ( 7 ~ + 5 ~ - 6 5, - 2 ~ - 3 ~ 2, , W) x = 7 2 f 5 %-~ 6 y = 5 - 2x - 3w 111 L I N E A R EQUATIONS CHAP. 111 The previous remarks give us the next theorem. Theorem 11.2: A system of linear equations belongs to exactly one of the following cases: 1. The system has no solution. 2 . The system has a unique solution. 3. The system has an infinite number of solutions. Recall that, in the special case of two equations in two unknowns, the above three cases correspond to the following cases described geometrically: 1. The two lines are parallel. 2. The two lines intersect in exactly one point. 3. The two lines are coincident. HOMOGENEOUS SYSTEMS OF EQUATIONS A system of linear equations is called homogeneous if it is of the form + + allxl a21 x1 u12x2 + ' * u22x2 -t ' ' ' . + u1nxn + uznx, = = 0 0 (3) . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . + + . + am, x n = 0 that is, if all the constants on the right are zero. By Theorem 11.1, the above system is equivalent to a system of the form am1 X1 + ClZX. X2 x1 + + * * * - . * urn222 * + C1,Zr + + + * . * C2,Xr * * * + + ClnXn = CznXn = 0 0, 7' L m o = o Clearly, the above system is consistent since the constants are all zero; in fact, the system trivial solution. It has a non-trivial solution only if r < n. Hence if we originally begin with fewer equations than unknowns in ( 3 ) , then T < n and so the system always has a non-trivial solution. That is, ( 3 ) always has the zero solution (0, 0 , , , . 0) which is called the Theorem 11.3: A homogeneous system of linear equations with more unknowns than equations has a non-zero solution. Example 5.1 : The homogeneous system x+zy-3x+ 2x+ w = 0 x-zw = 0 y-3x+5w = 0 x-sy+ has a non-trivial solution since there a r e more unknowns than equations. Example 5.2: We reduce the following system to echelon form: z+ yx = 0 x+ y - z = 0 2%-3y-t x = 0 x-4y+z2 = 0 to -5y -5y + 3x = 0 + 32 = 0 to x+yx = 0 y - " x5 = 0 o = o L I N E A R EQUATIONS 112 ‘CHAP. 11 The system has a non-trivial solution since we finally have only two non-zero equations in three unknowns. F o r example, choose r = 5 ; then y = 3 and TC = 2. In other words ( 2 , 3 , 5 1 is a particular solution. Example 5.3: We reduce the following system to echelon f o r m : x + u - r = 0 x+ y- r = 0 to 2y+r = 0 2x+4y2 = 0 -y-z = 0 3x 2y 22 = 0 + + to x + y 2 = 0 y++z = 0 -32 = 0 z = 0 UA*% = 0 r = O Since in echelon form there a r e three non-zero equations in t h e three unknowns, the system has a unique solution, the zero solution ( 0 , 0 , 0). z + y - to MATRICES AND LINEAR EQUATIONS The system ( 1 ) of linear equations is equivalent t o the matrix equation . . . . . .. . . . . . . . . . . am1 dm2 * * . amn in that every solution to (1) is a solution to the above matrix equation and vice versa. In other words, if A = ( a , , ) is the matrix of coefficients, X = (x,)is the column vector of unknowns, and B = ( b J is the column vector of constants, then the system ( I ) is equivalent to the simple matrix equation AX = B In particular, the homogeneous system of linear equations ( 3 ) is equivalent to the matrix equation AX = 0 Example 6.1: The following system of linear equations and the matrix equation a r e equivalent: 2 x2-+2 3y -y 5- 54 xx = 37 ’. (; -; I;)(:) = (;I In studying linear equations, it is usually simpler t o use the language and theory of matrices. Theorem 11.4: The general solution of a non-homogeneous system of linear equations A X = B is obtained by adding the general solution of the homogeneous system A X = 0 to a particular solution of the non-homogeneous system A X = B. Proof. Let a , be a fixed solution of A X = B. If p is any solution of the homogeneous system A X = 0, then A(a,+/3) = A a , + A P = B + O = B That is, the sum nl + p is a solution to the non-homogeneous system A X = B. On the other hand, suppose a, is a solution of A X = B distinct from a,. Then A(a2- a , ) = Aa, - Aa, = B - B = 0 That is, the difference N., - a1 is a solution of the homogeneous system A X = 0. But n2 = a, + ( a 2 - a,) hence every solution of A X = B can be obtained by adding a solution of AX=O to the particular solution a1 of A X = B. CHAP. 111 113 LINEAR EQUATIONS Theorem 11.5: If n l , a?, , . . , all a r e solutions to the homogeneous system of linear equations A X = 0, then every linear combination of the of the form k,a, + k p , + . . . + li,,a,, where the k Lare scalars, is also a solution of the homogeneous system AX = 0. Proof. We a r e given t h a t A(k,n,- . . . Accordingly, k,a, + ... Aa, = 0, . . ,, + k,{a,,) = T klAnl + . ila,, = 0. Hence . + k,Aa, = k,O + . . . + k,,O = 0 k p r l is also a solution of the homogeneous system A X = 0. In particular, every multiple ke of any solution a of A X = 0 is also a solution of AX = 0. Consider the homogeneous system in Example 5.2: Example 6.2: x = 0 y- .r+ 2.r - 3y + x = 0 = 0 .r-4y+2x = 1 2 , 3 , 5 ) is a solution of the above. Hence every multiple of a , As was noted, such a s ( 4 , 6 , 1 0 ) and i -6, -9, -15), is also a solution. (Y We now restate Theorem 11.2 using the language of matrix theory, and give an independent proof of the theorem. If A X = B has more than one solution, then it has an infinite number of solutions. Proof. Let a and 13 be distinct solutions to A X = B. Then the difference N - ,5 is a non-trivial solution to the homogeneous system A X = 0: A ( a - P ) =z An -- AP = B - B = 0 Theorem 11.2: Hence every multiple k ( ~ - p )of a - p scalar k, is a solution of A X = O . n + Accordingly, f o r each - p) k ( N Thus AX = B has a n infinite number of solutions as is a distinct solution to A X = B. claimed. r)+t) VECTORS AND LINEAR EQUATIONS The system ( I ) of linear equations is also equivalent to the vector equation + X1 x2 an,1 ... + = xs am2 bln &n In other words, if ( Y ~ , . . . , an and p denote the above column vectors respectively, then the system (1) is equivalent to the vector equation Xp, + Sp2 If the above equation has a solution, then o r is said to depend upon the vectors +. p ’ ’ A XnNn = p is said to be a linear combination of the vectors We restate this concept formally: The vector p is a linear combination of the vectors scalars k,, . . . , kll such that p = k l N l 4- k2a2+ * * . + k,a, al, . . . , N~ if there exist [CHAP. 11 L I N E A R EQUATIONS 114 i.e. if there exists a solution to p = where the Remark: X, X,a, + X2a2 + . . . + 5"an are unknown scalars. The above definition applies to both column vectors and row vectors, although our illustration was in terms of column vectors. ExampIe 7.1: Let I; /3= i), (ul = :-4 (i), a2 = Then /3 does depend upon the vectors a,, ue and + 7a2 p = -4q (a) a3 and a3 = ($ since - a3 t h a t is, the equation ( o r system) 2 = x+y+x 3 = x+y -4 = x has a solution ( - 4 , 7 , -1). :"u [I+. . VECTORS AND HOMOGENEOUS LINEAR EQUATIONS I[ The system ( 3 ) of homogeneous linear equations is equivalent to the vector equation XI + x2 + Xn f) = anlz ami Clmn That is, if a,, a2, . . . ,an denote the above column vectors respectively, then the homogeneous system ( 3 ) is equivaIent to the vector equation X,a1 + x2a2 + * . . + Xnafl = 0 If the above equation has a non-zero solution, then the vectors a l , . . .,an are said to be dependent; on the other hand, if the above equation has only the trivial (zero) solution, then the vectors are said to be independent. We restate this concept formally: 1 Definition: I The vectors al, a*, . . . , a l l are not all zero, such that kp, dependent if there exists scalars k,, k,, + kznz + * * ' +kp, . . . , kn, = 0 i.e. if there exists a non-trivial solution to Z,a, + X2a2 + where the xi are unknown scalars. in dependent. Remark: ' * + X,Ian = 0 Otherwise, the vectors are said to be The above definition applies to both column vectors and row vectors although our illustration was in terms of column vectors, LINEAR EQUATIONS CHAP. 111 Example 8.1: 115 The only solution to or :c+y+z .c+y x is the trivial solution s = 0, ?/ = 0 and x = 0. pendent. Example 8.2: = 0 = o = o Hence the three vectors a r e inde- The vector equation lor' system of linear equations) x+2y+ x = 0 z f 3 y t 3 x = 0 has the non-trivial solution ( 3 , - 2 , l ) . Accordingly, the three vectors a r e dependent. Solved Problems LINEAR EQUATIONS IN TWO UNKNOWNS 11.1. Determine three distinct solutions of 2n. - 3y = 14 and plot its graph. Choose any value for either unknown, say 2.(-2) - 3y = 14 Thus x = -2 and y = -6 or -4 - 2: = -2. 3u = 14 Substitute z = -2 into the equation t o obtain or -3y = 18 or y = -6 or, in other words, the pair (-2, -6) is a solution. Now substitute, say, z = 0 into the equation t o obtain 2.0 - 3y = 14 Thus (0, -14/3) = 14 -3u or or y = -14/3 is another solution. Lastly, substitute. say, y=O into the equation to obtain 2x-3.0 = 14 or 2x = 14 or z = 7 Hence (7,O) is still another solution. Now plot the three solutions on the Cartesian plane R2 as shown below; the line passing through these three points is the graph of the equation. t5 Y 116 LINEAR EQUATIONS [CHAP. 11 3x-2y = 7 x+2y = 1* 11.2. Solve the system: Since 311 # -212, the system has a unique solution. Now the coefficients of y a re already the negatives of each other: hence a d d both equations: 32 - 2y = 7 x+2y = 1 Addition: Substitute .r or = 8 4x .1:=2 = 2 into the second equation to obtain 2+2y = 1 2y = -1 or u or -4 = -$ -3) Thus :2 = 2 and y = or, in other words, the pair (Y = ( 2 , is the solution of the system. Check pour answer by substituting the solution back into both original equations: = 7 3*2-2*(-$) and 2+2*(-3) = 1 2-1 or 7 = 7 or = 1 1 = 1 or (I) 2 x + 5 y = 8 11.3. Solve the system: Note t h at 213 6+1 = 7 or ( 2 ) 3.2: - 2y = -7 hence the system ha s a unique solution. and then add: # 51-2: ( I ) by 3 and (2) by -2 3 -2 62 X (1): X (2): -6% + 15y + = 24 4~ = 14 19y = 38 Addition: To eliminate x, multiply y = 2 or Substitute y = 2 into one of the original equations, say ( I ) , t o obtain 2xl-5.2 = 8 23:+10 = 8 or 22 = -2 or Hence .c = --1 and y = 2 or, in other words, the pair ( - 1 , Z j x = -1 or is the unique solution to the system. Check your answer by substituting the solution back into both original equations: (1): 2*(--1) 4- 5 . 2 = 8 or -2 f 10 = 8 (2): 3.(-1)-2*2 = -7 or -3-4 = -7 8 = 8 or or -7 = -7 Multiply ( I ) by 2 and We could also solve the system by first eliminating y as follows. ( 2 ) by 5 and then add: 2 i< ( 1 ) : 4% 1Oy = 16 + Substitute 2 5 X (2): 15%- 1Oy = -35 Addition: 19x = -19 or x = -1 = -1 in ( 1 ) to obtain 2*(-1) Again we get (-1,Z) 11.4. Solve the system: + 5y = 8 or -2+5y = 8 or 6~ = 10 or y = 2 as a solution. (I) x -2y = 5 ( 2 ) -32 + 6y = -10' hence the system does not have a unique solution. But -2 5 -1 _ - -3 6 -10 Hence the lines a r e parallel and the system ha s no solution. Note t h a t 1/-3 = -216: +--- LINEAR EQUATIONS CHAP. 111 11.5. Solve the system: 117 (I) 5 x - 2 ~= 8 (2) 3 x + 4 y = 10 hence the sybtem has a unique solution. To eliminate y, multiply Note t h a t 5/3 f -2/4; ( 1 ) by 2 and add it t o ( 2 ) : 2 X (1): 10s - 4y = 16 32 (2): Addition: + 4y 13x = 10 or =26 x = 2 Substitute x = 2 in either original equation, say ( 2 ) , to obtain 3 . 2 + 4 y = 10 or 6 + 4y = 10 4y = 4 or or y = 1 Thus the pair 1 2 , l ) is t h e unique solution t o the system. Check the answer by substituting the solution back into both original equations: (1): 5.2-2-1 (2): 3.2+4.1 11.6. Solve the system: = 8 = 10 10-2 or or x-2v = 5 ( 2 ) - 3 ~ + 6 2 j = -15 = 8 or 6 + 4 = 10 or 8 = 8 10 = 10 (I) * Note t h at Accordingly, the two lines a r e coincident. Hence the system has an infinite number of solutions which correspond to the solutions of either equation. Particular solutions can be found as follows: Let y = 1 and substitute in ( 1 ) to obtain x - 2 * 1 = 5 or or x - 2 = 5 x = 7 Hence ( 7 , l ) is a particular solution. Let 11 = 2 and substitute in ( 1 ) t o obtain x - 2 * 2 = 5 or x - 4 = 5 or x = 9 Then ( 9 , 2 1 is another specific solution of the system. And so forth. The general solution to the system is obtained as follows. Let y = a and substitute in ( 1 ) t o obtain x-2y= 5 or z = 5 + 2 a Accordingly, the general solution to the system is (5 GENERAL SPSTERIS OF LINEAR EQUATIONS 11.7. Solve the following system in echelon form: x+2y-32+4w = 5 O x + y f 5 x I - 2 ~= 1 or + 2a, a ) , x where a is any real number. + zy - 32 + 4w ?/ + 5x +22u = 5 = 1 0 = 2 02+Oy+O2+Ow = 2 The system is inconsistent since i t has a n equation of the form 0 = c with c # 0. Hence the system has no solution. 11.8. Solve the following system in echelon form: x+3y-22X= 4 on:+ y - & = 2 or o x + o y + x = -1 ox o y + ox = 0 + 4 y-52= 2 x = -1 x+32j-22= o = 0 [CHAP. 11 LINEAR EQUATIONS 118 The system i s consistent since it has no equation of the form O = c with c f 0 ; hence the system has a solution. Furthermore, since there a r e three unknowns and three non-zero equations, the system has a unique solution. Substitute x = -1 into the second equation to obtain = 2 y-5*(-1) y+5 = 2 or y = or -3 Then substitute y = -3 and x = -1 into the first equation to obtain x+3.(-3)-2*(-1) = 4 or z - 9 + 2 = 4 o r 2 - 7 = 4 or x = 11 Thus x = 11, y = -3 and z = -1 or, in other words, the ordered triple (11,-3, -1) is the unique solution t o the system. 11.9. Solve the following system in echelon form: x+2y-32+4w ox x+2y-%+4w + + = 5 y +5 5 x +2 w = 1 = 5 or y +5x 2242 = 1 ox+oy+oz+ow = 0 o = o The system is consistent and so has a solution. Furthermore, since there a r e four unknowns and only two non-zero equations, the system h a s a n infinite number of solutions which can be obtained by assigning ar b i t r ar y values to the unknowns x and tu. To find the general solution to the system let, say, x = a and w = b, and substitute into the second equation to obtain y + 5 a + 2 b = 1 or y = 1 - 5 a - 2 b Now substitute x = a, w = b and y = 1- 5a - 2b into the first equation to obtain = 5 x+2(1-5a-2b)-3a+4b or z+2-10a-4b-3~+4b or x+2-13a=5 or = 5 x=3+13a Thus the general solution is (3 + 13a, 1 - 5a - 2b, a, b) where a and b a r e real numbers To find a particular solution t o the system let, say, a = 1 and b = 2 to obtain (3 + 13 * - 1, 1 - 5 1 - 2 * 2, 1, 2) or (16, -8, 1, 2) N o t e . Some texts leave the general solution in terms of x and w as follows: x = 3 + 13x = 1-5x-2w where z and w a r e a ny real numbers 132, 1- 5x - 2w,x , w) y (3 + z + ~ Y - ~= x-4 (2) 5x - 3y - 72 = 6 (I) 11.10. Solve the following system: (3) 3x-2y+32 . = 11 Reduce the system t o echelon form by first eliminating x from the second and third equations. Multiply (I) by -5 and add to ( 2 ) to eliminate z from the second equation: -5 x (I): -5x - 1oy + 202 5%- 3y - (2): Addition: -13y '72 + 13x = 20 = 6 = 26 or y -x = -2 Now multiply ( 1 ) by -3 and add to (3) to eliminate x from the third equation: -3 X (I): (3): Addition: - 3 ~- 6y t 1 2 2 = 12 3x - 2y + 32 = 11 -8y 4- 152 = 23 CHAP. 111 L I N E A R EQUATIONS 119 Thus t h e original system is equivalent to the system + 2y - x 4x = -4 x = - 2 y - -8y i152 = 23 Next multiply the second equation by 8 and add to the third equation to eliminate y from the third equation: 8 X second: 8y - 82 = -16 third: + 15x -8y Addition: = 23 or 7 7 x = x = l Thus the original system is equivalent t o the following system in echelon form: = -4 x t2y-4x z = -2 y-- x = 1 The system is consistent since i t has no equation of the form 0 = c with c # 0; hence the system has a solution. Furthermore, since there a r e three unknowns and also three non-zero equations, t h e system has a unique solution. Substitute z = 1 into t h e second equation to obtain = -2 y-1 y = -1 or Now substitute z = 1 and y = -1 into the first equation to obtain x+2.(-1)-4.1 = -4 or = -4 x-2-4 or 2-6 = -4 or z = 2 Thus x = 2 , y 2 -1 and z = 1 or, in other words, the ordered triple ( 2 , -1, 1) is the unique solution to the system. 11.11. Solve the following system: (1) +2y-% ( 2 ) -3x + = -1 y - 2x = -7 5 x 4-3y-42 = 2 (3) . Reduce to echelon form by first eliminating x from the second and third equations. Multiply ( 1 ) by 3 and add t o ( 2 ) to eliminate z from the second equation: 3 x 3s (1): -3x (2): + 6y - + Addition: y - 7y - 9~ = -3 2x = -7 112 = -10 Now multiply ( 1 ) by -5 and add to (3) to eliminate x from the third equation: -5 - 5 ~- 1Oy X (1): 52 (3): + Addition: + 1 5 ~= 5 3y - 4x = 2 + llx -7y = 7 Thus the system is equivalent to t h e system x + 2y - 7y - -7y 32 = -1 11z = -10 + 11z = 7 However, if we add the second equation t o the third equation, we obtain O x + Oy + Ox = -3 or 0 = -3 Thus the system is inconsistent since it gives rise to an equation 0 = c with c f 0; accordingly, the system has no solution. 120 'CHAP. 11 LINEAR EQUATIONS ~ + 2 y - 3 ~ = 6 (I) 11.12. Solve the following system: ( 2 ) 2x - y + 4x = 2 . (3) 4 x + 3 1 ~ - 2 = ~ 14 Reduce the system to echelon form by first eliminating :c from the second and third equations. Multiply ( 1 ) by -2 and add t o ( 2 ) ' -2 -2a - 4y f 6.2. = -12 X (1): 22 - y f 42 = (2): Addition: -5y Multiply ( 1 ) by -4 and add i t t o (3): -4 X (1): -4.2. (3): 42 - 8y 2 + 102 = -10 + 122 = -24 + 3y - 22 = y or y - 22 - 14 -5y f 102 = -10 Addition: 22 = 2 or = 2 Thus the system i s equivalent to x + 2 y - 3 ~ = 6 y-22 = 2 y-22 = 2 ~ f 2 u - 3 ~6 or simply y-22 = 2 ( S o t c . Since the second and third equations a r e identical, we can disregard one of them. I n fact, if we multiply the second equation by -1 and add to the third equation we obtain 0 = 0.) The system is now in echelon form. The system i s consistent and since there a r e three unknowns but only two non-zero equations, the system ha s a n infinite number of solutions which can be obtained by assigning ar b i t r ar y values to 2. To obtain the general solution t o the system let, say, z = a . second equation to obtain y-22a=2 or y = 2 + 2 a Then substitute z = a into the Now substitute z = a and y = 2 + 2a into the first equation t o obtain n+2(2+2a)-3a = 6 .?+4-44a-3~ = 6 or r + 4 + a = 6 or or x = 2-a Thus the general solution t o the system is (2 - a , 2 + 2a, a ) where a is any real number. To obtain a particular solution of the system let, say, a = l and substitute into the general solution t o obtain ( 2 - 1 , 2 + 2 . 1 , 1) o r (1, 4, 1). 11.13. Solve the system: + = 1 (2) 32 - 9 - 42 = 7 ( 3 ) 5~ + 2 1 ~- 62 1 5 (1) 2x 2/ - 3x . Method 1. Reduce to echelon form by first eliminating x from the second and third equations. Multiply ( I ) by -3 and (2) by 2 and then add to eliminate from the second equation: -3 X ( 1 ) : 2 X (2): -62 - 3y 6x + 92 - 22/- Addition: -5y + = -3 82 = 14 2 = 11 Multiply ( 1 ) by -5 and (5) by 2 and then add to eliminate z from the third equation: -5 X ( 1 ) : 2 x (3): Addition: -10% - 51/ + 152 = -5 10% i 4y - 122 = 10 -1/+ 32 = 5 or y - 32 = -5 CHAP. 111 LINEAR EQUATIONS 121 Thus the system is equivalent to the system 2.1- + y - 3x + x -5u y-3x = 1 1 22+2/-&= = 11 o r , interchanging equations, = -5 y-32 = -5 + x = 11 -5y Now niultiply the second equation by 5 and add to the third equation t o eliminate y from the third equation: 5 X second: 5y - 152 = -25 third: + -5y Addition: = x 11 = -14 -14x or = 1 x Thus the original system is equivalent t o the system + 23: - 3x = 1 y - 3 % = -5 1 x = Substitute z = 1 into t h e second equation to obtain = -5 y-3.1 Substitute 2 = 1 and y = - 2 22+(-2)-331 Thus 1: = 3, y = 1 or = -5 y-3 or y=--2 into the first equation to obtain or = 1 2x-2-3 or = 1 22-5 or 22 = 6 or z=3 = -2 and x = 1 is the unique solution to the system. Method 2. Consider y as the first unknown and eliminate y from the second and third equation. ( 1 ) to ( 2 ) to obtain 5 ~ - 7 2= 8 Add Multiply ( 1 ) by -2 and add to (3): -2 -4% X (I): - 5x (3): Addition: 2y + 62 + 2y - 62 x = -2 = 5 = 3 Thus the system is equivalent to 22+y-3x Substitute 1: = y - 3 2 + 2 ~ = 1 = 1 5% - 7x = 8 X = 3 or -7% + 5~ = 8 r = 3 3 into the second equation t o obtain -72+15 = 8 or -72 = -7 or x = 1 Substitute x = 3 and z = 1 into the first equation to obtain y - 3 + 6 = 1 Thus N or y + 3 = 1 or y = - 2 = 3, y = -2 and z = 1 is the unique solution of the system. VECTORS AND LINEAR EQUATIONS 11.14. Reduce the following vector equation to a n equivalent system of linear equations and solve: LINEAR EQUATIONS 122 [CHAP. 11 Multiply the vectors on the right by the unknown scalars and then add: Set the corresponding components of the vectors equal to each other to form the system x + y + x = 2 x + y =-4 X = 7 2 z T y + x = y + x = -4 or 7 x = Substitute r = 7 into the second equation to obtain y f 7 Substitute -c = -4 y = -11 or = 7 and y = -11 into the first equation to obtain x - l 1 + 7 = 2 or or x - 4 = 2 z = 6 Thus x = 7, y = -11 and x = 6 is the solution of the vector equation. 11.15. Write the vector v = (1,-2,5) as a linear combination of the vectors e l = (1,1,l), e, = ( 1 , 2 , 3 ) and e3 = (2, -1,l). We wish t o express ZI as v = x e l (1, -2, 5) + ye, + xe3, with x, y and z a s yet unknown. Thus we require: + y ( l , 2 , 3) + 4 2 , -1, x, x) + (Y, 2Y, 3Y) + (22, = x(1, 1, 1) = (2, = (x+y+2x, x+2y-x, -2, 1) 4 xf3yiz) Hence s+ x+y+2x= 1 y-22= x+2y- x = -2 .L:+31/- x = y-32 or 2y- 5 1 = -3 z = 4 x f y + 2 x = or or 1 = -3 y-32 2 x = Substitute z = 2 into t h e second equation t o obtain = -3 y-3.2 or y-6 = -3 or y = 3 Substitute z = 2 and y = 3 into the first equation to obtain s+ 3+ 2. 2= 1 or u = -6e, Consequently, 11.16. Write 1 x + 3 + 4 = or \-5 Set I' (-%) or [-:I, (::) e2 = (1;) and e3 = as a linear combination of the e, using unknowns x, y and z: w {-:) = x f 2 + y (1;) + (-:) (-:) = x =-6 + 3e2 + 2e3 as a linear combination of el = 2' = x +7 = 1 + f %el+ y e 2 (-5) ( = [-:I. +~ e 3 . z+2y+z 2 x - Y f 72 -32 - 4y - 52 ) CHAP. 111 123 LINEAR EQUATIONS Hence = 2 y+7%= 3 x t 2 y f 22- -3X - 41/ - 2 52 x+2y+ = -1 = 1 -5y+5x = -5 2 x = z+2y+ or 2 y - 22 or 2 x = g y - z = -5y+5x = -1 z t 2 y + x = 2 or = 1 y--x o = " The system is inconsistent and so has no solution. Accordingly, u cannot be written a s a linear combination of the vectors e l , e2 and e 3 . DEPENDENCE AND INDEPENDENCE 11.17. Determine whether the vectors il,1,-l), (2, -3,l) and (8, - 7 , l ) are dependent or independent. F i r st set a linear combination of the vectors equal to the zero vector: (0, 0, 0) = ~ ( 11, , -1) + ~ ( 2-3, , 1) + ~ ( 8-7, , 1) Then reduce the vector equation to an equivalent system of homogeneous linear equations: (0, 0, 0) = ( r ,2, - x ) = (X + ( 2 ~-,3 ~ ,II) + (82, - 7 2 , + 2u 4- 8x, x - 3y - 72, + y + -2 Z) X) r+2y+8x = 0 2-3y-7x = 0 -x+ y+ 2 = 0 01, Lastly, reduce the system to echelon form. Eliminate z+2y+ 3y + from the second and third equation to obtain r+2y+8x = 0 y+32 = 0 8x = 0 -5y - 152 = 0 5 or o = o y+3x = 0 92 = 0 = 0 y+3x = 0 z+2y+8z or Since there a r e three unknowns but only two non-zero equations, the system ha s a non-trivial solution. Thus the original vectors ar e dependent. Remark: We do not need to solve the system to determine dependence or independence; we only need to know if a non-zero solution exists. 11.18. Determine whether the vectors (-:I, [:) and \-3 are dependent or independent. \-1 F i r st set a linear combination of the vectors equal to the zero vector; and then obtain a n equivalent system of homogeneous linear equations: or -u/ -32 -1 xi2y+3x = 0 I -22 -32- + 3y + 2x y + 2 = 0 = 0 Eliminate z f r o m the second and third equations to obtain z + 2 y + 32 = 0 x+2y+32 = 0 + 82 = 5y + 10% = 7y 0 0 -3% - +Z 124 [CHAP. 11 LINEAR EQUATIONS Eliminate y from the third equation to obtain z+2y+3x = 0 y+22 = 0 -62 = 0 sf2yi-32 = 0 = 0 z = o or y+2x In echelon form the system has three unknowns and three non-zero equations; hence the system has the unique solution x = O , y = 0, z = 0, i.e. the zero solution. Hence the original vectors a re independent. Reduce the problem to a homogeneous system of equations by setting a linear combination of the vectors equal t o the zero vector: + + y 32 f 2v - 8~ -2% 4- 2y - 72 f UJ 32 y - 6v - 7~ -42 5y 22 - 5 v 4w x + + + X+ + y + 3 2 + 2 ~ - 8 ~= 0 -22+2y-72 3 ~ +y -42f $. W = 0 - 6 ~ - ' 7 ~= 0 52/+22-55v+4w = 0 Now the system of homogeneous equations ha s five unknowns but only four equations. Hence, by Theorem 11.3, the system has a non-trivial solution and so the vectors a r e dependent. 11.20. Let vl,v2, . . .,vm be n-component vectors. Show that if the number of vectors is greater than the number of components, i.e. m > 12, then the vectors are dependent. Note, as in the preceding problem, t h a t the vector equation x1v1 + 22v2 + ... + X m W m = 0 gives rise t o a homogeneous system of linear equations in which the number of unknowns equals the number of vectors and the number of equations equals the number of components. Hence if the number of vectors is greater than the number of components, then the system ha s a non-zero solution by Theorem 11.3 and so the vectors a r e dependent. CHAP. 111 LINEAR EQUATIONS 125 Supplementary Problems LINEAR EQUATIONS IN TWO UNKNOWNS 11.21. Solve: 22 - 5y = 1 32 2y = 11 + 11.22. Solve: 22 32 11.23. + 4y + 6y 2s 5s (ii) = 10 (ii) = 15 { + 3y + 7y = 1 = 3 (iii) 4s - 2y = 5 3y = 1 -62 + (iii) 32 2 i - 2y = 7 = -16 + 31/ 22 - 3y = 5t 32 y = 2t + Solve: GENERAL SYSTEMS OF LINEAR EQUATIONS 11.24. Solve: 22+ y-3% = 5 32 - 2y 22 = 5 5 2 - 3y - z = 16 + 11.25. Solve: + 3y - 2x = 5 2y 3% = 2 42- y+4x = 1 22 z + - x + 2 y + 3x = 3 2x 3y 8x = 4 3x 2y 172 = 1 + + + + + + 22 3y = 3 2-2y = 5 32 2y = 7 11.26. Solve: z+2y+2x = 32-2y2 = (i) 22 - 5y 3% = x+4y+62 = + 2 5 -4 0 Z + 5 y + 4 ~ -1 3 = ~ 3 (ii) 32 - y 22 5w = 2 4w = 1 22+2y+3x- + + VECTORS AND EQUATIONS 11.27. Write v = (9, 5, 7) as a linear combination of the vectors el = (1,1,1), e2 = (1,-1,0) and e3 = (2,0,1). 11.28. Determine whether the vectors (1, -2, 3, l), (3, 2, 1, -2) independent. 11.29. Determine whether the vectors 11.30. Prove: If the vectors el, e2 and e3 are independent, then any vector v can be written in a t most one way as a linear combination of the vectors e l , e2 and e3. (-i), (i) (-!) and and (1, 6, -5, -4) are dependent o r are dependent or independent. [CHAP. 11 LINEAR EQUATIONS 126 Answers to Supplementary Problems (ii) ( 2 , -l), o r x = 2 , y = -1 =3, y =1 11.21. (i) 13, 11, o r 11.22. (ii (5 - 2 a , a), or x = 5 - 2 a , y = a 11.23. (ii 16, 8) 11.24. (i) (1, - 3 , -2) 11.25. (ii (3, -1) 5 [ii) (5, 2 ) (ii) (-a+ 2b, 1 11.27. w = 2e, - 3e2 11.28. Dependent 11.29. Independent 21.30. Proof: (iii) (-1 - 7% 2 (ii) no solution (ii (2, 1, -1) (iii) ( t , -t), iii) no solution or z = -1, y = -5 or z = t , y = --t (iii) ( 5 , 2 ) + + 2a, a ) , o r x = -1-77x y=2+2x z = -x+2w 2u - 2 b , a , b ) , o r (iii) (6 - 2a - 5b, a, b, 2b - l), or 11.26. (iii) (-1, -5), y=1+2x-2w 6-2y-52 <(w5 = =22-1 (ii) no solution + 5e3 Suppose w = alel + a2e2+ u3e3 0 = w - and w = b,e, v = (al - bl)el + b,ez + b3e3. Subtracting, we obtain + (a2- b,)e, + (a3- ba)ea Since the vectors a r e independent, the coefficients must be zero, i.e. al - b , = 0 , a, Accordingly, a, = b,, u2= b , and u3 = b,. - b2 = 0 and u3 - b3 = 0 Chapter 12 Determinants of Order Two and Three INTRODUCTION To every square matrix there is assigned a specific number called the determinant of the matrix. This determinant function, first discovered in the investigation of systems of linear equations, has many interesting properties in its own right. In this chapter we define and investigate the determinant of square matrices of order two and three, including its application to linear equations. Determinants of matrices of higher order are beyond the scope of this text. DETERMINANTS OF ORDER ONE Write det(A) or ' A (for the determinant of the matrix A ; it is a number assigned to square matrices only. The determinant of a 1 x 1 matrix ( a ) is the number a itself: det(a) = a. Observe that in the linear equation in one unknown x, ax = b we can view a as a 1 x 1 matrix (a) applied to the one-component vector x. If det(a) # 0, 1 However, if det(a) = 0, i.e. i.e. if a # 0, then the equation has the unique solution x = ;b. if a = 0, then the equation has no solution if b f 0, and every number is a solution if b = 0; hence the solution is not unique. As we shall subsequently see, the preceding result holds true in general, i.e. det(A) # 0 is a necessary and sufficient condition f o r the linear equation A x = b to have a unique solution. w DETERMINANTS OF ORDER TWO The determinant of the 2 x 2 matrix (: :) is denoted and defined as follows: Example 1.1: We emphasize that a square array of numbers enclosed by straight lines is not a matrix but denotes the number that the determinant function assigns to the enclosed array of numbers, i.e. t o the enclosed square matrix. 127 DETERMINANTS O F ORDER TWO AND T H R E E 128 [CHAP. 12 A very important property of the determinant function is that i t is multiplicative; that is, Theorem 12.1: Remark: The determinant of a product of matrices is the product of the determinants of the matrices: det(AB) = det(A) det(B). The diagram on the right may help the reader obtain the determinant of a 2 x 2 matrix. Here the arrow slanting downward connects the two numbers in the plus term of the determinant ad - be, and the arrow slanting upward connects the two numbers in the minus term of the determinant ad - be. (X - + LINEAR EQUATIONS IN TWO UNKNOWNS AND DETERMINANTS Consider two linear equations in two unknowns: a1x a2,r + + bey b1y = c1 = cz Let us solve the system by eliminating y. Multiply the first equation by b, and the second equation by -bl, and then add: bp -bl X first: x second: albzx -arblx Addition: + b1bey = - blb2y = -blcz (a1bzra2b1)x = b2C1 b2C1 - blcz Now the system has a unique solution if and only if the coefficient of x in this last equation is not zero. i.e. Observe that D is the determinant of the matrix of coefficients of the system of equations. In this case, where D # 0, we can uniquely solve for x and y as quotients of determinants as follows: Here D, the determinant of the matrix of coefficients, appears in the denominator of both quotients. The numerators N Z and N y of the quotients for x and y, respectively, can be obtained by substituting the column of constant terms in place of the column of coefficients of the given unknown in the matrix of coefficients. Example 2.1: Solve using determinants: 2x - 3y = 7 3x 5y = 1 + The determinant D of the matrix of coefficients is 51 -3 D = ) 3 = 2.5 - 3.(-3) = 10 f 9 = 19 CHAP. 12) 129 D E T E R M I N A N T S O F ORDER TWO AND T H R E E Since D f 0, t h e system has a unique solution. To obtain t h e numerator LV, replace, in t h e matrix of coefficients, t h e coefficients of x by t h e constant terms: To obtain t h e numerator N, replace, in the matrix of coefficients, t h e coefficients of y by the constant terms: l3 2 = = 2.1 7 - 3.7 = 2 - 21 = -19 Thus the unique solution of the system is Example 2.2: Solve using determinants: ,2x = 5 + y \I 3+2y+3% = 0 F i r s t a r r a n g e the equations in standard form: 22- y = 5 3 x f 2 y = -3 The determinant D of the matrix of coefficients is Since D # 0, the system has a unique solution. Now, = 5 . 2 - (-3)(-1) NZ = = 10 - 3 = 7 Thus the unique solution of the system is Example 2.3: Solve using determinants: {::I:; 1 ; The determinant D of the matrix of coefficients is = 13 -' -61 = 2-(-6) - 3-(-4) = -12 + 12 = 0 Since D = 0, t h e system does not have a unique solution, and we cannot solve the system by determinants. (The discussion of the previous chapter shows t h a t the system h a s no solution since 2/3 = -41-6 f 7/5.) a1 bl c1 a2 b2 CP a3 bs c3 = alb2c3 + a2bscl + a3blcZ - alb3c2 - aZblc3 -- a3bZcl [CHAP. 1 2 DETERMINANTS O F ORDER TWO AND T H R E E 130 or which is a linear combination of three determinants of order two whose coefficients (with alternating signs) are the first row of the given matrix. Note that each 2 x 2 matrix can be obtained by deleting, in the original matrix, the row and column containing its coefficient: a1 + = 2(-20+2) - 3(0-2) - 4(0+4) = 0 (-4) -36 -4 + 6 - 16 = -46 -24 - 14 72 = Example 3.2: ' Remark: = 2(-12-00) - (10+4) + = 3(0-24) - -110 Problems 12.8 and 12.18 show that the determinant of a 3 x 3 matrix can be expressed as a linear combination of three determinants of order two with coefficients from any row or from any column, and not just from the first row as exhibited here. D = a1 bl c1 a2 b2 cz a3 b3 c3 # 0 131 D E T E R M I N A N T S O F ORDER TWO AND T H R E E CHAP. 121 In this case, the unique solution of the system can be expressed as quotients of determinants, bl d:! bz ds b3 dl N, 1 c1 dl dz d3 a1 , cz NU = a2 c3 a3 c1 CZ dt dz bi , N, = c3 ~ uz bz a3 u1 b3 d3 Example 4.1: Solve the following system by determinants: 2r+ x = 3 y- u f x = 1 2 + = 4 2-22y-3x The determinant D of t h e matrix of coefficients is obtained as follows: = 2(-3+2) - 1(-3-1) Since D # 0, the system h as a unique solution. f o r x, y and x respectively: N, = 1; 3 1 -1 11 -3 - N?, = = 2(-3-4) - 3(-3-1) N, = 2(4+2)-1(4-1)+3(-2-1) -2 = + -2 4 + = 3 5 We evaluate S,, S, and N,, the numerators = 3(-3+2) 1 = - l(-2-1) 1(-3-4) - = -3 1(-2-4) = -14 - l(4-1) = +7+6 = 10 + 12 - 3 = -5 1 2 - 3 - 9 = 0 Thus the unique solution is w, 2 = - - - D - 10 = 2 , 5 = -N = , - = -5 D 5 -1, x = ? N= - = O o D 5 INVERTIBLE MATRICES A square matrix A is said to be invertible if there exists a matrix B with the property that A B = B A = I , the identitymatrix Such a matrix B is unique; f o r AB1= B1A = I and ABz = BZA = Z implies B I = BIZ = B I ( A B P=) (B1A)BZ= IBa B2. 132 DETERMINANTS O F ORDER TWO AND T H R E E [CHAP. 12 We call such a matrix B the iwaerse of A and denote it by A-l. Observe that the above relation is symmetric; that is, if B is the inverse of A , then A is also the inverse of B. Example 5.1: 3 + -5 2) = 2.3 S*(-l) (1*3+3.(-1) 3 - 5 2 5 (-1 2)(1 3) = 3.2 (-5)al ((-1)*2+2.1 (f :)(-1 + 2*(-5) 5.2 1*(-5)+3*2) + = 1 (0 = (i + 3.5 (-5)*3 (-1)*5+2.3) 0 1) It is known that AB = Z if and only if BA = Z (Problem 12.17); hence it is necessary to test only one product to determine whether two given matrices are inverses, as in the next example. Example 5.2: 0 6 -1 1*(-11)+0*(-4)+2.6 2.(-11) (-1)*(-4) 3.6 4*(-11)f 1.(-4)+8.6 + + 1.2+0*0+2.(-1) (-l).O 3*(-1) 2.2 4*2+1*0+8.(-1) + + 1*2+0.1+2*(-1) 2.2 (-1)'l 3*(-1) 4*2+1*1+8*(-1) + + Thus the two matrices a r e invertible and a r e inverses of each other. Example 5.3: Find the inverse of (i . We seek scalars x,y, z and w f o r which y+2w = 0 3y+4w = 1 o r which satisfy The solutions of the systems a r e given matrix is (pi-:) . 2: = -2, z = 312 and y = 1, u)= -112. Thus the inverse of the We test our answer: INVERTIBLE MATRICES AND DETERMINANTS We now calculate the inverse of a general 2 X 2 matrix A = x , y , x and w such that ax + bx ex + dx (: i ). We seek scalars bw + dw a y f cy whjch reduces t o solving the following two systems of linear equations in two unknowns: ax + bx = 1 cx + dx = 0 ay+bw = 0 cy+dw = 1 CHAP. 121 DETERMINANTS OF ORDER TWO AND T H R E E 133 Note first that one equation in each system is homogeneous and has (0,O)as a solution, whereas the other equation does not have (0,O) as a solution. In other words, the lines in each system are not coincident and so each system has either a unique solution or no solution. Furthermore, the matrix of coefficients in each system is identical to the original matrix A . Accordingly, the systems have a solution (and so A has an inverse) if and only if the determinant of A is not zero. We formally state this result (which holds for matrices of all orders): Theorem 12.2: A matrix is invertible if and only if its determinant is not zero. Usually a matrix is said to be singular if its determinant is zero, and nonsingular otherwise. Hence the preceding theorem can be restated as follows: Theorem 12.3: A matrix is invertible if and only if it is nonsingular. Now if the determinant of the above matrix A is not zero, then we can uniquely solve for the unknowns 2 , y, x and w as follows: In other words, we can obtain the inverse of a 2 x 2 matrix, with determinant not zero, by (i) interchanging the elements on the main diagonal, (ii) taking the negative of the other elements, and (iii) dividing each element by the determinant of the original matrix. Example 6.1: Find the inverse of the matrix A = (i i ) . Now det(A) = IAI = 2 * 5 - 4 3 = 10 - 12 = -2. Hence the matrix does have a n inverse. To obtain A-1, interchange the elements 2 and 5 on the main diagonal and take the negative of the other elements to obtain the matrix (-: -;I Now divide each element by -2, the determinant of A , t o obtain A-I: Solved Problems DETERMINANTS OF ORDER TWO 12.1. Compute the determinant of each matrix: 134 [CHAP. 12 D E T E R M I N A N T S O F O R D E R TWO A N D T H R E E 12.2. Solve for R' and y, using determinants: 2x+ y = 7 (i, 3x-5y = 4 12.3. Determine those values of k for which Hence 2k=O is zero. ax-2by (ii) o r k = 0 , and k - 2 3ax-5by 1: = c = 2c , where a b # 0 ,",=o* = 0 or k = 2 . T h a t is, if k = O or k = 2 , the determinant 12.4. Prove Theorem 12.1 (in the case of 2 x 2 matrices): The determinant of a product of matrices is the product of the determinants of the matrices, i.e. det(AB) = det(A) det(B). Let A = \ and B = det(AB) = (ar \t s\ Zt/ ' andso AB = + b t ) ( c s + d u ) - (as -t bu)(cr+ d t ) + adrzc. + bcst + bdtu - acrs adst - bcru = acrs = adm and Thus - + bcst - bdtu a d s t - bcru det(A) det(B) = ( a d - b c ) ( ~ u - s t ) = d N A B ) = det(A) det(B). - adrzc - adst - bcru + best DETERMINANTS O F ORDER TWO AND TH R EE CHAP. 121 135 DETERMINANTS OF ORDER THREE 12.5. Find the determinant of each matrix: 1 (i) 4 0 2 -2 5 3 3 -1 11 = $ 3 = l(2-15) - 2(-4+0) = 4(2+6) + = 2(10-6) 12.6. Solve, using determinants: 1 4 0 l(0 + + 3(20+O) + 15) - 2(0-10) 3(5-3) + 4(-2+2) -2 5 + 8 + 60 = -13 + = 32 2 = 8 15 + +6+ 20 0 = 55 1 67 = 14 3y+2x = x + l 3% 2 2 = 8 - 5y 32-1 x x - 2 ~ + F i r s t arrange the equation in standard f o r m with the unknowns appearing in columns: 2.1:+3y- 2 = 1 3% 5y 22 = 8 :c - 2y - 32 = -1 + + Compute the determinant D of the matrix of coefficients: D -1 2 -3 3 5 = 1 -2 1 = 2(-15+4) - 3(-9-2) - (-6-5) = -22 + 33 + 11 = 22 Since D # 0, the system h a s a unique solution, To compute N,, N, and N,, replace the coefficients of the unknown in the matrix of coefficients by t h e constant terms: N, = 3 5 = ~ = 1 ( - 1 5 r 4 ) - 3(-24+2) - 1(-16*5) = -11 ~ -1 N, -1 2 -2 -3 12 1 -1 3 8 2 1 -1 -3 ~ 1 = 2(-24+2) - 1(-9 -2) - 1(-3-8) = -44 + 66 + 11 + 11 + 11 = 66 = -22 136 DETERMINANTS O F ORDER TWO AND T H R E E N, = I 3 5 1 -2 -1 1 - = 2(-5+16) 12.7. Consider the 3 x 3 matrix ::). (Et !: a3 3(-3-8) b3 + 1(-6-5) = 22 [CHAP. 12 + 33 - 11 = 44 Show that the diagram appearing below, c3 where the first two columns are rewritten to the right of the matrix, can be used to obtain the determinant of the given matrix: - - + + + - Form the product of each of t,he three numbers joined by an arrow slanting downward, and precede each product by a plus sign as follows: + albZc3 + blcza3 + Clazb, Now form the product of each of the three numbers joined by an arrow slanting upward, and precede each product by a minus sign as follows: - b l c1 a2 b2 c2 a3 b3 c3 a1 ' = alh,c3 a,b2cl + - b3c2al - c3a2b, + cla2b3 - a3b2cl - h1~2a3 b3c2al - c3a2bl 12.8. We represented the determinant of a 3 x 3 matrix A as a linear combination of determinants of order two with coefficients from the first row of A . Show that det(A) can also be represented as a linear combination of determinants of order two with coefficients from (i) the second row of A, (ii) the third row of A . (i) (ii) bl c1 a2 b2 c2 a1 a3 b3 c3 i = alb2c3 + u2h3cl + a3blc2 - alb3c, - a 2 h l c 3 = - ~ Z ( b , c 3 - b 3 ~ 1 )+ b Z ( ~ 1 ~ 3~ 3 ~ 1-) - a3b2c, cz(alb3 - aab,) CHAP. 12j DETERMINANTS O F ORDER TWO AND THREE 137 Observe t h a t the 2 X 2 matrix accompanying any coefficient can be obtained by deleting, in the original matrix, the row and column containing the coefficient. Furthermore, the signs accompanying the coefficients form a checkerboard pattern in t h e original matrix: Remayk: We can also expand the above determinant in terms of the elements of any column in a n analogous way (Problem 12.18). INVERTIBLE MATRICES 12.9. Find the inverse of (i 35) . Method 1. We seek scalars 2, y, x and w f o r which (; :)(: :) (; ;) = (322 x ++ 5232 Or 3y 2y + 5w + 3w 1 = (: ;) o r which satisfy + 52 + 32 32 22 = 1 = 0 { and 3 y + 5w = 0 2y+3w = 1 To solve the first system, multiply the first equation by 2 and the second equation by -3 and then add: 2 X first: -3 X 62 + 102 = 2 -62 - 9x = 0 second: Addition: x = 2 Substitute z = 2 into t h e first equation t o obtain 3x+5*2 = 1 32+10 = 1 or or 32 = -9 z = -3 or To solve the second system, multiply the first equation by 2 and the second equation by -3 then add: 2 X first: 6y low = 0 and + -3 X second: -6y - 9w = -3 Addition: Substitute ?L' w = -3 = -3 in the first equation to obtain 3 y f 5*(-3) = 0 or 3 y - 15 = 0 or 31,' = 15 or y = 5 Thus the inverse of the given matrix is Method 2. We use the general formula f o r the inverse of a 2 X 2 matrix. F i r s t find t h e determinant of the given matrix: [CHAP. 12 DETERMINANTS O F ORDER TWO AND T H R E E 138 Kow interchange the elements on the main diagonal of the given matrix and take the negative of the other elements to obtain (-2” -35) Lastly. divide each element of this matrix by the determinant of the given matrix, t h a t is, by -1: (-; -35) The above is the required inverse. o r which satisfy I 2x1 f 3y1 - X i = 1 21 2y, 2, = 0 -71 - y1 32, = 0 + 2x2 + + 22 -22 + 3y2 + 2y2 + 22 - yg + 3x2 Xg = 0 = 1 = 0 2x3 x3 -23 + 3y3 + 2y3 + - 763 f 23 = 0 23 =Z 323 0 = 1 We solve each system by determinants. Observe t h a t the matrix of coefficients in each system is the original matrix. Its determinant D is D = 1 2 1 -1 3 2 -1 f ( = 2(6+1) - 3(3+1) - 1(-lf2) = 14 - 12 - 1 = 1 -1 Since D # 0, each system has a solution and so the given matrix has a n inverse. each of the three systems follows: x1 = 7, y1 = -4, x1 = 1; x2 = -8, Thus the inverse of t h e given matrix is 2/2 = 5, zg = -1; xg = 5, y3 = -3, The solution of 23 1 1 CHAP. 121 DETERMINANTS OF ORDER TWO AND T H R E E 139 Supplementary Problems 12.11. Find the determinant of each matrix: 12.12. Solve each system using determinants: (32 (i) 1 4 x + 5y - = 8 2y = 1 2 2 - 3 y = -1 42 7y = -1 + ( ii) 5x = 2y - 7 3y = 4 4z + (iii) 12.13. Find t h e determinant of each matrix: 1 3 -2 -4 -2 -1 1 -3 12.14. Solve each system using determinants: + 2 2 f 3 = yJ-322 2%- 5y 22 = 7 z 2y - 42 = 3 3x - 4y - 6 2 = 5 + (ii) 3 y + 2 = 2-222 12.15. Find the inverse of each matrix: 12.16. Find the inverse of [-: 4 (i) , (; i) (ii) ( fSote: 2 -3 3) There will be 9 unknowns.) -2 12.17. Prove: A B = Z if and only if B A = I , where A and B a r e square matrices and I is the identity matrix. 12.18. Express the determinant of a 3 X 3 matrix as a linear combination of determinants of order two with coefficients from ti) t h e first column, (ii) the second column, (iii) the third column. Answers to Supplementary Problems 12.11. (i) -18, iii) -15, 12.12. 21 29 (i) x = -, y =26 26 (iii) 8, (iv) 1, (v) 44, 5 y =1 (ii) x = - 13' 13 (vi) -57 (iii) 13 Y = -8 = -7 ' 7 140 D E T E R M I N A N T S O F ORDER TWO AND T H R E E 12.13. (i) 21, (ii) -11, (iii) 100, (iv) 0 12.14. ii) x = 5 , y = 1, x = 1 (ii) z = z + y, y = -2. The determinant of this system = 0. [CHAP. 12 13 Chapter The Binomial Coefficients and Theorem FACTORIAL NOTATION The product of the positive integers from 1 to n inclusive is denoted by n ! (read ‘‘R factorial”): n! = 1.2.3.....(n-2)(n-l)n We can also define n ! recursively as follows: and l! = 1 n! = n - ( n - l ) ! It is also convenient to define O! = 1. Example 1.1: 2! = 1.2 = 2, 3 ! = 1 0 2 . 3 = 6, 5 ! = 5 * 4 ! = 5.24 B! _ - Example 1.2: 8.7*6! 6! 6! = 120, 1) = -T+ no?- 1)’‘ ‘(7%- r 6 ! = 6 * 5 ! = 6.120 = 8 . 7 = 56 12.11.10 + T Example 1.3: ?Z(n - 1). * *(?Z 4! = 1 . 2 . 3 . 4 n(12 - 1). . .(n- T l ) ( n - r ) ( n ( n- ~ ) (-n r - q . a . 3 + 1) = n ( n - - l ) . . . ( n - r + l ) 1* 2 * 3 ’ . . ( r - l ) r 1 - = r! = = 24, = 720 12*11.10*9! 9! - 12! 9! n! - 1). * a 3 ’ 2 1 (n- r ) ! 2*1 n! -._ (n-T)! 1 T ! n! r!(n-v)! BINOMIAL COEFFICIENTS The symbol (:) (read “nC r ” ) , where r and n are positive integers with r defined as follows: (:) 6 n, is n(n- l ) ( n - 2) * - - ( n- r + 1) 1 . 2 . 3 . * * ( T - l)r = These numbers are called the binomial coe,tficients in view of the theorem in the following section. Example 2.1: Note that (:) has exactly r factors in both the numerator and the denominator. Observe, by Example 1.3, that n(n- 1)* - . (n- T 1) 1.2’3.*.(r-l)r + 141 - n! r ! (n - ?.) ! 142 T H E BINOMIAL C O E F F I C I E N T S A N D T H E O R E M But n - (R- r ) = 1'; Theorem 13.1: ( n [CHAP. 13 hence we have the following important relation: ") ( = or, in other words, if a t Z, = ?a then :I 1 Example 2.2: Compute ( l o ) . By definition, \ 7 On the other hand, 10 - 7 = 8 and so we can also compute (lo' as follows: \7/ Observe t h a t the second method saves space and time. Example 2.3: Compute 'I4) !\ 12 and Now 1 4 - 1 2 = 2 and 2 0 - 1 7 = 3; hence (20'l. 17 // 2 0 . 1 9 - 1 8 = 1140 1.2.3 Remark: and the fact that 0 ! = 1, we can extend the Using the above formula for definition of (a) (:) to the following cases: n! = O!n! = 1 and, in particular, O! = 1 O!O! BINOMIAL THEOREM The Binomial Theorem, which can be proved by mathematical induction, gives the general expression for the expansion of (a b)qt. + Theorem 13.2 (Binomial Theorem) : (a + b)" = + na"-'l, + n(n- 1) + 1.2a'Z,"-' ?2(72 - 1) ,2 + + anP2b2 ?2abn-l + b" n(n - l)(n- 2) a n - 3 b 3 + . . , 1e2,3 CHAP. 131 143 T H E BINOMIAL C O E F F I C I E N T S A N D THEOREM + The following properties of the expansion of (a b)" should be observed: (i) There are 11 + 1 terms. (ii) The sum of the exponents of a and b in every term is n . (iii) The exponents of n decrease term by term from n to 0; the exponents of b increase term by term from 0 to n. (iv) The coefficient of any term is (y,) where k is the exponent of either a or b. (This property follows from Theorem 13.1.) (v) The coefficients of terms equidistant from the ends are equal. Example 3.2: Expand (2.c + 3 y 3 5 and simplify. Observe t h a t a and b in the Blnom1al Theorem correspond in this case to 22 and 3y2 respectively. Thus 5.4 5.4 (2x+ 3y2)5 = ( 2 . r ) ; 5 ( 2 ~ ) ~ ( 3 y2 ) ~ *( 22 r ) ? ( 3 y ~ 1). 2~ ( 2 . ~ ) ' ( 3 y ~ ) ~ 5 ( 2 ~ ) ( 3 y ~ ) (~3 ~ / ~ ) ; = = + + + + 2 j . 2 + 5 . 2 4 ~ 4 . 3 ! / ~ + 10 * 2 3 r s * 3 2 y 4 + 1 0 . 2 2 x L . 3 3 y 6 + 5 * 2 ~ * 3 ~ + y 8351/l0 3 2 ~ :+ 2 4 0 x 4 ~ 2+ 7 2 0 . ~ ' ? ( + ~ 1 0 8 0 + ~ ~810.c$ ~ ~ - 2432/'O PASCAL'S TRIANGLE The coefficients of the successive powers of a + b can be arranged in a triangular a r r a y of numbers, called Pascal's triangle, as follows: + b)O = 1 a + a2 + 2ab a3 + 3a2b + 3ab' + b3 a4 t 4a3b + 6a2b2 + 4ab3 -C + 5a4b (a (a-L b)' = (a (a T b)3 = ( a + b)4 = (a + b)5 ( a + b)G = = a6 + b)' a5 - 6a5b = 1_ 1 1 b 1 1 b2 1 b4 + lOa3b2 + 10aQb3+ 5ub4 + b5 15a4b2 + 20a3b3 + 15a2b4 + 6ab5 + 1 1 b6 5 6 1 2 3 1 3 1 'g lo4 5 8 8 1 5 1 6 1 ............................ ................................................................ Pascal's triangle has the following interesting properties. (i) The first number and the last number in each row is 1. (ii) Every other number in the array can be obtained by adding the two numbers appearing directly above it. For example: 10 = 4 f 6 , 15 = 5 10, 20 = 1 0 f 10. + Now since the numbers appearing in Pascal's triangle are the binomial coefficients, we can rewrite the triangular a r r a y of numbers as follows: [CHAP. 13 T H E BINOMIAL C O E F F I C I E N T S A N D THEOREM 144 (30) 3 j, !,3 (:) (;) ' ( 0 " ) i') (;) (i' ................................................... (fi) (:) ( ; ) . $ 1 ) (:) ' . . (nill\l :": ... ();: ... l) ............................................................................ (nil) (%;I) In order t o establish the above property (ii) of Pascal's triangle, it is necessary to prove the following theorem. Theorem 13.3: ( ) (I/L1) + n+l = In the next example we prove a special case of this theorem; the general proof of Theorem 13.3 is given as a solved problem. Example 4.1: We prove (7) = '11 + (I;) /, which is Theorem 13.3 when n = l l and T = 7. Now 7 5 Multiply the first fraction by - and the second fraction by 5 to obtain the same denominator, 7 and then add: 7-11! 7*6!5! - + -7 5! -*151. !4 ! + 7-11! 5-11! 7 ! 5! = - 5'11! 7-11! m +7-! 5! (7+5)*11! 7! 5! - 1-2 - 1-1 ! 7 !5 ! - 12! 7! 5 ! The general proof of Theorem 13.3 is very similar t o the proof given above. MULTINOMIAL COEFFICIENTS Let n1,112, . . . . nr be non-negative integers such that n . . . . n, ?il + n2 + . . . + n, = n. is defined as follows: n nl, n2, . . . . n, i= n! nl!na! * . . n,! Then the CHAP. 131 145 THE BINOMIAL COEFFICIENTS A N D THEOREM These numbers are called the ?nultiiioniial coefficients in view of the following theorem which generalizes the binomial theorem. Theorem 13.4: n ( u ~ t a z f ~ ~ ~ t=a , ) ~ Example 5.1: ( 2 3 - 7! - 2 ! 3! 2! - = 210 7*6.5.4.3*2*1 2-1.3.2.1.2.1 - al"1aF.. . Example 5.2: Expand (a nl + n2 + n3 = -b A c ) ~ . Consider all triplets of non-negative integers (nl,nz,n,) for which 3: = a3 + 3a'b + 3a2c + Gnbc b3 + 3bzc + 3ub2 + c3 3ac2 + 3bc2 Solved Problems FACTORIAL 13.1. Compute 4 ! , 5!, 6!, 7! and 8!. 4! = 1 ' 2 ' 3 ' 4 5! = 1 . 2 . 3 . 4 - 5 = 5 - 2 4 = 120 = 5.4! 6! = 1 - 2 . 3 . 4 . 5 . 6 13.2. Compute: 7! = 7 * 6 ! = 7 . 7 2 0 = 5040 = 2.1 13 ! (i) -, 11! 8! = 8 * 7 ! = 8.5040 = 40,320 = 6 * 5 ! = 6.120 = 720 (ii) 7! m, 1 13.3. Write in terms of factorials: (i) 27.26, (ii) 14. 13. 1 2 , (iii) 5 6 . (i) 27.26 (ii) = 27*26*25! 25! - 3 25! ll! 11! 1 - 14! 14 * 1 3 * 12 1 4 * 1 3 * 12 * 11! 6 -_ 55! (iii) 56 = 5 _ 55! - 56! - - 55! 146 [CHAP. 13 THE BINORIIAL C O E F F I C I E N T S A N D THEOREM 13.4. Simplify: (i) 72 ! __ - (i) - I)! (J1 + I)! n! m, iiii 11 + (n 1)! ( n- 1)! (iv) ____ (n-I)!' (I2 2) ! * . n(n- l ) ( n - 2 ) . ' . 3 * 2 ' 1 - n (72 - 1)(n- 2 ) ' ' ' 3 * 2 * 1 + ___ (n-l)! (n+ l ) n ( n - 1 ) ( ) ~2 -) . * . 3 * 2 *1 - ( n + l ) * n = ?"+n (n-l)(?z-2).'~3'2*1 ( n T l ) ! - ( n + l )( n. n- . (l n) !- l ) ! or, simply, - ( n + 1) n = (71 (iii) -__ - _ - n(n- - ?L! or, simply, l)! (n-l)! - % ~~ (n-l)! - - m- iivi (n - 1) ! (n-l)! ( n + 2) ! (n + 2 ) ( n+ 1) ' IZ ~~ * (11 - - 1)! BINOMIAL COEFFICIENTS 13.5. Compute: (i) ( y), ' 121' , (ii) '1 4 + 1 (iii) /' +n 1 - - 1) * n (n 2)(n n2 123 + 3n2 + 2% (y). Recall t h a t there a r e as m a n y factors in the numerator a s in the denominator. 13.6. Compute: ti) 18) \5/ (i) (:), (ii) 8.7.6.5.4 1.2.3.4-5 (;), (iii) (y). = 56 Note t h at 8 - 5 = 3; hence we could also compute a s follows: (8)1 \5/ 8.7.6 (ii) Now 9 - 7 = 2; hence (16p) (iii) Now 1 0 - 6 = 4; hence (140) = 10.9.8'7 = 1.2.3.4 = 210. 13.7. Prove: 16! 16! 11 3 to (?) Multiply the first fraction by 6 3 and the second by obtain the same denominator in both fractions; and then add: ):I( + 6-16! 11*16! = 6*5!.11! 6!.11.10! - 6-16! 11*16! 6!*11! + - - 6-16! G!*11! (6+11)*1G! 6 ! *11! + -1G 1! - 1 6 ! e l l ! 17-* 16! 6! 11! - 17! 6!*11! CHAP. 131 T H E BINOMIAL C O EFFICIEN TS AND THEOREM 147 (The technique in this proof is similar t o tha t of the preceding problem and Example 4.1.) n! 11 ! denomi1): r + I)! - r ) ! To obtain nt h-ersame )+(:) = +l nator in both fractions, multiply the first fraction by? and the second fraction by Hence Now I (r- I/\ T - 1) - * (ti - + r ! (n- n--r+l' r - 1) n ! ?-!(n-y+l)! (n-T+ T. * 1 2 ! ~~- r!(n-r+l)! + r - n ! t (n-r+l)*n! Y! (n-r+l)! __ Y! (iLtl)?i! - oz-r+m - T - Ir -- (12 - r - l)]* n ! r ! (n-rTfl)! ( n+ 1)! ! ( n- ?' 1 7 + BINOMIAL THEOREM 13.9. Expand and simplify: (z + 3 7 ~ ) ~ . - + + + 13.13. Given (a b)6 = a6 6a5b+ 15a4bbL + 20a3b3 15a2b4-L 6ab5 b6. and ( a b)'. - Expand (a+ b)' We could expand (u+ b)7 and ( a T b ) k using the binomial theorem. However, given the expansion of (a- b ) F , it is simpler to use Pascal's triangle to obtain the coefficients of (a+ b)' and ( a b ) h . F i r s t n r i t e down the coefficients of (a b)6 and then compute the next two rows of the triangle + + THE BINOMIAL COEFFICIENTS AND THEOREM 148 1 6 20 15 [CHAP. 13 1 6 Recall t h at each number is obtained by adding the two numbers above, e.g. 28 = 7 21, 56 = 35 21. From the above diagram we obtain + + + b)7 ( a + b)8 (a = = 35 = 1 5 + 2 0 , + 7a6b + 21ajbz + 35a4b3 + 35a3b4 + Zla'b5 + 7ab6 + b7 a6 + 8a7b + 28a6bz + 56a5b3 + 70a4b* + 56a3b5 + 28a2b6 + 8ab7 + b8 a7 + 13.14. (i) How does (a - b)" differ from (a b)"? (ii) Use Problem 13.11 t o obtain (2%- y2)5. ii) The expansion of ( a - b)n will be identical to the expansion of ( a + b)" except t h a t the signs in ( a - b)" will alternate beginning with plus, i.e. terms with even powers of b will be plus and terms with odd powers of b will be minus. This follows from the fa c t tha t ( - b ) r = br if T is even and ( - b ) r = -br if T is odd. ( i i ) By Problem 13.11, + 80x4~2+ 80x3~4+ 40x2y6 + 10xy6 + j u s t alternate the signs in (2s + y*)? = 32x5 - 80&42/2 + 8 0 ~ -~ 4Ox2y6 2 -~ 9 + 1 0 % -~ y'~O + (2% Y ' ) ~ = 3225 y10 To obtain (2%- y2)5, ( 13.15. Obtain and simplify the term in the expansion of (239 - y3)8 which contains do. Nethod 1. The general term in the expansion of (2&-y3)8 (;) Hence the term containing 210 (222)s-r has (:) (-y3)' ~16-2"= is 26-r %16-2r (- Y3)7 d o or 16 - 27 = 10 or T = 3, t h a t is, is the term Method 2. Each term i n the expansion contains a binomial coefficient, a power of 2x2 and a power of -y3; hence first write down the following: To obtain x10, the power of 2x2 must be 5; hence insert the exponent 5 to obtain () (2x215 ( 3 3 ) Now the sum of the exponents must add up to 8, the exponent of (-us) must be 3. Insert the 3 to obtain ( ) (2x95 (222-y3)8; hence the exponent of (-1/3)3 Last, the binomial coefficient must contain 8 above and either exponent below. term is (:) (2x2)s (-y3)3 = - 1792x10~9 Thus the required CHAP. 13! THE BINOMIAL COEFFICIENTS A N D THEOREM 13.16. Prove: 24 = Expand (1 16 149 = + 1)4 using the binomial theorern: 71 In\ 13.17. Prove the Binomial Theorem 13.2: (a+b)" = \a"-'@. r = ~\y) The theorem is true f o r n = 1, since + b)n and prove it is true f o r ( a + b)(a + b). ( a + b ) [ a n + (Y)an-lb + ... + We assume the theorem holds f o r ( a (a + b)nrl = = + (:)an-rb~ + ... + ( a + b)n+'. (Tnl) an-rtlbr-l (7) a bn-' + bil 3 N o w the term in the product which contains br is obtained from [(TnJ + But, by Theorem 13.3, Note that ( a (91 (?' + b ) ( a + b)n is a polynomial of degree n + 1 in b. Consequently, which was to be proved. MULTINOMIAL COEFFICIENTS 13.18. Compute: iii) ( 4 , 2 8 2 , O ) 8! - 8 * 7 * 6 * 6 * 4 * 3 * 2 * 1= 420 = 4!2!2!0! - 4*3*2*1*2*1*2.1*1 (iii) The expression ( ;\, 2 > h as no meaning since 5 + 3 + 2 + 2 5, # 10. br 150 [CHAP. 13 THE BINOMIAL C O E F F I C I E N T S AND T H E O R E M 13.19. Show that , Observe t h a t the expression Hence implicitly implies t h a t n, in2 = n or n2 = n - n,. n! n, ! (n- n,)! n! n l ! n,! - 13.20. Find the term in the expansion of ( 2 ~ ~ - 3 x y ~ +which z ~ ) ~contains xll and y4. The general term of the expansion is 2a3;:3a 1' (-3)b~byZb 2% (a, b , c / - 2 a ( - 3 ) b % 3 a C b @ b z2c (a, b, c / Thus the term containing x l l and ~4 h a s 3a+ b = 11 and 2 b = 4 or, b = 2 and a = 3. since a - b c = 6, we have c = 1. Substituting in the above gives + Supplementary Problems FACTORIAL 13.21. Compute: (I) 9!, 13.22. Compute Ii), :14! 16' (ii) l o ! , iiii) 11! 14! (ii) - 8' ll!' iiii), :l o ! 13.23. Write in terms of factorials: 13.24. Simplify, lo! (iv) - . 13! ii) 2 1 * 2 3 - 2 2 - 2 1 , ( n tl ) ! 72 ! (i) 7 (ii, , __ n. (Ti - 2 ) ! ' (iii) (ii) (n- 1) ! 1 (iii) 42. 10.11-12 ' (iv) (n-r (?z - + lI )) !!. - BINOMIAL AND MULTINOMIAL COEFFICIENTS 13.26. (7) (7) = + iii) (z2- 3y)4, (iii) (+a Show iwithout using Theorem 13.3) t h a t a );( BINOMIAL 1 S D MULTINOMIAL THEOREMS 13.28. Expand and simplify: (i) 12x y"3, + 2b)5, (iv) (2az- b)6. 13.29. The eighth row of Pascal's triangle is a s follows: 1 8 28 56 70 Compute the ninth and tenth rows of the triangle. 56 28 8 1 Also, 13.30. [n) 0 Show t h a t + (7) + (;) + - (;) - !\ 13.31. 13.32. 151 T H E BINOMIAL C O E F F I C I E N T S A N D THEOREM CHAP. 131 (c; Show t h a t (;) + ... + /71\ ,3/ (;I T 3 . + (E) = 2n. t (;) = 0. . Find the term in the expansion of (2x2 - +2/3)6 which contains x*. + 13.33. Find the term in the expansion of ( 3 ~ ~ 2 2) : which contains yG. 13.34. Find the term in the expansion of (2.j.' 13.35. Find the term in the expression of (ru - ~2 y? - + 32)' which contains T~ and z4. - 2 z ) 6 which ~3 and us. contains Answers to Supplementary Problems 13.21. (i) 362,880 13.22. (i) 240 13.23. (i) 24!/20! 13.24. (i) n 13.25. iiii 2184 13.27. (i) 504 13.28. (i) -n n2 + iiii) l / [ ? t ( n l)(n (iv) 15 iiiii 91 IV) (iv) (12 - r ) ( n- r 4-1) (vi) 816 1140 + 1 2 ~ ~ ; 1 / b~ y 4 + y6 7 (iii) a"/32 + 81yq + 5 d b / 8 + 5u3b2 + 20a2b3 + 40ab4 + 32b5 (iv) 64a" - (ii) x8 + 2)] iiii) 180 iiii 210 8 d (iii) 42!/41! or 7 ! / 5 ! iiii x(x - 1) = ( i i ) 35 (iv) 111716 (iii) 1/90 rii) 9 ! / 1 2 ! +1 (ij 10 (iii) 39,916,800 iii 3,628,800 - 1 2 . ~ 6-~5 4 x 4 ~-~108x2y3 192a"Jb A 240a8b2 - 160a6b3 - 60a4b4 - l W b 5 8 1 13.29. 1 1 + l)']. 13.30. Hint. Expand (1 13.31. Hint. Expand (1 - l ) n . 13.32. 7OxS.y" 13.33. 945x3yGz8 13.34. -(105/4)~4~'~4 13.35. -240~3~5~' 36 9 10 45 56 28 126 84 120 70 210 252 + 56 126 210 b6 28 84 9 36 120 1 8 45 1 10 1 [CHAP. 14 PE R 31 U T A T I 0 N S, 0RD E R E D SA 11P L E S 154 Theorem 14.4: The number of permutations of n objects of which n, a r e alike, n2 a r e alike, . . ., n,.are alike is n! nl! nz! * . * n,! We indicate the proof of the above theorem by a particular example. Suppose we want to form all possible 5 letter words using the letters from the word “DADDY”. Now there are 5 ! = 120 permutations of the objects DI, A, D2, Di, Y, where the three D’s are distinguished. Observe that the following six permutations D1DrD3AY,D2DD1D8AY, D3D1D2AY,D1D3DzAY, D2D3D1AY, and D3D2D1AY produce the same word when the subscripts a r e removed. The 6 comes from the fact that there are 3 ! = 3 . 2 . 1 = 6 different ways of placing the three D’s in the first three positions in the permutation. This is true for each of the three possible positions in which the D’s can appear. Accordingly there are 5 ! - 120 = 20 3! - 6 different 5 letter words that can be formed using the letters from the word “DADDY”. ~ Example 3.1: How many 7 letter words can be formed using the letters of t h e word “ B E N Z E N E ” ? We seek t h e number of permutations of 7 objects of which 3 a r e alike (the three E’s), and 2 a r e alike ( t h e t w o N’s). By Theorem 14.4, there a r e 7! 3 ! 2! ~ such words. Example 3.2: - 7.6.5.4-3.2.1 3.2.1-2.1 = 420 How many different signals, each consisting of 8 flags hung in a vertical line, can be formed from a set of 4 indistinguishable red flags, 3 indistinguishable white flags, and a blue flag? We seek t h e number of permutations of 8 objects of which 4 a r e alike and 3 a r e alike. There a r e 8.7-6.5.4.3.2.1 L -different signals. 1!3 ! = 280 4.3*2.1*3.2*1 ORDERED SAMPLES Many problems in combinatorial analysie and, in particular, probability, are concerned with choosing a ball from a n urn containing n balls (or a card from a deck, or a person from a population). When we choose one ball after the other from the urn, say T times, we call each choice a n ordered sample of size r. We consider two cases: (i) Sampling with replacewzeizt. Here the ball is replaced in the urn before the next ball is chosen. Now since there are n different ways to choose each ball, by the fundamental principle of counting there a r e r times & n - n - n. . . n = rzl different ordered samples with replacement of size T. (ii) Sampliizg without replucemcizt. Here the ball is not replaced in the urn after it is chosen. Thus there are no repetitions in the ordered sample. I n other words, a n ordered sample of size 7- without replacement is simply an r-permutation of the objects in the urn. Accordingly, there a r e :HAP. 141 P E R M U T A T I O S S . ORDERED SAMPLES P(n, r ) = njn - 1 ) ( 1 2 - 2 ) . ’ .(n- r + 1) = 155 12 ! ___ (12-r)! different ordered samples of size r without replacement from a population of n objects. Example 4.1: In how many ways can one choose three cards in succession from a deck of 52 cards iii with replacement, lii’l without replacement? If each card is replaced in the deck before the next card is chosen, then each card can be chosen in 52 different ways. Hence there a r e 5 2 * . 5 2 . 5 2 = 523 = 140,608 different ordered samples of size 3 with replacement. In case there is no replacement, then the first card can be chosen in 52 different ways; the second card can be chosen in 51 different ways; and, lastly, the third card can be chosen in 50 different ways. Thus there a r e 5 2 - 5 1 - 5 0 = 132,600 different ordered samples of size 3 without replacement. Solved Problems 14.1. There are 4 bus lines between A and B ; and 3 bus lines between B and C. (i) In how many ways can a man travel by bus from A to C by way of B ? (ii) In how many ways can a man travel roundtrip by bus from A to C by way of B ? (iii) In how many ways can a man travel roundtrip by bus from A to C by way of B, if he doesn’t want to use a bus line more than once? (i) There a r e 4 ways to go from A to B and 3 ways to go from E to C; hence there a r e 4 * 3 = 1 2 ways to go from A t o C by way of B. (ii) Method 1. There a r e 12 ways to go from A to C by way of B , and 1 2 ways to return. Hence there a r e 12 * 12 = 144 ways t o travel roundtrip. Method 2. The man d l travel from A to B t o C to B to A . E n t e r these letters with connecting arrows as follows: A+E+C+B+A Above each arrow write the nuinher of v a y s t h a t p a r t of the t r i p can be traveled; 4 ways from A t o E . 3 ways from B to C , 3 ways from C to B , and 4 ways from B to A : A%B$C-%’B%A Thus there a r e 4 * 3 * 3 . 4 = 144 ways to travel roundtrip. (iii) The man will travel from A t o E to C to E to A . E n t e r these letters with connecting arrows as follows: A+E+C+B+A The man can travel 4 ways from A t o E and 3 ways from E t o C; b u t he can only travel 2 ways from C to B and 3 ways from R t o A since he doesn’t want to use a bus line more than once. E n t e r these numbers above the corresponding arrows a s follows: A h E % C % B & A Thus there a r e 4 - 3 . 2 . 3 = 72 ways t o travel roundtrip without using t h e same bus line more than once. PERJIUTATIONS, ORDERED S A N P L E S 156 [CHAP. 14 14.2. If repetitions are not permitted, (i) how many 3 digit numbers can be formed from the six digits 2, 3, 5 , 6, 7 and 9 ? (ii) How many of these are less than 400? (iii) How many are even? (iv) How many are odd? (v) How many are multiples of 5 ? 0to represent a n a rbitra ry number, and then In each case d r aw three boxes write in each box the number of digits t h a t can be placed there. (ij The box on the left can be filled in 6 ways; following this, the middle box can be filled in --- 5 ways: and, lastly, t h e box on the right can be filled in 4 ways: a r e 6 . 5 . 4 = 120 numbers. I 6I I5 I I4 I. Thus there (ii) The box on the left can be filled in only 2 ways, by 2 or 3, since each number must be less than 400: the middle box can be filled in 5 ways; and, lastly, the box on the right can be filled I?] T&i . in 4 ways: Thus there a r e 2 5 . 4 = 40 numbers. (iii, The box on t h e right can be filled in only 2 ways, by 2 or 6, since the numbers must be even; the box on the left can then be filled in 5 ways; and, lastly, the middle box can be filled in 4 ways: Fl PI. Thus there a r e 5 4 * 2 = 40 numbers. (ivj The box on t h e right can be filled in only 4 ways, by 3, 5 , 7 o r 9, since the numbers must be odd: the box on the left can then be filled in 5 ways; and, lastly, the box in the middle can FI be filled in 4 ways: (v) . Thus there a r e 5 * 4 * 4 = 80 numbers. The box on the r i g h t can be filled in only 1 way, by 5 , since the numbers must be multiples of 5 : the box on the left can then be filled in 5 ways; and, lastly, the box in the middle can be fiIled in 4 ways: , Thus there a r e 5 * 4 1 = 20 numbers. 14.3. Solve the preceding problem if repetitions are permitted. (ij Each box can be filled in 6 ways: [q . Thus there a r e 6 6 6 = 216 numbers. (ii) The box on the left can still be filled in only two ways, by 2 o r 3, and each of the others in Thus there a r e 2 * 6 6 = 72 numbers. 6 ways: Lf._l . (iii) The box on the r i g h t can still be filled in only 2 ways, by 2 or 6, and each of the others in 6 ways: . Thus there a r e 6 * 6 2 = 72 numbers. - F1 (iv) The box on the right can still be filled in only 4 ways, by 3, 5, 7 or 9, and each of the other Thus there a r e 6 * 6 * 4 = 144 numbers. boxes in 6 ways: El (41. (vj The box on t h e left can still be filled in only 1 way, by 5, and each of the other boxes in 6 ways: Fl . Thus there a r e 6 * 6 1 = 36 numbers. 14.4. In how many ways can a party of 7 persons arrange themselves (i) in a row of 7 chairs? (ii) around a circular table? (ij The seven persons can ar r ange themselves in a row in 7 6 . 5 4 * 3 2 1 = 7 ! ways. (ii) One person can s i t at any place in the circular table. The other six persons can then a rra nge themselves in 6 * 5 4 3 * 2 1 = 6! ways around the table. This is a n example of a circular permutation. I n general, n objects can be arranged in a circle in (n - l)(n- 2) . . 3 2 1 = (n- l)!ways. - 9 157 PERMUTATIOKS, ORDERED S A M P L E S CHAP. 141 14.5. (i) In how many ways can 3 boys and 2 girls sit in a row? (ii) In how many ways can they sit in a row if the boys and girls a r e each to sit together? (iii) I n how many ways can they sit in a row if just the girls are to sit together? (9 The five persons can sit in a row in 5 ~ 4 . 3 . 2 . 1 = 5! = 120 ways. (ii) There a r e 2 ways to distribute them according to sex: BBBGG o r GGBBB. I n each case the boys can sit in 3 - 2 . 1 = 3! = 6 ways, and t h e girls can sit in 2 * 1 = 2! = 2 ways. Thus, altogether, there a r e 2 * 3 ! 2 ! = 2 6 * 2 = 24 ways. - - (iii) There a r e 4 ways to distribute them according to sex: GGBBB, BGGBB, BBGGB, BBBGG. Note t h a t each way corresponds t o the number, 0, 1, 2 o r 3, of boys sitting to the left of the girls. In each case, the boys can sit in 3 ! ways, and the girls in 2! ways. Thus, altogether, there a r e 4 * 3 ! * 2 != 4 . 6 - 2 = 48 ways. 14.6. Solve the preceding problem in the case of r boys and s girls. left in factorials.) (i) The P +s persons can s i t in a row in (T (Answers are to be + s) ! ways. (ii) There a r e still 2 ways to distribute them according to sex, the boys on the left or the girls on the left. In each case the boys can sit in r ! ways and the girls in s! ways. Thus, altogether, there a r e 2 * r ! s! ways. - + 1 ways t o distribute them according t o sex, each way corresponding t o the (iii) There a r e r number, 0,1,. . . , r , of boys sitting to the left of the girls. In each case the boys can sit in r ! ways and the girls in s ! ways. Thus, altogether, there a r e ( r 1)* r ! * s! ways. + 14.7. How many distinct permutations can be formed from all the letters of each word: (i) them, (ii) that, (iii) radar, (iv) unusual, (v) sociological. (i) 4 ! = 24, since there a r e 4 letters and no repetition. (ii) 4r = 12, 2! (iii) 5! = 30, 2! 2! since there a r e 4 letters of which 2 a r e t . since there a r e 5 letters of which 2 a r e r and 2 a r e a , 7' (iv) 3! = 840, since there a r e 7 letters of which 3 a r e u. 12! (") 3 ! 2 ! 2 ! 2 ! ' since there a r e 12 letters of which 3 a r e 0 , 2 a r e c, 2 a r e i, and 2 a r e 1. 14.8. How many different signals, each consisting of 6 flags hung in a vertical line, can be formed from 4 identical red flags and 2 identical blue flags? This problem concerns permutations with repetitions. there a r e 6 flags of which 4 a r e red and 2 a r e blue. 6! There a r e - = 15 signals since 4! 2 ! 14.9. In how many ways can 4 mathematics books, 3 history books, 3 chemistry books and 2 sociology books be arranged on a shelf so that all books of the same subject are together? F i r s t the books must be arranged on the shelf in 4 units according t o subject matter: 0 a. The box on the left can be filled by a n y of the four subjects; the next by a n y 3 subjects remaining; the next by any 2 subjects remaining; and the box on the right by the l a s t subject: a . Thus there a r e 4 . 3 - 2 - 1 = 4! ways to a r r a n g e the books on t h e shelf according to subject matter. In each of the above cases, the mathematics books can be arranged in 4 ! ways, the history books in 3 ! ways, the chemistry books in 3 ! ways, and the sociology books in 2 ! ways. Thus, altogether, there a r e 4 ! 4 ! 3 ! 3 ! 2 ! = 41,472 arrangements. [CHAP. 1 4 PERM U TAT I 0 N S, ORDERED SAM P L E S 158 14.10. Solve the preceding problem if there are r subjects, A,,A,, s,, s2,. . . ,s, books, respectively. . . . , A 7 , containing Since there a r e r subjects, the books can be arranged on the shelf in r ! ways according to subject matter. In each case, the books in subject A , can be arranged in sl! ways, the books in subject A ? in s,! ways, . . ., the books in subject A , in s,! ways. Thus, altogether, there a r e r ! s1 ! s,! . . . s, ! arrangements. 14.11. (i) In how many ways can 3 -4mericans, 4 Frenchmen, 4 Danes and 2 Italians be seated in a row so that those of the same nationality sit together? (ii) Solve the same problem if they sit a t a round table. (i) The 4 nationalities can be arranged in a row in 4! ways. In each case, the 3 Americans can be seated in 3 ! ways, the 4 Frenchmen in 4! ways, the 4 Danes in 4! ways, and the 2 Italians in 2 ! ways. Thus, altogether, there a r e 4! 3 ! 4! 4! 2 ! = 165,888 arrangements. (ii) The 4 nationalities can be arranged in a circle in 3! ways (see Problem 14.4 on circular permutations). In each case, the 3 Americans can be seated in 3 ! ways, t h e 4 Frenchmen in 4! ways, the 4 Danes in 4! wags, and the 2 Italians in 2! ways. Thus, altogether, there a r e 3 ! 3 ! 4 ! 4 ! 2 ! = 41,472 arrangements. 14.12.Find the total number of positive integers that can be formed from the digits 1, 2, 3 and 4 if no digit is repeated in any one integer. Note t h a t no integer can contain more than 4 digits. Let sl, sq, ss and s4 denote the number of integers containing 1 , 2, 3 and 4 digits respectively. We compute each si separately. Since there a r e 4 digits, there a r e 4 integers containing exactly one digit, i.e. s1= 4 . Also, since there a r e 4 digits, there a r e 4 * 3 = 1 2 integers containing two digits, i.e. s2 = 12. Similarly, there a r e 4 * 3 2 = 24 integers containing three digits and 4 * 3 * 2 * 1 = 2 4 integers containing four digits, i.e. s:]= 24 and s,$= 24. Thus, altogether, there a r e s1 s2 sg sq = 4 1 2 24 24 = 64 integers. - + + + + + + 14.13. Suppose an urn contains 8 balls. Find the number of ordered samples of size 3 (i) with replacement, (ii) without replacement. (i) Each ball in the ordered sample can be chosen in 8 ways; hence there are 8 8 8 = 83 = 512 samples with replacement. (ii, The first ball in the ordered sample can be chosen in 8 ways, the next in 7 ways, and the last in 6 ways. Thus there a r e 8 * 7 * 6 = 496 samples without replacement. 14.14. Find n if (i) (i) P(n, 2) = 72, (ii) P(n, 4) = 42P(?z, 2), (iii) 2P(n, 2) P ( n , 2 ) = n(n-1) = n2--n; hence n 2 - n = 72 or n * - n - 7 2 + 50 = P(2n, 2). = 0 or ( n - 9 ) ( n + 8 ) = 0. Since n must be positive, the only answer is n = 9. (ii) P ( n , 4) = n ( n - l ) ( n - 2)(n - 3) and P(n, 2) = n(n- 1). Hence n ( n - l)(n- 2)(n - 3) = 42n(n - 1 ) or +6 n2 - 5% = 42 or n2 or, if n # 0, # 1, (n - 2)(n - 3) = 42 - 5% - 36 = 0 or ( n - 9 ) ( n + 4) = 0 Since n must be positive, the only answer is n = 9. (iii) P ( J z2, ) = ?z(n- 1) = n2- ?z and P ( h , 2) = 2742% - 1) = 4n2 - 2%. Hence 2 ( n Z - n) t 50 = 4nz - 212 or 2x2 - 2% + 50 = 4n* - 2% Since n must be positive, the only answer i s n = 5. or 50 = 2n2 or n2 = 25 PERMUTATIONS, ORDERED SAMPLES CHAP. 141 14.15. Find the relationship between f 12 159 and p(n, r ) . ?' Recall t h a t (I) P(n, T) n! and = ,).! ( n - r ) ! = n! ________ (n- r ) ! ~ P(?i,T ) = r! n! -.(n-r)! T ! nl (n-r)!. - r! Multiply P ( n , r ) by 12 ! r ! (?Z--T)! $ to obtain = r!(:) Supplementary Problems 14.16. (i) How many automobile license plates can be made if each plate contains 2 different letters followed by 3 different digits? (ii) Solve the problem if the first digit cannot be 0. 14.17. There a r e 6 roads between A and B and 4 roads between B and C. (i) In how many ways can one drive from A to C by way of B ? (ii) In how many ways can one drive roundtrip from A to C by way of B ? (iii) In how many ways can one drive roundtrip from A to C without using the same road more than once? 14.18. Find the number of ways in which 6 people can ride a toboggan if one of three must drive. 14.19. (i) Find the number of ways in which five persons can sit in a row. (ii) How many ways a r e there if two of the persons insist on sitting next t o one another? 14.20. Solve the preceding problem if they sit around a circular table. 14.21. Find the number of ways in which a judge can award first, second and third places in a contest with ten contestants. 14.22. (i) Find the number of four letter words t h a t can be formed from the letters of the word HISTORY. (ii) How many of them contain only consonants? (iii) How many of them begin and end in a consonant? (ivj How many of them begin with a vowel? (vj How many contain t h e letter Y ? (vi) How many begin with T and end in a vowel? (vii) How many begin with T and also contain S ? (viii) How many contain both vowels? 14.23. How many different signals, each consisting of 8 flags hung in a vertical line, can be formed from 4 red flags, 2 blue flags and 2 green flags? 14.24. Find the number of permutations t h a t can be formed from all the letters of each word: (i) queue, (ii) committee, (iii) proposition, (iv) baseball. 14.25. (i) Find the number of ways in which 4 boys and 4 girls can be seated in a row if t h e boys and girls a r e to have alternate seats. (ii) Find the number of ways if they sit alternately and if one boy and one girl a r e t o sit in adjacent seats. (iii) Find the number of ways if they sit alternately and if one boy and one girl must not sit in adjacent seats. 14.26. Solve the preceding problem if they sit around a circular table. PERM U T A T I 0 N S, ORDERED SARI PLE S 160 [CHAP. 14 14.27. An urn contains 10 balls. Find the number of ordered samples ( i ) of size 3 with replacement, (ii, of size 3 without replacement, fiii) of size 4 with replacement, (iv) of size 5 without replacement. 14.28. Find the number of ways in which 5 large books, 4 medium-size books and 3 small books can be placed on a shelf so t h a t all books of the same size a r e together. 14.29. Consider all positive integers with 3 different digits. (Note t h a t 0 cannot be the first digit.) (i, How many a r e greater than 7001 iii) How many a r e odd? (iii) How many a r e even? (iv) How many a r e divisible by 5 ? 14.30. (i) Find the number of distinct permutations t h a t can be formed from all of the letters of the word E L E V E N . fii) How many of them begin and end with E l f i i i ) How many of them have the 3 E's together? (iv) How many begin with E and end with N ? Answers to Supplementary Problems 14.16. = 468,000 (i) 2 6 . 2 5 . 1 0 . 9 - 8 iii) 2 6 . 2 5 . 9 . 9 . 8 = 421,200 14.17. 14.18. 3 . 5 * 4 . 3 . 2 * 1 = 360 14.19. (i) 5 ! = 120 (ii) 4 . 2 ! * 3 ! = 48 14.20. (i) 4! = 24 (ii) 2 ! 3 ! = 1 2 14.21. 1 0 . 9 . 8 = 720 14.22. 14.23. 9' - 14.24. (ii) 14.25. (i) 2 * 4 ! * 4 ! = 1152 14.26. (i) 3 ! * 4 ! = 144 14.27. (i) 10 * 10 10 = 1000 (iii) 10 * 10 1 0 10 = 10,000 (ii, 1 0 ' 9 . 8 = 720 (iv) 10 * 9 * 8 7 * 6 = 30,240 2 ! 2! 2! 3 ! 5 ! 4 ! 3 ! = 103,680 14.29. (i) (iii) ~ 2! 3! 2! (ii) 2 . 7 * 3 ! . 3 ! = 504 (ii) 2 * 3 ! * 3 ! = 72 14.28. 3 ' 9 . 8 = 216 45,360 = 1,663,200 (iii) 1152-504 fiii) 144-72 8! (iv) _ _ = 5040 2! 2! 2! = 648 = 72 (ii) 8 ' 8 . 5 = 320 (iii, 9 ' 8 . 1 = 72 end in 0, and 8 ' 8 . 4 = 256 end in the other even digits; hence, altogether, 7 2 + 2 5 6 = 328 a r e even. (iv) 9.8.1 = 72 end in 0, and divisible by 5. 14.30. G! (i) 7 = 120 3. (ii) 4! = 24 8.8.1 = 64 end in 5; hence, altogether, 7 2 + 6 4 = 136 a r e iiii) 4 * 3! = 24 (iv) 4! = 12 2! - Chapter Combinations, Ordered Partitions Theorem 15.1: C(n,T ) = P(n, r ) = ~ r! abc a b c , a c b , bac, bca, c u b , c b a abd a b d , a d b , b a d , bda, d a b , dba acd a c d , a d c , c a d , c d a , d u e , dca bcd bcd, bdc, c b d , c d b , d b c , c k b n! T ! (n- T ) ! 161 15 162 COMBINATIOKS, ORDERED PARTITIONS n! Recall that the binomial coefficient We shall use C(n,r) and Example 1.3: (:) [CHAP. 15 I . hence I interchangeably. How many committees of 3 can be formed from 8 people? essentially, a combination of the 8 people taken 3 a t a time. Each committee is, Thus different committees can be formed. Example 1.4: A f ar mer buys 3 cows, 2 pigs and 4 hens from a man who h a s 6 cows, 5 pigs and 8 hens. How many choices does the fa rm e r have? (i The f ar mer can choose the cows in /6' ways, the pigs in ways, and \3 the hens in ways. Hence altogether he can choose the animals in 1 PARTITIONS AND CROSS-PARTITIONS Recall (Page 60) that a partition of a set X is a subdivision of X into subsets which are disjoint and whose union is X, i.e. such that each a E X belongs to one and only one of the subsets. In other words, the collection {A,,A,, . , . , A m }of subsets of X is a partition of X iff (ii) foranyA,,A,, either A l = A 3 or A t n A j = @ (i) X = A , u A 2 u UA,; The subsets of a partition are called cells. 9 Example 2.1: . . Consider the following classes of subsets of X = { 1 , 2 , 6) . . ., 8,9}: [{I, 3,51, { 2 , 6 } , {4,8,931 (ii) [{I,3,5), { 2 , 4 , 6 , 8 ) , { 5 , 7 , 9 ) ] (iii) [{I,3,51, { 2 , 4 , 6 , 8 } , (7,911 Then (i) is not a partition of X , since 7 6 X but 7 does not belong to a ny cell in (i). Also (ii) is not a partition of X since 5 € X and 5 belongs t o both { 1 , 3 , 5 } and {5,7,9}. On the other hand, (iii) is a partition of X since each element of X belongs t o exactly one cell. Suppose {A,,A,, ..., A,.} and {B,,B,, , . .,Bs} are both partitions of the same set X. Then the class of intersections {At n B , } form a new partition of X called the cross-partition. Example 2.2: Let { A , B, C, D} be the partition of the undergraduate students at a university into freshmen, sophomores, juniors and seniors, respectively; and let {M, F } be t h e partition of the students into maies and females, respectively. The crosspartition consists of the following: A n M : male freshmen A n F : female freshmen B n M : male sophomores B n F : female sophomores C n M : male juniors D n M : male seniors C n F : female juniors D n F: female seniors CHAP. 151 163 COMBINATIONS, ORDERED P A R T I T I O N S ORDERED PARTITIONS Suppose an urn A contains seven balls numbered 1 through 7. We compute the number of ways we can draw, first, 2 balls from the urn, then 3 balls from the urn, and lastly 2 balls from the urn. In other words. we want to compute the number of ordered partitions (AI, A2, A3) of the set of 7 balls into cells A , containing 2 balls, A, containing 3 balls and A , containing 2 balls. We call these ordered partitions since we distinguish between ({I, 21, {3,4,5), ( 6 7 ) ) and (C6,7), {3,4,51, {1,21) each of which determines the same partition of A . Now we begin with 7 balls in the urn, so there are (3 ways of drawing the first 2 balls, i.e. of determining the first cell A , ; follon.ing this, there are 5 balls left in the urn and so mays of drawing the 3 balls, i.e. of determining the second cell A , ; finally, /c, \ ways of determining the last cell A , . there are 2 balls left in the urn and so there are Hence there are 5 . 4 . 3 2.1 7.6 210 - different ordered partitions of A into cells A, containing 2 balls, A, containing 3 balls, and A, containing 2 balls. We state the above result in its general form as a theorem. Theorem 15.2: Let A contain n elements and let n,, n2,. . . , nr be positive integers with 12,1?1,4 .. n? = 12. Then there exist -+ (;$n i 2 n 1 )( n nz ). . . ( n - nl - n2 - * * - -nr-l nr - 1 different ordered partitions of A of the form (A,,A,, . . . , A r ) where A , contains n, elements, A , contains n, elements, . . ., and A , contains nT elements. Now observe that 7! -.-.2!5! 5! 2! 3!2! 2 ! 0 ! - 7! 2!3!2! since each numerator after the first is cancelled by the second term in the denominator of the previous factor. Similarly, (nnl)(i n - nl ) ( n - nl - nz n2 - - n! n3 -, n,!(n- n l ) !* nz!( n- n1- n2)!. n3!( n - n, - n2- n3)! n! nl ! n,! ns! . - n,! ... ~ n,! The above computation together with the preceding theorem gives us our next theorem. CO R I B I N A T I 0 N S, 0 RD E R E D P A R T IT I 0 K S 164 Theorem 15.3: [CHAP. 15 Let A contain 71 elements and let nl, 712, . . . , n , be positive integers with 111 n2 . . . n, = n. Then there exist + + + n! n l !na! ns! . . . n,! different ordered partitions of A of the form (A1,Az, . . .,A,) where A1 contains nl elements, A>contains n2 elements, . . . , and Ar contains nr elements. Example 3.1: In how many ways can 9 toys be divided between 4 children if t h e youngest child is to receive 3 toys and each of t h e others 2 toys? We wish to find the number of ordered partitions of t h e 9 toys into 4 cells containing 3, 2, 2 and 2 toys respectively. By Theorem 15.3, there a r e 3 ! 2 ! 2 ! 2! such ordered partitions. = 2520 Solved Problems COMBINATIONS 15.1. In how many ways can a committee consisting of 3 men and 2 women be chosen from 7 men and 5 women? ways, and the 2 women can be chosen from the 7.6.5 5-4 350 ways. ways. Hence the committee can be chosen in The 3 men can be chosen from the 7 men in 5 women in(') 2) Find the number of ways 4 balls can be drawn from the bag if (i) they can be of any color, (ii) 2 must be white and 2 black, (iii) they must all be of the same color. 3.5.2. A bag contains 6 white balls and 5 black balls. (ii The 4 balls (of a n y color) can be chosen from the 11 balls in the urn in 330 ways. (ii) The 2 white balls can be chosen in ways, and the 2 black balls can be chosen in ' 150 ways of drawing 2 white balls and 2 black balls. Thus there a r e (iii) There a r e (:) = 15 ways of drawing 4 white balls, and balls. Thus there a r e 15 )" = 5 ways of drawing 4 black + 5 = 20 ways of drawing 4 balls(4,of the same color. 15.3. A delegation of 4 students is selected each year from a college to attend the National Student Association annual meeting. (i) In how many ways can the delegation be chosen if there are 12 eligible students? (ii) In how many ways if two of the eligible students will not attend the meeting together? (iii) In how many ways if two of the eligible students are married and will only attend the meeting together? (y) 12 11 ' 10 * 9 = 495 ways. 1.2.3-4 (i, The 4 students can be chosen from the 12 students in (ii, Let A and B denote the students who will not attend the meeting together. = 165 COMBINATIONS, O R D E R E D PARTITIONS CHAP. 151 Method 1. If neither A nor B is included, then the delegation can be chosen in 210 ways. + 240 10-9*8.7 = 1.2.3.4 - If either A o r B , but not both, is included, then the delegation can be chosen in 2 * ( y ) = 2 . 11.02' 9. .38 210 (140) 1 240 ways. Thus, altogether, the delegation can be chosen in = 450 ways. Method 2. If A and B a r e both included, then the other 2 members of the delegation can be chosen in 1'" = 45 wags. Thus there a r e 495 - 45 = 450 ways t h e delegation can be chosen if (2, A and B a r e not both included. (iii) Let C and ll denote the married students. If C and D do not go, then t h e delegation can be chosen in (120\' /"" \4: 210 ways. ~ If both C and D go, then the delegation can be chosen in ~ = 45 wags. Altogether, the delegation can be chosen in 210 t 45 = 255 wags. 15.4. There are 12 points A , B, . . . in a given plane, no three on the same line. (i) How many lines are determined by the points? (ii) How many of these lines pass through the point A ? (iii) How many triangles are determined by the points? (iv) How many of these triangles contain the point A as a vertex? Since two points determine a line, there a r e ('2") = 1 2 * 11 1 . 2= 66 lines. To determine a line through A , one other point must be chosen: hence there a r e 11 lines through A . 12 - 1 2 * 1 1 * 1 0 Since three points determine a triangle, there a r e ( 3 ) 1.2.3 = 220 triangles. Method 1. To determine a triangle with vertex A , two other points must be chosen; hence there a r e ('2' ) = = 55 triangles with A a s a vertex. Method 2. There a r e (131 ) , = 165 triangles without A a s a vertex. = Thus 220 - 165 = 55 I of the triangles do have A a s a vertex. 15.5. A student is to answer 8 out of 1 0 questions on an exam. (i) How many choices has he? (ii) How many if he must answer the first 3 questions? (iii) How many if he must answer at least 4 of the first 5 questions? (i) The 8 questions can be selected in lo - lo ( 8 ) - (2) - = 45 ways. (ii) If he answers the first 3 questions, then he can choose the other 5 questions from the last 7 questions in '7 (5) - 7 - (2) - 7.6 = 1.2 21 ways. (iii) If he answers all the first 5 questions, then he can choose the other 3 questions from the last 5 in (i) = 10 ways. On the other hand, if he answers only 4 of the first 5 questions, then he can choose these 4 in from the last 5 in 25 ways. 5(i 4) = (5) = 5 ways, and he can choose the other 4 questions (i) = (f) = 5 ways; hence he can choose the 8 questions in 5 . 5 = Thus he h a s a total of 35 choices. 166 [CHAP. 15 COMBINATIOXS, ORDERED PARTITIONS 15.6. How many diagonals has an octagon? An octagon has 8 sides and 8 vertices. Any two vertices determine either a side or a diagonal. = 28 sides plus diagonals. 28 - But there a r e 8 sides; hence there a r e 8 = 20 diagonals. 15.7. How many diagonals has a regular polygon with ?z sides? sides has, also, n vertices. Any two vertices determine either a ’n\ n(n-1) n ( n - 1) sides plus diagonals. B u t there side or a diagonal. Thus there a r e 1 1.2 : 2 / a r e 71 sides: hence there a r e The regular polygon with n(n- 1) ~2 71 n = 722 -n 2 2n 2 - n2 - - 3n 2 - ’ 3 diagonals 2 15.8. Which regular polygon has the same number of diagonals as sides? The regular polygon with 71 sides has, by the preceding problem, we seek the polygon of n sides f o r which n2 - 3n - n 2 or n1 - 3 7 ~ = 2n or n2 - 5n = 0 nz - 3 n 7 diagonals. or Thus n(n-5) = 0 Since 71 must be a positive integer, the onIy answer is n = 5. In other words, the pentagon is the only regular polygon with the same number of diagonals a s sides. 15.9. A man is dealt a poker hand (5 cards) from an ordinary playing deck. I n how many ways can he be dealt (i) a spade flush (5 spades), (ii) an ace high spade flush (5 spades with the ace), (iii) a flush ( 5 of the same suit), (iv) a n ace high full house (3 aces with another pair), ( ~ 7 ) a full house (3 of one kind and 2 of another), (vi) 3 aces (without the fourth ace or another pair), (vii) 3 of a kind (without another pair), (viii) two pair? 13) - 13 * 1 2 - 1 1 1 0 - 9 = 1287 ways. The 5 spades can be dealt from the 13 spades in (i) ( 5 ,/ 1.2*3*4.5 (ii) If he receives the spade ace, then he can be dealt the other 4 spades from the remaining (iii) There a r e 4 suits, and a flush from each suit can be dealt (see (i)) in flush can be dealt in 4 . (iv) (7) (153) ways; hence a = 5148 ways. The 3 aces can be dealt from the i aces in (i) = (:> = 4 ways. in 12 ways, and the 2 cards of this pair can be dealt in ways. The pair can be selected Thus the ace high full house can be dealt in 4 12 {4) = 4 12 4 * 3 = 288 ways. \2 1.2 0- (v) One kind can be selected in 13 ways and 3 of this kind can be dealt in kind can be selected in 12 ways and 2 of this kind can be dealt in house can be dealt in 13 * 12 * = 3744 ways ($ (i) fobserve, by (iii), t h a t a flush is more common than a full house.) c1 ways; another ways. Thus a full COMBINATIOKS, ORDERED PARTITIOKS CHAP. 151 (vi) (:) kinds in ( y ) ways; and The 3 aces can be selected from 4 aces in the remaining 12 167 ways. The 2 other cards can be selected from 1 card in each kind can be dealt in 4 ways. / Thus a poker hand of 3 aces can be’ dealt in (vii) One kind can be selected in 13 ways: and 3 of this kind can be dealt in 2 other kinds can be selected from the remaining 12 kinds in (43) ways. The ways; and 1 card in each \ 2 / kind can be dealt in 4 ways. Thus a poker hand of 3 of a kind can be dealt in I ” ~ ‘ 13‘ ways; and 2 of each kind \ 2 ways. Another kind can be selected in 11 mays; and 1 of this kind in (viii) Two kinds (for the pairs) can be selected from the 13 kinds in can be dealt in (:I 4 ways. Thus a poker hand of t w o pair can be dealt in 15.10. Consider 4 vowels (including a) and 8 consonants (including b ) . (i) How many 5 letter “words” containing 2 different von~elsand 3 different consonants can be formed from the letters? (ii) How many of them contain b ? (iii) How many of them begin with b ? (iv) How many of them begin with a ? (v) How many of them begin with a and contain b? (i) (;) The 2 vov,els can be selected from the 4 vowels in ways, and the 3 consonants can be /8‘ selected from the 8 consonants in \ 3 1 ways. Furthermore, each 5 letters can be arranged in a row f a s a “word”) in 5 ! ways. Thus we can form (:; (:) * 5 ! = 6 56 * 120 = 40,320 words i\ ways. However, since b is one of the consonants, the ‘2 other 2 consonants can be selected from the remaining 7 consonants in ways. Again each 5 letters can be arranged as a word in 5! ways. Thus we can form (ii) The 2 vowels can still be selected in (i) (2‘) * * 5 ! = 6 * 21 * 120 = 15,120 Ivords containing b (z) (iii) The 2 vowels can be selected in ( 4 ) ways, and the other 2 consonants can be selected in \2/ ways. The 4 letters can be arranged, following b , in 4 ! ways. Thus we can form (i) (2’) - - 4! = 6 21 * 24 = 3024 words beginning with b 168 COMBINATIONS, ORDERED PARTITIONS [CHAP. 15 ( i d The other vowel can be chosen in 3 ways, and the 3 consonants can be chosen in The 4 letters can be arranged, following a , in 4! ways. 3 . 5 6 . 2 4 = 4032 words beginning with a. (vi (z) Thus we form - (:) 3 /8) \3/ ways. * 4! = The other vowel can be chosen in 3 ways, and the 2 other consonants can be chosen in 3 The 4 letters can be arranged, following a, in 4! ways. xays. (7) 2, - 4! = 3 - 2 1 - 2 4 Thus we can form = 1612 words beginning with a and containing b . 15.11. How many committees of 5 Tvith a given chairman can be selected from 12 persons? The chairman can be chosen in 12 ways and, following this, the other 4 on the committee can be chosen from the 11 remaining in such committees. (y) ways. Thus there a r e 1 2 . (121) = 12.330 = 3960 15.12. Find the number of subsets of a set X containing ?z elements. Method 1. The number of subsets of X with T subsets of X. The above sum (see Problem 13.30, Page 151) is equal to 2n, i.e. there a r e 2" subsets of X. Method 2. There a r e two possibilities for each element of X : either it belongs to the subset o r it doesn't; hence there a r e times 2 . 2 . . . . a 2 = 2n ways t o form a subset of X , i.e. there a r e 2n different subsets of X. 15.13. In how many ways can a teacher choose one or more students from six eligible students ? Method 1. By the preceding problem, there a r e 2s = 64 subsets of the set consisting of the six students. However, the empty set must be deleted since one o r more students a r e chosen. Accordingly, there a r e 26 - 1 = 64 - 1 = 63 ways to choose the students. Method 2. Either 1 , 2 . 3 , 4 , 5 or 6 students a r e chosen. Hence the number of choices is (F) + 4- (i) + (:) + (i) + (z) = 6 + 15 + 20 + 15 + 6 i1 = 63 15.14. In how many ways can three or more persons be selected from 12 persons? There a r e 21a - 1 = 4096 - 1 = 4096 ways of choosing one o r more of the twelve persons. Now there a r e ( y )+ (7) Hence there a r e 4096-78 = 12 + 66 = 78 ways of choosing one or two of = 4018 ways of choosing three or more. the twelve persons. CHAP. 151 COMBINATIONS, ORDERED PARTITIONS PARTITIONS AND CROSS-PARTITIONS 15.15. Which of the following are partitions of (i) 169 X = {a,b, c, d , e, f , g } ? [{a.0,c:, "Zj, if,9)] (ii) [{a, 0,c ) , {c, d , e}, { f , s } ] (iii) [{a, b , c ) , ( d , e ) , { f ,9 } ] (i) This is not a partition of X since e E X, but e does not belong to any cell. (ii) This is not a partition of X since c E A' and belongs to both (iii) This is a partition of X . Each element of T { a , 6, c) and {c, d, e } . belongs t o exactly one cell. 15.16. Let cA = { A , , A , , . . . , A , , L }and 49 = ( B , ,B,, . . . , B J be partitions of the same set X . Show that the cross-partition C = {Ai n B I : A i E d , B j E CB} is also a partition of X. Let z E X. Then x belongs to some d L o and to some Bi,, since 04 and '73 a r e partitions of X . Thus x E A , o n E , o and so belongs t o a member of the cross-partition. On the other hand, suppose 3: E A ; , , I ~ I B and ,~ AilnB,il. Then z E Aio and A , , , whence A . = A i since d is a partition of X . Similarly, B . = B j l . Accordingly, A i O n B j ,= A , l n B , l '0 1 30 and so C is a partition of X . 13.17. Let X = (1,2,3,4,5,6,7,8}. Find the cross-partition of each pair of partitions of X : (i) [{I,2,3, 4 } , .(5,6,7,8>1 and [{I,2 , 8 ) , ( 3 , 4 , 5 , 6 , 7 ) ] (ii) [{I,2 , 3 , 4 ) , { 5 , 6 , 7 , 8 } ] and [il, 2 ) , { 3 , 4 , 5 } , {6,7,8)] To obtain the cross-paTAition, intersect each set of one partition with each set of the other. (i) {1,2,3,4] C5,6,7,8> n {1,2, 8) = {8) n {l,2 , 8 } = {1,2} ; { 1 , 2 , 3 , 4 ) n{3,4,5,6,7} = {3,4}; Thus the cross-partition is {5,6,7,8}n{3,4,5,6,7] = {5,6,7} [{l, Z}, { 3 , 4 } , {8), {5,6,7}]. (ii) (1,2,3,4$ n {l, 2 ) = {1,2} ; = {3,4}; n {6,7,8> = P ; {5,6,7,8) n U , 2 } = 2' ) = (5) {1,2,3,4}n(3,4,5} {5,6,7,8)n{3,4,5} {1,2,3,4: ( 5 , 6, 7, 8) n {6,7,8} = (6, 7,8} Thus the cross-partition is [{l, 2 } , {3,4}, {5}, {6,7,8), 81 or, simply, [{I, a}, {3,4), (51, {6,7,8}] Remark: We do not distinguish partitions which differ by t h e empty set. 15.18. Let X denote the domain of a proposition P ( p , q, 7') of three variables: X = {TTT, TTF, TFT, T F F , FTT, F T F , F F T , F F F } (i) Partition X according to the number of T's appearing in each element. (ii) For the proposition P (11- q ) A r , partition X into T ( P ) and ( T ( P ) ) c . (Recall that 7 ( P ) is the truth set of P.) ) ('T(Q))c. (iii) For .the proposition Q = - p + ( q A T ) , partition X into 7 ( Q and (iv) Find the cross-partition of the partitions of X in (ii) and (iii). (i) Combine the elements of X with the same number of T's t o form the cells: [{FFF}, ( T F F , F T F , FFT}, {TTF, T F T , F T T ) , {TTT}] COMBIKATIONS, ORDERED PARTITIONS 170 iii) and (iii). Construct the t r u t h tables of P = ( p -+ q ) P f F Q r P+4 T and Q = - p [CHAP. 15 + (q A r): (P+Q)AT T T T T F T T F F T F F F F F F T T T T T F T F F T T T T T F F T F T T I B I 1,'l T F Note t h a t P is t r u e in Cases (lines) 1 , 5 and 7, and Q is t r u e in Cases 1 , 2 , 3 , 4 and 5. Thus T(P) = {TTT, FTT, F F T ) and I(Q) = {TTT, T T F , T F T , T F F , FTT} and so the required partitions a r e [{TTT, FTT, FFT}, {TTF, T F T , T F F , F T F , F F F } ] [{TTT, T T F , TFT, T F F , FTT}, { F T F , F F T , FFF}] and (iv) The cross-partition is obtained by intersecting each set of one partition with each set of the other: {TTT, FTT, F F T } n {TTT, TTF, TFT, T F F , FTT} = {TTT, FTT} {TTT, F T T , F F T } n { F T F , F F T , F F F } = {FFT} {TTF, T F T , T F F , F T F , F F F ) n {TTT, T T F , T F T , T F F , FTT} = {TTF, T F T , T F F } {TTF, T F T , T F F , F T F , FFF] n { F T F , F F T , F F F ) = { F T F , F F F } Thus t h e cross-partition is [{TTT, FTT}, {FFT}, {TTF, T F T , TFF}, { F T F , F F F } ] ORDERED AND UNORDERED PARTITIONS 15.19.In how many ways can 7 toys be divided among 3 chiIdren if the youngest gets 3 toys and each of the others gets 2? We seek the number of ordered partitions of 7 objects into cells containing 3, 2 and 2 objects, respectively. By Theorem 15.3, there a r e "' -= 210 such partitions. 3! 2 ! 2 ! 15.20.There are 12 students in a class. In how many ways can the 12 students take 3 different tests if 4 students are to take each test? Method 1. We seek the number of ordered partitions of the 1 2 students into cells containing 4 students 12! each. By Theorem 15.3, there a r e = 34,650 such partitions. Method 2. There a r e 1' ' (7) ways to choose 4 students to take the first test; following this, there a r e ways t o choose 4 students to take the second test. The remaining students take the third test. Thus. altogether, there a r e ( y ) (t) * = 495 70 = 34,650 ways f o r the students to take t h e tests. 171 COMBINATIONS, ORDERED PARTITIONS CHAP. 151 15.21. In how many ways can 12 students be partitioned into 3 teams, A , , A , and A,, so that each team contains 4 students? Method 1. Observe t h a t each partition { A l , A 2 , A 3 }of the students can be arranged in 3! = 6 ways as a n ordered partition. Since (see the preceding problem) there a r e partitions, there a r e 34,650/6 = 5775 (unordered) partitions. Method 2. Let A denote one of the students. Then there a r e (131) 12! = 34,650 such ordered ways to choose 3 other students to be on the same team as A . Now let B denote a student who is not on the same team as A ; then there are (:) ways t o choose 3 students of the remaining students t o be on the same team as B. The remaining 4 students constitute the third team. 165.35 = 5 7 7 5 ways t o partition the students. Thus, altogether, there a r e (Y) *(;I = 15.22. In how many ways can 6 students be partitioned into (i) 2 teams containing 3 students each, (ii) 3 teams containing 2 students each? (i) Method 1. 61 Since There ar e 3 ! 3! = 20 ordered partitions into 2 cells containing 3 students each. each unordered partition determines 2 ! = 2 ordered partitions, there a r e 20/2 = 10 unordered partitions. Method 2. Let A denote one of the students; then there a r e ([) = 1 0 ways to choose 2 other students to be on the same team as A . The other 3 students constitute the other team. I n other words, there a r e 10 ways to partition the students. (ii) Method 1. 61 There a r e = 90 ordered partitions into 3 cells containing 2 students each. Since each unordered partition determines 3! = 6 ordered partitions, there a r e 90/6 = 15 unordered partitions. & Method 2. Let A denote one of the students, Then there a r e 5 ways to choose the other student to be on the same team as A . Let B denote a student who isn't on the same team as A ; then there a r e 3 ways to choose another student to be on the same team as B. The remaining 2 students constitute the other team. Thus, altogether, there a r e 5 3 = 15 ways to partition the students. - 15.23.In how many ways can a class X with 10 students be partitioned into 4 teams A,, A,, B,, B,, where A , and A , contain 2 students each and B , and B, contain 3 students each ? Method 1. lo! There are 2 ! 2 ! 3 ! 3 ! = 25,200 ordered partitions of X into 4 cells containing 2,2,3 and 3 students, respectively. However, each unordered partition { A , ,A,, B1,B,} of X determines 2! 2! = 4 ordered partitions of X . Thus, altogether, there a r e 25,20014 = 6300 unordered partitions. Method 2. There a r e (y) ways to choose 4 students who will be on the teams A , and A,, and there a re 3 ways in which each 4 students can be partitioned into 2 teams of 2 students each. On the other hand, there ar e 10 ways (see preceding problem) t o partition the remaining 6 students into 2 teams containing 3 students each. Thus, altogether, there a r e (:)*3*10 to partition the students. = 2 1 0 * 3 * 1 0 = 6300 ways 172 COMBINATIONS, ORDERED PARTITIONS ;CHAP. 15 15.24.In how many ways can 9 students be partitioned into three teams containing 4, 3 and 2 students, respectively? Since all the cells contain different numbers of students, the number of unordered partitions equals the number of ordered partitions which, by Theorem 15.3, is - 1260. & 15.25. (i) In how many ways can a set X containing 10 elements be partitioned into two cells? (ii) In how many ways can 10 students be partitioned into two teams? (i, Method 1. Each subset A of X determines a n ordered partition ( A , A c ) of X (where Ac is the complement of A ) , and so there a r e 210 = 1024 such ordered partitions. However, each unordered partition { A , B } determines two ordered partitions, ( A , B ) and ( B , A ) , and so there a r e l 0 2 4 / 2 = 512 unordered partitions. Method 2. Let j . denote one of the elements in X . Now X - { x } contains 9 elements, and so there a r e 2q = 512 ways of choosing a subset of X - {x} t o be in the same cell as x. In other words, there a r e 512 such partitions. (Observe t h a t { X , @ ) is one of these partitions.) (iii Here we assume t h a t each team must contain at least one student, hence-we do not allow the partition f o r which one team contains 1 0 students, and the other team none. Accordingly, there a r e 5 1 2 - 1 = 511 possible partitions. Supplementary Problems COMBINATIONS 15.26. A class contains 9 boys and 3 girls. (i) In how many ways can the teacher choose a committee of 4? (iii How many of them will contain at least one girl? (iii) How many of them will contain exactly one girl? 15.27. A woman has 11 close friends. ( i ) In how many ways can she invite 5 of them to dinner? (ii) In how many ways if two of the friends a r e married and will not attend separately? iiii) In how many ways if two of them a r e not on speaking terms and will not attend together? 15.28. A woman has 11 close friends of whom 6 are also women. ii) In how many ways can she invite 3 o r more of them to a p a r t y ? (ii) In how many ways can she invite 3 or more of them if she wants the same number of men as women fincluding herself)? 15.29. There a r e 10 points A , R , . , . in a plane, no three on the same line. (i) How many lines a r e determined by the points? (ii) How many of these lines do not pass through A or E ? (iii) How many triangles a r e determined by the points? (iv) How many of these triangles contain the point A ? I W ) How many of these triangles contain the side A B ? 15.30. A student is t o answer 10 out of 13 questions on a n exam. (i) How many choices has he? iii) How many if he must answer the first two questions? (iii) How many if he must answer the first or second question but not both? (iv) How many if he must answer exactly 3 of t h e first 5 questions? (v) How many if he must answer a t least 3 of the first 5 questions? 15.31. How inany diagonals h a s a (i) hexagon, (ii) decagon? 15.32. Which regular polygon h as diagonals a s sides? 15.33. 173 COMBINATIONS, ORDERED PARTITIONS CHAP. 151 (i) twice as many diagonals as sides, (ii) three times as many (i) How many triangles a r e determined by the vertices of a n octagon? (ii) How many if the sides of the octagon a re not to be sides of a ny triangle? 15.34. How many triangles ar e determined by the vertices of a regular polygon with of the polygon ar e not t o be sides of any triangle? 12 sides if the sides 15.35. A man is dealt a poker hand (5 cards) from a n ordinary playing deck. In how many ways can he be dealt (i) a straight flush, (ii) four of a kind, (iii) a straight, (iv) a pair of aces, (v) two of a kind (a p a i r ) ? 15.36. The English alphabet has 26 letters of which 5 a r e vowels. (i) How many 5 letter words containing 3 different consonants and 2 different vowels can be formed? (ii) (iii) (iv) (v) (vi) (vii) (viii) How How How How How How How (ix) (x) How many of them begin with a and end with b? How many of them contain the letters a , b and c? many many many many many many many of of of of of of of them contain the letter b ? them contain the letters b and c ? them begin with b and contain the letter c? them begin with b and end with c? them contain the letters a and b? them begin with a and contain b? them begin with b and contain a? PARTITIONS AND CROSS-PARTITIONS 15.37. Which of t he following are partitions of (i) [{a,e , Q, Co), {u)] (ii) [{a,i,u>, { e l , (0,ull (iii) [{a,e ) , Co), Czt)] 35.38. 15.39. @I Find the cross-partition of each pair of partitions of X = { 1 , 2 , 3 , 4 , 5 , 6 ) : . (i) [{l, 2 , 3 ) , { 4 , 5 , 6 ) ] and [{I, 3,5), { 2 , 4 , 6 ) ] (ii) [{I, 2,3), { 4 , 5 , 6 ) ] and [{I, 21, {3,4, 5), (611 . Let X denote the domain of a proposition P ( p , q , ~ of ) three variables: X = {TTT, T T F , T F T , T F F , F T T , F T F , F F T , FFF) (i) (ii) (iii) (iv) 15.40. X = (a,e,i,o,u}? (iv) [ { a ,el, Ci), (01, {a,el, {ull (v) [ { a ,e , 4, (0,u1, (vi) [@, { a ) , Ce, u.1,@, {i, 011 Partition X according to the number of F's appearing in each element. F o r the proposition P = ( p v q ) A T , partition X into T ( P ) and ( 7 ( P ) ) c . Partition X into T(q) and T ( - q ) . Find the cross-partition of t h e partitions in (ii) and (iii). F o r a n y proposition P = P ( p , q, of P . . . .), show tha t [ T ( P ) ,T ( - P ) ] forms a partition of the domain that is, each 15.41. Let the partition { A , , A , , . . .,A,.} be a refinement of the partition {B,, B,, . . .,Bs}, A i is a subset of some Bj. Describe the cross-partition. 15.42. Show t h a t a cross-partition is a refinement (see preceding problem) of each of the original partitions. ORDERED AND UNORDERED PARTITIONS 15.43. I n how many ways can 9 toys be divided evenly among 3 children? 15.44. In how many ways can 9 students be evenly divided into three teams? 174 COMBINATIONS, ORDERED PARTITIONS 'CHAP. 15 15.45. In how many ways can 1 0 students be divided into three teams, one containing 4 students and the others 3 1 15.46. There a r e 12 balls in a n urn. In how many ways can 3 balls be drawn from t h e urn, f o u r times in succession? 15.47. In how many ways can a club with 1 2 members be partitioned into three committees containing 5,4 and 3 members respectively? 15.48. In how many ways can n students be partitioned into two teams containing at least one student? 15.49. In how many ways can 14 men be partitioned into 6 committees where 2 of the committees contain 3 men and the others 2 ? 15.50. In how many ways can a set X with 3 elements be partitioned into (ii, three (unordered) cells? (i) three ordered cells, 15.51. In how many ways can a set X with 4 elements be partitioned into (ii) three (unordered) cells? (i) three ordered cells, 15.52. In bridge, 13 cards a r e dealt to each of four players who a r e called North, South, E a s t and West. The distribution of the cards is called a bridge hand. ii) How many bridge hands a r e there? (ii, In how many of them will one player be dealt all four aces? (iii) In how many of them will each player be dealt a n ace? (iv) In how many of them will North be dealt 8 spades and South the other 5 spades? (v) In how many of them will North and South have, together, all four aces? (Rrmark. Leave answers in factorial notation.) Answers to Supplementary Problems 15.26. (i, 15.27. (i, ( y ) (:) ) (93) + ( y ) (';) (7) = 495, (ii) = 462, (ii) 15.28. (i) 211 - 1 - 15.29. (i) ('2", = 15.30. (i) = (ii) 15.31. (i) (ii) (i) or (iii) (iii) 2 * (:) *(:I (t:> + ('41) (130) = 252 = 252 + . * . + (") \ 11 = 120, (iv) = 36, = 1981 (v) 8 (7) = 286 (y)= (y)= (i) = 28, (iii) 3 = 210, = 1981 - 45, = 369, - - 6 = 9, 165 (ii) (Y) - 10 = 35 15.32. Hl?zt. By Problem 15.7, the regular polygon with n sides has ( n 2 - 3n)/2 diagonals. ( i ) (?zL - 3 n ) / 2 = 2% or n = 7, ( i i ) (nL- 3n)/2 = 3n or ?z = 9 15.33. (i) = 56, (ii) (:) -8 (t) - 8 = 16 COMBINATIONS, O R D E R E D PARTITIONS CHAP. 151 175 15.35. (i) 4 - 1 0 = 40, iii) 1 3 - 4 8 = 624, (iii) 10.45 - 40 = 10,200. (We subtract the number of straight flushes.) (iv) 4 /12 * 43 = 84,480, (v) 13 43 = 1,098,240 (:)(?) () ) 2 \ 3 ti)(:) (i) = (:)(:) 15.36. (i) (ii) * 5 ! = 1,596,000 (v) 19 5 ! = 228,000 (vi) 4 * (vii) 4 * (iii) 19 5! (iv) 19-(:) 4 ! = 4560 22,800 (i) (y) * (T) * 3! = 1140 (ix) 4 * 5! = 91,200 (x) 4 * 1 9 . 5 ! = 9120 * * 3! = 4456 (7) 4 ! = 18,240 fviii) 18,240 15.37. (i) yes, (ii) no, (iii) no, (iv) yes, (v) yes, (vi) yes 15.38. (i) [{1,31, { 2 } , C51, (4, Q], 15.39. (i) (ii) [C1,2}, {3}, @,C4,5}, (611 [{TTT}, {TTF, TFT, FTT}, {TFF, F T F , FFT}, {FFF}] (ii) [{TTT, TFT, FTT}, {TTF, T F F , F T F , F F T , FFF}] (iii) [{TTT, TTF, FTT, FTF}, {TFT, T F F , F F T , FFF}] (iv) [{TTT, FTT}, {TFT}, {TTF, F T F ) , { T F F , F F T , FFF}] 15.41. { A , , A , , . . . , A , } , i.e. the refinement partition. 15.42. A , n B , c A , and A , n B , c B , . 15.43. '! = 1680 3! 3! 3! 15.44. 1680/3! = 280 or 15.45. lo! 4 ! 3 ! 3 ! 2! 0 - - (:)(;) or 2100 = 280 ti)(;) = 2100 12! 15.46* 3! 31 3! 3! = 369,600 15.47. 12! = 5! 4! 3! 15.48. 2n-1 27,720 -1 14! .- 15'49' 3! 3! 2! 2! 2! 2! 2 ! 4! 15.50. (i) - 3,153,150 - 33 = 27 (Each element can be placed in any of the three cells.) (ii) The number of elements in the three cells can be distributed as follows: (4 [{31, (01, { O } ] , ( b ) [{2}, {1>,{Oil, (c) [{I}, c11, (111 Thus the number of partitions is 1 + 3 + 1 = 5. 15.51. (i) 3 4 = 81. (ii) The number of elements in the three cells can be distributed a s follows: ( a ) [{4}, (01, {O}], ( b ) [{3$, {I}, {O}], (4 [@I, C2}, {O)], (4 [@I, {I}, U}l. Thus the number of partitions is 1 + 4 + 3 + 6 = 14. 15.52. 52! 48! (ii) 4 9! 13!48! 13! 13! (iii) 4! 12! 121 12! 12! (i) 13! 13! 13! 13! 48! 48! 48! (v) 2 * 9! 13! 13! 13! 2 ' 4 * l o ! 12! 13! 13! + 2 * 3 * ll! ll! 13! 13! + iiv) (13) 39! 8 5!8!13!13! - 48! 2300 11! 13! 13! 13! Chapter 16 Tree Diagrams TREE DIAGRAMS A tree diagram is a device used to enumerate all the logical possibilities of a sequence of events where each event can occur in a finite number of ways. The construction of tree diagrams is illustrated in the following examples. Example 1: Find the product set A X B X C where A = ( 1 , 2 } , B = {a, b , c} and C = { 3 , 4 } . The tree diagram follows: (1,a, 3) (1, a, 4) 3 << '<A '<: '<: '<4 3 (1, b, 3) (1, b, 4) (1, c, 3) (1, c, 4) (2, a, 3) (2, a, 4) (2, b , 3) (2, b, 4) (2, c, 3) Observe that the tree is constructed from :<c left t o right, and (2, that c, 4) the number of branches at each point corresponds to the number of ways the next event can occur. Example 2: Mark and Eric a r e to play a tennis tournament. The first person to win two games in a row or who wins a total of three games wins the tournament. The following diagram gives the various ways the tournament can occur. Observe t h a t there a r e 10 endpoints which correspond to the 10 ways that the tournament can occur: MM, MEMM, MEMEM, MEMEE, MEE, EhIM, EMEMM, EMEME, E M E E , E E The path from the beginning of the tree to the endpoint describes who won which game in the individual tournament. 176 177 T R E E DIAGRAMS CHAP. 161 Example 3: A man h a s time t o play roulette at most five times. A t each play he wins or loses a dollar. The man begins with one dollar and will stop playing before the five times if he loses all his money or if he wins three dollars, i.e. if he h a s four dollars. The tree diagram describes the way the betting can occur. Each number in the diagram denotes t h e number of dollars he has at t h a t point. The betting can occur in 11 different ways. Note t h a t he will stop betting before the five times a r e up in only three of the cases. 3- Solved Problems 16.1. Construct the tree diagram f o r the number of permutations of {a, b, c } . abc c acb bac a bca The six permutations a r e listed on the right of the diagram. 16.2. Find the product set { 1,2,3} x ( 2 , 4 } x {2,3,4} by constructing the appropriate tree diagram. 2 6 1 2 3 4 (1,2, 2) (1, 2, 3) (1, 2, 4) (1, (1,4, 2) 4) 3) ,,/'54<: 245$: 4 (2, 2, 2) (2, 2, 3) (2, 2, 4) \ /2e! (3, 2, 2) (3,2, (3, 2, 4) 3) -3\4+23 (3, 4, 2) 3) 4 (3,4, 4) The eighteen elements of the product set a r e listed to the r i g h t of the tree diagram. 178 T R E E DIAGRAMS [CHAP. 16 16.3. Teams A and B play in a basketball tournament. The team that first wins 3 games wins the tournament. Find the number of possible ways in which the tournament can occur. Construct the appropriate tree diagram: The tournament can occur in twenty ways: AAA, AABA, AABBA, AABBB, ABAA, ABABA, ABABB, ABBAA, ABBAB, ABBB BAAA, BAABA, BAABB, BABAA, BABAB, BABB, BBAAA, BBAAB, BBAB, BBB 16.4. A man is a t the origin on the x-axis and takes a one unit step either to the left or to the right. He stops if he reaches 3 or -3, or if he occupies any position, other than the origin, more than once. Find the number of different paths the man can travel. Construct t h e appropriate tree diagram. There a r e 14 different paths, each path corresponding to a n end point of the tree diagram. TREE DIAGRAMS CHAP. 161 179 16.5. Teams A and B play a hockey game consisting of three periods. The number of goals (points) scored in each period satisfies the following conditions: (ij The total number of points scored by both teams in any one period is one 01' two. (iij The teams are never tied a t the end of any period. (iii) Neither team is ahead by more than 2 points a t the end of any period. (iv) Team A is leading a t the end of the first period. Find the number of ways the scoring can occur if the above conditions a r e satisfied. In how many ways will team B win? Note first, by condition (i), t h a t the scoring in a n y one period can occur in only five ways: B j B j 1jR.l The number a t the t op is the number of points scored by team A , and the number at t h e bottom by team B. Construct the appropriate tree diagram: Observe t h a t condition (iv) allows only two of the above five scores to occur in t h e first period. By the tree diagram, the scoring can occur in only 13 ways, and in 4 of these which a r e exhibited on the right of the diagram, Team B will win. T R E E DIAGRAMS 180 [CHAP. 16 16.6. Teams A and B play in baseball's world series. (Here, the team that first wins 4 games wins the series.) Find the number of ways the series can occur if (i) A wins the first game and (ii) the team that wins the second game also wins the fourth game. Construct the appropriate tree diagram: Observe t h a t there is only one choice in the fourth game, the winner of the second game. The series can occur in 15 ways. 16.7. In the following diagram let A , B , . . ., F denote islands, and the lines connecting them bridges. A man begins at A and walks from island to island. He stops for lunch when he cannot continue to walk without crossing the same bridge twice. Find the number of ways that he can take his walk before eating lunch. Construct the appropriate tree diagram: C A D D B D F E F There are eleven ways t o take his walk. Observe t h a t he must e a t his lunch at either B , D or E. CHAP. 161 181 TREE DIAGRAMS 16.8. Consider the adjacent diagram with nine points A , B , C , R , S, T,X,Y ,2. A man begins at X and is allowed t o move horizontally o r vertically, one step at a time. He stops when he cannot continue to walk without reaching the same point more than once. Find the number of ways he can take his walk, if he first moves from X to R. (By symmetry, the total number of ways is twice this.) A- 1 I-1-1 1-1 X-Y- z Construct the appropriate tree diagram: / ‘s/ / z‘ \ Y\ z ‘Y T Y z Y s Y z T C C B A There ar e 1 0 different trips. (Note t h at in only 4 of them a re all nine points covered.) Supplementary Problems 16.9. Construct the tree diagram f o r t h e number of permutations of { a , b , c , d } . 16.10. Find the product set {a,b , c } X {a,c } X {b} X { b , c} by constructing the appropriate tree diagram. 16.11. Teams A and B play in a basketball tournament, The first team tha t wins two games in a row o r a total of four games wins the tournament. Find the number of ways the tournament can occur. 16-12. A man has time t o play roulette five times. He wins or loses a dollar a t each play. The m a n begins with two dollars and will stop playing before the five times if he loses all his money or wins three dollars (i.e. has five dollars). Find the number of ways the playing can occur. 16.13. A man is at t h e origin on t h e x-axis and takes a unit step either t o the left or to the right. H e stops after 5 steps or if he reaches 3 or -2. Construct the tree diagram to describe all possible paths the man can travel. 16.14. Teams A and B play in baseball’s world series. (The team that first wins 4 games wins the series.) Find the number of ways the series can occur if the team t h a t wins the first game also wins the third game, and the team t h a t wins the second game also wins the fourth game. 16.15. Solve Problem 16.7 if the man (i) begins a t E , (ii) begins at F . 16.16. Solve Problem 16.8 if the man (i) begins a t Y and first moves to S, (ii) begins a t Y and first moves to 2. 16.17. Solve Problem 16.5 if condition (ii) is replaced by the following: (ii) The game does not end in a tie. 182 TREE DIAGRAMS [CHAP. 16 Answers to Supplementary Problems 16.12. 16.13. Hint. The tree is essentially the same as the tree of the preceding problem. 16.15. (i) (ii) E- C- D D F- C - D C - D E F- F \ TREE DIAGRAMS CHAP. 161 16.16. (i) sqB<; / R<y I’ 183 ‘4 B C R T X B A z T z R X T-=====Z (ii) - 2 0 - 2 0 __ __ 1 0 __ 0 __ 1 __ - 0 2 __ - and 1 0 - 1 __ 0 0 ~ 1 0 ~ 2 d Chapter 17 Probability INTRODUCTION Probability is the study of random or nondeterministic experiments. If a die is tossed in the air, then it is certain that the die will come down, but it is not certain that, say, a 6 will appear. However, suppose we repeat this experiment of tossing a die; let s be the number of successes, i.e. the number of times a 6 appears, and let n be the number of tosses. Then it has been empirically observed that the ratio f = sln, called the relative f r e q u e n c y , becomes stable in the long ru;, i.e. approaches a limit. This stability is the basis of probability theory. In probability theory, we define a mathematical model of the above phenomenon by assigning “probabilities” (or: the limit values of the relative frequencies) to each possible outcome of an experiment. Furthermore, since the relative frequency of each outcome is non-negative and the sum of the relative frequencies of all possible outcomes is unity, we require that our assigned “probabilities” also satisfy these two properties. The reliability of our mathematical model for a given experiment depends upon the closeness of the assigned probabilities to the actual relative frequency. This then gives rise to problems of testing and reliability which form the subject matter of statistics. Historically, probability theory began with the study of games of chance, such as roulette and cards. The probability 11 of an event A was defined as follows: if A can occur in s ways out of a total of n equally likely ways, then s p = P(A) = n For example, in tossing a die an even number can occur in 3 ways out of 6 “equally likely” ways; hence p = f = 4. This classical definition of probability is essentially circular since the idea of “equally likely” is the same as that of “with equal probability” which has not been defined. The modern treatment of probability theory is purely axiomatic. In particular, we require that the probabilities of our events satisfy the three axioms listed in Theorem 17.1. However, those spaces consisting of n equally likely elementary events, called equiprobable spaces, are an important class of finite probability spaces and shall provide us with many examples. SAMPLE SPACE AND EVENTS The set S of all possible outcomes of some given experiment is called the sample space. A particular outcome, i.e. an element in S, is called a sample point o r sample. An event A is a set of outcomes or, in other words, a subset of the sample space S. In particular, the set { a } consisting of a single sample a E S is an event and called an eZewzentury event. Furthermore, the empty set $2 and S itself are subsets of S and so are events; $3 is sometimes called the impossible event. 184 185 PROBABILITY CHAP. 171 Since an event is a set, we can combine events to form new events using the various set operations: (i) A U B is the event that occurs iff A occurs o r B occurs (or both); (ii) A n B is the event that occurs iff A occurs and B occurs; (iii) Ac, the complement of A , also written A, is the event that occurs iff A does not occur. Two events A and B are called mutually exclusive if they are disjoint, i.e. A n B = 8. In other words. A and B are mutually exclusive iff they cannot occur simultaneously. Example 1.1: Experiment: Toss a die and observe the number t h a t appears on top. sample space consists of the six possible numbers: Then the S = (1, 2, 3, 4, 5, 6 ) Let A be the event t h a t a n even number occurs, B t h a t a n odd number occurs and C t h at a prime number occurs: A = {2,4,6}, B = {1,3,5}, C = {2,3,5} Then: A u C = { Z , 3 , 4 , 5, 6} is the event t h a t a n even or a prime number occurs; B n C = (3, 5) is the event t h a t a n odd prime number occurs; Cc = (1, 4, S} is the event t h a t a prime number does not occur. Note t h a t A and B a r e mutually exclusive: A n B = 0;in other words, a n even number and a n odd number cannot occur simultaneously. Example 1.2: Experiment Toss a coin 3 times and observe the sequence of heads ( H ) and tails ( T) t h a t appears. The sample space S consists of eight elements: S = { HHH , H H T, H TH , H TT, TH H , TH T, TTH, TTT} Let A be t h e event t h a t two or more heads appear consecutively, and B t h a t all the tosses are the same: A = {HHH,HHT,THH} and B = {H H H ,TTT} Then A n B = (HHH} is the elementary event in which only heads appear. The event t h a t 5 heads appear is the empty set 8. Example 1.3: Experiment: Toss a coin until a head appears and then count the number of times the coin w a s tossed. The sample space of this experiment is S = (1,2,3, . . ., -}. refers t o the case when a head never appears and so the coin is tossed a n Here infinite number of times, This is a n example of a sample space which is co~72tabZy infinite. Example 1.4: Experiment: Let a pencil drop, head first, into a rectangular box and note the point on the bottom of the box t h at t h e pencil first touches. Here S consists of all the points on t h e bottom of the box. Let t h e rectangular ar ea on the r i g h t represent these points. Let A and B be the events t h a t t h e penciI drops into the corresponding areas illustrated on the right. This is a n example of a sample space which is not finite nor even countably infinite, i.e. which is non-denumerable. For theoretical reasons every subset of this space cannot be considered to be an event. I S The sample spaces in the preceding two examples, as noted, are not finite. The theory concerning such sample spaces, especially the non-denumerable case, lies beyond the scope of this text. Thus, unless otherwise stated, all our sample spaces S shall be finite. 186 PROBABILITY [CHAP. 17 FINITE PROBABILITY SPACES Let S be a finite sample space: S = {u,,a,, . . ., a , l ] . A finite probability space is obtained by assigning to each point a7E S a real number TI,, called the probability of at, satisfying the following properties: (i) each p l is non-negative, p i1 0 (ii) the sum of the p l is one, p , + p 2 + * * . +pn = 1. The probability of any event A , written P ( A ) , is then defined to be the sum of the probabilities of the points in A . For notational convenience we write P(u,) for P ( { a , } ) . Example 2.1: Let three coins be tossed and the number of heads observed; then the sample space is S = { 0 , 1 , 2 , 3 ) . We obtain a probability space by the following assignment P(0) = 4, P(1) = #, P(2) = Q P(3) = and 8 since each probability is non-negative and the sum of the probabilities is 1. Let A be the event t h a t a t least one head appears and let B be the event t h a t all heads or all tails appear’ A = (1, 2, 3) 2nd B = (0, 3) Then, by definition, P(A) = P(1) and Example 2.2: + P(2) P(B) = P(0) + 38 + 18 & +4 = P ( 3 ) = 38 + P(3) = * = 2 8 Three horses A , B and C a r e in a race; A is twice a s likely to win a s B and 13 is twice as likely to win a s C. W h a t a r e their respective probabilities of winning, i.e. P ( A ) ,P ( B ) and P(C)? Let P ( C ) = p ; since B is twice as likely t o win as C, P ( B ) = 2 p ; and since A is twice a s likely t o win as B , P ( A ) = 2 P ( B ) = 2 ( 2 p ) = 4p. Now the sum of the probabilities must be 1: hence p + Z p f 4 p = l or 7 p = 1 or p = + Accordingly, P(A) = 4p = 4, P(B) = 2 p = +, P(C) = p = 3 Question: Wha t is the probability t h a t B or C wins, i.e. P ( { B ,C})? By definition P ( { B ,C}) = P ( B ) + P(C) = + + 3 = + EQUIPROBABLE SPACES Frequently, the physical characteristics of a n experiment suggest that the various outcomes of the sample space be assigned equal probabilities. Such a finite probability space S, where each sample point has the same probability, will be called an equiprobable space. In particular, if S contains n points then the probability of each point is lln. T Furthermore, if an event A contains T points then its probability is r * 1- = -. In other words, n n number of elements in A = number of elements in S or P(A) = number of ways that the event A can occur number of ways that the sample space S can occur We emphasize that the above formula for P ( A ) can only be used with respect to a n equiprobable space, and cannot be used in general. The expression “at random” will be used only with respect to a n equiprobable space; formally, the statement “choose a point at random from a set S” shall mean that S is a n equiprobable space, i.e. that each sample point in S has the same probability, PROBABILITY CHAP. 171 187 Let a card be selected a t random from a n ordinary deck of 52 cards. Example 3.1: Let A = {the card is a spade} B = {the card is a face card, i.e. a jack, queen o r king} and We compute P ( A ) ,P ( B ) and P ( A n B ) . Since we have a n equiprobable space, number of spades 13 = 1 = number of cards 52 4 P ( An B ) = Example 3.2: P(B) = number of face cards - 12 number of cards 52 _ -3 - 13 number of spade face cards - 5 number of cards 52 Let 2 items be chosen a t random from a lot containing 12 items of which 4 a re defective. Let A = {both items a r e defective} Find P ( A ) and P ( B ) . and B = {both items a r e non-defective) Now = 66 ways, the number of ways tha t 2 items can be chosen from 12 items; 5' can occur in A can occur in (;) = 6 ways, the number of ways tha t 2 defective items can be chosen from 4 defective items; B can occur in (28) = 28 ways, the number of ways tha t 2 non-defective items can be chosen from 8 non-defective items. Accordingly, P ( A ) = = and P ( B ) = = E. 33 THEOREMS ON FINITE PROBABILITY SPACES The following theorem follows directly from the fact that the probability of an event is the sum of the probabilities of its points. Theorem 17.1: The probability function P defined on the class of all events in a finite probability space satisfies the following axioms: [P,] For every event A , 0 6 P ( A ) 1. [Pal P ( S )= 1. [P,] If events A and B are mutually exclusive, then P ( A U B) = P(A) P(B) + Using mathematical induction, [PSI can be generalized as follows: Corollary 17.2: If A , , A,, . . . ,AT are pairwise mutually exclusive events, then P(AJ P(A,UA,U * . U A r ) = P ( A , ) P(A,) . * . + + + Using only the properties of Theorem 17.1, we can prove (see Problems 4.19-4.22). Theorem 17.3: If @ is the empty set, and A and B are arbitrary events, then: (i) (ii) (iii) (iv) P(P) = 0; P(Ac) = 1 - P ( A ) ; P ( A \B) = P ( A ) - P ( A n B ) , i.e. P ( A n B c ) = P(A) - P ( An B ) ; A c B implies P ( A )L P ( B ) . Property (ii) of the preceding theorem is very useful in many applications. Example 4.1: Refer to Example 3.2, where 2 items a r e chosen at random from 12 items of which 4 a r e defective. Find the probability p t h a t at least one item is defective. Now is the complement of C = {a t least one item is defective} B = {both items a r e non-defective} [CHAP. 1 7 PROBABILITY 188 i.e. C = Be; and we found t h a t P ( B ) = g. Hence 14 19 p = P ( C ) = P(BC) = 1 - P ( B ) = 1 - 33 = 33 The odds t h at an event with probability p occurs is de$e$ to be the ratio p : (1- p ) . Th u s t h e odds t h at a t least one item is defective is 5 :33 or 19 : 14 which is read “19 t o 14”. Observe that axiom [P,] gives the probability of a union of events in the case that the events are mutually exclusive, i.e. disjoint. The general formula follows: Theorem 17.4: Corollary 17.5: For any events A and B , P ( A U B ) = P ( A ) + P ( B ) - P ( A n B ) . For any events A , B and C, P ( A U B U C ) = P ( A ) P(B) P ( C )- P ( A n B ) - P ( A n C ) - P(BnC) + P ( A n B n C ) Example 4.2: + + Of 100 students, 30 a r e taking mathematics, 20 a r e taking music and 10 a r e taking both mathematics and music. If a student is selected at random, find the probability p t h a t he is taking mathematics or music. Let A = {students taking mathematics} and B = {students taking music}. Then A n B = {students taking mathematics and music} and A U B = {students taking mathematics o r music}. 30 - 3 2 0 - 1 Since t h e space is equiprobable, then P ( A ) = - lo, P ( B ) = - s, P(AnB)= = Thus by Theorem 17.4, k. p = P(AuB) = P(A)+P(B)-P(AnB) = $+i-f = 5 CLASSICAL BIRTHDAY PROBLEM The classical birthday problem concerns the probability p that n people have distinct birthdays. In solving this problem, we assume that a person’s birthday can fall on any day with the same probability. Since there are n people and 365 different days, there are 365” ways in which the n people can have their birthdays. On the other hand, if the n persons are to have distinct birthdays, then the first person can be born on any of the 365 days, the second person can be born on the remaining 364 days, the third person can be born on the remaining 363 days, etc. Thus there are 365 * 364.363 . . (365 - n 1) ways the n persons can have distinct birthdays. Accordingly, 365-364.363 * (365 - n 1) - 365 364 363 . . . 365 - n 1 + e P = 365“ + -.-.-365 - 365 + 365 365 It can be shown that for n s 2 3 , p < $; in other words, amongst 23 or more people it is more likely that a t least two of them have the same birthday than that they all have distinct birthdays. Solved Problems SAMPLE SPACES AND EVENTS 17.1. Let A and B be events. Find an expression and exhibit the Venn diagram for the event that: (i) A but not B occurs, i.e. only A occurs; (ii) either A or B, but not both occurs, i.e. exactly one of the two events occurs. (i) Since A but not B occurs, shade the a re a of A outside of B as in Figure (a) below. Note t h a t Bc, the complement of B , occurs since B does not occur; hence A and Bc occurs. I n other words, the event is A n B c . CHAP. 171 PROBABILITY A but not B occurs. 189 Either A o r B , but not both, occurs. (b) (4 (ii) Since A or B but not both occurs, shade the a re a of A and B except where they intersect a s in Figure ( b ) above. The event is equivalent to A but not B occurs or B but not A occurs. Now, as in (i), A but not B is the event A n B c , and B but not A is the event BnAc. Thus the given event is ( A n B c ) u ( B n A c ) . 17.2. Let A , B and C be events. Find an expression and exhibit the Venn diagram for the event that (i) A and B but not C occurs, (ii) only A occurs. (i) Since A and B but not C occurs, shade the intersection of A and B which lies outside of C, as in Figure ( a ) below. The event is A nBn Cc. I 1 A and B but not C occurs. (4 (ii) Only A occurs. (b) Since only A is to occur, shade the area of A which lies outside of B and of C, as in Figure ( b ) above. The event is A nBcn Cc. 17.3. Let a coin and a die be tossed; let the sample space S consist of the twelve elements: S = {Hl, H2, H3, H4, H5, H6, T1, T2, T3, T4, T5, T6} (i) Express explicitly the following events: A = {heads and an even number appears}, B = {a prime number appears}, C = {tails and an odd number appears}. (ii) Express explicitly the event that: (a) A or B occurs, ( b ) B and C occurs, ( c ) only B occurs. (iii) Which of the events A , B and C are mutually exclusive? (i) To obtain A , choose those elements of S consisting of a n H and a n even number: A = (H2, H4, H6). To obtain B, choose those points in S consisting of a prime number: B = (H2, H3, H5, T2, T3, T5). To obtain C, choose those points in S consisting of a T and a n odd number: C = {Tl,T3, T5). (ii) (a) A or B = A u B = (H2, H4, H6, H3, H5, T2, T3, T5) ( b ) B and C = B n C = {T3,T5} of B which do not lie in A or C: B n A c n C c = {H3,H5,T2}. ( c ) Choose those elements (iii) A and C ar e mutually exclusive since A n C = @. 190 [CHAP. 1 7 PROBABILITY FINITE PROBABILITY SPACES 17.4. Suppose a sample space S consists of 4 elements: S = {u,, a,, a3,ad}. Which function defines a probability space on S ? (i) P(al)= +, P(a,) = $, P(a,) = 4, P(a,) = &. (ii) P(a,) = 4,P(a,) = $, P ( a J = -&, P(a,) = +. (iii) P(a,) = 3, P(a,) = d, P(aJ = 6, P(a,) = 6. (iv) P(al)= 4,P(a,) = t, P ( a , ) = $, P(aJ = 0. (i) Since the sum of the values on the sample points is greater than one, the function does not define a probability space on S. ++$ +i + Q = s, (ii) Since P(a,) = -4, a negative number, the function does not define a probability space on S. (iii) Since each value is non-negative, and the sum of the values is one, function does define a probability space on S. & + & +Q + Q = 1, the (iv) The values a r e non-negative and add up to one; hence the function does define a probability space on S. 17.5. Let S = {ul, a,, a,,a4}, and let P be a probability function on S. (i) Find P(a,) if P(a,) = +, P(a,) = &, P(a,) = +. (ii) Find P ( u l ) and P(a,) if (iii) Find P(a,) if (i) P(a,) = P(a,) = & and P ( a l ) = 2P(a,). P({a,,a,}) = b, P((a,,a,}) = + and P(a,) = Q. Let Pia,) = p . Then, f o r P t o be a probability function, the sum of the probabilities on the & f 8 = 1 o r p = 18 sample points must be one: ?I +++ (ii) Let P(a,) = p , then P ( a l )= 2 p . Hence 2 p f p - k P ( a l ) = +. & +i = 1 or p = P(a,) = P({az,u,}) - P(a2) = Q - (iii) Let P ( a l ) = p . P ( a J = P({a,,u4}) - P(a,) = Then p + 4+ + + Q = 1 or p = +, tha t is, P(aJ = + 4- Q 9. Thus P(a,) = & and = Q = Q 4. 17.6. A coin is weighted so that heads is twice as likely to appear as tails. Find P(T) and P(H). Let P I T )= p ; then P(H) = 2 p Now set the sum of the probabilities equal to one: p or p = $. Thus P(T)= p = Q and P ( H ) = 2 p = 3. +2p = 1 17.7. Two men, 7n1 and m,, and three women, w,,w, and w,,are in a chess tournament. Those of the same sex have equal probabilities of winning, but each man is twice as likely to win as any woman. (i) Find the probability that a woman wins the tournament. (ii) If m, and 20, are married, find the probability that one of them wins the tournament. Set P(zul)= p ; then P(w2)= P ( w 3 )= p and P ( m l )=P(m,) = 2 p . Next se t the sum of the probabilities of the five sample points equal to one: p p p 21, 2 p = 1 or p = +, + + + + We seek ii) P({wl, w,,wn}) and (ii) P({ml,wl}). Then, by definition, + P(WQ)= 3 + + + 3 = 4 P({rnl,w,})= P ( m J + P(W1) = 9 + + = $ P({Wl, w,,w,}) + = P(Wl) P(W,) CHAP. 171 191 PROBABILITY 17.8. Let a die be weighted so that the probability of a number appearing when the die is tossed is proportional to the given number (e.g. 6 has twice the probability of appearing as 3). Let A = {even number), B = {prime number), C = {odd number). (i) Describe the probability space, i.e. find the probability of each sample point. (ii) Find P ( A ) , P(B) and P ( C ) . (iii) Find the probability that: ( a ) an even o r prime number occurs; prime number occurs; (c) A but not B occurs. (i) ( b ) an odd Let P ( 1 ) = p . Then P ( 2 ) = 2 p , P ( 3 ) = 3 p , P ( 4 ) = 4 p , P ( 5 ) = 5 p and P ( 6 )= 6 p . Since the sum of the probabilities must be one, we obtain p 2 p 3 p 4 p 5 p 613 = 1 o r p = 1 / 2 1 . Thus 1 2 P(1) = 21, P ( 2 ) = E, P ( 3 ) $, P(4) = P(5) = P(6) = $ + + + + + i, (ii) P ( A ) = P ( ( 2 , 1, 6 ) ) = +, P ( B ) = P ( ( 2 , 3, 5 ) ) = g, &, P(C) = P({1,3,5}) = +. (iii) ( a ) The event that an even o r prime number occurs is A L I B = { 2 , 4 , 6 , 3 , 5 } , o r that 1 does not occur. Thus P ( A u B ) = 1- P(1) = g. ( b ) The event t h a t a n odd prime number occurs is 8 P ( { 3 , 5 ) ) = 21. (c) BnC = (3,5}. Thus P(BnC) = The event that A but not B occurs is A n B c = { 4 , 6 ] . Hence P ( A n B c ) = P ( { 4 , 6 ) ) = 8. 17.9. Find the probability p of an event if the odds that it will occur are a : b, that is, ‘‘a to b”. The odds that an event with probability p occurs is the ratio p : (1- p ) . Hence a a -P l - p -- - b or b p = a - a p or a p + b p = a or p = a f b 17.10. Find the probability p of an event if the odds that it will occur are “3 to 2”. i. 2 ,= f from which p = We can also use the formula of the preceding problem t o obtain 1-P 3 3 the answer directly: p = = = 5. 5 EQUIPROBABLE SPACES 17.11. Determine the probability p of each event: (i) an even number appears in the toss of a f a i r die; (ii) a king appears in drawing a single card from an ordinary deck of 52 cards; (iii) a t least one tail appears in the toss of three fair coins; (iv) a white marble appears in drawing a single marble from an urn containing 4 white, 3 red and 5 blue marbles. (i) The event can occur in three ways ( a 2 , 4 o r 6) out of 6 equally likely cases; hence p = (ii) There a r e 4 kings among the 52 cards; hence p = = = i. 3. (iii) If we consider the coins distinguished, then there a r e 8 equally likely cases: H H H , H H T , HTH,HTT,THH,THT,TTH,TTT.Only the first case is not favorable to the given event; hence p = $. (iv) There are 4 - 3 +5 = 12 marbles, of which 4 a r e white; hence p = 5 = f. PROBABILITY 192 [CHAP. 17 17.12. Two cards are drawn a t random from an ordinary deck of 52 cards. Find the probability p that (i) both are spades, (ii) one is a spade and one is a heart. (i) There a r e it2)= 1326 ways to draw 2 cards from 52 cards. There ar e (i3)= 78 ways to d r a w 2 spades from 1 3 spades; hence P = number of ways 2 spades can be drawn number of ways 2 cards can be drawn - __ 78 - 1326 - 3 - 51 (ii) Since there a r e 13 spades and 13 hearts, there a r e 13 * 13 = 169 ways to dra w a spade and a heart; hence p = 1326 = 102 =. 17.13.Three light bulbs are chosen at random from 15 bulbs of which 5 are defective. Find the probability p that (i) none is defective, (ii) exactly one is defective, (iii) a t least one is defective. There a r e ):'( (i) = 455 ways to choose 3 bulbs from the 15 bulbs. Since there a r e 15 - 5 = 10 non-defective bulbs, there a r e (f,") = 120 ways to choose 3 nondefective bulbs. Thus p = L455 ?! = %. 91 (ii) There ar e 5 defective bulbs and (120) = 45 different pairs of non-defective bulbs; hence there a r e 5 * 45 = 225 ways to choose 3 bulbs of which one is defective. 22s Thus p = 45j 45 = 91. (iii) The event t h a t at least one is defective is the complement of the event t h a t none a re defective 24 which has, by (i), probability 91. 24 - 67 Hence p = 1-91 - 91. 17.14. Two cards are selected a t random from 10 cards numbered 1 to 10. Find the probability p that the sum is odd if (i) the two cards are drawn together, (ii) the two cards are drawn one after the other without replacement, (iii) the two cards are drawn one after the other with replacement. (i) There a r e (io)= 45 ways t o and the other is even. 5.5 5 select 2 cards out of 10. The sum is odd if one number is odd There a re 5 even numbers and 5 odd numbers; hence there a r e 26 ways of choosing an even and a n odd number. 25 5 Thus p = 45 = 9. (ii) There ar e 1 0 . 9 = 90 ways t o draw two cards one a fte r the other without replacement. There a r e 5 5 = 25 ways t o draw a n even number and then a n odd number, and 5 5 = 25 ways to draw an odd number and then a n even number; hence p = 7 - 5 - g. - (iii) There ar e 1 0 . 1 0 = 100 ways t o draw two cards one a fte r the other with replacement. As in (ii), there a r e 5 . 5 = 25 ways t o dra w a n even number and then a n odd number, and - 25-25 - 5 5 = 25 ways to d r aw an odd number and then a n even number; hence p = 100 - 50 - 1 - 2. 17.15. An ordinary deck of 52 cards is dealt out a t random to 4 persons (called North, South, East and West) with each person receiving 13 cards. Find the probability p of each event (answers may be left in combinatorial and factorial notation): CHAP. 171 193 PROBABILITY (i) North receives all 4 aces. (ii) North receives no aces. (iii) North receives 5 spades, 5 clubs, 2 hearts and 1 diamond. (iv) North and South receive between them all the spades. (v) North and South receive between them all the aces and kings. (vi) Each person receives a n ace. (i) North can be dealt 13 cards in (!:) ways. If he receives the 4 aces, then he can be dealt the other 9 cards from the remaining 48 cards in (",") ways. (E)ways. (ii) North can be dealt 13 cards in froni the remaining 48 cards in Thus If he receives no aces, then he can be deaIt 13 cards ways. T h u s ways. He can be dealt 5 spades in (i3) ways, 5 clubs in (iii) North can be dealt 13 cards in (): (F) ways, 2 hearts in (i3)ways, and 1 diamond in (If) = 13 ways. Hence p = (iv) North and South can be dealt 26 cards between them in (ii) ways. 13 13 13 13 (5 )(5 )(2 1 ) If they receive the 13 spades, then they can be dealt 13 other cards from the remaining 39 cards in ):(; Hence p = (;!) (3 ways. / (!:). (ii)ways. (v) North and South can be dealt 26 cards between them in and the 4 kings, then they can be dealt the other 2 6 - 8 44 52 - 8 = 44 cards in (18) ways. Thus p = ):;( / (;",. If they receive the 4 aces = 18 cards from the remaining ways (an ordered partition). (vi) The 52 cards can be distributed among the 4 persons in 1 3 ! l& The remaining 48 cards can be distributed Now each person can get a n ace in 4 ! ways. among the 4 persons in 481 12! 121 12! 12! Thus ways. 48! 4 ! 1 2 ! 12: l e ! IZ! P = 521 13! 131 13! 131 - 4 ! 134 52 51 * 50 * 49 - 133 17 * 25 * 49 17.16. Six married couples a r e standing in a room. (i) If 2 people a r e chosen a t random, find the probability p that (a) they a r e married, ( b ) one is male and one is female. (ii) If 4 people are chosen a t random, find the probability p t h a t (a) 2 married couples are chosen, ( b ) no married couple is among the 4, (c) exactly one married couple is among the 4. (iii) If the 12 people are divided into six pairs, find the probability p that (a) each pair is married, ( b ) each pair contains a male and a female. 194 PROBABILITY (i) There a r e (7)= 66 ways t o choose 2 people from the 6 ((1) [CHAP. 17 12 people. 1 There a r e 6 married couples: hence p = 66 = fi. ( b ) There a r e 6 ways to choose a male and 6 ways to choose a female; hence p = (ii) There a r e 6 6 6 66 = 11. (y)= 495 ways t o choose 4 people from the 12 people. ( a ) There are (:) = 15 ways to choose 2 couples from the 6 couples; hence p 15 1 a = ,%. 1 ( b ) The 4 persons come from 4 different couples. There a r e (!) = 15 ways to choose 4 couples from the 6 couples, and there a r e 2 ways to choose one person from each couple. Hence p = - 2.2.2.2.15 - 16 495 (c) - 33. This event is mutually disjoint from the preceding two events (which are also mutually 16 disjoint) and a t least one of these events must occur. Hence p f $ f = 1 o r p = 33. (iii) There a r e in each. 12! - 12! 2!2!2!2!2!2! - 5 ; ways to partition the 12 people into 6 ordered cells with 2 people ((0 The 6 couples can be placed into the 6 ordered cells in 6! ways. Hence p = & = &. ( b ) The six men can be placed one each into the 6 cells in 6 ! ways, and the 6 women can be 6: 6 ! 16 placed one each into the 6 cells in 6! ways. Hence p = 12!/26= 231. 17.17. If 5 balls are placed in 5 cells a t random, find the probability p that exactly one cell is empty. There a r e 53 ways to place the 5 balls in the 5 cells. If exactly one cell is empty, then another cell contains 2 balls and each of the remaining 3 cells contains one ball. There a r e 5 ways to select the empty cell, then 4 ways t o select the cell containing 2 balls, and (:) = 10 ways t o select 2 balls to go into this cell. Finally there a r e 3 ! ways to distribute the remaining 3 balls among 103! - 48 the remaining 3 cells. Thus p = j.q. 125' 55 17.18.A class contains 10 men and 20 women of which half the men and half the women have brown eyes. Find the probability p that a person chosen a t random is a man or has brown eyes. Let A = {person is a man} and B = {person has brown eyes}. Then P ( A ) = lo = 30 $, P(Bj = 30 = 1 2' p = P(AuB) = P(A) P(AnB)= 30 = + P(B)- P ( A n B ) 5. We seek P ( A u B ) . Thus, by Theorem 17.4, = g++-Q = 3 PROOFS OF THEOREMS 17.19. Prove Theorem 17.3(i): If $3 is the empty set, then P ( @ )= 0. Let A be a n y event; then A and (3 a r e disjoint and A U $3 = A . + Thus P ( A ) = P ( A U @ ) = P ( A ) P(@), from which our result follows. 17.20. Prove Theorem 17.3(ii): For any event A , P ( A C )= 1- P ( A ) . The sample space S can be decomposed into t h e mutually exclusive events A and Ac: Thus 1 = P ( S ) = P ( A WAC) = P ( A ) P(Ac), from which our result follows. S = A UAC. CHAP. 171 195 PROBABILITY 17.21. Prove Theorem 1'7.3(iii): = P ( A )- P ( A n B ) P(A\B) A can be decomposed into t h e mutually exclusive events A \ B and A n B : A = ( A \B) u ( A n B ) . Thus P(A) = P(A\ B) from which our result follows. + P ( An B ) A is shaded. 17.22. Prove Theorem 17.3(iv): If A c B , then P ( A )6 P(B). If A c B , then B can be decomposed into the mutually exclusive events A and B\A. Thus P ( B ) = P ( A ) P(B\A) + The result now follows from t h e f act t h a t P ( B \A) 2 0. B is shaded. 17.23. Prove Theorem 17.4: For any events A and B, P ( AU B ) = P ( A ) P ( B ) - P ( A n B ) + A \B Note t h a t A u B can be decomposed into the mutually exclusive events and B : A UB = ( A \B)UB. Thus, using Theorem 1'7.3(iii), we have + P ( A u B ) = P ( A \B) P(B) = P ( A ) - P ( An B ) f P ( B ) = P ( A ) P ( B ) - P ( AnB ) ~ + A U B is shaded. 17.24.Prove Corollary 17.5: For any events A , B and C, P ( A u B u C ) = P ( A ) + P ( B ) + P ( C ) -P ( A n B ) - P ( A n C ) - P ( B n C ) + P ( A n B n C ) Let D = B u C . Then P(AnD) = P(AnB) A n D = A n ( B u C ) = ( A n B ) U ( A n C ) and + P(AnC)- P(AnBnAnC) = Thus P(AnB) + P(AnC)- P ( A n B n C ) + P(AuBuC) = P(AUD) = P(A) P(D) - P(AnD) = P ( A ) P ( B ) P ( C ) - P ( B n C ) - [ P ( A n B )+ P ( A n C ) - P ( A n B n C ) ] = P(A) P(B) P(C) - P ( B n C ) - P ( A n B ) - P(AnC) P ( A n B n C ) + + + + + MISCELLANEOUS PROBLEMS 17.25. Let A and B be events with P ( A ) = Q, P ( B ) = & and P ( A n B ) = &. Find (i) P ( AU B ) , (ii) &'(Ac) and P ( B c ) , (iii) P ( A c n B c ) , (iv) P ( A c U B c ) , (v) P ( A n B c ) , (vi) P ( B n A c ) . (i) P ( A u B ) = P ( A ) + P ( B )- P ( A n B ) = Q (ii) P(Ac) = 1 - P ( A ) = 1 - # = Q and +&-$ = Q P ( B ) = 1- P(B) = 1 - Q = 1 2 (iii) Using De Morgan's Law, ( A u B ) c = ACnBc, we have = 1- 8 = 8 P(AcuBc) = P ( ( A n B ) c ) = 1 - P ( A n B ) = 1 - & = p P (A cn B c) = P ( ( A u B ) c ) = 1 - P(AuB) (iv) Using De Morgan's Law, ( A n B ) c = A C U B Cwe , have [CHAP. 1 7 PROBABILITY 196 Equivalently, P(AcUt7c) = P(Ac) + P(Bc) - P(AcnBC) = P(A)-P(AnB) = (v, P ( A n B c ) = P(A\B) (vi) P ( B n A c ) = P ( B ) - P ( A n B ) = 8-$ (iii Substitute in P ( A u B ) = P ( A ) + P ( B ) - P ( A n B ) to obtain = .20 + = 8 4 and P ( A n B ) = Q. Find 4 = ++P(B)-& & -& = Number 1 2 3 4 5 6 Frequency 14 17 20 18 15 16 (iii, P ( A n B c ) = P ( A ) - P ( A n B ) = number of successes total number of trials The relative frequency f = 8 = Q P(A) = 1-P(Ac) = 1 - 20 2 +-& = 4 (i) (i) f = + 1- & = 17.26. Let A and B be events with P ( A U B ) = $, P ( A c ) = (i) P ( A ) , (ii) P(B),(iii) P ( A n B c ) . t 6 8 (ii) f = $ = .15 or P(B) = 3. ' 17+18+16 - (iii) f = 1 0 0 - .51 17+20+15 - (iv) f = 1 0 0 - .52 17.28. Let S = {ul, a,,. . ., as) and T = {b,, b,, . . ., b,} be finite probability spaces. Let the number p L j= P(a,)P ( b , ) be assigned to the ordered pair (ai, bj) in the product set S x T = ((8,t ): s E S, t E T } . Show that the pli define a probability space on S X T,i.e. that the p , are non-negative and add up t o one. (This is called the product probability space. We emphasize that this is not the only probability function that can be defined on the product set S x T.) Since P(a,),P(b,) Pll 1 0, for each i and each j , pij = P(uJ P(b,) + P12 + . - .+ Plt + P21 + P 2 2 + = + Pzt + * ' ' 0. Furthermore, + Psl + Ps2 + . . * + Pst + + P(a1)P(bt) + + P(aJ P(b1) + + P(as)P(b,) + P(as)[P(bl) + . +P(bt)] P(al)[P(bl)+ . + P(b,)] + P(al).l + + P(a,) '1 = P(al) + + P(as) = 1 = P(a1)P(bJ = * * . 2 * * * * * * * * " * * * * * * * * a ' . CHAP. 171 PROBABILITY 197 Supplementary Problems SAMPLE SPACES AND EVENTS 17.29. Let A and B be events. Find a n expression and exhibit the Venn diagram for the event t h a t (i) A or not B occurs, (ii) neither A nor B occurs. 17.30. Let A , B and C be events. Find an expression and exhibit the Venn diagram f o r the event tha t (i) exactly one of the three events occurs, (ii) a t least two of the events occurs, (iii) none of the events occurs, (iv) A o r B , but not C, occurs. 17.31. Let a penny, a dime and a die be tossed. (i) Describe a suitable sample space S. (ii) Express explicitly the following events: A = {two heads and a n even number appears}, B = {a 2 appears}, C = {exactly one head and a prime number appears}. (iii) Express explicitly t h e event t h a t ( a ) A and B occur, ( b ) only B occurs, ( c ) B o r C occur. FINITE PROBABILITY SPACES 17.32. Which function defines a probability space on S = {a,,a,,a,}? 4 (i) P(aJ = $, P(ad = g, P b 3 ) = 3 (ii) P ( a l ) = 8, P(a,) = -Q, P(a,) = Q (iii) P ( a l ) = Q, &‘(a2)= +, &‘(a,) = (iv) P(al) = 0, P(az)= 5, P(u3)= $ 17.33. Let P be a probability function on S = { a l , a ~ , a 3 } . Find P ( a l ) if (i) P(u2) = Q and P(u3) = 2, (ii) P ( a l ) = 2 P ( a 2 ) and P(a3) = $, (iii) P({aZ,a3}) = 2 P ( a 1 ) , (iv) P(a3) = 2P(u,) and P(a2) = 3 P ( a 1 ) . 17.34. Let A and B be events with P ( A u B ) = P ( A n Bc). 17.35. Let A and B be events with P ( A ) = P(AcuBc) and P ( B n A c ) . 4, P ( A n B ) = $ and P ( A c ) = 4,P ( A U B ) = 3 and P(B)= 3. 8. Find P ( A ) , P ( B ) and Find P ( A n B ) , P(AcnBc), 17.36. A coin is weighted so t h a t heads is three times as likely to appear a s tails. Find P(H) and P(T). 17.37. Three students A , B and C a r e in a swimming race. A and B have the same probability of winning and each is twice as likely t o win as C. Find the probability t h a t B o r C wins. 17.38. A die is weighted so t h a t t h e even numbers have the same chance of appearing, the odd numbers have the same chance of appearing, and each even number is twice as likely to appear a s a n y odd number. Find the probability t h a t (i) a n even number appears, (ii) a prime number appears, (iii) a n odd number appears, (iv) an odd prime number appears. 17.39. Find the probability of an event if t h e odds t h a t it will occur a r e (i) 2 to 1, (ii) 5 to 11. 17.40. I n a swimming race, the odds t h a t A will win a r e 2 to 3 and the odds t h a t B will win a r e 1 to 4. Find the probability p and t h e odds t h at A or B win the race. EQUIPROBABLE SPACES 17.41. A class contains 5 freshmen, 4 sophomores, 8 juniors and 3 seniors. A student is chosen at random to represent the class. Find the probability tha t the student is (i) a sophomore, (ii) a senior, (iii) a junior o r senior. 17.42. One card is selected at random from 50 cards numbered 1 to 50. Find the probability t h a t the number on the card is (i) divisible by 5 , (ii) prime, (iii) ends in the digit 2. 17.43. Of 10 girls in a class, 3 have blue eyes. If two of the girls a r e chosen at random, what is the probability t h a t ( i ) both have blue eyes, (ii) neither ha s blue eyes, (iii) at least one has blue eyes? 17.44. Three bolts and three n u t s a r e put in a box. probability t h a t one is a bolt and one a nut. If two pa rts a r e chosen at random, find the 198 PROBABILITY [CHAP. 17 17.45. Ten students, A , B , . . . , a r e in a class. If a committee of 3 is chosen a t random from the class, find the probability t h a t ( i ) A belongs t o t h e committee, (ii) B belongs t o the committee, ( i i i ' d and B belong to the committee, tivi A or B belongs t o the committee. 17.46. A class consists of 6 girls and 10 boys. If a committee of 3 is chosen at random from the class, find the probability t h a t (i) 3 boys a r e selected, (ii) exactly 2 boys a r e selected, (iii) a t least one boy is selected, (iv) exactly 2 girls a r e selected. 17.47. A pair of f a i r dice is tossed. greater than 1. Find the probability t h a t the maximum of the two numbers is 17.48. Of 120 students, 60 a r e studying French, 50 a r e studying Spanish, and 20 a r e studying French and Spanish. If a student is chosen a t random, find the probability t h a t the student (i) is studying French or Spanish, (ii) is studying neither French nor Spanish. 17.49. Three boys and 3 girls sit in a row. Find the probability t h a t (iil the boys and girls sit in alternate seats. 17.50. A die is tossed 50 times. occurrence: (i) the 3 girls sit together, The following table gives the six numbers and their frequency of Find the relative frequency of the event ii) a 4 appears, iii) a n odd number appears, (iii) a prime number appears. Answers to Supplementary Problems 17.29. (i) A u B ~ ,(ii) ( A u B ) ~ 17.30. (i) (AnBcnCc)u (BnAcnCc)u (CnAcnBc) (ii) ( A n B )u ( A n C ) U ( B n C ) 17.31. (i) (iii) ( A u B u C). (iv) ( A u B ) n CC S = (HH1, HH2, HH3, HH4, HH5, HH6, HT1, HT2, HT3, HT4, HT5, HT6, TH1, TH2, TH3, TH4, TH5, TH6, TT1, TT2, TT3, TT4, TT5, TT6) (ii) A = (HH2, HH4, HH6}, B = (HH2, HT2, TH2, TT2}, C = {HTZ, TH2, HT3, TH3, HT5, TH5) (iii) ( a ) A n B = (HH2) ( b ) B\(AuC) = (TT2) (c) B u C = (HH2, HT2, TH2, TT2, HT3, TH3, HT5, TH5} 17.32. (i) no, (ii) no, (iii) yes, (iv) yes 17.33. 17.34. 17.35. 17.36. 17.37. 3 5 17.44. 17.38. A, (i) k, 17.45. (i) 17.39. 17.46. 3 17.40. p = 5; the odds a r e 3 to 2. 17.47. 35 17.41. (i) I, iii) &, (iii) 3 11 17.42. 17.43. (i) A, (ii) &, (iii) 8 (ii) (ii) A, (iii) 3 (iv) 15 $, 1 8 27 15 (iii) 28, (iv) 17.48. (i) 4, 3 (ii) 17.49. (i) i,(ii) 1 17.50. (i) &, 2, (iii) 8 (ii) E Chapter 18 Conditional Probability. Independence CONDITIONAL PROBABILITY Let E be an arbitrary event in a sample space S with P ( E )> 0. The probability that an event A occurs once E has occurred or, in other words, the conditional probability of A given E , written P ( A ' E ) , is defined as follows: P ( AnE ) P ( A ' E )= P(E) As seen in the adjoining Venn diagram, P(A I E ) in a certain sense measures the relative probability of A with respect to the reduced space E . In particular, if S is an equiprobable space and we let \A]denote the number of elements in the event A , then P(AnE) (AnE( (El and so P(A1E) = P ( A n E ) = --,A n E l P(E) = P(E) IEl IS IS In other words, Theorem 18.1: If S is an equiprobable space and A and E are events of S, then number of elements in A n E number of elements in E number of ways A and E can occur P(A1E) = number of ways E can occur IE, or Example 1.1: Let a pair of f a i r dice be tossed. If the sum is 6, find the probability t h a t one of the dice is a 2. In other words, if E = {sum is 61 = {(1,5), (2,4), (3,3), (4,2), (5,111 A = {a 2 appears on at least one die} and find P ( A I E). Now E consists of five elements and two of them, (2,4) and (4,2), belong to A : A n E = {(2,4), (4,2)}. Then P ( A I E) = i. On t h e other hand, since A consists of eleven elements, A = ((2, l), 12,2), (2, 3), (2,4), (2,5), ( 2 , 6 ) , ( 1 , 2 ) , ( 3 , 2 ) , (4,2), (5,% (6,211 and S consists of 36 elements, P ( A ) = $. Example 1.2: A couple h as two children; the sample space is S = { b b , b g , g b , g g } with probability & f o r each point. Find the probability p t h a t both children a r e boys if ( i ) i t i s known t h a t one of the children is a boy, (ii) it is known t h a t the older child is a boy. ( i ) Here the reduced space consists of three elements, { b b , b g , g b } ; hence p = &. rii) Here the reduced space consists of only two elements, { b b , b g } ; hence p = 4. 199 200 [CHAP. 18 CONDITIONAL PROBABILITY. I N D E P E N D E N C E MULTIPLICATION THEOREM FOR CONDITIONAL PROBABILITY If we cross multiply the above equation defining conditional probability and use the fact that A n E = E n A , we obtain the following useful formula. Theorem 18.2: P ( E f l A ) = P ( E )P ( A 1 E ) This theorem can be extended by induction as follows: Corollary 18.3: P ( A l n A z n . . n A , ) = P ( A 1 ) P ( A 2 1 A 1 ) P (1 AA3l n A z ) P(A4 I A l n A e n A 3 ). . P ( A , I A1 n A p n . * * nA,-l) 9 We now apply the above theorem which is called, appropriately, the multiplication theorem. Example 2.1: A lot contains 12 items of which 4 are defective. Three items a r e drawn at random from the lot one a fte r the other. Find the probability p t h a t all three a r e non-defective. The probability tha t the first item is non-defective is since 8 of 12 items a r e non-defective. If the first item is non-defective, then the probability tha t the next since only 7 of the remaining 11 items a r e non-defective. item is non-defective is If the first two items a re non-defective, then the probability t h a t the last item is non-defective is since only 6 of the remaining 10 items are now non-defective. Thus by the multiplication theorem, = & . $ . " = 11 5 fi & 10 55 FINITE STOCHASTIC PROCESSES AND TREE DIAGRAMS A (finite) sequence of experiments in which each experiment has a finite number of outcomes with given probabilities is called a (finite) stochastic process. A convenient way of describing such a process and computing the probability of any event is by a tree diagram as illustrated below; the multiplication theorem of the previous section is used to compute the probability that the result represented by any given path of the tree does occur. Example 3.1: We a r e given three boxes as follows: Box I has 10 light bulbs of which 4 a r e defective. Box I1 has 6 light bulbs of which 1 is defective. Box I11 has 8 light bulbs of which 3 a r e defective. W e select a box a t random and then dra w a bulb at random. W h a t is the probability p t h a t the bulb is defective? Here we perform a sequence of two experiments: (i) select one of the three boxes; (ii) select a bulb which is either defective (D)or non-defective ( N ) . The following tree diagram describes this process and gives the probability of each branch of the tree: CHAP. 181 CONDITIONAL PROBABILITY. INDEPENDENCE 201 The probability that any particular path of the tree occurs is, by the multiplication theorem, the product of the probabilities of each branch of the path, e.g., the probability of selecting box I and then a defective bulb is 3.5 1 2 = IS. 2 Now since there a r e three mutually exclusive paths which lead to a defective bulb, the sum of the probabilities of these paths is the required probability: = 13 . 25 + ; . ; + 1 . 33 Example 3.2: = 113 8 360 A coin, weighted so t h a t P ( H ) = $ and P(T) = $, is tossed. If heads appears, then a number is selected at random from the numbers 1 through 9; if tails appears, then a number is selected at random from the numbers 1 through 5. Find the probability p that an even number is selected. The tree diagram with respective probabilities is Note t h a t the probability of selecting an even number from the numbers 1 through 9 is 4 since there a r e 4 even numbers out of the 9 numbers, whereas the probability of selecting a n even number from the numbers 1 through 5 is since there are 2 even numbers out of the 5 numbers. Two of the paths lead to a n even number: H E and TE. Thus p z P(E) = $ . + + ? . * = 55 5 9 Example 3.3: 3 3 135 Consider the stochastic process of t h e preceding example. If a n even number was selected, w h a t is the probability that tails appeared on the coin? In other words, we seek the conditional probability of T given E: P(T 1 E). Now tails and an even number can occur only on the bottom path which has 1 2 2 probability 3.5 = t h a t is, P ( T n E ) = 5. Furthermore, by the preceding example, P ( E ) = Thus by the definition of conditional probability, P ( T n E ) - 2/15 - 9 P ( T / E ) = ___ P(E) - 581135 - 29 &; %. ~ In other words, we divide the probability of the successful path by the probability of the reduced sample space. INDEPENDENCE An event B is said to be independent of an event A if the probability that B occurs is not influenced by whether A has or has not occurred. In other words, if the probability of B equals the conditional probability of B given A : P ( B ) = P(B1A). Now substituting P(B) for P ( B I A ) in the multiplications theorem P ( A nB) = P ( A )P(B 1 A ) , we obtain P(AnB) = P(A)P(B) We formally use the above equation as o u r definition of independence. 1 Definition:] Events A and B are independent if P ( A n B ) = P ( A ) P ( B ) ;otherwise they are dependent. Example 4.1: Let a f a i r coin be tossed three times; we obtain the equiprobable space S = {HHH, H H T , HTH, HTT, THH, THT, TTH, TTT} 202 [CHAP. 18 CONDITIONAL PROBABILITY. INDEPENDENCE Consider the events B = {second toss is heads) A = {first toss is heads}, C = {exactly two heads are tossed in a row} Clearly A and B are independent events; this fact is verified below. On the other hand, the relationship between A and C o r B and C is not obvious. We claim that A and C are independent, but that B and C are dependent. We have P ( A ) = P({HHH, HHT, HTH, HTT}) = P ( B ) = P({HHH, HHT, THH, THT}) = P ( C ) = P({HHT, THH)) = f $ = f = f = Then P ( A n B ) = P({HHH, HHT}) = 2, P ( A n C ) = P({HHT}) = 8, P ( B n C ) = P({HHT, THH}) = 2 Accordingly, P ( A ) P ( B ) = 3.4 = & = P ( A n B ) , and so A and B are independent; P ( A )P ( C ) = d * $ = 4 = P ( A n C ) , and so A and C a r e independent; 12 * $ P ( B )P(C) = = 4 P ( B n C ) , and so B and C a r e dependent. # Frequently, we will postulate that two events a r e independent, or the experiment itself will imply that two events are independent. Example 4.2: The probability that A hits a target is 2 and the probability that B hits it is What is the probability that the target will be hit if A and B each shoot a t the target? We are given that P ( A ) = 4 and P ( B ) = 5, and we seek P ( A u B ) . Furthermore, the probability that A or B hits the target is not influenced by what the other does; that is, the event that A hits the target is independent of the event that B hits the target: P ( A n B ) = P ( A ) P ( B ) . Thus 2. P ( A uB) = - P(A) + P ( B ) - P ( A nB ) t+$-'.? 4 5 = P(A) + P ( B ) - P ( A )P ( B ) = 11 20 Three events A, B and C are i n d e p e n d e n t if: P ( A n B ) = P ( A ) P ( B ) , P ( A n C ) = P ( A ) P ( C ) and i.e. if the events are pairwise independent, and (ii) P ( A n B n C ) = P ( A ) P ( B ) P ( C ) (i) P(BnC)= P(B)P(C) The next example shows that condition (ii) does not follow from condition (i); in other words, three events may be pairwise independent but not independent themselves. Example 4.3: Let a pair of fair coins be tossed; here S = {HH, HT, TH, TT} is an equiprobable space. Consider the events A = {heads on the first coin} = {HH, HT} B = {heads on the second coin} = {HH, TH} C = {heads on exactly one coin} = {HT, TH} Then P ( A ) = P ( B ) = P ( C ) = $ = f and P ( A n B ) = P({HH}) = $, P ( A n C ) = P({HT}) = t, P ( B n C ) = ({TH}) = $ Thus condition ii) is satisfied, i.e., the events are pairwise independent. A n B n C = $3 and so However, P ( A n B n C ) = P(@) = 0 # P(A)P(B)P(C) In other words, condition (ii) is not satisfied and so the three events are not independent. CONDITIONAL PROBABILITY. INDEPENDENCE CHAP. 181 203 Solved Problems CONDITIONAL PROBABILITY IN EQUIPROBABLE SPACES 18.1. A pair of fair dice is thrown. Find the probability p that the sum is 10 or greater if (i) a 5 appears on the first die, (ii) a 5 appears on at least one of the dice. ii) If a 5 appears on the first die, then the reduced sample space is A = { [ 5 , 1 ) , 15,2), ( 5 , 3 ) , (5,4), ( 5 , 5 ) , ( 5 , 6 ) } The sum is 10 o r greater on two of the six outcomes: ( 5 , 5 ) , ( 5 , 6 ) . Hence p = 2 1 = 3. (ii) If a 5 appears on a t least one of the dice, then the reduced sample space has eleven elements: B = ((5, I ) , (5,2), ( 5 , 3 ) , 15,4), 15,5), ( 5 , 6 ) , ( 1 , 5 ) , ( 2 , 5 ) , 13,5), (4,5),(6,511 3 The sum is 10 or greater on three of the eleven outcomes: ( 5 , 5 ) , ( 5 , 6 ) , (6,5). Hence p = a. 18.2. Three fair coins are tossed. Find the probability p that they are all heads if (i) the first coin is heads, (ii) one of the coins is heads. The sample space has eight elements: (i) S = {HHH, H H T , HTH, HTT, THH, THT, TTH, TTT}. If the first coin is heads, the reduced sample space is A = {HHH, HHT, HTH, HTT}. Since the coins are all heads in 1 of 4 cases, p = $. (ii) If one of the coins is heads, the reduced sample space is B = {HHH, HHT, HTH, HTT, THH. THT, TTH}. Since the coins are all heads in 1 of 7 cases, p = 3. 18.3. A pair of fa i r dice is thrown. If the two numbers appearing are different, find the probability p that (i) the sum is six, (ii) an ace appears, (iii) the sum is 4 or less. Of the 36 ways the pair of dice can be thrown, 6 will contain the same numbers: ( l , l ) , (2,2), Thus the reduced sample space will consist of 36 - 6 = 30 elements. . . ., ( 6 , 6 ) . (i) The sum six can appear in 4 ways: (1,5), ( 2 , 4 ) , (4,2), ( 5 , l ) . (We cannot include ( 3 , 3 ) since 4 2 the numbers are the same.) Hence p = 30 = 5. (ii) An ace can appear in 10 ways: ( 1 , 2 ) , (1,3), 10 p=,=,. 1 . . ., ( 1 , 6 ) and (2, l), ( 3 , 1 ) , . . ., (iii) The sum of 4 or less can occur in 4 ways: (3, I), (1,3), (2, l), ( 1 , 2 ) . Thus p = ( 6 , l ) . Hence 6 $. 18.4. Two digits are selected at random from the digits 1 through 9. If the sum is even, find the probability p that both numbers are odd. The sum is even if both numbers a r e even or if both numbers a r e odd. There a r e 4 even numbers i 2 , 4 , 6 , 8 ) ; hence there are (;) = 6 ways to choose two even numbers. There a r e 5 odd numbers i l , 3 , 5 , 7 , 9 ) : hence there are = 1 0 ways to choose two odd numbers. Thus there a r e 6 10 = 16 ways to choose two numbers such t h a t their sum is even: since 10 of these ways occur 10 - 5 when both numbers are odd, p = 16 - 2. + (3) 18.5. A man is dealt 4 spade cards from an ordinary deck of 52 cards. If he is given three more cards, find the probability p that at least one of the additional cards is also a spade. CONDITIONAL PROBABILITY. IN D EPEN D EN CE 204 [CHAP. 18 Since he is dealt 4 spades, there a r e 5 2 - 4 = 48 cards remaining of which 1 3 - 4 = 9 a r e spades. There a r e (",") = 17,296 ways in which he can be dealt three more cards. Since there a r e 48 - 9 = 39 cards which a r e not spades, there a r e = 9139 ways he can be dealt three cards which ar e not spades. Thus the probability q t h a t he is not dealt another spade is q = -; 157 hence p = 1 - q = 8 17,296 ' (y) 18.6. Four people, called North, South, East and West, are each dealt 13 cards from an ordinary deck of 52 cards. (i) If South has no aces, find the probability p that his partner North has exactly two aces. (ii) If North and South together have nine hearts, find the probability p that East and West each has two hearts. (i) There ar e 39 cards, including 4 aces, divided among North, (): ways t h a t North can be dealt 13 of the 39 cards. There a r e the four aces, and ):(; ways he can be dealt 11 cards from the not aces. Th u s (;)(::) - 6.12.13.25.26 p z - 36 * 37 * 38 * 39 (3 E a s t and West. There a re (;) ways he can be dealt 2 of 39 - 4 = 35 cards which a r e -650 2109 ways (ii) There a r e 26 cards, including 4 hearts, divided among E a s t and West. There a r e (;!) t h at , say, E a s t can be dealt 13 cards. (We need only analyze East's 1 3 cards since West must have the remaining cards.) There a r e ways E a s t can be dealt 2 hearts from 4 hearts, ways he can be dealt 11 non-hearts from the 26 - 4 = 22 non-hearts. Thus and );( (i) MULTIPLICATION THEOREM 18.7. A class has 12 boys and 4 girls. If three students are selected a t random from the class, what is the probability p that they are all boys? The probability t h a t t h e first student selected is a boy is 12/16 since there are 12 boys out of 16 students. If the first student is a boy, then the probability t h a t the second is a boy is 11/15 since there ar e 11 boys left out of 15 students. Finally, if the first two students selected were boys, then the probability t h a t the third student is a boy is 10114 since there a r e 10 boys left out of 14 students. Thus, by the multiplication theorem, the probability t h a t all three a r e boys is Another Method. There ar e (i6)= (y)= 220 ways t o select 3 boys out of 560 ways to select 3 students of the 16 students, and 12 boys; hence p = = 2 6. A T h i i d M et hod. If the students a re selected one a fte r the other, then there a r e 1 6 * 15 * 14 ways to select three students, and 12 * 11 * 10 ways to select three boys; hence p = -= $. 18.8. A man is dealt 5 cards one after the other from an ordinary deck of 52 cards. What is the probability p that they are all spades? The probability t h a t t h e first card is a spade is 13/52, the second is a spade is 12/51, the third is a spade is 11/50, the fourth is a spade is 10/49, and the last is a spade is 9/48. (We assumed in each case t h a t the previous cards were spades.) Thus p = = 2. $ 9 $ 0 %* & -. CHAP. 181 205 CONDITIONAL PROBABILITY. I N D E P E N D E N C E 18.9. An urn contains 7 red marbles and 3 white marbles. Three marbles are drawn from the urn one after the other. Find the probability p that the first two are red and the third is white. The probability that the first marble is red is 7/10 since there a r e 7 red marbles out of 1 0 marbles. If the first marble is red, then the probability t h a t the second marble is red is 6/9 since there a r e 6 red marbles remaining out of the 9 marbles. If the first two marbles a r e red, then the probability t h a t the third marble is white is 3/8 since there a r e 3 white marbles out of the 8 marbles in the urn. Hence, by the multiplication theorem, &.E.. ?, 9 8 = w 18.10. The students in a class are selected a t random, one after the other, for an examination. Find the probability p that the boys and girls in the class alternate if (i) the class consists of 4 boys and 3 girls; (ii) the class consists of 3 boys and 3 girls. (i) If the boys and girls a r e to alternate, then the first student examined must be a boy. The probability t h a t the first is a boy is 417. If the first is a boy, then the probability t h a t the second is a girl is 3/6 since there a r e 3 girls out of 6 students left. Continuing in this manner, we obtain the probability t h a t the third is a boy is.3/5, t h e fourth is a girl is 2/4, the fifth is a boy is 213, the sixth is a girl is 112, and the last is a boy is 1/1. Thus p = *.?.3.2.2.1.1 , 6 5 4 3 2 -r 1 - 3 5 (ii) There a r e two mutually exclusive cases: the first pupil is a boy, and the first is a girl. If the first student is a boy, then by the multiplication theorem the probability pl t h a t the students alternate is = g.;.;.;.;.; If the first student is a girl, then by the multiplication theorem the probability the students alternate is p 2 = 36 . 0~ . $ . $ .2 1 .1 1 = 1 YO Thus p = p,+pz 1 = %+& = p2 that $. CONDITIONAL PROBABILITY 18.11. In a certain college, 25% of the students failed mathematics, 15% of the students failed chemistry, and 10% of the students failed both mathematics and chemistry. A student is selected at random. (i) If he failed chemistry, what is the probability that he failed mathematics? (ii) If he failed mathematics, what is the probability that he failed chemistry? (iii) What is the probability that he failed mathematics o r chemistry? Let M = {students who failed mathematics} and C = {students who failed chemistry}; then P ( M ) = 25% = 2 5 , (i) P ( C ) = 15% = .15, P ( M n C ) = 10% = . l o The probability that a student failed mathematics, given t h a t he has failed chemistry is - .lo - 2 P ( M )C) = P ( M n C ) .15 3 P(C) ~ (ii) The probability t h a t a student failed chemistry, given t h a t he has failed mathematics is (iii) P(MuC) = P(M) + P(C) - P ( M n C ) = .25 + .15 - . l o = .30 = 3 206 CONDITIONAL PROBABILITY. INDEPENDENCE [CHAP. 18 18.12. Let A and B be events with P ( A ) = +, P(B) = Q and P ( A n B ) = $. Find (i) P ( A [ B), iii) P ( B I A ) , (iii) P ( AuB), (iv) P(Ac1 Bc), (v) P(BcI A'). (iii) P ( A U B ) = P ( A ) + P(B) 1 - P ( A nB) = 5 +3 1 - s - 4 = riv) First compute P(Bc) and P(AcnBc). P(Bc) = 1 - P ( B ) = 1 - Q = +. By De Morgan's ~ AcnBc; hence P(AcnBc) = P ( ( Ad B ) c ) = 1 - P(A U B ) = 1 - 1 1 2 = 2. 12 Law, ( A u B ) = 18.13. Let A and B be events with P ( A ) = 8, P ( B ) = # and P ( A U B ) = 9. Find P ( A 1 B ) and P ( B I A ) . + First compute P ( A n B ) using the formula P ( A UB) = P ( A ) P ( B )- P(A n B ) : _a - 4 + 8 -P(AnB) P(AnB) = or 4 and P(d) = P ( e ) = &i, and 18.14. Let S = { a , b , c , d , e } with P(a) = P ( b ) = A, P ( c ) = let E = { b , c , d } . Find: (i) P ( A IE) where A = {a,b , c } ; (ii) P ( B I E ) where B = { c , d , e } ; (iii) P ( C I E ) where C = { a , c , e } ; (iv) P ( D IE) where D = { a , e } . In each case use the definition of conditional probability + P(c) + P(d) = & P ( A n E ) = P ( b ) + P ( c ) = %+; = &, Here P ( E ) = P ( { b , c , d } ) = P ( b ) (ii APE = {b,c}, J - S tii) B n E = { c , d } , P ( B n E ) = P(c) t P ( d ) = f+& = i, P(x E) = +$ = P(* nE ) p . (El 5. & 1 (AnE) and P ( A / E ) = P P (E) - B ~ and P ( B / E ) = - 2' P(BnE) - - 5 - a P ( E ) - % - 6' ~ P(CnE) 1 1 - 5 - (iii) C n E = {c}, P ( C n E ) = P ( c ) = +, and P(C 1 E ) = P(E) # 3' ~ P(DnE) - 0 (iv) D n E = $3, P ( D n E ) = P ( @ ) = 0, and P(DI E ) = ___ P(E) - 3 = O. 18.15. Show that the conditional probability function P(* 1 E ) satisfies the three axioms of Theorem 17.1; that is: [P,] For any event A, 0 L P ( A 1 E ) f 1. [P,] For the certain event S , P ( S I E ) = 1. [P,] If A and B are mutually exclusive, then P ( A uB 1 E ) = P ( A I E ) P ( B E). + ~ Since A n E c E , P ( A n E ) 5 P ( E ) . Thus P ( A I E ) = n E ) 4 1 and is also non-negative; P(E) that is, 0 5 P(A 1 E ) f 1. P(E) Since S n E = E , P(S 1 E ) = P ( S n E ) - __ P(E) P(E) = If A n B = 8, then A n B n E = ( A n E ) n ( B n E ) = 8, i.e. A n E and B n E a r e mutually exclusive events; hence P ( ( A n E )u (BnE))= P ( A n E ) + P ( B n E ) . But ( A n E ) u ( B n E ) = ( A LB)n E. Thus P((Ac:B)n E ) = P ( A n E ) P ( B n E ) . Hence ~ ~ + CHAP. 181 CONDITIONAL PROBABILITY. I N D E P E N D E N C E 207 Remark: Since P(* I E ) satisfies the aKove three axioms, it also satisfies a n y consequence of these axioms: for example, P(AUB I E ) = P ( A I E ) + P(B1E) - P ( A i 7 B 1 Ej 18.16. Prove: Let E l ,E,, . . .,En form a partition of a sample space S , and let A be any event. Then P ( A ) = P ( E i ) P ( AI E l ) + P(Er) P ( A I Ez) + . . . + P(En)P ( A 1 En) Since the E , forin a partition of S, the E, a r e pairwise disjoint and S = E I U E z U . ~ ~ U E , , . Hence A = S n A = ( E , u E , u ~ ~ ~ u En,A, ) = ( E l n A ) u ( E , n A ) u . . . u (E,nA) and the E , n A a r e also pairwise disjoint. Accordingly, P(A) = P(E,nA) P(E,nA) + - . . + P(E,,nA) By the multiplication theorem, P ( E , p A ) = P ( E , )P ( A I Et); hence P(A) P(E1)P ( A 1 E l ) - P(E,j P ( A 1 Ez) + . . - P(E,,)P ( A I En) 18.17.Three machines A, B and C produce respectively 50%) 3 0 5 and 20% of the total number of items of a factory. The percentages of defective output of these machines are respectively 3%) 4% and 5 G . If an item is selected at random, what is the probability 71 that the item is defective? Let X be the event t h a t a n item is defective. preceding problem, P(X) = = - + Then by the - P ( A )P : X I A ) P ( B )P ( X B ) P ( C )P ( X 1 C ) 5OCr.3% 30%*4% 205.5% _ 5 0 , 3 + 3o.L + 2 o . z 37 = 3.7% + 100 100 100 100 + 100 100 - 1000 We can also consider this problem as a stochastic process having the adjoining tree diagram. 18.18. Prove Bayes’ Theorem: Let E l ,E,, . . . , E l , form a partition of a sample space S , and let A be any event. Then, for-each i, P ( E JP ( A I E l ) P ( E i A) = P(E1)P ( A E l ) + P(E2)P ( A 1 E z ) + . . . + P ( E n )P ( A 1 En) Using the definitlon of conditional probability, we obtain P ( E ,nA ) P(E,) P ( A I EL) P ( E i )P ( A I E l ) P(Ez)P ( A I E 2 j . . - P(En)P ( A 1 En) P(A) Here we used the multiplication theorem, P ( E ,n A ) = P(E,) P ( A 1 Ez),t o obtain the numerator, and Problem 18.16 t o obtain the denominator. P(E, A ) = + + 18.19. In a certain college, 4% of the men and 1% of the women are taller than 6 feet. Furthermore, 6 0 5 of the students are women. Now if a student is selected at random and is taller than 6 feet, what is the probability that the student is a woman? Let A = {students taller than 6 feet:. We seek P(W 1 A ) , the probability t h a t a student is a woman given t h a t the student is taller than 6 feet. By Bayes’ theorem, 208 CONDITIONAL PROBABILITY. I N D E P E N D E N C E [CHAP. 18 18.20. Find P(B A ) if (i) A is a subset of B , (ii) A and I3 are mutually exclusive. (i) If A is a subset of B , then whenever A occurs B must occur: hence P ( B 1 A ) = 1. Alternately, if A is a subset of B then A r B = A ; hence P(B A ) = ~ P(AnB) P(A) - P(A) P(A) = ~ (i) (iii (ii) If A and B a r e mutually exclusive. i.e. disjoint, then whenever A occurs B cannot occur; hence P ( B I A ) = 0. Alternately, if A and B a r e mutually exclusive then A n B = 0; hence FINITE STOCHASTIC PROCESSES 18.21. A box contains three coins; one coin is fair, one coin is two-headed, and one coin is weighted so that the probability of heads appearing is A coin is selected a t random and tossed. Find the probability p that heads appears. +. Construct the tree diagram a s shown in Figure ( a ) below. Note t h a t I refers t o the f a i r coin, I1 t o the two-headed coin, and I11 t o the weighted coin. Now heads appears along three of the paths: hence 1.1 + 1.1 + 1.1 = 11 P = 3 2 3 3 3 18 (4 0) 18.22.We are given three urns as follows: Urn A contains 3 red and 5 white marbles. Urn B contains 2 red and 1 white marble. Urn C contains 2 red and 3 white marbles. An urn is selected at random and a marble is drawn from the urn. is red, what is the probability that i t came from urn A ? If the marble Construct t h e tree diagram a s shown in Figure ( b ) above. We seek the probability t h a t A was selected, given t h a t the marble is red; t h a t is, P ( A 1 R). In order t o find P ( A 1 R ) , i t is necessary first t o compute P ( A n R ) and P ( R ) . The probability t h a t urn A is selected and a red marble drawn is 31 . 83 = g1 ; P ( A r R )= Thus f. t h a t is, Since there a r e three paths leading to a redmarble, P ( R ) = 1 * 3 +31 *3 ~ +3 1 5. g= 173 3 8 360' 1 P(A , R ) = ~ P(AnR) - s P ( R ) - II.? 360 - 45 173 CHAP. 181 209 CONDITIONAL PROBABILITY. I N D E P E N D E N C E 18.23. Box A contains nine cards numbered 1 through 9, and Box B contains five cards numbered 1 through 5. A box is chosen at random and a card drawn. If the number is even, find the probability that the card came from Box A . The tree diagram of t h e process is shown in Figure (a) below. We seek PIA ' E ) , the probability t h a t A was selected, given t h a t the number is even. The probability t h a t Box A and a n even number is drawn is 21 - y4 = $; t h a t is, P ( A n E ) = $. Since there are two paths which lead to an even number, P ( E ) = L*.i-L* 2 9 2 ; = L!4?5 . Thus 18.24. An urn contains 3 red marbles and 7 white marbles. A marble is drawn from the urn and a marble of the other color is then put into the urn. A second marble is drawn from the urn. 0) Find the probability p that the second marble is red. (ii) If both marbles were of the same color, what is the probability p that they were both white? Construct the tree diagram as shown in Figure ( b ) above. 6) Two paths of the tree lead to a red marble: p = 10 * & 10 & + 10 * 10 = !? 50 . ii. (ii) The probability t h a t both marbles were white is * $ = The probability t h a t both marbles were of the same color, i.e. the probability of the reduced sample space, is lo.lo+-'3 2 7 6 - Hence the conditional probability p = = 1o 1o bg. 5/g i. 18.25. We are given two urns as follows: Urn A contains 3 red and 2 white marbles. Urn B contains 2 red and 5 white marbles. An urn is selected at random; a marble is drawn and put into the other urn; then a marble is drawn from the second urn. Find the probability p that both marbles drawn are of the same color. Construct the following tree diagram: 210 CONDITIOKAL PROBABILITY. IISDEPENDENCE [CHAP. 18 Note t h a t if u r n A is selected and a red marble drawn and put into u r n B , then u r n C has 3 red marbles and 5 white marbles. Since there a r e f o u r paths which lead to two marbles of the same color, = $.;.; +$.$.2 + 1.2.2 - L . 5 . L 2 7 2 3 7 = 2 901 1680 INDEPENDENCE 18.26. Let A = event that a family has children of both sexes, and let B = event that a family has at most one boy. (i) Show that A and B are independent events if a family has three children. (ii) Show that A and B are dependent events if a family has two children. fi) We have the equiprobable space S = { b b b , b b g , b g b , b g g , g b b , g b g , g g b , ggg). Here P ( A ) = 68 = 34 A = W g , bgb, b g g , g b b , g b g , g g b ) and so B = Cbgg, g b g , g g b , s w > A n B = {bug,gbg,ggb) Since P ( A ) P ( B )= Iii) 2-4= # and so P(B) = P(AnB) = and so + z $ = P ( A n B ) , A and B a r e independent. We have the equiprobable space 5' = { b b , b g , g b , g g ) . A = {bg,gb) and so B = { b g , gb, gg} A n B = {bg,gb} Here and so P(B) = 4 2 and so P(AnB) = 4 P(A) = Since P ( A ) P ( B )# P ( A n E ) , A and B a r e dependent. 18.27. Prove: If A and B are independent events, then A c and B' are independent events. Let P ( A ) = x and P ( B ) = y. Then P(Ac) = l - - r and P(Bc) = 1-y. independent, P ( A n B ) = P ( A )P ( B ) = X U . Furthermore, P(AuB) = P(A) By De Morgan's theorem, + P(B)- P ( A n B ) = x + y - Since A and B a r e xy ( A u B ) c = AcnBc; hence P(AcnBc) = P ( ( A u B ) c ) = 1 - P ( A ~ J B = ) 1 -x -y On the other hand, P(AC)P(Bc) = (1 - x ) ( l - y) = 1 - j: - y + xy + Thus P(AcnBc) = P ( A c )P(Bc), and so A' and BC a r e independent. 18.28. The probability that a man will live 10 more years is 4,and the probability that his wife ill live 10 more years is +. Find the probability that (i) both will be alive in 10 years, (ii) at least one will be alive in 10 years, (iii) neither will be alive in 10 years, (iv) only the wife will be alive in 10 years. Let A = event t h a t the man is alive in 10 years, and B = event t h a t his wife is alive i n 10 years; then P ( A ) = and P ( B ) = t 11) 4. We seek P ( A n B ) . Since A and B a r e independent, P ( A n B ) = P ( A )P ( B ) = (ii) We seek P ( A u B ) . P ( A u B ) = P ( A )+ P ( B ) - P ( A n B ) = & - + = A. &++-A= 4. 4 4 3. iiii) We seek P(AcnBc). Now P(Ac) = 1- P ( A ) = 1- & = and P(i3c) = 1- P ( B ) = 1 - = Furthermore, since Ac and Ec a r e independent, P ( A c nSc) = P(AC)P(Bc) = 3 ' 42 3 = 1 2 . Alternately, since ( A U B ) C= ACnBc, P ( A c n B c ) = P ( ( Au B ) ~= ) 1- P ( A L'B) = 1 (iv) We seek P ( A c n B ) . Since P(Ac) = 1- P ( A ) = P ( A c ) P ( B )= f. 2 4 = 4. and Ac and B a r e independent, P ( A c n B ) = CHAP. 181 211 CONDITIONAL PROBABILITY. I N D E P E N D E N C E 18.29. Box A contains 8 items of which 3 are defective, and box B contains 5 items of which 2 are defective. An item is d r a x n at random from each box. (i) What is the probability p that both items are non-defective? (ii) What is the probability p that one item is defective and one not? (iii) If one item is defective and one is not, what is the probability p that the defective item came from box A ? (i) The probability of choosing a non-defective item from A is events a r e independent, p = = 5 8 ’ and from B is $. Since the fa; (ii) Method 1. = The probability of choosing two defective items is probability t h a t both a r e non-defective is i. Hence p = 1 20 R &. From fi) the = 19. 40 Method 2. The probability p , of choosing a defective item from A and a non-defective item from B is The probability p L of choosing a non-defective item from A and a :* = 9 1 defective item froin B is = $. Hence p = pl p , = l o t -4 = ?4! 0 . &. + (iii) Consider the events X = {defective item from A } and Y = {one item is defective and one and P(Y)= Hence non-defective}. We seek P ( X [ Y). By (ii), P ( X n Y ) = p , = g. 18.30. The probabilities that three men hit a target are respectively Q, & and +. Each shoots once a t the target. (i) Find the probability p that exactly one of them hits the target. (ii) If only one hit the target, what is the probability th at it was the first man ? Consider the events A = {first man hits the target}, B = {second man hits the target), and The three events a r e and P ( C ) = C = {third man hits the target}; then P ( A ) = +, P ( B ) = independent, and P(Ac) = g, P ( B c ) = 9 , P(CC) = a. t. (i) Let E = [exactly one man hits the target}. Then E = (AnBcnCc)u (AcnBnCc) In other words, if only one hit the target, then i t was either only the first man, A n B c r C c , o r only the second man, AcnBnCc, o r only the third man, A c n B c n C . Since the three events a r e mutually exclusive, p = P(E) = P(AnBcnCc) P(AcnBnCc) P(AcnBcnC) = P ( A )P(Bc)P(Cr) P ( A c )P ( B )P(Cc) T P(Ac)P(Bc) P ( C ) 1 3 2 5 1 2 5 3 1 = L t 5 - 5 = 31 - + + + -.-.---.-.__-.-. 6 4 3 6 4 3 6 4 3 12 36 24 72 (ii) We seek P ( A ’ E ) , the probability t h a t the first man h i t the target given t h a t only one man hit the target. Now A n E = A n Bc n Cc is the event t h a t only the first man h i t the target. hence By (i), P ( A r E ) = P ( A n B c n C c ) = $ and P ( E ) = g; Supplementary Problems CONDITIONAL PROBhBILITT IN EQUIPROBABLE SP-ACES 18.31. A die is tossed. If the number is odd, what is the probability t h a t it is prime? 18.32. A letter is chosen a t random from the word “trained”. If i t is a consonant, what i s the probability t h a t i t is the first letter of t h e word? [CHAP. 18 CONDITIONAL PROBABILITY. I N D E P E N D E N C E 2 12 18.33. Three f a i r coins a r e tossed. If both heads and tails appear, determine t h e probability t h a t exactly one head appears. 18.34. A pair of dice is tossed. If the numbers appearing a r e different, find the probability t h a t the sum is even. 18.35. A man is dealt 5 red cards from a n ordinary deck of 52 cards. W h a t is the probability t h a t they a r e all of the same suit, i.e. hearts o r diamonds? 18.36. A man is dealt 3 spade cards from a n ordinary deck of 52 cards. If he is given four more cards, determine the probability t h a t a t least two of the additional cards a r e also spades. 18.37. 18.38. 18.39. Two different digits a r e selected a t random from the digits 1 through 9. (i) If the sum is odd, what is the probability t h a t 2 is one of the numbers selected? (ii) If 2 is one of the digits selected, what i s t h e probability t h a t the sum is odd? Four persons, called North, South, E a s t and West, a r e each dealt 13 cards from a n ordinary deck of 5 2 cards. ii) If South h a s exactly one ace, what is the probability t h a t his p a r t n e r North h a s the other three aces? iii) If North and South together have 10 hearts, what is the probability t h a t neither E a s t nor West has the other 3 hearts? A class has 1 0 boys and 5 girls. Three students a r e selected from the class at random, one a f t e r the other. Find the probability t h a t ( i ) the first two a r e boys and the third is a girl, (ii) the first and third a r e boys and the second is a girl, (iii) the first and third a r e of the same sex, and the second is of the opposite sex. 18.40. In the preceding problem, if the first and third students selected a r e of t h e same sex and the second student is of the opposite sex, what is the probability t h a t the second student is a girl? CONDITIONAL PROBABILITY 18.41. In a certain town, 40% of the people have brown hair, 25% have brown eyes, and 15% have both brown hair and brown eyes. A person is selected at random from the town. ti) If he has brown hair, what is the probability t h a t he also has brown eyes? (ii) If he has brown eyes, what is the probability t h a t he does not have brown h a i r ? (iii) W h a t is the probability t h a t he has neither brown h a i r nor brown eyes? a, (i) P ( A IB), 18.43. Let S = ( a , b , c , d , e ,f} with P ( a ) = P(b) = P(c) = P(d) = A, P(e) = & and P ( f )= A . Let A = { a , c , e } , B = { c , d , e , f } and C = { b , c , , f } . Find (i) P ( A 1 B ) , (ii) P ( B I C), 16 (iii) P ( C ' A c ) , (iv) P(Ac 1 C). 18.44. In a certain college, 25% of the boys and 10% of the girls a r e studying mathematics. The girls constitute 60Cb of the student body. If a student is selected at random and i s studying mathematics, determine the probability t h a t the student is a girl. and A, P(AUB) = 4. Let A and B be events with P ( A ) = (ii, P ( B A ) , (iii) P ( A nBc), (iv) P ( A 1 Bc). A, P(B) = & 18.42. Find 4, FINITE STOCHASTIC PROCESSES 18.45. We a r e given two u r n s as follows: Urn A contains 5 red marbles, 3 white marbles and 8 blue marbles. Urn B contains 3 red marhles and 5 white marbles. A f a i r die is tossed; if 3 or 6 appears, a marble is chosen from B , otherwise a marble is chosen from A . Find the probability t h a t (i) a red marble is chosen, (ii) a white marble is chosen, (iiii a blue marble is chosen. CHAP. 181 CONDITIONAL PROBABILITY. I N D E P E N D E N C E 213 18.46. Refer to the preceding problem. (i) If a red marble is chosen, what is the probability t h a t it came from urn A ? {iii If a white marble is chosen, what is the probability t h a t a 5 appeared on the die? 18.47. An u r n contains 5 red marbles and 3 white marbles. A marble is selected a t random from the urn. discarded, and two marbles of the other color a r e put into the urn. A second marble is then selected from the urn. Find the probability t h a t i i ) the second marble is red, iii) both marbles a r e of the same color. 18.48. Refer to the preceding problem. (i) If the second marble is red, what is the probability t h a t the first marble is red? iii) If both marbles a r e of the same color, what is the probability t h a t they a r e both white? 18.49. A box contains three coins, two of them f a i r and one two-headed. A coin is selected at random and tossed twice. If heads appears both times, what is t h e probability t h a t the coin is two-headed? 18.50. We a r e given two urns as follows: U r n A contains 5 red marbles and 3 white marbles. Urn B contains 1 red marble and 2 white marbles. A f a i r die is tossed; if a 3 o r 6 appears, a marble is drawn from B and put into A and then a marble is drawn from A : otherwise, a marble is drawn from A and put into B and then a marble is drawn from B. (i) W h a t is the probability t h a t both marbles a r e red? (ii) What is the probability t h a t both marbles a r e white? 18.51. Box A contains nine cards numbered 1 through 9, and box B contains five cards numbered 1 through 5. A box is chosen a t random and a card drawn; if the card shows a n even number, another card is drawn from the same box; if the card shows a n odd number, a card is drawn from the other box. (i) W h a t is the probability t h a t both cards show even numbers? (ii) If both cards show even numbers, what is the probability t h a t they came f r o m box A ? (iii) W h a t is the probability t h a t both cards show odd numbers? 18.52. A box contains a f a i r coin and a two-headed coin. A coin i s selected a t random and tossed. If heads appears, the other coin is tossed: if tails appears, the same coin is tossed. (i) Find the probability t h a t heads appears on the second toss. (ii) If heads appeared on the second toss, find the probability t h a t it also appeared on t h e first toss. 18.53. A box contains three coins, two of them f a i r and one two-headed. A coin is selected at random and tossed, If heads appears the coin is tossed again; if tails appears, then another coin is selected from the two remaining coins and tossed. (i) Find t h e probability that heads appears twice. (ii) If the same coin is tossed twice, find the probability t h a t it is the two-headed coin. (iii) Find the probability t h a t tails appears twice. 18.54. U r n A contains z red marbles and y white marbles, and u r n B contains z red marbles and marbles. (i) 9 white If a n u r n is selected at random and a marble drawn, what is the probability t h a t t h e marble is red? (ii) If a marble is drawn from u r n A and put into u r n B and then a marble is drawn from u r n B , what is the probability t h a t t h e second marble is red? 18.55. A box contains 5 radio tubes of which 2 a r e defective. The tubes a r e tested one a f t e r the other until the 2 defective tubes a r e discovered. W h a t i s t h e probability t h a t t h e process stopped on t h e (i) second test, (ii) third test? 18.56. Refer to the preceding problem. If the process stopped on t h e third test, w h a t is the probability t h a t the first tube is non-defective? 214 CONDITIONAL PROBABILITY. IN D EPEN D EN CE [CHAP. 18 INDEPENDENCE 18.57. Prove: If -4 and B a r e independent, then A and Bc a r e independent and Ac and B a r e independent. 18.58. and P ( B ) = p . (i) Find p if A and B a r e Let A and B be events with P ( A ) = $, P ( A u B ) = mutually exclusive. (ii) Find p if A and B are independent. (iii) Find p if A is a subset of B. 18.59. Urn A contains 5 red marbles and 3 white marbles, and urn B contains 2 red marbles and 6 white marbles. + (i) If a marble is drawn from each urn, w ha t is the probability tha t they a r e both of the same color? (iii If two marbles a r e drawn from each urn, what is the probability t h a t all four marbles a r e of the same color? 18.60. Let three f ai r coins be tossed. Let A = {all heads o r all tails}, B = {at least two heads) and C = { at most two heads}. Of the pairs ( A , B ) , ( A , C) and ( B , C), which are independent and which a r e dependent? 18.61. The probability t h a t A hits a t ar g et is t and the probability t h a t B hits a ta rge t is Q. If each fires twice, what is the probability t h a t the ta rge t will be hit at least once? (i) (ii) If each fires once and the target is hit only once, what is the probability t h a t A hit the ta rget? (iii) If A can fire only twice, how many times must B fire so t h a t there is at least a 90% probability t h at the t ar g et will be hit? 18.62. Let A and B be independent events with P ( A ) = 8 and P ( A u B ) = (iii) P(Bc A ) . ' 8. Find (i) P ( B ) , (ii) P ( A 1 B ) , Answers to Supplementary Problems 15 - 18.31. $ 18.40. %= $ ~ 21 18.32. 2 18.33. 4 18.34. + 18.37. (i) 18.41. (i) Q , (ii) 3, (iii) fr 18.42. (i) Q, (ii) $, (iii) 18.43. (i) $, (ii) 3, (iii) 4, (iv) $ 18.44. Q 18.45. (i) + (ii) & (iii) 4 &, (ii) 5 4, (iv) Q b 8 18.39. (i) (ii) 10 - z *13 = 5 9 -.-.--10 15 15 (iii) 91 5 14 9 13 - 15 91 +- = 20 273 a, (ii) & 18.46. (i) 18.47. ( i ) z , (ii) 36 18.48. (i) 41' (ii) 13 15 91 5 21 41 13 20 3 W CHAP. 181 CONDITIONAL PROBABILITY. I N D E P E N D E N C E 215 16.49. $ 5 16.50. (i) 24 +& = 61 (ii) z , 3 16 + 8 = 81 3il 1206 ~ <-+: 6 w e ; 9 Tree diagram for Problem 18.50 1 18.51. (i) 5 16.52. (i) 8, 1 16.53. (i) 12 +1 (ii) 1 - 2 = 15' (ii) 12 = 2 15 5, (iii) 4+Q = Q 8 + 12 + 5 1 = f, 2 1 (ii) 3, (iii) 12 Tree diagram for Problem 18.55 Tree diagram for Problem 18.53 h, &; 16.55. (i) (ii) we must include the case where the three non-defective tubes appear first, since t h e l a st two tubes must then be t h e defective ones. 18.56. 3 16.60. Only A and B a r e independent. 1 1 1 (i) $, (ii) 9, (iii) 5 18.58. (i) 12, (ii) 9, (iii) 5 18.61. 7 18.59. (i) 16, (ii) 18.62. (i) 4, (ii) 4, (iii) 3 55 Chapter 19 Independent Trials, Random Variables INDEPENDENT OR REPEATED TRIALS We have previously discussed probabiIity spaces which were associated with an experiment repeated a finite number of times, as the tossing of a coin three times. This concept of repetition is formalized as follows: [Definition:I Let S* be a finite probability space. By ?i i n d e p e n d e n t or repeated trials, we mean the probability space S consisting of ordered n-tuples of elements of S" with the probability of an ?z-tuple defined to be the product of the probabilities of its components: P((s1,s2, . . . , Sn)) = P(s,)P(s2) . . . P(s,) Example 1.1: Whenever three horses a, b and c race together, their respective probabilities of I n other words, S = { u , b, c } with P(a) = +, P ( b ) = winning a r e +, f and and P ( c ) = 8. I f the horses race twice, then the sample space of the 2 repeated trials is S = {aa, ab, a c , h a , bb, bc, ca, cb, cc} A. F o r notational convenience, we have written a c f o r the ordered pair ( a , ~ ) . The Probability of each point in S is P(aa) = P(a)P(a) = 4.4 = $ P(ba) = Q P(ab) = P(u)P(b) = +*& = 4 P(bb) = & P(cb) = 3 P ( a c ) = P(u)P(c) = i.8 = & P(bc) = ilj- 1 P(cc) = P(ca) = & 1 & Thus the probability of c winning the first race and a winning the second race is P(ca) = A. From another point of view, a repeated trials process is a stochastic process whose tree diagram has the following properties: (i) every branch point has the same outcomes; (ii) the probability is the same f o r each branch leading to the same outcome. For example, the tree diagram of the repeated trials process of the preceding experiment is as shown in the adjoining figure. Observe that every branch point has the outcomes a, b and c, and each branch leading to outcome a has probability &, each branch leading to b has probability Q, and each leading to c has probability Q. 216 C CHAP. 191 217 INDEPENDENT TRIALS, RANDOM VARIABLES REPEATED TRIALS WITH TWO OUTCOMES We now investigate repeated trials of an experiment with two outcomes; we call one of the outcomes success and the other outcome failure. Let p be the probability of success, and so q = 1 - p is the probability of failure. Frequently we are interested in the number of successes and not in the order in which they occur. The following theorem applies. Theorem 19.1: The probability of exactly k. successes in n repeated trials is denoted and given by f ( n ,k , P) = (3P k r k The probability of at least one success is 1- q". Here (;) is the binomial coefficient (see Chapter 13). Example 2.1: A fair coin is tossed 6 times or, equivalently, six fair coins are tossed; call heads Then n = 6 and p = q = 4. a success. (i) The probability that exactly two heads occurs (i.e. k = 2) is f(6,2,4) = = ($)z(+)4 64 (ii) The probability of getting at least four heads (i.e. k = 4, 5 o r 6) is f(6,4, # f f(6,5,4) + f(6,6, Q) (3)4(g)2+ (E) = (#5 (4)+ (i)($J6 5 + 6 + 1 = 11 - 1 64 64 64 32 A, and so the (iii) The probability of no heads (i.e. all failures) is q6 = (4)6= probability of a t least one heads is 1 - q 6 = 1 - 614 = a. 64 Example 2.2: A fair die is tossed 7 times; call a toss a success if a 5 o r a 6 appears. Then n = 7, p = P({5,6}) = Q and q = 1- p = 2. 3 (i) The probability that a 5 or a 6 occurs exactly 3 times (i.e. k = 3) is f(7,3,Q) = (7) (')3 3 3 (")4 3 1 2187 ,187' (ii) The probability that a 5 or a 6 never occurs (i.e. all failures) is 47 = (Q)' = p. 2059 hence the probability that a 5 or a 6 occurs a t least once is 1- q7 = 2187. If we treat n and p as constants, then the function defined by f ( n ,k,p ) is called the binomial distribution since for k = 0,1,2, . . . , n it corresponds t o the successive terms of the binomial expansion: ( q +p)" = q" (Y) q n - ' p (;) q"-Zp2 + * + p" + = f(n,0,P) + a + f(n,1,P ) + f(n,2, P) + - . + f ( n n, , P) The use of the word distribution will be explained later in the chapter. This distribution is also called the Bernoulli distribution, and independent trials with two outcomes are called Bernoulli trials. RANDOM VARIABLES Let S be a sample space of some experiment. As noted previously, the outcomes of the experiment, i.e. the sample points of S , need not be numbers. However, we frequently wish to assign a specific number to each outcome of the experiment. Such a n assignment is called a random variable. In other words: 218 I N D E P E N D E N T TRIALS, RANDOM VARIABLES 1 Definition: I [CHAP. 19 A mndom variable X is a function from a sample space S into the real numbers. We shall let R, denote the range of a random variable X : R, = X ( S ) . We shall refer to R, as the range space. Remark: If the sample space S is nondenumerable, then f o r theoretical reasons lying beyond the scope of this text certain functions defined on S are not random variables. However, the sample spaces here are finite, and every function defined on a finite sample space is a random variable. Example 3.1: A pair of dice is tossed. The sample space S consists of the 36 ordered pairs of numbers between 1 and 6: S = ((1, I), ( 1 , 2 ) , . . . , ( 6 6 ) ) Let X assign to each point in S the sum of the numbers; then X is a random variable with range space Rx = (2, 3, 4, 5, 6, 7 , 8, 9, 10, 11, 12) Let Y assign t o each point the maximum of the two numbers; then Y is a random variable with range space RY = (1, 2 , 3, 4, 5, 6) Example 3.2: A sample of 3 items is selected from a box containing, say, 12 items of which 3 a r e defective. The sample space S consists of the = 220 different samples of size 3. Let X denote the number of defective items in the sample; then X is a random variable with range space R, = { 0 , 1 , 2 , 3 } . (y) PROBABILITY SPACE AND DISTRIBUTION O F A RANDOM VARIABLE Let R, = {x,,x,, . . .,x,} be the range space of a random variable X defined on a finite sample space S. Then X induces an assignment of probabilities on the range space R, as follows: p l = P(x,) = sum of probabilities of points in S whose image is x1 The function assigning p i to xior, in other words, the set of ordered pairs (xl, pl), . . . , usually given by a table ... is called the distribution of the random variable X . I n the case that S is an equiprobable space, we can easily obtain the distribution of a random variable from the following result. Theorem 19.2: Let S be an equiprobable space. If R, = {xl, . . . ,x,} is the range space of a random variable X defined on S , then pi = P(XJ = Example 4.1: number of points in S whose image is number of points in S xi A pair of f ai r dice is tossed. W e obtain the equiprobable space ' 5 consisting of the 36 ordered pairs of numbers between 1 and 6: S = {(LI), ( 1 , 2 ) , . . ., CHAP. 191 219 I N D E P E N D E N T TRIALS, RANDOM VARIABLES Let X be t h e random variable which assigns t o each point the sum of the two numbers; then X has range space R , = {2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12) x. We use Theorem 19.2 t o obtain the distribution of There is only one point There a r e two points in S, (1,Z) and 2 (2, l), having image 3; hence P(3) = z.There a r e three points in S , (1,3), (2,2) 3 4 and (3, l),having image 4: hence P ( 4 ) = 36. Similarly, P(5) = x, P(6) = &, etc. The distribution of X consists of the points in R, with their respective probabilities: (1,l)whose image is 2: hence P(2) = Example 4.2: xi 2 ' 36 I 3 - f6 4 - ' 3 36 &. 5 6 7 1 8 1 36 -5 36 -6 36 1 - 5 36 9 - 6', 10 11 12 3 36 2 - 6', 36 A sample of 3 items is selected a t random from a box containing 12 items of which 3 a r e defective. The sample space S consists of the = 220 different equally likely samples of size 3. Let X be the random variable which assigns to each sample the number of defective items in the sample; then Rx = {0,1,2,3). Again we use Theorem 19.2 to obtain the distribution of X. ii') (i) There a r e = 84 samples of size 3 which contain no defective items; hence 84 P ( 0 ) = m. There a r e 3 * (:) = 108 samples of size 3 containing 1 defective item; There are (;) * 9 = 27 samples of size 3 containing 2 defective hence P(1) = 2items; hence P ( 2 ) = &. There is only one sample of size 3 containing the 3 defecThe distribution of X follows: tive items; hence P(3) g. =A. Example 4.3: + A coin weighted so t h a t P i H ) = 8 and P(T)= is tossed three times. The probabilities of the points in the sample space S = {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT} a r e as follows: p ( H H H ) = g.:F.2 3 1 3 z 27 i7 = 6 P ( H H T ) = 2.2.1 a = $.;+*+ p(HTH) = - p ( H T T ) = +.l.+ = 4 27 P(THH) = PtTHT) = P(TTH) = P(,TTT) = t.3.g = 4 27 Q.8.Q = & g.1.2 3 3 &*&*& = 2 27 = 1 27 Let X be the random variable which assigns t o each point in S the largest number of successive heads which occurs. Thus, X(TTT) = 0 X ( H T H ) = 1, X(HTT) = 1, X(THT) = 1, X(TTH) = 1 X(HHT) = 2, X!THH) = 2 X(HHH) = 3 The range space of X is R , = {0,1,2,3}. We obtain P(O),P(1),P ( 2 ) and P(3) as follows: P(0) = P(TTT) = P(1) = P ( H T H ) & + P(HTT) + P(THT) + P(TTH) 2 P ( 2 ) = P(HHT) t P ( T H H ) = &-I $= P(3) = P(HHH) = & = & + $+& +$ = 10 I N D E P E K D E 9 T T I? IA L S, RAND 0 31 V A R IA B L ES 220 The distribution of X consists of t h e points in the range space of respective probabilities: ;CHAP. 19 X with their Now consider repeated trials of an experiment with two outcomes: that of success with probability p and that of failure with probability Q = 1- p . To each sequence of n outcomes, let X n assign the number of successes. Then X , 2 is a random variable with range space RY,,= {0,1,2, . . ., ti). Using Theorem 19.1, v e obtain PjO) = f ( t z , O , p ) = Q" P(1) = f ( i l , 1,p) = (:)qrz-1]1 P ( 2 ) = f(i1,2,p) = (;) q"-'p' .......................... P(H) = . f ( i r , n, p ) = p" I n other words, the distribution of X n is the binomial distribution as discussed previously: EXPECTED VALUE If the outcomes of an experiment are the numbers T , , x,, . , ., x r which occur with respective probabilities pl, p 2 , . . , p , , then the expected value is defined t o be , E = ~~p~ + ~ , p ,+ yrpr In particular, if the xZand p Z come from the distribution of a random variable X , then the expected value ic, called the expectation of the random variable and is denoted by E ( X ) . Example 5.1 : Consider the random variable X of Example 4.2. I t s distribution gives the possible numbers of defective items in a sample of size 3 with their respective probabilities of occurrence. Then the expected number of defective items in a sample of size 3, i.e. the expectation of the random variable X, is E(X) = + 2.sc 3 . L - s3 220 - o.g+1.E Example 5.2: A f a i r coin is tossed 6 times. The possible numbers of heads which can occur with their respective probabilities a r e as foilom-s: Note t h a t the above table is t h e binomial distribution with n = 6 and p = The expected value, i.e., the expected number of heads, is E = 0 . k + 1-& + 2 Observe t h a t E = np. - g + 3*% + 4 . 3 + 5 * & + 6-A = 3 3. CHAP. 191 I K D E P E N D E N T TRIALS, RANDOM VARIABLES 22 1 The result in the preceding example holds true in general; namely, Example 5.3: xi 0 Pi qn 1 2 (;)q"-'P (;)9"-2P2 ... n ... Pn The probability is 0.2 t ha t a n item produced by a factory is defective. If 3000 items are made by the factory, then the expected number of defective items is E = np = (3000)(0.2) = 60. GAMBLING GAMES I n a gambling game, the expected value E is considered to be the value of the game to the player. The game is said to be favorable to the player if E is positive, and tinfavorable if E is negative. If E = 0, the game is fair. xi 2 5 3 -1 -4 -6 The negative numbers -1, -4 and -6 correspond t o the fa c t t h a t the player loses if a non-prime number occurs, The expected value of the game is Thus t h e game is unfavorable to the player since the expected value is negative. Example 6.2: *+ A player tosses a f a i r coin. If a head occurs he receives 5 dollars, but if a tail 1* 4 = 3. occurs he receives only 1 dollar. His expected winnings E = 5 Question: How much money should he pay to play the game so t h a t the game is f a i r ? W e claim t h at the answer is 3 dollars. For if he pays 3 dollars t o play the game, then each outcome is decreased by 3 dollars. Thus when a head occurs he wins 5 - 3 = 2 dollars and when a tail occurs he wins 1 - 3 = -2 dollars, i.e., he loses 2 dollars; hence the expected value is now E = 2 * (-2) * $ = 0. ++ The result in the preceding example holds true in general; namely, Theorem 19.4: Let E be the expected value of a n experiment. If a constant k is added to each outcome of the experiment, then the expected value E* of the new experiment is E k : E" = E k. + + Example 6.3: A thousand tickets a t $1 each are sold in a lottery in which there is one prize of $500, four prizes of $100 each, and five prizes of $10 each. The expected winnings of one ticket are 500-& loo*& lo*& = 0.95 + + However, since a ticket costs $1, the expected value of the game is 0.95 - 1.00 = - $0.05, which is unfavorable to the player. 222 I N D E P E N D E K T TRIALS, RANDOJI VARIABLES [CHAP. 19 Solved Problems REPEATED TRIALS WITH TWO OUTCOMES 19.1. Find (i) f ( 5 , 2 , + ) , (ii) f ( 6 , 3 ,i), (iii) f ( 4 , 3 , t ) . Here f ( n ,k , P ) = (;) Pk (1- P y k 19.2. A fair coin is tossed three times. Find the probability P that there will appear (i) three heads, (ii) two heads, (iii) one head, (iv) no heads. Method 1. We obtain the following equiprobable space of eight elements: S = {HHH, HHT, HTH, HTT, T H H , THT, TTH, TTT] (i) Three heads (HHH) occurs only once among the eight sample points; hence P = 4. (ii) Two heads occurs 3 times IHHT, HTH, and THH); hence P = # . (iii) One head occurs 3 times (HTT, T H T and TTH); hence P = $ . (iv) No heads, i.e. three tails (TTT), occurs only once; hence P = $. Use Theorem 19.1 with .n = 3 and p = q = Method 2. (i) Here k = 3 and P = f ( 3 , 3 , 4 ) = (i) (3)O 4. = 1 . 81 * 1 = 1. 8 (+)z(4) = 3 1.1 4 2 = a. 8 (t)(+)l (4)z = 3 9 4 . 4 = #. (ii) Here k = 2 and P = f ( 3 , 2 , *) = (;) (iii) Here k = 1 and P = f(3,1,3) = (iv) Here k = 0 and P = f ( 3 , 0 , 4 ) = (:) (+)O(4)3 = 1. 1.18 = 1. 8 If 4 items are (i) 2 are defective, (ii) 3 are 19.3. Suppose that 20% of the items produced by a factory are defective. chosen a t random, what is the probability P that defective, (iii) a t least one is defective? Use Theorem 19.1 with n = 4, p = .2 and q = 1 - p = .8. (i) Here k = 2 and (ii) Here k = 3 and P = J(4,2, .2) = (i)(.2)2(.8)2 = .1536. P = f(4,3, .2) = (:) (.2)3(.8) = ,0256. (iii) P = 1 - 44 = 1 - (.8)4 = 1 - .4096 = .5904. 19.4. Team A has probability ;f of winning whenever it plays. If A plays 4 games, find the probability that A wins (i) exactly 2 games, (ii) at least 1 game, (iii) more than half of the games. Here 12 = 4, p = and q = 1 - p = $. CHAP. 191 (i) I N D E P E N D E N T TRIALS, RANDOM VARIABLES We seek f ( n , k , p ) when k = 2: f ( 4 , 2 , $ ) = (i) = ($)2(&)2 223 &. (ii) Here 44 = = $ is the probability t h a t A loses all four games. probability of winning at least one game. Then 1- 44 = is the (iii) A wins more than half the games if A wins 3 or 4 games. Hence the required probability is f(4,3,+, + f(4,4,9) = (;) (9)3 (g) + (94 g+g = 3 = 27 19.5. A family has 6 children. Find the probability that there are (i) 3 boys and 3 girls, (ii) fewer boys than girls. Assume that the probability of the birth of either sex i s 3. (i) The required probability is f ( 6 , 3, 4) = ( 6 ) (1)3 3 2 (1)s = 20 = -5- 16 64 2 (ii) There a r e fewer boys than girls if there a r e 0, 1 or 2 boys. Hence the required probability is + f(6,1, +) + f ( 6 , 2 , 4 ) = .f(6,0,4) (&6 + ($I5 (4) f (g) (4)4CgP = 19.6. The probability that a college student does not graduate is .3. Of 5 college students chosen at random, find the probability that (i) one will not graduate, (ii) three will not graduate, (iii) a t least one will not graduate. Here n = 5, p = .3 and q = 1 - p = .7, (i) f ( 5 , 1,.3) = (;) (.3) (.7)4 = .36015 (ii) f ( 5 , 3, .3) = (i)(.3)3(.7)2 = (iii) 1 - 45 = 1 - (.7)5 .1323 = 1 - .16807 = .83193 Consider n repeated trials with two outcomes and p = q = 3. (i) Find the probability of exactly h: successes. (ii) Find the probability P of more successes than failures if n is (a) odd, ( b ) even. (i) The probability of exactly k successes is f ( n ,k , 3, = (T;) (&)k = (T;) (&)"+ (3)n (ii) Recall t h a t the distribution of the number of successes appears in the expansion of the binomial ( q p)" = + (4+ 4)": + 1 = (1 2 1 2 )" = (+)n + (7) (3)"+ )(; (4)" + * - + (;I ' ($P+ (Y) (&). + (4)" Since the binomial coefficients are symmetrical about the center of the expansion and since each term is (4)" times a binomial coefficient, the above terms a r e also symmetrical about the center of the expansion. (a) If n is odd, then there is an even number of terms in the above expansion and so there are more successes than failures in exactly the terms in the last half of the expansion. Thus 1 = P + P or P = Q. ( b ) If n is even, then there is an odd number of terms in the above expansion, with the middle Now there a r e more successes than failures in the terms to the term M = (&)(:)". right of the middle term; hence by the symmetry, 1 = P M P o r P = A(1- M ) . + + 224 I N D E P E N D E K T TRIALS, RANDOM VARIABLES 'CHAP. 19 19.8. The probability of team A winning any game is 4. What is the probability P that A wins more than half of its games if A plays [ i ) 43 games, (ii) 8 games? (i) + since n is odd and 71 = q = +. of the expansion (4+ i.e. the probability t h a t A wins 4 games and = g,Then by the preceding problem, P = $(l M ) = + ( l- g) = z. By the preceding problem, P = (ii) The middle term M loses 4 games, is (:)(&)8 &)8, - 19.9. How many dice must be thrown so that there is a better than even chance of obtaining a six? The probability of not obtaining a six on n dice is which (Q)" is less than 4: Hence we seek the smallest ($)rl. 71 for Thus 4 dice must be thrown. 19.10.The probability of a man hitting a target is 4. (i) If he fires 7 times, what is the probability P of his hitting the target at least twice? (ii) How many times must he fire so that the probability of his hitting the target at least once is greater than t ? (i) We seek the sum of the probabilities f o r k = 2, 3, 4, 5, 6 and 7 . It is simpler in this case t o find the sum of t h e probabilities f o r k = 0 and 1,i.e. no hits or 1 hit, and then subtract i t from 1. f(7,0,$) = Then p = 1 2187 16,384 ($Ii = 5101 = 16,384 1187 m, f(7,1,%) = Ci) ($1 ($)6 5103 16,384 ~ 45-17. blOZ (ii) The probability of not hitting the t a r g e t is qn. Thus we seek the smallest n f o r which qn is less than 1 - $ = Q, where q = 1 - p = 1 - $ = $. Hence compute successive powers of q until q n < Q is obtained: (2)l = 34 Q 1. 3' ($)Z =A 16 Q A; d (9)3 = !7 Q 64 <I 3 1; but ( 9 ) 4 z 81 8 256 In other words, he must fire 4 times. 19.11.Prove Theorem 19.1: The probability of exactly k successes in n repeated trials is f ( n ,k , p ) = ([)pkq n P k . The probability of at least one success is 1- qn. The sample space of the 7% repeated trials consists of all ordered n-tuples whose components a r e either s (success) o r f (failure). The event A o f k successes consists of all ordered n-tuples of which Is components a r e s and the other n - k components a r e f . The number of n-tuples in the event A is equal to the number of ways t h a t k letters s can be distributed among t h e ? I components of a n n-tuple; hence A consists of (E) sample points. Since the probability of each point in A is p k qll- k , we have P ( A ) = ):( p k q n - k , The event B of no successes consists o f the single n-tuple (f,f,, , .,f), i.e. all failures, whose probability is qn. Accordingly, the probability of at least one success is 1 - qn. 19.12. The mathematics department has 8 graduate assistants who are assigned to the same office. Each assistant is just as likely to study at home as in the office. How many desks must there be in the office so that each assistant has a desk at least 90% of the time? If there a r e 8 desks, then 0% of the time an assistant will not have a desk at which t o study. CHAP. 191 INDEPENDENT TRIALS,RANDOM VARIABLES 225 Suppose theie a r e 7 desks; then an assistant will not have a desk a t which to study if all 8 assistants a r e in the office. The probability t h a t 8 will study in the office is ( $ ) 8 = = 0.45:. Thus there will not be enough desks 0 . d c r of the time. & Suppose there a r e 6 desks; then an assistant will not have a desk if 8 o r 7 assistants a r e in the office. The probability t h a t 7 xi11 study in the office is f ( 8 . 7 , i ) = = 3.2%. Thus there will not be enough desks 3.2% 0.Ac; = 3.GC; of the time. + Suppose there are 5 desks; then a n assistant will not have a desk if 8, 7 o r 6 assistants a r e in the office. The probability t h a t 6 will study in the office is f ( 8 , 6 ,$) = = 10.9%. Thus there will not he enough desks 10.9% 3.2"r 0.1G = 14.5% of the time. In other words, there will be enough desks only 85.5% of the time. + + Accordingly. 6 desks a r e required. RANDOM VARIABLES 19.13. A pair of fair dice is thrown. Let S be the random variable which assigns to each point the maximum of the two numbers which occur: X ( ( a ,b ) ) = max (a, b). (i) Find the distribution of X . (ii) Find the expectation of X . I (i) The sample space S is the equiprobable space consisting of the 36 ordered pairs of numbers between 1 and 6: S = {(l, l), (1,Z ) , , . , (6,6)}. Since X assigns to each point in S the maximum of the two numbers in the ordered pair, the range space of X consists of the numbers from 1 through 6: R , = { l , Z , 3 , 4 , 5 , 6 } . &. Each of three points There is only one point (1,l) whose maximum is 1; hence P ( 1 ) = Each of five points in S , in S, ( 1 , 2 ) , ( 2 , 2 ) and (2, l), has maximum 2; hence P(2) = Similarly, P ( 4 ) P(5) = ( 1 , 3 ) , ( 2 , 3 ) , 13,3), ( 3,2) and (3, l ) , has 3; hence P(3) 11 and P ( 6 ) = 1 ~ ., &. =A. The distribution of X consists of the points in I I =A, & Rx with their respective probabilities: I I I (ii) To obtain the expectation of X , i.e. the expected value, multiply each number x, by i t s probability p z and take the sum: E :i6+ 2 . A = 1.- + 4-$ + - 3 . 16 2 5-$ + 601 36 - 36 = 4.5 19.14. Suppose a random variable X takes on the values -3, -1, 2 and 5 with respective 2 x - 3 x + l x-1 x-2 __ and ___ . 10 ' 10 ' 10 10 expectation of X. probabilities __ ~ Determine the distribution and Set the sum of the probabilities equal t o 1, and solve t o find .2: = 3. Then put x = 3 into the above probabilities and obtain .3, .4, .2 and .1 respectively. The distribution of X is xi I -3 I -1 5 2 I I I The expectation is obtained by multiplying each value by its probability and taking the sum: E = (-3)(.3) + (-1)(.4) + 2(.2) + 5(.1) = --.4 226 INDEPENDENT TRIALS, RANDOM VARIABLES [CHAP. 19 19.15. Four fair coins are tossed. Let X denote the number of heads occurring. Find the distribution and expectation of the random variable X . nith Since this experiment consists of four independent trials, we obtain the binomial distribution 7 1 = 4 and p = $: P(0) = f(4,0,3) = P(3) = 1 ( 4 , 3 , 3 ) = 2 1L 6 P(1) = f ( 4 , 1 , + ) = P(2) = J ( 4 , 2 , 4 ) = P ( 4 ) = 1'(4,4,3) = & & Thus the distribution is I I I + + + + 1 - 1A 2 *1A 3 * &. 4 & = 2. E = 0 1 and the expectation is 16 6 6 Theorem 19.3 which states that the expected value E = ? ~ p= 4 = 2. This agrees with 4 19.16. A coin weighted so that P(H ) = $ and P ( T )= & is tossed three times. Let X be the random variable which denotes the longest string of heads which occurs. Find the distribution and expectation of X . The random variable X is defined on the sample space S = {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT} The points in S have the following respective probabilities: P ( H H H ) = 3.8.3 = 27 4 4 . 4 P(HHT) = 3.3.1 4 4 P(THH) = 64 4 P(HTT) = $ * $ * $ = & A P(THT) = 1.3-1 = 8 & & P(TTH) = $ * $ a $ = P(TTT) = &.&.& = 4 64 P(HTH) = $ * $ * $ = = $ * $ a $ 4 4 64 & Since X denotes the longest string of heads, X(TTT) = 0; X(HTT) = 1, X(HTH) = 1, X(THT) = 1, X(TTH) X(HHT) = 2, X(THH) = 2; 1; X(HHH) = 3 Thus the range space of X is R, = {0,1,2,3}. The probability of each number x, in R, is obtained by summing the probabilities of the points in S whose image is xi: P ( 0 ) = P(TTT) = P(1) = P(HTT) & P(2) = P(HHT) + P(HTH) + P ( T H T ) + P(TTH) = + P(THH) = 8 P ( 3 ) = P(HHH) = The distribution of X consists of the numbers in the range space R, with their respective probabilities: 64 64 64 64 19.17. Prove Theorem 19.2: Let S be an equiprobable space. If R, = {xl, . . , , xt} is the range space of a random variable X defined on S , then pi = P ( x , ) = number of points in S whose image is xi number of points in S Let S have n points and let sl, s2, show that P ( x J = r/n. By definition, , . .,s, be the points in S whose image is x,. We wish to CHAP. 191 I N D E P E N D E N T TRIALS, RANDOM VARIABLES 227 P ( x i ) = sum of the probabilities of the points in S whose image is xi = P(Sl) P(s2) . . . P(s,) + + + Since S is an equiprobable space, each o l the points in S has probability lln. Hence ?t r times 19.18. Let X be a random variable defined on a sample space S and with distribution Show that the following are equivalent ways of defining the expectation E ( X ) of the random variable X : E ( X ) = 2,p1 * * . x,p, = X ( s )P ( s ) c + + S E S Let sil,si2,. , . , sini be the points in S with image x i , t h a t is, X ( s i j ) = x i . Then, p, = P ( z J = sum of the probabilities of the points in S whose image is z1 = p, = P(Z2) P(S1,) = + P(SI*) + P(S22) P(S21) .............................. p, = P(rJ = P(S,l) t + '.* + + ' ' ' Z2Pz = ,z 3=1 P(S1j) j=1 "2 + P(s2n2) = P(s2j) j=1 . . . ............................ P(Sr2) + ' a + * z nt P(S,,t) = P(Stj) j=1 nt n2 Similarly, 3 = P(Slnl) X ( S 2 ; ) P(SZ,), . . ., XtP, = B J = lX(s,,) P(Stj) Since the slJ range over all the points s E S , we obtain 19.19. Let X and Y be random variables defined on the same sample space S. Let X the random variable on S defined by (X Y ) ( s ) = X ( s ) Y ( s ) + + Prove that E ( X + Y ) = E ( X )+ E ( Y ) . Using the result of the preceding problem, we obtain E(X +Y) = 2 ( X + Y ) ( s )P ( s ) 2 ( X ( S ) P ( S )i 3 X(s)P(s) ses = = ses S E S + = 2 seS + ( X ( s ) Y ( s ) )P ( s ) Y(s)P(s)) z Y ( s )P ( s ) seS = E(X) + E(Y) + Y be 228 I N D E P E N D E K T TRIALS, RANDOM VARIABLES [CHAP. 19 EXPECTED VALUES 19.20. A player tosses two fair coins. He wins $2 if 2 heads occur, and $1 if 1 head occurs. On the other hand, he loses $3 if no heads occur. Determine the expected value of the game and if it is favorable to the player. Note t h at the probability t h at 2 heads occur is f, tha t 2 tails occur is f and tha t 1 head Thus the probability of winning $2 is $, of winning $1 is and of losing $3 is occurs is Accordingly, E = 2 & 1 * - 3 . i = & = .25. In other words, the expected value of the game is 25d in favor of the player. +. - a. 4, + 3 19.21. A player tosses two fair coins. He wins $3 if 2 heads occur, and $1 if 1 head occurs. If the game is to be fair, how much should he lose if no heads occur? Let .T denote the amount he should lose if no heads occur. We have t h a t the probability of winning 93 is $, of winning $1 is and of losing $x is Since the game is to be fa ir, set E = 0: a. 3, E = 3.$+1.1---~.1 = 0 or 2 5 -z -- 0 4 x = 5 or 4 In other words, the game is f a i r if he loses $5 when no heads occur. 19.22. A player tosses two fair coins. He wins $5 if 2 heads occur, $2 if 1 head occurs and $1 if no heads occur. (i) Find his expected winnings. (ii) How much should he pay t o play the game if it is to be f a i r ? (i) The probability of winning $ 5 is $, of winning $2 is 6, and of winning $1 is $; hence E = 5 2 * 3 1 4 = 2.50, t h a t is, the expected winnings a r e $2.50. - t+ + - (ii) If he pays $2.50 t o play the game, then the game is fair. 19.23. A box contains 10 items of which 2 are defective. If four items a r e selected from the box, what is the expected number of defective items in the sample of size 4? (io) There ar e = 210 different samples of size 4 (without replacement). Since there a r e 8 nondefective items in the box, then (:) = 70 different samples contain no defective items. Hence the = probability P ( 0 ) of no defective items in the sample is P(0) = 2 (B) (:) 5. = 112 different samples contain 1 defective item: hence P(1) = = 28 = 28 different samples contain the 2 defective items; hence P ( 2 ) = Thus the expected number of defective items is E = O * ' +3 l a & + &. - 2 - E. 2m2 15 = 5 * 19.24. A fair die is tossed 300 times. Compute the expected number of sixes. This is an independent trials experiment with n = 300 and p = Q. Hence by Theorem 19.3, E = n p = 300.Q = 50. 19.25.Determine the expected number of boys in a family with 4 children, assuming the sex distribution to be equally probable. What is the probability that the expected number of boys does occur? 4; This is an independent (repeated) trials experiment with ?i = 4 and p = hence by Theorem 19.3, 1 2 = 2. In other words, the family can expect to have 2 boys. The probability t h a t the family has 2 boys is f ( 4 , 2 , 3 ) = 8. E = np = 4 CHAP. 191 INDEPENDENT TRIALS,RANDOM VARIABLES 229 19.26. Prove Theorem 19.3: For the binomial distribution the expected value is E = np. n E 71 = k(I;)qn-! = i,ib k=O n (li - l)! q"-k since q - 11 = 1. = pk-1 11p (p + p)!l-l zz np 1111the above summation. n e dropped the term n h e n li = 0 since i t is 0 * q" = 0.) 19.27. A fair coin is tossed until a head or 4 tails occurs. tosses of the coin. Find the expected number of Only one toss occurs if heads occurs the first time, i.e. the ererit H. Two tosses occur if the first is tails and the second is heads, 1.e. the event TH. Three tosses occur if the first t n o a r e tails and the third is heads, i.e. the event TTH. Four tosses occur if either T T T H or T T T T occurs. Hence P 1 1= P I H I= P ( 2 ) = P ( T H = $, P 31 = P ( T T H ) = +, and 3, P(4) = P T T T H I - P T T T T ) = Thus E = 1.++2.&+3.&+.:.& = &+A = 8 8 19.28.A box contains 8 light bulbs of which 3 are defective. A bulb is selected from the box and tested until a non-defective bulb is chosen. Find the expected number of bulbs t o be chosen. The probability of choosing a non-defective bulb the first time is P(1) = 8. The probability of first choosing a defective bulb and then a non-defective one is P ( 2 ) = $.f = $. The probability of first selecting two defective bulbs and then a non-defective one is P ( 3 ) = f= Lastly, the probability of first selecting the three defective bulbs and then a non-defective one is P(4) = 3.2.1.5 = 1_ 56. Hence t.3. &. , = 1.: E 3-& L + = 4.k 84 56 19.29. Prove Theorem 19.4: Let E be the expected value of an experiment. If a constant h: is added t o each outcome of the experiment, then the expected value E* of the new experiment is E + k. T ~ . ,. . , y r be t h e outcomes of the original experiment with respective probabilities Let pl,p,, . . , , p r . Note p, p , ... 4 = 1. Thus + + E = (%l+k)pl - (/l = xlpl + kp, - - x,p1 + .'pz + = E+k - .f$J ' ' . k)p, + + kp, + + xtp, + k)Pt f ("t+ "' ... T a.,P, + kp, k(p1 - 112 - ..' - Pt) 230 I N D E P E N D E N T TRIALS, RANDOM VARIABLES [CHAP. 19 Supplementary Problems REPEATED TRIALS WITH TWO OUTCOMES 19.30. 19.31. A card is drawn and replaced three times from a n ordinary deck of 52 cards. Find the probability t h a t iii two hearts a r e drawn, ( i i ) three hearts a r e drawn, (iii, a t least one he a rt is drawn. 19.32. A baseball player’s batting average is .300. If he comes to bat 4 times, w ha t is the probability t h a t he will get (i) two hits, (ii) a t least one hit? 19.33. A box contains 3 red marbles and 2 white marbles. A marble is drawn and replaced three times from the box. Find the probability t h a t (i) 1 red marble was drawn, (ii) 2 red marbles were drawn, (iii) a t least one red marble was drawn. 19.34. Team A has probability Q of winning whenever i t plays. If A plays 4 games, find the probability t h a t A wins ( i ) 2 games, (ii) a t least 1 game, (iii) more than half of the games. 19.35. A card is drawn and replaced in an ordinary deck of 52 cards. How many times must a card be drawn so t h at (i) there is a t least a n even chance of drawing a heart, (ii) the probability of drawing a heart is greater than $ ? 19.36. The probability of a man hitting a target is 8. (i) If he fires 5 times, what is the probability of hitting the target a t least twice? iii) How many times must he fire so t h a t the probability of hitting the t ar g et at least once is more than 90%. 19.37. Ten typists ar e assigned to the same office. Each typist is a s likely to work at home a s in the office. How many typewriters must there be in the office so t h a t each typist ha s a typewriter a t least 90r2 of the time? RANDOM VARIABLES 19.38. A pair of f ai r dice is thrown. Let X be the random variable which denotes the minimum of the two numbers which occur. Find the distribution and expectation of X . 19.39. Five f a i r coins a r e tossed. Let X be the random variable which denotes the number of heads occurring. Find the distribution and expectation of X. 19.40. A f a i r coin is tossed f o u r times. Let X be the random variable which denotes the longest string of heads. Find t h e distribution and expectation of X . +. 19.41. Solve the preceding problem in the case t h a t the coin is weighted so t h a t P ( H ) = 3 and P ( T ) = 19.42. Two cards a r e selected from a box which contains five cards numbered 1 , 1 , 2 , 2 and 3. (i) Let X be the random variable which denotes the sum of the two numbers. Find the distribution and expectation of X . (ii) Let Y be the random variable which denotes the maximum of the two numbers. Find the distribution and expectation of Y . 19.43. Refer to the preceding problem. Let X Y be the random variable defined by ( X Y ) ( s ) = X ( s ) Y ( s ) , i.e. which denotes the sum of the two numbers added to the maximum number. F o r example, ( X + Y)(Z,3) = 5 A 3 = 8. Find the distribution and expectation of X Y . Verify that E ( X Y ) = E ( X ) E ( Y ) . + + + + 19.44. A man stands a t the origin. He tosses a coin. If a head occurs, he moves one if a tail occurs he moves one unit t o the left. (i) If X is the random variable distance from the origin af t er 4 tosses of the coin, find the distribution and (ii) If Y is the random variable which denotes his position a fte r 4 tosses of distribution and expectation of Y . 19.45. Solve (i) and [ii) of the preceding problem for the case of 5 tosses of the coin. + + unit to the right; which denotes his expectation of X . the coin, find the CHAP. 191 I N D E P E N D E N T TRIALS, RANDOM VARIABLES 231 EXPECTED VALUE 19.46. A player tosses three f a i r coins. He wins $5 if 3 heads occur, $3 if 2 heads occur, and $1 if only 1 head occurs. On the other hand, he loses $15 if 3 tails occur. Find the value of the game to the player. 19.47. A player tosses three f a i r coins. He wins $8 if 3 heads occur, $3 if 2 heads occur, and $1 if only 1 head occurs. If the game is to be f ai r, how much should he lose if no heads occur? 19.48. A player tosses three f a i r coins. He wins $8 if 3 heads occur, $3 if 2 heads occur, $1 if 1 head occurs, but nothing if no heads occur. If the game is to be fa ir, how much should he pay to play the game? 19.49. A box contains 8 items of which 2 a r e defective. A man selects 3 items from the box. Find the expected number of defective items he has drawn. 19.50. Of the bolts produced by a factory, 2 5 a r e defective. In a lot of 8000 bolts from the factory, how many a r e expected t o be defective? 19.51. Find the expected number of girls in a family with 6 children, assuming sex distribution t o be equally probable. Wh at i s the probability t h a t the expected number of girls does occur? 19.52. A lottery with 500 tickets gives one prize of $100, 3 prizes of $50 each, and 5 prizes of $25 each. (i) Find the expected winnings of a ticket. fii) If a ticket costs $1, w ha t is the expected value of t h e game? 19.53. A f a i r coin is tossed until a head or five tails occur. Find the expected number of tosses of the coin. 4 6 19.54. A coin weighted so t h at P ( H ) = and P ( T ) = the expected number of tosses of the coin. 19.55. A plays B in a tournament. The first team The probability of team A winning an y game is t o win 2 games in a row o r a total of three games wins the tournament. Find the expected number of games in the tournament. is tossed until a head or five tails occurs. Find 4. 19.56. A box contains 1 0 transistors of which 2 a r e defective. A transistor is selected from the box and tested until a non-defective one is chosen. Find the expected number of transistors to be chosen. 19.57. Solve the preceding problem in the case t h a t 3 of the 10 items a re defective. Answers to Supplementary Problems 21 &, (ii) 128, 27 19.30. (i) 19.31. (i) &, (ii) &, (iii) 19.32. (i) 0.2646, (ii) 0.7599 19.33. (i) g,(ii) a,(iii) 117 19.34. (i) g,(ii) m, (iii) 112 19.35. (i) 3, (ii) 5 19.36. (i) 19.37. 7 (iii) 1.r~ 54 544 g,(ii) 6 232 I N D E P E N D E N T TRIALS, RANDOM VARIABLES [CHAP. 19 19.38. 19.39. c xi 5 U 1 2 n Pi 19.40. 19.41. 2 1 xi 0 Pi 1 1 3 l 32 3 16 16 32 O L 32 3 - 5 4 4 2 - 16 -1 32 E(X)=$ 1'6 8 1 E(X)= - 81 81 81 g 81 19.42. E ( X ) = 3.6 (i) 19.43. x i 3 I pi 5 I .1 6 I .4 7 I .1 8 1 .2 (ii) E ( X + Y ) = 5.9 .2 19.44. 19.45. E ( X ) = 15 19.46. 2% 19.52. 19.47. $20 19.53. 19.48. $2.50 19.49. 23 19.54. 19.55. (i) 75d, (ii) - 25@ % 23 19.50. 160 11 19.56. s 19.51. 3, 19.57. E ( Y ) = 2.3 Chapter 20 Markov Chains PROBABILITY VECTORS A row vector u = (u1,u2,. . .,u,,) is called a probability vector if its components are non-negative and their sum is 1. Example 1.1 : Consider the following vectors: 16 = ( 3 0 -1 I) 7j = 4’ > 4’ ’’ Then: I (2, 4,0’ 1) 4 and zu = (11 0 1) 4’4’ ’2 ic is not a probability vector since its third component is negative; 1: is not a probability vector since the sum of its components is greater than 1: zt‘ is a probability vector. Example 1.2: Remark: - The non-zero vector 2’ = ( 2 , 3 , 5 , 0 , 1 ) is not a probability vector since the sum of its components is 2 f 3 5 O + 1 = 11. However, since the components of w a r e non-negative, u has a unique scalar multiple Xu which is a probability vector; it can be obtained from v by dividing each component of v by the sum of the ’ 3 5 1 components of v: & G = (z 11’ fi, fl. 0 , E ) . + Since the sum of the components of a probability vector is one, a n arbitrary probability vector with n components can be represented in terms of n - 1 unknowns as follows: ( X l , x2, . . ., x,1-1, 1 - x1 - * . ’ - x n - 1 ) In particular, arbitrary probability vectors with 2 and 3 components can be represented, respectively, in the form (x, 1-x) and (x,Y, 1 - x - y ) STOCHASTIC AND REGULAR STOCHASTIC MATRICES A square matrix P = ( p i J is called a stochastic matrix if each of its rows is a probability vector, i.e. if each entry of P is non-negative and the sum of the entries in every row is 1. Example 2.1 : Consider t h e following matrices: (ii) (i) (iii) is not a stochastic matrix since the entry in the second row and third column is negative; (ii) is not a stochastic matrix since the sum of the entries in t h e second row is not 1; (iii) is a stochastic matrix since each row is a probability vector. 233 234 [CHAP. 20 31ARKOV CHAINS We shall prove (Problem 20.7) Theorem 20.1: If A and B are stochastic matrices, then the product AB is a stochastic matrix. Therefore, in particular, all powers A" are stochastic matrices. We now define an important class of stochastic matrices whose properties shall be investigated subsequently. 1- A stochastic matrix P is said to be regular if all the entries of some power Pm are positive. +) 0 Example 2.2: 1 The stochastic matrix A = is regular since is positive in every entry. 1 Example 2.3: Consider the stochastic matrix A = 11 A2 = 0 j4 (+ 1 A3 = +J 0 (3 $1' Here 0 A4 = ;( A) In f act every power A'n will have 1 and 0 in the first row; hence A is not regular. FIXED POINTS OF SQUARE MATRICES A non-zero row vector u = (ul,u,,. . . ,u,) is called a fixed point of a square matrix A if u is left fixed, i.e. is not changed,- when multiplied by A : uA = u We do not include the zero vector 0 as a fixed point of a matrix since it is always left fixed by every matrix A : OA = 0. (::) . Then the vector u = (2, -1) is a fixed point Example 3.1: Consider t h e matrix A = of A . For, Example 3.2: Suppose u is a fixed point of a matrix A : zcA = u. W e claim t h a t every scalar multiple of u , say hti, is also a fixed point of A . For, (xu)A = k(zcA) = xu Thus, in particular, the vector 2u = (4, -2) t h e example above: is a fixed point of the matrix A of We state the result of the preceding example as a theorem. Theorem 20.2: If u is a fixed vector of a matrix A , then for any real number h # 0 the scalar multiple XU is also a fixed vector of A . 235 MARKOV CHAINS CHAP. 201 FIXED POINTS AND REGULAR STOCHASTIC MATRICES The main relationship between regular stochastic matrices and fixed points is contained in the next theorem whose proof lies beyond the scope of this text. Theorem 20.3: Let P be a regular stochastic matrix. Then: (i) P has a unique fixed probability vector t , and the components of t are all positive; (ii) the sequence P, P2,P 3 , .. . of powers of P approaches the matrix T whose rows are each the fixed point t ; (iii) if p is any probability vector, then the sequence of vectors pP,pP2,pP3, . . . approaches the fixed point t. Note: P" approaches T means that each entry of Prl approaches the corresponding entry of T,and pPn approaches t means that each component of pPn approaches the corresponding component of t. Example 4.1: Consider the regular stochastic matrix P = vector t = (5, 1 - 2) such that tP = t ( +). 0 1 We seek a probability Multiplying the left side of the above matrix equation, we obtain Thus t = 1= (Q,+) is the unique fixed probability vector of P. By Theorem 20.3, the sequence P , P 2 , P 3 , . . . approaches the matrix T whose rows are each the vector t: (4, 4) T = (i :) ( = *33 *67) .33 .67 We exhibit some of the powers of P to indicate the above result: Example 4.2: Find the unique fixed probability vector of the regular stochastic matrix 0 Method 1. 1 We seek a probability vector t = 0 (2,Y,1 - 2 = - y) such that tP = t: (x,y,l-z-y) Multiplying the left side of the above matrix equation and then setting corresponding components equal to each other, we obtain the system 236 MARKOV CHAINS 21 - l 2 z -2 l, Y [CHAP. 20 3x+y = 1 = x 2 = 1 Z + ~ - + Z - & = y z - 3 ~ = -1 or y = 1-x-y z+2y 5 or y = g = 1 Thus t = (&,$, 8) is the unique fixed probability vector of P. Method 2. We first seek a n y fixed vector u = (2,y,z) of the matrix P: 0 1 0 $2 = (z, Y,2 ) or Z + L Z 2 = x = y u = z We know t h a t the system h a s a non-zero solution; hence we can arbitrarily assign a value to one of the unknowns. Set z = 2. Then by the first equation x = 1, and by the third equation y = 2. Thus U. = ( 1 , 2 , 2 ) is a fixed point of P . But every multiple of u is a fixed point of P ; hence multiply u by 6 to obtain the required $). fixed probability vector t = QZL= (i,?, MARKOV CHAINS We now consider a sequence of trials whose outcomes, say, X , , X,, . . . , satisfy the following two properties: u2, . . .,am} called the state space (i) Each outcome belongs to a finite set of outcomes {a,, of the system; if the outcome on the nth trial is a l , then we say that the system is in state a?at time n or at the nth step. (ii) The outcome of any trial depends at most upon the outcome of the immediately preceding trial and not upon any other previous outcome; with each pair of states (a,,aj) there is given the probability p i j that uj occurs immediately after at occurs. Such a stochastic process is called a (finite) Alarkov chain. The numbers p,,, called the transifion probabilities, can be arranged in a matrix * * P12 P = .................. called the transition matrix. Thus with each state al there corresponds the ith row (pi,,pi2, . . .,pl,J of the transition matrix P; if the system is in state ui,then this row vector represents the probabilities of all the possible outcomes of the next trial and so it is a probability vector. Accordingly, Theorem 20.4: The transition matrix P of a Markov chain is a stochastic matrix. Example 5.1: A man either drives his car or takes a train to work each day. Suppose h e never takes the train two days in a row; but if he drives to work, then the next day he is just as likely t o drive again as h e is t o take the train. The state space of the system is { t (train), d (drive)}. This stochastic process is a Markov chain since the outcome on any day depends only on what happened the preceding day. The transition matrix of t h e Markov chain is t d CHAP. 20) 237 SIARKOV CHAINS The first row of the matrix corresponds to the fa c t t h a t he never takes the t r ai n two days in a row and so he definitely will drive the day after he takes the train. The second row of the matrix corresponds t o the fa c t tha t the day a fte r he drives he will drive o r take the train with equal probability. Example 5.2: Three boys A , B and C a re throwing a ball t o each other. A always throws the ball t o B and B always throws the ball to C; but C is just as likely to throw the ball to B as to A . Let X, denote t h e n t h person to be thrown t h e ball. The sta te space of t h e system is ( A ,B , C}. This is a Markov chain since the person throwing the ball is not influenced by those who previously had the ball. The transition matrix of the Markov chain is A B C B The first row of the matrix corresponds to the fa c t tha t A always throws the ball to B. The second row corresponds t o the fact tha t B always throws the ball to C. The last row corresponds to the fa c t tha t throws the ball t o A or B with equal probability (and does not throw it to himself). c Example 5.3: A school contains 200 boys and 150 girls. One student is selected after another t o take a n eye examination. Let X, denote the sex of the n t h student who takes the examination. The state space of the stochastic process is { m (male), f (female)}. However, this process is not a Markov chain since, for example, the probability t h a t the third person is a girl depends not only on the outcome of the second trial b u t on both the first and second trials. Example 5.4: (Random waIk with reflecting barriers.) A man is a t a n integral point on the x-axis between the origin 0 and, say, the point 5. H e takes a unit step to the r i g h t with probability p o r to the left with probability g = 1- p , unless he is at t h e origin where he takes a step to the right to 1 o r at the point 5 where he takes a step t o the left to 4. Let X, denote his position a fte r n steps. This is a Markov chain with state space {uo,a,, a,, a,, a,, a,) where a, means t h a t the man is a t the point i. The transition matrix is a0 a2 a1 a3 a4 a5 Each row of the matrix, except the first and last, corresponds to the fa c t tha t the man moves from sta te aito sta te a, + l with probability p or back to state ai-l with probability q = 1 - p . The first row corresponds to the fa c t t h a t the man must move from s t ate a. to sta te al,and the la st row that the nian must move f r o m state a5 to s t at e a,. HIGHER TRANSITION PROBABILITIES The entry p l j in the transition matrix P of a Markov chain is the probability that the system changes from the state a[ to the state a, in one step: a, acl. Question: What is the probability, denoted by pi;), that the system changes from the state at t o the state alin exactly n steps: + a k , + * * * + ak,-l a, U, -+ + The next theorem answers this question; here the pi;) are arranged in a matrix the n-step transition matrix: called 238 MARKOV CHAINS Theorem 20.5: [CHAP. 20 Let P be the transition matrix of a Markov chain process. Then the = P". n-step transition matrix is equal to the nth power of P: P(") Now suppose that, a t some arbitrary time, the probability that the system is in state ai is 11, ; we denote these probabilities by the probability vector p = (p,, p2, . . . , p,,,) which is called the probability distribution of the system a t that time. In particular, we shall let p'0' = (@', p y , . . .)p:;)) denote the initial probability distribution, i.e. the distribution when the process begins, and we shall let (n) = ( P y , P?), * * .> P,, ) P(ll) denote the nth s t e p probability distribution, i.e. the distribution after the first n steps. The following theorem applies. Theorem 20.6: Let P be the transition matrix of a Markov chain process. If p = ( p J is the probability distribution of the system a t some arbitrary time, then pP is the probability distribution of the system one step later and pPn is the probability distribution of the system n steps later. In particular, *(I) Example 6.1 : = p<o'p p )= p"'p f 9 @3) = p(2'p, . . . ) p(n) = p(0)pn Consider the Markov chain of Example 5.1 whose transition matrix is t d Here t is the sta te of taking a tra in to work and d of driving t o work. ample 4.1, By Ex- Thus t h e probability t h a t the system changes from, say, state t t o state d in exactly 4 steps is i.e. p $ ) = #. Similarly, p j : ) = #, = and p(4) = 11. 8, dd 16 Now suppose tha t on the first day of work, the man tossed a f a i r die and drove to work if and only if a 6 appeared. I n other words, p(O) = (&&) is the initial arobabilitv distribution. Then is the probability distribution a fte r 4 days, i.e. piq) = Example 6.2: and p d( 4 ) = 96' Consider the Markov chain of Example 5.2 whose transition matrix is A B C Suppose C was the first person with the ball, i.e. suppose initial probability distribution. Then 0 p"' = p'0'P = (0,0,1) 1 0 p(0) = ( O , O , 1) is the CHAP. 201 MARKOV CHAINS 239 Thus, a f t e r three throws, the probability t h a t A has the ball is &, t h a t B h a s the = 1 and p y ) = ball is $ and t h a t C has the ball i s $: pa31 = $, ):p fr. Example 6.3: Consider the random walk problem of Example 5.4. Suppose the man began a t the point 2; find the probability distribution a f t e r 3 steps and a f t e r 4 steps, i.e. ~ ( 3 and pC4). Now p(0) = p“’ p‘2’ p‘3’ p(4) ( O , O , 1,0, 0,O) is t h e initial probability distribution. Then = p(0’P = ( 0 , q , 0,p , 0 , 0 ) = p“’P = ( q ! , 0 , 2pq, 0 , p2, 0 ) = p(”P = ( 0 , q L 2p2q, 0 , 3p2q, 0,p 3 ) = p ( 3 ) ~= ( a 3 2 p q 2 , 0 , pq2 2p3q 3 ~ 2 q 2 0, , 3 ~ 3 q - + + + +9 , o ) Thus a f t e r 4 steps he is a t , say, the origin with probability q3 + 2p2q2. STATIONARY DISTRIBUTION OF REGULAR MARKOV CHAINS Suppose that a Markov chain is regular, i.e. that its transition matrix P is regular. By Theorem 20.3, the sequence of n-step transition matrices Pn approaches the matrix T whose rows are each the unique fixed probability vector t of P ; hence the probability p:;) that u3occurs for sufficiently large n is independent of the original state a, and it approaches the component t j of t. In other words, Theorem 20.7: Let the transition matrix P of a Markov chain be regular. Then, in the long run, the probability that any state u3 occurs is approximately equal to the component t3 of the unique fixed probability vector t of P . Thus we see that the effect of the initial state or the initial probability distribution of the process wears off as the number of steps of the process increase. Furthermore, every sequence of probability distributions approaches the fixed probability vector t of P , called the stationary distribution of the Markov chain. Example 7.1: Consider t h e Markov chain process of Example 5.1 whose transition matrix is t d t P = d (& 0 1 4) (4,g). By Example 4.1, the unique fixed probability vector of the above matrix is Thus, in t h e long run, the man will take t h e train to work of t h e time, and drive to work the other 8 of the time. 4 Example 7.2: Consider t h e Markov chain process of Example 5.2 whose transition matrix is A P = B C B By Example 4.2, the unique fixed probability vector of t h e above matrix is 3, g). Thus, in the long run, A will be thrown the ball 20% of the time, and B and C 40% of the time. (&, ) [CHAP. 20 I I A R K O V CHAINS 240 ABSORBING STATES A state ut of a Markov chain is called absorbing if the system remains in the state ai once it enters there. Thus a state azis absorbing if and only if the ith row of the transition matrix P has a 1 on the main diagonal and zeros everywhere else. Example 8.1: Suppose the following matrix is the transition matrix of a Markov chain: a1 a2 a5 a4 a3 The states a? and a 5 a r e each absorbing, since each of the second and fifth rows h a s a 1 on the main diagonal. Example 8.2: (Random walk with absorbing barriers.) Consider the random walk problem of Example 5.4, except now we assume t h a t the man remains a t either endpoint whenever he reaches there. This is also a RIarkov chain and t h e transition matrix is given by an ai a, a3 a4 a5 a1 “I‘ a4 a5 \ q o O O p O o O o O\ o o o o q o 0 0 0 0 0 ”I 1 W e call this process a random walk with absorbing barriers, since the a. and I n this case, phn) denotes t h e probability t h a t the man reaches the s t a t e a,, on o r before t h e izth step. Similarly, p p ) denotes t h e probability t h a t he reaches the a5 on o r before the n t h step. a5 a r e absorbing states. Example 8.3: A player has, say, z dollars. H e bets one dollar a t a time and wins with probability p and loses with probability q = 1 - p . The game ends when he loses all his money, i.e. h a s 0 dollars, or when he wins 3 - x dollars, i.e. h a s N dollars. This game i s identical to the random walk of the preceding example except t h a t here t h e absorbing barriers are at 0 and N . Example 8.4: A man tosses a f a i r coin until 3 heads occur in a row. Let X , = k if, a t t h e itth trial, t h e l a s t tail occurred a t t h e (n- k)-th trial, i.e. X , denotes the longest string of heads ending a t the nth trial. This is a RIarkov chain process with state space {ao, al, a2, as}, where a, means t h e string of heads h a s length i. The transition matrix is a, u1 a2 a3 a0 a1 a2 a3 + + o o ;i f ;) Each row, except the last, corresponds to t h e f a c t t h a t a string of heads is either broken if a tail occurs or i s extended by one if a head occurs. The l a s t line corresponds t o the f a c t t h a t t h e game ends if three heads a r e tossed in a row. Note t h a t a3 is a n absorbing state. CHAP. 201 241 MARKOV CHAINS Let ui be an absorbing state of a hlarkov chain with transition matrix P. Then, for i # j , the n-step transition probability p:;) = 0 for every n. Accordingly, every power of P has a zero entry and so P is not regular. Thus: Theorem 20.8: If a stochastic matrix P has a 1 on the main diagonal, then P is not regular (unless P is a 1x 1 matrix). Solved Problems PROBABILITY VECTORS AND STOCHASTIC MATRICES 20.1. Which vectors are probability vectors ? (i) u = (&, 0 , -&, 4,Q), (ii) v = (4,0, ft, +, h), (iii) w = (4,0 , 0, +,4). A vector is a probability vector if its components a r e non-negative and their sum is 1. (i) ZL is not a probability vector since its third component is negative. (ii) v is not a probability vector since the sum of the components is gre a te r tha n 1. (iii) PL' is a probability vector since its components a r e non-negative and their sum is 1. 20.2. MuItiply each vector by the appropriate scalar to form a probability vector: (i) (2, 1, 0, 2, 3), (ii) (4, 0, 1, 2, 0, 5), (iii) (3, 0, -2, l), (iv) (0, 0, 0, 0, 0). (i) + + The sum of the components i s 2 - 1 0 3 + 2 = 8; hence multiply the vector, i.e. each Q,0, $, 8). component, by & to obtain the probability vector (ii) The sum of the components is 4 component, by (t, + 0 + 1 + 2 + 0 + 5 = 12; t o obtain the probability vector hence multiply the vector, i.e. each (g, 0, &,Q,0 , A). (iii) The first component is positive and the third is negative; hence i t is impossible to multiply the vector by a scalar to form a vector with non-negative components. Thus no scalar multiple of the vector is a probability vector. (iv) Every scalar multiple of t h e zero vector is the zero vector whose components add to 0. Thus no multiple of the zero vector is a probability vector. 20.3. Find a multiple of each vector which is a probability vector: (i) (g, $, 0, 2, g), (ii) (0, 3, 1, Q , $). In each case, first multiply each vector by a scalar so that the fractions are eliminated. (i) F i r st multiply the vector by 6 to obtain ( 3 , 4 , 0 , 1 2 , 5 ) . Then multiply by 1/(3 & t o obtain (&,&,O,$,&) which is a probability vector. + 4 + 0 + 12 + 5) = (ii) F i r st multiply the vector by 30 t o obtain (0,20,30,18,25). Then multiply by 1/(0 -t20 20 30 18 25 18 25) = t o obtain ( 0 , ~3. ' 93'93) which is a probability vector. + & + 30 + 20.4. Which of the following matrices are stochastic matrices? (i) A is not a stochastic matrix since it is not a square matrix. (ii) B is not a stochastic matrix since the sum of the components in the last row is greater tha n 1. (iii) C is a stochastic matrix. (iv) D is not a stochastic matrix since the entry in the first row, second column is negative. 242 MARKOV CHAINS 20.5. Let A = (1: 1: 1:) az bz [CHAP. 20 be a stochastic matrix and let u = c, ( U I , U P , U ~ )be a proba- bility vector. Show that uA is also a probability vector. (nt i' :I\ b, ZLA= (u1, u g , 243) c1 = (ulal + + u3a3,u l b , + u2bZ + u2a2 + uzc~ ~ $ 3 , ulcl+ ~ b 3 ~ 3 ) Since the ui, a,, bi and cl a r e non-negative and since the products and sums of non-negative numbers a r e non-negative, the components of wl a r e non-negative a s required. Thus we only need t o show t h a t t h e sum of the components of uA is 1. Here we use the f a c t t h a t z i 1 + u 2 + u 3 , a, b , cl, u 2 b, c2 and u3 b , c3 a r e each 1: + + + + ~ 1 + ~~ 12 + + + + + + ~ 3 b 3+ + + + b, + + + b, + c,) + + + + u2.l + u , * l = + u 2 + UQ = 1 ~2 ~ 23 ~ 3~ ~ l b uzb, l = zll(~1 = Zll*l ~ ~~ 1 ~ 1 ) 12 ~~ 23 Z L ~ ( U ~b 3 ~ 3 ~ 3 ) u1 If A = (aij) is a stochastic matrix of order n and u = (ul,u,, , . . ,uJ is a probability vector, then uA is also a probability vector. 20.6. Prove: I: The proof is similar t o t h a t of the preceding problem for the case n = 3 : [::: :1 ..' ..* ................. a12 . . ., u,) ZLA = ( ~ 1 ~, 2 , ... an1 ann + uza21+ . - + unanl, ula12+ u2a2,+ = (ulall * * + * + unaR2,. . ., ulaln+ uZa2,,+ - + unarm) * 9 Since the uiand aZja r e non-negative, the components of uA a r e also non-negative. Thus we only need to show t h a t t h e sum of the components of uA is 1: u1a12 u2a22 * u,an2 * ulaln uZazn * * * u,a,, ~~u~~ z ~ ~. u ununl ~ ~ - u,(all+ a12 . ' ' un(anl a,, * * * a,,) * * * Ul,) u,(azl a22 * * * a,,) + - + 2L1'1 + + + u2.l + + + + + * ' . + + - + + + + + + + + + + + + + + + Un'l = u,+u,+ ' . . +u, = 1 20.7. Prove Theorem 20.1: If A and B are stochastic matrices, then the product A B is a stochastic matrix. Therefore, in particular, all powers An are stochastic matrices. The ith-row si of t h e product matrix A B is obtained by multiplying the ith-row ri of A by the matrix B: si = riB. Since each ri is a probability vector and B is a stochastic matrix, by the preceding problem, si is also a probability vector. Hence A B is a stochastic matrix. 20.8. Prove: Let p = ( p l , p 2 , . . .,p,) be a probability vector, and let T be a matrix whose rows are each the same vector t = ( t l ,t,, . . . , tRJ. Then pT = t . Using the f act t h a t p1 + p2 + . . . + p m = 1, we have i t l t, * . * tm\ 243 MARKOV CHAINS CHAP. 201 REGULAR STOCHASTIC MATRICES AND FIXED PROBABILITY VECTORS 20.9. Find the unique fixed probability vector of the regular stochastic matrix A = What matrix does A n approach? 2 2 We seek a probability vector f = (Y, 1 - 2) such t h a t t A = f: Multiply the left side of the above matrix equation and then set corresponding components equal to each other to obtain the two equations = $x++-+z 2, Solve either equation to obtain x = Q . Thus t = $Z+&-frx (3,+) = 1-2 is the required probability vector. Check the answer by computing the product tA: The answer checks since t A = t . The matrix An approaches the matrix T whose rows a r e each the fixed point t: T = (f i). z 20.10. (i) Show that the vector u = ( b , a ) is a fixed point of the general 2 x 2 stochastic matrix P = ('I" lab). (ii) Use the result of (i) to find the unique fixed probability vector of each of the following matrices: 1--a (i) UP = ( " a ) ( a 1-b ) = (b-ab+ab, a b f a - a b ) = ( b , a ) = u. (ii) By (i), z t = ( 1 , i ) i s a fixed point of A . Multiply u by 3 to obtain the fixed point (3,2) of A which has no fractions. Then multiply ( 3 , 2 ) by 1/(3 2) = 6 to obtain the required unique fixed probability vector (#, Q). + By (i), u = ($, 4) is a fixed point of B. Multiply u by 6 to obtain the fixed point ( 4 , 3 ) , t o obtain the required unique fixed probability vector and then multiply by 1 / ( 4 + 3 ) = 3 cq, $). Hence (8,3) and the probability vector By (i), ZL = (.8, .3) is a fixed point of C. a r e also fixed points of C. 20.11. Find the unique fixed probability vector of the regular stochastic matrix P = 0 Method 1. We seek a probability vector t = (2,y, 1 0 1 - x - y) such t h a t tP = t: = (Z,Y,1---Y) 0 1 0 ( A , 6) MARKOV CHAINS 246 [CHAP. 20 R L P R = .8 .2 L (.6 A ) The probability distribution for the first trial is p = ( . 5 , - 5 ) . To compute the probability distribution for the next step, i.e. the second trial, multiply p by the transition matrix P: = (.7, .3) (.5, .5) (*8 *2) .6 .4 Thus on the second trial he predicts that 70% of the mice will go right and 30% will go left. To compute the probability distribution for the third trial, multiply that of the second trial by P : (.7, .3) ( *8 ‘2) .4 .6 = (.74, .26) Thus on the third trial he predicts that 74% of the mice will go right and 26% will g o left. We assume t h a t t h e probability distribution for the thousandth trial is essentially the stationary probability distribution of the Markov chain, i.e. the unique fixed probability vector t of the transition matrix P . By Problem 20.10, u = (.6, .2) is a fixed point of P and SO t = ($,$) = (.75,.25). Thus he predicts that, on the thousandth trial, 75% of the mice will go to the right and 25% will go to the left. 20.17. Given the transition matrix p(O) = (i) P = (~*) 1 0 with initial probability distribution (i,+). Define and find: (i) pi:), (ii) p C 3 ’ , (iii) p i 3 ) . pa:) is the probability of moving from state u2 to state a l in 3 steps. It can be obtained from the 3-step transition matrix P3; hence first compute P3: p2 Then p;) (ii) =(; l)J =(; ;) p3 is the entry in the second row first column of P3: p:) = 2. 8 is the probability distribution of the system after three steps. It can be obtained by successively computing p ( 1 ) , p ( 2 ) and then ~ ( 3 ) : p(3) However, since the 3-step transition matrix P3 has already been computed in (i), p ( 3 ) can also be obtained as follows: (iii) p r ) is the probability t h a t the process is in the state a2 after 3 steps; it is the second component of the 3-step probability distribution ~ ( 3 ) : p?) = L 12‘ 247 MARKOV CHAINS CHAP. 201 0” 20.18. Given the transition matrix P = 2 i) and the initial probability distribution (4 4 0 1 0 p(O) = (3, O,&). Find: (i) and p t i ) , (ii) p(4)and p?), approaches, (iv) the matrix that Pi* approaches. (i) ‘. Fi r st compute the 2-step transition matrix P2: 0 1 1 0 1 = pa Then p::) = (ii) To compute 4 ”) (i i 1 3 - - = t h e 2-step transition matrix P2 and the initial probability distribution ~ ( ~ use 1 , p(0)p2 p(2) pi41 f 1 0 1 0 0 1 0 a 2 and pi:) = 0, since these numbers refer to the entries in P2. p(0J: Since ; ;)(: (iii) the vector that p(O)P” = (1 2 3’ 3’ is the third component of 0) and p(4) = p(2)Pz = (1 14’ 12’ A) p;) = Q. p(4), (iii) By Theorem 20.3, p(o)Pn approaches the unique fixed probability vector t of P. To obtain t , first find an y fixed vector u = (x,y, z ) : ’ ”\ 4 (O4 3 (%2/* 0 1 3y = (x,y, z ) = x = +x++y+z or y 0 Find any non-zero solution of the above system of equations. Set z = 1; then by the third equation x = 2, and by the first equation y = 4. Thus u = (2,4,1) is a fixed point of P and so t = (+,+,+). In other words, p(O)Pn approaches ($,$,$). (iv) Pn approaches the matrix T whose rows a r e each the fixed probability vector of P ; hence 2 Pn approaches 4 (! - - _ I 1 I 7 r 7 $ . 20.19. A salesman’s territory consists of three cities, A , B and C. He never sells in the same city on successive days. If he sells in city A , then the next day he sells in city B. However, if he sells in either B or C, then the next day he is twice as likely to sell in city A as in the other city. In the long run, how often does he sell in each of the cities? The transition matrix of t h e problem is a s follows: A p = A B C 0 1 0 * V ; ) c z 3 We seek the unique fixed probability vector t of the matrix P. First find a n y fixed vector u = (x,y,z): (E 1 $ 0 v,4 (2, 1 0 $y = (% Y,4 01 + 32 s+$z = = 2 y ly = z Set, say, z = 1. Then by t h e third equation y=3, a nd by the first equation x =+. Thus 8 u = ( ~ , 3 , 1 ) . Also 3u = (8,9,3) is a fixed vector of P . Multiply 3 2 ~by 1/(8+ 9 3) = $ to obtain the required fixed probability vector t = (z 5’ 20’ ”) 20 = (.40, .45, J 5 ) . Thus in the long r u n he sells 407;. of the time in city A , 45% of the time in B and 15% of the time in C . + 248 MARKOV CHAINS [CHAP. 20 20.20. There are 2 white marbles in urn A and 3 red marbles in urn B . At each step of the process a marble is selected from each urn and the two marbles selected are interchanged. Let the state a,of the system be the number i of red marbles in urn A . (i) Find the transition matrix P . (ii) What is the probability that there are 2 red marbles in urn A after 3 steps? (iii) In the long run, what is the probability that there are 2 red marbles in urn A ? (i) There a r e three states, ao, and a2 described by the following diagrams: a, B A B A A a1 a0 B a2 If the system is in state a,, then a white marble must be selected from urn A and a red marble from u r n B , so the system must move to state a,. Accordingly, the first row of the transition matrix is (0,1,0). Suppose the system is in state a,. It can move to state a. if and only if a red marble is selected from u r n A and a white marble from urn B; the probability of t h a t happening is 1.1 = 1. Thus p l o = Q. The system can move from state al to a, if and only if a white 2 3 6 marble is selected from u r n A and a red marble from urn B ; the probability of t h a t happening Accordingly, the probability t h a t the system remains in state is 12 . 23 = 1 a . Thus p12= Thus the second row of the transition matrix is (Q, (Note a , is p , , = 1- Q - Q = t h a t p , , can also be obtained from the f a c t t h a t the system remains in the state a , if either a white marble is drawn from each urn, probability 4 - i = Q, o r a red marble is drawn from each urn, probability * -g = thus p11 = & f = 4. 4. 4,g). 4 4; 4 4.) Now suppose the system is in state a2. A red marble must be drawn from urn A . If a red marble is selected from u rn B , probability Q, then the system remains in state a,; and if a white marble is selected from urn B , probability $, then the system moves to state al. Note t h at the system can never move from sta te a2 t o the state uo. Thus the third row of the Tha t is, transition matrix is (0,+, i). a, a1 a2 a0 P = a1 a2 (; ;) 1 3 3 (ii) The system began i n state ao, i.e. p(O) = (l,O,O). Thus: p"' = pco'p = ( 0 , 1, O), p(2) = pcup = (1 6 , ~, 1 g), p'3' = p(2)P = (L 12' 36' ") 18 5 Accordingly, t h e probability t h a t there a r e 2 red marbles in urn A a f t e r 3 steps is g . (iii, We seek the unique fixed probability vector t of the transition matrix P . F i r s t find a n y fixed vector u = (x,y,z): Qy = x 0 1 0 (5,Y, 2) J Q Q 4 (o g = (x,Y, 4 or x+&y+Qz = y &y+Qz = z Set, say, x = l . Then by the first equation y = 6 , and by the third equation x = 3 . Hence 3) = & to obtain the required unique fixed probazi = (1,6,3). Multiply u by 1/(1+6 bility vector t = (.1,.6, .3). Thus, in the long run, 30% of the time there will be 2 red marbles in u r n A . Note that the long r u n probability distribution is the same as if the five marbles were placed in a n u r n and 2 were selected a t random t o put into urn A . + 20.21. A player has $2. He bets $1 at a time and wins with probability 8. He stops playing if he loses the $2 or wins $4. (i) m'hat is the probability that he has lost his money at the end of, a t most, 5 plays? (ii) What is the probability that the game lasts more than 7 plays? 249 MARKOV CHAINS CHAP. 201 This is a random walk with absorbing barriers at 0 and 6 (see Examples 8.2 and 8.3). The transition matrix is ::/ 4 * a2 1 0 0 0 0 a P = ~ a as a with initial probability distribution (i) , o I 3 0 0 a6 0 0 0 + o 3 u5 0 0 \ 0 0 ‘ o o + o ’ o i I 0 o p(0) o 0 4 6 g a4 a3 0 o 0 0 + o 0 O o 0 + 0 0 + 0 o + , 0 1 / = (0, 0 , 1, O , O , 0,O) since he began with $2. We seek p;5), the probability t h at the system is in sta te a. a fte r five steps. 5th step probability distribution ~ ( 5 ) : +, 0, 4, 0, 0, 0 ) p“’ p‘2) p‘3’ = p‘0’P = (0, p‘4’ = p‘3’P = p “ ’ P = (&O, *, 0, *, 0, 0 ) = p‘2’P = (*, 0, #, 0, &, 0) p‘5’ = p‘4’P A) = (#, 0 , A, 0, *, 0, = (#, 0,&, 0 1 L) A, 4, Thus p i s ) , the probability t h at he ha s no money a f t e r 5 plays, is (ii) Compute p(7): p(6) = p(5)p= (29, 0, 64 A,0’ 13 0’ 1) 8 ’ 64’ Compute the ’ 8’ 16 8. p ( 6 ) P = (2 2 0 27 0 13 i) 6 4 ’ 6 4 ’ ’ 128’ ’ 128’ 6 p(7) The probability that t h e g ame lasts more tha n 7 plays, i.e. t h a t the system i s not in state a6 af t er 7 steps, i s = &+ + a, or 20.22. Consider repeated tosses of a fair die. g. Let Xn be the maximum of the numbers occurring in the first n trials. (i) Find the transition matrix P of the Markov chain. Is the matrix regular? (ii) Find p C 1 ) the , probability distribution after the first toss. (iii) Find p ( z ) and p C 3 ) . (i) The state space of t h e Markov chain is { l , 2,3,4,6,6}. 1 P = 2 3 The transition matrix is 4 5 6 o o o + g g 0 0 0 0 g Q We obtain, f o r example, the third row of the matrix as follows. Suppose the system is in state 3, i.e. the maximum of t h e numbers occurring on the first n tria ls is 3. Then the 3 system remains in state 3 if a 1, 2, o r 3 occurs on the (n 1)-st trial; hence p33 = 6. On the other hand, the system moves to s t at e 4 , 5 or 6, respectively, if a 4 , 5 o r 6 occurs on the (n4-1)-st trial; hence p34 = p,, = p S 6 = Q. The system can never move to state 1 or 2 since a 3 h a s occurred on one of the trials; hence p31= p 3 2 = 0. Thus the third row of the transition matrix is The other rows a r e obtained similarly. (0, O,:, + i,i,i). The matrix is not regular since s ta te 6 is absorbing, i.e. there is a 1 on the main diagonal in row 6. (ii) On the first toss of the die, the state of the system XI is the number occurring; hence p“’ = (Q, Q, Q, Q, Q, 9). (iii) #2’ = p“’ p = (23 s I 2 “)* p ‘ 3 ) = 369 361 36’ 36, 3 6 ’ 36 p(2’ p = (L 119 37 61 91). 216’ 216’ 216’ 216’ 216’ 216 250 MARKOV CHAINS [CHAP. 20 20.23. Two boys b , and b, and two girls gl and g, are throwing a ball from one to the other. Each boy throws the ball t o the other boy with probability 8 and to each girl with probability 4. On the other hand, each girl throws the ball to each boy with probability 4 and never t o the other girl. In the long run, how often does each receive the ball? This is a Markov chain with state space { b l , b,, g,, g 2 } and transition matrix bl P = b2 [;1 b, g1 b2 gz 0 8 t z 91 '4 92 ;) h We seek a fixed vector u = (2,u,z , w)of P : ( 2 ,y, x , w)P = components of UPequal to u t o obtain the system + y + + z + a 2w = 5 +5++z+$w = u,x , w). (2, Set the corresponding u hx+&y = x $x+&y = w We seek any non-zero solution. Set, say, x = 1; then w = 1, 5 = 2 and y = 2. Thus u = ( 2 , 2 , 1 , 1 ) and so the unique fixed probability of P is t = ($,Q,&,&). Thus, in the long run, each boy receives the ball Q of the time and each girl Q of the time. 20.24. Prove Theorem 20.6: Let P = (p,,) be the transition matrix of a Markov chain. If p = ( p J is the probability distribution of the system at some arbitrary time k , then pP is the probability distribution of the system one step later, i.e. at time k 1; hence pPn is the probability distribution of the system n steps later, i.e. at time ) p ( l )P , . . . and also pen) = p ( 0 ) P R . k n. In particular, pc1)= p(O)P,p C 2 = + + Suppose the state space is { a l ,u2, . . .,am}. The probability that the system is in state a, a t time k and then in state a, a t time k 1 is the product pjp,,. Thus the probability that the system is in state at a t time k f 1 is the sum m + PlPlt +~ 2 ~ 2 1 + Thus the probabiilty distribution at time k + Pmpmi + 1 is m = jg,P ~ P , , m \ P* However, this vector is precisely the product o f the vector p = (pi)by the matrix P = ( p i j ) : p* = p P . 20.25. Prove Theorem 20.5: Let P be the transition matrix of a Markov chain. Then the n-step transition matrix is equal to the nth power of P: Pn) = Pn. Suppose the system is in state ai at, say, time k. We seek the probability 13:;) that the system is in state aj a t time k + n . Now the probability distribution of the system at time k , since the system is in state ai,is the vector ei = (0,. . ., 0,1,0,. . ., 0) which has a 1 at the ith position and zeros everywhere else. By the preceding problem, the probability distribution a t time k + n . is the product eiPn. But ei Pn is the ith row of the matrix Pll. Thus P!?) i s the jth component of the ith row of Pn, and so P ( n ) = Pn. MISCELLANEOUS PROBLEMS 20.26. The transition probabilities of a Markov chain can be represented by a diagram, called a transition diagram, where a positive probability p i j is denoted by an arrow from the state ui to the state a j . Find the transition matrix of each of the following transition diagrams: CHAP. 201 (i) MARKOV CHAINS 251 Note first t h a t the s t at e space is {al,az ,u3$ and so the transition matrix is of the form a1 P a2 a3 = a3 The ith row of the matrix is obtained by finding those arrows which emanate from a, in the diagram; the number attached to the arrow from a, to aJ is the j t h component of the ith row. Thus the transition matrix is a, a2 a 3 P = (ii) The state space is {a1,a2,a3,a4}.The transition matrix is a1 P a2 a4 a3 = a3 20.27. Suppose the transition matrix of a Markov chain is as follows: a1 a2 a3 a4 0 0 a4 Is the Markov chain regular? Note t h a t once the system enters the state a , o r the state a2, then i t can never move to s t a t e a3 or state a4, i.e. t h e system remains in the sta te subspace { a l , u Z } . Thus, in particular, pi;) = 0 f o r every n and so every power Pn will contain a zero entry. Hence P is not regular. 20.28. Prove: An rn x m matrix A = (alJ)has a non-zero fixed point if and only if the determinant of the matrix A - I is zero, Here I is the identity matrix. The matrix A = (aij) h as a non-zero fixed point if and only if there exists a non-zero vector u = (q, xz, . . . , x m ) such t h a t all a12 am1 am2 ... ................... * " 252 MARKOV CHAINS or (allxl [CHAP. 20 + a21x2+ ... + umlzm,u12x1+ a,,%, + ... + um2xm, . . . . almxl+ a 2 m +~ 2... + ammxm) = (x1, x 2 , . . . . xm) or, equivalently, if there exists a non-zero solution to the homogeneous system + u21x2 + ... + arnlzm = a12x1 + uX2x2+ ... + amzxm = allxl .................................. aln,sl + aXmz2+ *.* + ammxm= + a21x2 + ... + + ( u * 2 - l)s, + + (all-l)xl z1 z2 a1221 * * . amlxm = 0 am2xm = 0 ............................................. almXl + a 2 m ~ 2 + ... + (amm-l ) x n L = x, 0 Now m homogeneous equations in ?n unknowns have a non-zero solution if and only if the determinant of the matrix of coefficients is zero. But the matrix of coefficients of the above homogeneous system of equations is the transpose of the matrix all a12 A-I 10 * *. ... = ................. am1 am2 ’ * ‘ amm all- * * * ... 1 a12 . * * ... .......................... .......... 0 0 *.. aml am, ... amm-l Since the determinant of a matrix is equal to the determinant of its transpose, A has a non-zero fixed point if and only if det(A - I ) = 0. 20.29. Prove: Every stochastic matrix P = ( p J has a non-zero fixed point, Since the rows of the stochastic matrix P a r e probability vectors, the sum of its column vectors is Accordingly, the sum of t h e columns of the matrix P - Z is Thus the columns of t h e matrix P - I a r e dependent (see Page 114). B u t the determinant of a matrix is zero iff its columns a r e dependent; hence de t ( P - I ) = 0 and so, by the preceding problem, P - Z has a non-zero fixed point. 20.30. Suppose m points on a circle are numbered respectively 1, 2, , . .,m in a counterclockwise direction. A particle performs a “random walk” on the circle; it moves one step counterclockwise with probability p or one step clockwise with probability g = 1 - p . Find the transition matrix of this Markov chain. The state space is {1,2, . . . . m}. The diagram to the right below can be used to obtain the transition matrix which appears to the left below. 1 1 m-1 2 3 4 ... rn-2 m-1 P 0 0 ... 0 0 0 P 0 0 0 ... ... 0 P 0 0 Q ................................................... ... 0 0 0 P 0 0 0 0 ... m m-2 ni- 1 0 Q CHAP. 201 MARKOV CHAINS 253 Supplementary Problems PROBABILITY VECTORS AND STOCHASTIC MATRICES 20.31. Which vectors are probability vectors? (i) 20.33. (5,4, -d., 4) (ii) (4, 0, Q, 4.9) (iii) (A, 4, Q, 0, $1. Which matrices are stochastic? REGULAR STOCHASTIC MATRICES AND FIXED PROBABILITY VECTORS 20.34. Find the unique fixed probability vector of each matrix: 20.35. (i) Find the unique fixed probability vector t of P = 0 (iii) What vector does (ii) What matrix does Pn approach? 20.36. Find the unique fixed probability vector t of each matrix: 20.37. (i) Find the unique fixed probability vector t of P = (ii) What matrix does Pn approach? (iii) What vector does (t, 0, +, &)Pn approach? :I. (;: 1 0 (a,$, +)Pn approach? 1; ;,” ; ) * \ 0 + 0 12 I (iv) What vector does (+, O,O, &)Pn approach? 20.38. (i) Given that (ii) Given that t = ( I , O , &) is a fixed point of a stochastic matrix P , is P regular? t = (t,f,f,&) is a fixed point of a stochastic matrix P , is P regular? 20.39. Which of the stochastic matrices are regular? 20.40. Show that (cf + ce + de, uf + b f + ae, ad + bd + bc) P = ( 1-a-b c e is a fixed point of the matrix a 1-c-d f b d 1-e-f ) 254 MARKOV CHAINS [CHAP. 20 MARKOV CHAINS 20.41. A man’s smoking habits a r e as follows. If he smokes filter cigarettes one week, he switches to non-filter cigarettes the next week with probability .2. On the other hand, the probability that he smokes non-filter cigarettes two weeks in succession is .7. In the long run, how often does he smoke filter cigarettes? 20.42. A gambler’s luck follows a pattern. If he wins a game, the probability of winning the next game i s .6. However, if he loses a game, the probability of losing the next game is .7. There is a n even chance t h a t t h e gambler wins the first game. (i) What is the probability t h a t he wins the second game? (ii) Wh at is the probability t h a t he wins the third game? (iii) In the long run, how often will he win? F o r a Markov chain, t h e transition matrix is P = (ii) p ( 0 ) = (&,$). (iv) p i 2 ) ; (v) the vector p(O)Pn approaches; 1 20.43. Find: (i) $:I; (vi) the matrix Pn approaches. 20.44. (ii) p i ; ) ; (iii) For a Markov chain, t h e transition matrix is ~(2); 1 P = and the initial probability disZ tribution is p ( O ) = (T,z,O). 1 1 20.45. with initial probability distribution Find (i) p i t ) , (ii) p;:), T Z (iii) P ( ~ ) , (iv) p y ) . Each year a man trades his car f o r a new car. If he ha s a Buick, he trades it for a Plymouth. If he has a Plymouth, he trades i t for a Ford. However, if he has a Ford, he is just a s likely to trade i t f o r a new Ford as t o trade i t f o r a Buick or a Plymouth. In 1955 he bought his first car which was a Ford. (i) Find the probability t h a t he ha s a ( d ) 1958 Ford. (a) 1957 Ford, ( b ) 1957 Buick, (c) 1958 Plymouth, (ii) In the long run, how often will he have a Ford? 20.46. There a r e 2 white marbles in u r n A and 4 red marbles in urn B. A t each step of the process a marble is selected f r o m each urn, and the two marbles selected a r e interchanged. Let X , be the number of red marbles in u r n A af t er n interchanges. (i) Find the transition matrix P. (ii) W hat i s the probability t h a t there a r e 2 red marbles in urn A a fte r 3 steps? (iii) In the long run, what i s the probability t h a t there a r e 2 red marbles in u r n A ? 20.47. Solve the preceding problem in the case t h a t there a r e 3 white marbles in u r n A and 3 red marbles in u r n B. 20.48. A f a i r coin is tossed until 3 heads occur in a row. Let X, be the length of the sequence of heads ending at the n t h trial. (See Example 8.4.) W h a t is the probability that there a r e a t least 8 tosses of the coin? 20.49. A player has 3 dollars. A t each play of a game, he loses one dollar with probability 3 b u t wins two dollars with probability He stops playing if he h a s lost his 3 dollars or he ha s won at least 3 dollars. &. (i) Find the transition matrix of the Markov chain. (ii) Wh at is the probability t h a t there a r e at least 4 plays to the game? 20.50. The diagram on t h e r i g h t shows f o ur compartments with doors leading from one to another. A y o u s e in a n y compartment is equally likely to pass through each of the doors of the compartment. Find the transition matrix of the Markov chain. CHAP. 201 MARKOV CHAINS 255 20.51. Find the transition matrix corresponding to each transition diagram: 20.52. Draw a transition diagram for each transition matrix: a1 a2 Answers to Supplementary Problems 20.31. Only (iii). 20.32. (i) (3/13,0, 2/13,5/13,3/13) (ii) (8/18,2/18,0, 1/18,3/18,0, 4/18) (iii) (4/45,24/45,6/45,0, 3/45,8/45) 20.33. Only (ii) and (iv). 20.34. (i) (6/11,5/11), (ii) (10/19,9/19), (iii) (5/13,8/13), (iv) 20.35. (i) t = (4/13,8/13,1/13), 20.36. (i) t = (2/9,6/9,1/9), (z,Q) (iii) t = (4/13,8/13,1/13) (ii) t = (5/15,6/15,4/15) 20.37. (i) t = (2/11,4/11,1/11,4/11), (iii) t , (iV) t 20.39. Only (iii). 20.41. 60% of the time. 20.42. (i) 9/20, (ii) 87/200, (iii) 3/7 of the time. 20.43. (i) 9/16, (ii) 3/8, (iii) (37/64,27/64j, (iv) 27/64, (v) (.6,.4), 20.44. (i) 3/8, (ii) l/2, (iii) (7/16,2/16,7/16), (iv) 7/16 20.45. (i) ( a ) 4/9, ( b ) 1/9, ( c ) 7/27, ( d ) 16/27. (ii) 50% of the time 0 1 0 (ii) 3/8 (iii) 215 (vi) (::::) 256 MARKOV CHAINS 1 1 0 10 20.47. (i) P = o\ 1) (ii) 32/81 o a a + \o 0 1 (iii) 9/20 o/ 1 0 0 0 0 0 0 20.52. (i) ~ 0 o % 0 o ~ o 0 ~ 3 t a 0 0 o (ii) [CHAP. 20 Chapter 21 Inequalities THE REAL LINE The set of real numbers, denoted by R, plays a dominant role in mathematics. We assume the reader is familiar with the geometric representation of R by means of the points on a straight line. As shown below, a point, called the origin, is chosen t o represent 0 and another point, usually to the right of 0, to represent 1. Then there is a natural way to pair off the points on the line and the real numbers, i.e. each point will represent a unique real number and each real number will be represented by a unique point. For this reason we refer to R as the real line and use the words point and number interchangeably. -- I -4 Il -3 4 1, -2 I -1 I I I I I 0 1 2 3 4 The real line R POSITIVE NUMBERS Those numbers to the right of 0 on the real line R, i.e. on the same side as 1, are the positive numbers; those numbers to the left of 0 are the negative numbers. The set of positive numbers can be completely described by the following axioms: [P,] If a E R,then exactly one of the following is true: a is positive; a = 0; -a is positive. rPP] If a, b E R are positive, then their sum a + b and their product a - b are also positive. It follows that a is positive if and only if -a is negative. Example 1.1: We show, using only [P,] and [PJ, that the real number 1 is positive. By [PI], either 1 or -1 is positive. If -1 is positive then, by [P2], the product (-l)(-1) = 1 is positive. But this contradicts [P,] which states t h a t 1 and -1 cannot both be positive. Hence the assumption that -1 is positive is false and so 1 is positive. Example 1.2: The real number -2 is negative. For, by the preceding example, 1 is positive and so, by [P2], the sum 1 1 = 2 is positive; hence -2 is not positive, i.e. -2 is negative. + ORDER An order relation in R is defined using the concept of positiveness. 1-1 The real number a is less than the real number b, written a difference b - a is positive. 257 < b, if the 258 [CHAP. 21 INEQUALITIES The following notation is also used: a > b , read a is greater than b, u 6 b , read a is less than o r equal to b, a 2 b, read a is greater than or equal to b, means b < a means a < b or a = b means b 6 a Geometrically speaking, a < b means a is to the left of b on the real line R a > b means a is to the right o f b on the real line R < 5, 2 Example 2.2: A real number 3: is positive iff x > 0, and z is negative iff z x2 is always greater than 0: x2 > 0. Example 2.3: The notation 2 < x < 5 means 2 2 and 5 on the real line. -6 f -3, 4 We refer to the relations <, >, 6 4, 5 > -8 Example 2.1: and < 0. For z f 0, < x and also z < 5; hence x will lie between as inequalities in order to distinguish them < and > as strict inequalities. We now state basic properties about inequalities which shall be used throughout. 1 from the equality relation = . We also shall refer to Theorem 21.1: Let a, b and c be real numbers. (i) The sense o f an inequality is not changed if the same real number is added to both sides: If a < b , then a + c < b + c . If a 6 b, then u + c 6 b + c . (ii) The sense of an inequality is not changed if both sides are multiplied by the same positive real number: If a < b and c > 0, then ac < be. If a 4 b and c > 0, then ac 4 be. (iii) The sense of an inequality is reversed if both sides are multiplied by the same negative number: If a < b and c < 0, then ac > be. If a 6 b and c < 0. then ac be. (FINITE) INTERVALS Let a and b be real numbers such that a < b . Then the set of all real numbers x satisfying a < x < b is called the open interval from a to b a 6 x 4 b is called the closed interval from a to b a < x 6 b is called the open-closed interval from a to b a L x < b is called the closed-open interval from a to b The points a and b are called the endpoints of the interval. Observe that a closed interval contains both its endpoints, an open interval contains neither endpoint, and a n open-closed and a closed-open interval contains exactly one of its endpoints: " 1 a b open interval: a " <x <b c - a b closed interval: (L 5 x 6b CHAP. 211 INEQUALITIES - " a 259 - b <x - - U b d <b The open-closed and closed-open intervals are also referred to as being half-open (or: half -closed). open-closed interval: a f b closed-open interval: a 6 2 INFINITE INTERVALS Let a be any real number. Then the set of all real numbers x satisfying x x > a or x A a is called an infinite i n t e m n l . < a, x 6 a, LINEAR INEQUALITIES I N ONE UNKNOWN Every linear inequality in one unknown x can be reduced to the form a x < b, a x ' b , an:> b or a x k b If a Z 0 , then both sides of the inequality can be multiplied by l l a with the sense of the inequality reversed if a is negative. Thus the inequality can be further reduced to the f o r m x < c , x g c , n . > c or x z c whose solution set is an infinite interval. Example 3.1 : + Consider the inequality 5~ 7 posing), we obtain 5 2 - 22 Multiplying both sides by x Example 3.2: 5 1 + 1. - Adding - 2 2 - 7 to both sides (or: trans- 7 or 3 x 5 -6 + (or: dividing both sides by 3) we finally obtain 6 Consider the inequality 22 22 22 - 4 -2 + 3 < 4 2 + 9. 42 < 9-3 : -2 0 2 Transposing, we obtain or < -22 6 -3 AIultiplying both sides of the inequality by (or: dividing both sides by - 2 ) and is negative, we finally obtain reversing t h e inequality since -+ - x > -3 u -3 0 3 r ABSOLUTE VALUE The absolute value of a real number x, written 1x1, is defined by = 1x1 { x if -n: X A O if x < O that is, if x is non-negative then = x, and if x is negative then 1x1 = -x. Thus the absolute value of every real number is non-negative: 1x1 2 0 f o r every x E R. Geometrically speaking, the absolute value of x is the distance between the point x on the real line and the origin, i.e. the point 0. Furthermore, the distance between any two points a, b E R is la- b( = Ib - a ' . ,-fil = fi Example 4.1: ,-21 = 2, 171 = 7, Example 4.2: ' 3 - 81 = 1-51 = 5 and I8 - 3'l - 151 = 5 1-7, = 7, 260 IN EQ U A LITIES [CHAP. 21 The statement 1x1 < 5 can be interpreted to mean t h a t the distance between x and t h e origin is less than 5; hence z must lie between -5 and 5 on the real line. I n other words, Example 4.3: Ix' < 5 and -5 < x -- have identical meaning and, similarly, 1x1 f 5 and - 5 5 2 6 - - <5 5 -5 n -5 n 5 5 have identical meaning. The graph of the function f(x) = 1x/, i.e. the absolute value function, lies entirely in the upper half plane since f ( x )2 0 for every x E R: -2 --4 t 2 4 t Graph of f(z) = 1x1 The central facts about the absolute value function are the following: Let a and b be any real numbers. Then: (i) la1 0, and la1 = 0 iff a = 0 (ii) -la1 a la1 (iii) lab1 = /a1* lbl Theorem 21.2: (iv) la (v) la + bl + bl 6 2 + la1 lbl la1 - Ibl Solved Problems INEQUALITIES 21.1. Write each statement in notational form: (iv) a is greater than b. (i) a is less than b. (ii) a is not greater than b. (v) a is not less than b. (iii) a is less than or equal to b. (vi) a is not greater than or equal t o b. A vertical or slant line through a symbol designates the opposite meaning of the symbol, (i) a < b, (ii) a 4 b, (iii) a 5 b, (iv) a > b, (v) a 4 b, (vi) a b 21.2. Rewrite the following geometric relationships between the given real numbers using the inequality notation: (iii) x lies between -3 and 7. (i) y lies to the right of 8. (ii) x lies to the left of -3. (iv) w lies between 5 and 1. Recall t h a t a < b means a lies to the left of 8 on the real line: (i) y (ii) z > 8 or < -3. 8 < g. < z and x < 7 or, more concisely, 1 < w < 5. (iii) -3 (iv) -3 < 2 < 7. CHAP. 211 261 I NE Q U A LITIES 21.3. Describe and diagram each of the following intervals: (i) 2 < x < 4, (ii) -1 6 x A 2, (iii) -3 < x 4 1, (iv) -4 (vi) x 2. (i) All numbers greater than 2 and less tha n 4: 6 x < -1, 4 b r l2 2 0 -1, (ii) All numbers greater t h an o r equal to -1 and less tha n or equal to 2: -- (iii) All numbers greater t h an -3 and less tha n or equal to 1: 21.4. ' ' - (vi) All numbers less t h an or equal to 2 : 4, I I , 2 0 Solve and diagram the solution set of each inequality: (i) 4x - 3 L 2x 3, (ii) 3x 2 < 6n: - 7. + (i) Transpose to obtain: or: + 4x - 2x 22 Divide by 2 : 3 6 4 4 +3 (ii) Transpose t o obtain: or: 32 - 6 x -3x - 4, x'3 < < -7 - -3 0 -3 0 2 -9 x > 3 Divide by -3: 3 - r L 3 Note t h a t the inequality is reversed since -3 is negative. 21.5. Solve each inequality and diagram its solution set: (i) 3%-1 4n:+2, (ii) x - 3 > 1 + 3 x , (iii) 2 % - 3 4 5x-9. (i) Transpose to obtain: 3x - 42 -x or: Divide by -1: x 2 3 4 2 ' -3 (ii) Transpose to obtain: or: Divide by - 2 : x - 32 > 1 (iii) Transpose t o obtain: 22 - 5% 6 -9 -2x 2 or : -32: Divide by -3: +1 > < f 4, I 0 3 -2 0 2 -2 0 2 -3 +3 4 -2 4, - +3 -- -6 2 2 2 In each case the inequality was reversed since division w a s by a negative number. 21.6. Solve: 4% + x-2 -< 2x-& 3 Multiply both sides by 12: or: Transpose: or: Divide by -14: +4(~-2)< + 42 - 8 < 6% + 4x - 2 4 ~< 62 6% 2 4~1 2 4~ 1 8-1 -14~< 7 5 > -Q * , , 2 0 * -2 _ ' 1 -2 2 0 ' ' 0 -4 I -2 , -2 (iv) All numbers greater t h an o r equal to -4 and less tha n -1: (v) All numbers greater t h an -1: , > -1, (v) x ' 2 262 [CHAP. 21 INEQUALITIES 21.7. Solve each inequality, i.e. rewrite the inequality so that x is alone between the inequality signs: 12, (iv) -6 A -2s 4 4. (i) 3 L x - 4 5 8, (ii) -1 6 x + 3 6 2, (iii) -9 6 3 x (i) Add 4 to each side t o obtain 7 f z 12. 6 (ii) Add -3 to each side t o obtain -4 5 x 4 -1. to obtain -3 (iii) Divide each side by 3 (or: multiply by 5) x 4. (iv) Divide each side by -2 (or: multiply by -+) and reverse the inequalities to obtain -2 (i) < Add 5 to each side to obtain: Divide each side by 2: 4 < 2 < 6 (ii) Add -3 t o each side to obtain: -10 f -2x 6 2. Divide each side by -2 and reverse inequalities: ABSOLUTE VALUE 21.9. Evaluate: (i) 13 - 51, (ii) 1-3 +5 21.10. Evaluate: (i) = 121 = 2 + (iii) 1-3 - 6 x 5 5. (iv) 13 - 71 51, - 1-51. = 1-61+l21 + 1-1 - 51 - 1-81 13 - 61 - 1-2 + 41 - 1-2 - 31 (iv) = 6+2 = 8 = 1-31 - 1-31 = 3 - 3 = 0 = 4 1-61 - 1-81 = 4 6 (iv) 13- 61 - 1-2+4/ - 1-2 - 31 = 3 - 2 - 5 = -4 (ij) -1 (iii) 4 (i) 12 - 81 '3 - 11 (ii) 12 - 51 - 14 - 71 12-81+'3-11 - 2% (iii) 1-3 - 51 = 1-81 = 8 (iv) 13 - 71 - 1-51 = 1-41 - 1-51 = 4 - 5 = -1 (i) 13 - 51 = '-21 = 2 (ii) 1-3 + 51, 3. < 12 8 22 x + 3 A 5. < 2x - 5 < 7, (ii) -7 21.8. Solve each inequality: (i) 3 f 12-51 - 14-71 (iii) 4 + + + 1-1 - 51 - 1-81 - 8 = 2 21.11. Rewrite without the absolute value sign: (i) 1x1 6 3, (ii) (x-21 < 5 , (iii) (2x-31 5 7. (i) -3 5 x 5 3 or (ii) - 5 < 2 - 2 < 5 (iii) -7 f 22 - 3 5 7 or -3<x<7 -4 5 22 5 10 or -2 21.12. Rewrite using a n absolute value sign: (i) -2 f x L 5 x 5 6 , (ii) 4 < x < 10. F i r s t rewrite t h e inequality so t h a t a number and its negative appear at the ends of the inequality. (i) Add -2 to each side to obtain -4 (ii) Add -7 to each side to obtain -3 6 x -2 44 which is equivalent to IX -21 6 < x - 7 < 3 which is equivalent to I$-- 71 < 4. 3. PROOFS 21.13. Prove: If a < b and b < c, then a < c. By definition, a < b and b < c means b - a and c - b a r e positive. By [P2],the sum positive numbers ( b - a) + (c - b ) = c - a is positive. Thus, by definition, a < c. of two CHAP. 211 INEQUALITIES < b, then a + c < b + c. (ii) If a < b and c is positive, then ac < be. (iii) If a < b and c is negative, then ac > be. 21.14. Prove Theorem 21.1: (i) By definition, a 263 If a (i) <b means b - a is positive. B u t ( b + ~ ) - ( a + ~=) b - a - Hence ( b + c) - (a + c ) < is positive and so a + c b + c. (ii) By definition, a < b means b - a is positive. But c is also positive; hence by [P2]the product c ( b - a ) = b c - ac is positive. Accordingly, a c < bc. (iii) By definition, a < b means b - a is positive. By [P,], if c is negative then -c is positive; hence by [P2] t h e product ( b - a)(-c) = a c - bc is also positive. Thus, by definition, bc < ac or, equivalently, ac > bc. + bl 21.15. Prove Theorem 21.2(iv): la Method 1. Since la[ = ? a , --/a1 f a 5 L la\; also, -(l al + la+ bl Therefore la/ -161 bl) 6 + JbJ. f 'a: f lb[. +b 6 + Ibl I = a f b Then, adding, + Ib/ la1 + 161 la1 Method 2. Now ab f d 21.16. Prove: implies 2ab 6 2 (a1lbl, and so + 2ab + b2 + 2 la( lb( + b2 = la12 + 2 lb( + [bl2 = (la1 + lW m = la + bl and so, by the square root of the above, 10 + bl 6 la1 4-lbl. (a+ b)i = But lab1 = la1 Ibl a? 5 (a- b / la1 a2 + Ibl. + Using the result of t h e preceding problem, we have la- bl = ( a (--b)\ 21.17. Prove: la - c J 6 ]a- bJ 3 is a la1 + 1-41= la/ + \ b / . + ] b- el. l~-~c/ 21.18. Prove that 6 = !(a-b) + (b-C)[ f la-bl + /b-c/ positive number. By [P,], either or 8 i s positive. Suppose -3 is positive and so, by [P2],the sum = -1 i s also positive. But by Example 1.1, the number 1 is positive and not -1. T h u s we have a contradiction, and so & is positive. -4 (-4)+ (-4) 21.19. Prove that the product a . b of a positive number a and a negative number b is negative. If b is negative then, by [PI],-b is positive; hence by [P2]the product a (-b) is also positive. B u t a (-b) = - ( a b). Thus -(a * b ) is positive and so, by [P,], a * b is negative. - 21.20. Prove that the product a - b of negative numbers a and b is positive. (-a) If a and b a r e negative then, by [PI], --a and -b are positive. Hence by [P2],the product B u t a . b = (-a) (-b), a nd so a *b is positive. * (-b) is positive. - 264 INEQUALITIES 21.21. Prove: If a and b are positive, then a iff a2 < b2. < bz; hence <b a and b a r e positive, a2 < ab and ab < b. Since Suppose a [CHAP. 21 + a2 < bz, On the other hand, suppose a2 < b2. Then b 2 - a2 = ( b a ) ( b - a ) is positive. Since a and b a r e positive, the sum b a is positive; hence b - a is positive or else the product ( b a ) ( b- a) would be negative. Thus, by definition, a < b. + + 21.22. Prove that the sum of a positive number a and its reciprocal l l a is greater than or equal t o 2: if a > 0, then a l l a 2 2. + If a = 1, then lla = 1 and so a + lla = 1 + l = 2. a - 1 # 0 and so (u-1)* > 0 or a2- 2 a + 1 > On the other hand, if or 0 az+ 1 a # 1, then > 2a Since a is positive, we can divide both sides of the inequality by a to obtain a+ lla > 2. Supplementary Problems INEQUALITIES 21.23. Write each statement in notational form: (i) x is not greater t h an y. (iii) m is not less tha n n. (ii) Y is greater t h an or equal t o t. 21.24. (iv) u is less tha n w. Using inequality notation, rewrite each geometric relationship between the given real numbers: (i) x lies to the right of y. (iii) x lies between 4 and - 6 . (ii) r lies to the left of t . 21.25. (iv) y lies between -2 and -5. Describe and diagram t h e following intervals: (i) -1 <x 6 (ii) 1 3, + f x 6 4, f x < 0, (iv) x f 1. > 4 x + 5. 21.26. Solve: (i) 6% 5 21.27. Solve: (i) 5 - 2 21.28. 2-1 Solve: - 1 5 -1 3 2 2 ' 21.29. Solve, i.e. rewrite so t h a t x is alone between the inequality signs: < 2x -6, (ii) 2x - 3 (ii) 3% +2 2 62+ 11. + (i) -2 21.30. 6 2x -7, (iii) -2 Solve: f x -3 (i) 4 6 f < x +2 < 4, (ii) -5 -2x f 10, (ii) -1 1, (iii) -12 < 2 s - 3 < 5, < 42 < (iii) -3 ABSOLUTE VALUES 21.31. Evaluate: (i, 14 - 71, (ii) 17 - 41, (iii) 14 f 71, (iv) 1-4 - 71. 21.33. Evaluate: (i) I 1-31 - 1-91 I, (ii) ! 12 - 61 - 11 - 91 I. -8. 5 - 2x 6 7. CHAP. 211 INEQUALITIES 265 21.34. Rewrite without the absolute value sign: (i) 1x1 1% 8, (ii) 5 - 31 + < 8, (iii) 12x 41 8, (iv) 12-2x1 < 8. 21.35. Rewrite using the absolute value sign: <x< (i) -3 8, (iii) -7 < < b and c is positive, then cla x 9, (ii) 2 < x (iv) -1 -1, f 2x+3 5. f THEOREMS ON INEQUALITIES 21.36. Prove: If 21.37. Prove: If a a <b and c < d, then a > c/b. + c < b +d. 6 ( a+ b ) / 2 . 21.38. Prove: If a and b are positive, then 21.39. Prove: If 21.40. Prove: For any real numbers a, b and c, (i) 2ab a< b where d Q, f 6 , c and d are positive, then a + c b S d ~ f a2 + b2 < c* d and (ii) ab THEOREMS ON ABSOLUTE VALUES 21.41. Prove: (i) (--a1 = 1a1, (ii) u2 = Ia12, (iii) 'a] = @, (iv) 1x1 < a iff -a 21.42. + ac + be f < x < a. Prove Theorem 21,2(iii): lab/ = /a] lbl. 21.43. Prove Theorem 21.2(v): ]a1- /bl 21.44. Prove: la' - lbl 5 la + bl. ja- bl. 6 Answers to Supplementary Problems 21.23. (i) x 4 y, 21.24. (i) x > y or (ii) r y 21.25. (i) (i) x f 21.27. (i) x > 4, -6 < x < 4, (iv) -5 < y < -2 (iii) (ii) x 4 2 0 (ii) .T f x 6 c I -4 6 -3 f 21.31. (i) 3, (ii) 3, (i) -2, 21.33. (i) 6, (i) -8 x f -2, < x < -1, (ii) 1 < x < -2 (5)-3 < x < 4, (iii) -1 f x f 4 (iii) 11, (iv) 11 (ii) 7, (iii) 6, (iv) 3 (ii) 4 f 21.35. (i) 1%-31 x f < 8, (ii) -5 < x < 11, (iii) -6 6, (ii) 'x-51 f 3, L x 6 (iii) ' ~ $ 4 1< 3, 2, (iv) -3 <x <5 (iv) 12%+1l f 0 -2 < -4 (ii) -7 7, 21.30. (i) -5 21.34. < w. (iv) u 7 21.29. (i) 1 21.32. 4: n, < x, (ii) T < t, (iii) -- , -2 -3, 21.26. 2 t , (iii) m r -4 21.28. x 2 3 1 2 I 4 + + c2. u2 b2 Chapter 22 Points, lines and Hyperplanes We assume the reader is familiar with the Cartesian plane, denoted by R2, as shown on the right. Here each point P represents an ordered pair of real numbers (a,b ) and vice versa; the vertical line through the point P meets the horizontal axis (x axis) a t a, and the horizontal line through P meets the vertical axis (y axis) a t b. Thus in discussing the Cartesian plane R2 we use the words point and b *P x axis a DISTANCE BETWEEN POINTS The distance d between two points 11 = (x,,y,) and q = (x2,y2) is given by d = d(., - x,)2 + (y, - y,)2 For example, the distance between p = (3, -1) and q = (7,2) is d = d(7-3)' 3. (2 -(-l))' = d16 +9 = 5 We shall denote the distance between p and q by d ( p , q ) . INCLINATION AND SLOPE OF A LINE The inclination B of a line I is the angle that I makes with the horizontal; that is, 6 = 0" if 1 is horizontal or B is the smallest positive angle measured counterclockwise from the positive end of the x axis to the line 1. The range of 0 is given by 0" 4 8 < 180". The slope 172 of a line is defined as the tangent of its angle t? of inclination: 7n = tan8. In particular, the slope of the line passing through two points p = (x,,y,) and q = (x2,y,) is given by !I* - 1-11 tv = t a n 0 = ~ x, - x, The slope of a vertical line, i.e. when x2=x1, is not defined. Two distinct lines I, and I , are parallel if and only if their slopes, say nz, and m2 respectively, are equal: m, = nz,. On the other hand, the lines are perpendicular if and only if the slope of one is the negative reciprocal of the other: m, = -Urn2 or 7n,?n, = -1. 266 CHAP. 221 267 POINTS, LINES A N D HYPERPLANES LINES AND LIKEAR EQUATIONS Every line 1 in the Cartesian plane R' can be represented by a linear equation of the form + by = c , where u and b a r e not both zero (1) i.e. each point on 1 is a solution to ( 1 ) and each solution to ( 1 ) is a point on 1. Conversely, every equation of the form ( 1 ) represents a line in the plane. Thus, in discussing the Cartesian plane, we sometimes use the terms line and linear equation synonymously. (See Chapter 11.) a.15 HORIZONTAL AND VERTICAL LINES The equation of a horizontal line. i.e. a line parallel to the n: axis, is of the form y = 1; where k is the point a t which the line intersects the y axis. In particular, the equation of the :I' axis is y = 0. x=a The equation of a certical line, i.e. a line parallel to the y axis, is of the form y=b r = k a where lc is the point a t which the line intersects the 3' axis. In particular, the equation of the y axis is z'= 0. - POINT-SLOPE FORRl A line is completely determined once its direction and a point on the line are known. The equation of the line having slope 02 and passing through the point (xl,yl) is y - y1 = m(.u - X I ) Example 1.1: Example 1.2: F i n d th e equation of the line h av i n g slope 3 an d passing t h r o u g h t h e point ( 5 , -21. Substitute in t h e above f o r m u l a to obtain y 2 = 31, - 5) or y = 3% - 1 7 . + F in d t h e equation of t h e line passing t h r o u g h the two points (3,2) a n d 15,-6). Yz-YI -G-2 112 = ____ - -4. T h u s t h e equation xq-xl 5-3 of the line is y - 2 = -4( 7. - 3 ) or 4 2 y = 1-1. T h e slope m of t h e line is + SLOPE-INTERCEPT FORM The equation of the line having slope vz and intersecting the y axis at k , i.e. having ?/-intercept k , is y = 023' + k Thus if the equation of a line 1 is a? - by 2 / = = c, where b f 0, then a --.r 11 and so m = - a / b Example 2.1: Example 2.2: + C - 13 is the slope of 1. The slope of t h e line I! = -2.). F i n d th e slope 7n of t h e line = 2,~X - - 33. , hence ) ) I = 3. 1/ +8 is -2. 2.x - 3y = 5. Rewrite t h e equation in t h e f o r m 268 POINTS, L I N E S AND H Y P E R P L A N E S [CHAP. 22 PARALLEL LINES. DISTANCE BETWEEN A POINT AND A LINE Let I be the line ax by = c. The set of all lines parallel to I is represented by the equation + ax + by = h. (2) i.e. every value for k determines a line parallel to 1 and every line parallel to l is of the above form f o r some k . We call k a p a m m e t e r , and the set (2) a one parameter family of lines. The distance d between a point 11 = ( x ~ , y ~ ) and the line 1 is given by d = lax, + byl c1 VGTi7 I n particular, the distance Example 3.1: - between the origin (0,O) and the line Z is Find the line I parallel to t h e line 2r - 51/ = 7 and passing through the point ( 3 , 4 ) . The line 1 belongs to the family 2x Iz 5y = - Since the point 1 3 , l ) belongs t o 1 it must satisfy the equation of I: 2.3 - 5.4 = k k = -14 or Thus t h e equation of I is 2%-55y = -14. Example 3.2: Let 1 be the line 3.c d - 4y = 2. The distance d between the point (2, - G ) and 1 is = 13.2+4*(--6)-21 The distance p between t h e origin and 1 is EUCLIDEAN m-SPACE Let ic and v be vectors with 112 - JIm p 1-201 5 = 121 - 2 = ___ - 5' l/m components of real numbers: u = (ul,u,, . . . ,uIn) and v = (vl,v2, . . ., v,) Recall (see Chapter 9) the following operations of vector addition, scalar multiplication, and dot product : u + v = ( 2 1 , 4G,, 26, + v*, . . ., un, + v,J ku U' v = (k'u,, ku2, . . . , ku,), = 21,2', + 1L2V2 + .- + * f o r any scalar k Ui,,7jli, The set of all such vectors with the above operations is called Euclidean m-space o r m dimensional Euclidean space o r , simply, m-space, and will be denoted by R". The distance between the above vectors u and v, written d(u,v), is defined by CHAP. 221 POINTS, LINES AND HYPERPLANES The norm (or: l e n g t h of the vector ilull = 269 written ]lull, is defined by ZL, -\/... = dzq + u;+ . + Uf, * * Observe that d(u,v) = Iju - ~111. The vectors u and 1: are said to be orthogonal (or: perpendicular) if their dot product is zero: u-v= 0. The set of vectors of the form 10, . . . , 0, Z L ~ ,0, . . ., 0) is called the xi axis. Note that the zero vector, also called the origin, belongs to every axis. Example 4.1 : Consider the following vectors in R4: u = (1, - 2 , 4 , 1 ) and ZL +v + = (1 3 , -2 - 1, -1 - 5 , 1+ 0 ) 7u = U.W = (3,1, -5,O). Then: (4, -1, -1, 1) (7, -14, 28, 7 ) = 1.3 + + 4.(-5) + 1 . 0 = 3 + (1+ 2 ) 2 + (-5 4)2 + (0 - 1 ) 2 (-2)*1 diu,w) = d ( 3- 1 ) 2 Example 4.2: = 2) - We frequently identify a point in, say, R2 with t h e arrow from the origin to the given point. Then the arrow corresponding to the sum p q is t h e diagonal of the parallelogram determined by the arrows corresponding to the points p and q as shown on the right. Furthermore, llpll, the norm of p, is precisely the length of the a r row to p . - 2 - 20 = +0 = -19 a + The sum p +$+ BOUNDED SETS A subset A of Rm is bounded iff there exists a real number m > 0 such that llull < nz for every point u E A . In the case of the plane R2, this says that the points of A lie inside of the circle of radius rn centered a t the origin, as indicated on the right. - + q. -m A is bounded. HYPERPLANES Consider a linear equation in the m unknowns x,, . . ., xm: CIXl + C2X, + * * * + CmXm = b (3) Using the notation of the dot product of RI'~,the above equation can be written in the f o r m C'X = b where c = (c1,c2, . . .,cJ and x = (x1,z2,. . .,x,). that c # 0, i.e. that the ci in ( 3 ) are not all zero. Definition: Unless otherwise stated, we assume The set of all points in Rm which satisfy the equation c x = b, i.e. the set of all solutions of the linear equation (3), is called a hyperplane. 270 POINTS, L I N E S AND H Y P E R P L A K E S Example 5.1: The hyperplanes i n the Cartesian plane R’ a r e the s t r a i g h t lines a s discussed a t the beginning of this chapter. Example 5.2: The hyperplanes in 3-space RJ, i.e. in solid geometry, correspond t o the planes a s illustrated on the right. I n particular, the equation of t h e x y p l a n e is z = 0, the ?z-plane is y=O, and the ?/:-plane is x = 0. [CHAP. 22 Y Hyperplane in R3. Observe that the “dimension” of a hyperplane in R’ is one, and in R3 is two; that is, we can identify the points of a hyperplane in R2 with the points on the real line R and we can identify the points on a hyperplane of R3 with the points in R2 with the further property that distances between corresponding points are equal. In each case the dimension of the hyperplane is one less than the dimension of the Ivhole space. In fact, this property completely characterizes hyperplanes in general; namely, Theorem 22.1: A subset A of R”‘ is a hyperplane if and only if there exists a one-to-one correspondence between A and Ril’-i such that distances between corresponding points are equal. The proof of this theorem is beyond the scope of this text. PARALLEL HYPERPLANES. DISTANCE BETWEEN A POINT AND A HYPERPLANE Let N be the hyperplane c , ~ , C J , t . - . cliplII = b . Then + + el,?-, + C _J , _ + . . + ClnXnl * = k is the equation of the family of all hyperplanes parallel to a, i.e. which do not have a point in common with a. The distance d between the point 71. = (ul,u2,. . ., un,) and N is given by 1 c , l ~ lt c,u, . . . e,,LzL;,,- 2, 1 + + a = dcf+ c; + * * * + c;l I n other words, the distance between u and the hyperplane c * R: = b is d = 1c-u - bl llcll I n particular, the distance between the origin, i.e. the zero vector, and p P Example 6.1: = Ibl dcf+ c; + - - - + C t - ~r is -P I ICIl I n solid geometry, i.e. R3, t h e family of hyperplanes parallel t o the r p p l a n e , i.e. t o z = O , is represented by x = k where k is the point at which the hyperplane intersects the i axis. ( S e e d i a g r a m be1ow.r Y Two members of the family z = k. 271 POINTS, LINES A N D HYPERPLANES CHAP. 221 Example 6.2: Let iy be the hyperplane 3 2 point (1,-4,-2,3) and a is d - 4y + x - 2w 1 7 in R4. The distance d between the + 1*(-2) - 2 ' 3 + (-4)2 + 1 2 + (-2)Z i 3 . 1 - 4.(-4) = d3' The distance p between and the origin is a p -7 - 4 ~ fi 171 = d32 + (-4)' + 1 2 + (-2)2 7 - - fi' Solved Problems DISTANCE I N THE PLANE 22.1. Find the distance d between (i) ( - 3 , 5 ) and (6,4), (ii) ( 5 , -2) and (-1, -4). d = In each case use the formula (i) + (4-5)2 = d ( - l - 5 ) * + (-4+2)2 d = d ( 6 + 3)? (ii) d = f + ~ ( X ' - X ~ ) ~ (yz-yJ'. ? = = d m fi = fi 22.2. Show that the points p = (--1,4), q = ( 2 , l ) and r = (3,5) are the vertices of an isosceles triangle. Compute the distance between each pair of points: d(p,q) = d(2+ 1)' + (1-4)2 d(q,r) = d ( 3 - 2)' + ( 5 - 1)2 = &m = fi = d r n = m Since d ( p , T ) = d ( q , T ) , t h e triangle is isosceles. 22.3. Show that the points p = (4,-2), q = (-1,3) and r = (1,4) are the vertices of a right triangle, and find its area. Compute the distance between each pair of points: Since d ( q , T ) Z + d(p, + (3+2)2 d(p,q) = l/-l- 4)' d(p,r) = d(q,r) = d(1+1)"(4-3)2 ~ ) 2= d(1- 4 ) 2 + (4 + 2)2 d ( p , q)2, = d r n dw-5 = m = rn = = 6 = the triangle is a right triangle. The area of the right triangle is A = +(a)(fi) = 2. 22.4. Find the point equidistant from the points a = (-4,3), b = ( 5 , 6 ) and c = (4,-1). Since d ( p , a) = d ( p , b ) , Let p = (x,y) be the required point. Then d ( p , a ) = d ( p , 6 ) = d ( p , c). d(z+ 4)2 + (y-3)Z = d ( z - 5 ) 2 + (y- 6)2 or 3z +y = 6 Since d ( p , a ) = d ( p , c ) , d ( z +4)2 + (y-3)2 = d ( x - 4)2 + (y + 1 ) 2 or 2%- y = -1 Solve the two linear equations simultaneously to obtain x = 1 and y = 3. required point. Thus p = (1,3) is the 2'72 POINTS, LIN ES AND H Y P E R P L A N E S Show that the points p = (1,-2), q = (3,2) and the same straight line. 1' [CHAP. 22 = (6,8) are collinear, i.e. lie on Method 1. Compute the distance between each p'air of points: d ( p , q ) = d ( 3 - 1)Z + (2+2)2 = lLi-TT6 = d(6-1)2 + (8+2)2 = d(p,r) Since d ( p , T ) = d ( p , q ) + d(y, T), d m = &G = 2 6 = = 5 6 the points lie on the same straight line. Method 2. Compute the slope m, of the line passing through p and y, and the slope m2 of the line passing through p and r: m2 = 8 - (-2) - lo = 2 ml = 2 - (-2) = 4 = 2, - 5 2 6 - 1 3-1 ~ ~ Since m, = m 2 , the three points lie on the same line. LINES 22.6. Plot the following lines: (i) x = -4, x = -1, x = 3 and x = 5; (i) (ii) y = 4, y = 1, y = -2, and y = -3. Each line is vertical and meets the x axis at its respective value of x. See diagram below. (ii) Each line is horizontal and meets the ~jaxis at its respective value of y. See diagram below. y=4 1 5 y=l , I I -5 t * 5 y=-2 y=-3 r t -5 22.7. Find the equation of the line (i) parallel to the x axis and passing through the point (-2, - 5 ) , (ii) parallel to the y axis and passing through the point (3,4). (i) If the line is parallel t o the z axis, then i t is horizontal and its equation is of the form y = k. Since i t passes through the point (-2,-5), k must equal the y-component of the point. Thus the required equation is y = - 5 . fii) If the line is parallel to the u axis, then i t is vertical and its equation is of the form 2 = k. Since it passes through the point (3,4), k must equal the 2-component of the point. Thus the required equation is r = 3 . 22.8. Plot the following lines: (ij 2%- 3y = 6, (ii) 4x t 3y = 6, (iii) 3x + 2y = 0. In each case find at least two points on the line, preferably the intercepts, and as a check a third point. The line drawn through these points is the required line. 273 POINTS, LINES A N D HYPERPLANES CHAP. 221 -- 5 22.9. Find the equation of the line (i) passing through (3,2) with slope -1, (ii) passing through (-4,1) with slope 4, (iii) passing through (-2, -5) with slope 2. In each case use the formula y point. (i) y - 2 = -l(.z-3) 1= i(xt4) (ii) y - (iii) y +5 = 2(2 + 2) - y1 = or x +y or x - or 22 - y = 1 m(.r - xl) where m is the slope and (xl, yl) the given = 5 2y = -6 22.10. Find the equation of the line passing through each pair of points: (i) (1,3)and (4, -5); (ii) (-2, -6) and ( 3 , l ) . Y2 - Y1 I n each case, first find the slope nt of the line using the formula m = ___ and then 2 2 - $1 find the equation of the line using the point-slope formula. 8 (i) m = - --5-- -3 The equation of the line is Y - 3 = --(X - 1) or 8~ 376 = 17. 4-1 3' 3 11-6-7 7 - (ii) m = 3 f 2 5 ' The equation of the line is y - 1 = -5( x - 3 ) o r 72 - 5 y = 16. + 22.11. Find the equation of the line passing through each point and parallel to the given line: (i) (3, -2) and 5% 4y = -5; (ii) (1,6) and 3x - 7y = 4. + (i) + The line belongs t o the family 6n: 4y = k. Substitute (3,-2) into the equation to obtain 5 . 3 i 4 * (-2) = k or k = 7. Thus the required line is 52 41/ = 7. + (ii) The line belongs t o the family 32 - 71/ = k . Substitute (1,6) into the equation to obtain Thus the required line is 32 - 7y = -39. 3 * 1 - 7 * 6 = k or k = -39. 22.12. Let ax + by = c be the equation of a line 1. Show that bx - ay = k (4 is the equation of the family of lines perpendicular t o 1. If b = 0, then 1 is vertical and ( 1 ) is the equation of the family of horizontal lines; hence ( 1 ) is the family of lines perpendicular to I . 274 [CHAP. 22 POINTS, LINES AND H Y P E R P L A N E S On the other hand, if a = 0 then 1 is horizontal and ( 1 ) is the equation of the family of vertical lines and so is the family of lines perpendicular to 1. Lastly, suppose b # 0 and a # 0. Then the slope of 2 is - a / b and ( 1 ) is the equation of the family of lines with slope b l a . But ( - a l b ) ( b / a ) = -1; hence, again, ( 1 ) is the family of lines perpendicular to 1. 22.13. Find the equation of the line passing through the given point and perpendicular to the given line: (i) (2,5) and 32 4y = 9; (5) (3, -1) and 2x - 49 = 7. + The line belongs to the family 4r -3y = k. Substitute ( 2 , 5 ) into the equation t o obtain 8 - 15 = k or k = -7. Thus the required line is 45.- 3 y = -7. ii) (ii) The line belongs to the family 4x - 2y = k. Substitute (3, -1) into the equation t o obtain 12 - 2 = k or k = 10. Thus the required line is 4r 2 y = 10 o r 2 2 + y = 5 . + 22.14. Determine the distance d of each line from the origin: (i) 3%- 4 = ~ 8, (ii) 5 x + 12y = -4. Use the formula d = lei (i) d = d r n ' _- 181 (rn (ii) d = 5' 4 - 13' 1-41 @Ti5 22.15. Determine the distance d between each point and line: (i) (5,3) and 5x-12y = 6; (ii) (-3,-2) and 4 x - 5y = -8. lax1 Use the formula d = + by1 - C I &Fiz . EUCLIDEAN m-SPACE 22.16. Find the distance d(u, v) and the dot product u * v for each pair of vectors: (i) u = (3,-2,0,5) and 2) = (6,2,-3,-4); (ii) u = ( 2 , 3 , - 5 , - 7 , l ) and v = (5,3,-2,-4,-1); (iii) ZL = (4,-1,6,5) and v = (1,-4,2,0,7). Use the formulas d ( u , v) = d ( v l - u1)2+ (iii) The distance d ( u , w ) and the dot product different spaces: u E R4 and v E R5. 22.17. Find the norm of each vector: In each case use t h e formula ZL * + (wnL- U"U + and U* w = UlW, + * * . + umvm. a re not defined since the vectors belong to = (2,-1,3,5); v = (3,-2,-1). dut + ui + . . . + u$ ~ l u = ~ 22.18. Let u = (1,-3, 2, 4) and v = (3, 5, -1, -2). (i) u u,,,)2 Find: v ; (ii) 5u; (iii) 2u - 371; (iv) 'urn v ; (v) ~ l u land ~ 11v11. POINTS, LINES AND H Y P E R P L A K E S CHAP. 221 273 (i) u + v = ( l + 3 , - 3 + 5 , 2 - 1 , 4 - 2 ) = ( 4 , 2 , 1 , 2 ) (ii) 5u = (5 1, 5 * (-3), 5 2, 5 . 4 ) = (5, -15, 10, 20) (iii) 2 u - 3v = (2, -6, 4, 8) (-9, -15, 3 , 6 ) = (-7, -21, 7 , 14) (iv) U . L ' = 3 - 15 - 2 - 8 = -22 - (v) 1 1 ~ 1= - + = d1+9+4+16 m. ,Jvll = d 9 + 2 5 + 1 + 4 = a. 22.19. For each pair of vectors, determine li so that the vectors are orthogonal: (i) (3, -2, k , 5 ) and (2k, 5, -3, -1); (ii) ( 2 , 3 k , -4,1,5) and (6, -1,3,7,2k). In each case, set the product of the vectors equal to zero and solve f o r k. + (-2) (i) 3 2k (ii) 2 . 6 A * 5 3k*(-1) -- +5 k (-3) (-1) = 0 + 7 + 5*2k = 0 - (-4).3 or 6k - 10 - 3k or 12 - 3k - 12 - 5 = 0 k =5 or + 7 + 10k = 0 or k = -1 22.20. Determine which of the following subsets of R" are bounded: (i) A consists of all probability vectors; (ii) B consists of the points on the x, axis; (iii) C is finite; (iv) D consists of all points with positive components. Recall t h a t X C R ~i s bounded if there exists a real number T ~ Z> 0 such t h a t IIu(, m f o r f every u E X . (i) A is bounded since IIuI/ 1 f o r every probability vector u . (ii) B is not bounded; f o r if rn is a n y positive real number, then ( m and has norm greater t h a n rn. f + 1, 0 , 0 , . . ., 0) belongs t o B (iii) C is bounded. Choose rn to be the maximum norm of the vectors in C; then rn satisfies the required property. (iv) D is not bounded. HYPERPLANES 22.21. Find the distance d of each hyperplane from the origin: (i) 2 ~ - 3 y + 5 x = 8; (ii) 2 ~ + 7 y - z + f z u = -4. bi where the hyperplane is c x = b . In each case use the formula d = - I (i) d = 181 d22 - 8 + (-3)Z + 52 6 . ~ ell iii) d = 1-41 d4+49+1+4 -~4 - 6' 22.22. Determine the distance d between each point and hyperplane: (i) (4,0,-2, -3) and x - 6.y 5 x 22.9 = 3; (ii) ( 2 , - 1 , 3 , 7 , -2, -4) and 3x, - 5 x , - 2x, 4x, X, - 3x, = -9. + + + In each case use the formula d = (i) d = '4 + 0 - 10 - 6 - 31 dl+36+25+4 - 15 -~ m' ~ * lt 1~ i + b ' where u i s the point and c * x = b the hyperplane. iii) cl = / 6 + 5 - 6 + 2 8 - 2 + 1 2 + 9 1 ~/9+25+4+16+1+9 - 52 a - 13 2 ' 22.23. Find the equation of the hyperplane passing through the given point and parallel to the given hyperplane: (i) (3,-2,1, -4) and 2 x + 5 y - 6 2 - 2 ~ : = 8; (ii) ( 1 , - 2 , 3 , 5 ) and 4 ~ - 5 y + 2 x + w = 11. The hyperplane belongs to the family 2 x f 5 y - 6 2 - 2 ~ = k . Substitute (3,-2,1,-4) into the equation to obtain 6 - 1 0 - 6 - 8 = k or k = -2. The required hyperplane is 2~ 5~ - 62 - 2 %=~ -2. (ii) The hyperplane belongs to the family 4 x - 5 y + 2 x + w = k . Substitute ( 1 , - 2 , 3 , 5 ) into the equation to obtain 4 + 1 0 + 6 + 5 = k o r k = 25. The required hyperplane is 4 ~ - 5 5 ~ 1 2 z + u= : 25. (i) + 276 POINTS, LINES AND H Y P E R P L A K E S [CHAP. 22 22.24. Show that the equation of the hyperplane in Rm which intersects the xi axis at aj f 0 is 2, 2, -+-+ . . . $X 2 1 1 a1 a2 am (Y + Let clxl i czxz+ c,x, = b be the equation of a. Substituting each x,-intercept (0,. . ., 0 , a,, 0 , . . ., 0) into the equation, we obtain the homogeneous system ale1 a2cz -b = 0 -b = 0 -b = 0 a,c, where the ci and b a r e the unknowns. The system is in echelon form and we can arbitrarily assign a value to b . Set b = 1; then c i = l / n i . Thus the equation of a is as claimed. 22.25. Find the equation of the hyperplane in R4 which intersects the axis at -2, 4,3 and -6, respectively. By the preceding problem, the equation is x1 -2 xq +3 += +4 3 -6 22 5 1 6x1 or - 35, - 4~~ 1 2x4 = -12 22.26. Find the equation of the hyperplane 01 in R4 which passes through the points (1,3,-4, a), (@,I,6, -3), (O,O, 2 , 5 ) and (O,O,3,7). Let a x + b y + c z + d w = e be the equation of 01. Substituting each point into the equation, we obtain the following homogeneous system of linear equation where a , b , c , d and e a r e the unknowns: a+3b-4c+2d-e = 0 a+3b-4c+2d-e = 0 I b+6c-3d-e 2c+5d-e 3c+7d-e = 0 = 0 = 0 Or I b+Gc-3d-e = 0 2c+5d-e = 0 d - e = 0 The system has been reduced to echelon form on the right. There is only one free variable, i.e. we can arbitrarily assign a value to e . Set e = 1. Then d = 1 by equation four, c = -2 by equation three, b = 1 6 by equation two, and a = - 4 1 by equation one. Thus the required equation is - 4 l ~ + l G y - 2 ~ +=~ 1. Remark: The hyperplane a is uniquely determined since there is only one free variable in the homogeneous system. I n other words, every value of e determines a different equation but all these equations a r e multiples of each other and so they determine the same hyperplane a. 22.27. Show that there is more than one hyperplane in R4 which passes through the points (1,6, -1,4), (@,I,2,2), (0,1,1,6) and (0,1,3,-2)- + Let ax b y + c z + d w = e be the general equation of the hyperplanes passing through the given points. Substitute each point into the general equation and then reduce the system of equations to echelon form as follows: ~ + 6 b - c+4d-e b+2c+2d-e 10 = 0 bS c + 6 d - e = 0 b+3c-2d-e = 0 a+Gb- or cf4d-e b+2c+2d-e = 0 = 0 C-4d 1 0 C-4d = 0 or a+6b- c+4d-e b+2c+2d-e C-4d = 0 = 0 = O The system has two free variables, e and d ; t h a t is, we can arbitrarily assign values t o both e and d and obtain a n equation of a hyperplane passing through the given points. Accordingly, all the equations will not be multiples of each other and so there is more than one hyperplane passing through the given points. CHAP. 221 POINTS, L I N E S A N D H Y P E R P L A N E S 277 22.28. Show that there is a t least one hyperplane in Rm passing through any m arbitrary points. + + + Substituting each of the points into the equation clxl c2x2 * * * z,c = b , we obtain a system of n z homogeneous linear equations in the m 1 unknowns, el, . . . , c, and b . Thus the system has a t least one free variable and so has a non-zero solution. The non-zero solution determines a hyperplane passing through the given points. + 22.29. Show that any r < m arbitrary points in Rttzdo not uniquely determine a hyperplane, i.e. that there is more than one hyperplane passing through the r points. + + + Substituting each of the r points into the equation clxl c2x2 c,x, = b, we obtain a system of r homogeneous equations in the m 1 unknowns, el, . . ., c, and b. Since r < m, the system has a t least two f r ee variables. Accordingly, all the solutions of the system will not be multiples of each other and so they determine more tha n one hyperplane. + 1 D 2 2 -1 3 2 * a * 3 -1 = 1(-2+2) - 2(4+3) + 3(4+3) = 0 - 14 + 21 = 7 2 Substitute in the second equation to obtain z = 2, and then substitute in the first equation t o find x = 1. Thus the hyperplanes intersect in the unique point (1, -3,2). 22.31. Determine whether the given hyperplanes in R3 intersect in a unique point: (i) x + 3 ~ - 2 x = 1, 2 x - y + x (ii) x + y + x = 1, 2 x + y - x = 2 and 4 x + 5 y - 3 x = 4; = 6 and 3 x - 3 y + x = -5. In each case, compute t h e determinant D of the matrix of coefficients: (i) D = 1 3 2 -1 4 5 -2 1 = l ( 3 - 5 ) - 3(-6-4) - 2(10+4) = -3 Since D = @ , the hyperplanes do not intersect in a unique point. 1 3 -3 11 Since D # @ ,the hyperplanes intersect in a unique point. -2 + 30 - 28 = 0 278 POINTS, LIN ES AND HYPERPLANES [CHAP. 22 Show that iii arbitrary hyperplanes u1 x = b,, ti, * x = b,, . . ., urn* x = bm in Rrn intersect i ; ~a unique point if and only if the ??z rectors P L ~ , U.~. ,.,urnare (linearly) independent. The system of linear equations in m unknowns has a unique solution if and only if the detemiinarit of the matrix A of coefficients is not zero. However, the determinant of A is not zero if and only if the rows of A ar e independent vectors. The result now follows from the fa c t that the rows of A a r e the vectors z i l . MISCELLANEOUS PROBLEMS 22.33. Prove the Cauchy-Schwarz Inequality: For any pair of vectors u = (ul, . . .,urn) and v = . . .,vJ in Rm, (til, c Iulvll m 11/ III * 6 6 1 ull 1 110 2=1 If 7~ = 0 o r w = 0, then the inequality reduces to 0 6 0 0 and is therefore true. Thus we need only consider the case in which z( # 0 and w # 0, i.e. in which I J Z L I I # 0 and IIwIJ # 0. Furthermore, f IU. + ... = IuIz.I w1 1ZtlnlJnll Iulwl( 6 + .'. + m ~z~l,lwm/ = (u,7J,l a=1 Thus we need only prove the second inequality. Now for any real numbers x , y E R. 0 f 2xy Set Y = ?(,I/ z ~ l l and y = 1~ / t (z - Y ) ~= r 2- 2ey f 22 + + y2 or, equivalently, (1) y2 in ( 1 ) to obtain, for any 1, But, b y definition of the norm of a vector, ((all = 2 a ; = 2 ( z L i i 2 and (IwI(= Thus summing ( 2 ) with respect t o i and using Iztiwil = izti' l w i l , we have 2 v? = 2 iv[l2. t h a t is, hIultiplyinp both sides by IIL' 1 w'l, we obtain the required inequality. 22.34. Prove Minkowski's Inequality: For any pair of vectors u = (ul, . . .,urn) and ZI = (vl, . . .,vrn) in R", I ' Z L + vll 6 I(u(I ~ ~ v ~ ~ . + If 7i + 2) +%I/ 1 = 0, t h e inequality clearly holds. Thus we need only consider the case in which # 0. Now 7(1 + v,I IIu 6 IU,~ + Iwl~ +w / p f o r a ny real numbers = I: + w , ) 2 f 2 I?(, Hence B IU, + w,/2 = 2 Ik + 4 1% + w,1 + Iv,l) = B I Z f , + vtl ILL,/ + B 1% + wtl Iw,l = (ZL, + w11 wul E R. (IZL,i But by the Cauchy-Schwarz Inequality (see preceding problem), ~ ~ u , + w , 1 I ~f, l I Z L + V I I llz~ll and ~ l z ~ , f wp,Il ~ (l~+vl\lI~11 CHAP. 221 POINTS, L I N E S AND H Y P E R P L A N E S 279 SuppIementary Problems DISTANCE BETWEEN POINTS 22.35. Compute the distance between each pair of points: (iii) (4,Oi and (-2, -71; (iv) (-3, -51 and 14, -11. (i) (5,-1) and ( 2 , G ) ; (ii) ( 2 , 2 ) and ( 5 , -5); 22.36. Determine k so t h a t the distance between (1,3) and (2k, 7) is 5. 22.37. Find the point equidistant from ( 1 , 2 ) , i - l , 4 ) and (5,3). 22.38. Determine li so t h a t the points (2, l ) ,(-2, k ) and 16,4) a r e collinear, i.e. belong to t h e same straight line. 22.39. Show t h a t the points 16,5), (2, -5) and (-1,2) a r e the vertices of a right triangle, and find its area. LINES 22.40. Plot the following lines: (i) x = -5, .t = -2, x = 1, x = 4; (ii) y = - 4 , y = -1, y = 2, ZJ = 6. 22.41. Find the equation of the line: (i) parallel to the J- axis and passing through the point (3, -4); (ii) parallel to the u axis and passing through the point (-2,l); (iii) horizontal and passing through the point 15, -7); (iv) vertical and passing through the point (6,-3). 22.42. Plot the following lines: (i) 3 2 - 5y = 15 and 2x (iii) 2 x + 5 y = 0 and 4 x - 3 y = 0. 22.43. Find the slope of each line: (v) ?J = 7 . 22.44. 22.45. Find the equation of (i) passing through (ii) passing through (iii) passing through (iv) passing through (i) y = -2x i 8; + 3y = 12; iii) 2 2 - 5 y + (ii) 42 - 3y = -6 = 14; and 5% 2 ~ 1= 8 ; (iv) x = -2; [iii) z = 3 y - 7 ; the line: ( 3,- 2) with slope &; (-5,O) and ( 3 , 5 ) ; (1, -3) and parallel to 2 s - 3y = 7; (2, -1) and perpendicular to 3 s 4y = 6. + Determine the distance of each line from the origin: (i) 4x 31/ = -8; (ii) 2x 5y = 3; (iii) 12s - 5y = 26. + + 22.46. Determine the distance between each point and line: (i) (2, -1) and 6s - 8y = 15; (ii) ( 3 , l ) and 3: - 3y = -6; + (iii) (-2,4) and 12% 5y = 3. EUCLIDEAN m-SPACE 22.47. Let u = (1,- 2 , 5 ) and v = (3,1, -2). Find: (i) u v ; (ii) 2 2 ~- 5v; (iii) 2 6 . v; (iv) I'dI; (v) d(u,w). + 22.48. Let u = ( 2 , 1 , - 3 , 0,4) and v = (5, -3, -1,2,7). Find: (i) u + v ; (ii) 32~-21;; (iii) u - v ; (iv) I;v:l; (v) d(u,v), i.e. IIu-dI. 22.49. F o r each pair of vectors determine the value of k so t h a t the vectors a r e orthogonal: (ii) ( 1 , 7 ,k + 2 , -2) and ( 3 , k , - 3 , k ) . (i) (5,k,-4,2) and (1, - 3,2,2k) ; 22.50. Determine the value of k so t h a t the distance between ( 2 , k , 1 , - 4 ) and ( 3 , - 1 , 6 , - 3 ) 22.51. Show t h a t a hyperplane is not bounded. 22.52. Show t h a t a set A is bounded if and only if there exists a real number -m* 6 u,5 177 for every component of any vector u E A . m* HYPERPLANES 22.53. Calculate the distance of each hyperplane from the origin, i.e. the zero vector: (i) 2x - y 32 4u: = 7; (ii) 4x - 12y - 32 = -10; (iii) 3% 2y 22 - w = -6. + + + is 6. >0 such t h a t 280 22.54. POINTS, LIN ES AND HYPERPLANES Calculate the distance between the point and the hyperplane: (i) 12, - 3 , l ) and 3 x + 4 y - l 2 x = 8; (ii) (5,-2,1,-3) and r and x + 8 y - 4 x = -7. 22.55. Find the equation o f the hyperplane in R3 which: (i) contains (2,-3,-5) and is parallel t o the xx plane; (ii) intersects t h e x, y and x axes at 2, -3 and 4 respectively; (iii) contains (1,-2,2), (0, 1 , 3 ) and (0,2, -1); (iv) contains (1,-5,4) and is parallel to 3%-7y-6x = 3. 22.56. Find the equation o f the hyperplane in R4 which: (i) contains the xl, x2 and x4 axes; (ii) contains (1,5,-4,3) and is parallel to xl-3x2-4x3-2x1 (iii) contains (1,1,1,l), ( O , l , -3, l), (O,O,1,4) and (O,O, 2, -1); (iv) intersects t h e axes at 3, -1,2, and G, respectively. 22.57. Determine whether t h e given hyperplanes in (i) 1 .v+3y+5x = 1 52 + y+ 12"- 22.58. x = 2; (ii) = 7 y-22 I 2$2y+x 32 - 2y [CHAP. 22 + 72/-5x+5w = 1; (iii) (4,1,-2) = -1; intersect in a unique point: 2-4y-2x = 1 = 1 R3 -x = 2; 52+2y+x = 4 (iii) 1 + + 22 2y \3x- y - x = 2. x = o Show t h a t the intersection of r < ? n hyperplanes in Rm is either empty or contains a n infinite number of points. Answers to Supplementary Problems 22.35. (i) m, (ii) m, (iii) m, (iv) a 22.36. k = 2 o r k = -1 22.37. (2.3, 5.3) 22.38. k = -2 22.39. 29 22.41. (i) y = -4, 22.43. (i) -2, 22.44. (i) 2 - 4 y 22.45. (i) 8/5, (ii) 3 / 6 9 , 22.46. (i) 22.47. 22.48. [ii) x = -2, (ii) 2/5, (iii) / n , (iv) x = 6 (iii) 1/3, (iv) not defined, = 11, (ii) 5x-8y 4, (ii) G = -7, = -25, (v) 0 (iii) 2%-33y = 11, (iv) 4 2 - 3 y (iii) 2 (iii) 7/13 + v = (4, -1,3); (ii) 2 u - 5w = (-13, -9,20); (iii) u v = -9; (iv) (i) u + w = (7, -2, -4,2,11); (ii) 3u - 2w = (-4,9, -7, -4, -2); (iii) u (i) u 2 a ; (v) d(2(,2)) = a 22.49. (i) k = 3; (ii) k = 3/2 22.50. k = 2 or k = -4 22.53. (i) 7/m, (ii) 10/13, (iii) 6/m= 2 / f i 22.54. (i) 2, 22.55. (i) y = -3; 22.56. (i) x3 = 0; (ii) x1 - 3x2 - 433 - 22, = -4; xq = 6 22.57. (i) No. (ii) 3, = 11 = 6; (v) d(u, w) = &i IIU/I w = 38; (iv) ;1w11 = fi = (iii) 3 (ii) 6 % - 4y f 3 x = 12; (iii) 13xl-4y (ii) No. (iii) Yes. +x = 7; (iv) 32 - 7y- 6x = 14 (iii) 20x1 - 23x2 - 52, - x4 = -9; (iv) 2x1 - 62, f 3x3 + Chapter 23 Convex Sets and linear Inequalities LINE SEGMENTS AND CONVEX SETS a2, . . .,am) and Q = (bl, b,, The line segment joining two points P = (ul, Rm, denoted by P&, is the set of points tP + (1- t)Q = (ta, + (1- t ) h , ta2 + (1- t ) h , , . ., ta,,, + (1- t ) b j , l ) , . . ., b,,) where 0 L t f in 1 A subset X of Rm is convex iff the line segment joining any two points P and Q in X is Q cX. contained in X : P,Q E X implies P Example 1.1. The rectangular and elliptical ar eas below ar e each convex since the line segment joining a ny two points P and Q, in either figure, is contained in the figure. On the other hand, the U-shaped a re a to the right below is not convex, since t h e line segment joining the points P and Q also has points lying outside of the figure. Example 1.2. Let X and Y be an y two convex sets in R7n. We show tha t the intersection X n Y is also convex. For let P and Q be any two points in X n Y ; then P , Q E X and P, Q E Y. Since X and Y a r e each convex, the line segment &P i s contained in both X and Y : P&cX and P&cY. Hence &P is contained in the intersection: &P c X n Y . Thus X n Y is convex. In fact, Theorem 23.1: The intersection of any number of convex sets is convex. We next state a theorem about line segments which will be used throughout. Theorem 23.2: Let f be a linear function defined over a line segment PQ. point A E P&, either (i) f ( P )6 f(A) 6 f ( Q ) In particular, if f ( P ) = f ( Q ) in P&. or (ii) f(Q) Then for every f(A) f(p) then f has the same value at every point In other words, the value of a linear function f at any point of a line segment lies between the values of f a t the end points. Remark: A function f defined over Rm is said t o be linear if it is of the form f = f(x1, X2) . . . ,Xm) 281 = ClXl + czxz + ' .. + Cn1Xm 282 CONVEX S E T S AND L I N E A R I N E Q U A L I T I E S LINEAR INEQUALITIES The linear equation a,x, + a,x,+ * * ;CHAP. 23 . + amxm = b partitions R" into 3 subsets: (i) those points which satisfy the given equation; (ii) those points which satisfy the inequality a,x, a,zz (iii) and those points which satisfy the inequality a,x, a,xz + - + amxnl < b ; + + - + a,llxnL> b. + The first set, as defined previously, is a hyperplane; the other two sets are called open half spaces. Accordingly, the solution set or, simply, solution (or: graph) of the linear inequality (or: a l x l ULT? . * + amxnl 2 b ) al x l + a2xz + - + a,,,~,,,6 Z, = b, called the boundconsists of the points of the hyperplane a,x, + a,x, + - . + a n l ~ m i n g hyperplane, and of one of the open half spaces; such a set is termed a closed half space or, simply, a hal f space. An important property of half spaces follows. - - + 9 Theorem 23.3: Half spaces are convex. I n particular, the solution of the linear inequality ax b y c (or: ax+ by 2 c ) is called a hal f plane. Geometrically speaking, the half plane consists of all of the points lying on the bounding line ax by = c and on one side of the line. + + 5 Example 2.1. -6, To d r a w the g r a p h of the inequality 2 r - 3 y first plot the line 2 x - 3 3 ~ = -6. Then choose a n arbit r a r y point t h a t doesn't belong to the line, say ( O , O ) , and test it to see if i t belongs t o t h e solution set. Since (0,O) does not satisfy the given inequality, the points on the other side of the line belong to the solution set as shown on the right. f Graph of 2%- 3y f -6 POLYHEDRAL CONVEX SETS AND EXTREME POINTS The intersection X of a finite number of (closed) half spaces in R" is called a polyhedral convex set. A point P E X is called an e x t r e m e point or c o m e r point of X if P is the intersection of 7n of the bounding hyperplanes determining X . Observe that Theorem 23.1 and Theorem 23.3 justify the use of the term convex; that is, half spaces are convex and so their intersection is also convex. Theorem 23.4: The solution of a system of linear inequalities a11n.1 a21r1 + + a12xz a2252 + + * . . -a1,,>:c,,1 6 bl -a2ii,:c,n 6 bz * ' ' ................................ is a polyhedral convex set. The theorem follows from the fact that the solution to each inequality is a half space and that the solution of the system is the intersection of the individual solutions. CHAP. 231 Remark: 283 CONVEX S E T S A N D L I N E A R I N E Q U A L I T I E S Since the sense of an inequality can be changed by multipIying che inequality by, say, -1, there is no loss in generality in assuming the inequalities in Theorem 23.4 are of the same given sense. Example 3.1. The bounded polyhedral convex set in the figure on the r i g h t is the solution of the following system of five linear inequalities: x + u + x f 2 2 - - y 2 0 l'x x ' 0 2 ' 0 The points A , B , C, D, E and 0 in the diagram a r e the corner points, i.e. extreme points of the solution; each point is the intersection of three of the bounding planes. For example, D is the intersection of the planes x y x = 2, x - y = 0 and x = 1; solving the three equations simultaneously. we obtain D = 1). Similarly we can obtain A = (2,0,0), B = (1,l , O ) , C = ( l , O , l ) , E = ( O , O , 1) and 0 = ( O , O , O ) . Y + + (a,+, Remark: Although there are (i) = 10 different ways to select 3 of the 5 given bounding planes in the above example, four of the ten sets of equations will not determine an extreme point. In general, a set of m bounding hyperplanes of a polyhedral convex set X in R" does not determine an extreme point of X if and only if one of the following holds: (i) its solution is empty, i.e. the equations are inconsistent; (ii) its solution is infinite, i.e. the equations are dependent; (iii) its solution is unique but does not belong to X . POLYGONAL CONVEX SETS AND CONVEX POLYGONS In the special case of the plane R*, a polyhedral convex set X is called a polugonal convex set and, if it is bounded, a convex polygon. Furthermore, a point P E X is a n extreme point of X iff it is the intersection of two of the bounding lines of X. z=1 -i = 4 y = -1 E x-y B = (3, -l), the solution of - -5 [CHAP. 23 CONVEX S E T S A N D L I N E A R I N E Q U A L I T I E S 284 Although there a r e (i)= 6 different ways to select 2 of the 4 bounding lines, E = (1,-3), p + 2 y = 4 t h e solution of F = (6, -1), t h e solution of lx = 1 , does not satisfy z -y = 4 , does not satisfy y = -1 Jx-y \y 5 L 4 -1 I n other words, E and F do not belong t o the convex polygon. Example 4.2. Consider the following f o u r systems of inequalities: x+2y f 4 x-y 5 0 y 22 -1 x t 2 y f x - y L y 4 x f 2 y 5 4 z+2y 0 2--y =' 0 x-y y 6 -1 -1 (ii) (i) y 4 g 0 f -1 (iv) (iii) + In each case the bounding lines of the given half planes are x 2y = 4, x - y = 0 and y = -1. are = 3 different ways to select 2 of the 3 bounding lines. Furthermore, (i) x-7J A = (-1, -l), is the solution of = 0 y = -1 x+21/ = 4 B = (6,-l), is the solution of 4 4 C = (3,s)' is the solution of -5 1 y = -1 -i x+2y = 4 x-y = 0 There CHAP. 231 CONVEX S E T S AND LINEAR INEQUALITIES I 285 / (iii) (iv) W e now discuss the solution of each system which is sketched above. Here the solutions a r e bounded and hence called a convex polygon. All three points A , B and C a r e its extreme points. Here the solutions a r e not bounded. Only the points A and C a r e its extreme points. x-y 6 0 , t h a t is, without the inequality Here the solutions a r e the same as the solutions to + x 2y 6 4. In this case we say t h a t the inequality x point A is the only extreme point. 1 + 2y --1 f 4 is redztndant (or: superfluous). The Here the solution is empty, and we say t h a t the system is inconsistent. There a r e no extreme points. LINEAR FUNCTIONS ON POLYHEDRAL CONVEX SETS We seek the maximum o r minimum value of a linear function f = + + ~ 1 x 1 ~ 2 x 2 * * + CmXm where the unknowns are subject to the following constraints: UllXl UZlXl + a12x2 + az2x2 + ar2x2 + + ’ ’ ‘ * * ’ + UlniXrn 6 + UZrnXrn I + amxm 6 ................................. U,lXl + * *. 61 ba b, That is, we seek the maximum or minimum value of a linear function over a polyhedral convex set. The following fundamental theorem applies: Theorem 23.5: Let f be a linear function defined over a bounded polyhedral set X in R”. Then f takes on its maximum (and minimum) value a t a n extreme point of x. Thus the solution of the above problem consists of two steps: (i) Find the extreme points of X . (ii) Evaluate the function f at each extreme point. The maximum of the values of f in step (ii) is the maximum value of f over the entire set and the minimum value of f in step (ii) is the minimum value of f over X . X, 286 CONVEX SETS AND LIN EA R IN EQ U A LITIES [CHAP. 23 Example 5.1. Find the maximum and minimum value of the function f = 2 r i 5y over the convex polygon X given by 2+2y L 4 2-y 5 4 x t l y 2 -1 The extreme points of X a r e as follows (see Example 4.1): A = (1,-1), B = (3, -l), C = (4,O) and D = (1,;) Evaluate f a t each of these points: f ( A ) = f(1,-1) = 2 . 1 - 5 . 1 = -3 j ( B ) = f(3,-1) = 2 . 3 - 5 . 1 = 1 f(C) = f(4,O) = 2 . 4 f ( D ) = f'(l,i) = 2 . 1 + 5.0 +5.; = 8 = 9.5 Thus the maximum value of f over the entire convex polygon X is 9.5 and occurs at D, and the minimum value i s -3 and occurs at A . We shall prove Theorem 23.5 f o r the case m = 2 as a solved problem (see Problem 23.17). However, it is instructive to show here how this result follows from geometric considerations in the case of the preceding example. We can consider the equation f = 2% 5y as a one parameter family of lines in the plane R2 (see adjoining diagram). We seek the maximum (or: minimum) value of f with the condition that the line f = 2 x + 5 y intersects the given convex polygon X . As indicated by the diagram, starting with lines of the family with large values of f , the family will first intersect X at the point D. Similarly, starting with lines having small values of f, the family will first intersect X at the point A . In other words, the linear function f = 2 x f 5 y will take on its maximum and minimum values over The family f = 2x + 5y and X a r e plotted. X at the points D and A, respectively. + Example 5.2. Find the maximum value of the function f = x + 2y 4-82 x+y+z x-y over the polyhedral convex set X given by 2 0 6 2 z " 1 2 2 0 2 ' 0 The extreme points of X are (see Example 3.1, Page 283). A = (2,0,0), B = (1,1,0), C = (1,0,1), D = ( + , + , I ) , E = ( O , O , l ) , 0 = (O,O,O) Evaluate f a t each of t h e six extreme points: f ( A ) = 2, f ( B ) = 3, f(C) = 9, f ( D ) = 9.5, f ( E ) = 8, f ( 0 ) = 0 Thus, by Theorem 23.5, t h e maximum value of f over the entire polyhedral convex set and is 9.5. X occurs a t Consider now a function f of the form f = + + ~1x1 ~ 2 x 2 * * * + CmXm +d over a polyhedral convex set X in Rm. The function f differs from the linear function D 287 CONVEX S E T S AND LINEAR I N E Q U A L I T I E S CHAP. 231 g = c1x1 + c2x2 + * * . + CmX,n a t every point of X by the fixed amount d ; hence f takes on its maximum (and minimum) value a t the same point as g. In other words, Theorem 23.6: A function of the form f = ClXl + CZXZ + * * * + C,nXm +d defined over a bounded polyhedral set X in Rm takes on its maximum (and minimum) value a t an extreme point of X. Example 5.3. Find the minimum value of the function f = 2z-4411+3z+2 over t h e bounded polyhedral convex set X of the preceding example. Evaluate f a t each of the six extreme points of X : + +0+2 +0+2 2+0 +3 2 f(A) = f(2,0,0) = 4 0 f(B) = f(l,l,O) = 2 - 4 f(C) = f ( l , O , 1) = T = 6 = 0 f ( D ) = f(+,+,1) = 1 - 2 f ( E ) = f ( O , O , 1) = 0 4 0 = 7 f(0) = f ( O , O , 0 ) = 0 +3+2 +3 +2 +0 +0 +2 = 4 = 5 = 2 Thus t h e minimum value of f over X occurs a t the point B and is 0. Remark: If a polyhedral convex set X is not bounded, then a linear function f may or may not take on a maximum (and minimum) value. However, the following theorem always applies (see Problem 23.9): Theorem 23.7: Let f be a linear function defined over a polyhedral convex set X. Then (i) f will take on a maximum (minimum) value over X if and only if the values of f are bounded above (below) and (ii) the maximum (minimum) value will always occur at an extreme point of X. Solved Problems LINEAR INEQUALITIES IN TWO UNKNOWNS 23.1. Graph each inequality: (i) 2x + y 4 -4, (ii) 4% - 3 y -6, (iii) 2x - 3y 6 0. In each case, first plot t h e line bounding the half plane by finding at least t w o points on t h e line; then choose a point which doesn't belong t o the line and test i t in t h e given inequality. (i) + On the line 22 y = -4: if R: = 0, I/ = -4: if y = O , z = - 2 . Draw t h e line through (0,-4) and ( - 2 , O ) . Test, say, the origin (0,O). Since (0,O) does not satisfy 21 y 5 -4, i.e. 0 =f -4, shade the side of the line which does not contain the origin. + (i) Graph of 2 x + y 6 -4. 288 CONVEX SETS AND LINEAR INEQUALITIES [CHAP. 23 (ii) On the line 42 - 3y = -6: if x = 0, y = 2; if y = 0, z = -1.5. Draw the line through (0,2) and (-1.5,O). Test the origin (0,O). Since (0,O) satisfies the given inequality 4x - 3y 2 -6, shade the side of the line containing the origin. (iii) On the line 22 - 3y = 0: if z = 0, y = 0; if z = 3, y = 2. Draw the line through (0,O) and ( 3 , 2 ) . Since the origin belongs t o the line, test the point (2,O). Since (2,O) does not satisfy the given inequality 22 - 3y 6 0, shade the other side of the line. -5 (ii) Graph of 4 2 - 3 y 5 23.2. Graph each inequality: (i) x (i) 1 (iii) Graph of 22 - 3y -6. 2, (ii) x 1, (iii) y 4 0. -1. First plot the line x = 2 , the vertical line meeting the x axis a t 2. Then shade the area to the left of the line since the x-coordinates of these points are less than 2. (ii) First plot the line z = 1, the vertical line meeting the x axis a t 1. Then shade the area to the right of this line since the x-coordinates of these points are greater than 1. (iii) First plot the line y = -1, the horizontal line meeting the y axis a t -1. Then shade the area above the line since the y-coordinate of these points are greater than -1. (i) Graph of 2 4 2. 23.3. Graph each inequality: (i) 1 (i) (ii) Graph of x x 4, (ii) -2 1. (iii) Graph of y 2 -1. 4 y 4 1. Recall that 1 5 x 6 4 means 1 fz and 3 5 4 . Plot the vertical lines s = l and x = 4 ; then shade the area between the two lines as shown in Fig. (i) below. (ii) Plot the horizontal lines y = -2 and y = 1; then shade the area between the lines as shown in Fig. 6.i) below. 289 CONVEX SETS AND LINEAR INEQUALITIES CHAP. 231 y = -2 -4 (i) Graph of 1 f z 5 (ii) Graph of -2 4. f y 5 1. 23.4. Plot the solution of each pair of inequalities: (i) 2%- 39 6 6 2x+y 6 4 (ii) 2x + 5y 10 X L l (iii) -1 x x-y 6 3 6 2 In each case, plot each half-plane with different slanted lines; the cross-hatched area is the required solution. 22 -3376 Graphsof 22-3y f 6 and 2 z + y f 4. Solution of 2z+y 5 6 f 4 (ii) Graphs of 2 2 + 5y 10 and z 2 1. Solution of { 22 + 5y f 2’1 10 (IJ 1 . 1 1 ~p ~ i u b cllci u - Y, vrlVllYl ...vvl---D ---- .. -. -. the left of the line since the 2-coordinates of these points a r e less than 2. L ~ L G YIIG I____ ~~ (ii) First plot the line x = 1, the vertical line meeting the s axis at 1. Then shade the area to the right of this line since the s-coordinates of these points a r e greater than 1. (iii) F i r s t plot the line y=-1, the horizontal line meeting the y axis at -1. Then shade the area above the line since the y-coordinate of these points a r e greater than -1. (i) Graph of x 6 2. 23.3. Graph each inequality: (i) (ii) Graph of z (i) 1 4 x 4 4, (ii) -2 5 (iii) Graph of y 1. 2 -1. 4 y 4 1. Recall t h a t 1 6 x 5 4 means 1 5 x and x r 4 . Plot the vertical lines x = l and x = 4 ; then shade the area between the two lines as shown in Fig. (i) below. (ii) Plot the horizontal lines y = -2 and y = 1; then shade the area between the lines a s shown in Fig. (ii) below. / / l / i l -5 Each half plane is plotted. The required solution. Observe that the solution of the system is not bounded. (ii) Each half plane is plotted. Here the solution of the system is bounded. The required solution. CHAP. 231 CONVEX S E T S A N D LIN EA R IN EQ U A LITIES 291 LINEAR FUNCTIONS OVER POLYHEDRAL CONVEX SETS 23.6. Find the maximum and minimum of each function over the convex polygon X shown in the diagram on the right: f = 2x +52J 8- A = (0,2) B = (4, -1) c = (5,l) D = (3,5) g = x-22J h = -3xf2J -2 -2 - Evaluate each function at each extreme, i.e. corner point: f(A) = t(0,2) = 0 f ( B ) = t(4,-1) + 10 = 10 = 8-5 = 3 +5 6 + 25 X is shaded. y ( A ) = 0 - 4 = -4 g(B) = 4 +2 h(A) = 0 +2 1 = -13 h ( B ) = -12 - +1= f ( C ) = t ( 5 , l i = 10 = 15 y(C) = 5 - 2 = 3 h ( C ) = -15 f ( D ) = t(3,5) = = 31 ~ ( 0=) 3 h ( 0 ) = -9 - = 2 = 6 10 = -7 Thus: maximum of f over X is 31 and occurs at the point D maximum of g over X is 6 and occurs a t the point B maximum of h over X is 2 and occurs a t the point A and minimum of i over X is 3 and occurs a t the point B minimum of g over X is -7 and occurs a t the point D minimum of lz over X is -14 and occurs a t the point C. +5 -14 = -4 over the convex polygon X of the 23.7. Find the maximum of the function f = 2 x - y preceding example. Evaluate t' a t each of t h e four extreme points of X : f ( A ) = i ( 0 , 2 ) = -2 f(C) = f ( 5 , l ) = 9 f ( B ) = f(4, -1) = 9 f ( D ) = f(3,5) = 1 Thus the maximum of f over X is 9 and occurs at two of the corner points, B = (4,-1) and C = ( 5 , l ) : hence by Theorem 23.2 it will occur at every point on the line segment joining B and C. The above can be interpreted geometrically: the member of the family of parallel lines determined by the equation f = 2%- y which has the maximum value of f and intersects X , passes through both B and C and so contains every point on the line segment joining B and C, as shown in the diagram. The diagram also indicates t h a t f takes on its minimum value at t h e point A . 23.8. Find the extreme points of the polygonal convex set X determined by the system 3,r+2y x-y G 6 0 x G 4 Method 1 (Algebraic). = 3 ways to select 2 of the 3 bounding lines: There a r e (i) 23.5. Plot the solution of each system of inequalities: (i) 3x + 2 y 6 (ii) ,T + 4y L 4 x-y 4 n: ‘1 -1 $ 5 2 y 1 0 (i) Each half plane is plotted. The required solution. Observe t h a t the solution of the system is not bounded. (ii) Each half plane is plotted. The required solution, Here the solution of the system is bounded. __.~.~ iiiinimum value of It over X. X and the family g X f 76. X and the family h = % - 276. 293 CONVEX S E T S A N D LINEAR INEQUALITIES CHAP. 231 23.10. Find the maximum and minimum of the function f = x - 2y 4 over the convex polygon X given by xsy 5 4 + Ic + 2y + -2 x-y 2 -2 x g 3 To obtain the extreme points of X, plot X as shown in the diagram. The points A , B . C and D a r e the corner points of X . The coordinates of each point can be obtained by finding the intersection of the appropriate pair of bounding lines: A = (-2,O) x + 2 y = -2 is the solution of x- x + 2 y = -2 { {x+: 1 ," B = (3, -%) is the solution of C = (3,l) is the solution of D = (1,3) is the solution of y = -2 2 = 3 i x + y = 4 x-y = -2 Next evaluate the function f at each extreme point: f(A) = --2'0+4 f(B) = 3 = 2 +5 +4 f(C) = 3 - 2 1 - 4 = 12 +4 f ( D ) = 1- 6 = 5 = -1 Thus the maximum of f over X is 12 and occurs at B, and the minimum is -1 and occurs a t D. 23.11. Find the extreme points of the polyhedral convex set X given by 2 + y + x g 3 4 0 2x+2y+2 x-y X k O 2'0 A point p E R3 is an extreme point of X iff it is the intersection of three of the bounding = 1 0 ways to choose 3 of the 5 bounding planes: planes of X. There a r e (g) i = 3 = 4 z = o = 3 = 4 z = o x + y + z = 3 x+y+x x + y + z 2x+2y+z = 4 2x+2y+x 2x+2y+x x-y = 0 ii) x+y+z 2-y = 3 = 0 x = o (v) (iii) (ii) 1 = 3 2x+2y+x x = o 2-y x + y + z x = o (vi) x--y = 4 = 0 = 0 x = o (iv) 2 = 0 (vii) x-ty+z = 3 294 CHAP. 23 CONVEX SETS AND L I N E A R I N E Q U A L I T I E S I I 2.'. 2y - z = 4 .t - II'-y x g = 0 z 7 I 0 = = oO 2 - 0 Each system except (iii) yields a unique solution: A, = (0, 3, 0) (vii) A , = (0, 0, 4) (vi) !iii) No solution, i.e. system is inconsistent. (viii) A s = (1, 1,0 ) IIV) A , = (0, 0 , 3 ) (IX) A, = (0, 2, 0) (x) A i o = (0, 0, 0) rv) A, = 3 3 (2' -2 ' 0) To find the extreme points of X, test each of the above points in the given inequalities. The following a r e not extreme points of X: A = ($, f , 0) does not satisfy 2.x + 2u - z A , = (0, 3,O) does not satisfy 2a i A ; = ( O , O , 4) does not s a t ~ s f y .L 2u x y-z L =' 4 4 3 The other six points do satisfy the given inequalities and a r e the extreme points of X. The adjacent diagram sho\\s the position of these points. Note t h a t we obtained the extreme points purely algebraically. 23.12. Find the maximum and minimum values of the function f = 4 x - 2 y - x over the bounded polyhedral convex set X of the preceding problem. Evaluate t' a t each of the six extreme points of X: t'(A1) 1 f(+,+,2) f ( A J =- f(0,1,2) t(A4) 1 f(O,O,3) = 2 - 1- 2 !(A,) -1 = f(l,l,O) +0 +0 0+0+0 = 4 -2 2 - 2 = -4 J(A,) = f ( 0 , 2 , 0 ) = 0 - 4 0 -0 - 3 f(A1,) = f(0,0 , O ) = 0 - -3 =z = 2 -4 z= 0 The maximum value of f over X is 2 and occurs a t the point A s . The minimum value of f over X is -4 and occurs at the extreme points A , and A , and so also occurs at each point on the line segment joining A , to A,. MISCELLANEOLIS PROBLEMS 23.13. Prove: Let f be a linear function defined over R": f(X1, x 2 , . . . , X!!!) = ClXl + C2Xr +. .+ * Then f o r any vectors P,Q E R"' and any real number k , (i) f ( p+ Q) = f(P)+ f(Q), (ii) f ( k P ) = k f(0 CnlXrn 295 CONVEX SETS AND LINEAR IN EQ U A LITIES CHAP. 231 23.14. Prove: Let f be a linear function over R”’ with f ( P )6 f ( Q ) . Then for 0 6 t f(p)L f ( t P + (1 Since t and ( 1 - ~ t)&) 1, f(&). a r e each non-negative, t ( P ) f ) 6 tj(P) 5 and tf(Qi f ( Q ) implies (1- t ) f ( P ) (1- t ) f ( Q j f Hence, using the iesult of the preceding problem, we have f(P) t f(P) - + (1 - t ) f ( P ) t t ( P ) f (1 tj f ( Q ) = t ( t P -- (1- t ) Q ) ~ t f ( Q ) + (1- t ) t ( Q ) = f ( Q ) t ( f P (1- t ) Q ) = t f ( P ) + (1 - t ) fCQ) and 23.15. Prove Theorem 23.2: Let f be a linear function defined over a line segment p& in R”‘. Then for every point A E P&, either (i) f ( P )A f ( A )6 f ( Q ) or (ii) f(Q) A f ( A ) f ( P ) . In particular, if f ( P )= f ( & ) then f’ has the same value a t every point in P&. The proof folloivs from the preceding problem and the fa c t t h a t A = f P f (1- t ) Q for some t between 0 and 1. 23.16. Prove Theorem 23.3: Let X be a half space convex: P , Q E X implies & P CX. + * . . + unIsm4 b. a,x, Then X is Now X consists of all the points A E R”’ such t h a t f(A) Hence if P , Q E 5 b X , then f(Pj 6 t(P) 6 f(A) f = alxl where + ~ 2 f% . . ~. + a,,,r,,, & b and f ( & ) f b. Thus, by Theorem 23.2, if A E P f f(Q) aE b f(Q) or 6 f(A) 5 f(P) f then either b In either case T”A) 5 b and so W c X . 23.17. Prove Theorem 23.5 (for the case w i = 2): Let f be a linear function defined over a convex polygon X . Then f takes on its maximum (and minimum) value at a corner point of X . Suppose t h at t’ takes on its largest corner value at the corner point P . Let A be any other point in X. We w a n t to prove t h a t f ( A )5 j ( P ) . Since X is bounded, the line passing through the points P and A will intersect a bounding line of X a t a point Q such t h a t A lies between P and Q: A E PQ. Furthermore, Q must lie on the line segment joining two corners, say, U and V : Q E P \ UT. Suppose f ( P ) < f ( A ) ; then by Theorem 23.2, f ( P ) < f ( A ) f ( Q ) . That is, f ( A ) lies between f ( P ) and f ( Q ) . On t h e other hand, by Theorem 23.2, either i ( Q ! f ( U ) or f(&) f(V’). Combining the various inequalities, we finally have f(p) f(A) f(Q) f ( l 7 o r f ( P ) < f ( A ) )V / f f f 4 f(Q) fW) However, both cases contradict the assumption t h a t f ( P ) w a s the largest corner value. Accordingly, f(P)< f ( A ! IS impossible and so f ( A ) t ( P ) . f 23.18. Prove Theorem 23.5: Let f be a linear function defined over a bounded polyhedral convex set X in R”. Then f takes on its maximum (and minimum) value a t a n extreme point of X . The proof is by induction on m. If m = 1, then X is a line segment and by Theorem 23.2 f takes on its nlasimum (and minimunii value a t a n extreme point, i.e. a n end point. CONVEX SETS AND LINEAR INEQUALITIES 296 'CHAP. 23 We assume the theorem holds f o r a polygonal convex set in Rm-1. Let A E X. Since X is bounded, the point A belongs to some line segment p& where P and Q each belongs to a bounding half space of X . By Theorem 23.2, f ( A ) must lie between f ( P ) and f(Q) and so f must take on its inaximum land minimum) value a t a point on a bounding half space. Kow every point B E X on a bounding half space of X belongs to the intersection Y of X and a hyperplane H . However, Y is a bounded polyhedral convex set of H and H is isomorphic to R1"-1. Thus by induction f takes on its maximum value a t a n extreme point of Y which is also an extreme point of X. Supplementary Problems LINEAR INEQUALITIES IN TWO UNKNON'NS 23.19. Graph each inequality: (i) x - 2y 23.20. Graph each inequality: (i) x 23.21. Graph each inequality: (i) -2 23.22. Plot the solution to each system: ii) 2.c - y 6 4 (ii) x y x 5 1 x-2y 23.23. f f 4, (ii) 3x 3, iii) x f x f + 2, f 2 2 +y (iii) y -4, (ii) 0 5 x 2 5 6 2, iiv) y 1- 2 0, (iv) 3x - 5y -15. -3. 3. (iv) -1 5 x 6 4 - 2 f y 5 1 (iii) 2 2 4- 3y 6 - 1 f y 6 2 5 2 Plot the solution to each system: ii) x - y e 2 (ii, 3 r 1 2y 6 6 2 x f y 2 -2 2 y ' l u s 1 y (iii) 2x + 5y 6, f (iii) x - 2y 2 -4 - 2 5 x 5 1 -1 y 2 -3 LINEAR FUNCTIONS OVER POLYHEDRAL CONVEX SETS 23.24. A = (-1,l) B (1,-2) Find the maximum and minimum values of each function over the convex polygon X shown in the diagram on the right: (i) f = 3 x f 2 y ; f i i ) .g = --z 2y - 4; (iii) h = x - 2y. C = (4, -1) D = (5,2) + 23.25. , Find the maximum and minimum values of over the convex polygon f = 3r-y x+2y x-y x f f % 4 1 -1 23.26. Find the maximum and minimum values of the function f = 2 2 given by 2-y 2 0 3x+y 2 3 x r 4 23.27. Draw the convex polygon X given by x +y x-y 6 +y +3 over the convex polygon 3 -3 y ' 0 x 2 -1 2 4 2 Find the corner points of over X . X 2 and the maximum and minimum values of the function f = 22 - y CHAP. 231 23.28. CONVEX SETS AND L I N E A R I N E Q U A L I T I E S Draw the convex polygon X given by s+2y x-y 297 6 2 0 2 ' 4 y r o Find the corner points of over X. 23.29. X and the maximum and minimum values of the function f = 2 x - 3 y Find the extreme points of the following bounded polyhedral convex set X in R3: 3x+3y+x 5 6 x-y 2 0 2'0 y ' 0 2 ' 0 2 5 3 Find the maximum and minimum values of the function f = x 23.30. Draw the polygonal convex set X given by zf2y 2-y + 2y - x over X . '4 2 -2 2 ' 2 Find the maximum and minimum values (if they exist) of each function over X f = 2x++, g = 22-yy, h = x-3y Answers to Supplementary Problems (ii) (iii) (ii) (iii) 23.20. ;CHAP. 23 C O N V E X SETS AND LINEAR I N E Q C A L I T I E S 298 23.21. 41 -4 (ii) 23.22. \ -4 23.23. \ (ii) (iii) (ii) (iii) 4i i i'i 23.24. (i) max f = 19, min f = -1; 23.25. max f = 5, min f = -5.5 23.26. max f = 15, niin f = 2 i,ii) max g = -1, min g = -10; (iii) rnax h = 6, min h = -3 23.27. A = B = c = D = E = (-1, 0) (2, 0 ) (2, 1) ( 0 , 3) (-1, 2) Maxi = 4, a t B. M i n f = -4, a t E. CHAP. 231 299 CONVEX S E T S A N D L I N E A R I N E Q U A L I T I E S 23.28. A = (0, 0) B = (4, 0) c = (4, 1) D = (2, 2) Maxf = 8, a t B. Min f = -2, a t D . 23.29. The corner points are: (O,O,O),(2,0,0), ( 1 , 1 , 0 ) ,(0,0,3), (1,0,3), (4,$ , 3 ) . Max f = 4, min i = -3. 23.30. Maxf = 5 a t A . niin t’ does not exist. Max g and niin g do not exist. Min h = -6 a t 8 . max h does not exist. Chapter 24 linear Programming LINEAR PROGRAMMING PROBLEMS Linear programming problems are concerned with finding the maximum or minimum of a linear function f = C I X I C Z X Z + * * . + CntXrn + called the objective function, over a polyhedral convex set X which includes the condition that the unknowns (variables) are non-negative: 21 2 0, X Z 0 ,~ . . ., X,n 2 0 Each point in X is called a feasible solution of the problem, and a point in X a t which f takes on its maximum (or minimum) value is called an o p t i m m solution. Example 1.1. (Rlaximum Problem.) A tailor has the following materials available: 16 sq. yd. cotton, 11 sq. yd. silk, 15 sq. yd. wool. A suit requires the following: 2 sq. yd. cotton, 1 sq. yd. silk, 1 sq. yd. wool. A gown requires the following: 1 sq. yd. cotton, 2 sq. yd. silk, 3 sq. yd. wool. I f a suit sells for $30 and a gown f o r $50, how many of each garment should the tailor make t o obtain t h e maximum amount of money? The following table summarizes the above facts: Let 5 be the number of suits the tailor makes and y t h e number of gowns. We seek to maximize + f = 30% 50y subject to 2x +y L_ 16, x + 2y f 11, x i - 3y 15, r 2 0, y % 0 Observe t h a t the inequality 2 x + y 6 16 corresponds to t h e f a c t t h a t 2 sq. yd. of cotton is required f or each suit and 1 sq. yd. f o r each gown, but only 16 sq. yd. is available. The inequalities x + 2 y f 11 and x + 3 y f 15 correspond to the analogous facts about t h e silk and wool, Fig. 24-1 is the g r a p h of t h e convex polygon X given by t h e above inequalities. The corner points of X a r e 0 = ( O , O ) , A = (8,0), B = (7,2), C = (3,4), D = (0,5) The values of t h e objective function f = 302 50y at t h e + corner points a r e f(0) = 0 , J ( A )= 240, f ( B ) = 310, f(C) = 290, f ( D ) = 250 Thus the maximum of f over X is 310 and occurs at t h e point B = ( 7 , 2 ) . I n other words, the maximum amount of money the tailor can g e t is $310, and this will happen if he makes 7 suits and 2 gowns. 300 Fig. 24-1 CHAP. 241 301 LINEAR PROGRAMMING Example 1.2. (Minimum Problem.) A person requires 10, 1 2 and 12 units of chemicals A , B and C, respectively, for his garden. product contains 5 , 2 and 1 units of A, B and C, respectively, per j a r ; and a dry product 1, 2 and 4 units of A , B and C, respectively, per carton. If the liquid product sells f o r $3 per the d r y product sells for $2 per carton, how many of each should he purchase to minimize the meet the requirements? The following table summarizes the above facts: Units per j a r Units per carton Units needed Chemical A Chemical B Chemical C 5 2 1 1 2 10 12 12 co s t $3 $2 4 A liquid contains j a r and cost and Let x denote the number of j a r s bought and y the number of cartons. We seek to minimize the function g = 32 2y subject to the conditions 12 5x y 2 10, 22 2y 2 12, z 4 y 2 12 P and t h e f a c t t h a t z and ?j a r e non-negative, z 0, u 2 0. + + + + Fig. 24-2 is the g r ap h of t h e polygonal convex set T of this minimum linear programming problem. The corner points of T a r e 8 P = (0, lo), Q = (1,5), R = (4,2), S = (12,O) N t h o u g h T is not bounded, the objective function y = 32 2y is never negative over T and so takes on a minimum value a t a corner point of T . Since g ( P ) = 20, g ( & ) = 13, g(R)= 16, g ( S ) = 36 4 + 4 8 12 16 Fig. 24-2 the minimum value of g over T is 13 and occurs a t Q = (1,5). In other words, he should buy 1 j a r and 5 cartons. DUAL PROBLEMS Every maximum linear programming problem has associated with it a corresponding minimum problem called its dual and, conversely, every minimum linear programming problem has associated with it a corresponding maximum problem called its dual. The original problem is termed the prinzal problem. To be more specific, we define each of the following problems to be the dual of the other: (i) Maximum problem: Maximize f = 0, xz and (ii) Minimum problem: Minimize g = subject to + CIXI b1zu1 + + . + CmXm ~ 2 x 2 0, * .... Xm 10. +- + b 2 ~ 2 * * bkWk + + + aklwii + a 2 m 2 + . . . + ak~wk 2 cz .............................. almzuI+ + + akmwk allwl a12wl alzc2 * * * ~ * ~* 2 2 c1 ~ 2 Cm 302 [CHAP. 24 L I N E A R PROGRAMMING Let us write the coefficients of the above inequalities in the form of a matrix as follows with the coefficients of the objective function as the last row: (i) Maximum problem: (ii) Minimum problem: In each case the matrix of coefficients of the dual problem can be obtained by taking the transpose of the matrix of coefficients of the primal problem. (Note that there is no last entry in the last row and last column of either matrix.) Example 2.1. Find the dual of the maximum problem of Example 1.1. Write the matrix of coefficients of the given (primal) problem and then take its transpose: Primal problem 30 3 15 50 * Dual problem 16 11 15 Thus the dual minimum problem is as follows: Minimize g = 16u subject to +v f w + 2v -I- 3w 2u u and + l l v + 15w ZL 2 0, v 2 30 2 50 0. 0, w Example 2.2. Find the dual of t h e minimum problem of Example 1.2. Write the matrix of coefficients of the given (primal) problem and then take i t s transpose: Primal problem 1 4 12 3 * 2 Dual problem 10 12 1 2 Thus the dual maximum problem is as follows: + 12v + 12w 5u + 2v + w 3 Maximize f = 10% subject t o f ZL+2V+4W and 21 2 0, v 2 6 0, w 2 5 0. In general, a linear programming problem need not have an optimum solution. However, the following theorem, which is the main result in the theory of linear programming, applies: CHAP. 241 303 LINEAR PROGRAMMING Duality Theorem: The objective function f of a maximum linear programming problem takes on a maximum value if and only if the objective function g of the corresponding dual problem takes on a minimum value and, in this case, m a x f = min g. Furthermore, if P and Q are feasible solutions such that f ( P ) = g(&), then P and Q are optimum solutions of their corresponding problems. The proof of this theorem lies beyond the scope of this text. Example 2.3. Consider the maximum problem of Example 2.2. The graph of the polyhedral convex set X given by the inequalities of t h a t problem appears on the right. The corner points a r e P = (lOil9, 0, 7/19) Q = ( 3 / 5 , 0,0) R = (114, 7/8, 0) s = 10,1, 0 ) T = (0,0 , 3) 0 = (0,0,0 ) + 2) + The objective function t' = lox 12u 12w takes on its maximum value a t the point R and m a x f = 13. On the other hand, min g = 13 for the corresponding miniinum problem appearing in Example 1.2, as predicted by the Duality Theorem. U MATRIX NOTATION Let the matrix A , the column vectors b and x, and the row vectors c and w be defined as follows: A = u": 1:: :::,):I ........*....... akl ah2 ' ' ' b = p, [;:I x = b k / akm c = ' 20 = ( C l , c2, . . ., e m ) . . ., W k ) (201, '202, Xm Then the maximum and minimum linear programming problems (1) in the preceding section can be rewritten in the following short compact form: (ii) Minimum problem: (i) Maximum problem: Minimize g = wb Maximize f = cx subject to A x and X Remark: b 0. subject to zuA and 20 c (1) 0. If ZL and v are vectors, then zc v shall mean that each component of u is less than or equal to the corresponding component of v. INTRODUCTION TO T H E SIMPLEX METHOD Let X be the polyhedral convex set, i.e. set of feasible solutions, of the above maximum linear programming problem. An optimum solution can be found by evaluating the objective function f a t each corner point of X. However, if the number of unknowns and inequalities is large, then i t can be a long and tedious task to find all the corners of X . An alternate way of finding an optimum solution is the following: 304 LINEAR PROGRAMMING [CHAP. 24 (1) Select a corner of X as a starting point. (2) Choose an edge through this corner such that f increases in value along the edge. (3) Proceed along the chosen edge to the next corner. (4) Repeat steps (2) and (3) until a n optimum solution is obtained, i.e. a corner is reached such that f does not increase in value along any of its edges. The above process is accomplished mathematically by the so-called simplex method. Observe, first of all, that the maximum linear programming problem (I), Page 301, is equivalent to the following problem which contains only linear equalities and not linear inequalities; the variables xm 1, . . , , rFl1 t lc are called "slack" variables: Maximize f = subject to . . . + c , x m + O ~ x , l t ~ + O ~ ~ m +.z. +. + O * x m t h clx1+cLx3+ + ~ ~ 1 1 U ~ U1 a21X1 + . . . + almXm + X~ + a2222 + ~ . + azmxm + ~ 1 = bl Xnit2 = bz - + ................................................ and aklXl + x1 A 0, ahax2 c ' ' ' + 0, . . ., XL f akmXm Xm 0, Xmtl = bk Zmtk 0, . . . . Xm+k % (2) 0. The algorithm of the simplex method consists of four steps (see Page 307) which correspond, respectively, to the above steps. For the sake of clarity we shall explain each step separately. Remark: In order to avoid special cases, we shall assume the following additional conditions on our linear programming problems: (i) Positive condition: Each component of b is non-negative ( b % 0) and each row of A contains at least one positive entry. (ii) N o n - d e g e n e r a c y condition: If r is the rank of A , i.e. the number of independent rows (or columns) of A , then c is not a linear combination of less than T rows of A and Z, is not a linear combination of less than r columns of A . INITIAL SIMPLEX TABLEAU The initial simplex tableau of the maximum linear programming problem (1) or the equivalent problem ( 2 ) is as follows: P1 Pa * ' * Pm .................. akl I akz " ' akm 1 ............ 0 0 I " ' 1 I I I J indicators Observe that the rows of the tableau, excluding the last, are the coefficients of the linear equalities in ( 2 ) . The columns of the coefficients of the non-slack variables are labeled Pi, respectively; their significance will appear later. The entries in the last row, excluding the last entry, are called i n d i c a t o r s and are the negatives of the coefficients of the objective function f in ( 2 ) . The entry in the lower right hand corner is zero; i t corresponds to the value of the objective function f at the origin. CHAP. 241 305 LINEAR PROGRAMMING Although the simplex method is stated in terms of the maximum linear programming problem, the method will give the optimum solution to both the maximum problem and to its dual minimum problem. Remark: Example 3.1. Maximize j = 30.1. subject to +y + 22/ 22: 6 2: 5 16 11 6 15 + 3y 3: and Initial simplex tableau: - 50y Pl 15 -30 0, u ’ O n Example 3.2. Maximize f = 1 0 ~ 12v ~ subject to and -50 0 0 0 0 Initial simplex tableau: + + 12w 5 u + 2w + w 3 + 2v 4w ‘2 5 ti pi? Pl pz p3 5 2 1 1 0 3 T Z L ~ O ,w 2 0 , W k O Example 3.3. Initial simplex tableau: Maximize f = x 2y subject to + 8x z .r.+y+x 1 f 2 pz p3 l o 1 -1 ‘0 -z+y and r PI 3 - 5 0 , y z o , 75’0 PIVOT ENTRY O F A SIMPLEX TABLEAU A pivot entry or simply a pivot of a simplex tableau is obtained as follows: (i) Arbitrarily select a column which contains a negative indicator. (ii) Divide each positive entry in this column into the corresponding element in the last column. The entry which yields the smallest quotient in step (ii) is the pivot; its column is called the pivotal column and its row the pivotal row. If the tableau has no negative indicator, then it is a terminal tableau and has no pivot. Properly interpreted, the terminal tableau yields a solution to our problem. The procedure which follows represents a method of passing to such a terminal configuration. Example 4.1. Consider the following simplex tableaux: Pl I 1 p2 2 1 1 0 0 1 1 2 1 PI pz 0 0 p3 1 I 1 1 @ 0 1 - 1 -30 -50 , 0 0 0 0 0 2 0 -1 0 0 0 0 1 0 1 1 1 1 7 1 0 (ii) 1 1 2 1 5 1 6 306 0 1 0 0 0 - 0 1 1 1 40 -10 4 290 f iv) (iii) (i) [CHAP. 24 LINEAR PROGRAMMING Either the first column o r the second column can be chosen a s the pivotal column. In the first case 2 is the pivot since 312 is less than 1 2 / 1 and 5/3. In the second case 1 is the pivot since 5/1 is less than 1212. We need not consider the quotient 3/(-1) since -1 is not positive. (ii) Here the second column must be chosen a s the pivotal column since i t is the only one with a negative indicator. Furthermore, 1 is the pivot since 2/1 is less tha n S/2. (iii) The fifth column is t h e pivotal column and 3 is the pivot since 6/3 is less tha n 4/1. (iv) There is no pivotal column and no pivot since there is no negative indicator. This tableau is a terminal tableau. CALCULATING THE NEW SIMPLEX TABLEAU Let D be a simplex tableau with rows R, whose pivot appears in the rth row and sth column; say D is the matrix (&) and so d,, is the pivot. A new tableau D* with rows RF is calculated from D (using appropriate “elementary row operations”) so that 1 appears in the original pivot position and zeros elsewhere in that column. This is accomplished as follows: (i) First divide each entry of the pivotal row R, of D by the pivot d,, to obtain the corresponding row R: of D*:: 1 R: = -I?, drs If the pivotal column of D is labeled P,, then this row of D* is labeled (or relabeled) Pi. (ii) Every other row €2: of D* is obtained by adding the appropriate multiple of the above row R: to the row Ri of D: R: = -disRT -k R, If one of these rows of D is labeled Pi, then the corresponding row of D* is given the same label; otherwise the row is not labeled. Example 5.1. Consider the following tableau with a pivot entry circled: R,: R,: R$ I R,: 1 - 8 - 6 1 0 I 0 I 0 1 0 1 We remark t h a t the circled 2 i s a pivot entry since i t appears in a column with a negative indicator and 412 is less than both 22/1 and 914. We now calculate the new tableau step by step: (i) Divide each entry of the pivotal row R , by the pivot 2 t o obtain the new row RZ; label label of the pivotal column, i.e. PI: Rl with the CHAP. 241 L I N E A R PROGRAMMING 307 2 R4 + (ii) RF is obtained by multiplying the above row R: by -1 and adding i t t o R,: RT = -R,* R,; R: is obtained by multiplying R,* by -4 and adding i t to R3: R; = -4R* R3; RZ is obtained by multiplying RZ by 8 and adding i t t o R,: RT = SR; R4. These rows do not change labels. Thus the completed new tableau is PI p, J - + The intermediate tableau appearing in the preceding example was illustrated separately from the completed tableau only f o r the sake of clarity; in general, we calculate the new tableau directly from the given tableau. We remark that if we choose the second column as the pivotal column and so obtain a different pivot, then we would calculate a different new tableau. This corresponds to the fact that in moving from corner to corner there may be more than one edge on which the objective function f increases in value; hence there is more than one way to finally reach the optimum solution. The simplex method begins with the initial tableau and then calculates new tableaux one after the other until a terminal tableau is obtained. We remark that in each step a t most one row, the pivotal row, changes its label; the columns keep the same label throughout. Example 5.2. Consider the following tableau with a pivot entry circled: Pl p2 Pl (*I 0 0 0 -10 4 0 0 16 Divide each entry of the pivotal row by the pivot 2 to obtain the row o l o l g a The new tableau is Pl p2 1 0 0 ~ 0 1 (**) 45 1 2 1 1 It is calculated as follows: (i) t h e first row is obtained by multiplying (**) by -3 and adding the result t o the first row of (*); (ii) t h e second row is obtained by multiplying (**) by fr and adding the result t o the second row of (*); it retains the label PI; (iii) t h e third row is (**); it i s labeled P,, the label of the pivotal column of (*); (iv) the fourth row is obtained by multiplying ( * c ) by 10 and adding the result t o the fourth row of (*). Note t h a t t h e new tableau is a terminal tableau, i.e. it ha s no negative indicators. 308 L I N E A R PROGRAMMING [CHAP. 24 INTERPRETING T H E TERMINAL TABLEAU Suppose after performing the algorithm of the simplex method on the linear programming problem ( I ) , Page 301, we obtain the following terminal tableau: ... some labeled with Pi Then: (i) ............. a * ... * * 4: . . . * ... ............. * a ... I I I I 4, ... (I, q k I v I v, the entry in the lower right hand corner of the tableau, is the maximum value of f (and so by the duality theorem is the minimum value of g: m a x f = m i n g = v); (ii) the point Q = (q,, qL, . . . . (I,) is the optimum solution of the minimum problem; its components are the last entries in the columns corresponding to the slack variables; (iii) the point P = ( p l , p l , . . . . p,,,), where 1the last entry of a row labeled Pt ), 0 if no row is labeled P, is the optimum solution of the maximum problem. 'I1 = Thus a necessary condition (and so a check) that the calculations are correct is that f ( P )= g ( Q ) = v. Example 6.1. Find the optimum solution Q = ( q l , . . . . q k ) of the minimum problem, the optimum solution P = (p,, . . ~,p,?,) of the maximum problem, and v = max f = min g according to each of the following terminal tableaux: PI p2 p, PI P , P , ::: * :: * * * (i) (i) Q = ( 3 , 4 , 5 ) , P = ( 0 , 2 , 6 ) , w = max f = min g = 19. no r o w is labeled P,. :* * (ii) Note tha t the first component of P is 0 since (ii) Q = ( 2 , 5 ) , P = (0,7,0), w = max f = min g = 11. Note tha t the first and third components of P a r e each 0 since no row is labeled P , o r P,. ALGORITHM OF T H E SIMPLEX METHOD The algorithm of the simplex method consists of four steps which correspond, respectively, to the steps on Page 304: (1) Construct the initial tableau. (2) Find and circle a pivot entry in the tableau. (3) Calculate the new tableau from the given tableau and circled pivot. (4)Repeat steps (2) and (3) until a terminal tableau is obtained. The following examples illustrate the use of this algorithm. CHAP. 241 309 L I N E A R PROGRAMMING Example 7.1. Applying the simplex method to Example 1.1, Page 300, we obtain the following sequence of tableaux: Pl -30 p2 -50 0 0 , 0 -1 0 0 Pl p2 0 0 0 40 -10 290 (iii) Thus, by the terminal tableau (iv), the optimum solution is P = (7,2) since 7 and 2 a r e the last entries in the rows labeled Pl and P , respectively, and the maximum value of f is 310. This agrees with the previous solution of the problem. Observe t h at the numbers 0, 250, 290 and 310 which a ppe a r respectively in the lower right hand corners of t h e tableaux correspond precisely to the values of the objective function f a t the points 0, D , G and B in Fig. 24-1. Example 7.2. We solve the problem in Example 1.2, Page 301, by the simplex method; the first tableau is the initial tableau of the dual maximum problem: I ; : I : :I:) Pl 0 -10 -12 -12 0 0 p2 p3 0 12 PZ 0 -4 12 (ii) 0 Remark: 0 9 1 1 5 13 The rows and columns of the tableaux in the simplex method are labeled so that we can obtain the optimum solution of the maximum problem from the terminal tableau. However, if only the optimum solution of the minimum problem is required, then we can dispense with the labels. For example, the labels in the preceding example were not needed. LI N E A R PRO GRAM PVII N G 310 [CHAP. 24 Solved Problems DUALITY 24.1. Find the dual of each problem. ii) Maximize f = 3n: t 5 y subject to 2x - y + 32 and X k O , + 22 subject to 6 8 4 + 6y + 72 x + 2y + 52 2 4 (ii) Minimize g = 22 2x-y+2x 3x+5y+x x+2y+4x 4 2/20, 220. and y”0, X k O , 3 1 A 2 220. In each case form the matrix of coefficients of the given problem with the objective function as the last row, and then form t h e transpose matrix. (i) Coefficient matrix: 3 2 - 1 Transpose: 2 6 1 Dual problem: Minimize g = 621 3 subject to 6 8 * and (ii) Coefficient matrix: Transpose: 1 2 + 8v + + 2u v -u+2v 3u 4v 2 ?= 2 3 5 2 uk0, vk0. Dual problem: Maximize f = 4 2 ~ 3v 3 subject to + +w u + 2v + 3w 2 2u - v + 5tu 6 f f 6 7 5u+2v+w * 7 f uso, v s o , wso. and 24.2. Find the dual of each problem. (i) Maximize f = 5 x -y + 22 7x + 2y - x subject to L 8 4 subject to x-39-28 A ~ x - Y + ~6z 5 and ii) x”0, 21’0, 2 6r-4s- and rZ0, sk0, F i r s t multiply the second inequality by -1 to change its sense to 7 2 - 1 [-:-: 5 -1 8 Transpose: 7 -1 3 -j [-: : -;-j 5 7 7 2 tho. 6. subject to Transpose: 9 -6 2 7u - v -t 3w 2u+3v-w - ~ t t2 Gw~ 5. Dual problem: Maximize f = 72 subject to 7 -2 * and + 5w 1 5 -1 2 2 2 u50, v t o , w t o . and F i r s t multiply the second inequality by -1 to change its sense to 9 - 3 L Dual problem: Minimize g = 8zc - 4v 8 -4 Coefficient matrix: 3t X L O . Coefficient matrix: iii) + 5s + t 9r - 3s + 5t (ii) Minimize y = 2r - 2y 9x - Gy - 3 x + 4y 5x t 3y f 5 f z20, y t o . 2 5 1 L I N E A R PROGRAMMING CHAP. 241 311 SIMPLEX METHOD 24.3. Find the initial tableau of each problem. (ii) Minimize y = 2r (i) Maximize f = 5% f 2 y - x subject to x - 3y f 4 x + + 6 8 5 2 y 62 L 2 ~ + 7 y -f~ 9 3x and (i) X L O , y’0, subject to + 5s + t 9 r t 2s + 4 t 2 67’+3~-2t 6 8 w[ and ,220. The initial tableau is of the form with the columns of A labeled Pi. p i Pl pz p3 0 0 1 1 9 1 1 0 0 0 1 0 1 7 - 1 2 -5-2 (ii) Find the dual maximum problem and then construct the initial tableau as i n (i). PI pz 24.4. Find a pivot entry of each of the initial tableaux in the preceding problem. In each case, (1) select any column with a negative indicator a s the pivotal column, (2) divide each positive entry in this column into the corresponding entry in the l a s t column, and (3) choose a s the pivot t h a t e n t r y which yields the smallest quotient in 12). (i) We can choose either the first o r second column as the pivotal column. ( a ) If the first column is chosen, f o r m the quotients 8/1, 513 and 912. Since 5/3 is the smallest number of the three, the entry 3 in the second row is the pivot. ( b ) If the second column is chosen, form the quotients 512 and 917. Since 9/7 is the smaller of the two numbers, the entry 7 in the third row is the pivot. Note t h a t the quotient 8 / ( - 3 ) is not considered since -3 is not a positive number. (ii) We can choose either the first o r the second column as the pivotal column. I n the first column, 9 is the pivot since 2/9 i s the smallest of the numbers 2/9, 5/2 and 114. In the second column, 6 is the pivot since 2/6 i s the smaller of the numbers 216 and 513. 2 1 1 1 0 0 7 1111 Coefficientmatrix: Transpose: 2 3 1 Dual problem: Maximize f = 4u 2 subject to \2 6 7 ZL 6 Zu-v+fw 5u+2v+w * and 24.2. + 3v + w + 2v + 32u 2 U h O , 6 7 6 f ?.‘so, w“0. Find the dual of each problem. (i) Maximize f = 5% - 3 ‘ix + 2y - x subject to (ii) Minimize y = 2 r + 5s + t 8 x-3y-22 1 4 3 x - y + 6 ~6 5 and (i) + 22 + subject to _L 9 r - 3s 5t 6r - 4s - 3t and T ~ O First multiply the second inequality by -1 to change its Sense to Transpose: 7 - 1 3 5 -1 8 -4 5 Coefficient matrix: + u”0, vso, Transpose: 7 9 -6 2 7 -2 Dual problem: Maximize f = 7 2 * and 3 1 obtain 1 1 t and add t h e result 1to the0 second row 0 z 1to 6 1fj z1 s 4 Multiply ( 1 ) by 7 and add t h e result oto theo lasto row$to obtain tableau is Thus the calculated Pl p2 W S O . k. subject to Multiply ( 1 ) by t”0. Dual problem: Minimize g = 8u - 4v 3. 5w subject to 7u- v 3w 5 2 u + 3 v - w 2 -1 -~+2v+6w 2 2 First multiply t h e second inequality by -1 to change its Sense to 5 7 2 6, and 9 - 3 f x--”o. x”0, y ” 0 , Coefficient matrix: 2 -1 8 (ii) s”0, , 1 - 276 9% - 6y -32+4y 5x+3y z”0, f 5 f y10. 2 5 1 CHAP. 241 313 L I N E A R PROGRAMMING 24.7. Using the pivot circled in the following tableau, calculate the new tableau and then find a pivot entry of it. pi I p, 2 3 ' 1 0 0 3 8 1 o I 1 : l ' 0 Divide each entry in t h e pivotal row, i.e. the third row, by the pivot 1 to obtain the (same) row 1 2 0 0 1 3 (1) Multiply ( 1 ) by -2 and add the result t o the first row to obtain 0 - 1 1 0 - 1 2 Multiply ( 1 ) by -3 and add the result to the second row to obtain 0 0 2 1 - 3 2 Multiply ( 1 ) by 2 and add the result t o the last row to obtain 0 - 6 Thus the new tableau i s Pl 0 2 0 6 pz where the third row is labeled P I , the label of the original pivotal column. The second column of this tableau has the only negative indicator and so must be chosen as the pivotal column. Since 2/2 is less t h an 3 / 2 , the 2 in the second r o w is the pivot. 24.8. Using the tableau and the pivot obtained in the preceding problem, calculate the new tableau. Divide each entry of t h e pivotal row, i.e. the second row, by the piTiot 2 to obtain the row 0 1 0 1 - 3 2 Then the new tableau i s Pl 1 p2 PZ Pl 0 0 2 0 0 2 2 -1 4 3 - 7 12 (1) 314 24.9. LINEAR PROGRAMMING [CHAP. 21 In each of the following terminal tableaux, find the optimum solution Q = (ql, . . ., q,) of the minimum problem, the optimum solution P = (pl, . . .,p,) of the maximum problem, and v = m a x f = m i n g . Pl I : ; Pl p , p2 p3 p3 Pl * * * * * * * * * * * 7 1 p3 2 12 * * 8: a: (i) p3 * * * 3 4 (ii) Recall that: (1) the components of Q a r e the la st entries, respectively, in the columns corresponding t o the slack variables; (2) the component p i of P is 0 if no row is labeled P , or the last entry in a row labeled P i ; ( 3 ) v is the entry in the lower right hand corner of the tableau. (i) Q = (7,1, 2), P = (5,0,3), v = 12. Note t h a t the second component of P is 0 since no row is labeled P2. (ii) Q = ( 3 , 4 ) , P = (0,0:2), v = 5 . Note t h a t the first two components of P a r e 0 since no row is labeled P , or P2. 24.10. Solve the following linear programming problem by the simplex method. Maximize f = 4 x - 2 y - x subjectto and + + x-y 3 4 0 y’0, x”0. x y x 2x+2y+x X k O , f A We obtain the following sequence of tableaux: Pl p, -4 2 Pl p3 p2 p3 0 (i) (ii) Pl p2 p3 (iii) Thus, by the terminal tableau, the maximum of f is 2 and occurs at t h e point P = (1,1,0). (See Problem 23.11, Page 293.) 24.11. Solve the following linear programming problem by the simplex method. Maximize f = 2x-32j + x subject to and 3x + 6 y + x 4x+2y+x x-y+x x s o , y’0, A 6 4 3 2’0. CHAP. 241 315 LIiYEAR PROGRAMMING 4 1 2 We obtain the following sequence of tabIeaux: Pl p, p3 3 6 1 1 1 0 4 2 1 1 0 1 - 1 0 0 0 3 - 1 -2 p, Pl 0 1 0 0 0 Pl 6 1 4 1 1 3 0 0 P3 p3 P3 0 -2 0 -1 1 - 0 2 3 3 :i 0-1 4 s3 3 3 1 0 1 3 3 2 3 ' 0 0 0 1 0 5 (iii) Thus, by the terminal tableau, the maximum value of t' is 1013 and occurs at the point P = (1/3,0,8/3). 24.12. Solve the following linear programming problem by the simplex method. Minimize g = 6 x + 5 y + 2 x + + subject to .P 3y 2x -I 5 2x+2y+x 3 2 4,2:-2y+3x '1 -1 and c r o , y'0, zro. We obtain the following sequence of tableaux. Pl -6 pz -2 p3 Pl 0 0 0 0 p, p3 0 (ii) (i) Pl p3 p2 - . . Pt 1 2 2 . 2 2 1 2 2 (iii) Note first t h a t ( I ) IS the tableau of the dual maximum problem. Note also t h a t the third I'OW of (iii) i s relabeled P , , the label of the pivotal column of the precedlng tabIeau (ii). By the terminal tableau, the minimum value of g IS 5 and occurs at the point Q = (0,0,5/2). LINEAR PROGRAMMING WORD PROBLEMS 24.13. A tailor has 80 sq. yd. of cotton material and 120 sq. yd. of wool material. A suit requires 1 sq. yd. of cotton and 3 sq. yd. of wool, and a dress requires 2 sq. yd. of each. How many of each garment should the tailor make to maximize his income if 316 [CHAP. 24 LINEAR PROGRAMMING (i) a suit and dress each sells for $30, (ii) a suit sells for $40 and a dress for $20, (iii) a suit sells for $30 and a dress for $20? Construct the following table which summarizes the given facts: Suit I Dress ~ Available Cotton 1 2 80 Wool 3 2 120 (ii) Here the objective function is g = 4 0 z + 2 0 y . I Evaluate g a t the corner points: g ( 0 ) = 0, g ( A ) = 800, g(B)= 1400, g ( C ) = 1600 The maximum value of g occurs a t C = (40,O). Thus he should make 40 suits (and no dresses), for which he will receive $1600. (iii) Here the objective function is It = 3@x+20y. Evaluate h at the corner points: h ( 0 ) = 0, h ( A ) = 800, h(B) = 1200, h ( C ) = 1200 The maximum value of h occurs a t both B = (20,30) and C = (40,O). Thus the tailor should make either 20 suits and 30 dresses o r only 40 suits; in either case he will receive $1200. (Actually, any point on the line segment joining B and C is an optimum solution.) 24.14. A truck rental company has two types of trucks; type A has 20 ft3 of refrigerated space and 40 f t 3 of non-refrigerated, and type B has 30 ft3 each of refrigerated and non-refrigerated space. A food plant must ship 900 f t 3 of refrigerated produce and 1200 f t 3 of non-refrigerated produce. How many trucks of each type should the plant rent so as to minimize cost if (i) truck A rents for 30e per mile and B for 40@ per mile, (ii) truck A rents for 30c per mile and B for 50C per mile, (iii) truck A rents for 2OC per mile and B for 30c per mile? Construct the following table which summarizes the given facts: Type A Type B Requirements Non-ref rigerated 1200 Let x denote the number of trucks of type A and y the number of type B t h a t the plant rents. Then x and y a r e subject to the following conditions: 20x + 30y 2 900, 40% + 30y 5 1200, x 2 0, 0 CHAP. 241 317 LINEAR PROGRAMMING The set of feasible solutions is drawn on the right; its corner points a r e 40 P = (0,40), Q = (15, 20), R = (45,O) (i) Here t h e objective function to minimize is f = 30s 4011. Evaluate f at the corner points: + f ( P ) = f ( 0 , 4 0 ) = 1600, f ( Q ) = f(15,ZO) = 1250, f ( R )= f(45,O) = 1350 30 20 10 The minimum value of f occurs a t Q = (15,ZO). Thus the plant should r en t 15 trucks of type A and 20 of B at a total cost of $12.50 per mile. (ii) Here the objective function is g 10 20 30 40 50 = 3 0 x f 5 0 y . Evaluate g a t the corner points: g(P)= 2000, g(&) = 1450, g ( R ) = 1350 occurs at R = (45,O). Thus the plant should re nt 45 trucks of type A The minimum value of g (and none of type B ) at a total cost of $13.50 per mile. (iii) Here the objective function is h = 202 + 30y. Evaluate h at the corner points: h(P) = 1200, h(Q) = 900, h(R) = 900 The minimum value of h occurs at both Q = (15,ZO) and R = (45,O). Thus the plant can r e n t either 15 trucks of type A and 20 of B o r only 45 of type A a t a total cost of $9.00 per mile. 24.15. A company owns two mines: mine A produces 1 ton of high grade ore, 3 tons of medium grade ore and 5 tons of low grade ore each day; and mine B produces 2 tons of each of the three grades of ore each day. The company needs 80 tons of high grade ore, 160 tons of medium grade ore and 200 tons of low grade ore. How many days should each mine be operated if it costs $200 per day to work each mine? 1 Construct the following table which summarizes the given facts: Mine B Requirements High Medium grade g r ad e Low g r ad e 200 Let x denote the number of days mine A is open and y the number of days B is open. Then x and y a r e subject to t h e following conditions: x 2y 80, 3% 2g 5 160, 52 2y 2 200, 2 5 0 , 2( 2 0 + + + We wish t o minimize t h e function g = 2002+200y subject to the above conditions. Method 1. Draw the set of feasible solutions as on the’right. The corner points are: p = (0, loo), Q = (20,50), R = (40,20), S = (80,O). Evaluate the objective function g = 200%4-2 0 0 ~at t h e corner points: g ( P ) = 20,000, g ( Q ) = 14,000, g(R)= 12,000, g(S) = 16,000. The minimum value of g occurs at R = (40,ZO). Thus t h e company should keep mine A open 40 days and mine B open 20 days f or a total cost of $12,000. Method 2. Consider t h e dual problem: + 160v + 2OOw u + 3w + 5w 200 2u + 2w + 2w 200 Maximize f = 80% subject to 6 and u20,w+o, W’O. 318 [CHAP. 24 L I N E A R PROGRAMMING Using the simplex method we obtain the following sequence of tableaux: 1 -80 -120 0 r-a 40 1 40 20 1 1 I 0 ; 100 0 40 8,000 I 12,000 (iii) Thus the minimum value of g is 12,000 and occurs a t the point (40, 20). (Note t h a t labels were not necessary since we were only solving the minimum problem.) 24.16. Suppose, in the preceding problem, the production cost at mine A is $250 per day and at mine B is $150 per day. How should the production schedule be changed? + In this case, the objective function to minimize is g = 2502 150y subject to the same set of feasible solutions, i.e. conditions. Evaluate g at the corner points of the set of feasible solutions: g ( P ) = g(0,lOO) = 15,000, g ( Q ) = g(20, 50) = 12,500, g ( R ) = g(40,20) = 13,000, g ( S ) = g(80,O) = 20,000 The minimum value of g occurs at Q = (20,50). Thus the company should now keep mine A open 20 days and mine B open 50 days f o r a total cost of $12,500. Supplementary Problems DUALITY 24.17. Find the dual of each problem. (i) hlaximize subject to and f = - 2y 52 + 32 + 22 2%+ 7y + 52 32 - 4y 2’0, yA0, (ii) Minimize f 9 f 11 subject to 32+9y+x 2 2 0 . and 24.18. + 3y + 72 6% - 2y + 6x 3 2 2 + 3y - 4x ’ -2 g = 122 ’8 2 2 0 , 2/20, x”0. Find the dual of each problem. + 216 - 2x (i) Maximize f = 32 subject to 42 + 3y 5% - 2y - 22 f 11 + 72 2 2 f 15 x+y+3z and 2’0, (ii) y’0, Minimize subject to + 7s + 2t 7r + 3s + 5 t 9 g = T 4r -i- 8s - t 10 2’0. SIMPLEX METHOD 24.19. Find the initial tableau and all possible pivot entries f o r each of the problems given in Problem 24.17. CHAP. 241 24.20. Calculate the new tableau from each tableau and circled pivot. 0) pi p, -3-4 2 0 1 0 6 0 0 1 2 1 0 0 0 1 0 3 4 p, p, p, Pl IT :I $ * * $ * * * * 3 1 Solve by the simplex method: 2 5 9 Maximize f = 9z subject t o and 24.23. PI I n each of the following terminal tableaux, find the optimum solution Q = (q,, . . . , q k ) of the minimum problem, the optimum solution P = (pl, . . ., p,) of the maximum problem, and ZI = max f = rnin g. P z I ; 21.22. (ii) p, 5 - 5 24.21. 319 L I N E A R PROGRAMMING Solve by the simplex method: + 2y + 52 + 3y - 5 x y + 3x 3x + y - 2x 2s 2.1: - X’O, y’0, 12 f 6 f 3 2 2’0. + 5y + x +y +x 3 2 2 + 2y - ’ 2 x + 2 y ’4 Minimize g = 4x subject t o .?: 1 2 and 2’0, y’0, x 10. LINEAR PROGRAMMING WORD PROBLEMS 24.24. A carpenter has 90, 80 and 50 running feet of plywood, pine and birch, respectively. Product A requires 2 , 1 and 1 running feet of plywood, pine and birch respectively: and product B requires 1, 2 and 1 running feet of plywood, pine and birch respectively. ti) If A sells f o r $12 and B sells f o r $10, how many of each should he make to obtain the maximum gross income? (ii) If B sells f o r $12 and A sells f o r $10, how many of each should he make? 24.25. Suppose t h a t 8, 1 2 and 9 units of proteins, carbohydrates and f a t s respectively a r e t h e minimum weekly requirements f o r a person. Food A contains 2, 6 and 1 units of proteins, carbohydrates and f a t s respectively per pound; and food B contains 1, 1 and 3 units respectively per pound. (i) If A costs 85c per pound and B costs 40r per pound, how many pounds of each should he buy per week t o minimize his cost and still meet his minimum requirements? (ii) If the price of A drops to 75( per pound while B still sells a t do6 per pound, how many pounds of each should he then buy? 24.26. A baker has 150. 90 and 150 units of ingredients A , B and C respectively. A loaf of bread requires 1, 1 and 2 units of A , B and C respectively; and a cake requires 5 , 2 and 1 units of A , B and C respectively. ( i ) If a loaf of bread sells f o r 35C an$ a cake f o r 80<,how many of each should he bake and what would his gross income be? (ii) If t h e price of bread is increased to 45C per loaf and a cake still sells f o r 80$, how many of each should he then bake and what would be the gross income? 24.27. A manufacturer h a s 240, 370 and 180 pounds of wood, plastic and steel respectively. Product A requires 1, 3 and 2 pounds of wood, plastic and steel respectively; and product B requires 3, 4 and 1 pounds of wood, plastic and steel respectively. (i) If A sells f o r $4 and B for $6, how many of each should be made to obtain the maximum gross income? (ii) If the price of B drops t o $5 and A remains a t $4, how many of each should then be made t o obtain the maximum gross income? 320 L I K E A R PROGRAMMIKG [CHAP. 24 24.28. An oil company requires 9,000, 12,000 and 26,000 barrels of high, medium, and low grade oil, respectively. It owns two oil refineries, say A and B. A produces 100, 300 and 400 barrels of high, medium and low grade oil respectively per day; and B produces 200, 100 and 300 barrels of high, medium and low grade oil respectively per day. (i) If each refinery costs $200 per day t o operate, how many days should each be r u n to minimize the cost and meet the requirements? (ii) If the cost a t A rises to $300 per day and B remains at $200 per day, how many days should each be operated? MISCELLANEOUS PROBLEMS 24.29. Solve Problem 24.16 by t h e simplex method. 24.30. Show t h a t the maximum linear programming problem (I), Page 301, is equivalent t o the problem (Z), Page 304. Answers to Supplementary Problems 24.17. (i) Minimize g = 9u subject to 3u + llv + 2v 2 -4u+7v 22 2 u + 5v 24.18. (i) Minimize subject to and (i) P, g = llu 4 u - 5v 3u 2v -2U - 7V v GZL -221 2a + 15w + to ’0, + 8w + 2v + 3w + 3v + 9w 57L - 4v -t w and + + to + 3ZC+ u 5 0, P, - .f = 3u - 2v subject t o 5 -2 3 u s o , V’O. and 24.19. 2 (ii) Maximize 2 3 2 5 -2 2 24. 2 (ii) Maximize 7.7: - 4y 32 - 8y 5x and P3 2 0, w 5 0. f = 9% - 1Oy subject t o zc 2 0. 0, v f 12 3 7 f f +y L_ f 6 1 7 2 s 2 0 , y’0. (ii) P I P, P, 0 -5 2 - 3 0 0 0 24.20. P, 24.21. (i) Q = (3,1,5), P = ( & , & , O ) , 24.22. v = 9. (ii) Q = (3,1), P = ( 0 , 0 , 7 , 0 ) , w = 11. The rnaximuin value of f is 49 and occurs at P = (0,12,5). 24.23. The minimum value of g is 11 and occurs at the point Q = ( 0 , 2 , 1 ) . 24.24. (i) 40 of A and 10 of B f o r $580. 24.25. (i) 1 of A and 6 of B f o r $3.26. 24.26. (i) 50 loaves of bread and 20 cakes for $33.50. 24.27. (i) 30 of A and 70 of B f o r $540. 24.28. (i) A r u n 50 days and B r u n 20 days. iii) 20 of A and 30 of B f o r $560. (ii) 3 of A and 2 of B f o r $3.05. (ii) 70 loaves of bread and 10 cakes for $39.50. (ii) 70 of A and 40 of B f o r $480. (ii) A r u n 20 days and B r u n 60 days. Chapter 25 Theory of Games INTRODUCTION TO MATRIX GAMES Every matrix A defines a game as follows: (1) There are two players, one called R and the other C. (2) A p l a y of the game consists of R’s choosing a row of A and, simultaneously, C’s choosing a column of A . (3) After each play of the game, R receives from C an amount equal to the entry in the chosen row and C O ~ L I I I ~ I Ia, negative entry denoting a payment from R to C. Example 1.1. Consider the following matrix games: I : [ (i) (ii) I n the first game, if R consistently plays the first row, hoping to win a n amount 3 or 4, then C can play the second column and YO win a n amount 1. However, if C consistently plays the second column then R can play the second row and so win a n amount 1. Thus we see t h a t if either player should consistently play a particular row or column, the other player can take advantage of t h a t fact. Now in the second game, player R can guarantee winning 1 or more by consistently playing row 2. Player C can then minimize his loss by playing column 1. Thus in this game i t is “best” if R plays a given row and C plays a given column. On the other hand, suppose a competitive situation is as follows: (i) there a r e two people (or: organizations, companies, countries, (ii) each person has a finite number of courses of action; (iii) the two people choose a course of action simultaneously; . . .); (iv) for each pair (i, j ) of choices, there is a payment b , , from one given player to the other. Then the above competitive situation is equivalent to the matrix game determined by the matrix B = ( b l J ) . Example 1.2. Each of the two players R and C sin~ultaneouslyshows 1 or 2 fingers. If the sum of t h e fingers shown is even, R wins the sum f r o m C ; if the sum is odd, R loses the sum t o C. The matrix of this game is a s follows: Player C 1 2 Player R 2 We shall show later (Example 3.1) t h a t this game is “favorable” t o player C. 321 322 THEORY O F GAMES [CHAP. 25 We remark that matrix games and the above competitive games which can be represented as matrix games are termed two-person zero sum games. The zero sum corresponds to the fact that the sum of the winnings and losses of the players after each play is zero. There also exists a theory f o r n-person games and non-zero sum games but that lies beyond the scope of this text. STRATEGIES By a strategll for R in a matrix game, we mean a decision by R to play the various rows with a given probability distribution, say to play row 1 with probability pl, to play row 2 with probability p 2 , . . . . This strategy for R is formally denoted by the probability vector p = (pl, p,, . . . ,pk). F o r example, if the matrix game has two rows and R tosses a coin to decide which row to play, then his strategy is the probability vector p = 3). (a, Similarly, by a s t r a t e g y for C we mean a decision by C t o play the various columns with a given probability distribution, say to play column 1 with probability q,, to play column 2 with probability qs, . . . . This strategy f o r C is formally denoted by the probability vector (I = (q,,qs,. . .,q,). A strategy which contains a 1 as a component and 0 everywhere else, i.e. where R decides to play consistently a given row or C decides to play consistently a given column, is called a p u r e s t r a t e g y ; otherwise it is called a m i x e d s t r a t e g y . OPTIMUM STRATEGIES AND THE VALUE OF A GAME F o r simplicity, we shall first consider a 2 X 2 matrix game, say A = m Suppose player R adopts the strategy 11 = ( p l , p 2 ) and player C adopts the strategy q = (q,,q,). Then R plays row 1 with probability pl and C plays column 1 with probability ql, and so the entry a will occur with probability p,q,. Similarly, b will occur with probability p,qz, c with probability pLql and d with probability p 2 q 2 . Hence the expected winning (see Page 220) of R is where qt, At and p t denote the transpose of g, A and p , respectively Similarly, if A is any k x n z matrix game and player R adopts the strategy p = (p,, . . .,p k ) and C adopts the strategy Q = (ql, . . ,qJ, then the expected winnings of R is given by E ( p ,q ) = p A qt = q A t p t . Now suppose that v1 is the maximum number f o r which there is a strategy pa for R such that the expected winnings of II is at least v1 f o r any strategy adopted by C: E(po,q) 2 v1, for every strategy q of C (11 323 THEORY O F G A M E S CHAP. 251 Since V I is the maximum of such numbers. the strategy p" is in a certain sense a "best" strategy that R can adopt; we call p o an optinzunz strategy for R. We show in Problem 25.9, Page 233, that ( I ) is equivalent to the condition p"A 2? ( V l , 2'1, . . ., Vl) On the other hand, suppose v2 is the minimum number for which there is a strategy qo for player C such that the expected loss of C is not greater than l j 2 f o r any strategy adopted by R: E( p, q") 4 vL, for every strategy 11 f o r R (2) Then the strategy q0 is in a certain sense a "best" strategy that C can adopt; we call qo a n optimum s t m t e y y for C . Similarly. ( 2 ) can be shown to be equivalent to the condition qo A' 6 ( ~ r 2'2, , . . ., 71.) The fundamental theorem on game theory by von Neumann states that V I and v z exist f o r any matrix game and are equal to each other: v = V I = vz. This number u is called the value of the game, and the game is said t o be f a i r if v=O. We observe that (1) and ( 2 ) imply that v1 6 E(po,qo) 6 v z ; hence by von Xeumann's theorem we have the equality relation We summarize and formally define the terms introduced above: FDefinition:] In a matrix game A , a strategy p o f o r player R is an optimum strategy, a strategy qo f o r player C is an optii?zumstrategy, and a number v is the calzie of the game if P O , qo and 21 are related as follows: (i) p o A 1 ( v , ~ ., . .,v); (ii) @ A t L ( v , ~ !. ,. .,v) If v = 0, the game is said to be f a i r . We point out that there may be more than one pair of strategies for which (i) and (ii) hold; that is, a player may have more than one optimum strategy. We remark that the solution of a matrix game means finding optimum strategies for the players and the value of the game. Before we give the solution to an arbitrary matrix game by means of the simplex method developed in the preceding chapter, we shall discuss two special cases which occur frequently and which offer simple solutions: strictly determined games and 2 x 2 games. STRICTLY DETERMINED GAMES A matrix game is called strictly d e t e i w i i l e d if the matrix has an entry which is a minimum in its row and a maximum in its column; such a n entry is called a "saddle point". The following theorem applies. Theorem 25.1: Let 2' be a saddle point of a strictly determined game. Then an optimum strategy f o r R is always t o play the row containing v, a n optimum strategy f o r C is always to play the column containing o, and v is the value of the game. Thus in a strictly determined game a pure strategy f o r each player is an optimum strategy. Example 2.1. Consider the following game: I I 3 -4 I 0 , 2 1 - 4 1 - 3 1 -1 3 324 THEORY O F GAMES [CHAP. 25 If we circle the minimum entry in each row and put a square about the maximum e ntry in each column, we obtain A = The point 1 is both circled and “boxed” and so is a saddle point of A . Thus w = 1 is the value of the game, and optimum strategies po for R and qo for C are as follows: pa = (O,l,Oj and qo = ( O , O , 1 , O j T h a t is, R consistently plays row 2, and C consistently plays column 3. We now show t h at po, q 0 and w do satisfy the required properties to be optimum strategies and the value of the game: 3 0 -4 -3 (i) pOA = (O,l,O)(f 1 3 )!2 = (2,3,1,2j (1,1,1,1) = (w,v,v,wj -1 2 We remark that the proof of Theorem 25.1 is similar to the proof above f o r the special case. 2 x 2 MATRIX GAMES +ll Consider the matrix game A = If A is strictly determined, then the solution is indicated in the preceding section. Thus we need only consider the case in which A is non-strictly determined. We first state a useful criterion to determine whether or not a 2 X 2 matrix game is strictly determined. Lemma 25.2: The above matrix game A is non-strictly determined if and only if each of the entries on one of the diagonals is greater than each of the entries on the other diagonal: (i) a,d > b and a,d > c ; or (ii) b, c > a and b,c > d. The following theorem applies. Theorem 25.3: Suppose the above matrix game A is non-strictly determined. Then p o = (xl,xz) is an optimum strategy for player R, qo = (y,,y2) is a n optimum strategy for player C, and v is the value of the game where d-c x, = a + d - b - c ’ II‘, = d-b y1= a+d-b-c’ and 2,= zJ2 a-b a+d-b-c a-c = a+d-b-c ad - bc ~ + d - b - ~ 325 THEORY O F GAMES CHAP. 251 Note that the five quotients above have the same denominator, and that the numerator of the formula f o r v is the determinant of the matrix A . Example 3.1. Consider the following matrix game (see Example 1.2): Using the above formula, we can obtain the value v 2: of the game: - 2 . 4 - (-3).(-3) = -1 12 Thus the game is not f a i r and is in favor of the column player C. Optimum strategies p o f o r R and qo for C a r e as follows: po = (L 1 2 ' 1A) 2 and qo = 2 f 4 - 3 f 3 (&,A). RECESSIVE ROWS AND COLUMNS Let A be a matrix game. Suppose A contains a row r asuch that r l L r , for some other row r,. Recall that r l L r l means that every entry of r z is less than or equal t o the corresponding entry of T,. Then r z is called a yecessive row and r, is said to dominate it. Clearly, player R would always rather play row r, than row rl since he is guaranteed to win the same o r a greater amount in every possible play of the game. Accordingly, the recessive row r? can be omitted from the game. On the other hand, suppose A contains a column Ca such that c , h c 3 for some other column cj. Then cl is called a recessive coLumri and c, is said to dominate it. For analogous reasons player C would always rather play column c, than column el. Hence the recessive column c, can be omitted from the game. We emphasize that a recessive row contains numbers smaller than those of another row, whereas a recessive column contains numbers larger than those of another column. Example 4.1. . Consider the game A = Note that ( - 5 , - 3 , l ) 6 (2,-1,2), i.e. every entry -2 in the first row is f the corresponding entry in the second row. Thus the first row is recessive and can be omitted from the game and the game may be reduced to 1 2 1 - 1 I 2 Now observe t h a t the third column is recessive since each entry is second column. Thus the game may be reduced to the 2 X 2 game A* = Fl 2 the corresponding entry in the The solution to A* can be found by using the formulas in Theorem 25.3 and is Thus the solution to the original game A is 326 THEORY O F GAMES JCHAP. 25 SOLUTIOS OF A MATRIX GAME BY THE SIMPLEX METHOD Suppose we are given a matrix game A . We assume that A is non-strictly determined and does not contain any recessive rows or columns. We obtain the solution to A as follows. (1) Add a sufficiently large number h. to every entry of A to form, say, the following matrix game which has only positive entries: I A' 1 1 x 1 .. ult'! nil a22 . . . a2,!( ................ /all \ ah1 a12 \ * 1 ah!,,1 ak2 (The purpose of this step is to guarantee that the value of the new matrix game is positive.) (2) Form the following initial tableau: PI Pr *. P,,, ' . . 0 0 1 ... 0 a21 a.2 U2r,1 ............................. 0 0 - .. 1 a k l ak2 ' ahnl a11 a12 . . . al,,, 1 0 1 ' 1 * * * 1 ' ' -1 -1 ... -1 I 0 0 ~~ * ' . 0 0 ~ ~ This tableau corresponds to the linear programming problem: Maximize f = subject t o and cc'i - xz + - . + T , ( ~ + - . + ui,,,Tl,l 1 + .+ 1 ............................ + + + 1 + ali?-l ~~12x2 * f ar13'1 t a22xz * * ahi.1'1 nk2x2 * ' s.1'0, X S S O , Cllr,!5m ' ..., 6 ahnis)'itr X,,,'O (3) Solve the above linear programming problem by the simplex method. Let P be the optimum solution to the maximum problem, Q the optimum solution to the dual minimum problem, and 2' = f ( P )= g ( Q ) the entry in the lower right hand corner of the terminal tableau. Set 1 1 1 (i) p o = -Q (ii) qo = --+P -k (iii) 2: = v* V* ' Then p o is an optimum strategy f o r player R in game A , qo is an optimum strategy f o r player C, and c is the value of the game. We remark that the games ,4 and A have the same optimum strategies for their respective players and that their values differ by the added constant k ; that is, p o is an optimum strategy f o r player R in game A * , 4" is an optimum strategy for player C in game A*, and 2' 4h. = 1/v* is the value of the matrix game A 4 . Example 5.1. Consider the mati I X game THEORY O F GAMES CHAP. 25! 327 Add 3 to each entry of A t o form the matrix game The initial simplex tableau and successive tableaux a r e as follows: (iii) Thus P = C O , & , i ) , Q = (i,d) and z- = 4; hence l/v* = 81'3. Then for the original game A , p o = (8/3)& = (2/3, 1/3) is a n optimum strategy f o r player R, qo = ( 8 / 3 ) P = (0, 1/3, 2/3) is a n optimum strategy f o r player C, v = 8/3 - 3 = -1/3 is the value of the game. Observe t h a t the game is favorable t o the column player C. ExampIe 5.2. Consider the matrix game A shown below: A = ; -1 -1 Adding 2 t o each entry of A , we obtain the matrix game A' on the right. The initial tableau and t h e successive tableaux a r e as follows: P, P1 P, P, 4 3 1 1 0 0 1 4 1 0 1 0 1 1 0 0 0 -1 -1 -1 ~~ (iii) 1 0 P, P, 328 THEORY O F GAMES [CHAP. 2 3 Thus P = (1/22,3/22,9/22), Q = (3/22,1/22,9/22) and v* = 13/22; hence l/w* = 22/13. f o r the original game A , po = (22/13)Q = (3/13, 1/13, 9/13) is a n optimum strategy for R , qo = (22/13)P = (1/13, 3/13, 9/13) is a n optimum strategy f o r C, w = 22/13 - 2 = -4/13 Accordingly, is the value of the game. 2 x m AND m x 2 MATRIX GAMES Consider the 2 x m matrix game Suppose we apply the simplex method t o the above game. Note that the optimum solution of the corresponding maximum linear programming problem will contain at most two non-zero components; hence so will an optimum strategy f o r the column player C. In other words, the 2 x m matrix game A can be reduced to a 2 x 2 game A*. I n the next example we show a way of solving such a 2 x nz game A and, in particular, how to reduce the game to a 2 x 2 game A X . A similar method holds f o r any m X 2 matrix game. Example 6.1. Consider the following matrix game (see Example 5.1): -1 3 A Let p0 = (2,1-2) = 1 (-2 -:) denote an optimum strategy for R and let v be the value of the game. Then or 32 - 2 ( 1 - 2 ) 5 w -x+(l-x) 5 v f v O-(l-x) L 5x-2 V or v f V L Let us plot the three linear inequalities in w and x on a coordinate plane where 0 x 1, as shown in the diagram. The shaded region depotes all possible values of v; however, R chooses his optimum strategy so t h a t v is a maximum. This occurs at the point W which is the intersection of v = -2x+ 1 and u = x - 1; t h a t is, where x = Q and v = -&. Thus t h e value of the game is -$ and the optimum strategy f o r R is ($,Q). f +1 -22 2-1 f To find a n optimum strategy f o r C, we can set qo = (y, z, 1 - y - x ) and use the condition qoAt 5 (w,v, v) = -Q, -Q). However, a simpler way is as follows. Observe that t h e two lines intersecting at W a r e determined by only the second and third columns of A . Omit the other column and reduce the game t o t h e 2 X 2 niatrix game 4 2 1 (-4, A* = El -1 -2 -3 -4 By Theorem 25.3, the solution t o A* i s V' = -9, po' = (3, Q) and q0' = (I 2 ) 3' 3 CHAP. 251 329 THEO.RY O F G A M E S An optimum strategy f o r the column player C in the matrix game A is then v = v’ and po = PO‘, as expected.) q0 = (O,&,$). (Note tha t Remark: In the diagram above, the three hounding lines intersect the line E = 0 at -2, 1 and -1 respectively, and the line r = l at 3, -1 and 0, respectively. Observe tha t these numbers correspond t o the components of the r o w of the original matrix A . SUMMARY I n finding a solution to a matrix game, the reader should follow the following steps: (1)Test to see if the game is strictly determined. (2) Eliminate all recessive rows and columns. (3) I n the case of a 2 x 2 game, use the formulas in Theorem 25.3. (4) In the case of a 2 Example 6.1. X m game or an ni X 2 game, reduce the game to a 2 X 2 game as in ( 5 ) Use the simplex method in all other cases. Solved Problems MATRIX GAMES 25.1. Which of the following games are strictly determined? For the strictly determined games, find the value v of the game and find an optimum strategy po for the row player R and an optimum strategy qo for the column player C. 1 I -8 0 1 -1 -3 (i) 3 1 - 1 7 3 4 (iii) (ii) In each case, circle the minimum entries in each row and put a box around the maximum entries in each column as follows: @ 1 (i) (i) mi01 1 I -3 (iii) (ii) The -1 in row 2 and column 2 is a saddle point; hence z1 = -1, p0 = (O,l,O), q0 = (0,1,0). (ii) There is no saddle point; hence the game is non-strictly determined. Note t h a t both 3’s in the first column a r e “boxed”, since both a r e maximum entries in the column. (iii) The 2 in row 2 and column 3 is a saddle point; hence v = 2, p o = (0,1, O), QO = (0,0,1,0). 330 25.2. THEORY O F GAMES [CHAP. 25 Find the solution to each of the following 2 x 2 games. (ii) po (iv) If the game is non-strictly determined, use the following formulas of Theorem 25.3, where ( y l , x2) and 40 = (ul, ur). a-b d - b d -c XI = 0 ‘ d x2 = a + c l - b - c ’ a+d-b-c’ b -c’ Llq (ii (iii) a-e = a f d - b - ~ ’ .u= The game is non-strictly determined. F i r s t compute ad - bc a + d - - b - ~ (L td -b -c = 2 + 2 - 0 + 1 = 5. Then That is, (g, +) is an optimum strategy f o r R and [g,Q I is a n optimum strategy for C. The 2.2-0.(-1) - L value of the game is ti = 5 - 5 . Observe t h a t the game I S not Pair and is in favor of t h e TOW player R. lii) This game is strictly determined with saddle point -1. Thus w = -1; and p @ = ( 1 , O ) qo = ( 0 , l), i.e. R should always play row 1 and C should always play column 2. and (iii) The game is non-strictly determined, F i r s t compute a - d - b - c = 1- 5 43 ’1 = 10. Then .yl = 8/10 = .8, x2 = . 2 , y1 = .6, yz = .4. Tha t is, (A,.2) is a n optimum strategy for R and 1.5- (-3)*(-1) - .2. i.6, .4) is an optimum strategy f o r C. The value of the game is t = 10 The game is not f ai r and, since v is positive, the game is in favor of the row player R. + iivj The game is non-strictly determined. First compute a d - b - c = -3 - 3 - 2 - 5 = -13. Then x1 = 5/13, y 2 = 8/13, y 1 = 8/13, y, = 5/13. Thus (&,&) is a n optimum strategy f o r R (-3).(-3)-2.5 - 1 and is a n optimum strategy f o r C. The value of the game is L’ = - 13 13 * The game is not f ai r and is in favor of the row player R. (A,&) 25.3. Find a solution to the matrix game First circle each row minimum and box each column maximum: Observe t h at there is no saddle point and so the game is non-strictly determined. We now te st f o r recessive rows and columns. Note t h a t the third column is recessive since each entry is larger than the corresponding entry in the second column; hence the third column can be omitted to obtain the game -5 CHAP. 251 331 THEORY OF G A M E S Note t h a t the second row is recessive since each entry is smaller than the corresponding entry in the first row. Thus the second row can be omitted to obtain the game I 1 1 - 3 1 Using the formulas in Theorem 25.3, the solution to the above 2 v' = -13/11, = (2 11' 5 1 1 ) (Io' = ,o' t 2 game is - f i (11' 2 -) 11 Thus a solution to the original game is v = -13/11, 25.4. pa = Find a solution t o the matrix game - (fi.O,,), 4 (I" -1 A = 1 Set p o = (:r, 1 - x ) . Then p o A (-;-;-;-;) (x,1-2:) or -2% +1 2 = (5,h. 0 ) 5 -3 1 -2 -2 is equivalent to 5 iil, I ' , c , i ' j (V,V,C,") v 8 ~ - 32 3x-2 % v -7x+5 5 v Plot t h e above four linear inequalities a s shown in the diagram. Note t h a t t h e maximum L' satisfying the inequalities occurs a t t h e intersection of the lines -2x+ 1 = 2; and 3 ~ - 2 = v, t h a t is, a t the point x= and L' = -L. Thus v = -8 is the value of the game, and p o = (2, 8) is a n optimum strategy f o r player R. Since the two lines determining L* come from the first and third columns of t h e matrix game A , omit the other columns t o reduce A t o the 2 X 2 matrix game A :1 - F l . 1 1 A solution of A " , by Theorem -2 (g, 0 , b, 0 ) Thus q 0 = in game A . 25.5. I is a n optimum strategy f o r player C Note t h a t v = v' and po 1P O ' , as expected.) Find a solution t o the matrix game or A = 47-1 -3.r -1 23. El. 4 v f 21 f v 5 332 THEORY O F GAMES [CHAP. 25 Plot the above three inequalities (where O‘x‘l) as shown in the diagram. W e seek the miximum v satisfying the given inequalities; note t h at i t occurs a t the intersection of the lines -3% 1 = v and 2x = v, i.e. at the and v = %. Thus v = is the value of point r = 1 5 the game and qo = (&,Q) i s an optimum strategy f o r player C. Since the two lines determining v come from the second and third rows of A , omit the other row to obtain + the 2 X + Fl. 2 matrix game A:: = A solution of A’?,by Theorem 23.5, is 3’ = 2 , 5 = (2 5 ) qo’ 5’5 ’ = (I, +) 5 mi. Thus po = (0,$, 8) is a n optimum strategy for player R . (Note t h at u = v’ and qo = q o ’ , as expected.) 25.6. Find a solution to the matrix game -1 The reader should first verify t h a t the game is non-strictly determined and tha t there a re no recessive rows or columns. We find a solution by the simplex method. F i r s t add 4 to each entry t o obtain the game 3 4 The initial and subsequent tableaux follow: Pl p2 Pl p3 pz p3 4 (i) (ii) (iii) (Renzayk: Since the l as t row in (iii) has no negative indicators, the tableau is a terminal one and so all of its entries need not be computed.) Thus P = (L li’ 5 17’ O ) , Q = (0’ 1 7 ’ ”) 17 and ~ 4 : hence 1/~:,: !?, Accordingly, po = ( $ ) Q = 9 3 ( 0 ,i ,~ 40 = ( ) y ) P, = (i 5 0 ) and 2: = 17 - 4 = -5 5’5’ 5 5’ =&; 25.7. Two players R and C simultaneously show 2 or 3 fingers. If the sum of the fingers shown is even, then R wins the sum from C; if the sum is odd, then R loses the sum to C. Find optimum strategies f o r the players and to whom the game is favorable. THEORY OF GAMES CHAP. 251 The matrix of the game is 2 333 3 The game is non-strictly determined; hence the formulas in Theorem 25.3, Page 234, apply. First compute a d - b - c = 4 6 5 5 = 20. Then + + + + (g, &), is an optimum strategy f o r R and (&,%) is a n optimum strategy f o r C. The value T h a t is, = 4 . 6 - - ( - 5 )20' ( - 5 ) - -L. of the game is 2 0 , and, since i t is negative, the game is favorable t o C . 25.8. Two players R and C match pennies. If the coins match, then R wins; if the coins do not match then C wins. Determine optimum strategies for the players and the value of the game. The matrix of the game is H T The game is non-strictly determined. Using the formulas in Theorem 25.3, Page 234, we obtain Thus (g, &) is an optimum strategy for both R and C. The value of x1 = x, = y1 = y2 = = 1.1 - ( - l ) . ( - i ) = 0: hence the game is fair. the game is 4 4. THEOREMS 25.9. Let p o be a given strategy for player R in a matrix game A . Show that the following conditions are equivalent: v, for every strategy q for C; (ii) p o A 2 (v,v, . . . , v). (i) p o A qt Assume that iii) holds. Then, for any strategy q f o r C, pOA qt 2 (v,v, . . ., v) qt = v On the other hand, assume t h a t (i) holds and t h a t pOA = ( a l , a 2 , . . ., a k ) . strategy q = (1,0, . . .,O), we have Choosing the pure pOA qt = (al,aL, . . ., ak)qt = a , s v Similarly, a2 2 w, . . ., ak 5 v. I n other words, @ A 2 (v, v, . . ., w). 25.10. Let p = ( a l , a r ,. . ., ak) and p* = ( b l ,bz, . . ., bk) be probability vectors. Show that each point on the line segment joining p and p * , i.e. t p -t (1- t)pa where 0 g t 4 1, is also a probability vector. Observe that tp + (1- t ) b z , . . ., t a L + (1- t ) b k ) Since t , 1 - f, the a , and the b, are all non-negative, ta, + (1- t ) b , is also non-negative every i. Furthermore, f a , + (1- t ) b , + ta2 + (1- t ) b , + + tak + (1- t)bh . . + (1- t ) b k = ta, + ta, + . + tah + (1 t ) b , + (1 t ) b z + (1- t)p" = (tal + (1 - t ) b l , ta2 * ' ' *. - - * = t ( a , + a 2 + ~ ~ ~ + a ~ ) + ( 1 - t ) ( b l + b , + ~ ~=~ + t +b (~ l)- t ) = 1 for 334 T H E O R Y O F GAMES 'CHAP. 2 6 25.11. Let p o and p be optimum strategies for player R in a matrix game A with value c . Show that p' = t p o + (1- t ) p , where 0 t 6 1, is also an optimum strategy for R. p A By the pieceding problem, p is a probability vector, hence we need only show t h a t (L,I , ,v). We a r e given t h a t pOA Thus (L, I!, .v) p A and p'A == (tpo - i l - t ) p ) A tpOA 2 4 + (1 t ) A~ - t ( v , v ,. . , l = (v,w,. . ,v) ( L , L, (1 - t ) ( C , w, . . ., w ) ) ,L) 25.12. Show that if a and b are saddle points of a matrix A , then a = b. If n and b a r e in the sanie row, then each is a minimum of the row and so a = ti. Similarly if u and b a r e in the same column, then n and b a r e maxima of the column and so a = b. Now assume t h a t a and b appear in different columns and different rows as illustrated below: *: :,: ::i ::: \:,: \* * :L * ::. .,. .'. Q r;;l ::: f :,: pJ :.: * * Q k.; Q * * @ : : p J :.k * *\ :I :,: :.: .:: :: :i i' *I i:: Since a is a saddle point, a G d and c f a ; and since b is a saddle point, b s c and d E b. Thus a ' d s b ' c 5 a Accordingly, the equality relation holds above: 25.13. Show that the game A = a = d = b = c. is strictly determined. Observe t h a t each a is a minimum of the first row. If a and if u 1 c then the second a is a saddle point. b then the first a is a saddle point, Thus n e are left with the case t h a t b > a and c > a, i.e. where b and c a r e maxima of their respective columns. Now if b f then b is a saddle point, and if c b then c is a saddle point. (3 Accordingly, in all cases A has a saddle point and so A is strictly determined. 25.14. Prove Lemma 25.2: The matrix game A = is non-strictly determined if and only if each of the entries on one of the diagonals is greater than each of the entries on the other diagonal: (i) a,d > b and a,d > c; or (ii) b,c > a and b,c > d. If ( i ) or l i i ) holds, then there is no saddle point for A and so A is non-strictly determined. Now suppose A is non-strictly determined. By t h e preceding problem, a # b . We consider two cases. Case (1): a < b. If a c then a is a saddle point; hence IL < c. We need only show t h a t and d < c. Suppose d 2 b: then d > c o r else d is a saddle point. But a < c and d > c implies c is a saddle point. Accordingly, d % b leads to a contradiction, and so d < b. Similarly, d < c. d <b Case ( 2 ) : n > b. The proof is similar t o t h e proof in Case (1). CHAP. 251 THEORY O F GAMES 335 +l+l 25.15. Prove Theorem 25.3: Suppose the matrix game A = is not strictly determined. Then a-c a-b a-b a-c ?Jo = ( a + d - b - c ' a + d - b - c ad - bc u = a+d-b-c are respectively an optimum strategy f o r R, a n optimum strategy for C and the value of the game. Set p0 = (x,1-x) and q0 = (y,1-y), +c ( b - d ) ~+ d or (a-c)z 2 5 Then pOA v v and 2 (w,w) and qoAf 6 (w,w) is equivalent to (a- b)y +b (c-d)y 4- d f t? f v Solving the above inequalities as equations f o r the unknowns 2 , y and v, we obtain the required result. We remark t h a t a b - c - d # 0 by the preceding problem, i.e. by Lemma 25.2. We also note t h a t in the case of a 2 X 2 non-strictly determined matrix game A , we have the following stronger result: poA = (w,v) and qoAt = ( t i , w), + 25.16. Let A be a matrix game with optimum strategy p o f o r R, optimum strategy qo f o r C, and value v ; and let k be a positive number. Show that f o r the game kA, p o is an optimum strategy for R, qo is an optimum strategy for C, and kv is the value of the game. We a r e given t h at pOA (w,w, . ., c) and qOA (w,W , . . ., c). Then pO(kA) = k(p0A) 2 k(w,v, . . ., w) = (kv,kv, . . . , kv) qO(kA)t = qOkAt = k(q0At) f k ( ~ W,, . . ., W ) = (lit],kv, . . ., kw) , f which was to be shown. Supplementary Problems MATRIX GAMES 25.17. Find a solution t o each game, i.e. an optimum strategy po for R , an optimum strategy qo f o r C, and t h e value of the game. (9 25.18. Find a solution to each game. (4 25.19. -4 - -3 -4 Find a solution t o each game. (ii) (iii) -1 4 -2 336 25.20. THEORY O F GAMES [CHAP. 25 Find a solution to each game. (i) -1 . Find a solution to the game if iii a < 1, (ii) 1 < a < 3, (iii) a > 3. 25.21. Consider the game 25.22. Each of two players R and C h a s a dime and a quarter. They each show a coin simultaneously. If the coins ar e the same, R wins C's coin; if the coins a r e different, C wins R's coin. Represent the game as a matrix game and find a solution. 25.23. Each of two players R and C has a penny, nickel and dime. They each show a coin simultaneously. If the total amount of money shown is even, R wins C's coin; if i t is odd, C wins R's coin. Represent the game as a matrix and find a solution. THEOREMS 25.24. Suppose every en t r y in a matrix game A is increased by a n amount k. Show tha t the value of the game also increases by k , but t h a t the optimum strategies remain the same. 25.25. Show t h at if every entry in a matrix game is positive, then the value of the game is positive. Answers to Supplementary Problems (i) p o = (0, l), q0 = (0,l), w = 1; (ii) p o = (l,O), q0 = (0,I), v = 2; = ( l , O ) , v = 1. (iii) 25.18. (i) po = (,7,.3), qo = (.4,.6), w = .2; (ii) p o = (3/4,1/4), (iiij p o = (7/8,1/8), qo = (5/8,3/8), v = 3/8. = (5/12,7/12), 25.19. (i) p o = (5/8,3/8), qo = (3/8,0,5/8), w = 1/8; (ii) PO = (5/7,2/7), q0 = (5/7,2/7,0), v = 3/7; (iii) P O = (2/7,0,5/7), q 0 = (4/7,3/7), w = 13/7. 25.21. (i) 25.17. p0 can be any strategy, yo 25.22. p o = (l,O), q 0 = (O,l), w = 1; (ii) p o = (0,1), q0 = (0,1), w = a ; =I; p o = (5/7, 2 / 7 ) , q' = 5 1 25.23. qo = 0. 10 5 10 (i,g), w ; po = ( l O / l l , 0, l / i l ) , yo = (1/2, 0, 1/2), -10 j -10 I 10 w = 0. v = 1/4; INDEX De Morgan’s laws, 11 Degenerate equations, 107 Dependent vectors, 114 Detachment, law of, 26 Determinants, 127 Disjoint sets, 37 Disjunction, 2 exclusive, 3, 1 5 Distance, 266 Distribution, 218 binomial, 217 Domain of a function, 66 Dot product, 84 Dual problem, 301 Duality theorem, 303 Absolute value, 259 Absorbing states, 240 Algebra of propositions, 11 Algebra of sets, 38 Algorithm of the simplex method, 308 Angle of inclination, 266 Argument, 26 Bayes’ theorem, 207 Biconditional statement, 19 Binomial coefficients, 141 theorem, 142 Binomial distribution, 217 Birthday problem, 188 Bounded sets, 269 Bounding hyperplane, 282 Echelon form, 109 Element, 35 Elementary event, 184 Empty set, 37 Equations, linear, 104 system of, 107 Equiprobable space, 186 Equivalence class, 60 Equivalence relation, 59 Equivalent, logically, 10 Euclidean space, 268 Event, 184 Exclusive disjunction, 3, 1 5 Expected value, 220 of a game, 221 Extreme point, 282 C player, 321 Cartesian product, 50 plane, 266 Cauchy-Schwarz inequality, 278 Cell, 60 Class, 37 equivalence, 60 Closed half space, 282 Closed interval, 258 Co-domain of a function, 66 Coefficients, binomial, 141 of equations, 106 Collection, 37 Column vector, 81 Combinations, 161 Complement of a set, 37 Components of vectors, 81, 83 Composition function, 68 Compound statements, 1 Conclusion of an argument, 26 Conditional probability, 199 Conditional statement, 18 Conjunction, 1 Consistent equations, 109 Constant, 106 Contradiction, 10 Contrapositive, 19 Converse statement, 19 Convex sets, 281 polygonal, 283 polyhedral, 282 Corner point, 282 Counting, fundamental principle of, 152 Cross-partitions, 162 Factorial, 141 F a i r game, 221, 323 Fallacy, 26 False statement, 1 Family, 37 of lines, 268 Feasible solution, 300 Finite set, 36 Fixed point, 234 Frequency, relative, 184 Function, 66 composition of, 68 identity, 70 inverse, 70 linear, 67, 281 objective, 300 polynomial, 68 337 INDEX Games, gambling, 221 f a i r , 221 Games, matrix, 321 f a i r , 323 strictly determined, 323 value of, 323 Graph of a function, 67 Half space, 282 plane, 282 Homogeneous equations, 111 Hyperplane, 269 Identity function, 70 Image, 66 Implication, 28 Inclination of a line, 266 Inconsistent equations, 109 inequalities, 285 Independent events, 201 trials, 216 Independent vectors, 114 Inequalities, 258 linear, 259 Infinite set, 36 Initial simplex tableau, 304 Inner product, 84 Intersection of sets, 37 Intervals, 258 Inverse, function, 70 matrix, 132 relation, 59 statement, 19 Invertible matrix, 131 Joint denial, 16 Line, in the plane, 267 real, 257 segment, 281 Linear, equation, 1 0 4 function, 67, 281 inequality, 259 programming, 300 space, 83 Logical implication, 28 Logically equivalent, 10 Mapping, 66 Markov chain, 236 Matrix, 90, 112 addition of, 90 game, 321 multiplication of, 92 non-singular, 133 regular, 234 scalar multiplication of, 91 singular, 133 square, 93 stochastic, 233 Matrix (cont.) transition, 236 transpose, 94 Member, 35 Minkowski’s inequality, 278 Mixed strategy, 322 Multinoniial coefficients, 144 Mutually exclusive events, 185 Negation, 3 Negative numbers, 257 New simplex tableau, 306 Non-singular matrix, 133 Norm of a vector, 269 Null set, 36 Numbers, real, 256 Objective function, 300 Odds, 188 One-to-one function, 69 Onto function, 69 Open half space, 282 Optimum solution, 300 Optimum strategy, 322 Order, 257 Ordered pair, 50 Ordered partition, 163 Origin, 257 Pair, ordered, 50 Parallel lines, 266 Parameter, 268 Partitions, 60, 162 ordered, 163 Pascal’s triangle, 143 Permutation, 152 Perpendicular lines, 266 Pivot, 305 Pivotal column, 305 row, 305 Plane, Cartesian, 266 Plane, half, 282 Play of a game, 321 Players, R and C, 321 Point, 266 corner, 282 extreme, 282 fixed, 234 saddle, 323 Polygonal convex set, 283 Polyhedral convex set, 282 Polynomial, 68 Premises, 26 Primal problem, 301 Probability, 184 conditional, 199 distribution, 238 vector, 233 Product set, 50 Proper subset, 36 Proposition, 3 P u r e strategy, 322 Quotient set, 60