Jennifer Bachand Math 482H Game Theory Game theory is defined as the logical analysis of situations of conflict and cooperation. A key question in game theory is to reason about the behavior we should expect to see when players take part in a given game. First, we will discuss what a mathematical game exactly is. A mathematical game has the following properties: • At least two players (by “players” we could mean anything, such as an individual, company, or entity) • Each player has a number of possible strategies • The strategies chosen determine the outcome • Each possible outcome has its associated “payoff”, representing the value of the outcome to that player Some examples of mathematical games are chess, bridge, and poker. Game theory generalizes the study of these and applies it to the real world. Note: Game theory models outcomes based on assumed rational play. This limits its application as real-world games are enormously complex and humans do not usually behave rationally. An example of a model of a game and its payoff table: Rose A B C A Colin (-2,2) (0,0) (-5,5) B (-3,3) (-2,2) Table 1 (10,-10) where Rose and Colin are players with strategies A, B, and C. This is an example of a zero-sum game, meaning that the payoffs for each outcome add to 0. Zero-sum games represent situations of pure conflict, where one player’s gain is directly the other player’s loss. We can simplify the situation by looking at the game through Rose’s perspective. We thus consider only her payoffs (Table 2) and then the motion diagram (Table 3), which is where we draw an arrow to the smallest entry in each row and the largest in each column. A A B 0 2 2 B C -3 -5 A C Table 2. This is Table 1 but from Rose’s perspective (Payoff Table) 10 A B 0 2 2 B Jennifer Bachand Math 482H -3 -5 Table 3, a motion diagram for Table 2. 10 From these tables, we see that we should play strategy C, with the hope that Colin will play B. However, if Colin plays A, the consequences will be the worst possible. But if Colin was planning to play strategy B, then Rose should play strategy A; this continues. We say that Strategy S dominates strategy T if every outcome in S is at least as good as the corresponding outcome in T, and at least one outcome in S is strictly better than an outcome in T. i.e., S ≥T for all entries. Example: Rose\Colin A B C D Rose/Colin A C 2 4 3 C A 12 D -16 B 5 3 -1 1 0 1 7 0 0 -20 16 Table 4. An example payoff table for Rose A B B C D D Table 5. Motion diagram for Table 4. Here, we see that Colin B dominates Colin C because Rose gains less from Colin B, no matter what she plays. Jennifer Bachand Math 482H Dominance Principle: A rational player should never play a dominated strategy. We may note that Rose C and Colin B are the most cautious strategies in this game, because by playing them, Rose knows she will gain at least 2 and Colin knows that he will lose no more than 2. Rose C/Colin B is called an equilibrium outcome, because they could not have forced a better outcome. Is there a way to maximize the occurrence of an equilibrium outcome? This brings us to the concept of equilibrium strategy. Consider an mxn game: A\B A1 A2 … Am βj Let B1 a11 a21 … am1 β1 B2 a12 a22 … am2 β2 … … … … … … Bn a1n a2n … amn βn αi α1 α2 … αm (αi) = min(over j) aij= minimum of the numbers in the ith row. This is the "most we can win"; our opponent will respond with strategy Bj for minimum yield, αij When choosing strategy A0 we must count on the fact that by a suitable action, our opponent can keep us from winning more than αi. So we stop at strategy Ai, where αi is a maximum. α = max(over i) αi Thus, α = max(over i) min(over j) aij. This is called the lower value of the game, the maximum of the minimum yield, or the maximin. Its corresponding strategy is called the maximin strategy; by playing the maximin strategy, we are assured a yield of greater than or equal to α in any case. Similarly, our opponent is trying to reduce our yield to the minimum, and examines each of our strategies from the point of view of maximum yield for that strategy. We set βi= max(over i) of aij. We find the minimum of βj: β= min(over j) βj. Jennifer Bachand Math 482H So β=min (over j) max(over i) aij. β is called the upper value of the game, or minimax. Its corresponding strategy is called the minimax strategy. By playing the minimax strategy, our opponent ensures that he or she will not lose more than the amount β, regardless of our adopted strategy. This is the Priniciple of Minimax. We further explain with the following example: A/B A1 B1 2 A2 -3 βj A3 B2 B3 -3 αi 4 -3 6 -5 4 4 -5 -5 4 4 6 Table 7 -5 The maximin strategy here is A1 (we lose no more than 3) and the minimax strategy here is B1 or B2 (opponent loses no more than 4). Sometimes, α=β. If α=β we call this value v, or the common value of the game. This represents a saddle point, and the entry at this outcome is ≤ any in its row and ≥ any in its column. Saddle Point Principle: If a matrix has a saddle point, both players should play a strategy which contains it. That is, if one party adheres to their optimal strategy and the other deviates, the deviating party can only decrease their yield. Saddle points are equivalent and interchangeable; any two saddle points in a matrix game have the same value. If Rose and Colin both play strategies containing a saddle point outcome, result will always be a saddle point. We confirm this with the following proof: Suppose a and b are saddle point entries in a matrix game, and c and d are other entries at the corners of a rectangle containing a and b. a … … … c d … … b Jennifer Bachand Math 482H Since a is smallest in its row and b is largest in its column, a≤c≤b. Since b is smallest in its row and a is largest in its column, b≤d≤a. So a≤c≤b≤d≤a, so a=c=b=d. So c and d are also the largest in their columns and smallest in their rows, and thus are saddle points. We will now go on to discuss mixed strategies. In most matrix games, the row maximin is not equal to the column minimax, which means there is no saddle point. Thus, the optimal strategy for each player, rather than one pure strategy, is a mixed strategy: this is where each party plays a mixture of strategies according to a certain fixed probability. We analyze this effect using expected value. The expected value of payoffs a1, a2,..ak with respective probabilities p1, p2, …,pk is E(x) = p1a1+p2a2+….+pkak, or the weighted average of those payoffs. If a party knows its opponent is playing a given mixed strategy and will continue to play it regardless of this party's play, the expected value principle states that the party should play its strategy with the largest expected value. Here is an example: Colin Rose A B βj A B 2 -3 2 3 0 3 αi -3 Table 8 0 In this example, the maximin is 0, which does not equal the minimax, which is 2. If Colin flips a coin to play CA 50% of the time and CB 50% of the time, then our expected payoffs are: RoseA = .5(2)+ .5(-3)= -.5 RoseB = .5(0) + .5(3)= 1.5 Thus, Rose should play strategy B if she knows Colin's mixed strategy. However, Colin understands that if he uses a ½ CA, ½ CB mixed strategy, Rosie can take advantage of this. So instead, he will use CA with a probability of x and CB with a probability of (1-x), 0<x<1. Our new expected values become: Jennifer Bachand Math 482H RoseA = x(2) + (1-x)(-3) = -3 + 5x RoseB = x(0) + (1-x)(3) = 3-3x If these are the same, Rose can't take advantage of Colin's strategy. So Colin will equalize them: -3 + 5x = 3 - 3x x=.75 So Colin plays (.75)CA, (.25) CB, and Rose wins on average ≤ ¾ . Remember that Colin must do this randomly. He might achieve this strategy any number of ways, but one way would be flipping two coins and playing B if he gets two heads. Now we can analyze this from Rose's point of view. We use the same calculations, but with probability y: ColinA: y(2) + (1-y)(0) = 2y ColinB: y(-3) + (1-y) (3) = 3- 6y 2y= 3-6y y= 3/8 So if Rose plays (3/8) RA, (5/8)RB, she is assured ≥ ¾ per game This is similar to a saddle point. Rose has a mixed strategy, which ensures she will have an expected payoff of ≥ ¾ . Colin has a mixed strategy which ensures Rose's expected payoff ≤ ¾ . So ¾ is the expected value of the game, and ¾ CA, ¼ CB is Colin's optimal strategy, while (3/8) RA, (5/8) RB is Rose's optimal strategy. This is the solution of the game. Application: Evolutionarily Stable Strategies The basic ideas of game theory can also be applied to situations in which the players are not making conscious choices or decisions; that is, game theory can model different forms of behavior. The analysis of this can give insight into which forms of behavior are “successful,” or have a tendency to persist, and which are not and have a tendency to be driven out by others. This approach is very well suited to evolutionary biology. Evolutionary biology emphasizes the way that an organism’s genes are responsible for its characteristics; an organism’s characteristics determine its fitness and ability to produce offspring. An organism that is more fit in a given environment is more likely to produce a larger amount of offspring, which increases their representation in the population; these genes are carried on in the offspring, so the organism’s offspring also has a greater rate of reproduction. Evolutionary game theory comes into play when we understand that behaviors of organisms involve interaction with other organisms, and an organism’s success depends on this interaction. In order to analyze the fitness of an organism, it must be analyzed in the context of its population. Thus, we can apply game theory in this way: an organism’s characteristics and behaviors, determined by its genes, are like its strategy in a game, and its fitness is its payoff; thus, the payoff depends on which strategies (or characteristics) the organism uses (or possesses) in its Jennifer Bachand Math 482H interactions with other organisms. Finding equilibrium is a helpful way to predict the results of evolution on a population. For example, we could consider a species of beetle. Suppose that the ability of each beetle to find food and use this food effectively is determined ultimately by its body size. Suppose further that a mutation is introduced: beetles with this mutation develop a significantly larger body size. Our population then separates into two separate kinds of beetles: small beetles and large beetles. Large beetles have higher metabolic requirements, and thus require more food to maintain their energy; this decreases their fitness. We may think that because of this mutation, large beetles are likely to starve and will therefore die out after several generations. However, we must consider also the interaction between organisms: the beetles compete for food, and larger beetles are more effective at claiming the largest shares of the food. For simplicity, we will consider only the interactions between two beetles at any point in time. However, note that the basic principles we will discuss can be extended to simultaneous interactions among many beetles. During a competition between two beetles for food, we have the following outcomes: • Beetles of the same size compete. Here, they get equal shares of the food. • Beetles of different sizes compete. Here, the large beetle gets the majority of the food. This brings us back to the topic of evolutionarily stable strategies. We can say that large beetles have less of a fitness benefit from a certain amount of food due to their metabolism. We can model the beetles’ fitness from any amount of food as payoffs in a two-player game between two beetles. Each beetle plays one of two strategies, Small or Large, depending on its body size. Table 9 displays this payoff table: Beetle 2 Beetle 1 Small Large Small Large 8,1 3,3 5,5 Table 9: The Body Size Game 1,8 These payoffs model the principles discussed: two small beetles will receive the same fitness from the food source, large beetles receive more fitness at the expense of the small beetles, and large beetles do not fully receive the fitness from the food, since they have a higher metabolism and also must spend energy fighting the other large beetle for an equal share. This game differs from the ones previously discussed primarily because the strategy played is not a choice. Each beetle, based on its genetics, is required to play one strategy its entire life. Thus, the strategy changes only come from long-term shifts in populations and the effects of evolutionary forces. This kind of game theory is known as evolutionary game theory: players’ strategies are genetically Jennifer Bachand Math 482H determined and will persist once they are sufficiently introduced into the population. We suppose that each beetle will, throughout its lifetime, repeatedly compete with other beetles for food. We also suppose that the population is big enough to prevent the same two beetles from interacting solely with one another repeatedly over their lifetimes. Carrying the game analogy further, we can say that a beetle’s overall fitness is the average fitness it gains from each of its pairwise competitions with others, and its overall fitness determines its reproductive ability and success. This success is manifested by the amount of offspring carrying its genes into the next generation. A strategy is evolutionarily stable if its widespread use over the population means that any small group of intruders adopting a different strategy will not be successful, and will die off over several generations. These intruders could be mutants born with new behaviors, or migrants who join the population. Mathematically, we can say that whenever the entire population is playing strategy S, a small group of intruders playing strategy T will have lower fitness than those playing strategy S (or, strategy S dominates strategy T). Since fitness determines reproductive success, it only makes sense that players using strategy T, as in a subpopulation, will less successful reproductively and the amount of players of this strategy will shrink over time and probably die out over multiple generations. Thus, we have carried the game analogy this far: • An organism in a population has fitness, which is the expected payoff it receives from interactions with random members of the population. • Strategy T invades strategy S at level x (x small and positive) if an x percentage of the population plays T and (1-x) percentage plays S. • Strategy S is evolutionarily stable if there exists a number y (y small, positive) such that when any strategy T invades strategy S at a level x<y, strategy S dominates strategy T; that is, an organism playing S has a strictly greater fitness than an organism playing T. We can apply these definitions to our beetle example. We will analyze whether the Small strategy is evolutionarily stable, then we will analyze the Large strategy. Suppose that for some small positive number x, a (1-x) fraction of the population plays Small, and an x fraction plays Large; this models the situation where a minor intruding population of large beetles has just arrived into a population of small beetles. The expected payoff to a small beetle in a random interaction, or game, in this population, is determined as usual. Its payoff is 5 with the probability (1-x) and 1 with a probability x (from meeting a small beetle and a large beetle, respectively). Thus, E(x) = 5(1−x)+1·x = 5−4x. The expected payoff to a large beetle in a random interaction in this population is as follows: the large beetle receives a payoff of 8 with a probability of (1-x) and a payoff of 3 with probability x (from meeting a small beetle and a large beetle, respectively). Thus, Jennifer Bachand Math 482H E(x)= 8(1−x)+3·x = 8−5x. For small values of x, or even relatively larger ones, E(x) for the fitness of the large beetles is larger than the E(x) for the fitness of small beetles. This tells us the the Small strategy is not evolutionarily stable. What about the Large strategy? We can now suppose that for some small, positive number x, a (1-x) fraction of the population plays the Large strategy and an x fraction of the population plays Small. This models the situation where a minor intruding population of small beetles has just arrived into a population of large beetles. The expected payoff to a large beetle in a random interaction in this population is as follows: the large beetle receives a payoff of 3 with probability (1-x) and a payoff of 8 with probability x (from meeting a large beetle and a small beetle, respectively). So E(x)= 3(1−x)+8·x = 3+5x. The expected payoff to a small beetle in a random interaction in this population is as follows: the small beetle receives a payoff of 1 with probability (1-x) and a payoff of 5 with probability x (from meeting a large beetle and a small beetle, respectively). So E(x) = (1−x)+5·x = 1+4x. We can see that E(x) for large beetles is greater than E(x) for small beetles, so the Large strategy is evolutionarily stable. These results tell us that if a small population of large beetles is introduced into a population of small beetles, the large beetles are very successful; since they mostly meet small beetles, the get most of the food in almost every interaction they have. Since the small beetles cannot compete with the large beetles, Small is not an evolutionarily stable strategy. Conversely, if a small population of small beetles is introduced into a population of large beetles, the small beetles will still lose in almost every competition for food. The population of large beetles is able to resist the intrusion of small beetles; Large is an evolutionarily stable strategy. Therefore, if a mutation to create larger body sizes is possible, we should expect that only populations of large beetles will exist in nature, as they will be evolutionarily stable; large beetles benefit at the expense of small ones. There is another way to interpret our results. Notice that small beetles receive more fitness from the food that they ingest than the large beetles. We can say that if we started from a population of small beetles, natural selection caused these beetles’ fitness to decrease over time as larger beetles were able to do better in interactions with them (a payoff of 5 turned into a payoff of 1, decreasing the small beetles’ fitness). This fits with the main principle of natural selection: natural selection increases the fitness if individual organisms in a fixed environment, but if the environment becomes more hostile to the organisms, as in the case where a population of large beetles invades, the organism’s fitness will intuitively go down. This is the case with the beetles; the increasing amount of large beetles can be seen as a shift to an environment which is more hostile for everyone. Only the fittest will survive. Jennifer Bachand Math 482H References: 1. Straffin, P. D. Game Theory and Strategy. The Mathematical Association of America, 1993. Print. 2. Ventzel, Elena S. Lectures on Game Theory: International Monographs on Advanced Mathematics and Physics. Delhi: Hindustan Publishing Corporation, 1961. Print. 3. Easley, David, and Jon Kleinberg. Networks, Crowds, And Markets, Reasoning About A Highly Connected World. Cambridge University Press, 2011. Print. 4. Vajda, Steven. Mathematical Games And How To Play Them. Chichester: Ellis Horwood Limited, 1992. Print.