CHT. 5 DATABASE MANAGEMENT

advertisement

Game Theory

Game Theory

 Two (or more) decision makers with conflicting interests are under competition.

 Two decision makers are assumed to be equally smart.

Zero-Sum vs. Non-zero-Sum

 In a zero-sum game, a player’s gain is equal to another player’s loss.

 In a non-zero-sum game, a player’s gain is not necessarily equal to another player’s loss.

Two-Person Zero-Sum Game

Two decision makers’ benefits are completely opposite i.e., one person’s gain is another person’s loss

 Payoff/penalty table (zero-sum table):

– shows “offensive” strategies (in rows) versus

“defensive” strategies (in columns);

– gives the gain of row player (loss of column player), of each possible strategy encounter.

Example 1 (payoff/penalty table)

Athlete

Strategies

(row strat.) A

Manager’s Strategies

(Column Strategies)

B C

1

2

$50,000 $35,000 $30,000

$60,000 $40,000 $20,000

Two-Person Constant-Sum

Game

 For any strategy encounter, the row player’s payoff and the column player’s payoff add up to a constant C.

 It can be converted to a two-person zerosum game by subtracting half of the constant (i.e. 0.5C) from each payoff.

Example 2 (2-person, constantsum)

During the 8-9pm time slot, two broadcasting networks are vying for an audience of 100 million viewers, who would watch either of the two networks.

Payoffs of NW1 for the constant-sum of 100(million)

Network 1

(NW1)

Network 2 (NW2) western Soap Comedy western soap comedy

35

45

38

15

58

14

60

50

70

An equivalent zero-sum table

Network 1

Network 2 western Soap Comedy western soap comedy

-15

- 5

-12

-35

8

-36

10

0

20

Equilibrium Point

 In a two-person zero-sum game, if there is a payoff value P such that

P = max{row minimums}

= min{column maximums} then P is called the equilibrium point, or saddle point, of the game.

Example 3 (equilibrium point)

Athlete

Strategies

(row strat.) A

Manager’s Strategies

(Column Strategies)

B C

1

2

$50,000 $35,000 $30,000

$60,000 $40,000 $20,000

Game with an Equilibrium

Point: Pure Strategy

 The equilibrium point is the only rational outcome of this game; and its corresponding strategies for the two sides are their best choices, called pure strategy.

 The value at the equilibrium point is called the value of the game.

 At the equilibrium point, neither side can benefit from a unilateral change in strategy.

Pure Strategy of Example 3

Athlete

Strategies

(row strat.) A

Manager’s Strategies

(Column Strategies)

B C

1

2

$50,000 $35,000 $30,000

$60,000 $40,000 $20,000

Example 4 (2-person, 0-sum)

Row

Players Column Player Strategies

Strategies 1

1 4

2 2

2

4

3

3

10

1

3 6 5 7

Mixed Strategy

 If a game does not have an equilibrium, the best strategy would be a mixed strategy.

Game without an Equilibrium

Point

 A player may benefit from unilateral change for any pure strategy.

Therefore, the game would get into a loop.

 To break loop, a mixed strategy is applied.

Example:

Company I Company II Strategies

Strategies B C

2

3

8

1

4

7

Mixed Strategy

 A mixed strategy for a player is a set of probabilities each for an alternative of the player.

With mixed strategy, row player’s

expected value of payoffs and column player’s expected value of losses are same.

Expected Value

 The outcome of an action A is uncertain.

X

1

, X

2

, …, X n are possible outcomes with probabilities p

1

, p

2

, …, p n respectively.

Expected value of A’s outcomes is:

E(A) = X

1

*p

1

+X

2

*p

2

+…+X n

*p n

 E(A) is interpreted as the average outcome of action A.

Daily Demand

 Possible daily demands of the low-fat milk in a store are:

– 20 cases with probability 0.05,

– 21 cases with probability 0.25,

– 22 cases with probability 0.35,

– 23 cases with probability 0.2,

– 24 cases with probability 0.1,

– 25 cases with probability 0.05.

 What is the expected value of daily demand of the milk?

A Slot Machine

 Possible numbers of tokens coming out of a slot machine for a token put in are as below:

Number of tokens out

0

4

5

1

2

10

100

1000

Probability

0.5213

0.33

0.1

0.03

0.01

0.008

0.0006

0.0001

 What is the expected value of tokens out for one token in?

Tossing Coins

Calculate the expected value of “guessing Head”.

Calculate the expected value of “guessing Tail”.

Would you “guess Head” or “Guess Tail”?

Guess ‘Head’

Guess ‘Tail’

Land on ‘Head’ Land on ‘Tail’

$100 - $60

- $80 $150

Example: Mixed Strategy

Company I Company II Strategies

Strategies B C

2

3

8

1

4

7

Let mixed strategy for company I be

{0.6, 0.4}; and for Company II be

{0.3, 0.7}.

Example

(continued)

 If Company I takes strategy 2, then the expected value of payoffs would be:

 If Company I takes strategy 3, then the expected value of payoffs would be:

 Expected value of payoffs for Company I is:

Example

(continued)

 If Company II takes strategy B, then the expected value of losses would be:

 If Company II takes strategy C, then the expected value of losses would be:

 Expected value of losses for Company II is:

Equilibrium Mixed Strategy

 An equilibrium mixed strategy makes expected values of any player’s individual strategies identical .

 Every game contains one equilibrium mixed strategy.

 The equilibrium mixed strategy is the best strategy.

Equilibrium

A status is equilibrium if it is balanced; and once it is off the balance, it will be brought back to balance automatically by its internal force.

Equilibrium in Game Theory

 In a game, a status is a combination of strategies of two players.

 A status is equilibrium if neither player will be better off by unilaterally leaving that status; and it would rationally come back once leaving the status.

How to Find Equilibrium Mixed

Strategy

 By linear programming (as introduced in book)

 By QM for Windows, – we use this approach.

Both Are Better Off at

Equilibrium

 At equilibrium, both players are better off, compared to maximin strategy for row player and minimax strategy for column player.

 No player would benefit from unilaterally changing the strategy.

A Care-Free Strategy

The row player’s expected gain remains constant as far as he stays with his mixed strategy (no matter what strategy the column player uses).

The column player’s expected loss remains constant as far as he stays with his mixed strategy (no matter what strategy the row player uses).

Unilateral Change from

Equilibrium by Column Player probability

0.6

0.4

Strat 2

Strat 3

0.1

B

8

1

0.9

C

4

7

Unilateral Change from

Equilibrium by Column Player probability

0.6

0.4

Strat 2

Strat 3

1.0

B

8

1

4

7

0

C

Unilateral Change from

Equilibrium by Row Player probability

0.2

0.8

Strat 2

Strat 3

0.3

B

8

1

0.7

C

4

7

A Double-Secure Strategy

 At the equilibrium, the expected gain or loss will not change unless both players give up their equilibrium strategies.

Note: Expected gain of row player is always equal to expected loss of column player, even not at the equilibrium, since 0-sum)

Both Leave Their Equilibrium

Strategies probability

0.5

0.5

Strat 2

Strat 3

0.8

B

8

1

0.2

C

4

7

Both Leave Their Equilibrium

Strategies probability

0.2

0.8

Strat 2

Strat 3

8

1

0

B

4

7

1

C

Penalty for Leaving

Equilibrium

 It is equilibrium because it discourages any unilateral change.

 If a player unilaterally leaves the equilibrium strategy, then

– his expected gain or loss would not change, and

– once the change is identified by the competitor, the competitor can easily beat the non-equilibrium strategy.

Find the Equilibrium Mixed

Strategy

 Method 1: As on p.573-574 of our text book. The method is limited to 2X2 payoff tables.

 Method 2: Linear programming. A general method.

 Method we use: Software QM.

Implementation of a Mixed

Strategy

 Applied in the situations where the mixed strategy would be used many times.

 Randomly select a strategy each time according to the probabilities in the strategy.

 If you had good information about the payoff table, you could figure out not only your best strategy, but also the best strategy of your competitor (!).

Dominating Strategy vs.

Dominated Strategy

 For row strategies A and B: If A has a better (larger) payoff than B for any column strategy, then B is dominated by A.

 For column strategies X and Y: if X has a better (smaller) payoff than Y for any row strategy, then Y is dominated by X.

 A dominated decision can be removed from the payoff table to simplify the problem.

Example:

Company I Company II Strategies

Strategies A B C

1

2

3

9

11

4

7

8

1

2

4

7

2-Person Non-zero Sum Game

One player’s gain is not equal to the other player’s loss.

Prisoners’ dilemma (see a separate

PowerPoint presentation)

Decision Theory Problems

 If one of players in a game theory problem is the “Mother Nature” or the “God”, then it becomes a Decision Theory Problem, as in

Chapter 12.

Examples of Decision Theory

Problems

 How high the dam should be built to deal with possible flood;

 Which stock would be selected for your investment;

 Hoe many cases of milk to order every week for a grocery store;

 How many cashiers to hire to serve customers at a satisfactory level.

Major Difference

 In a game theory problem

, one player’s strategy would affect the other’s strategy.

 In a decision theory problem , the action of

Mother Nature is not influenced by a human’s. Mother Nature’s action is simply random in the eyes of humans, which is called state of nature .

Approaches for Making

Decision (1)

 If probabilities of states of nature can be figured out, then the alternative with highest expected value of possible payoffs will be the best decision, by using a decision table or a decision tree.

Approaches for Making

Decision (2)

 If probabilities of states of nature are not known, then the alternative with highest average payoff will be the decision, or the alternative whose largest possible regret is the smallest will be the decision, dependent on decision maker’s preference.

Example:

States of Nature

Decision Favorable Unfavorable

Alternatives market market

0.4

0.6

Large plant $200,000 -$180,000

Small plant $100,000 -$20,000

Doing nothing $0 $0

What is the best alternative for the decision of investment?

Download