The Prisoner`s Dilemma Dan Kelsey Definition: The Blackwell

advertisement
The Prisoner’s Dilemma
Dan Kelsey
Definition: The Blackwell Encyclopedic of Managerial Economics (McAuliffe, 2005) provides the
following definition of the Prisoner’s Dilemma (PD).
“Two suspects are arrested by the police and are charged with a minor crime. They are
also suspected of a major crime but there is not enough evidence to convict them without a
confession. The two prisoners are then offered the following deal:
1.
2.
3.
If one prisoner confesses to the major crime and implicates the other, he will serve 6
months in jail while the other will serve 6 years in jail.
If neither prisoner confesses to the major crime, each will be convicted of the minor
offense and will serve 1 year in prison.
If both prisoners confess to the major crime, they will both serve 3 years in prison”
The following table illustrates the Prisoner’s Dilemma:
Prisoner A
Confess
Confess
3, 3
Don’t Confess
6, .5
Prisoner B
Don’t Confess
.5, 6
1, 1
(The first number represents Prisoner A, the second number represents Prisoner B.)
The PD allows an individual or firm to determine the best strategy to pursue based on
predictions of the effects of a rival individual or firm pursuing the same or opposite strategy.
A commonly used term is a Nash Equilibrium. This occurs when no player in the PD is able to
improve payoff by unilaterally changing their strategy, given the other player’s strategies (Baye, 2006, p.
357). Baye (2006, p. 361) illustrates a Nash Equilibrium in the following way:
Firm A
Firm B
Advertise
Don’t Advertise
Advertise
$4, $4
$20, $1
Don’t Advertise
$1, $20
$10, $10
In this example, the Nash Equilibrium for the game is for each Firm to advertise. The decision to
advertise cannot be improved on given the other firm’s decision. Another term is used in the Prisoner’s
Dilemma is a player’s dominant strategy. A dominant strategy is a strategy that results in the highest
payoff to a player regardless of the opponent’s action (Baye, 2006, p. 355). In the above example, each
firm has a dominant strategy of Advertise. If Firm B decides to advertise, then the best alternative is for
Firm A to advertise because they will earn $4. If firm B decides to not advertise, then the best alternative
for Firm A is to advertise. As a result, Firm A has a dominant strategy to advertise.
Discussion: The Prisoner’s Dilemma (PD) was created by John Nash, who was awarded the
Nobel prize for his work in originating game theory. In the definition provided above, the key piece of
information is that the prisoners are unable to communicate and arrange a mutually beneficial strategy.
Collusion is not an option. When the PD is played one time, the winning strategy is to “rat out the other
guy” (Steidtmann, 2007). But what happens when the game is played multiple times? This is most
similar to the business environment where competitors are fiercely competitive and trying to gain
market share. In this situation, the most effective strategy for multiple plays is “Tit-for-Tat”. Do to your
opponent what he did to you in the last turn (Steidtmann, 2007). To summarize, if you and your best
friend rob a bank and are picked up by the police, your best strategy is to confess to the crime and rat
out your friend. But if you and your best friend are business rivals, and one of you lowers your price,
then the best strategy is for the other to lower his price. However, this could result in a price war.
Application: The PD can be applied in any situation where a firm or situation is faced with a
dilemma. It was applied to management (Seale, Arend, & Phelan, 2006), leadership (Clark & Lee, 2005),
policy, global economic issues ("Playing games with the planet.," 2007), retail (Steidtmann, 2007), labor
negotiations (Haber, 2006), the effect on “free riders” (Gunnthorsdottir & Rapoport, 2006), and
standard setting (Axelrod & Mitchell, 1995).
The PD was applied to leadership and management (Clark & Lee, 2005; Seale et al., 2006) by
examining the effect interaction between a leadership and the ability of leadership to minimize the
effect of a PD. Clark and Lee (2005), determined that while good leadership can help groups reduce the
harmful effects of the PD, poor leadership can result in personal benefit at the expense of the
organization. Furthermore, they state “good leaders have a responsibility to reduce the harm their
group suffers from the non-cooperative behavior of others”. In other words, good leadership can result
in people acting for the good of the firm instead of the good of the individual.
When managers are considering alliances, Seale and Phelan (2006) suggest that managers
should, in addition to other factors, enter alliances with a bias toward to cooperation, remain in the
alliance for as a long as a partner’s expected level of cooperation exceeds the solution , favor alliances
with compact payoffs, and choose less experienced alliance partners. By remaining in alliance when
cooperation exceeds the solution, both parties are acting in one another’s best interest. Otherwise, the
alliance will turn into a one shot PD where a firm will “rat out the other firm”.
In applying the PD to labor negotiations in the 2002 Major League Baseball Players Association
contract talks, Haber (2006) uses the players and labor as the two sides in a PD. In this game, each party
can choose to be aggressive or conciliatory. In those cases where both sides choose to be aggressive,
strikes have happened and resulted in an impasse. This is a no-win situation for everyone. Currently,
many unions in the United States (US) are faced with this dilemma. Although many of the unions were
highly successful in the past, they are finding their ability to act in the best interests of their members is
somewhat limited. Because of the ability to outsource manufacturing at drastically lower labor costs, a
union may find itself in its’ own PD. Should the union act in the best interests of the organization as a
whole, or act in a way that benefits the workers. By taking a strong stance in representing the workers,
unions are increasing management’s willingness to pursue the option that is most advantageous for
them.
PD can also be applied to environmental and public policy decisions. In one article, the PD is
applied to the Kyoto Accord and governments of other countries either being unwilling to address global
warming or assuming others will address the issue ("Playing games with the planet.," 2007). In
discussing various forms of a PD, Arce and Sandler (2005) extend a scenario to a situation where one
country causes river pollution in a downstream country while the downstream country causes air
pollution in downwind in the first country. In this case they state mutual altruism can lead to net gains in
both countries.
From a business competition perspective, the best known public PD was the competition for
two competing industry standards for tapes used in video cassette recorders (VCRs) (Axelrod & Mitchell,
1995). The competition for standards represents an ideal situation for the PD. When a decision is made
regarding what will be the standard in a given industry, the potential profits to the creator of the
standard can be enormous. Because the adoption of their standard provides the greatest payback, an
organization will push for adoption and often times not attempt to reach an alliance with competitors
where a win-win solution can be reached.
The final area is the topic of free-riding. By using the concept of a PD, it was shown that a basic
reinforcement based learning model can reduce intra-group rivalry and thereby increase inter-group
success (Gunnthorsdottir & Rapoport, 2006). For example, a disruptive player on an athletic team can be
highly detrimental to the team despite the level of talent contained within the team. As a result, the
team is held back from reaching its’ potential. But when a reinforcement model is used and the
troublesome player learns how a change in behavior can impact the success of the team.
In some respects, the PD is applicable where collusion is not permitted. If collusion were
permitted, firms would act in a way that is most beneficial to both firms. Since collusion is not an option,
Nash was able to develop a model that is highly applicable to a variety of settings ranging from public
policy to relationships within a team. By applying the PD to a situation, an individual or organization is
better able to determine the best option.
Questions
A student is preparing for final exams. The student is trying to determine if study time should be
devoted to the economics final exam. The student, being a very bright and intelligent MBA
student, constructed the following PD. The student figures he can get a higher score on a true /
false test, but the professor only gives true / false tests 40% of the time and multiple choice
tests 60% of the time. The students will not find out the format of the test until the professor
distributes the test.
Student
Professor
Mult. Choice
Study
90, 60
T/F
95, 40
Not Study
75, 60
80, 40
1. What is the student’s dominant strategy?
a. Study
b. Not Study
c. The Student does not have a dominant strategy
2. The student figures he needs to score at least an 80 on the test to get an A in the class.
What is the best strategy for the student?
a. Not study and hope the test is true / false.
b. Study
c. A and B above
d. None of the above
3. Assume the PD above is representative of the entire class, and not one person. The
Professor has both tests when he enters the room, but has just come from a faculty meeting
and is very frustrated. He decides to take it out on the class and make sure the students get
the lowest grade possible. What test should he provide?
a. Multiple Choice
b. True/False
c. Essay
d. Does not matter, the students are all failing anyway.
4. You own a gas station on one corner, and a rival owns a gas station on the opposite corner.
You have both been in business for many years and plan to remain in business for many
years to come. Your rival drops the price of gas on Monday. What should you do?
a. Offer a free carwash.
b. Lower your price so it is less than the rival’s price.
c. Nothing, you have loyal customers you are certain will continue to come to your
station.
d. Raise your price as a means of differentiating yourself from the competition.
5. Many firms offer profit sharing plans and issue checks to employees for meeting certain
performance goals. What should a manager do to earn the maximum bonus available?
a. Assure the employees under him are working together in pursuit of departmental
and company goals.
b. Do the same as always and hope the company has a really good year.
c. Use a “slash and burn” policy in reaching the goals and then quit the company after
receiving the profit sharing check.
Answer Key:
Question 1: Correct answer is a. Study. If the professor gives a multiple choice test, the student will get
the highest grade through study. If the professor gives a T/F test, the student will get a higher grade
through study.
Queston 2: Correct answer is c. A and B above. If the student studies, the minimum grade is 90. But if
the student does not study, they only way to get the required grade is if the test is True/False.
Question 3: The professor should give the multiple choice test. If students have studied, they will get a
lower test score. Essay is not an option. The correct answer is a.
Question 4: Correct Answer is b. This is not a one play situation. Therefore, since multiple games are
involved, the correct answer is to use a tit-for-tat strategy. Although answer a may bring in some
customers, according to game theory the best option is tit-for-tat.
Question 5: According to PD theory, the best option is to take what you can when it is a one shot game.
In this case, the best answer is c. But if the manager intends on staying with the firm, then the correct
answer is A. Based on the question and one shot theory, the answer is a.
References
Arce M., D. G., & Sandler, T. (2005). The Dilemma of the Prisoners Dilemmas. Kyklos, 58(1), 3-24.
Axelrod, R., & Mitchell, W. (1995). Coalition Formation in Standard-setting Alliances. Management
Science, 41(9), 1493-1508.
Baye, M. R. (2006). Managerial Economics and Business Strategy (5th Edition ed.). New York: McGraw Hill Irwin.
Clark, J. R., & Lee, D. R. (2005). Leadership, Prisoners' Dilemmas, and Politics. CATO Journal, 25(2), 379397.
Gunnthorsdottir, A., & Rapoport, A. (2006). Embedding social dilemmas in intergroup competition
reduces free-riding. Organizational Behavior & Human Decision Processes, 101(2), 184-199.
Haber, L. J. (2006). Labor Negotiations and Game Theory: The Case of Asymmetric Bargaining Power.
Journal of Collective Negotiations, 31(1), 21-32.
McAuliffe, R. E. (2005). Prisoner's dilemma. Blackwell Encyclopedic Dictionary of Managerial Economics,
1-194.
Playing games with the planet. (2007). Economist, 384(8548), 83.
Seale, D. A., Arend, R. J., & Phelan, S. (2006). Modeling alliance activity: Opportunity cost effects and
manipulations in an iterated prisoners dilemma with exit option. Organizational Behavior &
Human Decision Processes, 100(1), 60-75.
Steidtmann, C. (2007). The Nash Equilibrium: Game Theory and Organizational Success. Gourmet
Retailer, 28(9), 138.
Download