Mixed Strategy

views updated

Mixed Strategy

In the theory of games a player is said to use a mixed strategy whenever he or she chooses to randomize over the set of available actions. Formally, a mixed strategy is a probability distribution that assigns to each available action a likelihood of being selected. If only one action has a positive probability of being selected, the player is said to use a pure strategy.

A mixed strategy profile is a list of strategies, one for each player in the game. A mixed strategy profile induces a probability distribution or lottery over the possible outcomes of the game. A (mixed strategy) Nash equilibrium is a strategy profile with the property that no single player can, by deviating unilaterally to another strategy, induce a lottery that he or she finds strictly preferable. In 1950 the mathematician John Nash proved that every game with a finite set of players and actions has at least one equilibrium.

To illustrate, one can consider the children’s game Matching Pennies, in which each of two players can choose either heads (H ) or tails (T ); player 1 wins a dollar from player 2 if their choices match and loses a dollar to player 2 if they do not. This game can be represented as follows:

	H	T
H	(1,–1)	(–1, 1)
T	(–1, 1)	(1,–1)

Here player 1’s choice determines a row, player 2’s choice determines a column, and the corresponding cell indicates the payoffs to players 1 and 2 in that order. This game has a unique Nash equilibrium that requires each player to choose each action with probability one-half.

Another example is provided by the Hawk-Dove game, which has been used by evolutionary biologists to model animal conflicts:

	H	D
H	(0, 0)	(4, 1)
D	(1, 4)	(2, 2)

In this game any strategy profile in which one player chooses H and the other picks D is in equilibrium. Hence, there are two pure strategy equilibria, (H, D ) and (D, H ). In addition, there is a mixed strategy equilibrium in which each player selects H with probability 2/3.

One feature of a mixed strategy equilibrium is that given the strategies chosen by the other players, each player is indifferent among all the actions that he or she selects with positive probability. Hence, in the Matching Pennies game, given that player 2 chooses each action with probability one-half, player 1 is indifferent among choosing H, choosing T, and randomizing in any way between the two. Because randomization is more complex and cognitively demanding than is the deterministic selection of a single action, this raises the question of how mixed strategy equilibria can be sustained and, more fundamentally, how mixed strategies should be interpreted.

In an interpretation advanced in 1973 by John Harsanyi, a mixed strategy equilibrium of a game with perfect information is viewed as the limit point of a sequence of pure strategy equilibria of games with imperfect information. Specifically, starting from a game with perfect information, one can obtain a family of games with imperfect information by allowing for the possibility that there are small random variations in payoffs and that each player is not fully informed of the payoff functions of the other players. Harsanyi showed that the frequency with which the various pure strategies are chosen in these perturbed games approaches the frequency with which they are chosen in the mixed strategy equilibrium of the original game as the magnitude of the perturbation becomes vanishingly small.

A very different interpretation of mixed strategy equilibria comes from evolutionary biology. To illustrate this, consider a large population in which each individual is programmed to play a particular pure strategy. Individuals are drawn at random from that population and are matched in pairs to play the game. The payoff that results from the adoption of any specific pure strategy will depend on the frequencies with which the various strategies are represented in the population. Suppose that those frequencies change over time in response to payoff differentials, with the population share of more highly rewarded strategies increasing at the expense of strategies that yield lower payoffs. Any rest point of this dynamic process must be a Nash equilibrium. In the special case of the Hawk-Dove game any trajectory that begins at a state in which both strategies are present converges to the unique mixed strategy equilibrium of the game. In other words, the long-run population share of each strategy corresponds exactly to the likelihood with which it is played in the mixed strategy equilibrium.

SEE ALSO Evolutionary Games; Game Theory; Nash Equilibrium; Nash, John

BIBLIOGRAPHY

Harsanyi, John C. 1973. Games with Randomly Disturbed Payoffs: A New Rationale for Mixed Strategy Equilibrium Points. International Journal of Game Theory 2: 1–23.

Maynard Smith, John. 1982. Evolution and the Theory of Games. Cambridge, U.K., and New York: Cambridge University Press.

Maynard Smith, John, and G. R. Price. 1973. The Logic of Animal Conflict. Nature 246: 15–18.

Nash, John F. 1950. Equilibrium Points in N-Person Games. Proceedings of the National Academy of Sciences 36 (1): 48–49.

Osborne, Martin J., and Ariel Rubinstein. 1994. A Course in Game Theory. Cambridge, MA: MIT Press.

Taylor, Peter D., and Leo B. Jonker. 1978). Evolutionarily Stable Strategies and Game Dynamics. Mathematical Biosciences 40: 145–156.

Rajiv Sethi

International Encyclopedia of the Social Sciences