Hurwicz's Criterion of Realism and decision-making under massive uncertainty
For a PDF version of this post, click here.
[UPDATE: After posting this, Johan Gustafsson got in touch and it seems he and I have happened upon similar points via slightly different routes. His paper is here. He takes his axioms from Binmore's Rational Decisions, who took them from Milnor's 'Games against Nature'. Hurwicz and Arrow also cite Milnor, but Hurwicz's original characterisation appeared before Milnor's paper, and he cites Chernoff's Cowles Commission Discussion Paper: Statistics No. 326A as the source of his axioms.]
In 1951, Leonid Hurwicz, a Polish-American economist who would go on to share the Nobel prize for his work on mechanism design, published a series of short notes as part of the Cowles Commission Discussion Paper series, where he introduced a new decision rule for choice in the face of massive uncertainty. The situations that interested him were those in which your evidence is so sparse that it does not allow you to assign probabilities to the different possible states of the world. These situations, he thought, fall outside the remit of Savage's expected utility theory.
The rule he proposed is called Hurwicz's Criterion of Realism or just the Hurwicz Criterion. He introduced it in the form in which it is usually stated in February 1951 in the Cowles Commission Discussion Paper: Statistics No. 356 -- the title was 'A Class of Criteria for Decision-Making under Ignorance'. The Hurwicz Criterion says that you should choose an option that maximises what I'll call its Hurwicz score, which is a particular weighted average of its best-case utility and its worst-case utility. A little more formally: We follow Hurwicz and let an option be a function from a set of possible states of the world to the real numbers . Now, you begin by setting the weight you wish to assign to the best-case utility of an option, and then you assign the remaining weight to its worst-case. Then the Hurwicz score of option is just
However, reading his other notes in the Cowles series that surround this brief three-page note, it's clear that Hurwicz's chief interest was not so much in this particular form of decision rule, but rather with any such rule that determines the optimal choices solely by looking at their best- and worst-case scenarios. The Hurwicz Criterion is one such rule, but there are others. You might, for instance, weight the best- and worst-cases not by fixed constant coefficients, but by coefficients that change with the minimum and maximum values, or change with the difference between them or with their ratio. One of the most interesting contributions of these papers that surround the one in which Hurwicz gives us his Criterion is a characterization of rules that depend only on best- and worst-case utilities. Hurwicz gave rather an inelegent initial version of that characterization in Cowles Commission Discussion Paper: Statistics No. 370, published at the end of 1951 -- the title was 'Optimality Criteria for Decision-Making under Ignorance'. Kenneth Arrow then seems to have helped clean it up, and they published the new version together in the Appendix of their edited volume, in which they contributed most of the chapters, often with co-authors, Studies in Resource Allocation. The version with Arrow is still reasonably involved, but the idea is quite straightforward, and it is remarkable how strong a restriction Hurwicz obtains from seemingly weak and plausible axioms. This really seems to me a case where axioms that seem quite innocuous on their own can combine in interesting ways to make trouble. So I thought it might be interesting to give a simplified version that has all the central ideas.
Here's the framework:
Possibilities and possible worlds. Let be the set of possibilities. A possible world is a set of possibilities--that is, a subset of . And a set of possible worlds is a partition of . That is, presents the possibilities at at a certain level of grain. So if , then is the most fine-grained set of possible worlds, but there are coarser-grained sets as well, such as or . (This is not quite how Hurwicz understands the relationship between different sets of possible states of the world -- he talks of deleting worlds rather than clumping them together, but I think this formalization better captures his idea.)
Options. For any set of possible worlds, an option defined on is simply a function from into the real numbers . So an option takes each world in and assigns a utility to it. (Hurwicz refers to von Neumann and Morgenstern to motivate the assumption that utilities can be measured by real numbers.)
Preferences. For any set of possible worlds, there is a preference relation over the options defined on . (Hurwicz states his result in terms of optimal choices rather than preferences. But I think it's a bit easier to see what's going on if we state it in terms of preferences. There's then a further question as to which options are optimal given a particular preference ordering, but we needn't address that here.)
Hurwicz's goal was to lay down conditions on these preference relations such that the following would hold:
Hurwicz's Rule Suppose and are options defined on . Then
(H1) If
. That is, you should be indifferent between any two options with the same maximum and minimum.
(H2) If
. That is, you should prefer one option to another if the worst case of the first is better than the worst case of the second and the best case of the first is better than the best case of the second.
Here are the four conditions or axioms:
(A1) Structure is reflexive and transitive.
(A2) Weak Dominance
(A3) Permutation Invariance For any set of worlds and any options defined on , if is a permutation of the worlds in and if for all in , then .
This just says that it doesn't matter to you which worlds receive which utilities -- all that matters are the utilities received.
(A4) Coarse-Graining Invariance Suppose is a set of possible worlds and suppose are options on with and . Then let , so that has the same worlds as except that, instead of and , it has their union. And define options and on as follows: and , and and for all other worlds. Then iff .
This says that if two options don't distinguish between two worlds, it shouldn't matter to you whether they are defined on a fine- or coarse-grained space of possible worlds.
Then we have the following theorem:
Theorem (Hurwicz) (A1) + (A2) + (A3) + (A4) (H1) + (H2).
Here's the proof. Assume (A1) + (A2) + (A3) + (A4). First, we'll show that (H1) follows. We'll sketch the proof only for the case in which , since that gives all the crucial moves. So denote an act on by a triple . Now, suppose that and are options defined on with the same minimum, , and maximum, . Let be the middle value of and the middle value of .
Now, first note that
After all, by Permutation Invariance. And, by Coarse-Graining Invariance, iff , where . And, by Permutation Invariance and the reflexivity of , . So , as required. And now we have, by previous results, Permutation Invariance, and Weak Dominance:
and
And so, by transitivity, . That gives (H1).
For (H2), suppose has worst case , middle case , and best-case , while has worst case , middle case , and worst case . And suppose and . Then as required.
In a follow-up blog post, I'd like to explore Hurwicz's conditions (A1-4) in more detail. I'm a fan of his approach, not least because I want to use something like his decision rule within the framework of accuracy-first epistemology to understand how we select our first credences -- our ur-priors or superbaby credences (see here). But I now think Hurwicz's focus on only the worst-case and best-case scenarios is too restrictive. So I have to grapple with the theorem I've just presented. That's what I hope to do in the next post. But here's a quick observation. (A1-4), while plausible at first sight, sail very close to inconsistency. For instance, (A1), (A3), and (A4) are inconsistent when combined with a slight strengthening of (A2). Suppose we add the following to (A2) to give (A2 ):
3. If for all in and for some in , then .
Then we have know from above that , but (A2 ) entails that , which gives a contradiction.
[UPDATE: After posting this, Johan Gustafsson got in touch and it seems he and I have happened upon similar points via slightly different routes. His paper is here. He takes his axioms from Binmore's Rational Decisions, who took them from Milnor's 'Games against Nature'. Hurwicz and Arrow also cite Milnor, but Hurwicz's original characterisation appeared before Milnor's paper, and he cites Chernoff's Cowles Commission Discussion Paper: Statistics No. 326A as the source of his axioms.]
In 1951, Leonid Hurwicz, a Polish-American economist who would go on to share the Nobel prize for his work on mechanism design, published a series of short notes as part of the Cowles Commission Discussion Paper series, where he introduced a new decision rule for choice in the face of massive uncertainty. The situations that interested him were those in which your evidence is so sparse that it does not allow you to assign probabilities to the different possible states of the world. These situations, he thought, fall outside the remit of Savage's expected utility theory.
The rule he proposed is called Hurwicz's Criterion of Realism or just the Hurwicz Criterion. He introduced it in the form in which it is usually stated in February 1951 in the Cowles Commission Discussion Paper: Statistics No. 356 -- the title was 'A Class of Criteria for Decision-Making under Ignorance'. The Hurwicz Criterion says that you should choose an option that maximises what I'll call its Hurwicz score, which is a particular weighted average of its best-case utility and its worst-case utility. A little more formally: We follow Hurwicz and let an option be a function
However, reading his other notes in the Cowles series that surround this brief three-page note, it's clear that Hurwicz's chief interest was not so much in this particular form of decision rule, but rather with any such rule that determines the optimal choices solely by looking at their best- and worst-case scenarios. The Hurwicz Criterion is one such rule, but there are others. You might, for instance, weight the best- and worst-cases not by fixed constant coefficients, but by coefficients that change with the minimum and maximum values, or change with the difference between them or with their ratio. One of the most interesting contributions of these papers that surround the one in which Hurwicz gives us his Criterion is a characterization of rules that depend only on best- and worst-case utilities. Hurwicz gave rather an inelegent initial version of that characterization in Cowles Commission Discussion Paper: Statistics No. 370, published at the end of 1951 -- the title was 'Optimality Criteria for Decision-Making under Ignorance'. Kenneth Arrow then seems to have helped clean it up, and they published the new version together in the Appendix of their edited volume, in which they contributed most of the chapters, often with co-authors, Studies in Resource Allocation. The version with Arrow is still reasonably involved, but the idea is quite straightforward, and it is remarkable how strong a restriction Hurwicz obtains from seemingly weak and plausible axioms. This really seems to me a case where axioms that seem quite innocuous on their own can combine in interesting ways to make trouble. So I thought it might be interesting to give a simplified version that has all the central ideas.
Here's the framework:
Possibilities and possible worlds. Let
Options. For any set
Preferences. For any set
Hurwicz's goal was to lay down conditions on these preference relations such that the following would hold:
Hurwicz's Rule Suppose
(H1) If
(H2) If
Here are the four conditions or axioms:
(A1) Structure
(A2) Weak Dominance
- If
for all in , then . - If
for all in , then .
(A3) Permutation Invariance For any set of worlds
This just says that it doesn't matter to you which worlds receive which utilities -- all that matters are the utilities received.
(A4) Coarse-Graining Invariance Suppose
This says that if two options don't distinguish between two worlds, it shouldn't matter to you whether they are defined on a fine- or coarse-grained space of possible worlds.
Then we have the following theorem:
Theorem (Hurwicz) (A1) + (A2) + (A3) + (A4)
Here's the proof. Assume (A1) + (A2) + (A3) + (A4). First, we'll show that (H1) follows. We'll sketch the proof only for the case in which
Now, first note that
and
And so, by transitivity,
For (H2), suppose
In a follow-up blog post, I'd like to explore Hurwicz's conditions (A1-4) in more detail. I'm a fan of his approach, not least because I want to use something like his decision rule within the framework of accuracy-first epistemology to understand how we select our first credences -- our ur-priors or superbaby credences (see here). But I now think Hurwicz's focus on only the worst-case and best-case scenarios is too restrictive. So I have to grapple with the theorem I've just presented. That's what I hope to do in the next post. But here's a quick observation. (A1-4), while plausible at first sight, sail very close to inconsistency. For instance, (A1), (A3), and (A4) are inconsistent when combined with a slight strengthening of (A2). Suppose we add the following to (A2) to give (A2
3. If
Then we have know from above that
What an interesting result. I suppose it would be easy to reframe it as an impossibility theorem? You could add another axiom along the lines of "it must be possible for the middle-cases to make a difference to the decision" and then the result would show that there is no decision rule that meets all the conditions.
ReplyDeleteSuch a result would seem intuitive. It would basically be saying: if you refuse to assign probabilities to scenarios, you will have no principled way of assigning weights to the middle-case scenarios, so you will have no adequate decision rule.
Thanks, Jonathan! Really nice point! In fact, after posting, Johan Gustafsson sent on a draft paper of his in which he gets the same result and does indeed reframe as an impossibility result. I've put a link in an update at the top of the post.
Delete