Decision making under interval uncertainty: toward (somewhat) more convincing justifications for Hurwicz optimism-pessimism approach

Warattaya Chinnakum (Department of Economics, Chiang Mai University, Chiang Mai, Thailand)
Laura Berrout Ramos (Department of Computer Science, The University of Texas – El Paso, El Paso, Texas, USA)
Olugbenga Iyiola (Department of Computer Science, The University of Texas – El Paso, El Paso, Texas, USA)
Vladik Kreinovich (The University of Texas – El Paso, El Paso, Texas, USA)

Asian Journal of Economics and Banking

ISSN: 2615-9821

Article publication date: 14 April 2021

Issue publication date: 27 April 2021

732

Abstract

Purpose

In real life, we only know the consequences of each possible action with some uncertainty. A typical example is interval uncertainty, when we only know the lower and upper bounds on the expected gain. A usual way to compare such interval-valued alternatives is to use the optimism–pessimism criterion developed by Nobelist Leo Hurwicz. In this approach, a weighted combination of the worst-case and the best-case gains is maximized. There exist several justifications for this criterion; however, some of the assumptions behind these justifications are not 100% convincing. The purpose of this paper is to find a more convincing explanation.

Design/methodology/approach

The authors used utility approach to decision-making.

Findings

The authors proposed new, hopefully more convincing, justifications for Hurwicz’s approach.

Originality/value

This is a new, more intuitive explanation of Hurwicz’s approach to decision-making under interval uncertainty.

Keywords

Citation

Chinnakum, W., Berrout Ramos, L., Iyiola, O. and Kreinovich, V. (2021), "Decision making under interval uncertainty: toward (somewhat) more convincing justifications for Hurwicz optimism-pessimism approach", Asian Journal of Economics and Banking, Vol. 5 No. 1, pp. 32-45. https://doi.org/10.1108/AJEB-07-2020-0029

Publisher

:

Emerald Publishing Limited

Copyright © 2020, Warattaya Chinnakum, Laura Berrout Ramos, Olugbenga Iyiola and Vladik Kreinovich.

License

Published in Asian Journal of Economics and Banking. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at http://creativecommons.org/licences/by/4.0/legalcode


1. Formulation of the problem

Need to make decisions under interval uncertainty. In many real-life situations, we need to make a decision, i.e. we need to select one of the possible alternatives. For example, we want to select the best investment strategy, we need to decide whether to accept a new job offer, etc.

In the ideal world, we should know the exact consequence of each possible alternative. In such an ideal case, we select an alternative which is the best for us. For example, if the goal of the investment is to save for retirement, then we should select the investment strategy that will bring us the larger amount of money by the expected retirement date.

In real world, there is uncertainty. We can rarely predict the exact consequences of each action. In the simplest case, instead of knowing the exact amount of money m resulting from each alternative, we only know that this amount will be somewhere between the values m¯ and m¯. In other words, we do not know the exact value m, but instead we only know the interval [m¯,m¯] that contains the actual (not yet known) value m. Such a situation is known as the situation of interval uncertainty. If we know intervals corresponding to different alternatives, which alternative should we select?

In other cases, in addition to the bounds m¯ and m¯, we also have some information about which values from the corresponding interval are more probable and which are less probable. In other words, we have some information – usually partial – about the actual probability distribution on the interval [m¯,m¯]. Sometimes, we know the exact probability distribution. In this case, we can, e.g., select the alternative for which the expected gain is the largest – of, if we want to be cautious, e.g. the alternative for which the gain guaranteed with a certain probability (e.g. 80%) is the largest.

In practice, we rarely know the exact probability distribution. Even if we know that the distribution is, e.g. Gaussian, we still do not know the exact values of the corresponding parameters – from the observations, we can only determine parameters with some uncertainty. For different possible combinations of these parameters, the expected gain – or whatever else characteristic we use – may take different values. Thus, for each alternative, instead of the exact value m of the corresponding objective function (such as expected gain), we have a whole interval [m¯,m¯] of possible values of this objective function. So, we face the exact same problem as in the simplest possible case – we need to select an alternative in a situation when for each alternative, we only know the interval of possible values of the objective function.

How decisions under interval uncertainty are currently made. As we have mentioned earlier, decision-making under interval uncertainty is an important practical problem. Not surprisingly, methods for solving this problem have been known for many decades. Usually, practitioners use a solution proposed in the early 1950s by the future Nobelist Leo Hurwicz; see, e.g. Hurwicz (1951), Luce and Raiffa (1989), Kreinovich (2014). According to this solution, a decision-maker should:

  • first, select a parameter α from the interval [0,1]; and then

  • select an alternative for which the following combination attains the largest possible value:

    α·u¯+(1α)·u¯.

This idea is known as the optimism–pessimism criterion, and the selected value α is known as the optimism parameter. The reason for these terms is straightforward:

  • If α = 1, this means that the decision-maker simply selects the alternative with the largest possible value of m¯. In other words, the decision-maker completely ignores the possibility that the outcome of each alternative can be smaller than in the best possible case and bases his/her decision exclusively on comparing these best possible consequences of different actions. This is clearly an extreme case of an optimist.

  • Vice versa, if α = 0, this means that the decision-maker simply selects the alternative with the largest possible value of m¯. In other words, the decision-maker completely ignores the possibility that the outcome of each alternative can be better that in the worst possible case and bases his/her decision exclusively on comparing these worst possible consequences of different actions. This is clearly an extreme case of a pessimist.

Both these situations are extreme. In real life, most people take into account both good and bad possibilities, i.e. in Hurwicz terms, they make decisions based on some intermediate value α – which is larger than the pessimist’s 0 but smaller than the optimist’s 1.

How can we explain the current approach to decision-making under uncertainty. There exist reasonable explanations for Hurwicz criteria, both:

Remaining problem and what we do in this paper. In both monetary and utility cases, to derive Hurwicz’s formula, we need to make certain assumptions:

  1. some of these assumptions are more reasonable; and

  2. some of these assumptions are slightly less convincing.

Natural questions are as follows:

  • Do we need these somewhat less convincing assumptions?

  • Can we avoid them altogether – and, if not, can we replace them with somewhat more convincing assumptions?

These are the question that we will analyze – and answer – in this paper.

Structure of this paper. We will start this paper with the easier-to-describe and easier-to-analyze case of monetary alternatives. First, in Section 2, we describe the usual assumptions leading to the Hurwicz criterion, explain how the Hurwicz criterion can be derived from these assumptions (in this, we largely follow (Kreinovich, 2017)) and why some of these assumptions may not sound fully convincing. Then, in Section 3, we present new – hopefully more convincing – assumptions and show how Hurwicz criterion can be derived from the new assumptions.

Then, we deal with the utility case. In Section 4, we briefly remind the readers who are not familiar with all the technical details of decision theory, what is utility and what are the properties of utility. In Section 5, we describe the usual assumptions leading to Hurwicz criterion for the utility case (they are somewhat different from the monetary case), explain how Hurwicz criterion can be derived from these assumptions (in this, we also largely follow (Kreinovich, 2017)) and why some of these assumptions may not sound fully convincing. Finally, in Section 6, we show that the Hurwicz criterion can be derived from the (hopefully) more convincing assumptions in the utility case as well.

2. Monetary case: usual derivation of the Hurwicz criterion and the limitations of this derivation

To make a decision, we need to have an exact numerical equivalent for each interval. We want to be able to compare different alternatives with interval uncertainty. In particular, for each interval-valued alternative [m¯,m¯] and for each alternative with a known exact monetary value m, we need to be able to decide the following:

  • whether the exact-valued alternative is better; or

  • whether the interval-valued alternative is better.

Of course, if m<m¯, then no matter what is the actual value from the interval [m¯,m¯], this value will be larger than m. Thus, in this case, the interval alternative is clearly better. We will denote this by m<[m¯,m¯].

Similarly, if m>m¯, then no matter what is the actual value from the interval [m¯,m¯], this value will be smaller than m. Thus, in this case, the interval alternative is clearly worse: [m¯,m¯]<m.

If m<[m¯,m¯] and m′ < m, the clearly m<[m¯,m¯]. Similar, if [m¯,m¯]<m and m < m′, then [m¯,m¯]<m.

One can show that because of this, there is a threshold value separating the two cases, namely, the value.

sup{m:m<[m¯,m¯]}=inf{m:[m¯,m¯]<m}.

Let us denote this threshold value – depending on m¯ and m¯ – by f(m¯,m¯).

By definition, for every ε > 0, we have:

f(m¯,m¯)ε<[m¯,m¯]<f(m¯,m¯)+ε.

In particular, this property holds for an arbitrarily small ε, including such small ε that no one will notice the difference between the value m and the values mε and m + ε. So, from the practical viewpoint, we can say that the interval [m¯,m¯] is equivalent to the monetary value f(m¯,m¯). We will denote this equivalence by following:

[m¯,m¯]f(m¯,m¯).

From this viewpoint, all we need to do to describe decision-making under interval uncertainty is to describe the corresponding function f(m¯,m¯).

The numerical value f(m¯,m¯) should always be between m¯ and m¯. As we have mentioned earlier, for every value m<m¯, we have m<[m¯,m¯], the set {m:m<[m¯,m¯]} contains all the values from the set (,m¯). Thus, its supremum f(m¯,m¯) has to be greater than or equal to all the values m<m¯, in particular, than all the values m=m¯1/n. So, we must have:

m¯1/n<f(m¯,m¯)
for all n. In the limit n → ∞, we conclude that m¯f(m¯,m¯).

Similarly, as for every value m>m¯, we have [m¯,m¯]>m, the set:

{m:[m¯,m¯]<m}
contains all the values from the set (m¯,). Thus, its infimum f(m¯,m¯) has to be smaller than or equal to all the values m>m¯ – in particular, than all the values m=m¯+1/n. So, we must have f(m¯,m¯)<m¯+1/n for all m. In the limit n → ∞, we conclude that f(m¯,m¯)m¯.

Based on the two previous examples, we should always have m¯f(m¯,m¯)m¯.

Let us prepare for the usual derivation of Hurwicz criterion. To explain the usual derivation of Hurwicz criterion from several assumptions, let us first provide the usual motivation for these assumptions.

Monotonicity. Let us assume that we start with an interval [m¯,m¯], and then we:

  • delete all the lowest-value options, i.e. options for which mm¯ for some m¯>m¯; and/or

  • add several higher-value options, with m>m¯, e.g. all the values from m¯ to some larger value m¯>m¯.

After this, we get a clearly better interval [m¯,m¯]. Thus, we conclude that the function f(m¯,m¯) should be monotonic: if m¯m¯ and m¯m¯, then f(m¯,m¯)f(m¯,m¯).

Additivity. Suppose that we have two situations:

  1. in the first situation, we can get any value from a¯ to a¯; and

  2. in the second situation, we can get any value from b¯ to b¯.

By definition of the function f(m¯,m¯), we are willing to pay the value f(a¯,a¯) to participate in the first situation and the value f(b¯,b¯) to participate in the second situation.

What if we consider these two choices as a single situation? In this case, the smallest possible value that we get overall – in both situations – is when we get the smallest possible value a¯ in the first situation and the smallest possible value b¯ in the second situation. In this case, the overall value is a¯+b¯.

Similarly, the largest possible value that we get overall – in both situations – is when we get the largest possible value a¯ in the first situation and the largest possible value b¯ in the second situation. In this case, the overall value is a¯+b¯.

Thus, when we consider these two choices as a single situation, the interval of possible monetary gains has the form [a¯+b¯,a¯+b¯]. So, the equivalent monetary value of the two choices treated as a single situation is f(a¯+b¯,a¯+b¯).

It is reasonable to require that the price that we pay for two choices sold together should be equal to the sum of the prices that we pay for two choices taken separately, i.e. that f(a¯+b¯,a¯+b¯)=f(a¯,a¯)+f(b¯,b¯). This property is known as additivity.

The usual derivation of Hurwicz criterion. Now, we are ready to describe the usual derivation of Hurwicz criterion.

Definition 1:

  • By a value function, we mean a function f(m¯,m¯) that assigns to each pair (m¯,m¯) of real numbers for which m¯m¯ and a real number f(m¯,m¯) for which m¯f(m¯,m¯)m¯.

  • We say that a value function f(m¯,m¯) is monotonic if whenever m¯m¯ and m¯m¯, then f(m¯,m¯)f(m¯,m¯).

  • We say that a value function f(m¯,m¯) is additive if for all possible values a¯a¯ and b¯b¯, we have f(a¯+b¯,a¯+b¯)=f(a¯,a¯)+f(b¯,b¯).

  • We say that a value function f(m¯,m¯) has a Hurwicz form if it has the form f(m¯,m¯)=α·m¯+(1α)·m¯ for some α ∈ [0,1].

Proposition 1. For a value function f(m¯,m¯), the following two conditions are equivalent to each other:

  • the value function is monotonic and additive; and

  • the value function has the Hurwicz form.

Proof. It is easy to prove that a Hurwicz-form value function is monotonic and additive.

Vice versa, let us assume that a value function f(m¯,m¯) is monotonic and additive. Let us denote α=deff(0,1).

Because of additivity, for every natural number n, we have:

[0,1/n]++[0,1/n](n times)=[0,1],
thus
f(0,1/n)++f(0,1/n)(n times)=n·f(0,1/n)=f(0,1)=α,
hence f(0,1/n) = α · (1/n).

Similarly, for every m and n, we have:

f(0,m/n)=f(0,1/n)++f(0,1/n)(m times)=m·f(0,1/n)=α·(m/n).

For every real number r, we have m/n r ≤ (m + 1)/n, where m=defr·n. Thus, owing to monotonicity, we have f(0,m/n) ≤ f(0,r) ≤ f(0,(m + 1)/n), i.e. α · (m/n) ≤ f(0,r) ≤ α · (m + 1)/n. Here, 0 ≤ rm/n ≤ 1/n, so in the limit n → ∞, we have m/nr and (m + 1)/nr. Thus, the above inequality leads to f(0,r) = α · r.

In particular, for every m¯m¯, we have f(0,m¯m¯)=α·(m¯m¯). By the property of a value function, we have m¯f(m¯,m¯)m¯, i.e. f(m¯,m¯)=m¯. Thus, owing to additivity:

f(m¯,m¯)=f(m¯+0,m¯+(m¯m¯))=f(m¯,m¯)+f(0,m¯m¯)=m¯+α·(m¯m¯).

One can easily check that this is indeed the Hurwicz expression.

Limitations. The previously mentioned motivations are reasonably reasonable, but they may not be 100% convincing.

Indeed, we argued that if the worst-case scenario is possible for each of the two situations, then it is possible that we have the worst-case scenario in both situations. This may sound reasonable, but it is not in full agreement with common sense. Indeed, e.g. when we fly from point A to point B, we understand:

  • that there may an unexpected delay at the airport A;

  • that a plane may have a problem in flight and we will have to get back; and

  • that there may a problem at the airport B and we will get stuck on the plane.

But, we honestly do not believe that all these low-probable disasters will happen at the same – this only happens in comedies describing lovable losers who always get into trouble.

We can raise another issue about the additivity requirement that additivity assumes that for the combination of two items, we always pay the same price as for the two items separately. Sometimes, this is true, but often, this is not true: there are discounts if you buy several items (or several objects of the same type) at the same time.

What should we do? As the arguments that we used previously to justify the assumptions are not 100% convincing, maybe we can find somewhat more convincing arguments in favor of Hurwicz formula or, alternatively, maybe these more convincing arguments can lead us to a different formula?

This is what we will analyze in Section 3.

3. Monetary case: new, hopefully more convincing, derivation of the Hurwicz criterion

Shift-invariance. Suppose that we offer a user a package deal in which he/she gets m dollars cash and an alternative in which he/she gets between m¯ and m¯. The equivalent value for the interval-value alternative is f(m¯,m¯), so the overall value for this package is m+f(m¯,m¯).

On the other hand, if we consider this a package deal, then in this deal, we get any amount between m+m¯ and m+m¯. Thus, the value of this package deal should be equal to f(m+m¯,m+m¯). It is reasonable to require that these two valuations should lead to the same result, i.e. that we should have m+f(m¯,m¯)=f(m+m¯,m+m¯). In mathematical terms, this property is known as shift-invariance.

Discussion. At first glance shift-invariant is very similar to additivity. Indeed, it can be viewed as a particular case of additivity, in which the first interval is simply the interval [m, m] consisting of a single number m.

But good news is that both previously mentioned objections to general additivity do not apply here. Indeed, we are not talking about a combination of rare events, so the first objection is not applicable. The second objection is also not applicable, as although we may expect a discount if we buy two big bottles of milk, no one expects a discount if we buy a bottle of milk and a fixed amount of money (e.g. when we ask to change a big banknote when paying).

Need for additional assumptions. If we limit ourselves only to shift-invariance, we will get too many possibilities in addition to Hurwicz formula: specifically, one can see that we can have a more general expression:

f(m¯,m¯)=m¯+F(m¯m¯),
where F(z) is a monotonic function defined for all z ≥ 0 for which F(z) ≤ z for all z – e.g. F(z) = z/(1 + z) (by the way, it is possible to show that the above expression is the most general form of a monotonic shift-invariant value function).

To narrow down the class of possible value functions, we need to make additional reasonable assumptions. We will describe one such assumption right away.

A new assumption – transitivity. Let us start with the same interval [0,1] with which we started the proof of Proposition 1. Similarly to this proof, let us denote the value f(0, 1) corresponding to this interval by α.

What can we conclude that from the fact that f(0,1) = α? Well, owing to shift invariance, we can conclude that for every x, we have f(x,1 + x) = α + x. From the mathematical viewpoint, this is all that we can conclude. However, from the common sense viewpoint, we can make yet another conclusion.

Indeed, e.g. for each x from the interval [0,1], the alternative corresponding to the interval [x,1 + x] is equivalent to getting a monetary amount α + x: [x,1 + x] ≡ α + x. If we do not know which of these intervals the alternative corresponds to, but we know that it corresponds to one of these alternatives, this means that the actual gain can take any value from the union of these intervals. Each of these intervals is equivalent to the value α + x, thus, the union of these intervals is equivalent to the set of all possible values α + x when x ∈ [0,1]:

x[0,1][x,1+x]{α+x:x[0,1]}

Let us estimate the left-hand side and the right-hand side of this equality.

  • The smallest possible value in the left-hand side is when we take the smallest value from the interval [x,1 + x], i.e. the value x – for the smallest possible value x from the interval [0,1] (i.e., for the value x = 0). Thus, the smallest possible value in the left-hand side is equal to 0.

  • The largest possible value in the left-hand side is when we take the largest value from the interval [x,1 + x], i.e. the value 1 + x – for the largest possible value x from the interval [0,1] (i.e., for the value x = 1). Thus, the largest possible value in the left-hand side is equal to1 + 1 = 2.

So, the left-hand side of the previously mentioned equality is the interval [0,2].

Similarly:

  • The smallest possible value in the right-hand side is when we take the smallest possible value x from the interval [0,1], i.e. the value x = 0. Thus, the smallest possible value in the right-hand side is equal to α + 0 = α.

  • The largest possible value in the right-hand side is when we take the largest possible value x from the interval [0,1], i.e. the value x = 1. Thus, the smallest possible value in the right-hand side is equal to α + 1.

So, the left-hand side of the above equality is the interval [α, 1 + α].

Thus, the previously mentioned equivalent takes the form [0,2] ≡ [α,1 + α]. Good news is that we already known – as a particular case of shift-invariance – that the interval [α, 1 + α] is equivalent to the value α + α = 2α. Thus, by transitivity of equivalence, we conclude that the interval [0,2] is equivalent to 2α, i.e. that f(0,2) = 2α. Then, by shift-invariance, we will get f(x,2+x) = 2α + x for each x.

By similarly combining intervals [x,1 + x] corresponding to x ∈ [0,2], we conclude that [0,3] ≡ [α,2 + α], and as we already know that [α,2 + α] ≡ 2α + α, by transitivity, we will have f(0,3) = 3α.

Instead of stacking intervals of width 1, we could similarly stack intervals of a different width w.

New derivation of Hurwicz formula. It turns out that this way, we can indeed get a new derivation of Hurwicz formula. Let us describe all this in precise terms.

Definition 2.

  • We say that a value function f(m¯,m¯) is shift-invariant if for every m and for all m¯m¯, we have m+f(m¯,m¯)=f(m+m¯,m+m¯).

  • We say that a value function is transitive if for each w and for all m¯m¯, we have f(¯,¯)=f(r¯,r¯), where:

    [¯,¯]=defm[m¯,m¯][m,w+m]

and

[r¯,r¯]=def{f(m,m+w):m[m¯,m¯]}.

Comment: In this definition, we only described transitivity for the case when all combined intervals have the exact same width. Our main motivation for this restriction is that, as we will show, only such transitivity is needed – and in derivations, it is always desirable to avoid unnecessarily general assumptions and to limit ourselves only to weakest possible assumptions – weakest possible among those that will lead to the desired derivation.

There is another reason for this limitation: as we how later in this section, if we generalize this property too much, then there will be no realistic value function at all that would satisfy thus generalized property.

Proposition 2. For a value function f(m¯,m¯), the following two conditions are equivalent to each other:

  • The value function is monotonic, shift-invariant and transitive.

  • The value function has the Hurwicz form.

Proof. Similarly to our above arguments, we can see that [¯,¯]=[m¯,m¯+w], so f(¯,¯)=f(m¯,m¯+w).

Owing to the monotonicity of the value function, we have:

[r¯,r¯]=[f(m¯,m¯),f(m¯+w,m¯+w)]

Owing to shift-invariance, we have:

[r¯,r¯]=[f(m¯,m¯),w+f(m¯+w,m¯+w)]
so, again owing to shift-invariance – this time in relation to a shift by f(m¯,m¯) – we get [r¯,r¯]=f(m¯,m¯)+[0,w].

Thus, again owing to shift-invariance, f(r¯,r¯)=f(m¯,m¯)+f(0,w). Therefore, transitivity means that:

f(m¯,m¯+w)=f(m¯,m¯)+f(0,w)

One can easily see that the Hurwicz formula is shift-invariant and satisfies the previously mentioned property for all w and for all m¯m¯.

Vice versa, let us assume that we have a value function that satisfies this property for all w and for all m¯m¯. In particular, for m¯=0, this means that f(0,m¯+w)=f(0,m¯)+f(0,w). This is exactly the particular case of the additivity property that we used (as well as monotonicity) in the proof of Proposition 1 to prove that f(0, r) = α · r for all real numbers r. From this formula, in that proof, we used, in effect, shift-invariance to prove that the Hurwicz formula is indeed true for all m¯m¯. As we still assume shift-invariance, this means that we have a derivation of the Hurwicz formula in this case as well.

The proposition is proven.

Discussion: we cannot generalize the transitivity property too much. Let us show that the transitivity assumption cannot be realistically generalized too much, to cases when united intervals have different widths.

Definition 3: We say that a value function is fully transitive if for each family of intervals {[m¯(a),m¯(a)]}aA for which both sets aA[m¯(a),m¯(a)] and {f(m¯(a),m¯(a)):aA} are intervals, we have f(¯,¯)=f(r¯,r¯), where we denoted

[¯,¯]=aA[m¯(a),m¯(a)]
and
[r¯,r¯]={f(m¯(a),m¯(a)):aA}.

Proposition 3. For a value function f(m¯,m¯), the following two conditions are equivalent to each other:

  • The value function is monotonic, shift-invariant and fully transitive.

  • The value function has the Hurwicz form with α = 0 or α = 1.

Discussion. So, full transitivity is satisfied only in the two extreme (and unrealistic) cases:

  1. when α = 0 – the case of full pessimism; and

  2. when α = 1 – the case of full optimism.

Proof. One can easily check that both extreme value functions f(m¯,m¯)=m¯ (that corresponds to α = 0) and f(m¯,m¯)=m¯ (that corresponds to α = 1) are fully transitive.

Let us prove that, vice versa, every monotonic shift-invariant and fully transitive value function coincides with one of the two extreme functions. Indeed, as the general condition should be satisfied for all possible families of intervals [m¯(a),m¯(a)], in particular, it should be satisfied for all the families from Definition 2. Thus, owing to Proposition 2, the value function should have the Hurwicz form.

Now, for the family [0,a], where aA = [0,1], the union [¯,¯] is simply equal to [0,1], so f(¯,¯)=f(0,1)=α.

On the other hand, here, f(0,a) = α · a, so:

[r¯,r¯]={α·a:a[0,1]}=[0,α],
thus f(r¯,r¯)=α·α=α2. Thus, the generalized transitivity is satisfied only when α = α2, i.e. when either α = 0 or α = 1.

The proposition is proven.

4. What is utility and what are the properties of utility: a brief reminder

What is utility. To apply computer-based number-oriented tools for making decisions in a nonmonetary case, we need to describe the user’s preferences in numerical terms. In decision theory (Fishburn, 1969; Luce and Raiffa, 1989; Raiffa, 1997; Nguyen et al., 2009; Kreinovich, 2014), this is done as follows.

Let us select the two extreme alternatives:

  • a very bad alternative A which is worse than anything that we will actually encounter; and

  • a very good alternative A+ which is better than anything that we will actually encounter.

For each real number p from the interval [0, 1], we can form a lottery – we will denote this lottery by L(p) – in which:

  • we get the very good alternative A+ with probability p; and

  • we get the very bad alternative A with the remaining probability 1 − p.

To find how valuable is each alternative A for the decision-maker, we ask him/her to compare the alternative A with lotteries L(p) corresponding to different probabilities p. Here:

  • when p is small, close to 0, the lottery L(p) is similar to the very bad alternative A and is, thus, worse than A; we will denote this by A < A; and

  • when p is close to 1, the lottery L(p) is similar to the very good alternative A+ and is, thus, better than A: A < L(p).

Also, the smaller the probability p of getting a very good alternative, the worse the lottery L(p). Thus:

  • if L(p) < A and p′ < p, then L(p′) < A; and

  • if A < L(p) and p < p′, then A < L(p′).

Thus, similarly to the monetary case, there exists a threshold value:

sup{p:L(p)<A}=inf{p:A<L(p)};
We will denote this threshold value by u(A). This threshold value is known as the utility of the alternative A.

Similarly to the monetary case, for every ε > 0, we have L(u(A) – ε) A < L(u(A) + ε). This is true for arbitrarily small ε, in particular, for the values ε for which the difference in probabilities among u(A) – ε, u(A), and u(A) + ε are practically unnoticeable. So, we can conclude that from the practical viewpoint, the alternative A is equivalent to the lottery L(u(A)). We will denote this equivalence by AL(u(A)).

Utility is defined modulo a linear transformation. The numerical value of the utility u(A) depends not only on the alternative A, it also depends on which pair (A,A+) we select. What if we select a different pairs (A,A+) – e.g. a pair for which A < A < A+ < A+? How will that change the numerical value of utility?

If an alternative A has utility u′(A) with respect to the pair (A,A+), this means that this alternative is equivalent to the lottery L′(u′(A)), in which:

  • we get A+ with probability u′(A); and

  • we get A with the remaining probability1 – u′(A).

As A < A < A+, we can find a utility value u(A) for which the alternative A is equivalent to the lottery L(u(A)), in which:

  • we select A+ with probability u(A); and

  • we select A with probability 1 – u(A).

Similarly, we have A+L(u(A+)). Thus, the original alternative A is equivalent to a two-stage lottery, in which:

  • first, we select either A+ (with probability u(A′)) or A (with probability 1 – u(A′));

  • then, we select either A+ or A with probabilities depending on what we selected on the first stage: if we selected A+ on the first stage, then we select A+ with probability u(A+) and A with probability1 – u(A+), and if we selected A on the first stage, then we select A+ with probability u(A) and A with probability 1 – u(A).

As a result of this two-stage lottery, we get either A+ or A, and the probability of selecting A+ is equal to:

u(A)·u(A+)+(1u(A))·u(A).

By definition, this probability is the utility u(A) of the alternative A with respect to the pair (A, A+), thus:

u(A)=u(A)·u(A+)+(1u(A))·u(A).

The right-hand side is a linear expression in terms of u′(A). So, we conclude that utilities corresponding to different pairs can be obtained from each other by a linear transformation.

In other words, the numerical value of the utility is defined modulo a generic linear transformation – just like the numerical value of time and temperature, where the corresponding linear transformations mean selecting a different starting point and/or a different measuring unit.

5. Utility case: usual derivation of the Hurwicz criterion and the limitations of this derivation

Formulation of the problem. As we have mentioned earlier, in many practical situations, we do not know the exact consequence of each action, and thus, we do not know the exact value of the corresponding utility. Instead, for such situations, we only know the interval [u¯,u¯] of possible utility values. According to the general idea of utility, to describe the decision-maker’s preferences for such interval-valued situations, we must assign, to each such interval, an appropriate utility value. Similarly to the monetary case, we will denote this utility value by f(u¯,u¯), and we will call the corresponding function a value function. Clearly, we must have u¯f(u¯,u¯)u¯, and clearly, if u¯ and/or u¯ increase, the interval-valued alternative becomes better, i.e. the value function should be monotonic.

What are other natural properties of the value function?

We cannot reuse assumptions from the monetary case. We cannot simply use the same properties as in the monetary case. For example, additivity makes no sense:

  • it makes perfect sense to add dollar amounts; but

  • it makes no sense to add probabilities (and utilities, as we have explained, are probabilities).

So, we need alternative assumptions.

Assumptions used in the usual derivation of the utility-case Hurwicz formula. As utility is defined modulo a general linear transformation, it makes sense to require that the formulas transforming the bounds u¯ and u¯ into an equivalent utility should remain the same if we linearly “rescale” all utility values. In particular:

  • If we have f(u¯,u¯)=u, then after shifting all the utility values by u0 we should retain the same relation between the shifted utilities u¯=u¯+u0, u¯=u¯+u0 and u′ = u + u0: f(u¯,u¯)=u.

  • Similarly, if we have f(u¯,u¯)=u, then after rescaling all the utility values by a factor c > 0, we should retain the same relation between the scaled utilities u¯=c·u¯,u¯=c·u¯ and u′ = c·u: f(u¯,u¯)=u.

In the shift case, if we substitute the values u¯=u¯+u0,u¯=u¯+u0, and u=u+u0=f(u¯,u¯)+u0 into the desired equality f(u¯,u¯)=u, we get the requirement f(u¯+u0,u¯+u0)=f(u¯,u¯)+u0. One can see that this is exactly the property that we called shift-invariance.

In the rescaling case, if we substitute the values u¯=c·u¯,u¯=c·u¯, and u=c·u=c·f(u¯,u¯) into the desired equality f(u¯,u¯)=u, we get the requirement f(c·u¯,c·u¯)=c·f(u¯,u¯). We will call this property scale-invariance.

Definition 4: We say that a value function f(u¯,u¯) is scale-invariant if for every c > 0 and for all u¯u¯, we have f(c·u¯,c·u¯)=c·f(u¯,u¯).

Proposition 4. For a value function f(u¯,u¯), the following two conditions are equivalent to each other:

  • The value function is monotonic, shift-invariant and scale-invariant.

  • The value function has the Hurwicz form.

Proof. It is easy to check that the Hurwicz formula is monotonic, shift-invariant and scale-invariant. Let us show that, vice versa, every monotonic value function which is shift- and scale-invariant has the Hurwicz form.

Indeed, as in the proof of Proposition 1, let us denote α=deff(0,1). For all u¯<u¯, owing to shift-invariance with u0=u¯, we have f(u¯,u¯)=u¯+f(0,u¯u¯). Now, owing to scale-invariance with c=u¯u¯, we get f(0,u¯u¯)=(u¯u¯)·f(0,1)=(u¯u¯)·α.

Thus, f(u¯,u¯)=u¯+f(0,u¯u¯)=u¯+(u¯u¯)·α, which is exactly the Hurwicz formula.

The proposition is proven.

Limitations. Shift-invariance is indeed reasonable, but scale-invariance is not fully convincing. Yes, indeed, we can have different units for measuring utility – just like we can use different units for measuring money, but it is not very convincing to expect that people will make the same choices involving US$100 as in situations involving 100 Pesos (which at present, in 2020, represents a 20 times smaller amount of money).

It is therefore desirable to replace scale-invariance with a more convincing assumption.

6. Utility case: new, hopefully more convincing, derivation of the Hurwicz criterion

We cannot simply dismiss scale-invariance. We cannot simply dismiss scale-invariance and keep only shift-invariance: we already considered this scenario when discussing the monetary case, and we showed that in this case, there are too many values functions satisfying these requirements.

So, we need additional assumptions – assumptions which are more convincing that scale-invariance.

What we propose. What we propose is the previously described transitivity property. The arguments in favor of this property apply verbatim to the utility case. And we already know – from Proposition 2 – that if we require shift-invariance and transitivity, then the only value functions we get are Hurwicz ones.

Thus, indeed, we get a new, (hopefully) more convincing, derivation of the Hurwicz criterion in the utility case as well.

References

Fishburn, P.C. (1969), Utility Theory for Decision Making, John Wiley and Sons, New York, NY.

Hurwicz, L. (1951), “Optimality criteria for decision making under ignorance”, Cowles Commission Discussion Paper, Statistics, No. 370.

Kreinovich, V. (2014), “Decision making under interval uncertainty (and beyond)”, in Guo, P. and Pedrycz, W. (Eds), Human-Centric Decision-Making Models for Social Sciences, Springer Verlag, pp. 163-193.

Luce, R.D. and Raiffa, R. (1989), Games and Decisions: Introduction and Critical Survey, Dover, New York, NY.

Nguyen, H.T., Kosheleva, O. and Kreinovich, V. (2009), “Decision making beyond arrow’s ‘impossibility theorem’, with the analysis of effects of collusion and mutual attraction”, International Journal of Intelligent Systems, Vol. 24 No. 1, pp. 27-47.

Raiffa, H. (1997), Decision Analysis, McGraw-Hill, Columbus, OH.

Further reading

Kreinovich, V. (2017), “Decision making under interval (and more general) uncertainty: monetary vs utility approaches”, Journal of Computational Technologies, Vol. 22 No. 2, pp. 37-49.

Acknowledgements

This work was supported in part by the US National Science Foundation grants 1623190 (A Model of Change for Preparing a New Generation for Professional Practice in Computer Science) and HRD-1242122 (Cyber-ShARE Center of Excellence).

Corresponding author

Vladik Kreinovich can be contacted at: vladik@utep.edu

Related articles