Skip to main content

Rank dependent expected utility

Prospect theory is most well known for its assumption that gains are treated differently to losses. Another crucial part of the theory, namely that probabilities are weighted, typically attracts much less attention. Recent evidence, however, is suggesting that probability weighting has a crucial role to play in many applied settings. So, what is probability weighting and why does it matter?

The basic idea of probability weighting is that people tend to overestimate the likelihood of events that happen with small probability and underestimate the likelihood of events that happen with medium to large probability. In their famous paper on 'Advances in prospect theory', Amos Tversky and Daniel Kahneman quantified this effect. They fitted experiment data to equation


where Î³ is a parameter to be estimated. In interpretation, p is the actual probability and Ï€(p) the weighted probability. The figure below summarizes the kind of effect you get. Tversky and Kahneman found that a value of Î³ around 0.61 best matched the data. This means that something which happens with probability 0.1 gets a decision weight of around 0.2 (overweighting of small probabilities) while something that happens with probability 0.5 gets a decision weight of only around 0.4 (underweighting of medium to large probabilities).  



Why we observe this warping of probabilities is unclear. But the consequences for choice can be important. To see why consider someone deciding whether to take on a gamble. Their choice is either to accept £10 for certain or gamble and have a 10% chance of winning £90 and a 90% chance of winning nothing. The expected value of this gamble is 0.1 x 90 = £9. So, it does not look like a good deal. But, if someone converts a 10% probability into a decision weight of 0.2 we get value 0.2 x 90 = £18. Suddenly the gamble looks great! Which might explain the appeal of lottery tickets.

There is, though, a problem. It is not enough to simple weight all probabilities. This, as I will shortly explain, doesn't work. So, we need some kind of trick. While prospect theory was around in 1979 it was not until the early 1990's that the trick was found. That trick is rank dependent weighting. The gap of over 10 years in finding a way to deal with probabilities may help explain why probability weighting has had to play second fiddle to loss aversion. Lets, though, focus on the technical details.

Consider the example. Here there are no obvious problems if we just weight probabilities. The 10% chance of winning is converted into a 0.2 decision weight while the 90% chance of losing is converted into a 0.7 decision weight. The overall expected value is then 0.2 x £90 = £18. Everything looks fine.

So, consider another example. Suppose that the sure £10 is now a gamble with a 10% chance of winning £10.09, a 10% chance of winning £10.08, a 10% chance of winning £10.07, and so on, down to a 10% chance of winning £10. If we just simply weight all these 10% probabilities as 0.2 then we get expected value of 0.2 x 10.09 + 0.2 x 10.08 + ... + 0.2 x 10 = £20.09. This is absurd. A gamble that essentially gives £10 cannot be worth over £20! You might say that the problem here is we have ended up with a combined weight of 2. If, though, we normalize weights to 1 we will not have captured the over-weighting of small probabilities. So, normalizing is not, of itself, a solution. 

The problem with the preceding approach is that we have weighted everything - good or bad - by the same amount. Rank dependent probability does away with that. Here we rank outcomes from best to worst. The decision weight we place on an outcome is then the weighted probability of the outcome or something better minus the weighted probability of something better

In our original gamble the best outcome is £90 and the worst is £0. The weight we put on £90 is around 0.2 because there is 10% chance of £90, no chance of anything better, and a 10% probability is given weight 0.2. The weight we put on £0 is 0.8 because it is the weighted probability of £0 or better, namely 1, minus the weighted probability of £90, namely 0.2. So, not much changes in this example.

In the £10 gamble the best outcome is £10.09, the next best £10.08, and so on. The decision weight we but on £10.09 is around 0.2 because there is a 10% chance of £10.09 and no chance of anything better. Crucially, the weight we put on £10.08 is only around 0.1 because we have the weighted probability of £10.08 or better, a 20% chance that gives weight around 0.3, minus the weighted probability of £10.09, around 0.2. You can verify that the chance of winning £10.07, £10.06 and so on has an even lower decision weight. Indeed, decision weights have to add to 1 and so the high weight on £10.09 is compensated by a lower weight on other outcomes. For completeness the table below gives the exact weights you would get with the Tversky and Kahneman parameters. Given that decision weights have to add to 1 the expected value is going to be around £10. Common sense restored!




Generally speaking, rank dependent weighting means that we capture, and only capture, over-weighting of the extreme outcomes. So, we capture the fact a person may be overly optimistic about winning £90 rather than £0 without picking up the perverse prediction that every unlikely event is over-weighted. The discussion so far has focused on gains but we can do the same thing with losses. Here we want to capture, and only capture, over-weighting of the worst outcomes. 

So why does all this matter? There is mounting evidence that weighting of probabilities can explain a lot of behavior, including the equity premium puzzle, long shot bias in betting and willingness of households to buy insurance at highly unfavorable premiums. For a review of the evidence see the article by Helga Fehr-Duda and Thomas Epper on 'Probability and risk: Foundations and economic implications of probability-dependent risk preferences'. It is easy to see, for instance, why overweighting of small probabilities could have potentially profound implications for someone's view of insurance. A very small probability of loss may be given a much higher decision weight. That makes insurance look like a good deal.  

Comments

Popular posts from this blog

Revealed preference, WARP, SARP and GARP

The basic idea behind revealed preference is incredibly simple: we try to infer something useful about a person's preferences by observing the choices they make. The topic, however, confuses many a student and academic alike, particularly when we get on to WARP, SARP and GARP. So, let us see if we can make some sense of it all.           In trying to explain revealed preference I want to draw on a  study  by James Andreoni and John Miller published in Econometrica . They look at people's willingness to share money with another person. Specifically subjects were given questions like:  Q1. Divide 60 tokens: Hold _____ at $1 each and Pass _____ at $1 each.  In this case there were 60 tokens to split and each token was worth $1. So, for example, if they held 40 tokens and passed 20 then they would get $40 and the other person $20. Consider another question: Q2. Divide 40 tokens: Hold _____ at $1 each and Pass ______ at $3 each. In this case each token given to th

Nash bargaining solution

Following the tragic death of John Nash in May I thought it would be good to explain some of his main contributions to game theory. Where better to start than the Nash bargaining solution. This is surely one of the most beautiful results in game theory and was completely unprecedented. All the more remarkable that Nash came up with the idea at the start of his graduate studies!          The Nash solution is a 'solution' to a two-person bargaining problem . To illustrate, suppose we have Adam and Beth bargaining over how to split some surplus. If they fail to reach agreement they get payoffs €a and €b respectively. The pair (a, b) is called the disagreement point . If they agree then they can achieve any pair of payoffs within some set F of feasible payoff points . I'll give some examples later. For the problem to be interesting we need there to be some point (A, B) in F such that A > a and B > b. In other words Adam and Beth should be able to gain from agreeing.

Some estimates of price elasticity of demand

In the  textbook on Microeconomics and Behaviour with Bob Frank we have some tables giving examples of price, income and cross-price elasticities of demand. Given that most of the references are from the 70's I'm working on an update for the forthcoming 3rd edition. So, here is a brief overview of where the numbers come from for the table on price elasticity of demand. Suggestions for other good sources much appreciated. Before we get into the numbers - the disclaimer. Price elasticities are tricky things to tie down. Suppose you want the price elasticity of demand for cars. This elasticity is likely to be different for rich or poor people, people living in the city or the countryside, people in France or Germany etc.etc. You then have to think if you want the elasticity for buying a car or using a car (which includes petrol, insurance and so on). So, there is no such thing as the price elasticity of demand for cars. Moreover, the estimated price elasticity will depend o