Regret (decision theory)

{{Short description|Measure of value difference between best possible decision and made decision}}

In decision theory, regret aversion (or anticipated regret) describes how the human emotional response of regret can influence decision-making under uncertainty. When individuals make choices without complete information, they often experience regret if they later discover that a different choice would have produced a better outcome. This regret can be quantified as the difference in value between the actual decision made and what would have been the optimal decision in hindsight.

Unlike traditional models that consider regret as merely a post-decision emotional response, the theory of regret aversion proposes that decision-makers actively anticipate potential future regret and incorporate this anticipation into their current decision-making process. This anticipation can lead individuals to make choices specifically designed to minimize the possibility of experiencing regret later, even if those choices are not optimal from a purely probabilistic expected-value perspective.

Regret is a powerful negative emotion with significant social and reputational implications, playing a central role in how humans learn from experience and in the psychology of risk aversion. The conscious anticipation of regret creates a feedback loop that elevates regret from being simply an emotional reaction—often modeled as mere human behavior—into a key factor in rational choice behavior that can be formally modeled in decision theory.

This anticipatory mechanism helps explain various observed decision patterns that deviate from standard expected utility theory, including status quo bias, inaction inertia, and the tendency to avoid decisions that might lead to easily imagined counterfactual scenarios where a better outcome would have occurred.

Description

Regret theory is a model in theoretical economics simultaneously developed in 1982 by Graham Loomes and Robert Sugden,{{cite journal |last1=Loomes |first1=G. |last2=Sugden |first2=R. |year=1982 |title=Regret theory: An alternative theory of rational choice under uncertainty |journal=Economic Journal |volume=92 |issue=4 |pages=805–824 |doi=10.2307/2232669 |jstor=2232669 }} David E. Bell,{{cite journal |last=Bell |first=D. E. |year=1982 |title=Regret in decision making under uncertainty |journal=Operations Research |volume=30 |issue=5 |pages=961–981 |doi=10.1287/opre.30.5.961 }} and Peter C. Fishburn.{{cite book |last=Fishburn |first=P. C. |year=1982 |title=The Foundations of Expected Utility |series=Theory & Decision Library |isbn=90-277-1420-7 }} Regret theory models choice under uncertainty taking into account the effect of anticipated regret. Subsequently, several other authors improved upon it.{{cite journal |last1=Diecidue |first1=E. |last2=Somasundaram |first2=J. |year=2017 |title=Regret Theory: A New Foundation |journal=Journal of Economic Theory |volume=172 |pages=88–119 |doi=10.1016/j.jet.2017.08.006 |s2cid=36505167 }}

It incorporates a regret term in the utility function which depends negatively on the realized outcome and positively on the best alternative outcome given the uncertainty resolution. This regret term is usually an increasing, continuous and non-negative function subtracted to the traditional utility index. These type of preferences always violate transitivity in the traditional sense,{{cite journal |last1=Bikhchandani |first1=S. |last2=Segal |first2=U. |year=2011 |title=Transitive Regret |journal=Theoretical Economics |volume=6 |issue=1 |pages=95–108 |doi=10.3982/TE738 |doi-access=free |hdl=10419/150148 |hdl-access=free }} although most satisfy a weaker version.

For independent lotteries and when regret is evaluated over the difference between utilities and then averaged over the all combinations of outcomes, the regret can still be transitive but for only specific form of regret functional. It is shown that only hyperbolic sine function will maintain this property.{{cite journal |last1=Bardakhchyan |first1=V. |last2=Allahverdyan |first2=A. |year=2023 |title=Regret theory, Allais’ paradox, and Savage’s omelet |journal= Journal of Mathematical Psychology |volume=117 |doi=10.1016/j.jmp.2023.102807|arxiv=2301.02447 }} This form of regret inherits most of desired features, such as holding right preferences in face of first order stochastic dominance, risk averseness for logarithmic utilities and the ability to explain Allais paradox.

Regret aversion is not only a theoretical economics model, but a cognitive bias occurring as a decision has been made to abstain from regretting an alternative decision. To better preface, regret aversion can be seen through fear by either commission or omission; the prospect of committing to a failure or omitting an opportunity that we seek to avoid.{{cite web |title=Why do we anticipate regret before we make a decision? |url=https://thedecisionlab.com/biases/regret-aversion |website=The Decision Lab}} Regret, feeling sadness or disappointment over something that has happened, can be rationalized for a certain decision, but can guide preferences and can lead people astray. This contributes to the spread of disinformation because things are not seen as one's personal responsibility.

Evidence

Several experiments over both incentivized and hypothetical choices attest to the magnitude of this effect.

Experiments in first price auctions show that by manipulating the feedback the participants expect to receive, significant differences in the average bids are observed.{{cite journal |last1=Filiz-Ozbay |first1=E. |last2=Ozbay |first2=E. Y. |year=2007 |title=Auctions with anticipated regret: Theory and experiment |journal=American Economic Review |volume=97 |issue=4 |pages=1407–1418 |doi=10.1257/aer.97.4.1407 |s2cid=51815774 }} In particular, "Loser's regret" can be induced by revealing the winning bid to all participants in the auction, and thus revealing to the losers whether they would have been able to make a profit and how much could it have been (a participant that has a valuation of $50, bids $30 and finds out the winning bid was $35 will also learn that he or she could have earned as much as $15 by bidding anything over $35.) This in turn allows for the possibility of regret and if bidders correctly anticipate this, they would tend to bid higher than in the case where no feedback on the winning bid is provided in order to decrease the possibility of regret.

In decisions over lotteries, experiments also provide supporting evidence of anticipated regret.{{cite journal |last1=Zeelenberg |first1=M. |last2=Beattie |first2=J. |last3=Van der Pligt |first3=J. |last4=de Vries |first4=N. K. |year=1996 |title=Consequences of regret aversion: Effects of expected feedback on risky decision making |journal=Organizational Behavior and Human Decision Processes |volume=65 |issue=2 |pages=148–158 |doi=10.1006/obhd.1996.0013 |url=http://dare.uva.nl/personal/pure/en/publications/consequences-of-regret-aversion-effect-of-expected-feedback-on-risky-decision-making(795bf97f-53f9-4641-86a2-a0635c537491).html }}{{cite journal |last1=Zeelenberg |first1=M. |last2=Beattie |first2=J. |year=1997 |title=Consequences of regret aversion 2: Additional evidence for effects of feedback on decision making |journal=Organizational Behavior and Human Decision Processes |volume=72 |issue=1 |pages=63–78 |doi=10.1006/obhd.1997.2730 |url=https://research.tue.nl/nl/publications/consequences-of-regret-aversion-2-additional-evidence-for-effects-of-feedback-on-decision-making(f302d547-7f43-4a9b-96c7-ac50527548d9).html }}{{cite journal |last1=Somasundaram |first1=J. |last2=Diecidue |first2=E. |year=2016 |title=Regret theory and risk attitudes |journal=Journal of Risk and Uncertainty |volume=55 |issue=2–3 |pages=1–29 |doi=10.1007/s11166-017-9268-9 |s2cid=254978441 }} As in the case of first price auctions, differences in feedback over the resolution of the uncertainty can cause the possibility of regret and if this is anticipated, it may induce different preferences.

For example, when faced with a choice between $40 with certainty and a coin toss that pays $100 if the outcome is guessed correctly and $0 otherwise, not only does the certain payment alternative minimizes the risk but also the possibility of regret, since typically the coin will not be tossed (and thus the uncertainty not resolved) while if the coin toss is chosen, the outcome that pays $0 will induce regret. If the coin is tossed regardless of the chosen alternative, then the alternative payoff will always be known and then there is no choice that will eliminate the possibility of regret.

= Anticipated regret versus experienced regret =

Anticipated regret tends to be overestimated for both choices and actions over which people perceive themselves to be responsible.{{Cite journal|title = Looking Forward to Looking Backward The Misprediction of Regret|journal = Psychological Science|date = 2004-05-01|issn = 0956-7976|pmid = 15102146|pages = 346–350|volume = 15|issue = 5|doi = 10.1111/j.0956-7976.2004.00681.x|language = en|first1 = Daniel T.|last1 = Gilbert|first2 = Carey K.|last2 = Morewedge|first3 = Jane L.|last3 = Risen|first4 = Timothy D.|last4 = Wilson|citeseerx = 10.1.1.492.9980| s2cid=748553 }}{{Cite journal|title = Biased Forecasting of Postdecisional Affect|journal = Psychological Science|date = 2007-08-01|issn = 0956-7976|pmid = 17680936|pages = 678–681|volume = 18|issue = 8|doi = 10.1111/j.1467-9280.2007.01958.x|language = en|first1 = Nick|last1 = Sevdalis|first2 = Nigel|last2 = Harvey| s2cid=7524552 }} People are particularly likely to overestimate the regret they will feel when missing a desired outcome by a narrow margin. In one study, commuters predicted they would experience greater regret if they missed a train by 1 minute more than missing a train by 5 minutes, for example, but commuters who actually missed their train by 1 or 5 minutes experienced (equal and) lower amounts of regret. Commuters appeared to overestimate the regret they would feel when missing the train by a narrow margin, because they tended to underestimate the extent to which they would attribute missing the train to external causes (e.g., missing their wallet or spending less time in the shower).

Applications

Besides the traditional setting of choices over lotteries, regret aversion has been proposed as an explanation for the typically observed overbidding in first price auctions,{{cite journal |last=Engelbrecht-Wiggans |first=R. |year=1989 |title=The effect of regret on optimal bidding in auctions |journal=Management Science |volume=35 |issue=6 |pages=685–692 |doi=10.1287/mnsc.35.6.685 |hdl=2142/28707 |hdl-access=free }} and the disposition effect,{{cite journal |last1=Fogel |first1=S. O. C. |last2=Berry |first2=T. |year=2006 |title=The disposition effect and individual investor decisions: the roles of regret and counterfactual alternatives |journal=Journal of Behavioral Finance |volume=7 |issue=2 |pages=107–116 |doi=10.1207/s15427579jpfm0702_5 |s2cid=153522835 }} among others.

Minimax regret

The minimax regret approach is to minimize the worst-case regret, originally presented by Leonard Savage in 1951.{{cite journal |last=Savage |first=L. J. |year=1951 |title=The Theory of Statistical Decision |journal=Journal of the American Statistical Association |volume=46 |issue=253 |pages=55–67 |doi=10.1080/01621459.1951.10500768 }} The aim of this is to perform as closely as possible to the optimal course. Since the minimax criterion applied here is to the regret (difference or ratio of the payoffs) rather than to the payoff itself, it is not as pessimistic as the ordinary minimax approach. Similar approaches have been used in a variety of areas such as:

One benefit of minimax (as opposed to expected regret) is that it is independent of the probabilities of the various outcomes: thus if regret can be accurately computed, one can reliably use minimax regret. However, probabilities of outcomes are hard to estimate.

This differs from the standard minimax approach in that it uses differences or ratios between outcomes, and thus requires interval or ratio measurements, as well as ordinal measurements (ranking), as in standard minimax.

=Example=

Suppose an investor has to choose between investing in stocks, bonds or the money market, and the total return depends on what happens to interest rates. The following table shows some possible returns:

class="wikitable" ! Return !! Interest rates rise !! Static rates !! Interest rates fall	Worst return
Stocks \| −4 \|\| 4 \|\| 12 \|\| −4
Bonds \| −2 \|\| 3 \|\| 8 \|\| −2
Money market \| 3 \|\| 2 \|\| 1 \|\| 1
Best return \| 3	4	12

class="wikitable"

! Return !! Interest rates rise !! Static rates !! Interest rates fall

Worst return

Stocks

| −4 || 4 || 12 || −4

Bonds

| −2 || 3 || 8 || −2

Money market

| 3 || 2 || 1 || 1

Best return

| 3

The crude maximin choice based on returns would be to invest in the money market, ensuring a return of at least 1. However, if interest rates fell then the regret associated with this choice would be large. This would be 11, which is the difference between the 12 which could have been received if the outcome had been known in advance and the 1 received. A mixed portfolio of about 11.1% in stocks and 88.9% in the money market would have ensured a return of at least 2.22; but, if interest rates fell, there would be a regret of about 9.78.

The regret table for this example, constructed by subtracting actual returns from best returns, is as follows:

class="wikitable" ! Regret !! Interest rates rise !! Static rates !! Interest rates fall	Worst regret
Stocks \| 7 \|\| 0 \|\| 0 \|\| 7
Bonds \| 5 \|\| 1 \|\| 4 \|\| 5
Money market \| 0 \|\| 2 \|\| 11 \|\| 11

class="wikitable"

! Regret !! Interest rates rise !! Static rates !! Interest rates fall

Worst regret

Stocks

| 7 || 0 || 0 || 7

Bonds

| 5 || 1 || 4 || 5

Money market

| 0 || 2 || 11 || 11

Therefore, using a minimax choice based on regret, the best course would be to invest in bonds, ensuring a regret of no worse than 5. A mixed investment portfolio would do even better: 61.1% invested in stocks, and 38.9% in the money market would produce a regret no worse than about 4.28.

Example: Linear estimation setting

What follows is an illustration of how the concept of regret can be used to design a linear estimator.

In this example, the problem is to construct a linear estimator of a finite-dimensional parameter vector $x$ from its noisy linear measurement with known noise covariance structure. The loss of reconstruction of $x$ is measured using the mean-squared error (MSE). The unknown parameter vector is known to lie in an ellipsoid $E$ centered at zero. The regret is defined to be the difference between the MSE of the linear estimator that doesn't know the parameter $x$ , and the MSE of the linear estimator that knows $x$ . Also, since the estimator is restricted to be linear, the zero MSE cannot be achieved in the latter case. In this case, the solution of a convex optimization problem gives the optimal, minimax regret-minimizing linear estimator, which can be seen by the following argument.

According to the assumptions, the observed vector $y$ and the unknown deterministic parameter vector $x$ are tied by the linear model

: $y=Hx+w$

where $H$ is a known $n \times m$ matrix with full column rank $m$ , and $w$ is a zero mean random vector with a known covariance matrix $C_w$ .

Let

: $\hat{x}=Gy$

be a linear estimate of $x$ from $y$ , where $G$ is some $m \times n$ matrix. The MSE of this estimator is given by

: $MSE = E\left(||\hat{x}-x||^2\right) = Tr(GC_wG^*) + x^*(I-GH)^*(I-GH)x.$

Since the MSE depends explicitly on $x$ it cannot be minimized directly. Instead, the concept of regret can be used in order to define a linear estimator with good MSE performance. To define the regret here, consider a linear estimator that knows the value of the parameter $x$ , i.e., the matrix $G$ can explicitly depend on $x$ :

: $\hat{x}^o=G(x)y.$

The MSE of $\hat{x}^o$ is

: $MSE^o=E\left(||\hat{x}^o-x||^2\right) = Tr(G(x)C_wG(x)^*) + x^*(I-G(x)H)^*(I-G(x)H)x.$

To find the optimal $G(x)$ , $MSE^o$ is differentiated with respect to $G$ and the derivative is equated to 0 getting

: $G(x)=xx^*H^*(C_w+Hxx^*H^*)^{-1}.$

Then, using the Matrix Inversion Lemma

: $G(x)=\frac{1}{1+x^*H^*C_w^{-1}Hx}xx^*H^*C_w^{-1}.$

Substituting this $G(x)$ back into $MSE^o$ , one gets

: $MSE^o=\frac{x^*x}{1+x^*H^*C_w^{-1}Hx}.$

This is the smallest MSE achievable with a linear estimate that knows $x$ . In practice this MSE cannot be achieved, but it serves as a bound on the optimal MSE. The regret of using the linear estimator specified by $G$ is equal to

: $R(x,G)=MSE-MSE^o=Tr(GC_wG^*) + x^*(I-GH)^*(I-GH)x-\frac{x^*x}{1+x^*H^*C_w^{-1}Hx}.$

The minimax regret approach here is to minimize the worst-case regret, i.e.,

$\sup_{x\in E} R(x,G).$

This will allow a performance as close as possible to the best achievable performance in the worst case of the parameter $x$ . Although this problem appears difficult, it is an instance of convex optimization and in particular a numerical solution can be efficiently calculated.{{cite journal |first1=Y. C. |last1=Eldar |first2=A. |last2=Ben-Tal |first3=A. |last3=Nemirovski |title=Linear Minimax regret estimation of deterministic parameters with bounded data uncertainties |journal=IEEE Trans. Signal Process. |volume=52 |issue=8 |pages=2177–2188 |year=2004 |doi=10.1109/TSP.2004.831144 |bibcode=2004ITSP...52.2177E |s2cid=16417895 }} Similar ideas can be used when $x$ is random with uncertainty in the covariance matrix.{{cite journal |first1=Y. C. |last1=Eldar |first2=Neri |last2=Merhav |title=A Competitive Minimax Approach to Robust Estimation of Random Parameters |journal=IEEE Trans. Signal Process. |volume=52 |issue=7 |pages=1931–1946 |year=2004 |doi=10.1109/TSP.2004.828931 |bibcode=2004ITSP...52.1931E |s2cid=15596014 }}{{cite journal |first1=Y. C. |last1=Eldar |first2=Neri |last2=Merhav |title=Minimax MSE-Ratio Estimation with Signal Covariance Uncertainties |journal=IEEE Trans. Signal Process. |volume=53 |issue=4 |pages=1335–1347 |year=2005 |doi=10.1109/TSP.2005.843701 |bibcode=2005ITSP...53.1335E |s2cid=16732469 }}

Regret in principal-agent problems

Camara, Hartline and Johnsen{{Cite book |last1=Camara |first1=Modibo K. |last2=Hartline |first2=Jason D. |last3=Johnsen |first3=Aleck |chapter=Mechanisms for a No-Regret Agent: Beyond the Common Prior |date=2020-11-01 |title=2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS) |chapter-url=http://dx.doi.org/10.1109/focs46700.2020.00033 |pages=259–270 |publisher=IEEE |doi=10.1109/focs46700.2020.00033|arxiv=2009.05518 |isbn=978-1-7281-9621-3 |s2cid=221640554 }} study principal-agent problems. These are incomplete-information games between two players called Principal and Agent, whose payoffs depend on a state of nature known only by the Agent. The Principal commits to a policy, then the agent responds, and then the state of nature is revealed. They assume that the principal and agent interact repeatedly, and may learn over time from the state history, using reinforcement learning. They assume that the agent is driven by regret-aversion. In particular, the agent minimizes his counterfactual internal regret. Based on this assumption, they develop mechanisms that minimize the principal's regret.

Collina, Roth and Shao{{Cite arXiv |eprint=2311.07754 |last1=Collina |first1=Natalie |last2=Roth |first2=Aaron |last3=Shao |first3=Han |title=Efficient Prior-Free Mechanisms for No-Regret Agents |date=2023 |class=cs.GT }} improve their mechanism both in running-time and in the bounds for regret (as a function of the number of distinct states of nature).

References

External links

{{cite web|url=http://philosophy.hku.hk/think/strategy/decision.php|title=TUTORIAL G05: Decision theory|archive-url=https://web.archive.org/web/20150703104008/http://philosophy.hku.hk/think/strategy/decision.php|archive-date=3 July 2015}}

Category:Choice modelling

Category:Optimal decisions

Category: Decision theory