Download - A Model of Gain/Loss Asymmetry - 京都大学wakai/research/papers/mdis-rev2...A Model of Gain/Loss Asymmetry Katsutoshi Wakai January 2006 Abstract Based on a notion associated with

A Model of Gain/Loss Asymmetry

Katsutoshi Wakai�

January 2006

Abstract

Based on a notion associated with utility smoothing, this paper pro-

poses, in an axiomatic framework, a new model of discount factors that

captures an asymmetric attitude toward gains versus losses. Its key

departure from existing models is that preferences satisfy dynamic con-

sistency. This introduces the notion of recursive gain/loss asymmetry

in discount factors: The di¤erence between future and current utility

de�nes a gain or a loss, and gains are discounted more than losses. Un-

der this mechanism, current utility becomes the endogenous reference

point. The bene�t of dynamic consistency, as well as the economic im-

plications of gain/loss asymmetry, is shown through a repeated game

example.

Keywords: discount factors, gain/loss asymmetry, recursive utility,

reference points, time-variability aversion, utility smoothing

JEL Classi�cation Numbers: D90, D91

�Department of Economics, The State University of New York at

Bu¤alo, 427 Fronczak Hall, Bu¤alo, NY 14260; kwakai@bu¤alo.edu;

http://pluto.fss.bu¤alo.edu/classes/eco/kwakai/

1

Acknowledgement

This paper is a revised version of the �rst half of Chapter II of my disserta-

tion at Yale University (Wakai (2002)). I would like to express my appreciation

to my advisor, Stephen Morris, and dissertation committee members, John

Geanakoplos and Benjamin Polak, for their invaluable advice and support. I

would also like to thank Larry Epstein and Itzhak Gilboa for valuable dis-

cussions and suggestions. I have bene�ted from comments by Richard Braun,

Takao Kobayashi, Giuseppe Moscarini, Robert Shiller, Leeat Yariv, and semi-

nar participants at the State University of New York at Bu¤alo, the University

of Iowa, the University of Rochester, The University of Texas Austin, the Uni-

versity of Tokyo, Washington University, and the RUD 2004 conference. I

am also grateful to the Editor and referees, whose comments and suggestions

greatly improved this paper.

2

1 Introduction

Intertemporal choices involve an allocation of resources over time. To analyze

this choice problem, Samuelson (1937) proposes a special form of a weighted

summation decision rule, called the discounted utility model. For example, at

each time t, the conditional utility of a consumption sequence c = (c0; :::; cT )

is expressed in an additively separable form

Vt(c) �TX�=t

��tU(c� ); (1)

where � is called a single-period discount factor and U is called an instanta-

neous utility function. Since it is analytically tractable as well as axiomati-

cally driven (Koopmans (1960)), the discounted utility model has become the

standard framework for analyzing intertemporal choices. One of the de�ning

characteristics of this model (as well as the weighted summation decision rule

in general) is that it discounts each utility sequence (U(ct); :::; U(cT )) by the

same series of discount factors (1; :::; �T�t), regardless of the distribution of

utility over time (called utility independence). However, as Frederic, Loewen-

stein, and O�Donoghue (2002) show in a recent survey article, many studies

have reported results contradicting utility independence. Among them, we

focus on the following experiment conducted by Thaler (1981).

One set of subjects were asked to imagine that they had won a monetary

prize that could be received either now or later. They were asked to state

how much money they would need to be paid if payment could be delayed (by

three months, one year, or three years). Another set of subjects were asked a

comparable set of questions, with the prize replaced by a tra¢ c ticket. Since

the immediate payment was equally preferred to the later payment, the yield

to maturity (i.e., (1=�)� 1) was computed based on (1), where instantaneous

3

utility U(:) was replaced by a monetary payment. The study found that the

yields implied by the former set of questions were much higher than the yields

implied by the latter set (we refer to this speci�c result as yield asymmetry;

we use gain/loss asymmetry to refer to the general tendency that gains are

discounted more than losses). Moreover, the di¤erence in yields increased as

the sizes of gains and losses decreased. This implies that yield asymmetry

should exist even if a monetary payment is converted to the corresponding

utility by a concave U .1

Loewenstein and Prelec (1992) explain yield asymmetry by adapting the

reference-dependent utility as developed by Kahneman and Tversky (1979,

1991): The instantaneous utility function U(ct) is replaced by the value func-

tion �(ct � qt), where qt is called a reference point. This behavioral approach

has three di¢ culties: First, there is no general guideline for how to model a

reference sequence (q0; :::; qT ) from which gains and losses are computed. Sec-

ond, the relationship between gain/loss asymmetry and notions of intertem-

poral preferences such as consumption smoothing is not well de�ned. Third,

preferences do not necessarily satisfy dynamic consistency. To address the

�rst two issues, Shalev (1997) models gain/loss asymmetry based on Gilboa�s

(1989) variation aversion, that is, the aversion to the utility variation between

adjacent periods. However, Shalev (1997) does not address the third issue of

dynamic consistency. This is problematic in application because we must also

model how the con�ict between di¤erent selves is resolved.

Given this limitation, our objective is to provide a new model of discount

factors that not only captures an asymmetric attitude toward gains versus

losses but also satis�es dynamic consistency. Our model is based on a notion

1See Loewenstein and Prelec (1992).

4

of utility smoothing, time-variability aversion, meaning that a decision maker

(DM) is averse to utility variations de�ned over an entire time horizon. This is

second-order aversion in the sense that U already captures the notion of con-

sumption smoothing if U is concave. Formally, at each time t, the conditional

utility of a consumption sequence c is expressed in the recursive form

Vt(c) � min�t+12[�t+1;�t+1]

[(1� �t+1)U(ct) + �t+1Vt+1(c)]; (2)

where �t+1 and �t+1 describe the upper and lower bounds of single-period dis-

count factors, respectively, and satisfy 0 < �t+1 � �t+1 < 1. We provide an

axiomatization for the representation as an extension of the weighted sum-

mation decision rule. This is achieved by adapting a method developed in a

di¤erent context by Gilboa and Schmeidler (1989).

The representation (2) introduces the distinct feature called recursive gain/

loss asymmetry: The di¤erence between future utility Vt+1(c) and current

utility U(ct) de�nes a gain or a loss, and gains are discounted more than

losses. This is clearly seen by rewriting (2) as follows:

Vt(c) = U(ct) + min�t+12[�t+1;�t+1]

�t+1[Vt+1(c)� U(ct)]: (3)

More speci�cally, based on utility smoothing, (2) explains the reason for

gain/loss asymmetry shown in (3): When future utility is lower (higher) than

current utility, the DM has a stronger (weaker) incentive to increase future

utility by decreasing current utility. The DM expresses this incentive by shift-

ing weight onto (away from) the future utility. Hence, an increase from the

current utility is regarded as a gain, and it requires a lower discount factor;

a decrease from the current utility is regarded as a loss, and it requires a

higher discount factor. Under this mechanism, current utility U(ct) becomes

the reference point. Intuitively, from the reference point U(ct), the cost of

5

reducing the future utility Vt+1(c) outweighs the bene�t of increasing it. This

cost-bene�t gap is captured by a wideness in the range of discount factors

[�t+1; �t+1].

Besides being dynamically consistent, (2) addresses two other issues raised

for the behavioral approach: First, (2) provides a clear mechanism for reference-

dependence (i.e., how relative consumption matters) and a clear relationship

between the degree of gain/loss asymmetry and the size of a set of discount fac-

tors. In particular, a reference point is endogenous, meaning that it is speci�c

to each consumption sequence. Second, (2) explains the relationship between

gain/loss asymmetry and notions of intertemporal preferences, namely, time-

variability aversion and dynamic consistency.

Recursive gain/loss asymmetry also plays a crucial role in explaining yield

asymmetry, which is translated to a condition on [�t+1; �t+1]. In particular,

for a stationary and in�nite horizon version of (2), where [�t+1; �t+1] = [�; �]

for all periods, yield asymmetry exists if and only if � 6= �. Hence, recursive

gain/loss asymmetry and yield asymmetry become an equivalent notion.

Dynamic consistency makes our model tractable so that we can study the

economic implications of gain/loss asymmetry. In a simple repeated game

application, we show that as players become more gain/loss asymmetric, it

becomes easier (more di¢ cult) to �nd a subgame perfect Nash equilibrium

(SPNE) that supports constant (variable) payo¤ sequences.

The reminder of the paper proceeds as follows: Section 2 provides exam-

ples to motivate our objective. Section 3 axiomatizes the representation (2).

Section 4 characterizes the relationship of (2) to yield asymmetry and to the

discounted utility model. Section 5 provides a repeated game example. Sec-

tion 6 compares our model with other intertemporal utility functions. Section

7 concludes the paper.

6

2 Examples: Dynamic Consistency and Gain/Loss Asymmetry

Consider a three-period problem. At each t, the DM has a conditional ordering

on a set of consumption sequences c = (c0; c1; c2). Assume that orderings are

independent of the consumption history.

First, we show that if a reference point is exogenously given (i.e., the same

reference point is used to evaluate all c), preferences may not satisfy dynamic

consistency. For example, Loewenstein and Prelec (1992) propose the fol-

lowing descriptive model: Let q0 = (q00; q01; q

02) be an exogenous status quo

consumption plan. The ex-ante preference �0;q0 is represented by

V0;q0(c) =2Xt=0

v(ct � q0t )�(t); (4)

where v(ct � q0t ) is a continuous and increasing value function that evalu-

ates ct with the value v(0) = 0 and �(t) is a discount function (they adapt

the hyperbolic discount function). This model explains yield asymmetry if

v(y)=v(x) > v(�y)=v(�x) for all 0 < y < x, meaning that v(x) is less elastic

at x > 0 than at �x. Hence, (4) is based on �rst-order aversion: yield asym-

metry is de�ned as the functional form of v rather than as a characteristic of

discount factors.

Di¢ culties arise from an interpretation of the status quo (i.e., the reference

point). For simplicity, assume that the discount function is a usual geometric

series �(t) = �t, where 0 < � < 1. Also, let v(z) < �v(�z) for z > 0.2

Suppose that the conditional preference �1;q1 is represented by the same class

2The other possibility is the existence of z > 0 such that v(z) > �v(�z) for 0 < z � z.

A similar counterexample can be constructed.

7

as (4), with an exogenous status quo plan q1 = (q10; q11; q

12), i.e.,

V1;q1(c) =

2Xt=1

v(ct � q1t )�(t� 1):

Consider the following two consumption sequences:

c = (a; a+ d; a) and c0 = (a; a; a+ e),

where d; e > 0. Let fc; c0g be the DM�s choice set. Suppose that at time 0, the

DM is endowed with c so that q0 = c. Since c and c0 share the same time-0

consumption, we can investigate the consistency between the ordering at time

0 and the ordering at time 1. If c '0;q0 c0, the DM will choose either c or c0 at

time 0. For a decision problem at time 1, if c is chosen at time 0, the status

quo is q1 = c, and c '1;q1 c0. However, if c0 is chosen at time 0, the status quo is

changed to q1 = c0. Then a simple computation shows that c0 �1;q1 c. This is a

violation of dynamic consistency in the sense that the DM ex-ante knows that

c0 will be preferred at time 1 if c0 is chosen at time 0, but the DM is ex-ante

indi¤erent between c and c0. The model generates inconsistency because the

exogenous reference plans, q0 and q1, are unrelated: The conditional ordering

�1;q1 depends on the choice at time 0, but in (4), the DM fails to take into

account this relationship.

Second, we show that even if a reference point is endogenously given (i.e.,

sequence speci�c), preferences may not satisfy dynamic consistency. For exam-

ple, Shalev (1997) introduces the following model by extending Gilboa�s (1989)

representation, which is dependent on the utility variation between adjacent

periods: The ex-ante preference �0 is represented by

V0(c) � u0 +2Xt=1

��0;+t maxfut � ut�1; 0g+ �0;�t minfut � ut�1; 0g

; (5)

8

where ut is the instantaneous utility of consumption ct. When 0 < �0;+t < �0;�t ,

(5) explains gain/loss asymmetry; under appropriate values of discount factors,

(5) explains yield asymmetry. The time-t utility ut becomes an endogenous

reference point to evaluate the time-t+1 utility ut+1 but not the future utility

Vt+1(c), i.e., the utility of an entire consumption sequence evaluated at time

t + 1.3 Hence, the asymmetry introduced by (5) is not recursive. Similar to

(2), gain/loss asymmetry as shown in (5) is based on second-order aversion.

Suppose that the conditional preference �1 is represented by the same class

as (5), i.e.,

V1(c) � u1 + �1;+2 maxfu2 � u1; 0g+ �1;�2 minfu2 � u1; 0g:

Preferences are dynamically consistent if c �t+1 c0 implies c �t c0, where

c� = c0� for all � � t. Then a simple computation shows that if preferences

satisfy dynamic consistency,

�0;+1 = �0;�1 .

Hence, Shalev�s (1997) representation is dynamically inconsistent if we assume

gain/loss asymmetry (i.e., 0 < �0;+t < �0;�t ).4

The above inconsistency comes from the feature that for each �t, the dis-

count factor of ut, (1 � �t;:t+1), depends on ut+1 but not on Vt+1(c). More

precisely, utility smoothing makes discount factors dependent on the utility of

future consumption. In this case, endogenizing a reference point alone does not

3For comparison, (2) can be written as Vt(c) = U(ct) + �t+1maxfVt+1(c) � U(ct); 0g +

�t+1minfVt+1(c)� U(ct); 0g.4This result comes from the assumption used in Gilboa (1989) and Shalev (1997); their

models are based on Schmeidler�s (1989) non-additive prior model. For more details, see

Eichberger and Kelsey (1996); Sarin and Wakker (1998); Grant, Kajii, and Polak (2000).

9

necessarily resolve dynamic inconsistency; the recursive form of asymmetry is

additionally required.

3 Representation

We adapt the Anscombe-Aumann (1963) framework with a temporal interpre-

tation: Preferences are de�ned on the set of sequences whose outcome at any

period is a lottery de�ned over a consumption set. This domain is rich enough

to model preference for utility smoothing independently of a functional form

of instantaneous utility. The domain of our interest, the set of consumption

sequences, is identi�ed with a subset of this larger domain, where each element

of the subset is a sequence of degenerate lotteries.

Time is discrete and varies over T = f0; 1; :::; Tg, where T > 0 (Appendix

B considers the in�nite horizon case). For a convex consumption set X � R,

let M be the set of all probability distributions over X with �nite support; M

is a mixture space under a probability operation +. An act is a function l : T

! M , where l = (l0; l1; :::; lT ). Denote by L = MT the collection of all acts;

L is a mixture space under the operation + de�ned by (l + l0)t = lt + l0t. A

constant act p is a function l : T ! M such that lt = p 2 M for all t 2 T ; p

is also identi�ed with p 2M . Let C be a collection of all constant acts.

The primitive of the model is the collection of preference orderings f�tg

� f�t j t 2 T g, whose element is de�ned on L. Each conditional ordering on

C induces a conditional ordering on M , also denoted by �t.

First, consider the following four axioms:

A1 (Conditional Preference-CP): For each t 2 T and for all l; l0 2 L,

if l� = l0� for all � � t, then l 't l0.

10

A2 (Weak Order-WO): For each t 2 T and for all l; l0; l00 2 L, (i)

l �t l0 or l0 �t l and (ii) l �t l0and l0 �t l00 imply l �t l00.

A3 (Continuity-CT): For each t 2 T and for all l; l0; l00 2 L with

l �t l0 �t l00, there exist �, � 2 (0; 1) such that �l + (1 � �)l00 �t l0 and

l0 �t �l + (1� �)l00.

A4 (Non-Degeneracy-ND): For each t 2 T , there exist l; l0 2 L such

that l �t l0.

A1 states that a conditional ordering depends only on the corresponding

continuations of acts; it also implies that a preference ordering is independent

of a payo¤ history. A2-A4 are standard assumptions.

The next axiom relates the evaluation of lotteries to the evaluation of acts.

A5 (Strict Monotonicity-SMT): For each t 2 T and for all l; l0 2 L,

if l� �t l0� for all � 2 T , then l �t l0. In addition, if for some � � t, lt �t l0t,

then l �t l0.

A5 assumes that a lottery is evaluated independently at each time.

The discounted utility model fails to explain gain/loss asymmetry because

of utility independence, which is characterized by the following property:

A60 (Independence-ID): For each t 2 T , for all l; l0; l00 2 L, and for all

� 2 (0; 1), l �t l0if and only if �l + (1� �)l00 �t �l0 + (1� �)l00.

Applying A1-A3 and A60 to C, there exists an a¢ ne function Ut : M ! R

that represents �t on M . Then, by A5, a preference ordering on lottery se-

quences induces a preference ordering on utility sequences. Also, the mixture

operation + on L induces the addition between the corresponding utility se-

quences. Hence, A60 implies utility independence.

11

Gain/loss asymmetry is based on utility-sequence-speci�c discounting. To

allow such discounting, we replace A60 with the following weaker axiom:

A6 (Constant-Independence-CI): For each t 2 T , for all l; l0 2 L,

p 2 C, and for all � 2 (0; 1), l �t l0if and only if �l+(1��)p �t �l0+(1��)p.

First, A6 is equivalent to A60 on C so that there exists an a¢ ne function

Ut : M ! R that represents �t on M . Second, since p does not involve

utility deviations, the pattern of utility deviations implied by l is preserved

in the pattern of utility deviations in �l + (1 � �)p. Hence, CI allows for

the possibility that the same discounting is applied only to a subset of utility

sequences sharing a similar pattern of utility deviations.

Next, consider the reasoning behind gain/loss asymmetry. If gains are

discounted more than losses, the DM tends to put greater weight on a lower

utility than on a higher utility. The motive behind this pattern of discounting

resembles utility smoothing. Hence, we introduce a notion that expresses

aversion to utility deviations:

A7 (Time-Variability Aversion-TVA): For each t 2 T , for all l; l0 2

L, and for all � 2 (0; 1), l 't l0 implies �l + (1� �)l0 �t l.

Given A1-A6, A7 is e¤ectively imposed on utility sequences. Therefore, this

axiom does not imply consumption smoothing (i.e., an instantaneous utility

function can be convex on a degenerate outcome spaceX). Intuitively, hedging

the movement of instantaneous utility over time increases overall utility.

The conditional utility based on A1-A7 is a temporal version of Gilboa-

Schmeidler�s (1989) multiple-priors utility, which does not necessarily satisfy

dynamic consistency. Hence, we restrict our attention to the family of condi-

tional orderings, for which each ordering satis�es the following crucial axiom:

12

A8 (Dynamic Consistency-DC): For each t 2 T nfTg and for all l; l0

2 L such that l� = l0� for all � � t, if l �t+1 l0, then l �t l0; the latter ranking

is strict if the former is strict.

Dynamic consistency has a normative appeal, which has been mentioned in

the introduction and discussed in detail in Section 2 through counterexam-

ples. The analytical advantage of dynamic consistency will be shown through

applications in Section 5.

The following proposition is the main result of this paper (for the proof,

see Appendix A):

Proposition 1: The following statements are equivalent:

(i) f�tg satisfy A1 to A8.

(ii) There exist an a¢ ne function U : M ! R and a collection of sets of

discount factors f[�t; �t]g1�t�T satisfying 0 < �t � �t < 1 for all t 2 [1; T ]

such that: For each t 2 T , �t is represented by Vt(:); where fVt(l)g0�t�T are

recursively de�ned by

Vt(l) � min�t+12[�t+1;�t+1]

[(1� �t+1)U(lt) + �t+1Vt+1(l)] (6)

and VT (l) � U(lT ). Moreover, [�t; �t] is uniquely de�ned for all t 2 [1; T ], and

U is unique up to a positive a¢ ne transformation.

Time-variability aversion is captured by minimization on a set of discount

factors, whereas dynamic consistency forces this operation to be applied re-

cursively. A discount factor takes on an extreme value under the recursive

application of minimization. Hence, (6) generates gain/loss asymmetry.

A special feature of this model is that the scale of future utility is compatible

with the scale of current utility. In particular, under a constant act p, U(p) =

13

Vt+1(p). Hence, Vt+1(l) is greater than U(lt) if and only if l �t+1 lt, where

lt is a constant act that pays lt at each time. Intuitively, having U(lt) as

an endogenous reference point is equivalent to having a constant act lt as an

endogenous reference point.

4 Analysis

4.1 Yield Asymmetry and Conditions on Sets of Discount Factors

First, for the models based on utility smoothing, we formally de�ne yield

asymmetry as follows:

De�nition 1: For T � s > s0 � t, let u = (a; :::; a; a + bs; a; :::; a)

and u0 = (a; :::; a; a � bs; a; :::; a) be utility sequences, where bs > 0. Let v =

(a; :::; a; a + vs0 ; a; :::; a) and v0 = (a; :::; a; a � v0s0 ; a; :::; a) be utility sequences

satisfying u 't v and u0 't v0, where �t is an induced conditional ordering on

the set of utility sequences. Then the DM�s preferences show �yield asymmetry

at time t�if 0 < vs0 < v0s0 for any admissible combinations of a, bs, s, and s0.

Although highly related, recursive gain/loss asymmetry and yield asymme-

try are not equivalent notions: Recursive gain/loss asymmetry is based on a

comparison between di¤erent levels of utility in a given period, whereas yield

asymmetry is based on a comparison between two utility sequences, each of

which allocates a gain (or a loss) in a di¤erent period. Hence, to relate these

two notions, we derive necessary and su¢ cient conditions on sets of discount

factors that explain yield asymmetry (for the proof, see Appendix C).

Proposition 2: The DM�s preferences show yield asymmetry at every

14

period if and only if for each t satisfying T � t > 0,

�t(1� �t)

(1� �t+1) <�t

(1� �t)(1� �t+1); (7)

where �T+1 = �T+1 � 0.

For example, suppose that �t = � and �t = � for all t > 0. If � + � < 1

and � 6= �, yield asymmetry exists at all periods. Hence, the sets of discount

factors that satisfy the above condition are not empty.

To negate the last period e¤ect (i.e., �T+1 = �T+1 � 0), we may apply

a stationary and in�nite horizon version of (6) (see Appendix B), where our

representation becomes the following:

V ((lt; :::)) � min�2[�;�]

[(1� �)U(lt) + �V ((lt+1; :::))]: (8)

Then condition (7) becomes � < �, which is always satis�ed under � 6= �.

Hence, recursive gain/loss asymmetry and yield asymmetry become an equiv-

alent notion.

4.2 Relationship between the Discounted Utility Model and Time-

Variability Aversion

First, we investigate a binary relation of �more time-variability averse� (or

equivalently, �more gain/loss asymmetric�). From the representation of (6), it

is clear that the instantaneous utility function U in�uences e¤ective intertem-

poral substitution. Thus, we can only permit a binary relation of �more time-

variability averse�for pairs of preference relations that embody the same rank-

ing of a single-period consumption lottery. Then Vt is more time-variability

averse than V 0t if and only if U = U0, and

[�� ; �� ] � [�� ; �� ]0 for all � 2 ft+ 1; :::; Tg:

15

where [�� ; �� ] and [�� ; �� ]0 represent sets of discount factors in (6) for Vt and

V 0t , respectively.

The relationship between the discounted utility model and (6) is most

clearly shown by (8): If � = � = �, (8) is equivalent to the discounted utility

model with the discount factor 0 < � < 1, i.e.,

V ((lt; :::)) � (1� �)1X�=t

��tU(l� ): (9)

(9) is the least time-variability averse in the class of (8), i.e., there exists

no other Vt that is strictly less time-variability averse than (9). Hence, the

discounted utility model is a special case of our model, in which the DM is

time-variability neutral (i.e., in A7, l 't l0 implies �l + (1� �)l0 't l).

5 Application

To study the economic implications of gain/loss asymmetry, we present a sim-

ple repeated game application. The bene�t of dynamic consistency becomes

evident through the construction of an SPNE. Consider the following version

of the prisoner�s dilemma (the numbers are instantaneous utilities). The re-

sults in this example will not change under a positive a¢ ne transformation of

payo¤s. Hence, they are independent of the level of utility but dependent on

the pattern of utility deviations involved in a payo¤ sequence.

Player 1

Player 2

A B

A (5,5) (10,1)

B (1,10) (4,4)

16

We focus on two di¤erent payo¤ sequences. The �rst sequence is an in�nite

repetition of (5,5). To generate this sequence under an SPNE, consider the

following strategy.

(a) Play A until the opponent plays B.

(b) If the opponent plays B, then play B for the next N > 0 periods.

(c) If (b) happens, after N periods of punishment, play A until the opponent

plays B.

(d) If (c) happens, and if the opponent plays B, play B forever.

This strategy involves a combination of Nash reversion with a preceding

punishment phase. The intention of this strategy is to give the opponent an

opportunity to learn the bene�t of cooperation.

First, consider discounted-utility players with the same discount factor

0 < � < 1. For this strategy to be an SPNE,

5 � 10(1� �) + 4�, and (10)

5 � 10(1� �) + 4�(1� �N) + 5�N+1 = 10(1� �) + 4� + �N+1. (11)

For example, let � = 0:9, which satis�es (10). Then by (11), N � 8; since the

payment at a single-period derivation is high, it requires a long punishment

phase.

Second, suppose that players�preferences are represented by (8) with the

same set of discount factors [�; �], where 0 < � < � < 1. We assume that

� 2 [�; �] so that the players here are more gain/loss asymmetric than the

discounted-utility players above. For the above strategy to be an SPNE,

5 � 10(1� �) + 4�, and (12)

5 � 10(1� �) + 4� (1� �)(1� �)(1� �

N) + 5��N = 10(1� �) + 4� + ��N . (13)

17

By assumption, � � 0:9 � �, which satis�es (12). By (13), N � 1 if � �

(6� � 5)=�; the right hand side is increasing in � (it is 49at � = 0:9). This

shows that if players are su¢ ciently gain/loss asymmetric, the subgame perfect

strategy requires only a minimum punishment phase. Hence, as players become

more gain/loss asymmetric, it becomes easier to achieve cooperation if payo¤s

are constant.

The second sequence is (10,1) at an even period and (1,10) at an odd

period. To generate this sequence under an SPNE, consider the following

strategy using Nash reversion.

(a) Player 1 plays B if t is even and plays A if t is odd until Player 2 plays B

at an even t.

(b) If Player 2 plays B at an even t, Player 1 plays B forever.

(c) Player 2 plays A if t is even and plays B if t is odd until Player 1 plays B

at an odd t.

(d) If Player 1 plays B at an odd t, Player 2 plays B forever.

Consider the discounted-utility players. For this strategy to be an SPNE,

1 + 10� � 4(1� �2):

This condition is satis�ed under � = 0:9. Similarly for the gain/loss asymmet-

ric players,

V ((1; 10; 1; 10; :::)) � 4, and (14)

V ((1; 10; 1; 10; :::)) =1

1� ��(1� �) + �(1� �)10

= 10� 9(1� �)

1� ��: (15)

Again, � � 0:9 � �. By (14) and (15), � � 3=(9 � 6�); the right hand

side is increasing in � (it is 56at � = 0:9). For example, let � > 51

54. Then

3=(9�6�) > 0:9 so that the �rst strategy is an SPNE for someN but the second

18

strategy is not (if N � 8, the �rst strategy is an SPNE at any � � 0:9 � �).

Hence, as players become more gain/loss asymmetric, it becomes harder to

�nd an SPNE that supports variable payo¤ sequences.

6 Other Related Literature

Koopmans (1960) proposes the aggregator function that represents dynami-

cally consistent preferences. Concave, nonlinear, and smooth aggregator func-

tions can exhibit e¤ects similar to gain/loss asymmetry. However, the de-

gree of asymmetry converges to zero as the sizes of gains and losses decrease,

which contradicts the results shown by Thaler�s (1981) experiment. On the

contrary, our aim is to build a speci�c aggregator function under which the

degree of asymmetry does not diminish as the sizes of gains and losses de-

crease. As applications show, our approach provides a clear parametrization

of gain/loss asymmetry independently of a utility level, which is not possible

under a smooth aggregator function. Also, given the Euler inequality derived

in a di¤erent context by Epstein and Wang (1994), the analytical cost of (6)

caused by non-di¤erentiability is relatively minor.

In terms of a model of discount factors, Uzawa (1968), Epstein (1983), and

Shi and Epstein (1993) propose a model under which the discount factor at

any future date depends on historical consumption. However, their models are

not based on utility smoothing and do not show gain/loss asymmetry. On the

other hand, similarly to our model, Geo¤ard (1996) proposes, in a continuous-

time setting, the following non-additively-separable model under which the

discount rates depend on an entire sequence of consumption:

V (c) � minrt2R

Z 1

0

f(ct; Bt; rt)dt; (16)

19

where R is a set of admissible paths of discount rates, and f gives current

felicity as a function of the current values of consumption ct, discount factor

Bt, and discount rate rt. Although (16) does not satisfy dynamic consistency

in general, Geo¤ard (1996) shows that the recursive utility of Epstein (1987)

belongs to this class of the representation if f(ct; Bt; rt) = BtF (ct; rt). Indeed,

our model (8) can be written as follows:

V (c) � minB2B

1Xt=0

BtF (ct; �t+1);

where F (ct; �t+1) � (1� �t+1)u(ct) and B is a collection of admissible normal-

ized discount-factor processes B de�ned by B0 � 1 and by Bt+1 � �t+1Bt with

�t+1 2 [�; �]. Hence, in a discrete time setting, our model provides axiomati-

zation for a particular form of (16).

Finally, Epstein and Schneider (2003) propose recursive multiple-priors

utility, where a set of priors is recursively constructed. In (6), a set of dis-

count factors is similarly constructed. To exploit this similarity, we adapt

their proof and suitably modify it to �t our framework.

7 Conclusion

This paper makes the following contributions: (i) Based on time-variability

aversion, we develop a new model of discount factors that not only captures

gain/loss asymmetry but also satis�es dynamic consistency; (ii) Dynamic con-

sistency introduces the notion of recursive gain/loss asymmetry, where current

utility becomes an endogenous reference point to evaluate future utility. The

model provides a clear mechanism for reference dependence and explains the

relationship among gain/loss asymmetry, time-variability aversion, and dy-

namic consistency; and (iii) Under a stationary and in�nite horizon version of

20

the model, recursive gain/loss asymmetry and yield asymmetry are equivalent.

Also, through an example, we show the applicability of our model originated

by dynamic consistency.

Appendix A: Proof of Proposition 1

If T = 1, Proposition 1 is equivalent to Theorem 1 of Gilboa and Schmeidler

(1989) (GS). Hence, assume that T > 1. We prove (i) ) (ii), which is based

on Lemmas A.1-A.4 (given Theorem 1 of GS and Lemmas A.1-A.4, (ii)) (i)

is trivial). We adapt the arguments in Epstein and Schneider (2003).

Lemma A.1: There exists an a¢ ne function U : M ! R, unique up to a

positive a¢ ne transformation, such that for all t 2 T , U represents �t on M .

Proof. By CP, WO, CT, and CI, there exists an a¢ ne Ut : M ! R,

unique up to a positive a¢ ne transformation, that represents �t on M .

For a given t < T , consider l; l0 2 L such that l� = p 2M and l0� = q 2M

for all � � t+ 1 and l� = l0� for all � < t+ 1. By CP, WO, and SMT, l �t+1 l0

if and only if p �t+1 q. By WO and DC, l �t+1 l0 if and only if l �t l. By

CP, WO, and SMT, l �t l0 if and only if p �t q. Hence, p �t+1 q if and only if

p �t q. The conclusion follows by induction applied from t = 0 to T � 1. �

For the remaining lemmas, we use U in place of Ut. By ND, assume that

[�1; 1] � U(M). In particular, let p[0] 2 M satisfy U(p[0]) = 0. Assume a

standard topology on Rt.

Lemma A.2: For each t 2 T , there exists a non-empty, closed, and

convex set of strictly positive discount factors, Dt � RT�t+1; each of whose

21

elements, bt 2 Dt, satis�esPT

�=t bt� = 1 such that �t is represented by Vt(:);

where

Vt(l) � minbt2Dt

TX�=t

bt�U(l� ) (A.1)

Moreover, Dt is unique.

Proof. This follows from Theorem 1 of GS and Lemma A.1. �

For each t < T , let Ut � f(U(lt); Vt+1(l))j l 2 Lg. By (A.1), [�1; 1] �

[�1; 1] � Ut. De�ne the following function It : Ut ! R by

It(U(lt); Vt+1(l)) � Vt(l): (A.2)

By SMT and DC, (A.2) is well de�ned and increasing on Ut. Also, It(1; 1) = 1.

Lemma A.3: For a given t < T , there exists a unique [�t+1; �t+1] � R

satisfying 1 < �t+1 � �t+1 < 1 such that for all l 2 L,

It(U(lt); Vt+1(l)) = min�t+12[�t+1;�t+1]

[(1� �t+1)U(lt) + �t+1Vt+1(l)]: (A.3)

Proof. We adapt the arguments in Lemma 3.3 of GS.

(i) It is homogeneous on Ut: For any x; x0 2 Ut with x0 = �x for 0 < � � 1,

It(x0) = �It(x). Let l; l0 2 L satisfy l0 = �l+(1��)p[0] and (U(lt); Vt+1(l)) = x.

Since U is a¢ ne, U(l0t) = �U(lt). By (A.1), Vt+1(l0) = �Vt+1(l). By (A.2),

�It(x) = �Vt(l) and

It(x0) = It(�U(lt); �Vt+1(l)) = It(U(l

0t); Vt+1(l

0)) = Vt(l0):

By (A.1), Vt(l0) = �Vt(l) so that It(x0) = �It(x).

(ii) Extend It by homogeneity to R� R.

(iii) It is monotone on R� R (by construction).

22

(iv) It satis�es constant additivity on R� R: For any x; x0 2 R� R with

x0 = (x1; x2) and x1 = x2, It(x + x0) = It(x) + It(x0). By homogeneity, it is

enough to consider 2x; 2x0 2 Ut. Let l; p 2 L satisfy (U(lt); Vt+1(l)) = 2x and

(U(p); Vt+1(p)) = 2x0. Since U is a¢ ne,

1

2U(lt) +

1

2U(p) = U(

1

2lt +

1

2p). By

(A.1),1

2Vt+1(l) +

1

2Vt+1(p) = Vt+1(

1

2l +

1

2p). Hence, by (A.1) and (A.2),

It(x+ x0) = Vt(

1

2l +

1

2p) = Vt(

1

2l) + Vt(

1

2p) = It(x) + It(x

0):

(v) It is superadditive on R� R: For any x; x0 2 R� R, It(x + x0) �

It(x) + It(x0). By homogeneity, it is enough to consider 2x; 2x0 2 Ut. Let

l; l0 2 L satisfy (U(lt); Vt+1(l)) = 2x and (U(l0t); Vt+1(l0)) = 2x0, where l0� = l0� 0for all �; � 0 > t. Since U is a¢ ne,

1

2U(lt) +

1

2U(l0t) = U(

1

2lt +

1

2l0t). By (A.1),

1

2Vt+1(l) +

1

2Vt+1(l

0) = Vt+1(1

2l +

1

2l0). Hence, by (A.1) and (A.2),

It(x+ x0) = Vt(

1

2l +

1

2l0) � Vt(

1

2l) + Vt(

1

2l0) = It(x) + It(x

0):

The conclusion follows from Lemma 3.5 and Theorem 1 of GS. �

Let �t+1 � [�t+1; �t+1]. De�ne a collection of discount factors, D0t, by

D0t � fbt 2 RT�t+1++ jbt = (1��t+1; �t+1bt+1); where �t+1 2 �t+1 and bt+1 2 Dt+1g:

(A.4)

Each element bt 2 D0t satis�esPT

�=t bt� = 1. Since �t+1 andDt+1 are non-empty,

closed, and convex, so is D0t; D0t is also compact.

Lemma A.4: D0t = Dt.

23

Proof. For all l 2 L,

Vt(l) = minbt2Dt

TX�=t

bt�U(l� )

= min�t+12[�t+1;�t+1]

f(1� �t+1)U(lt) + �t+1Vt+1(l)g (by (A.3))

= min�t+12[�t+1;�t+1]

f minbt+12Dt+1

f(1� �t+1)U(lt) + �t+1TX

�=t+1

bt+1� U(l� )gg

= minbt2D0t

TX�=t

bt�U(l� ):

Hence, D0t with U represents �t. The conclusion follows from the uniqueness

of Dt, as implied by ND. �

Finally, (ii) of Proposition 1 follows by recursively applying Lemmas A.3

to A.4 from time 0 to T � 1. �

Appendix B: In�nite Horizon Model

Let Tt � ft; t+ 1; :::g be a countably in�nite number of periods, and T � T0.

Denote by �t the �-algebra that consists of all subsets of Tt. Since �t and

�t0 are identical except for the name of each element, for simplicity, let B

be a collection of all bounded, �t-measurable real valued functions on Tt for

any t 2 T . B is endowed with the sup norm. We use the weak�-topology on

B�, i.e., the dual space of B. Denote by 1 2 B a constant function of one.

De�ne by �A an indicator function on A 2 �t. For each l 2 L, de�ne the

continuation from time t, lt 2 L, by lt = (lt; lt+1; :::). Let0Y�=1

x� � 1 for any

sequence x : T1 ! R++.

We impose regularity and stationarity axioms:5

5For an in�nite horizon extension, Gilboa (1989) and Shalev (1997) require an additional

24

A9 (Best�Worst-BW): For each t 2 T , there exist l� and l� 2 L such

that l� �t l �t l� for all l 2 L.

A10 (Stationarity-ST): For each t 2 T , for all l; l0;el;el0 2 L such that(i) ls = els and l0s0 = el0s0 for all s < t and s0 < t + 1 and (ii) lt+s = l0t+1+s andelt+s = el0t+1+s for all s 2 T , l �t el if and only if l0 �t+1 el0.Proposition B.1: The following statements are equivalent:

(i) f�tg satisfy A1 to A10, where T � 1 in A8.

(ii) There exist (a) a set [�; �] � R satisfying 0 < � � � < 1, (b) a non-empty,

closed, and convex set of strictly positive and countably additive discount fac-

tors on �0, � � B�, each of whose elements, b 2 �, satis�es

b(a) �1Xt=0

btat for all a 2 B,

where bt � t� t+1 and t �tY

�=1

�� based on a sequence of atemporal discount

factors f�tg with �t 2 [�; �], and (c) an a¢ ne function U : M ! R such that:

For each t 2 T , �t is represented by Vt(:); where

Vt(l) = V (lt) � min

b2�f1X�=t

b��tU(l� )g = min�t+12[�;�]

[(1� �t+1)U(lt) + �t+1V (lt+1)]:

Moreover, � and � are unique, U is unique up to a positive a¢ ne transforma-

tion, and maxp2MU(p) and minp2MU(p) exist.

Proof. Given Proposition 4.1 of GS and � de�ned above, (ii) ) (i) is

trivial. We prove (i)) (ii), which consists of several steps.

For a function U :M ! R, de�ne a function U� : L ! B by U�(l)t = U(lt).

axiom to de�ne the continuity in discount factors. Here, dynamic consistency plays a similar

role.

25

Step 1: If f�tg satisfy A1-A9, then for each t 2 T , there exist (a) a set

[�t+1; �t+1] � R satisfying 1 < �t+1 � �t+1 < 1 and (b) a non-empty, closed,

and convex set of �nitely additive discount-factor measures on �t, Dt � B�,

each of whose elements, bt 2 Dt, satis�es bt(1) = 1 and bt(�f�g) > 0 for all

� 2 Tt such that the conditional ordering �t is represented by Vt(:), where

Vt(l) � minbt2Dt

ZTtU � (l)dbt = min

�t+12[�t+1;�t+1][(1� �t+1)U(lt) + �t+1Vt+1(l)];

and U : M ! R satis�es the conditions stated in Proposition B.1. Moreover,

Dt is compact and uniquely determined, and [�t+1; �t+1] is unique.

Proof. This follows from the suitable adaptation of Proposition 1 to the

in�nite horizon setting. In particular, we invoke Proposition 4.1 of GS at

Lemma A.2. Also, we replace (A.4) with

D0t � fbt 2 B�jbt = (1� �t+1; �t+1bt+1); where �t+1 2 �t+1 and bt+1 2 Dt+1g;

and where �t+1 � [�t+1; �t+1]. Each bt 2 D0t de�nes bt : B! R by

bt(at) =

ZTtatdbt = (1� �t+1)att + �t+1

ZTt+1

at+1dbt+1. (B.1)

It is easy to see that D0t is non-empty, closed, convex, and compact in B�. The

existence of maxp2MU(p) and minp2MU(p) follows from BW. �

Step 2: There exist V : L ! R, D � B�, and [�; �] � [0; 1] such that

V (lt) = Vt(l), D = Dt, and [�; �] = [�t+1; �t+1] for all t 2 T .

Proof. The existence of V follows from ST. The uniqueness of Dt proves

other results. �

For the remaining steps, we use V , D, and [�; �]. Let f�tg be a sequence of

atemporal discount factors, where �t 2 [�; �]. From f�tg, de�ne a sequence of

26

discount factors f tg by t �tY

�=1

�� . Construct � � B� by

� � fb 2 B�j For each a 2 B, b(a) �1Xt=0

btat, where bt � t � t+1g: (B.2)

� is non-empty. For any b 2 �,P1

t=0 bt =P1

t=0( t � t+1) = 0 = 1. Hence,

� is a collection of discrete, countably additive, and strictly positive discount

factor sequences. Under these conditions, � is closed and compact in B�.

Step 3: � is convex.

Proof. For any � 2 (0; 1) and any b; b0 2 �, let eb � �b+ (1� �)b0. Then,eb is based on fe�tg such that e�1 = ��1 + (1� �)�01 2 [�; �] and for t > 1,e�t = �t + (1� �) 0t�1(�0t � �t)

� t�1 + (1� �) 0t�12 [�; �];

where f�tg and f tg de�ne b and f�0tg and f 0tg de�ne b0. Hence, eb 2 �. �

Step 4: D =� so that for all l 2 L, Vt(l) = minb2�fP1

�=t b��tU(l� )g.

Proof. For D ��, let bb 2 D. For each k 2 T , let �k � t+kY�=t

�� , where �� is

an implied discount factor of bb de�ned by (B.1). Then, for each n 2 T , thereexists b(n) 2 � such that b(n)��t = ��t � ��+1�t for all � 2 ft; :::; t + ng. By

recursively applying (B.1), for each l 2 L,

bb(U � (lt)) = t+nX�=t

( ��t � ��+1�t)U(lt� ) + �n+1ZTt+n+1

U � (lt+n+1)dbbt+n+1, andb(n)(U � (lt)) =

t+nX�=t

( ��t � ��+1�t)U(lt� ) + �n+11X

�=t+n+1

b(n)��t �n+1

U(lt� ):

27

Therefore,

jbb(U � (lt))� b(n)(U � (lt))j= �n+1j

ZTt+n+1

U � (lt+n+1)dbbt+n+1 � 1X�=t+n+1

b(n)��t �n+1

U(lt� )j

< �n+1(2jV (l�)j+ 2jV (l�)j):

Since the bound is independent of n, t, and l and �n+1 converges to zero,

bb(U � (lt)) = lim b(n)(U � (lt)): (B.3)

And since (B.3) holds for any l 2 L, bb is either a limit point of � or a point

in �. Hence, bb 2 � because � is closed in B�.

For D ��, let b 2 �. Let f �t g be a sequence of implied discount factors

of b de�ned by (B.2). By (B.1), for each n 2 T , there exists bb(n) 2 D such thatfor each l 2 L,

bb(n)(U � (lt)) = t+nX�=t

( ��t � ��+1�t)U(lt� ) + �n+1ZTt+n+1

U � (lt+n+1)dbb(n);t+n+1.Then b 2 D follows from a similar argument leading to (B.3) because D is

closed in B�. �

Appendix C: Proof of Proposition 2

By a comparison between u = (a; :::; a; a + bt; a; :::; a) and v = (a; :::; a; a +

vt�1; a; :::; a), u '0 v if and only if

(1� �t)vt�1 = �t(1� �t+1)bt; (C.1)

where �T+1 = �T+1 � 0. Similarly, by a comparison between u = (a; :::; a; a�

bt; a; :::; a) and v = (a; :::; a; a� v0t�1; a; :::; a), u '0 v if and only if

(1� �t)v0t�1 = �t(1� �t+1)bt: (C.2)

28

Hence, by rearranging (C.1) and (C.2), yield asymmetry exists between time

t�1 and t if and only if (7) is satis�ed. Given dynamic consistency, this proves

that (7) is a necessary condition.

To show that it is su¢ cient, consider a comparison between time s and

time s0, where s > s0. Then

vs0 = bs

sY�=s0+1

��(1� �s+1)(1� �s0+1)

= bs

sY�=s0+1

��(1� �� )

(1� ��+1), and

v0s0 = bs

sY�=s0+1

��(1� �s+1)(1� �s0+1)

= bs

sY�=s0+1

��(1� �� )

(1� ��+1).

By (7), vs0 < v0s0. �

References

1. Anscombe, F., and R. Aumann (1963): �A De�nition of Subjective Prob-

ability,�Annals of Mathematical Statistics, 34(1), 199-205.

2. Eichberger, J., and D. Kelsey (1996): �Uncertainty Aversion and Dy-

namic Consistency,�International Economic Review, 37(3), 625-640.

3. Epstein, L. (1983): �Stationary Cardinal Utility and Optimal Growth

under Uncertainty,�Journal of Economic Theory, 31(1), 133-152.

4. Epstein, L. (1987): �The Global Stability of E¢ cient Intertemporal Al-

locations,�Econometrica, 55(2), 329-355.

5. Epstein, L., and M. Schneider (2003): �Recursive Multiple-Priors,�Jour-

nal of Economic Theory, 113(1), 1-31.

6. Epstein, L., and T. Wang (1994): �Intertemporal Asset Pricing under

Knightian Uncertainty,�Econometrica, 63(2), 283-322.

29

7. Frederic, S., G. Loewenstein, and T. O�Donoghue (2002): �Time Dis-

counting and Time Preference: A Critical Review,�Journal of Economic

Literature, 40(2), 351-401.

8. Geo¤ard, P. (1996): �Discounting and Optimizing: Capital Accumula-

tion Problems as Variational Minmax Problems,�Journal of Economic

Theory, 69(1), 53-70.

9. Gilboa, I. (1989): �Expectation and Variation in Multi-Period Deci-

sions,�Econometrica, 57(5), 1153-1169.

10. Gilboa, I., and D. Schmeidler (1989): �Maxmin Expected Utility with

Non-Unique Prior,�Journal of Mathematical Economics, 18(2), 141-153.

11. Grant S., A. Kajii, and B. Polak (2000): �Temporal Resolution of Un-

certainty and Recursive Non-Expected Utility Models,�Econometrica,

68(2), 425-434.

12. Kahneman, D., and A. Tversky (1979): �Prospect Theory: An Analysis

of Decision under Risk,�Econometrica, 47(2), 263-292.

13. Koopmans, T. (1960): �Stationary Ordinal Utility and Impatience,�

Econometrica, 28(2), 287-309.

14. Loewenstein G., and D. Prelec (1992): �Anomalies in Intertemporal

Choice: Evidence and an Interpretation,� Quarterly Journal of Eco-

nomics, 107(2), 573�597.

15. Samuelson, P. (1937): �A Note on Measurement of Utility,�Review of

Economic Studies, 4(2), 155-161.

30

16. Sarin B., and P. Wakker (1998): �Dynamic Choice and NonExpected

Utility,�Journal of Risk and Uncertainty, 17(2), 87-119.

17. Schmeidler, D. (1989): �Subjective Probability and Expected Utility

without Additivity,�Econometrica, 57(3), 571-587.

18. Shalev, J. (1997): �Loss Aversion in a Multiple-Period Model,�Mathe-

matical Social Sciences, 33(3), 203-226.

19. Shi, S., and L. Epstein (1993): �Habit and Time Preference,� Interna-

tional Economic Review, 34(1), 61-84.

20. Thaler, R. (1981): �Some Empirical Evidence on Dynamic Inconsis-

tency,�Economics Letters, 8, 201-207.

21. Tversky, A., and D. Kahneman (1991): �Loss Aversion in Riskless Choice:

A Reference-Dependent Model,�Quarterly Journal of Economics, 106(4),

1039-1061.

22. Uzawa, H. (1968): �Time Preference, the Consumption Function, and

Optimum Asset Holdings,�in J.N. Wolfe, ed., Capital and Growth: Pa-

pers in Honour of Sir John Hicks (Chicago: Aldine, 1968).

23. Wakai, K. (2002): �Linking Behavioral Economics, Axiomatic Decision

Theory and General Equilibrium Theory,�Ph.D. Dissertation, Yale Uni-

versity.

31

Download - A Model of Gain/Loss Asymmetry - 京都大学wakai/research/papers/mdis-rev2...A Model of Gain/Loss Asymmetry Katsutoshi Wakai January 2006 Abstract Based on a notion associated with

Top Related