random choice and market demandecon.ucsb.edu/~jabirche/papers/irrational.pdf · classical consumer...
TRANSCRIPT
Random Choice and Market Demand
Javier A. Birchenall∗
University of California at Santa Barbara
September 10, 2020
Abstract
I present a model of choice with no preferential basis or optimizing behavior,
a statistical model of choice. Individual consumption is randomly chosen while
respecting a linear budget set. I show that mean (i.e., market) demands satisfy
all the properties of classical demand theory, including symmetry and negative
semidefiniteness of the Slutsky matrix. The derived comparative statics serve to
introduce a new form of (random) duality to demand theory, characterize market
demands for general probabilistic choice behaviors, and examine the consistency
and statistical power to detect rational behavior in revealed preference tests.
Keywords: demand theory, random choice behavior, aggregation
JEL classification: D01; D11; E03
Communications to: Javier A. Birchenall
Department of Economics, 2127 North Hall
University of California, Santa Barbara CA 93106
Phone/fax: (805) 893-5275
∗I am grateful to audiences at several locations for comments and helpful suggestions. I am specially
grateful to Geir Asheim, Ted Bergstrom, Gary Charness, Luis Corchón, John Duffy, Martin Dufwenberg,
Enrique Fatas, Raya Feldman, Eric Fisher, Rod Garratt, Zack Grossman, John Hartman, David Hinkley,
Marek Kapicka, Tee Kilenthong, Natalia Kovrijnykh, John Ledyard, Steve LeRoy, Gustavo Ponce, Cheng-
Zhong Qin, Warren Sanderson, Perry Shapiro, Joel Sobel, Glen Weyl, Eduardo Zambrano, and Giulio
Zanella for useful conversations. The usual disclaimer applies.
1
1 Introduction
Classical demand theory is deterministic. To account for random variation in actual choices,
or to test the implications of revealed preference theory using statistical methods, econo-
mists typically view consumers as boundedly rational. A consumer’s deterministic choices,
or the process by which their choices are made, are often augmented by a random element.
Examples include additive error terms to individual demand functions or additive random
preference shocks to deterministic utility functions. This paper takes these bounded ra-
tionality arguments to the extreme. I study consumer demands derived from a statistical
model of choice where individual choices are entirely made at random.
A statistical model of choice has no preferential basis and there is no optimizing be-
havior at any stage of the decision process. I consider a continuum of individuals that
select their consumption over a linear budget set by drawing random choices from a con-
tinuous probability density function. This choice behavior is an extreme form of bounded
rationality. Individuals are actually “irrational” in the sense used by Becker [7]. Becker
[7] demonstrated by example that compensated price changes sometimes force “irrational”
individuals to, on average, satisfy the Law of Demand. Characterizing a mean (i.e., market)
demand function derived from a statistical model of choice was an important contribution
of Becker [7].1 Becker’s [7] example, however, is impractical and restrictive. It relies on
random choices drawn from a uniform distribution, two commodities only, and non-satiated
individual choices.2
In this paper, individual choices are drawn from a general probability density function, an
arbitrary number of commodities, and for demands that lie in the interior or the boundary
of the budget set when the “last” commodity is left as a residual. I show that market
1Hildenbrand ([32], p. 34) remarked “I do not know of any successful alternative for modelling the
dependence of [demand] on [prices]. There is, of course, the well-known example of Becker.” He also
pointed out some limitations in Becker [7]; particularly, the use of uniform distributions. Varian ([59], p.
105) remarked “there seem to be few alternative hypotheses other than Becker’s that can be applied using
the same sorts of data used for revealed preference analysis.”2The uniform distribution is special. Some aggregation properties in Caplin and Spulber [11] follow
entirely because of their use of uniform distributions. Using two-commodities is also special for demand
integrability; see Katzner ([35], Theorem 4.1-2). Finally, non-satiation is responsible for downward sloping
indifference curves and adding up in classical demand theory. Demsetz ([16], p. 489) was agnostic: “as
imaginative and informative as [Becker’s] arguments are, they cannot be sustained in full without appealing
to rationality.”
1
demands satisfy the compensated Law of Demand quite generally. Indeed, I show that
market demand functions in a statistical model satisfy all the properties of classical demand
theory, including symmetry and negative semi-definiteness of the Slutsky matrix (Theorems
1 and 3). This last property is the defining feature of classical demand theory. Mean
demands in a statistical model of choice are thus indistinguishable from classical demands.
Mean demands can be rationalized as being the result of utility maximization, even when
interior (Theorems 2 and 4).
The main theorems yield several insights. First, mean demand integrability implies that
classical demand functions can be derived either from randomizing or optimizing behavior
(i.e., utility maximization or expenditure minimization). The paper hence introduces a
new (random) dual representation to classical demand theory. While the characterization is
partial, I make headway into the formulation of random duality in demand theory. Second, a
statistical perspective on choice behavior yields comparative statics that characterize a wide
range of probabilistic choice models. It is possible, for instance, to incorporate restrictions
based on behavioral postulates or choice heuristics. An example listed here is Luce’s [42]
independence from irrelevant alternatives, but many other examples based on preferential
and ‘pure’ (i.e., non-preferential) random choice behaviors are possible.3
Third, statistical models of choice have been used to derive statistical power to detect
rational behavior in revealed preference tests; see, e.g., Varian ([57], [58], and [59]), Bronars
[9], and Echenique et al. [18]. In statistical testing, consistency is generally viewed as a
necessary condition. It ensures that hypotheses can be distinguished in sufficiently large
samples. The statistical model of choice provides an alternative hypothesis to revealed
preference theory.4 It also provides power calculations. Power calculations have used the
uniform distribution, the logistic distribution, and random perturbations from observed
3The model does not differentiate between preferential and ‘pure’ (i.e., non-preferential) random choice
behavior. Traditional decision theory considers ‘pure’ randomizing behavior useful when preferential choices
have limited value; see, e.g., Elster [20] and Moore [49]. Recent laboratory and field experiments that
showcase the value of randomizing behavior in individual decision-making settings include Agranov and
Ortoleva [1] and Levitt [40].4Revealed preference tests abound including ”irrational” (i.e., non-standard) populations such as animal
subjects (Kagel et al. [34] and Chen et al. [12]), psychiatric patients (Cox [14]), children (Harbaugh et al.
[31]), and individuals under the influence of alcohol (Burghart et al. [10]) to name a few.
2
choices; see, e.g., Bronars [9], Choi et al. [13], and Andreoni et al. [4].5 I find here that re-
vealed preference tests and tests based on the amount of money pumped out of non-rational
individuals have low (asymptotic) power. Asymptotic power is generally independent of the
distribution function used in the alternative hypothesis (Corollaries 3 and 4).
Many other literatures have made use of statistical models of choice. In the spirit of
Becker [7], Grandmont [29] derived through aggregation market demands of the Cobb-
Douglas type (see also Hildenbrand [32] and Kneip [37]). Their positive income effects
ensure equilibrium uniqueness. I show that positive income effects are not general to the
statistical model of choice. In Becker [7], positive income effects follow due to the log-
concavity of the uniform distribution (Corollary 5). I provide a general characterization of
income effects (Corollary 2). Gode and Sunder ([25], [26]), in another application, used a
double auction to study the allocative efficiency of market mechanisms for zero-intelligence
traders, also inspired by Becker [7]; see Duffy [17] for a discussion of agent-based modeling
and decision-making in economic experiments. This paper suggests that the literature’s
reliance on a uniform distribution to model zero-intelligence traders is unnecessary.
2 Simple Examples
This section presents simple examples to illustrate the intuition behind the general theorems.
I consider an economy with two infinitely divisible commodities, 1 and 2. Their prices
= (1 2) and income are given and are assumed to be strictly positive.
There is a continuum of individuals whose choices must respect a linear budget set
() ≡ ©(12) ∈ R2+ : 0 ≤ 11 + 22 ≤ ª. (1)
The only choice is 1. Once 1 is chosen, the remaining income is spent in 2. (Simul-
taneous choices are discussed below.) The model of choice is statistical : 1 is selected at
random by draws from a distribution function (1) with density (1).
5Varian ([57], [58], and [59]) provides an overview of revealed preference theory. Bronars [9] first applied
Becker’s [7] model to derive statistical power. Choi et al. [13] considered boundedly rational individuals that
follow Luce’s [42] choice axioms; see also Andreoni et al. [4] for many alternatives. Beatty and Crawford
[6] discuss some nonstatistical aspects in revealed preference tests.
3
I assume that (1) is absolutely continuous and that first and second moments are
finite. Measure zero events are neglected throughout and all subsets are assumed non-
empty and measurable. To make sure that some choices are unfeasible, I assume that the
support of (1) is sufficiently large (i.e., possibly infinite). I also assume that (1) is
invariant with respect to ( ). This is the analog of assuming that tastes are invariant to
the economic environment. As in classical demand theory, in which preferences are given
and independent of the budget set, the density function (1) is given and does not vary
with (). Throughout the paper, I maintain these “well-behaved” requirements for all
other densities from where choices are drawn.
The Law of Demand. The maximum quantity of 1 that individuals can afford
in (1) is Xmax1 ≡ 1. Therefore, statistical choices must lie in the restricted choice set
0 ≤ 1 ≤Xmax1 . Let (Xmax
1 ) ≡ Pr{1 : 0 ≤ 1 ≤Xmax1 }. The mean demand for 1 is:
1() ≡ E [1| 0 ≤ 1 ≤Xmax1 ] =
Z max1
0
1(1)
(Xmax1 )
1. (2)
Given1,2 satisfies 22 = −11 so its mean demand is 2() = (−11())2.Let ·| denote the standard Slutsky compensation, = 1( )1.
6
Proposition 1 Suppose that individuals randomly choose 1 from (1) using a well-behaved
probability density function (1), and determine 2 as a residual. Then, the compensated
own price effects are negative for 1() and 2( ), i.e.,
1()
1
¯
0, (3)
and the Slutsky matrix of substitution effects is symmetric and negative semidefinite.
The proof, as other technical results, is in the Appendix. The relevant derivative for (2)
is:1( )
1
¯
=E [1| 0 ≤ 1 ≤Xmax
1 ]
Xmax1
µXmax
1
1
¯
¶, (4)
6When 1 changes, mean demand changes to 1(1 + 1 2 + ), where = 1()1 is the
Slutsky compensation. Income compensations are not so that everyone can afford mean demands but so that
individuals can, on average, afford their original bundle. Obviously, a Hicksian compensation or standard
duality results are not possible here because demands do not rely on utility functions.
4
so (3) follows as each term in (4) is given by:
E [1| 0 ≤ 1 ≤Xmax1 ]
Xmax1
=(Xmax
1 )
(Xmax1 )
(Xmax1 − 1( )) 0, and (5)
Xmax1
1
¯
= −Xmax1 − 1()
1 0. (6)
In Proposition 1, the (compensated) Law of Demand for 1() holds due to a statis-
tical fact and an economic fact. First, 1(), a right-truncated mean, is lower than the
truncation point Xmax1 and its value increases as the truncation point increases; see (5).
Second, even after taking into account the income compensation, the maximum amount of
1 that can be purchased declines as 1 increases; see (6).
Figure 1 illustrates Proposition 1. Values of 1 that exceed Xmax1 are not feasible so
(1) is truncated atXmax1 , i.e., the dark-shaded area to the right ofXmax
1 is not feasible at
(). When 1 increases, the maximum quantity of 1 that individuals can afford declines
to Xmax 01 . Once income is compensated, the truncation point changes to Xmax
100 Xmax
1 .
A compensated increase in 1 thus reduces Xmax1 and the mean demand declines from
1 ≡ 1(1 2 ) to 001 ≡ 1(
01 2
0).7
Some remarks. Proposition 1 does not depend on functional form assumptions on
(1). Becker [7] assumed a uniform distribution with (1) = 1+ and support [0 +], so
1() =1
(1)
Z 1
0
11 =1
2
1, (7)
and 2() = (12)(2). These demands coincide with those of a symmetric Cobb-
Douglas utility. (A power density (1) = (+)(1
+)−1 yields non-symmetric Cobb-
Douglas demands for 6= 1.)Proposition 1 examines mean demands but other aggregates also satisfy the Law of
Demand. For example, let med1 ( ) ≡ −1( (Xmax1 )2) denote 1’s median demand.
Since med1 ()Xmax1 = (12)(Xmax
1 )(med1 ( )) 0, median demands increase
7The only generalizations of Becker [7] I have been able to find are in Sanderson ([52], [53]). They assume
that an increase in 1 expands the choice set in a first-order stochastic sense. Since the relevant choice
distribution is (1) (Xmax1 ), the orderings assumed in Sanderson ([52], [53]) are always obtained here:
(1) (Xmax1
0) stochastically dominates (1) (Xmax1 ) in the first degree sense in [0Xmax
10].
5
6
-
(1)
1
2
Xmax1Xmax
100Xmax
101
2
001
002
2
02
eeeeeeeeeeeeeee
eeeeeeeeeeeeeee
eeeeeeeeeeeeeee
ZZZZZZZZZZZZZZZZZZZZ
ZZZZZZZZZZZZZZZZZZZZ
ZZZZZZZZZZZZZZZZZZZZ
eeeeeeeeeeeeeeeeeee||||
||||||||
|
rr
Figure 1: Mean demand for 1, drawn from (1). Initial choices are truncated by Xmax1 .
As prices change, the truncation point is Xmax 01 . For compensated changes, the truncation
point is Xmax 001 .
with Xmax1 hence they satisfy the Law of Demand.
The Law of Demand holds on average but not for every realization. When 1 increases,
the fraction of individual demands given by (1())− (0) satisfies the Law of Demandbut a fraction (Xmax 00
1 ) − (1()) violates it since demands increase for these indi-
viduals; see Figure 1. These violations have been used to measure the statistical power of
revealed preference tests. I return to this application of statistical choices in Section 5.
As in any demand function, uncompensated price effects satisfy the standard Slutsky
equation,1()
1=
1( )
1
¯
− 1()1( )
, (8)
with income effects given by 1() = E [1| 0 ≤ 1 ≤Xmax1 ] Xmax
1 (11) 0.
Hence mean demand 1() is normal (Figure 1), but positive income effects for 2()
are not general. A sufficient condition for 2( ) to be a normal good is that (1) is log-
concave; see Appendix. This condition is implicit in Becker [7] as the uniform distribution
is log-concave.
6
001
002
QQQQQQQQQQQQQQQQQQQQ
QQQQQQQQQQQQQQQQQQQQ
QQQQQQQQQQQQQQQQQQQQ
eeeeeeeeeeeeeeeee
6
- 1
2
Xmax1Xmax
1001
2
Xmax2 (0)
Xmax2
00(0)
(1 2) =
¡¡ª
rr
Figure 2: The Law of Demand for interior demands and = 2 when 1 is stochastically
decreasing in 2.
Mean demands are homogeneous (of degree zero) in (), add up to income, and
symmetric with respect to compensated cross-price changes. Means demands can even be
rationalized as the outcome of optimizing behavior. Integrability, however, is special under
two commodities because there is no room for non-transitive behavior; see, e.g., Mas-Colell
et al. ([45], p. 36 and Exercise 2.F.15) and Katzner ([35], Theorem 4.1-2). I provide general
integrability results in Section 3.
Interior demands. Suppose now that individuals choose 1 and 2 simultaneously
with draws from a distribution function (1 2) with density (1 2). The maximum
feasible consumption of 1 is still Xmax1 ≡ 1. The maximum feasible consumption of
2 depends on 1, as in 2Xmax2 (1) = − 11 or 2X
max2 (1) = 1[X
max1 − 1], for
1 ≤ Xmax1 . Its bounds are Xmax
2 (0) = 2 and Xmax2 (Xmax
1 ) = 0. Hence, for feasibility,
one now needs to consider choices drawn from
(Xmax1 Xmax
2 (1)) ≡ Pr{(12) : 0 ≤ 11 + 22 ≤ }
=
Z max1
0
Z max2 (1)
0
(1 2) 21. (9)
7
Mean demands 1( ) and 2( ) rely on the truncated distribution (9). Adding up
is violated (a.s.), but 1( ) and 2() satisfy all other properties of classical demand
theory. Figure 2 illustrates the Law of Demand for interior demands using a contour of
(1 2). A compensated increase in 1 ‘pivots’ the budget line and removes the dark-
shaded triangular area along (Xmax 001 Xmax
1 ). The area along (Xmax2 (0)Xmax 00
2 (0)) is
added. The Law of Demand holds because the area that is removed once 1 increases favors
high values of 1 whereas the area that is added favors low values of 1.
Figure 2 assumes that 1 and2 are “negatively associated.” It is possible to see that as
income increases, the feasible set favors more low values of 1. This yields inferior demands
for 1() and normal demands for 2(). If 1 and 2, are “positively associated,”
both goods would be normal. I formalize these observations later on. I was unable to find
conditions to make both demands inferior, an impossibility for classical demand theory.
3 General Theorems
This section will deal with generalizations of the previous examples. Let ≡ (1 )
denote the commodity vector and let ≡ (1 ) be the corresponding price vector.
The budget set,
() ≡ © ∈ R+ : 0 ≤ · ≤
ª, (10)
may hold as an inequality depending on whether one or more commodities are randomly
chosen or determined as a residual. I first study a system of mean demands where the “last”
commodity is chosen randomly. The corresponding analysis for the case when is
determined as a residual, which is less general, follows.
3.1 Interior demands
Choices are simultaneously drawn from (1 ) ≡ Pr(1 ≤ 1 ≤ ) with
density (1 ). To ensure that individual choices respect (10), (1 ) are drawn
from
(Xmax) =
Z max1
0
Z max (1−1)
0
(1 ) 1, (11)
8
where Xmax ≡ (Xmax1 Xmax
(1 −1)) is the vector of maximum feasible consump-
tions.8 The vector Xmax satisfies 1Xmax1 = and
Xmax (1 −1) = −1[X
max−1 (1 −2)− −1] , for = 2 . (12)
This vector is ordered: Xmax (1 −1) is a subset of −1X
max−1 (1 −2) be-
cause a positive consumption for commodities ≤ limits the maximum consumption of
commodities . If = Xmax (1 −1), positive consumption for commodities
becomes unfeasible as Xmax (1 −1) = 0 for = + 1 ; see, e.g., (12).
The upper bound for feasible consumptions is Xmax (1 = 0 −1 = 0) = for all
= 1 . These upper bounds are the vertices of the budget hyperplane.
Compensated price effects. The mean demand for commodity is given by
() ≡ E[|0 ≤ · ≤ ], or
() =
Z max1
0
Z max (1−1)
0
(1 )
(Xmax) 1, (13)
with () ≡ (1( ) ()) denoting the demand system.These demands are the truncated means of the random realizations of individual choices.
Mean demands depend on the () because the maximum feasible consumptionsXmax de-
pend on (). I have omitted the dependence ofXmax on ( ) for notational convenience.
The vector of maximum feasible consumption, however, is homogeneous (of degree zero) in
(). As a function of ( ), max( ) = max() for all ( ) and 0. Notice
also that in (11) and (13), the order of integration is irrelevant. Changing it amounts to a
relabeling of the different commodities.9
Compensated price changes are summarized by the × Slutsky matrix of substitution
effects, whose ()-entry is
S() ≡ ()
¯
. (14)
8For a uniform density, (Xmax) is the volume of a simplex, (Xmax) = (!)−1Q
; see Ellis [19].9The order of integration in (11) and (13) can be changed without consequences due to Fubini’s Theorem;
see, e.g., Fikhtengol’ts ([22], Vol. II, Section 344). This change would be somewhat analogous to changing
the order in which first-order conditions are obtained in the classical model.
9
The following theorem characterizes the resulting demand system:
Theorem 1 Suppose that individuals randomly choose subject to (10) using a well-
behaved probability density function (). Then, mean demands () are: interior,
· () ; homogeneous (of degree zero) in (); and the Slutsky matrix S()is symmetric and negative definite.
To simplify notation, let ≡ (1 ) and let () denote the vector when
is excluded, i.e., () ≡ (1 −1). Likewise, let ≡ (1 ) be the vector ofdifferential changes and let () denote this vector when is excluded, i.e., () ≡(1 −1). Finally, let max(()) denote the density () when X
max (()) takes the
place of . That is, max(()) ≡ (()X
max (())), which is only a function of ().
This density is central in all the comparative statics of this paper. The term Xmax (())
plays a central role because it is the only term that depends on the prices of all commodities,
i.e., all changes in ( ) are channeled through Xmax (()).
Proof (overview). Mean demands are (a.s.) interior since exhausting the budget
set is a zero-probability event. Homogeneity follows since Xmax is homogeneous in (),
as previously discussed. Symmetry and negative definiteness in the Slutsky matrix follow
from differential changes to (13). The relevant derivatives are based on repeated differen-
tiation under the integral sign so they are collected in the Appendix. I show there that a
compensated change in on commodity yields
S( ) = −Z max
1
0
Z max−1((−1))
0
[ − ( )][ − ()]
max(())
(Xmax)(). (15)
This term can be written as the second-order moment of linearly independent random
variables. For instance, let Σ be the × variance-covariance matrix of the random vari-
ables (() − ()( )Xmax (())− ()). By definition, then, Σ = [ ] ≡
E[( − ())( − ())| ≤Xmax] so one has
S() = −[], (16)
for = 1 − 1, and with = Xmax (()) when = . Since the variance-
10
covariance matrix Σ is symmetric and positive definite; see, e.g., Fisz ([23], Theorem 3.6.6),
the Slutsky matrix () = −Σ is symmetric and negative definite.Individual and mean demands are interior, but ( ) is homogeneous in (). As in
classical demand theory, this is a feature of the budget set (10). Symmetry and negative
definiteness are technical properties that have an intuitive interpretation. Individual choices
are random variables with a well-behaved variance-covariance matrix. The Slutsky matrix
S( ) is the negative of an × positive definite variance-covariance matrix. Hence
S( ) is symmetric and negative definite.Integrability. There are no preferences or optimizing behavior behind the demand
system (). Mean demands are actually interior so they would violate non-satiation.
Still, as the following theorem shows, () can be rationalized as being the result of the
maximization of some utility function:
Theorem 2 Under the conditions of Theorem 1, there exists some continuous, non-decreasing,
and quasi-concave utility function : R+1+ → R such that ( ) and 0( ) ≡ − ·
() are the unique solution to max0≥0 {( 0) : · + 0 = }.
Proof (overview). The system () is an incomplete demand system whose Slutsky
matrix S( ) satisfies the integrability conditions in Epstein [21] (see also LaFrance andHanemann [39]).
Incomplete demand systems can be rationalized if S() permits so. Theorem 2 con-
siders a “residual” commodity 0 with 0 = 1. The role of 0 is to exhaust the budget set
in order to determine a constant of integration for the expenditure function associated with
( 0). Not all incomplete demand systems can be rationalized. For integrability, S()must be symmetric and negative definite, as in Theorem 1. Negative definiteness in S()cannot be weakened for semidefiniteness; see Epstein ([21], Example 1).
3.2 Non-interior demands
As in Becker [7], and the simple example depicted in Figure 1, assume now that individuals
choose () commodities randomly but the “last” commodity is a residual. The density
11
function is (1 −1), which is only a function of () arguments. Mean demands
()() are given by (13) and, by construction, () = ( − () · ()()).
Theorem 3 Suppose that individuals randomly choose () subject to (10) using a well-
behaved probability density function (()). The “last” commodity is determined as a
residual. Then, mean demands (): add up to income, · () = ; are homoge-
neous (of degree zero) in (); and the Slutsky matrix S() is symmetric and negativesemidefinite.
The proof is an adaptation of the proof of Theorem 1. In contrast to Theorem 1, the
Slutsky matrix S( ) is now symmetric and negative semidefinite instead of just symmetricand negative definite. This is so because the “last” commodity is redundant. Knowing
the individual realizations of demands for commodities () and the budget set (10) is
enough to determine individual and mean demands for the “last” commodity .10 Since
the random variables()−()( ) andXmax (())−() in (16) are linearly related,
their variance-covariance matrix Σ is positive definite but singular; see, e.g., Fisz ([23],
Theorem 3.6.6).11 As Σ is positive semidefinite, the Slutsky matrix ( ) is symmetric
but only negative semidefinite.
As before, there is no optimizing behavior at any stage, but ( ) can rationalized :
Theorem 4 Under the conditions of Theorem 3, there exist some continuous, non-decreasing,
and quasi-concave utility function : R+ → R such that ( ) is the unique solution to
max≥0 {() : · = }.
Interior and non-interior demands ( ) derived from randomizing behavior can be
rationalized as the outcome of maximizing behavior. A “residual” commodity 0 in the
incomplete system may seem restrictive but rational behavior is typically tested in terms of
expenditure and not in terms of income. Since it is not generally possible to study all the
goods relevant for an individual, empirical tests of revealed preferences effectively analyze
10In classical demand theory, homogeneity and adding up also imply that the negative semidefiniteness
cannot be extended to negative definiteness; see, e.g., Mas-Colell et al. ([45], Proposition 2.F.3).11For example, when = 2 and2 =X
max2 (1), as in the non-interior demands in Section 2,X
max2 (1)−
2() becomes −(12)(1 − 1()) which is a linear in (1 − 1()).
12
incomplete demand systems.12 Of course, if only prices and demands are observed, but
not income , one can define 0 so that 0 ≡ · ( 0).
4 Some Extensions
This section discusses some simple extensions of the previous findings.
Bounded rationality “errors.” Applied consumer demand studies and revealed
preference tests typically augment deterministic individual demands with small “errors.”
These errors are seen as a random component in the decision process (i.e., mental errors)
or as additive measurement error in prices or individual choices; see, e.g., Choi et al. [13],
Echenique et al. [18], Lewbel [41], and Varian [58].
Individual demands here do have a “rational” and a “error” component. Deviations from
“rational” behavior are also bounded. Let denote = 1 independent sample real-
izations of an individual’s vector of demands. I assume that the realizations hold ( ) fixed.
The sample average for the individual demand for is defined by ≡ −1
X
=1
.
Corollary 1 Individual demands for in the statistical model of choice have a “rational-
plus-error” representation
= ( ) +
, (17)
with ≡
− ( ) and E[
] = 0. Moreover, for a given 0, individual demands
satisfy
Prn¯
− ( )¯ o
[]
2. (18)
Proof. Expression (17) is a standard construction and (18) is Chebyshev inequality.
By construction, a sample of individual demands generates a random variable that de-
viates from a “rational” mean demand by a purely stochastic “error” term. Since mean
demands can be rationalized by a well-behaved utility function, individual choices in the sta-
tistical model also admit a random utility representation given by () = (())+ ,
12Applied work relies on separability, composite goods, or an incomplete system of demands. Conventional
revealed preference tests, for example, do not test non-satiation. It is typically assumed to avoid trivial
data rationalizations; see, e.g., Varian ([57], p. 969).
13
where ≡ (() + ) − (()) and ≡ − () with E[] = E[()] −(E[]) 0. Corollary 1 thus shows that purely random choice data can be represented
as a rational-plus-error demand function, or as random utility model in which an individ-
ual’s utility () results from perturbations to a deterministic indirect utility function
(()) or from perturbations to a deterministic demand function ().
Income effects. As in classical demand theory, no special restrictions on () are
needed in Theorems 1 and 3. Income effects, however, cannot be unambiguously signed.
Income effects depend on the association between commodity and the “last” com-
modity . The random variables ( ) are said to be “positively associated” if one
is more likely to observe and taking larger (or smaller) values together than any
mixture of these. For 0 and 0 , while the remaining − 2 variables are fixed,“positive association” restricts () according to (0
0)( ) (0 )(
0);
see Shaked and Shanthikumar ([54], Section 9D).
Corollary 2 Under the conditions of Theorem 1, suppose that and are “positively
(resp. negatively) associated” random variables. Then, income effects for () are posi-
tive (resp. negative).
Proof (partial). For , with = 1 − 1, mean demand () satisfies:
()
=
Z max1
0
Z max−1((−1))
0
[ − ()]
max(())
(Xmax)(). (19)
This expression compares E[|() ≤ Xmax() = Xmax
(())], the mean value of
when takes its highest possible value, with ( ), or E[| ≤ Xmax]. For “posi-
tively associated” random variables, E[|() ≤Xmax() =X
max (())]− () 0,
yielding positive income effects. For = , expression (19) replaces byXmax (()), which
always exceeds ( ). Income effects for are therefore positive.
Income effects depend on the relationship between and . To understand this
relationship, recall the example of random choices over a single good in Figure 1. Income
effects are positive for 1 because 1 is trivially associated with 1. In the two-good case
of Figure 2, income effects for 1 are negative because 1 is “negatively associated” with
14
2. Income effects are positive for 2 since 2 is the “last” commodity.
The “last” commodity is central because changes in are channeled throughXmax (()).
Expression (12) implies that Xmax (()) = −
X−1=1
(), which is increasing in
. As the (hyper) surface Xmax (()) shifts out, the feasible values of expand. When
and are “positively associated,” and the budget set expands, the relevant density
added to determine mean demands for favors high values of , so () increases. For
“negatively associated” variables, the added area favors high values of but low values
of . In such case, () would be an inferior good. As before, it is not possible for all
goods to be inferior. Income effects for the “last” commodity are always positive.
A preferential random choice example. Other than having a sufficiently large
support, the restrictions required from () are in general implied by probability theory. The
previous theorems thus apply to choice probabilities restricted through behavioral postulates
or choice heuristics.
A well-known choice restriction is Luce’s [42] independence from irrelevant alternatives
(IIA). As an illustration, let (|) be the probability that an alternative subset is chosenwhen the set of available alternatives is and IIA holds. Since (|) = (| 0) ( 0|),for ⊆ 0 ⊆ , one has a continuous logit formulation with
(|) =
Z
exp{()}Z
exp{(0)}0, and (|) = exp{()}Z
exp{(0)}0, (20)
where () is a direct utility function associated with reflexive, transitive, and complete
preferences over ∈ .13 The parameter accommodates extremes in the rationality
spectrum. The larger is, the greater the degree of “irrationality.” The limit of (|) as→∞ is the uniform distribution used by Becker [7], and when → 0, the choice problem
becomes a deterministic utility maximization problem; see Anderson et al. ([3], p. 42).
Using (|), it is possible to derive a system of mean demands of the form ( |).These mean demands, holding the set of alternatives constant, satisfy the general theorems
13Continuous choice models can be derived as the infinitesimal limit of discrete choice models; see, e.g.,
Ben-Akiva and Watanatada ([8], p. 327) and McFadden ([46], pp. 311-312). In location theory, continuous
choices are unavoidable.
15
presented before. They, therefore, follow standard comparative statics for compensated price
changes and can be rationalized. This is a more general characterization than Mossin’s [50]
early study of the market demand behavior in Luce [42].14
Non-preferential random choice behaviors also satisfy the previous theorems.15 ‘Pure’
random choice may arise out of fairness or strategic considerations; see, e.g., Moore [49]
for a classical discussion on the use of an “impersonal and relatively uncontrolled process”
to coordinate social actions. ‘Pure’ random choices may also arise when preferential or
rational choices have limited value; see, e.g., Elster [20]. Agranov and Ortoleva ([1], p.
2) study random choice behavior in laboratory experiments. They argue for deliberate
randomization, especially when subjects are dealing with ‘hard’ (i.e., complex) questions.
Levitt [40] reports happiness changes from following actual random choice behavior in a
field experiment.16
Random duality. Mean demands obtained from the statistical model of choice admit
a random dual representation:
E[|0 ≤ · ≤ ] = argmax∈R+
{() : 0 ≤ · ≤ )}. (21)
The demand system () can hence be derived either from randomizing or optimizing
behavior (i.e., utility maximization or expenditure minimization).
Expression (21) does not imply that any classical demand function can be represented
as the mean outcome of random choice behavior. For instance, a random choice procedure
will (a.s.) lead to interior individual and mean demands. Non-interior demands based on a
14Luce’s [42] formulation, and the more general elimination process studied by Tversky [56], can be rep-
resented as random utility models; see, e.g., Anderson et al. ([3], Chap. 2). Tversky [55], Gonzalez-Vallejo
[28], and Roe et al. [51] are additional examples of preferential choice models based on the comparative eval-
uation of the different alternatives. A recent literature in economics studies sophisticated random choices.
A very small set of examples is Fudenberg et al. [24], Gul and Pesendorfer [30], Machina [43], and Manzini
and Mariotti [44]. These papers focus on individual and not on market behavior.15Individual choices are often represented by “observed” choice probabilities. This is the starting point
of the stochastic revealed preference theory developed by McFadden and Richter [47]. The probability that
individual choices lie in ⊂ () is () = () (Xmax). “Observed” choices here are not required
to satisfy any form of stochastic transitivity. The current statistical model of choice is “irrational” not only
in the standard deterministic sense, but also in the more general stochastic sense.16Anecdotal and ethnographic evidence also exists. McGrath ([48], p. 433) observed male shoppers in
gift shops on Christmas Eve tended “to make large, rapid, spontaneous, and often random purchases.”
16
residual “last” commodity also restrict cross-price effects. In the = 2 example in Section
2, non-interior demands 1( ) cannot vary with 2, as 2 is not part of Xmax1 .
To establish random duality starting with a continuously differentiable classical demand
system ∗( ) ≡ (∗1() ∗( )), one requires a density ∗() that satisfiesZ max1
0
Z max (())
0
[ − ∗()] ∗() = 0, for = 1 . (22)
Expression (22) is an orthogonality condition. The density ∗() must exist and be
independent of (). It should also integrate to one over a (sufficiently large) support.
In order to induce a classical demand system, the density ∗() must be such that mean
demands drawn from ∗() agree with ∗() along a ( )-expansion path.
Since () is a truncated mean, random duality requires characterizing a probability
distribution function in terms of its truncated mean, as in Kotz and Shanbhag [38]. By
homogeneity, ∗() can be written as a function of the extreme points of the budget
set (10). These points are Xmax1 = 1 and X
max (1 = 0 −1 = 0) = for
= 2 . Classical demands ∗() can then be written as ∗() ≡ ∗(1 ) with
≡ . As changes, ∗() change and so does the maximum feasible consumptions
that determine (). (The dependence ofXmax on is implicit to avoid cluttered notation.)
Differentiate (22) for commodity with respect to , with 6= . Then, the relevant
density ∗() can be constructed using the inversion formula for truncated means, as in
Z max1
0
Z max−1((−1))
0
Z max (())
0
[ − ∗()]∗() =
∗()
∗(). (23)
This generates a system of partial differential equations. By simultaneously tracing the
-paths, one can trace a density ∗() that induces ∗(). (For the “last” commodity,
the relevant term is of the form [Xmax ((); ) − ∗()]. If demands are non-interior,
Xmax (()) = ∗() for all() so the “last” equation is redundant.) Unfortunately, because
each affects multiple terms in Xmax, it is not generally easy to solve the system (23),
even when = 2. I describe the = 2 case under interior demands in the Appendix, but
the complete characterization of random duality requires a dedicated paper.
17
Even though mean demands can be rationalized, utility functions and welfare measures
have no content in a statistical model of choice. There is no conflict between the “represen-
tative agent” and the disaggregated individuals, as in Mas-Colell et al. ([45], Section 4D). In
here, there is no preferential or behavioral basis for normative assessments. As Mas-Colell
et al. ([45], pp. 121-122) note “[t]he moral of all this is clear: The existence of preferences
that explain behavior is not enough to attach them any welfare significance. For the latter,
it is also necessary that these preferences exist for the right reasons.”
5 Statistical Power and Rational Choice
Instead of starting from a choice behavior, revealed preference theory confronts observed
individual choices in order to infer an individual’s preferences. In this section, I apply the
general theorems to measure the statistical power to detect rational behavior in revealed
preference tests. A bundle is weakly revealed preferred to 0 (i.e., satisfies WARP) if
whenever · ≥ · 0 it is false that 0 · 0 0 · ; see, e.g., Mas-Colell et al. ([45], Chap.2). I consider compensated price changes to ensure testability.17
Revealed preference theory is non-statistical. Thus I formulate the null and the alterna-
tive hypotheses as:
0 : Choices are derived from a deterministic demand function in which no ‘true’
violation of WARP exists but there are (additive) errors to the consumer’s choices.
: Choices are drawn from a statistical model of choice according to a distribution
function ().
The weak axiom is the simplest possible setting for testing of rational economic behavior.
Demands that satisfyWARP do not require symmetry in cross-priced effects and hence allow
for non-transitive behavior. Consistency with utility maximization requires a generalized
axiom of revealed preference (GARP). I present some remarks about GARP later on.
17Tests are uninformative if budget sets do not intersect; see Beatty and Crawford [6]. Intersecting
budget sets is a standard assumption; see, e.g., the equal marginal utility of income (EMUI) assumption in
Echenique et al. [18]. Testability is discussed in detail by Andreoni et al. [4].
18
Tests on the size of the violations. Given a pair of consumption bundles of the
form () and (0 0), the statistic = · (− 0)+0 · ( 0−) measures the potentialprofits or losses of an “arbitrager” that exploits violations of rationality. The positive part
+ ≡ max{0 } and the negative part − ≡ max{− 0} denote the money pump and themoney drain, respectively. The significance of violations of rationality can be measured by
the amount of money ‘pumped out’ of individuals that violate WARP; see Echenique et al.
[18]. Because the asymptotic properties of the money pump + agree almost completely
with those of the -statistic, I first consider tests based on .
Under 0, = ( − 0) · (∗( ) − ∗(0 0)) + ( − 0) · ( − 0), where ∗( ) and
∗(0 0) represent the unobserved ‘true’ choices and and 0 are distributed according to
N ( 2). The test is one-sided and rejects 0 if the -statistic exceeds a critical test-value.
The distribution of the -statistic under the null cannot be calculated without additional
assumptions on the ‘true’ choices. As first explored by Varian [57], under the null of no ‘true’
violations and normality, there is an upper bound statistic given by = (− 0) · (− 0),
with ≥ and with normally distributed with 2 ≡ 2·k− 0k2 ·2 . Given a significancelevel and a value for 2, one can calculate a critical test-value 0 from the normal
distribution according to Pr{ |0} = N ( : { }) = . For the size of the test,
one has Pr{ |0} ≤ .
Define the power of tests based on the -statistic as Pr{ |}. To characterizeasymptotic power, let { : = 1 } denote the sample data. The obser-
vations are obtained at ( ) and the observations at (0 0). Let ≡ ( ) =
( − 0) · ( − ) denote the sample -statistic for the ( )-pair. The -statistic can
be computed for a total of ( − 1)2 pairwise sample comparisons. I consider a simple
matched sample average of the -statistic, ≡ −1X=
, or
= −1X
=(− 0) · ( −). (24)
The -statistic under is standard: converges (weakly) to ≡ (−0) · (()−
(0 0)) ≤ 0 (i.e., → when → ∞), where the inequality follows from Theorems
19
1 and 3. Given 2 ≡ 2 · k− 0k2 · Σ, with Σ as the variance-covariance matrix of and
[ ] = 2 , Chebyshev’s inequality bounds the power function:
Pr{ |} 22 +( − )
. (25)
This upper bound depends on the first two moments of the -statistic under the alternative:
the more “rational” the statistical model, the less powerful the test would be, i.e., lower
values of and/or 2 (a small price change k− 0k and/or a small variance in ) yield
lower statistical power.
The “money pump” statistic. Tests based on the money pump are related to the
-statistic. Under the null, the +-statistic is + = max{0 (−0) · (∗()−∗(0 0))+
(− 0) · (− 0)} and the critical values for + ≡ max{0 } are given by + = max{0 (−0) · ( − 0)}. The critical values are defined analogously as Pr{+ +
|0} = N (+ :{+ +
}) = , which relies on a truncated normal distribution. Define the power of
tests based on the money pump as Pr{+ + |}. Statistical power for the and the
+ statistics decreases with the size of the critical area and + ; see, e.g., (25). Also, the
power function of tests based on + cannot be smaller than the power of tests based on the
-statistic, i.e., Pr{+ +
|} ≥ Pr{ + |} since
+ = + − by definition.
In the limit as →∞, however, both tests behave similarly:
Corollary 3 For a given and well-behaved distribution function () in , the asymptotic
power of revealed preference tests based on the -statistic and the money pump +-statistic
satisfy lim→∞Pr{ |} = lim→∞ Pr{+ +
|} = 0.
Proof. The proof follows from the one-sided Chebyshev inequality (25). Further, notice
that lim→∞ Pr{+ +
|} ≡ lim→∞ Pr{max{0 } + |}. Hence the limit for
the power function of the +-statistic follows from the continuous mapping theorem, i.e.,
lim→∞ Pr{max{0 } + |} = max{0Pr{lim→∞ } +
|} = 0. (The maxfunction is continuous although not differentiable.)
The -statistic and the money pump are inconsistent. A statistical test is consistent if
its power against an alternative hypothesis tends to one as → ∞. (This is a standard
20
requirement in hypothesis testing.) The inconsistency of these tests is not due the use
of a particular distribution function under the alternative hypothesis . At any level of
significance , there is no well-behaved distribution function () that generates a consistent
test of individual rationality based on the - or the +-statistics.
Tests on the number of violations. Tests can also be based on the number of
revealed preference violations. Assume without loss of any generality that 0 . Consider
the bundles () and (0 0) but study them in relation to (). The test involves
pairwise comparisons between and (), and between 0 and ().
There are two kinds of violations: 0· 0 might exceed 0·() thus revealing a violationof 0 being weakly preferred to (), or · () might exceed · thus revealing a
violation of ( ) over . In the case of = 2 in Figure 1, these violations lie in the
segments [2 1] and [1Xmax 001 ]. An indifference curve tangent to these segments would
be concave to the origin leading to an inconsistency with the weak axiom.
The sample distributions of these kinds of violations depend on their frequency, as in
Π+{} ≡ #{ · (
− ( ))}+
, and Π−{} ≡ #{− · (
− ())}+
. (26)
The sample distribution of a violation of either the first or the second kind (or of both) is
given by
Π{()} = Π+{}+ Π
−{}− Π+{}Π
−{}.
The test statistic Π under 0 must be such that the probability of observing a ‘true’
violation should be zero. The appropriate test is also one-sided and rejects the null if the
number of observed violations exceeds some critical test-value. Under the null,
Π+{ 0|0} = N (0 : {∗(0 0) + 0 ( )}), andΠ−{|0} = N ( : {() ∗() + }).
The size of the test can be determined (or at least bounded) as before so thatΠ{( 0)|0} ≤, where is the significance level. Define the asymptotic power of the test asΠ{( 0)|},or as the value of the test statistic Π under .
21
Corollary 4 For a given and well-behaved distribution function () in , the asymptotic
power of revealed preference tests based on the Π-statistic satisfies
Π{( 0)|} = 1− (( ))[ (Xmax)− (( )) + (0)]
(Xmax 00) (Xmax). (27)
Proof. Under , the power components in (26) converge (strongly) to
lim→∞
Π+{ 0|} = 1− (( ))
(Xmax 00), and lim
→∞Π−{|} = (( ))− (0)
(Xmax), (28)
where Xmax 00 corresponds to a budget set (0 0). Since the power function is given by
Π{( 0)|} ≡ lim→∞ Π{( 0)|}, (27) follows from simple rearrangements.
Tests based on the number of violations are also inconsistent. The asymptotic power for a
test based on Π{( 0)|} equals one if and only if (( )) = 0 or (())− (0) = (Xmax). These conditions essentially require () to be insensitive to price changes. As
in Corollary 5, at any level of significance , there is no well-behaved distribution function
() that generates a consistent test of individual rationality based on the Π-statistic.
Asymptotic power for the Π-statistic, however, is sensitive to the distribution function in
the alternative. To maximize power, (()) = [ (Xmax) + (0)]2, so at most
Π{( 0)|} = 1− [ (Xmax) + (0)]2
4 (Xmax 00) (Xmax),
with the highest value given by (Xmax) = 1 and (0) = 0. Power would increase further
if (Xmax 00) is close to (Xmax). As (Xmax 00) → 1, power approaches 3/4. In contrast,
if (Xmax 00) = 14, asymptotic power would be zero.
Some remarks. A test of individual rationality is consistent if it can, with unlimited
observations, perfectly discriminate between demands that satisfy WARP and demands
based on the statistical model of choice. For a given probability distribution function ()
in , the statistical model of choice should eventually make only choices that violate the
weak axiom.18 This contradicts Theorems 1 and 3, and Corollary 1.
18Consistency in hypothesis testing is defined in a large-sample limit. In numerical examples available in
an Appendix not for publication, I find that random choice behavior leads to well-behaved demand functions
for realistic sample sizes. Inconsistency seesm to be a practical concern.
22
The previous tests are problematic. The null and the alternative hypotheses are not
mutually exclusive. The reason is that 0 and are not standard statistical hypotheses,
i.e., 0 and are not nested. (This is why statistical power is not bounded by size, as in
standard statistical hypothesis testing.) The issue is that economic theory does not impose
any meaningful restriction on the stochastic behavior of individual demands under 0. One
can assume that prices may be mismeasured, that measurement error is multiplicative, or
that there is a stochastic component in the decision process. These specifications change the
meaning of ‘small’ failures under the null but they do not affect power under the alternative
hypothesis.
The literature has considered additional rationality indices.19 Afriat’s efficiency index
measures the maximum “margin of error” across violations of the weak axiom (Echenique et
al. [18] and Varian [58]), theminimum cost index measures the monetary cost of breaking all
revealed preference violations in a given data set (see Dean and Martin [15]), and the swaps
indices measure the number of “choice swaps” required to rationalize choice inconsistencies
in a given data set (Apesteguia and Ballester [5]). Even the Slutsky matrix can be used
to assess individual rationality (Aguiar and Serrano [2]). These and similar deterministic
indices can be used to formulate statistical tests of rational behavior but it seems unlikely
that they will yield consistency seeing that the statistical model delivers, on average, well-
behaved demand functions.
I examined cycles of length two and not the more general cycles associated with the
generalized axiom of revealed preferences (GARP). A data set satisfies GARP if for each
pair of bundles and with = 1 if · ≥ · then it is false that
· · . Given a sample sequence of observations of the form 1 2 , one
can generalize the - and +-statistics as T = −1P
=1
P
=1 · ( − +1), with
+1 = 1 , and T + = max{0 T }. For a finite sample of size , the power of tests based
on the generalized statistics T and T + cannot be smaller than the power of tests based
on the and +-statistic since longer cycles allow for more revealed preference violations.
19Corollary 3 analyzes the average money pump cost. Echenique et al. [18] also studied the median
money pump cost. I have not studied median demands in general. Median demands in Section 2 satisfy
the compensated Law of Demand. Thus the asymptotic power of tests of revealed preferences based on the
median behavior of the and +-statistics is also likely to be zero.
23
Under the alternative hypothesis, however, mean demands ( ) satisfy GARP since
they are the outcome of maximizing a quasi-concave utility function according to Theorems
2 and 4. Asymptotic statistical power is thus likely to be small for T and T + . Power
calculations for tests of GARP in terms of the number of violations cannot be derived in
closed-form.20
6 Some Final Remarks
This paper derived a well-behaved consumer demand system from a statistical model of
choice with no preferential basis or optimizing behavior. The findings, colloquially and
loosely, represent the economic equivalent of the proverbial typing monkeys. The interpre-
tation of these findings is open and depends on whether one views the glass as half full or
half empty.
On one hand, the findings are reassuring. Psychologists and experimental economists
have raised numerous objections against the consistency of preferences and maximizing
behavior. These objections are valid but not very forceful in a market context. The testable
predictions of classical demand theory can be reached by an alternative route that abandons
all behavioral assumptions. In this defense of the classical model, which confirms and
extends the one advanced by Becker [7], pure behavioral objections are misguided because
the predictive power of demand theory depends primarily on how budget sets change and not
on the psychological or neurological process of reasoning involved in an individual’s decision
making. In effect, the comparative statics derived in this paper can be applied to realistic
random choice behaviors rooted in decision theory and aware of systematic departures from
individual rationality.
On the other hand, the fact that random choices are as predictive as sophisticated ra-
tional choices is troublesome; especially for empirical tests of the behavioral postulates of
rational choice theory. The essence of revealed preference theory is to deduce properties
of an individual’s preferences from observed choices. But statistical tests of revealed pref-
erences are unrevealing since even a naïve (i.e., statistical) model of choice will pass these
20The literature uses simulations to measure power in general cases; see, e.g., Bronars ([9], p. 695).
24
tests with flying colors. Moreover, and this is a difficulty for demand integrability and wel-
fare analyses, successfully recovering an individual’s preferences from observed choice data
does not necessarily represent a meaningful exercise. Observed choices could, in principle,
be rational yet lack normative content. The findings, in short, diminish the economists’
confidence on the rational choice paradigm.
References
[1] Agranov, M. and Ortovela P. (2017) “Stochastic Choice and Preferences for Random-
ization,” Journal of Political Economy, Vol. 125(1), 40-68.
[2] Aguiar, V.H. and Serrano, R. (2017) “Slutsky Matrix Norms and the Size of Bounded
Rationality,” Journal of Economic Theory, Vol. 172, 163-201.
[3] Anderson, S.P., De Palma, A., and Thisse, J.F. (1992) Discrete Choice Theory of
Product Differentiation, Cambridge: MIT Press.
[4] Andreoni, J., Gillen, B., and Harbaugh, W.T. (2013) “The Power of Revealed Prefer-
ence Tests: Ex-Post Evaluation of Experimental Design,” UCSD Working Paper.
[5] Apesteguia, J. and Ballester, M.A. (2015) “A Measure of Rationality and Welfare,”
Journal of Political Economy, Vol. 123(6), 1278-1310.
[6] Beatty, T.K.M., and Crawford, I.A. (2011) “HowDemanding is the Revealed Preference
Approach to Demand?,” American Economic Review, Vol. 101(4), 2782-2795.
[7] Becker, G.S. (1962) “Irrational Behavior and Economic Theory,” Journal of Political
Economy, Vol. 70(1), 1-13.
[8] Ben-Akiva, M. andWatanatada, T. (1981) “Application of a Continuous Spatial Choice
Logit Model,” in Manski, C.F. and McFadden, D. (Eds.) Structural Analysis of
Discrete Choice Data with Econometric Applications, Cambridge: MIT Press.
[9] Bronars, S.G. (1987) “The Power of Nonparametric Tests of Preference Maximization,”
Econometrica, Vol. 55(3), 693-698.
[10] Burghart, D.R., Glimcher, P., and Lazzaro, S.C. (2013) “An Expected Utility Maxi-
mizer Walks Into a Bar...,” Journal of Risk and Uncertainty, Vol. 46(3), 215-246.
[11] Caplin, A. and Spulber, D.F. (1987) “Menu Costs and the Neutrality of Money,” Quar-
terly Journal of Economics, Vol. 102(4), 703-725.
[12] Chen, M. K., Lakshminarayanan, V., and Santos, L.R. (2006) “How Basic are Behav-
ioral Biases? Evidence from Capuchin Monkey Trading Behavior,” Journal of Political
Economy, Vol. 114(3), 517-537.
25
[13] Choi, S., Kariv, S., Müller, W., and Silverman, D. (2012) “Who Is (More) Rational?,”
American Economic Review, Vol. 104(6), 1518-1550.
[14] Cox, J.C. (1997) “On Testing the Utility Hypothesis,” Economic Journal, Vol. 107(443),
1054-1078.
[15] Dean, M. and Martin, D. (2016) “Measuring Rationality with the Minimum Cost of
Revealed Preference Violations,” Review of Economics and Statistics, Vol. 98(3), 524-
534.
[16] Demsetz, H. (1996) “Rationality, Evolution, and Acquisitiveness,” Economic Inquiry,
Vol. 34(3), 484-495.
[17] Duffy, J. (2006) “Agent-Based Models and Human Subject Experiments,” in Tesfat-
sion, L. and Judd, K.L. (Eds.) Handbook of Computational Economics, Vol. 2:
Agent-Based Computational Economics, Amsterdam: Elsevier.
[18] Echenique, F., Lee, S., and and Shum, M. (2011) “The Money Pump as a Measure of
Revealed Preference Violations,” Journal of Political Economy, Vol. 119(6), 1201-1223.
[19] Ellis, Richard S. (1976) “Volume of an N-Simplex by Multiple Integration,” Elemente
der Mathematik, Vol. 31, 57-59.
[20] Elster, J. (1987) Taming Chance: Randomization in Individual and Social
Decisions, University of Utah Press.
[21] Epstein, L.G. (1982) “Integrability of Incomplete Systems of Demand Functions,” Re-
view of Economic Studies, Vol. 49(3), 411-425.
[22] Fikhtengol’ts, G.M. (1965) The Fundamentals of Mathematical Analysis, Lon-
don: Pergamon Press.
[23] Fisz, M. (1963) Probability Theory and Mathematical Statistics, New York:
Wiley.
[24] Fudenberg, D., Iijima, R., and Strzalecki, T. (2015) “Stochastic Choice and Revealed
Perturbed Utility,” Econometrica, Vol. 83(6), 2371-2409.
[25] Gode, D.K. and Sunder, S. (1993) “Allocative Efficiency of Markets with Zero-
Intelligence Traders: Market as a Partial Substitute for Individual Rationality,” Journal
of Political Economy, Vol. 101(1), 119-137.
[26] Gode, D.K. and Sunder, S. (1997) “What Makes Markets Allocationally Efficient?,”
Quarterly Journal of Economics, Vol. 112(2), 603-630.
[27] Goldberger, A. (1983) “Abnormal Selection Bias,” in Karlin, S., Amemiya, T., and
Goodman, L. (Eds.) Studies in Econometrics, Time Series and Multivariate
Statistics, New York: Academic Press.
26
[28] Gonzalez-Vallejo, C. (2002) “Making Trade-Offs: A Probabilistic and Context-Sensitive
Model of Choice Behavior,” Psychological Review, Vol. 109(1), 137-155.
[29] Grandmont, J.M. (1992) “Transformation of the Commodity Space, Behavioral Het-
erogeneity, and the Aggregation Problem,” Journal of Economic Theory, Vol. 57(1),
1-35.
[30] Gul, F. and Pesendorfer, W. (2006) “Random Expected Utility,” Econometrica, Vol.
74(1), 121—146.
[31] Harbaugh, W., Krause, K., and Berry, T. (2001), “GARP for Kids: On the Develop-
ment of Rational Choice Behavior,” American Economics Review, 91(5), 1539-1545.
[32] Hildenbrand, W. (1994) Market Demand: Theory and Empirical Evidence,
Princeton: Princeton University Press.
[33] Joe, H. (1997) Multivariate Models and Multivariate Dependence Concepts,
Chapman & Hall.
[34] Kagel, J.H., Battalio, R.C., and Green, L. (1995) Economic Choice Theory: An
Experimental Analysis of Animal Behavior, Cambridge: Cambridge University
Press.
[35] Katzner, D.W. (1970) Static Demand Theory, New York: MacMillan.
[36] Khuri, A.L. (2002) Advanced Calculus with Applications in Statistics, New
York: Wiley.
[37] Kneip, A. (1999) “Behavioral Heterogeneity and Structural Properties of Aggregate
Demand,” Journal of Mathematical Economics, Vol. 31(1), 49-79.
[38] Kotz, S. and Shanbhag, D.N. (1980) “Some New Approaches to Probability Distribu-
tions,” Advances in Applied Probability, Vol. 12(4), 903-921.
[39] LaFrance, J.T. and Hanemann, W.M. (1989) “The Dual Structure of Incomplete De-
mand Systems,” American Journal of Agricultural Economics, Vol. 71(2), 262-274
[40] Levitt, S. (2020) “Heads or Tails: The Impact of a Coin Toss on Major Life Decisions
and Subsequent Happiness,” Review of Economic Studies, forthcoming.
[41] Lewbel, A., (2001) “Demand Systems with and without Errors,” American Economic
Review, Vol. 91(3), 611-618.
[42] Luce, R.D. (1959) Individual Choice Behavior: A Theoretical Analysis, New
York: Wiley.
[43] Machina, M.J. (1985) “Stochastic Choice Functions Generated fromDeterministic Pref-
erences over Lotteries,” Economic Journal, Vol. 95, 575-594.
27
[44] Manzini, P. and Mariotti, M. (2014) “Stochastic Choice and Consideration Sets,”
Econometrica, Vol. 82(3), 1153—1176.
[45] Mas-Colell, A., Whinston, M. and Green, J. (1995)Microeconomic Theory, Oxford:
Oxford University Press.
[46] McFadden, D. (1976) “The Mathematical Theory of Demand Models,” in Stopher, P.R.
and Meyburg, A.H. (Eds.) Behavioral Travel Demand Models, Lexington, MA:
Lexington Books.
[47] McFadden, D. and Richter, M.K. (1990) “Stochastic Rationality and Revealed Stochas-
tic Preference,” in Chipman, J., McFadden, D., and Richter, M.K. (Eds.)Preferences,
Uncertainty, and Optimality, Essays in Honor of Leo Hurwicz, Boulder: West-
view Press.
[48] McGrath, M. A. (1989) “An Ethnography of a Gift Store: Wrappings, Trappings and
Rapture,” Journal of Retailing, Vol. 65(4), 421-449.
[49] Moore, O.K. (1957) “Divination: A New Perspective,” American Anthropologist, Vol.
59(1), 69-74.
[50] Mossin, A. (1968) “Elements of a Stochastic Theory of Consumption,” Swedish Journal
of Economics, Vol. 70(4), 200-220.
[51] Roe, R.M., Busemeyer, J.R., and Townsend, J.T. (2001) “Multialternative Decision
Field Theory: A Dynamic Connectionist Model of Decision Making,” Psychological
Review, Vol. 108(2), 370-392.
[52] Sanderson, W.C. (1974) “Does the Theory of Demand Need the Maximum Principle?,”
in David, P.A. and Reder, N.W. (Eds.) Nations and Households in Economic
Growth: Essays in Honor of Moses Abramovitz, New York: Academic Press.
[53] Sanderson, W.C. (1980) “A Nonutilitarian Economic Model of Fertility and Female
Labor Force Participation,” Revue économique, Vol. 31(6), 1045-1080.
[54] Shaked, M. and Shanthikumar, J.G. (2007) Stochastic Orders, New York: Springer.
[55] Tversky, A. (1969) “Intransitivity of Preferences,” Psychological Review, Vol. 76(1),
31-48.
[56] Tversky, A. (1972) “Elimination by Aspects: A Theory of Choice,” Psychological Re-
view, Vol. 79(4), 281-299.
[57] Varian, H.R. (1982) “The Nonparametric Approach to Demand Analysis,” Economet-
rica, Vol. 50(4), 945-973.
[58] Varian, H.R. (1990) “Goodness-of-Fit in Optimizing Models,” Journal of Econometrics,
Vol. 46(1-2), 125-140.
28
[59] Varian, H.R. (2006) “Revealed Preference,” in Szenberg, M., Ramrattan, L., and
Gottesman, A. (Eds.) Samuelsonian Economics and the Twenty-First Century,
pp. 99-115. Oxford University Press.
29
7 Appendix A: Omitted proofs and derivations
Proof Proposition 1 (completed). Mean demands for 1 were studied in the text.
Consider the mean demand for 2. Some steps in this case are analogous to the partial
proof in the text. Consider first the own-price effect. By definition,
2()
2
¯
=
µ(2)
2
¯
¶− 1( )
2
¯
1
2− 1()1
22,
with
µ(2)
2
¯
¶=
1( )1
22,1()
2
¯
=E [1| 0 ≤ 1 ≤Xmax
1 ]
Xmax1
2( )
1, so
2()
2
¯
= −E [1| 0 ≤ 1 ≤Xmax1 ]
Xmax1
µ2( )
2
¶ 0.
Moreover, the Slutsky matrix is such that |S()| can be written as
|S( )| = 2()
µE [1| 0 ≤ 1 ≤Xmax
1 ]
Xmax1
¶ ¯ −221 1111 −12
¯,
with |S( )| = 0. Symmetry and negative semidefiniteness hold here. Integrability followsfrom these properties; see Katzner ([35], Theorem 4.1-2).
For the Slutsky equation for 1, notice that the Slutsky compensation at income is
= 1( )1. The Slutsky equation (8) follows by definition. For completeness, notice
thatXmax
1
1
¯
=Xmax
1
1+
1( )
1.
The Slutsky equation (8) is a consequence of this expression as well as of (4). The Slutsky
equation for 2 follows from simple rearrangements. For instance, notice that the uncom-
pensated own-price effect is 2()2 = −2( )2 which rules out Giffen goods.Income effects, however, can be negative for 2(). That is,
2()
=
µ1− E [1| 0 ≤ 1 ≤Xmax
1 ]
Xmax1
¶1
2. (A1)
Corollary 5 Under the conditions of Proposition 1, suppose (1) is a log-concave distri-
bution. Then, 2() is a normal good.
Proof. The proof relies on (A1) and the following fact about log-concave distributions
0 ≤ E [1| 0 ≤ 1 ≤Xmax1 ] Xmax
1 ≤ 1; see Goldberger ([27], Appendix A).
Omitted derivations from the general theorems. In order to simplify the deriva-
tions in this section, I first present an auxiliary Lemma that will be used repeatedly. The
Lemma serves to simplify the calculations of price and income effects. The central implica-
tion of the following Lemma is that in order to determine the response in mean demands to
changes in prices and income, one only needs to evaluate the changes in the most interior
A1
integral. Economically, this result makes sense: recall that Xmax (()) = − () · (),
which is the only maximum feasible consumption that depends on the entire price vector
and income . Hence, any price change will invariably alter Xmax (()).
Lemma 1 Let ( ) ≡Z max
1
0
Z max (())
0
(), where Xmax is given by (12). Then,
( )
¯
=
Z max1
0
"Z max (())
0
()
#¯¯
(),
=
Z max1
0
Z max−1((−1))
0
(()Xmax (()))
µXmax
(())
¯
¶(),
for = 1 .
Proof. The proof is a repeated application of Leibniz’s rule for differentiation under the
integral sign. (The case of = 2 and 1 = 2 = 1 is available in Khuri ([36], pp. 307-308).)
For the general proof, it is enough to consider changes in 1 since this price enters in all
limits of integration. All other price changes can be seen as special cases.
()
1
¯
=
"Z max2 (1)
0
Z max (1−1)
0
(1 )1
#1=
max1
Xmax1
1
¯
+
Z max1
0
1
"Z max2 (1)
0
Z max (1−1)
0
(1 )2
#¯¯
1.
The first term is zero since Xmax2 (Xmax
1 ) = (− 1(1))2 = 0. Further, as noted in
the text, sinceXmax2 (Xmax
1 ) = 0, thenXmax = 0 for ≥ 2 also as there would be no income
left for the consumption of these commodities.
The second term becomesZ max1
0
⎧⎨⎩"Z max
3 (12)
0
Z max (1−1)
0
()2
#2=
max2 (1)
Xmax2 (1)
1
¯
⎫⎬⎭ 1
+
Z max1
0
Z max2 (1)
0
1
"Z max3 (12)
0
Z max (1−1)
0
()3
#¯¯
12,
whose first component is also zero as 3Xmax3 (1X
max2 (1)) = 2(X
max2 (1)−Xmax
2 (1)) =
0. Thus, as before, the first component evaluates a definite integral over a degenerate
interval and this equals zero. The only relevant component is the second one which also
needs to be evaluated using Leibniz’s rule.
By the way the limits of integration are defined, the derivative operator moves toward
A2
high values of . For instance, the -th step of the sequence of integrals is given by
( )
1
¯
= 0 + + 0 ( − 1 times)
+
Z max1
0
1
"Z max (1−1)
0
Z max (1−1)
0
()
#¯¯
1−1,
where the evaluation of the integral with the upper limit of integration Xmax+1 (1 )
evaluated at =Xmax (1 −1) will also equal zero. Moreover, X
max = 0 for all ≥ .
The last term in the sequence is
( )
1
¯
=
Z max1
0
1
"Z max (1−1)
0
()
#¯¯
1−1,
which under Leibniz’s rule simply becomes
()
1
¯
=
Z max1
0
Z max−1(1−2)
0
(()Xmax (()))
µXmax
(())
1
¯
¶().
Proof of Theorems 1 and 3 (completed). The proof was started in the text and is
completed here. I focus on the case of interior demands but provide the needed adaptations
for the case of a residual commodity at the end of the proof.
Consider (13) and write this expression as () = ( ) (Xmax), with
( ) ≡Z max
1
0
Z max (())
0
(), and
(Xmax) =
Z max1
0
Z max (())
0
(),
as the numerator and the denominator respectively.
Using Lemma 1, notice that
()
¯
=
Z max1
0
Z max (())
0
(()Xmax (()))
µXmax
(())
¯
¶(),
for all 6= and with Xmax (()) taking the place of for = . Likewise
(Xmax)
¯
=
Z max1
0
Z max (())
0
(()Xmax (()))
µXmax
(())
¯
¶(),
A3
for all . The quotient rule implies
( )
¯
=
µ1
(Xmax)
¶ ∙()
¯
− () (Xmax)
¯
¸.
This last expression, upon substitution, yields
()
¯
=
Z max1
0
Z max−1((−1))
0
{ − ( )}µmax(())
(Xmax)
Xmax (())
¯
¶().
(A2)
where, as in the text, max(()) ≡ (()Xmax (())).
A compensated own-price change in only needs to be evaluated in terms of its effect
on Xmax (()). This result implies that price effects are channeled through changes in the
maximum feasible consumption. Recall that feasibility implies
Xmax (1 −1) =
−X−1
=1
µ
¶, (A3)
and that income is compensated according to = ( ). Then,
Xmax (())
¯
=1
∙µ
¶−
¸=
()−
. (A4)
Finally, substitution of (A4) into (A2) yields the own-price effect. For commodity , the
own price effect is given by
()
¯
= −Z max
1
0
Z max−1((−1))
0
©Xmax
(())− ( )ª2µmax(())
(Xmax)
1
¶().
(A5)
The main difference is that has been substituted by Xmax (()), which is a (hyper)
surface along (). Expression (A5) is consistent with the ones in the simple examples.21
The cross-partial effects follow from the appropriate derivative of Xmax (()). For in-
stance
()
¯
=
Z max1
0
Z max (())
0
(()Xmax (()))
µXmax
(())
¯
¶(),
and similarly for the denominator. Following Lemma 1, and repeating the steps just taken,
one can show that:
()
¯
=
Z max1
0
Z max−1((−1))
0
{ − ( )}µmax(())
(Xmax)
Xmax (())
¯
¶(),
(A6)
where Xmax (())
¯= (()− ). Simple substitutions yield (15).
21Notice that in the example of Section 2, the own-price effect (4) can be written as 1()1| =−{Xmax
1 − 1()}2(Xmax1 ) (Xmax
1 )1, which is the analog of (A5).
A4
The only property that deserves some further comment is the negative (semi)definiteness
of S(), which is analogous to the positive (semi)definiteness of Σ. The term S()can be written as a covariance term given by
[ ] =
Z max1
0
Z max−1((−1))
0
{ − ( )} { − ()} (())(),
where (()) is proportional to (()Xmax (())). Consider a vector and the non-
negative quadratic form [ · ( − ( ))]2 = · ( − ( ))( − ()) · . Theconditional expectation (16) satisfies E[ · ( − ( ))]2 = · Σ , which only takesnon-negative values. This is a well-known property of variance-covariance matrices; see Fisz
([23], pp. 89-90).
To verify the conditions for positive definiteness, one only needs to check that the “last”
commodity is linearly independent from the remaining − 1 commodities because theseother commodities have a joint density, i.e., their distribution is non-degenerate; see Fisz
([23], p. 90). Notice that Xmax (()) is a linear function of (). That is, for a given
realization of demands (), the maximum feasible consumption Xmax (()) is linear in
(), i.e., Xmax (()) = −()·() for all possible realizations of(); see (12). Notice,
however, that is not necessarily equal to Xmax (()). In particular, = X
max (())
for all possible realizations of () only when is selected as a residual. In such case,
() will be linearly related to ()( ) since ( ) = − () · ()( ). If is a residual, then Σ is positive semidefinite but not positive definite since · Σ = 0
holds for some 6= 0. In other words, the -dimensional distribution has a degenerated
character as the probability is not distributed over the whole -dimensional space, but is
concentrated on − 1 dimensions.Preliminaries and remarks for integrability conditions. In terms of notation,
let ( ) denote the expenditure function: ( ) ≡ min≥0{ · : () ≥ }. Thus,( )
= ∗( ) , (A7)
for = 1 and with ∗( ) as the Hicksian demand for , i.e., ∗( ) = ∗( ( )).
Integrability requires a boundedness condition in terms of the partial derivative of mean
demands with respect to income; see Katzner [35]. Boundedness is assumed to hold here
due to finite first and second moments.
Proof of Theorems 2 and 4. Symmetry and a negative definite Slutsky matrix
S( ) is necessary and sufficient for the existence of a solution for the partial differentialequation system (A7); see, e.g., Epstein [21]. These conditions hold here (Theorem 1). The
constant of integration in (A7) cannot be uniquely determined in an incomplete demand
system. (In a complete system, the budget constraint serves to determine the constant of
integration; see, e.g., Katzner ([35], Chap. 4).) In an incomplete demand system, one must
make assumptions about how to complete the system. Assuming the existence of 0 with
0 = 1 accomplishes this in the simplest possible way. See the remarks below for additional
ways to complete the system.
A5
If the “last” commodity is a residual, () becomes a standard demand system.
A negative semidefinite S() is necessary and sufficient for the solution of the previoussystem to be fully determined. These requirements hold here (Theorem 3). The utility
function can then be recovered from the expenditure function; see, e.g., Katzner ([35],
Chap. 4).
In general, an incomplete system can be completed in many ways. Let 0( 0 ) de-
note the vector of demands for commodities that complete the demand system and let
0 represent their corresponding vector of prices. If ( 0 ) = () and 0( 0 ) =
0(0 ), then there is a utility function : R++ → R such that () and 0(0 )
are the solution to max0≥0 {(Ψ(0)) : · + 0 · 0 = }, with Ψ(0) linearly homo-
geneous. Epstein [21] and LaFrance and Hanemann [39] provide additional remarks about
the integrability of incomplete demand systems.
Income effects. Before completing the proof of Corollary 2, I need to define the
notion of “positive association.” The variables random and are said to be “positively
associated” in the positive likelihood ratio dependence sense of Shaked and Shanthikumar
([54], Section 9D) or the Total Positivity order TP2 in Joe ([33], Chapter 1), if
(0 0)( ) (0 )(
0), (A8)
for 0 and 0 , when the − 2 remaining variables are fixed. Positive association
means that one is more likely to observe that and take larger values together and
smaller values together than any mixture of these. Figure 2 depicts the case of negatively
associated variables.
Proof of Corollary 2. To derive income effects (19), one simply needs to evaluate the
appropriate derivative of Xmax (()). Using the notation of the previous proofs,
()
=
Z max1
0
Z max (())
0
(()Xmax (()))
µXmax
(())
¶(),
and similarly for the denominator. Once these expressions are substituted back into the
quotient rule, one obtains (19).
As noted in the text, expression (19) can be written as E[|() ≤ Xmax() =
Xmax ] − E[|() ≤ Xmax
() ≤ Xmax ]. To characterize this difference, I focus on a
pair of commodities as this implies the general case. Let Φ( ) denote the conditional
probability distribution for a feasible value of when = , as in
Φ( ) ≡ Pr{ | ≤Xmax = } =
Z
0
( )Z max
0
(0 )0, (A9)
and define Φ( 0) for
0 similarly. Since is positive, E[| ≤ Xmax
=
] =Xmax −
Z max
0
Φ( ).
A6
Notice that E[| ≤ Xmax = 0] E[| ≤ Xmax
= ] requires
Φ( ) Φ( 0), which, from (A9), corresponds toZ
0
Z max
0
[( )(0 0)− ( 0)(
0 )]0 0.
By assumption, this expression holds when and are “positively associated” in (A8).
A7
8 Appendix B: Additional results [NOTFORPUB-LICATION]
Extended random (“dual”) representations. Mean demands that arise from individ-
uals randomly choosing their consumptions can be represented as demands that arise from
the maximization of some utility function. This sub-section provides a partial examina-
tion of the “converse” problem for two random representations. The first representation
assumes that demands are on average feasible whereas the second is individually feasible
but it restricts the support of the distribution. Consider a given consumer demand function
∗() for commodity with ∗( ) ≡ (∗1() ∗( )). Demands ∗() arehomogeneous in ( ) and satisfy ·∗() = . These are the basic properties of demand
functions in the behavioral model of choice.
(i) Average feasibility. This feasibility condition is the same as the one used by Katzner
[35] to discuss errors and shocks to individual demand functions. In particular, as in Katzner
([35], p. 161), “if [the consumer] were to choose from the same budget set many times, ‘on
average’ he would choose the utility maximizing bundle.” The goal in the first representation
is to find a distribution function ∗() such that
∗() =Z 1
0
Z
0
∗() ∗()
, (B1)
for all = 1 where ∗() ≡ ∗(1 ) and ∗() is the joint density of ∗().Notice that (B1) integrates random individual demands over a rectangular area instead of
over the triangular area given by (10).
As in the text, the derivations needed to construct the first representation rely on an
inversion formula for the right-truncated mean. Write (B1) as
∗(z)∗() =
Z
0
µ ∗(1 −1 +1 )
¶,
which on differentiation with respect to yields (∗())
∗() + ∗()(∗()) =
(∗()). As in the text, let (z) ≡ [∗()][ − ∗()]. A convenient way to
write the previous expression is ln ∗() = (). Integration yields
∗(() ) = exp
½−Z +∞
(() )
¾, (B2)
where ∗(() ) ≡ ∗(1 −1 +1 ), with the ‘constant’ of integration
determined such that ∗() is a distribution function over its support. This distributionfunction ∗() can be found by solving the simultaneous equations (B2).(ii) Individual feasibility. A second representation can be constructed using results from
convex analysis. Every point in a convex and compact set of finite dimension can be written
B1
as a convex combination of the extreme points of the set; see, e.g., Phelps ([1], p. 1). Let
be a compact convex subset of R and let denote the extreme points of . Every point
∈ can be written as a convex combination of z:
=X
=1,
where ≥ 0 andP
=1 = 1. There is a probabilistic interpretation based on Minkowski’s
integral representation; see Phelps [1]:
Theorem 5 (Minkowski) Let be a compact convex subset of R. Then, for every point
∈ , there exists a probability measure concentrated in the extreme points of such
that =R(). If is a simplex, is unique.
Proof. Let be the Dirac measure on the point , i.e., for every Borel set B, (B) =1 if ∈ B and zero otherwise. Then, () =
P
=1 = 1 and is a measure with support
{1 }. Therefore,R() =
P
=1
R() =
P
=1 = . Uniqueness can
be established generally for the case of a simplex; see Phelps ([1], Chap. 10).
To apply the previous representation it is enough to notice that the budget set (10) is
convex and compact; see Mas-Colell et al. ([45], p. 22). This means that it is possible to
construct a probability measure concentrated in that is always feasible and represents any
behavioral demand function ∗() as the mean demand from a random choice procedure
that selects bundles on the budget hyperplane. An alternative way to interpret the previous
theorem is by making reference to the coarea integral formula applied to the level set of
(10).
Random random (“dual”) representation for = 2. Consider as an example with
interior demands. Standard differentiation in (22) for ∗2(), in terms of 2, yieldsZ max1 ()
0
½[Xmax
2 (1; )− ∗2()]∗(1X
max2 (1; ))
Xmax2 (1; )
2
−∗2()
2
Z max2 (1;)
0
∗()2
)1 = 0.
One can rewrite the previous term asZ max1 ()
0
"
2log
Z max2 (1;)
0
∗()2 − 2(1; )
#1 = 0, (B3)
for 2(1; ) ≡ (Xmax2 (1; )− ∗2())
−1∗2(), so upon integrationZ max2 (1;12)
0
∗(1 2)2 = exp
½−Z ∞
2
2(1; 1 )
¾,
for a ‘constant’ of integration . By varying 2 it is possible to trace the density along the
2 dimension, while holding 1 constant.
B2
Now consider ∗1(). Using integration by parts, expression (22) can be written as
[Xmax1 ()− ∗1()]
∗(Xmax1 ()Xmax
2 (1; )) =
Z max1 ()
0
∗(1Xmax2 (1; ))1.
Differentiation with respect to 1 yields
0 = 1()−
1log ∗(Xmax
1 ()Xmax2 (1; ))
− 1
[Xmax1 ()− ∗1()]
1
∗(Xmax1 ()Xmax
2 (1; ))
Z max1 ()
0
1 ∗(1X
max2 (1; ))1,
for
1() ≡ ∗1()1
1
Xmax1 ()− ∗1()
.
This equation is more complex than (B3) due to the fact that as 1 varies, bothXmax1 ()
and Xmax2 (1; ) change. If there is a solution, one can then trace the cumulative density
∗() over its support as the extreme points (1 2) vary. The constant of integration mustbe then such that the density integrates to one over its support.
Simulation exercises. A key feature in all previous derivations is that mean demand
curves are defined from the aggregation of individual choices. In here, I explore some
simple simulation exercises whose purpose is to determine how ‘large’ the economy needs
to be in order to observe consistent results. I consider a simple uniform distribution and a
multivariate log-normal distribution.
(i) Uniform distribution. Assume two goods and non-interior choices. This is the setting
used in the simple example of Section 2. Assume also that 2 and are constant throughout.
At a fixed price level 1 let 1 , = 1 be an i.i.d. sequence of uniform random
variables on [0 1]. Each represents an individual realization of demand for good 1.
The mean demand is:
1 = −1
X=1
1 ,
which, by the strong law of large numbers, satisfies 1 → 1(). Moreover, when prices 1
change, one can trace an uncompensated demand curve whose elasticity should be ε = −1.In the following simulations = 1 and 1 varies from 1 to 2. The results consider two
incremental steps. The first is 0005 and the second 005. This means that each individual
has 200 realizations of demand in the first case and 20 realizations in the second case. Mean
demand is computed for different values of that range from = 1 to = 1 000. In each
sample, and for each value of , a log-log linear regression estimates the uncompensated
elasticity of the demand curve. The number of cross-samples is 500.
Table B1 shows that ‘individual’ demands are on average negatively sloped with an
elasticity consistent with the predicted pattern. The estimates of the elasticity, however,
are unreliable when only one individual realization is considered and the sample variation
in prices is small. In fact, the elasticity cannot be statistically distinguished from zero in
B3
this case. Further, the standard deviation across and between samples is 1.090 and 0.954
respectively. Both estimates suggest that some individuals have a positively-sloped demand
curve. Finally, the goodness of fit from the R2 is, on average, about 5 percent. Thus, overall,
individual demands are not consistently determined.
Table B1. Simulation results for uniform distribution.
Number of individuals aggregated
= 1 5 10 25 50 100 500 1 000
A. Grid size for 1 of 200 sample points
ε −1.0086 −1.0095 −1.0061 −1.0005 −1.0011 −1.0015 −1.0007 −1.0001std.err. (0.349) (0.100) (0.067) (0.041) (0.029) (0.020) (0.009) (0.006)
std.dev. [0.345] [0.102] [0.067] [0.040] [0.029] [0.020] [0.008] [0.006]
R2 0.0396 0.3343 0.5242 0.7423 0.8538 0.9219 0.9834 0.9917
[0.027] [0.054] [0.043] [0.026] [0.014] [0.008] [0.001] [0.0008]
B. Grid size for 1 of 20 sample points
ε −1.101 −0.9901 −0.9966 −0.9977 −0.9992 −1.0013 −0.9998 −1.0002std.err. (0.954) (0.276) (0.188) (0.118) (0.083) (0.058) (0.026) (0.018)
std.dev. [1.090] [0.297] [0.208] [0.125] [0.085] [0.061] [0.027] [0.018]
R2 0.0562 0.3586 0.5497 0.7633 0.8672 0.9316 0.9857 0.9928
[0.109] [0.160] [0.146] [0.075] [0.043] [0.022] [0.004] [0.002]
Note: The number of cross-samples is 500. The elasticities are estimated using a linear fit to
log[1 ] and log[1]. The average value of the standard errors across samples is in parentheses.
The cross-sample standard deviation for the estimate of the elasticity of demand and the R2 is in
brackets.
Consider the two cases in which = 5. In these cases, the goodness of fit increase to
over 30 percent and the estimates of the elasticity of demand become (statistically) close
to ε = −1. When the sample variation in prices is 200, and = 10, the across sample
standard deviation for the elasticity of demand is about 0.06. The goodness of fit and
the statistical significance of the estimates also suggest that average demands are precisely
estimated. When only 20 sample points for 1 are considered, a similar conclusion follows
if the aggregation takes place for 50 individuals. As expected, with = 500 or = 1000,
the goodness of fit is well over 98 percent and the elasticity is precisely estimated up to
three digits.
(ii) Multivariate log-normal distribution. Consider next three goods and non-interior
choices. Prices are 1, 2, and 3. Prices 2 = 1 and 3 = 1, and = 3 are constant
throughout. 1 varies from 1 to 2. Let (1
2
3), = 1 be an i.i.d. sequence
of log-normal random variables on (). Each represents an individual realization
of demand for the three goods 123. The draws are from a log-normal distribution with
mean [−05−05−05], variances [2 2 2] and with positive pairwise correlations betweenall variables of 05.
In the following simulations 1 varies in incremental steps of 005, so there are 20 sample
points. Mean demand is computed for different values of that range from = 1 to
= 10 000. In each sample, and for each value of , a log-log linear regression estimates
the compensated elasticity of the demand curve, ε. This elasticity should be negative,
B4
although there are no specific numerical values suggested by the theory. Again, the number
of cross-samples is 500.
Table B2 shows again that ‘individual’ demands are negatively sloped but unreliable.
As the number of individuals aggregated increases, mean demands become better behaved
especially when more than 100 individuals are considered. The goodness of fit from the
R2 increases and the precision of the estimated elasticity of demand also increases. The
compensated elasticity is negative.
Table B2. Simulation results for multivariate log-normal distribution.
Number of individuals aggregated
= 1 10 100 1 000 10 000
ε −0.0965 −0.2325 −0.2598 −0.2524 −0.2541std.err. (0.723) (0.363) (0.107) (0.0384) (0.022)
std.dev. [2.301] [0.614] [0.184] [0.056] [0.018]
R2 0.0813 0.0869 0.222 0.656 0.858
[0.162] [0.152] [0.213] [0.134] [0.036]
Note: The number of cross-samples is 500. The elasticities are estimated using a linear fit
regression. The average value of the standard errors across samples is in parentheses. The cross-
sample standard deviation for the estimate of the elasticity of demand and the R2 is in brackets.
In conclusion, Tables B1 and B2 show that consistent mean demand curves do not
require an excessively large number of individuals aggregated; this is specially true with
enough price variation. That is, while the theoretical analyses rely on a large number of
individuals, numerical approximations with fewer individuals still give support to the main
theoretical conclusions of the analysis.
References
[1] Phelps, R.P. (2001) Lectures on Choquet’s Theorem, Berlin: Springer.
B5