characterization of moment points in terms of christoffel numbers

19
CHARACTERIZATION OF MOMENT POINTS IN TERMS OF CHRISTOFFEL NUMBERS* By SAMUEL KARLIN AND LARRY SCHUMAKER in Stanford, California, U.S.A. w Introduction The setting for this paper is the theory of generalized moment spaces induced by a system of functions. The theory of moment spaces has served crucially as a tool in carrying out certain statistical analysis and as a format for various probabilistic studies. Aspects of moment spaces are also of value in dealing with boundary value problems associated with Sturm-Liouville differential operators and, more generally, in developing canonical representations of operators in Hilbert spaces whose spectra possess simple multiplicity. Various extremal problems can be expeditiously analyzed using the geometry of moment spaces. Applications in the theory of interpolation and approx- imation are very familiar, see e.g., Krein [-4] and Karlin and Studden [-3]. Krein [-4] set forth the structure of generalized moment spaces induced by a Tchebycheff system (see Section 3) while independently Karlin and Shapley [,2] elaborated the geometry of reduced moment spaces generated by the special system {1, t, ..., tn}. The geometry of generalized moment spaces is further developed in Karlin and Studden [-3] which also includes an ex- tensive bibliography of the subject. In order to state the objectives and results of this paper we first need to review some facts concerning the reduced moment spaces induced by the system of functions {1,t,...,t ~} on [,0,1]. (The choice of 1-0,1] is no restric- tion; any other closed bounded interval [,a, b] would do as well.) The follow- ing definitions and results are relevant, see [-3, Ch. 2] for detailed discussions and proofs. * Research supported in part under contracts NONR 225(28) and NSF GP 2487 at Stanford University, Stanford, California. 213

Upload: samuel-karlin

Post on 22-Aug-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

CHARACTERIZATION OF MOMENT POINTS

IN TERMS OF CHRISTOFFEL NUMBERS*

By SAMUEL KARLIN AND LARRY SCHUMAKER

in Stanford, California, U.S.A.

w Introduction

The setting for this paper is the theory of generalized moment spaces induced

by a system of functions. The theory of moment spaces has served crucially

as a tool in carrying out certain statistical analysis and as a format for various

probabilistic studies. Aspects of moment spaces are also of value in dealing

with boundary value problems associated with Sturm-Liouville differential

operators and, more generally, in developing canonical representations of

operators in Hilbert spaces whose spectra possess simple multiplicity.

Various extremal problems can be expeditiously analyzed using the geometry

of moment spaces. Applications in the theory of interpolation and approx-

imation are very familiar, see e.g., Krein [-4] and Karlin and Studden [-3].

Krein [-4] set forth the structure of generalized moment spaces induced

by a Tchebycheff system (see Section 3) while independently Karlin and

Shapley [,2] elaborated the geometry of reduced moment spaces generated

by the special system {1, t, ..., tn}. The geometry of generalized moment spaces

is further developed in Karlin and Studden [-3] which also includes an ex-

tensive bibliography of the subject.

In order to state the objectives and results of this paper we first need to

review some facts concerning the reduced moment spaces induced by the

system of functions {1,t, . . . , t ~} on [,0,1]. (The choice of 1-0,1] is no restric-

tion; any other closed bounded interval [,a, b] would do as well.) The follow-

ing definitions and results are relevant, see [-3, Ch. 2] for detailed discussions

and proofs.

* Research supported in part under contracts NONR 225(28) and NSF GP 2487 at Stanford University, Stanford, California.

213

214 SAMUEL K A R L I N and LARRY SCHUMAKER

The reduced moment space IIL,+ 1 is the set of points

(1.1)

1

I I [ , + l = { c = ( c o , C l , . . . , c , ) e E " + l [ c i = f tidu(t), i - - 0 , 1 , . . . , n

0

where u(t) traverses the set of all non-decreasing right continuous functions

of bounded variation. The moment space ll[,+l is manifestly a closed convex

cone in E "+1 not contained in an n-dimensional hyperplane of E "+~ . More-

over, the cone nl,+ 1 is the convex conical hull of the curve C,+ i in E "+l,

whose expression in parametric form is

(1.2) C,+ 1 = {c(t) = (1,t, ...,t") [0 -< t <: 1}.

In view of the characterization of 1~,+~ as the convex conical hull of C,,+l,

a classical convexity theorem due to Carath6odory asserts that every vector

c e Ill,,+ 1 admits a representation of the form

p

(1.3) c i = ~2 2j.t~, i = 0 , 1 , . . . , n , O < t l < t z < . . . < t a < l j = l

where p < n + 2 and 2j > 0, j = 1 ,2 , . . . , p .

Much of the structure of the interior and boundary of nl,+1 as well as

the nature of the extreme rays of n~,,+ 1 is discernible by studying the proper-

ties of representations of moment points of the form (1.3). A useful basis for

classifying points of nl ,+l is the concept of the index value. The index I(c) of a moment point c e 111,+~ is defined to be the minimal number of points

of C,+ a which span c (i.e., are involved in a representation of c of the form

(1.3)) with the special convention that the two end points (1,0, .- . ,0) and

(1,1,. . . , 1) of C,+ 1 are counted as half points while the point (1, t , . . . , t") ~ C,+ 1 receives a full count for any t satisfying 0 < t < 1.

With this definition, the boundary of ILL,+ 1 (abbreviated Bd ll[,+l) admits

(n + 1) the following simple characterization: c e Bd 11~, + 1 if and only if I(c) <

Moreover, every boundary point of lll,+~ possesses a unique representation

(n + 1) of type (1.3) with p <

CHARACTERIZATION OF MOMENT POINTS 215

The interior of ll~n+ 1 may also be studied in terms of the representations

(1.3). The values of {ti} f (which we always assume to be arranged in in-

creasing order) involved in a representation are called the roots of the re-

presentation. The increasing function a(t) which exhibits points of increase

only at the {ti} p and jumps of magnitude 2j at tj is referred to as the asso-

ciated measure. By the index of the set of roots {t~,...,tp} we will mean the

number obtained by counting interior roots as one and the end points 0 and

1 as one-half. We sometimes will refer to the index of a measure which is

to be interpreted as the index of the set of roots belonging to the measure.

For points in the interior of lll~§ 1 (abbreviated Int lll~+ 1) there exist pre-

cisely two representations with index (n + 1)/2. These are called the principal

representations. One of them involves the point 1 (i.e., has a root at 1) and

is called the upper pr incipal representation while the other does not involve 1

and is called the lower principal representation.

A section of I ~ + 1 is any subset S of I1~+1 with the property that if c~ llln+t and c ~ 0 then there exists a unique 2 > 0 such that 2 c e S . Sec-

tions of 11l~+ 1 are generated by specifying a strictly positive polynomial

p(t) = ~2 air ~ on [0,1] with the correspondence determined as follows: The i = 0

set of all c e Ill~+i satisfying ~2 aic i = 1 comprises a section. We denote by i = 0

M" the section of 111~+~ corresponding to the special positive polynomial

p~ = 1. The section M n is equivalently defined by the condition that the

moment point c =(Co,Cl , . . . ,c , ) satisfies Co = 1. Henceforth we confine at-

tention to the section M ~ unless stated explicitly to the contrary.

In all discussions concerning Int M ~ it is necessary to separate the cases of

n odd or even.

Case 1 ( n = 2 m ) . Let C=(1,Cl,C2,'",C2m ) be an interior point of M n

i.e., c ~ Int ll~n+l and c o = 1. This entails the existence of a probability mea-

sure tr on [0,1] whose first n + 1 moments are

1

c i = f t~da(t) i = 0 , 1 , 2 , . - . , n

0

216 SAMUEL KARLIN and LARRY SCHUMAKER

where a ( t ) possesses at least m + 1 points of increase. As stated previously,

associated with c are two principal representations which we describe in terms

of their roots and weights by the symbols {tj; ~-,jjl~m+~ and {sj;/~j}lm+l . Here

tj and sj are the roots of the lower and upper representations, respectively,

and 2j and pj denote the corresponding weights. The explicit expression of

c in terms of these representations is given by

m + l m + l i (1.4) c i = ~2~ 2 j t ~ = ~ l l j S j , i = 0 ,1 ,2 , . . . ,n .

j = l j = l

It is convenient for later reference to display the roots and weights of the

two principal representations as follows:

( 1 . 5 ) 0 = t I < t 2 < . . . < tin+ 1

2 1 , 2 2 , " " , • m + l

(roots of lower principal representation)

(weights)

(1.6) s~ < s z < . . . < Sin+ 1 = 1 (roots of upper principal representation)

I~1, P2 , " " , Pm + 1 (weights).

The pairs of roots and weights associated with a given moment point c are

not arbitrary but satisfy certain relations. Indeed, it is established in Cor-

ollary 11.3.1 of [3] that the roots of the principal representations strictly

interlace in the manner

(1.7) 0 = t 1 < s z < t 2 < 5 2 < . . . < tm+ 1 < Sm+ 1 : 1 .

Furthermore, Theorem III.2.1 of [3] asserts that the weights interlock in

the sense that the inequalities

k k k + l

(1.8) 0 < 12 2j < ~ #j < ]~ ;tj j = l j = l j = l

m + l m + l

12 2j = ]~ ~j = 1 j = l j = l

k = l , 2 , . . . , m

hold.

C H A R A C T E R I Z A T I O N OF MOMENT POINTS 217

f~ "l.m + 1 The roots toj~2 may be calculated as the zeros of the mth orthogonal

polynomial Pro(t) with respect to the measure tda(t). Furthermore, the roots

{sj}] are the zeros of Qm(t) which is the m th orthogonal polynomial with

respect to the measure (1-t)dtr(t). The associated weights {2j} and {#j}

appear as the multipliers in the quadrature formulas approximating integrals

of functions with respect to the measures tda(t) and ( 1 - t)da(t), respectively.

They are commonly called Christoffel-Cotes numbers (see Szeg(5 1-5] page 47)

and bear much importance in the theory of approximation and interpolation.

Case 2. (n = 2m + 1). Let c = ( 1 , c l , c 2 , "",C2,n+l) be an interior point of

M". In this case the roots and weights of the two principal representations

can be displayed in the form

(1.9)

and

(1.10)

0 < tt < t2 < "-" < tm+l < 1

~ 1 , / ] ' 2 , " " , ~'m+ 1

0 ~--~- S 1 <( S 2 <~ " '" < S m + 1 <~Sm+ 2 ~ 1

] / 1 , ] 2 2 , ' " , , / / m + l , ~ / m + 2

(lower principal representation)

(upper principal representation)

and the roots strictly interlace in the manner

(1.11) 0 = S 1 < t 1 < S 2 • t 2 < " ' " < tin+ 1 < S m + 2 : 1,

while the weights satisfy the inequalities

(1.12)

k k k + l

0 <~2 /~j < ~ 2j < ]~ ~j k = l , 2 , . . . , m j = l j = l j = l

m + l m + l m + 2

]E ~j < ~ 2 j = ~ ~ j = l . j = t j = l j = l

In this paper we investigate the problem of ascertaining for a given set

of roots and/or weights the existence and uniqueness of a moment point in

Int M" whose principal representations involve the prescribed data.

218 SAMUEL KARLIN and LARRY SCHUMAKER

We discuss the results for the even case n = 2m; the odd case is analogous.

It is a trivial observation that when a set of roots and corresponding weights, m + l

. m + l say {tj,2j} 1 with ~2 2 j = l are prescribed as in (1.5), the point j = l

c = (1,q,.. . ,CZm) explicitly defined by (1.4) is a moment point belonging to

IntM". The fact that c actually lies in the interior of M" is a consequence

of the fact that the representation which determines c has index (m + 1)/2 while

any boundary point of M n necessarily possesses a unique representation of

index at most n/2 = m. It further follows from the general theory that the

roots and weights of the other principal representation of c symbolized by

{sj;/~j}~ '+1 are uniquely determined and the pair of roots {tj} and {s j} enjoy

property (1.7) while the corresponding sets of weights satisfy (1.8).

A less elementary result is that if we prescribe two sets of roots t l m + l j~l and {sj}'~ "+1 then property (1.7) is necessary and sufficient for the

existence of a unique moment point c ~ [n tM" whose principal representations

involve the given sets of roots. We state this fact formally as follows:

T h e o r e m 1. (n = 2 m ) , Suppose two sets o f points {tj}~ '+1 and {sj}• +1

are prescribed satisfying (1.7). Then there exists a unique moment point

c e I n t M " such that the lower and upper principal representations of c in-

volve the roots {t~}~ n+l and ~ ~m+l (si~l , respectively. A similar statement is

valid for n = 2m + 1.

The proof of Theorem 1 is rather elementary and amounts to showing

that a certain system of homogeneous linear equations possesses a solution

in the positive orthant.

Consider next the situation that two sets of weights {2j}] '+1 and {pj}]'+~

are prescribed for which 2j > 0 and pj > 0, j = 1,2, .- . ,m + 1 and where m + l m + l

]E 2 i = 1 and ~E #i = 1. We inquire as to what conditions guarantee the j = l j = l

existence of a moment point in Int M" whose principal representations involve

the given weights. As pointed out previously a necessary condition is that

the sets of numbers {2j}~ +1 and r ,m+x l P j h satisfy the inequalities (1.8). It is a remarkable fact that the relations (1.8) also provide a sufficient condition.

Precisely, we have

CHARACTERIZATION OF MOMENT POINTS 219

Theorem 2. (n = 2m). Let (2j)'~ +1 and {/~),~+1 be positive numbers sa-

tisfying (1.8). Then there exists a unique moment point c ~ I n t M" such that r ,,n+l m+l (#j~l and {2i} x are the weights of the upper and lower principal

representations of c, respectively. A similar statement is valid for n odd.

The proof of Theorem 2 seems to be inherently intricate and delicate. It

amounts to establishing the existence and uniqueness of a solution to a certain

non-linear system of equations. The special nature of the equations is de-

cisively exploited in the analysis. We also make use of the theory of canonical

representations (see Section 2) which play a fundamental role in many aspects

of the theory of moment spaces.

The proofs of Theorems 1 and 2 are developed in Section 2. The extensions

of the above results to the case of moment spaces generated by Tchebycheff

systems of functions is described in Section 3.

w P r o o f s of T h e o r e m s 1 and 2.

We begin with the proof of Theorem 1 for the case n = 2m. The proof in

the odd case is similar.

P r o o f of T h e o r e m 1.

:t l -m+1 U r ~ m + l L e t {rk}] +2 = ( j J1 lSj)l arranged in increasing order. Consider the

system of n + 1 equations

n+2

(2.1) ~2 akrfk = 0 i = O , 1,- . . ,n k = l

in the n + 2 variables {a~}] +2. The matrix ]] r~ ]] of (2.1) has rank n + 1 and

hence the components of the unique solution of (2.1), apart from a multi-

plicative constant, are proportional to the cofactors obtained by successively

striking out a single column. The appropriate minors are Vandermonde deter-

minants evaluated at a set of n + 1 of the points {rk}] +2 and so are pos-

itive since the {rk} occur in increasing order. Thus the ak strictly alternate

in sign.

Since the {s j} and {ti} interlace as in (1.7) we may rewrite (2.1) in the form

220 SAMUEL KARLIN and LARRY SCHUMAKER

m + l m + l

i = 0 , 1 , . . . , n (2.2) )-2, 2it) = ~ Iris j , j=l j=l

where now 2 j = a z j _ 1 > 0 and / ~ j = - a z j > 0 , j = l , 2 , . . . , m + l . Setting

the components of c equal to the values of (2.2), and fixing the multiplying m + l m + l

constant so that Co = Y~ 2j = ~ /t j = 1, we obtain a moment point j=l j=l

c~ IntM" with the desired principal representations. It is clear that c is actually

in the interior of M" since a boundary point of M" has a unique representation

involving at most p < m + 1 points with index value < n / 2 while the

expression in (2.2) indicates that c has index (n + 1)/2.

The remainder of this section is devoted to the proof of Theorem 2. To

this end it is first essential to introduce additional terminology and review

several other facts concerning moment spaces described more fully in [3].

A representation of an interior point c ~ I n t 11l,+1 is called canonical if

I(c) < n +____.22 For any c ~ Int 11l,+ 1 and any prescribed t* ~ [0,1] there exists = 2 "

a canonical representation of c involving the root t*. The roots of canonical

representations also satisfy certain interlacing properties. Indeed, suppose

that a and a* are the measures associated with two different representations

of c e lnt 11~,+1 and suppose a is canonical. Then for every pair of interior

roots t j , tj+ 1 of a there exists a point of increase of a* in (t~, ti+ 1). This fact

implies, in particular, that no two canonical representations of an interior

point of 111,+ 1 may share an interior root. For the proof of the above as-

sertion we refer to Karlin and Studden [3, Chapter 2].

We are now prepared to establish the main result of the paper.

P r o o f o f T h e o r e m 2.

The proof will be elaborated in a series of lemmas. The first group of

lemmas establish the existence of a moment point with the desired principal

representations. The remaining lemmas provide the demonstration of the

uniqueness assertion.

We proceed by induction. Assuming that the theorem is proved for n = 2m

CHARACTERIZATION OF MOMENT POINTS 221

we wish to show its validity for n = 2m + 1. The inductive step proceeding

from n = 2 m + l to n = 2 m + 2 is carried out in a similar manner while

the initial step corresponding to n = 1 is trivial.

The confirmation of the existence of a moment point in In tM"(n = 2m + 1)

possessing lower and upper principal representations with weights J'2 ~m+ t jj1 and

{#~}~+z is equivalent to establishing the existence of points {tj}~ +1 and

{sj}~ '+2 interlacing in the manner

(2.3) 0 = s t < tl < "'" < sm+t < tm+t < Sin+2 = 1

and satisfying the non-linear system of equations

m + l m+2

(2.4) d t = • 2 j t J - Z /zfl~ l = l , 2 , . . . , n . j = l j = t

m + l

= 21t~ + ~ ( - # f i ~ + a j t ~ ) - / ~ m + 2 = 0 j=2

We will show that a solution of (2.4) possessing property (2.3) exists by emp-

loying a continuity method.

Consider the open simplex A in E"-1 defined by

A : {7 = (~)1,~2, "" ,~2m)] 0 < 71 < ~)2 '<~ "'" < ~)2m < 1} .

It is convenient to also label the components of a point ~ of A in an alternate

manner. Thus we will frequently use the identification

~1 : ( ~ l , 7 2 , ' " , Y 2 m ) = (s2, t 2 , ' " ' S m + l ' t m + l ) " We now define a mapping f of A

into E n-~ with components

m + l

( 2 . 5 ) f t (~)=f l ( s2 , t2,"' ,Sm+l,tm+l) ~ I t -- (--/~fly q- 2jtj) - / lm+2, j=2

l = 1 , 2 , . . . , n - 1 .

The mapping f is locally 1:1 on A. In fact an explicit computation of the

Jacobian of the transformation yields

222 SAMUEL K A R L I N and LARRY SCHUMAKER

(2.6) J ( f l , f 2 , ' " , f . -2 , f . -1 )

\82, t2~ "" ",Sm+ 1, tin+ 1 /

m + l (O,l,..., n-3, n-2) (n-a)! 1-I (-'~j#j)

= j = 2 V \ s 2 ~ t 2 , . . . , Sm+l , tm+ 1

where V denotes the Vandermonde determinant

( 0, 1, . . . , p ~ = det l[ ~J II~.Joo" V \~o, ~ , "", ~ /

Since the 2j and #i are by stipulation strictly positive numbers and the Vander-

monde determinant is well known to be non-zero when evaluated at distinct

points, we see that the Jacobian never vanishes on A and therefore the map-

ping f is locally homeomorphic. Moreover, f (from direct inspection) and

the inverse mapping f - 1 which is well defined locally by virtue of the implicit

function theorem, see Apostol [1, p. 147], possess continuous partial deri-

vatives of all orders. Viewed globally, f - 1 could conceivably be multivalued

although we will later prove t h a t f -1 is actually 1 : 1 in thelarge (i.e., 1:i on

the range f(A)).

Our first lemma shows the existence of a point in A which is mapped into

zero by f .

L e m m a 1. There exists a point ?o in A such that f(?o) = O.

ProoL

If we set t t = 0 in (2.4), examination of the first n - 1 equations in terms of

the variables s2, t2,"',Sm+ 1, tin+ 1 reveals that they are precisely the equations

corresponding to the existence of a moment point in In tM "-1 (n - 1 = 2m)

with the weights of its lower and upper principal representations prescribed as

(2.7)

1, 1 - /~1 ' ' 1 - Pl) (lower)

~2 ~3 ~m+2 / 1 p l ' 1 - px' ' 1 ~ p l j (upper).

C H A R A C T E R I Z A T I O N OF MOMENT POINTS 223

These sets of weights clearly satisfy the restrictions (1.8) for the case n = 2m

and hence invoking the induction hypothesis we obtain a solution of the

first n - 1 equations of (2.4) (with tl = 0) which we designate by {t~ +x and c 0 ~ m + l 0 0 0 0 0 �9 ? = (s2,t2,...,sm+l,t,~+O is in the simplex A since the ~sj 12 The point

roots of the principal representations of a moment point in M n-1 inter-

lock in the manner

(2.8) 0 0 0 < s ~ 1 7 6 < Sm+l<tra+l<l .

An equivalent expression of this fact (cf. (2.4) for t~ = 0 and (2.5)) is that

the point ?o of A is mapped into zero by the func t ionf . The proof of Lem-

ma 1 is complete.

Appealing to the implicit function theorem we deduce the existence of

functions {tj(t)}~ '+l and {sj(t)}7 +1 of class C~(-e,e) such that

(2.9) ~(t) = (s2(t), t2(t ) ,'--,Sm+ l ( t ) , tin+ l(t))

is a point of A and

(2.1o) .fz(s2(t), t2(t), ...,sin+ 1(0, t,,+ 1(0) = - 2it z,

1 = 1 , 2 , . . . , n - 1

for all t 6 ( - e , e ) where e is positive and sufficiently small. Of course,

~(0) = ~o. The relations (2.10) may be written out explicitly as

(2.~1) m + !

at(t) = 21[h(t)] t + ~, (-pj[s2(t)] t + 2j[tj(t)] t) -/2m+2 = 0 j = 2

I = 1 , 2 , - . . , n - 1

where we choose t l ( t )= t. We underscore an important property of the

components of (2.9) in the next lemma.

L e m m a 2. The functions {tj(t)}~ '+1 and (sj(t)}~ +1 appearing in (2.9)

are monotone increasing for t 6 ( - 8 , e ) .

224 SAMUEL KARLIN and LARRY SCHUMAKER

P r o o L

Differentiating (2.11) with respect to t yields

(2.12) 21t~ -~ dtl(t) m+l {_p.sl._ 1 dsj(t) dtj(t)t at + ~ ~ ~ ~ --,t-i-- + ~jtj-~ dt / = 0 . /=2

I=1,2, . . . ,n-1.

The equations (2.12) may be regarded as a system of n - 1 homogeneous

equations in the n unknowns {2flj)~ +1 and " ,~m+l , , ~--lljSj)2 where tj and sj are

shorthands for dtj(t)/dt and dsj(t)/dt respectively. The matrix of the system

has rank n - 1 and hence

(2.13) 2fl~ = xV ( 0'1'2' . . . , n - 3 , n - 2 )

\ t l , s 2 , t 2, ""~Sj, Sj+ I, "'%Sin+l, tm+ l l

j = 1 ,2 , . . . ,m + 1.

,1, . . . , n - 3 , n - 2 ) Pis~

xV\q,s2, "',tj-l,tj,-",sin + 1, tin+ a

j = 2 ,3 , . . . ,m + 1

where ~c is a fixed constant. Since the point ?(t) of (2.9) is contained in A the

determinants occurring in (2.13) are all strictly positive. But by choice t] _--I

and by stipulation 21 > 0 from which we infer that tc > 0, and thus t~. > 0 r

and sj > 0 ( j = 2 ,3 , . . . ,m + 1).

The relations (2.11) are satisfied also when t = e if {tj(e)}~ '+t and

{s/(e))~ '+1 are chosen suitably. In fact, consider a sequence e, increasing to e.

Lemma 2 tells us that the components of (2.9) increase to finite limits; viz.

t j(8,)--, t / 0 (2.14) j = 2,3,. . . ,m + l

sj(~r) --, sA0

or what is the same

(2.15) r(8,) --, r(,)

CHARACTERIZATION OF MOMENT POINTS 225

where ?(e) = (Sz(e), t2(e),..., St, + l(e), tm+ t(e)) and the equations (2.11) hold for

t = e. By continuity we infer, since ?(e,) ~ A for all r , that

s2(0 ) < s2(e ) = < t2(e ) = < ... = < sm+t(e ) = < tm+l(e ) = < 1

i.e., ?(z) is contained in the closure of A which we denote by A. The point

y(e) is either in A or lies on the boundary of A. It is on the boundary of

A only if at least two successive components of ?(e) are equal and/or

if ~,~_l(e) = tm+l(e) = 1. The next lemma shows that the first contingency

is impossible, i.e., if ?(e) as constructed above belongs to BdA then necessarily

0 < e < ?2(8) < 73(~) < "'" < ?,-2(e) < ?,-x(e) = 1.

L e m m a 3. Let ?(t) be defined as above for t e [0,e]. Then

O < t < s 2 ( t ) < t 2 ( t ) < ' " < s , ~ + t ( t ) < t m + l ( t ) < = l for O<_t<=e.

Proof.

For each t E [0, e] the equations (2.11) may be interpreted as asserting that

(tj(t)}'~ +l and m+2 {sj(t)} t , (st(t)-- O,sm+2(t)-- 1) qualify as the roots of two

distinct canonical representations of some moment point c(t)~ In tM ~-1. As

noted above, however, two canonical representations may not share an interior

root and hence the t / s and s /s are distinct. Since the points are in the in-

dicated order for t = 0, invoking the continuity we see that they remain in

this order for all t~ [0,e]. Further, since 0 < s2(0) and S2(t ) is strictly

monotone increasing, t < SE(t ) if e is sufficiently small.

We now extend the definition of ?(t) up to the "upper face" of the bound-

ary o f A (BdA) defined by ?2,, = 1.

L e m m a 4. There exists a point t in (0,1) and a curve ?(t) of the form

(2.9) defined on [0 , [ ] whose components are continuously differentiable

such that the relations (2.11) hold for all 0 <- t <_ t. Furthermore, ?( t )sA

for 0 <_ t < t while ?(~) is in the "upper face" of Bd A, i.e., ?z,,(~) = tm+ t(t) = 1

and 0 < t -- ?t(t) < ? 2 ( l ) • -.. < ? 2 m _ t ( t ) < 72ra(t) = 1 .

226 SAMUEL KARLIN and LARRY SCHUMAKER

Proof.

The required y(t) is determined on [0,~] by the preceding analysis. Lemma

3 tells us that ?(e) is either in the open simplex A or in the "upper face" of

Bd A. In the latter circumstance the proof of the lemma is complete. On the

other hand, if t , ,+l(e)< 1 then 7(e)~A and the implicit function theorem

applies at ~ with initial point ?(e). The theorem affirms the existence of a

unique curve ?(t) possessing components of continuity class C t defined on

an open neighborhood ( e - 6 , ~ + 6) of the point e with 6 > 0, such that

7(t)~A and (2.11) holds. Owing to the local homeomorphic character of f

we infer that ?(t) is of continuity class C 1 on a - 6 < t < e + 6. Lemma 2

continues to apply implying that the components of 7 are monotone increas-

ing. Employing a limiting argument as previously we may construct the point

7(e + 6). Lemma 3 now applies on [e,e + 6] which shows that 7(e + 6) either

belongs to A or to the "upper face" of BdA. This process of extension of

7(0 can be repeated on increasing t intervals. It necessarily ends for some

f < 1, however, since by Lemma 3, t < t,,+~(t) so that for some f in (0,1)

the point 7(0 lies on the "upper face" of BdA.

The existence part of Theorem 2 will be complete if we show that

m + l ra+2

(2.16) d,(t) = • 2j[ti(t)]" -- ~E pi[sj(t)]" j = l j = l

changes sign as t traverses the interval [0, Q since d,(t) is clearly a continuous

function of t. This is the intent of the next lemma.

Lemma 5. Let d.(t) be defined as in (2.16). Then d.(O)<0 and

d.(t) > O.

ProoL

To compute the sign of d,(0) we regard (2.11) and (2.16) for t = 0 as n

linear equations in the n variables whose solutions are the constants (2j]~ n+l

and {-# i}~ +z (recalling that h ( 0 ) = 0). Solving by Cramer's rule for -P2

yields

CHARACTERIZATION OF MOMENT POINTS 227

1, 2, . . . , n - l , n )

-/a2V \sz(O),t2(O),...,tm+ l(O),Sm+2(O)

A, 2, . . . ,n-2 , n - 1 x = d.(0)(- 1)"-~v( | .

t 2 (0), S 3 (0), "", t m + l(0),S m + 2 ( 0 ) 1

The determinants are strictly positive because of (2.8) and hence d,(0)< 0

since /z z > 0 and n is odd.

To compute d,(0 we regard (2.11) and (2.16) for t = t as n equations in

n variables with the solution given by the constants {2i}~', (-ILj}~ +l and

(2m+1--/tin+2) (noting that tin+l(0= Sin+2(?)= 1). Solving for hi yields

/1, 2, . . . , n - l , n )

~l V~q( O,S2( O,'",Sm+ l( O, tm+ l( O

1, 2, . . . , n -2 , n - 1 1

= dn(O(- 1)"-1 V \S2(O, t2(O,...,Sm+t(O, tm+l(~)]

where as before the determinants being Vandermondian are strictly positive

while 21 > 0 and n is odd implying that d,(?) > 0.

Our next task is to establish the uniqueness part of Theorem 2. The first

lemma proves that the point 7(0 =(Sz(t),t2(t),'",Sm+l(t),tm+i(t)) in A such

that (2.11) holds is uniquely determined.

Lemma 6. Suppose O< t 1 < 1 and let 7(tl) and ~(ti) be two points

of A such that (2.11) is satisfied. Then y(t l )= ~(ti).

P r o o L

The proof is by reductio ad absurdum. Suppose that V(tl) is distinct from

~7(ti). Applying the implicit function theorem at the point tl with the two

different initial points y(ti) and ~(tl) and repeating the process as carried out

in Lemmas 2 and 4 we obtain two distinct curves v(t) and ~7(t) in A defined on

[0,tl] whose components are all monotone increasing. Now if y(0)= ~(0)

we violate the fact that the mapping f from A into E "-I defined in (2.5)

is locally 1:1 since ~(0 and ~(t) describe distinct curves.

228 SAMUEL KARLIN and LARRY SCHUMAKER

On the other hand, if yr(0) # 7r(0) for some r, 1 < r < n, we have produced

two distinct solutions (namely ?(0) and ~(0)) to the problem in the moment

space Int M"-1 with prescribed weights (2.7). This is incompatible with the

uniqueness assertion of the inductive hypothesis. Thus in either circumstance

we have a contradiction thereby implying the result of the lemma.

We need the following additional lemma.

L e m m a 7. Let 7(t) be a curve o f the form (2.9) in A such that (2.11)

holds. Then the function dn(t) defined in (2.16) is strictly increasing, i.e.,

d',(t) > O.

P r o o L

Consider the system of equations (2.12) coupled with the equation

m+ 1 d'(t) n- 1 , n- I , , - 1 , = n21t 1 t x + n 2jtj t j). ( - - f l j S j Sj JF

j = 2

Solving for 21t~ = 21 by Cramer's rule yields

( D"-~d'(+" ~ , 1 , ' " , n - 3 , n - 2 ) 21V (O, 1 , . . . , n - 2 , n - 1 ] _ , - , , ,OV

\ t l , S 2 , . . . , S m + l , t m + l / n s 2 , t 2 , . . . , S m + l , t m + i "

By Lemma 2 the points appearing in the Vandermonde determinants are

distinct and arranged in increasing order and consequently the determinants

are strictly positive. Since 21 > 0 by specification and n is odd we deduce

that d',(t) > O.

We are prepared to complete the proof of the uniqueness assertion of

Theorem 2.

St )+m+ 1, S t )m+l L e m m a 8. Let ~ j j t {sj}~ '+t and ~,./jl , {~j}~,+l be two sets o f

roots satisfying (2.3) and (2.4). Then t~= tj, j = 1 , 2 , . . . , m + 1 and

s = ~j, j = 2 , . . . ,m + 1.

CHARACTERIZATION OF MOMENT POINTS 229

ProoL

Assume that the sets of roots are not identical. Invoking the implicit func-

tion theorem as above we start at t~ and Tt (say i~ > t~) and generate two

curves ~(t) and ~(t) in A such that (2.11)is satisfied for all t e [ 0 , ? l ] .

By Lemma 6 these curves coincide for all t~ [0, rl] . Since by hypothesis (2.4)

is satisfied we have d,(tl)= d, (? l )= 0, contradicting the fact that dn(t) is

strictly monotone increasing (Lemma 7). This contradiction proves the lemma.

w Extens ions .

The results of Section 2 have been specifically elaborated in the context

of the system of functions {1, t,...,t"} which generate the classical reduced

moment space 111,+1. In this section we indicate the extensions of these re-

sults to the case of generalized moment spaces induced by a Complete Tche-

bycheff system of functions {ui}~. For the relevance, utility and importance

of general Tchebycheff systems in analysis and applications, we refer the

reader to [3].

Let uo(t),ul(t),...,u,(t) be continuous functions on [a,b]. The set

{ui}~ is said to be a Tchebycheff(T) system provided the determinants

t~ ) (3.1) U = det[] u,(tj) n > 0

\ O,l,...,n

for all a < to < tt < ".. < t, ~ b. The set {ui}~o is called a Complete Tche- bycheff (CT) system if {ui} k is a T-system for k = 0,1,.-., n. I f ui ~ C"[a, b], i = 0,1,. . . , n then {ui}~ is referred to as an Extended Tchebycheff(ET) system

provided

(3.2) u* ) = detl[ u,(ti)[I > 0 \ 0, 1, . . . ,n ./

whenever a < t 0 < t l < . . . _ < t . < b where if for some j and p

t i _ l < t j = t j + 1 . . . . . t j+p<t j+ j ,+ l , then the j + l + l st row of (3.2) is (t) (0 (t) to be replaced by the vector of components [Uo(tj),ut (tj),...,u, (tj)]

230 SAMUEL KARLIN and LARRY SCHUMAKER

for l = 0 , 1 , . . . , p . When {ui}ko is an ET system for each k = 0 , 1 , . . . , n then

{ui}g is called an Extended Complete Tchebycheff(ECT) system.

For any given set of functions w~(t)~ c"-i[a,b] which are strictly positive

on [a, b] we form the system of functions

(3.3)

uo(t ) = Wo(t )

u~(t) = Wo(t)

t

f w~(~)d~ a

Un(t) = Wo(t) f wl('t) ~ W 2 ( ' 2 ) " ' " f Wn(,n)d,n...d,1 a a 0

and it is simple to verify that these constitute an ECT-system. It is proved in I-3, Chapter 11] that every ECT system satisfying the boundary conditions

i = 1,2, . . . ,n (3.4) u~J~(a) = 0,

j = 0 , 1 , 2 . . . , i - 1

may be represented in the form (3.3). I f an ECT-system does not satisfy (3.4)

it is a simple matter to subtract a Iinear combination of Uo,Ul,...,Uk_t from

uk (for k = 1,2,...,n) to obtain an ECT-system satisfying (3.4). It is also known

that the system {Vi(t ) = [-(Ui+ l(t))/Wo(t)]'}"o-1 (called the first reduced system)

is also an ECT-system.

If {ui} ~ is a Tchebycheff system on [0,1], then all of the definitions and

results from [3] mentioned in Section 1 are valid for the moment space lq,+l

defined by

1

,0 o / 0

where a traverses the set of all non-decreasing right continuous functions

of bounded variation. The appropriate generating curve (1.2) is

C,+ t = ((Uo(t),ut(t), ...,u,(t)) [0 -< t ~< 1}.

CHARACTERIZATION OF MOMENT POINTS 231

When {ui}~ is an ECT-system with u o - 1 the arguments of Theorem 2

proceed mutatis mutandis. The determinants occurring in the proof are po-

sitive owing to the ECr-property. The analog of the Jacobian (2.6) is positive

since as noted above the first reduced system {[(ui+t(t))/Wo(O]'}"o -1 is also

an ECT-system.

Theorem 2 is also valid if {ui}~ is merely a CT-system on [0,1] with uo --- 1.

This is proved by using a standard perturbation technique. Specifically, (see

[3]), we form the modified system of functions

1

�9 ~ ~/2-~ ~ u , ( x ) d x , i = O, 1 , . " , n

For e > 0, {ui(e;t)}~ is an ECT-system when {tt,}~ is a CT-system. Then for

each e > 0 (let n = 2m + 1, say) there exists a unique moment point c(e) s Int M"

and roots {ti(e)}7 '+1 and {s/~)}'~ +z satisfying (2.3) such that

m+l m+2 (3.5) c/(e) = ]~ ).flt~(t/~)) = ~2 i~jui(sj(~)) , i = 0 , 1 , . . . , n

j = l j = l

are the lower and upper principal representations of c(~). Now executing a

standard diagonal selection procedure corresponding to a sequence of ek

decreasing to zero, we produce {rj}~ '+1 and {~i}~ "+2 such that t j ( e k ) ~ 7i,

j = 1,2, . . . ,m + 1 and Sj(ek)'-->gj, j = 1 ,2 , . . . ,m + 2. Then clearly {?j}7 +~

and {gj}~+2 are the roots of the lower and upper principal representations

of a moment point ~.

REFERENCES

1. Tom M. Apostol, Mathematical Analysis, Addison-Wesley, Reading, Mass., 1960. 2. S. Karlin and L. S. Shapley, "Geometry of Moment Spaces," Mem. Am. Math. Soc.

No. 12, 1953. 3. S. Karlin and W. Studden, Tchebycheff Systems with Applications in Analysis

and Statistics, Interscience, New York, 1966. 4. M. G. Krein, "The Ideas ofP. L. Cebysev and A. A. Markov in the Theory of Limiting

Values of Integrals and Their Further Developments," Am. Math. Soc. Transl., Ser. 2,12,3-120. 5. G. Szeg6, Orthogonal Polynomials, revised edition, New York, 1959.

DEPARTMENT OF MATHEMATICS, STANFORD UNIVERSITY,

STANFORD, CALIFORNIA, U.S.A. (Received March 25, 1966)