fundamentals of real analysis by

FUNDAMENTALS OF REAL ANALYSISby

Dogan Comez

II. MEASURES AND MEASURE SPACES

II.1. Prelude

Recall that the Riemann integral of a real-valued function f on an interval [a, b] is defined as∫ b

a

f(x)dx := lim‖P‖→0

n∑i=1

f(xi)(xi − xi−1), (if the limit exists)

where P = {a = x1 < x2 < · · · < xn−1 < xn = b} is a partitioning of [a, b], ‖P‖ = maxi |xi −xi−1|, and xi ∈ [xi−1, xi] is arbitrary. Then, it follows that if f : [a, b] → R is a continuousfunction, then it is Riemann-integrable. Also, if f : [a, b]→ R has finitely many bounded jumpdiscontinuities, then it is Riemann-integrable. Indeed, there are functions with infinitely manybounded jumps which are still Riemann-integrable. However, there are “very simple” functionswith countably many bounded jumps which are not Riemann-integrable. For example, let

f(x) =

{1 if x ∈ [0, 1] ∩Q0 if x ∈ [0, 1] ∩Qc.

Then, lim sup‖P‖→0

∑ni=1 f(xi)(xi − xi−1) = 1 6= 0 = lim inf‖P‖→0

∑ni=1 f(xi)(xi − xi−1)!

Riemann integral also suffers from a convergence problem: let {ri} be an enumeration (listing)of rational numbers in [0, 1], and define a sequence of functions {fn} on [0, 1] by

fn(x) =

{1 if x = r1, r2, r3, . . . , rn

0 otherwise,

and f be the function in the example above. Then each fn is Riemann-integrable, fn → fpointwise on [0, 1], but f is not Riemann-integrable.

Verdict: Riemann integral has several deficits!

Next, observe that if f is a positive, continuous function on [a, b] or has finite jump discon-

tinuities, then∫ bafdx gives the area of the region R = {(x, y) : a ≤ x ≤ b, 0 ≤ y ≤ f(x)}.

However, if the discontinuities of f is more “complicated” or if f is continuous but has oscilla-tions on a “large” set, then f need not be Riemann-integrable; hence, for such regions “area”is not defined.

Question. Can we define area of a region without involving Riemann-integral? If so, how?

This was the question asked by mathematicians of late 19th century that led to the conceptof “measure” that would measure the size (length, area, volume, . . . etc.) of a set (in Rn).For, one needs a working definition of measure.

Recall that the area of a region A ⊂ R2 bounded by two “nice” (continuous, piecewiselinear) functions as A = {(x, y) : a ≤ x ≤ b, f(x) ≤ y ≤ g(x)} is given by the Riemann-

integral∫ ba|f(x) − g(x)|dx. This definition is still valid if f and g have finitely many bounded

jump discontinuities. This is a good start; however, as seen in the example above, there arefunctions with countably many bounded jump discontinuities which are not Riemann-integrable.

1

2

Furthermore, there are continuous curves (so called “space-filling curves”) which do not fit tothis scheme. Therefore, defining area via Riemann-integral has serious inadequacies.

Here comes Peano. In order to address this deficiency, Peano proposed the following geometricapproach (inspired from Archimedes’ idea on calculating areas enclosed by conic sections). ForS ⊂ R2, define

ai(S) = inner area of S = supremum of areas of all polygons contained in S,

ao(S) = outer area of S = infimum of areas of all polygons containing S,

and define A(S) = ai(S) if both the inner and outer area exist and the equality ao(S) = ai(S)holds. Although this is a significant improvement over the definition of area via Riemannintegral, it still has some drawbacks. For instance, if S = {(x, y) : x, y ∈ [0, 1] ∩ Q}, thenao(S) = 1 6= 0 = ai(S). Thus, some rather simple regions do not have area!

Jordan’s try. Inspired by Peano, Jordan proposed using the concept of “content” to addressthe problem. For S ⊂ [a, b], let P = {(xi, xi+1)} be any finite partition of [a, b] by open intervals.Let

Ji(S) = {∑

l(Ii) : S ⊃ Ii ∈ P contains interior points of S},

Jo(S) = {∑

l(Ii) : Ii ∈ P contains points of S ∪ ∂S},and let ci(S) = supP Ji(S) and ci(S) = infP Jo(S). Then, if ci(S) and co(S) exist and are equal,define c(S) = content of S = ci(S) and call S Jordan-measurable. It turns out that form manysets in R their content is well defined (which includes regions defined by space filling curves).Jordan also defined integral of a real-valued function f on [a, b] as follows. Let∫ b

a

f = supP

∑infEi

f(x).c(Ei)∫ b

a

f = infP

∑supEi

f(x).c(Ei),

where P = {Ei} is a partitioning of [a, b] into Jordan measurable sets. If∫ baf =

∫ baf, then the

common value is called the integral of f.

Jordan integral improves Riemann integral; furthermore, the collection of Jordan-measurablesets is richer than those having area in the sense of Peano. On the other hand, the contentconcept also has some problems; for example, ci([0, 1]∩Q) = 0 = ci([0, 1]∩Qc) and co([0, 1]∩Q) =1 = co([0, 1] ∩Qc); thus, both of these sets are not Jordan-measurable. Furthermore, there areeven some open (or closed) bounded sets which are not Jordan-measurable, which is a seriousdrawback.

Despite it’s shortcomings, Jordan measure (content) and Jordan measurable sets have thefollowing “nice” properties:

1) If A and B are Jordan measurable, then so are A ∩ B, A ∪ B, A \ B, B \ A (closureunder set operations).

2) c(E) ≥ 0 (non-negativity).3) If E and F are disjoint Jordan measurable sets, then c(E ∪ F ) = c(E) + c(F ) (finite

additivity); if E and F are not necessarily disjoint, c(E∪F ) ≤ c(E)+c(F ) (subadditivity).4) If A ⊂ B are Jordan measurable sets, then c(A) ≤ c(B) (monotonicity).5) If A is Jordan measurable and α is a real number, then α+A is also Jordan measurable

and c(A) = c(α + A) (translation invariance).

Exercise Prove the properties 2)-5) stated above.

3

Borel’s approach. The drawback of Jordan-measurability led Borel to define the propertiesthe measure of sets should satisfy explicitly. He stated that:

1) measure of a set should be non-negative2) measure of a (finite or infinite) union of disjoint sets should be the sum of measures of

the individual sets3) if A ⊂ B, then measure of B \ A should be measure of B minus measure of A4) every set whose measure is non-zero should be uncountable, and5) the location of the set should not affect the measure of the set (i.e. measure of A and

measure of α + A should be the same).

Notice that, these are essentially the features of Jordan content. Having described propertiesthat a measure must carry, Borel also restricted it to those sets which are constructible, wherehe defined a set constructible if it is

a) a union (finite or countably infinite) of disjoint intervals,b) the complement of a constructible set A wrt any other constructible set B covering A.

He prescribed that the measure of an interval would be its length. With these, he hoped that:(i) any subset of R would be constructible, and (ii) a measure satisfying the properties listedabove could be constructed.

Bad news! (Or, may be good!?) It turns out that it is impossible to construct a measureon subsets of R having all five properties above (see pp: 19-20 in the text). However,when n = 1, 2, it is possible to construct a set function on subsets of Rn if only finite additivityis required (Banach), and when n ≥ 3 even this is impossible (Hausdorff). On the other hand,constructing a set function on subsets of Rn satisfying first four properties is possible (thoughvery delicate).

The property (5) above is, for obvious reasons (it makes sense!), very essential; hence, havinga set function satisfying only first four properties does not address the problem. Furthermore,it turns out that not every subset of R is constructible either (we’ll see an example ofsuch a set later). Thus, since we would like to preserve as much of the properties (1)-(5) andsince many sets constructed over intervals seem to satisfy them, we have one option to follow:weaken one of the properties/requirements of Borel. Weakening (1), (3) and (4) does not makesense. As seen above, although weakening (4) by requiring only finitely additivity does not solvethe problem, one can weaken (2) in an appropriate fashion while keeping measure well-defined.These considerations provide compelling rationale to give up defining a measure on all of P(R),and, instead, let a measure be defined on certain (proper) subclasses of P(R).

First of all, we must be careful in applying “additivity” of a measure m. If additivity alsoincludes adding over not-necessarily countable index sets, then we would encounter paradoxicalsituations. For instance, (0, 1] = ∪x∈(0,1]{x}; hence,

1 = m((0, 1]) =∑x∈(0,1]

m({x}) =∑x∈(0,1]

0 = 0!

Therefore, it’d be safer to restrict it to “countable additivity”. This clears way to

Program. Develop: (1) a good understanding of constructible sets, and (2) a good concept ofmeasure on constructible sets.

Let’s start with (1). Following Borel’s prescription of constructibility, we see that constructiblesets must satisfy the following axioms:

4

(i) if A and B are constructible, then A ∪B is also constructible(i′) if {Ai} is a countable collection of constructible sets, then ∪iAi is also constructible(ii) if A is constructible, then so is Ac

(iii) R and ∅ are constructible.

Notice that, since intervals are a priori constructible, all open (closed) sets are constructible(why?), and so are arbitrary union (intersection) of closed (open) sets.

Definition. The smallest collection B of all subsets of R which contains all open sets andsatisfying (i′), (ii) and (iii) is called Borel sets (or Borel σ-algebra).

Remark. From the definition, the collection B includes:

(a) all closed sets(b) all arbitrary union of closed sets (Fσ sets)(c) all arbitrary intersection of open sets (Gδ sets)(d) all arbitrary intersection of sets in (b) (Fσδ sets)(e) all arbitrary union of sets in (c) (Gδσ sets)

and so on. Notice that this process goes on indefinitely; hence the definition of B is stated abovein a “peculiar” manner.

Exercises. 1. Show that Z, Q, Qc, C are Borel sets, where C is the Cantor set. Which one ofthe classes Fσ, Gδ, Fσδ, . . . each of these sets belong?

2. Prove that translate of any Fσ-set ( Gδ-set) is a Fσ-set ( Gδ-set)

Question. Does the collection B actually exist?

The answer is a resounding Yes as the following statement proves.

Fact.1 Given any collection O of subsets of R, there exists a smallest collection A of subsets ofR satisfying (i′), (ii), (iii) and containing O.

Note. Such a family A is called a σ-algebra (of subsets of R).

Proof. (of Fact.1) Let ∆ be the family of all σ-algebras of subsets of R that contain O. Then∆ is nonempty (it contains P (R)). Let A = ∩{D : D ∈ ∆}. Then, O ⊂ A. Furthermore,

(i) if A,B ∈ A, then A,B ∈ D for all D ∈ ∆. So, since such a D is a σ-algebra, A ∪ B ∈ Dfor all D ∈ ∆. Hence, A ∪B ∈ A.

(ii) if A ∈ A, then A ∈ D for all D ∈ ∆; and it follows that Ac ∈ D for all D ∈ ∆. Hence,Ac ∈ A.

(iii) By definition, ∅,R ∈ D for all D ∈ ∆; and hence, ∅,R ∈ A.

Therefore, A is a σ-algebra. From the definition of A, if D is a σ-algebra containing O, thenA ⊂ D; hence, A ⊂ D for all D ∈ ∆. Hence, A is the smallest σ-algebra containing O.

Remark. |B| = c. (See Real and Abstract Analysis, by Hewitt & Stromberg, Theorem 10.23,pp:133-134.) It is well known that |P(R)| = 2c > c; hence, B is a proper subset of P(R)! Thus,it appears that the smallest collection of sets that are constructible over all intervals in R doesnot include all subsets of R.

This remark suggests that in order to complete the program identified above, we might keep(1), (3)-(5) intact and restrict (2) to countable additivity and have measures defined onlargest possible subcollection (σ-algebra) A smaller than P(R).

5

Now, let’s see how can we approach the problem of constructing a measure m on a σ-algebra.Following Borel, the starting point is defining measure on finite sets and intervals. Definemeasure of an interval as its length. So m([a, b]) = l([a, b]) = l((a, b)) = l((a, b]) = l([a, b)) =b− a = m((a, b)). If a = −∞ or b =∞, then m([a, b]) = l([a, b] =∞. Since [a, b] = [a, b) ∪ {b},we must have m({b}) = 0 for any singleton {b} ⊂ R; consequently, measure of any finite set is0. Also, m([a, b]) = l([a, b]) < l([a, 2b]) = m([a, 2b]); hence m(A) ≤ m(B) should hold if A ⊂ B.This discussion, with a little luck, tells us that if we are to construct a measure on subsets ofR, on that collection we’d hope it to satisfy the following properties:

(i) 0 ≤ m(A) ≤ ∞(ii) m(A) ≤ m(B) if A ⊂ B (monotonicity)

(iii) m(∅) = 0 = m({a}), ∀a ∈ R(iv) m(I) = l(I) for any interval I ⊂ R(v) m(∪∞k=1Ak) =

∑∞k=1m(Ak) for all disjoint collection of sets {Ak} (countable additivity)

(vi) m(a+ A) = m(A), ∀a ∈ R (translation invariance).

Exercises

1. Recall that for S ⊂ [a, b] and P = {(xi, xi+1)} any finite partition of [a, b] by open intervals,we define

Ji(S) = {∑

l(Ii) : Ii ⊂ S, Ii ∈ P contains interior points of S},

Jo(S) = {∑

l(Ii) : Ii ∈ P contains points of S ∪ ∂S},and let ci(S) = supP Ji(S) and co(S) = infP Jo(S). Then, if ci(S) and co(S) exist and are equal,we define c(S) = content of S = ci(S) and call S Jordan-measurable. Prove that

(i) c(E) ≥ 0 (non-negativity).

(ii) If A ⊂ B are Jordan-measurable sets, then c(A) ≤ c(B) (monotonicity).

(iii) If E and F are disjoint Jordan-measurable sets, c(E∪F ) = c(E)+c(F ) (finite additivity);if E and F are not necessarily disjoint, c(E ∪ F ) ≤ c(E) + c(F ) (subadditivity).

(iv) If A is Jordan measurable and α is a real number, then α+A is also Jordan measurableand c(A) = c(α + A) (translation invariance).

2. Provide simple justifications that any Fσ, Gδ, Fσδ and Gδσ-set is a Borel set.

3. (i) Show that Z, Q, Qc, C are Borel sets, where C is the Cantor set. Which one of theclasses Fσ, Gδ, Fσδ, . . . each of these sets belong?

(ii) Prove that translate of any Fσ-set ( Gδ-set) is a Fσ-set ( Gδ-set).

II.2. Lebesgue Measure

Having Jordan’s improvement on Peano’s original ideas and Borel’s specifications of a measureand constructible sets, Lebesgue developed a theory of measure that addressed the limitationsof Jordan measure and applicable to a large subclass of P(R) that avoids pathological setsmentioned above. In this section, we will construct this measure known as Lebesgue measure.As expected, we’ll begin with measure of intervals.

Clearly, we have the following for bounded intervals:

6

a) 0 ≤ l(I) <∞b) l(I) ≤ l(J) if I ⊂ J (monotonicity)c) l(I ∪ J) = l(I) + l(J) if I, J are disjoint (finite additivity)d) l(a+ I) = l(I), ∀a ∈ R (translation invariance).

Exercise. Let I be a bounded open interval and I1, I2, . . . , In be a finite collection of boundedopen intervals. If I ⊂ ∪ni=1Ii, then `(I) ≤

∑ni=1 `(Ii).

Fact.2 Let I be a bounded open interval and I1, I2, . . . , In, . . . be a countable collection ofbounded open intervals such that I ⊂ ∪∞i=1Ii. Then `(I) ≤

∑∞i=1 `(Ii).

Proof. Let I = (a, b) and let ε > 0 be arbitrary. Consider

I ′ = (a− ε

4, a+

ε

4) and I ′′ = (b− ε

4, b+

ε

4).

Then the collection {Ii}∞i=1 ∪ {I ′, I ′′} is an open cover for [a, b]. Hence by Heine-Borel theorem,there exists a finite sub-cover, say {Ii}∞i=1 ∪ {I ′, I ′′}. Now, apply the Exercise above.

Remarks. 1. If the collection I1, I2, . . . , In, . . . in Fact.2 above were a disjoint, countablecollection of bounded open intervals covering I, then `(I) ≤

∑∞i=1 `(Ii) would follow (almost)

trivially from (c) above. (Exercise)

2. If, furthermore, {Ii} is a disjoint, countable collection of bounded open intervals withI = ∪∞i=1Ii, then `(I) =

∑∞i=1 `(Ii). (Exercise)

Now, we would like to extend these (nice) properties of the measure m(= `) on intervals tomore general sets of real numbers.

Definition. The set function m∗ : P(R)→ R#+ defined by

m∗(A) = inf

{∞∑k=1

`(Ik) : A ⊆∞⋃k=1

Ik, each Ik is a bounded open interval

}is called the (Lebesgue) outer measure.

Remarks. It is easy to show that:

(i) m∗(∅) = 0 and, m∗({a}) = 0 for all a ∈ R.(ii) m∗(A) ≤ m∗(B) if A ⊂ B.

Exercises. 1. Calculate m∗(A) if (i) A is a singleton or a finite set, (ii) A = Q ∩ [0, 1].

2) Show that m∗(A) = m∗(α + A) for all α ∈ R.

Fact.3 For any interval I ⊂ R, m∗(I) = `(I).

Proof. Assume I = (a, b), then , given any ε > 0, we have I ⊂ (a− ε2, b+ ε

2). Then, by Remark

(ii) above, m∗(I) ≤ `(I) < b − a + ε which implies that m∗(I) ≤ b − a = `(I). On the otherhand, b− a is a lower bound for the set of numbers

U =

{∞∑i=1

`(Ii) : I ⊂∞⋃i=1

Ii, {Ii} open

}.

Since m∗(I) = inf(U), which must be greater than or equal to any other lower bound, we havem∗(I) ≥ b− a = `(I). Hence m∗(E) = `(I).

7

Next assume that I = [a, b]. Then, for all ε > 0 (a, b) ⊂ I ⊂ (a− ε, b+ ε). Thus,

m∗(a, b) ≤ m∗(I) ≤ m∗(a− ε, b+ ε)

⇒ `(I) ≤ m∗(I) ≤ `(E) + 2ε

⇒ `(I) = m∗(I).

The other two bounded cases are proved similarly.

If I is an unbounded interval (i.e., if a = −∞ or b =∞), then there exists an interval I ′ ⊂ Iwith `(I ′) = n. Then m∗(I) ≥ m∗(I ′) = n for all n ≥ 1; hence, m∗(I) =∞ = `(I).

Fact.4 The outer measure m∗ is countably sub-additive, namely, if A1, A2, . . . is a sequence ofsubsets of R, then

m∗

(∞⋃i=1

Ai

)≤

∞∑i=1

m∗(Ai).

Proof. Fix ε > 0. Given any Ai, pick a cover of Ai consisting of open intervals {Ii,n} such that

m∗(Ai) ≤∞∑n=1

`(Ii,n) < m∗(Ai) +ε

2i.

Next consider the collection {Ii,n}∞i,n=1, which covers⋃∞i=1Ai. Hence

m∗

(∞⋃i=1

Ai

)= inf

{∞∑k=1

`(Jk) :∞⋃k=1

Jk ⊃∞⋃i=1

Ai

}.

Therefore,

m∗

(∞⋃i=1

Ai

)≤

∞∑i,n=1

`(Ii,n) =∞∑i=1

∞∑n=1

`(Ii,n) <∞∑i=1

(m∗(Ai) +

ε

2i

)=∞∑i=1

m∗(Ai) + ε. �

Remark. It was shown by Vitali (1905) that there are A,B ⊂ R disjoint such that m∗(A) > 0,m∗(B) > 0, and m∗(A ∪B) 6= m∗(A) +m∗(B).

Fact.5 m∗ is finitely additive implies m∗ is countably additive.

Proof. If {Ai}∞i=1 is a disjoint family of subsets of R and m∗ is finitely additive, then by Fact.4n∑i=1

m∗(Ai) = m∗

(n⋃i=1

Ai

)≤ m∗

(∞⋃−=1

Ai

)≤

∞∑i=1

m∗(Ai).

Now by letting n→∞ we have m∗(⋃∞i=1 Ai) =

∑∞i=1 m

∗(Ai).

Consequently, in order to have m∗ be countably additive, we need to seek a smaller class ofsubsets of R (than P(R)) on which m∗ is finitely additive.

A simplification: From Fact.4 and Fact.5, in order to show that m∗ is finitely additive on aclass of sets, all we need to show is that m∗(A∪B) ≥ m∗(A) +m∗(B) for any pair A,B in thatclass.

Definition. A set E ⊂ R is called measurable if given any A ⊂ R, then

m∗(A) = m∗(A ∩ E) +m∗(A ∩ Ec), i.e., E splits A m∗-additively.

Remarks. 1. ∅ and R are measurable.

2. E measurable implies that Ec is measurable.

8

Question. What is the structure and the size of the the class of measurable sets?

In order to answer this question we need to make an observation.

Fact.6 If m∗(E) = 0, then E is measurable.

Proof. Let A ∈ R. Then (A ∩ E) ⊂ E and (A ∩ Ec) ⊂ A. Hence by monotonicity of m∗

m∗(A ∩ E) +m∗(A ∩ Ec) ≤ m∗(E) +m∗(A) = m∗(A).

Hence m∗(A ∩ E) +m∗(A ∩ Ec) ≤ m∗(A). Conversely we notice that

m∗(A) ≥ m∗(A ∩ Ec) = 0 +m∗(A ∩ Ec) = m∗(A ∩ E) +m∗(A ∩ Ec).

Therefore, E is measurable.

Definition. A nonempty collection G of subsets of R is called an algebra if

i) R ∈ G,ii) if {Ai}ni=1 ⊂ G, then ∪ni=1Ai ∈ G, and

iii) A ∈ G implies Ac ∈ G.

Remark. It follows from (i) and (iii) that ∅ ∈ G. Also, it follows from (ii) and (iii) that if{Ai}ni=1 ⊂ G, then ∩ni=1Ai ∈ G.

Definition. A nonempty collection G of subsets of R is called a σ-algebra if it is an algebraand (ii′) if {Ai}∞i=1 ⊂ G, then ∪∞i=1Ai ∈ G.

Theorem.1 F = {E ⊂ R : E is measurable} is a σ−algebra.

Proof. We first note that ∅,R ∈ F . Hence (i) is satisfied. Also E ∈ F which implies thatEc ∈ F . Thus (iii) is also satisfied. Therefore, we must show {Ei}∞i=1 ⊂ F ⇒

⋃∞i=1 Ei ∈ F .

First we will consider the finite case. Let E1, E2 ∈ F . In order to prove that E1 ∪E2 ∈ F weneed to show for all A ⊂ R that

m∗(A) ≥ m∗(A ∩ (E1 ∪ E2)) +m∗(A ∩ (E1 ∪ E2)c).

Observe that

A ∩ (E1 ∪ E2) = (A ∩ E1) ∪ (A ∩ Ec1 ∩ E2).

Now

m∗(A ∩ (E1 ∪ E2)) +m∗(A ∩ (E1 ∪ E2)c) = m∗(A ∩ (E1 ∪ E2)) +m∗(A ∩ Ec1 ∩ Ec

2)

≤ m∗(A ∩ E1) +m∗(A ∩ E2 ∩ Ec1) +m∗(A ∩ Ec

1 ∩ Ec2)

= m∗(A ∩ E1) +m∗(A ∩ Ec1) = m∗(A)

since E2 ∈ F . Hence (E1 ∪ E2) ∈ F . Therefore F is an algebra.

Claim. If {E1, E2, . . . , En} is a disjoint family of sets in F , then for all A ⊂ R we have

m∗

(A ∩

(n⋃i=1

Ei

))=

n∑i=1

m∗(A ∩ Ei).

(Exercise: Prove this claim. Hint: Use induction.)

Next let {Ei} ⊂ F be an arbitrary (countable) family of sets. Without loss of generality, wecan assume that it is a disjoint family. Let E =

⋃∞i=1 Ei. We now need to show for all A ⊂ R

m∗(A) ≥ m∗(A ∩ E) +m∗(A ∩ Ec).

9

First observe that

Ec =∞⋂i=1

Eci ⊂

n⋂i=1

Eci =

(n⋃i=1

Ei

)c

for any n. Then,

m∗(A) = m∗

(A ∩

(n⋃i=1

Ei

))+m∗

(A ∩

(n⋃i=1

Ei

)c)since F is an algebra

≥ m∗

(A ∩

(n⋃i=1

Ei

))+m∗(A ∩ Ec) by the observation above.

By the Claim above, we have

m∗

(A ∩

(n⋃i=1

Ei

))+m∗(A ∩ Ec) =

n∑i=1

m∗(A ∩ Ei) +m∗(A ∩ Ec).

Now if we let n→∞, we have

m∗(A) ≥∞∑i=1

m∗(A ∩ Ei) +m∗(A ∩ Ec)

≥ m∗

(A ∩

(∞⋃i=1

Ei

))+m∗(A ∩ Ec)

= m∗ (A ∩ E) +m∗(A ∩ Ec).

Hence m∗(A) ≥ m∗(A ∩ E) + m∗(A ∩ Ec) and⋃∞i=1Ei ∈ F . This proves that F is a

σ−algebra.

Proposition. All intervals are in F (Hence B ⊂ F).

Proof. We will show that (a,∞) ∈ F for a ∈ R. Let A ⊂ R, which without loss of generalitycan be assumed with m∗(A) < ∞. Now, from the definition of m∗, we can find a countablefamily of open intervals {Ek} covering A such that for ε > 0

m∗(A) ≤∞∑k=1

`(Ik) < m∗(A) + ε.

For any k, let Ik = I ′k ∪ I ′′k where I ′k = Ik ∩ (a,∞) and I ′′k = Ik ∩ (−∞, a]. Then

m∗(Ik) = `(Ik) = `(I ′k) + `(I ′′k ) = m∗(I ′k) +m∗(I ′′k ).

Hence

m∗(A ∩ (a,∞)) ≤ m∗

((∞⋃k=1

Ik

)∩ (a,∞)

)= m∗

(∞⋃k=1

I ′k

)≤

∞∑k=1

m∗(I ′k).

Similarly we have have

m∗(A ∩ (−∞, a]) ≤∞∑k=1

m∗(I ′′k ).

Then

m∗(A ∩ (a,∞)) +m∗(A ∩ (−∞, a]) ≤∞∑k=1

[m∗(I ′k) +m∗(I ′′k )] =∞∑k=1

m∗(Ik) ≤ m∗(A) + ε.

Hence (a,∞) ∈ F . Proof for other types of intervals are similar and left as an exercise.

10

Remark. It follows from this proposition that the σ-algebra F is rather large (at least as largeas B). We will see below that

B ⊂ F ⊂ P(R),

an each of these inclusions is proper.

Exercise. Let E ⊂ R with m∗(E) > 0. Show that there exists a bounded subset of E that alsohas positive outer measure.

Definition. The set function m = m∗|F : F → [0,∞] is called the Lebesgue measure andthe triple (R,F ,m) is sometimes called the Lebesgue measure space.

Fact.7 The Lebesgue measure m inherits all of its properties from m∗, namely:

i) m(∅) = 0, m(R) =∞ii) If A ⊂ B, then m(A) ≤ m(B) for all A,B ∈ F

iii) m(A ∪B) = m(A) +m(B) for any A,B ∈ F with A ∩B = ∅ (finite additivity)iv) m(A) = `(A) if A is an intervalv) m is translation invariant

Furthermore, m is countably additive, namely,

vi) m(∪∞i=1Ai) =∑∞

i=1m(Ai) for any countable disjoint collection {Ai} ⊂ F .

Proof. All these properties, except (iii), (v) and (vi), easily follow from the properties of theLebesgue outer measure. SinceA ∩ B = ∅, it follows that Ac ∩ B = B. Hence, since A ismeasurable,

m∗(A ∪B) = m∗(A ∩ (A ∪B)) +m∗(Ac ∩ (A ∪B)) = m∗(A) +m∗(B),

which proves (iii). Proof of (v) is straightforward (exercise). For (vi), first observe that fromthe claim in Theorem.1 it follows that m∗ is finitely additive on F , which together with Fact.5implies (vi).

Another proof goes as follows: We need to show that if {Ek}∞k=1 ⊂ F is a disjoint collection,then

m

(∞⋃k=1

Ek

)=∞∑k=1

m(Ek).

We already know (from finitely additivity) that, for any n ≥ 1

m

(n⋃k=1

Ek

)=

n∑k=1

m(Ek).

So by monotonicity

m

(∞⋃k=1

Ek

)≥ m

(n⋃k=1

Ek

)=

n∑k=1

m(Ek).

Now if we let n→∞ we get the desired result. �

Now, we will exhibit some important features of Lebesgue measurable sets and Lebesguemeasure. Let’s begin with the approximation property of the Lebesgue measurable sets.

Theorem.2 Let E ⊂ R. The following are equivalent:

(i) E ∈ F(ii) For all ε > 0, there exists O open such that E ⊂ O and m∗(O \ E) < ε.(iii) For all ε > 0, there exists C closed such that C ⊂ E and m∗(E \ C) < ε.

11

(iv) There exists G ∈ Gδ with E ⊂ G such that m∗(G \ E) = 0.(v) There exists F ⊂ Fσ with F ⊂ E such that m∗(E \ F ) = 0.

Furthermore, if E ∈ F with m(E) <∞, then (ii)-(v) are equivalent to

(vi) For all ε > 0, there exists a finite union U of open intervals s.t. m∗(U M E) < ε whereA M B = (A \B) ∪ (B \ A).

Proof. First we make an important observation: if m∗(A) <∞ is measurable and A ⊂ B, then

m∗(B) = m∗(A ∪ (B \ A)) = m∗(A) +m∗(B \ A)⇒ m∗(B \ A) = m∗(B)−m∗(A).

We will prove (i) ⇒ (ii) ⇒ (iv) ⇒ (i) and (ii) ⇒ (iv) ⇒ (i). The other implications will beleft as an exercise.

(i) ⇒ (ii): Assume first that m∗(E) <∞. From definition, given ε > 0 there exists an opencover {Ik} of E such that,

∞∑k=1

l(Ik) < m∗(E) + ε.

Let O = ∪∞k=1Ik, then, since m∗ is an outer measure,

m∗(O) = m∗

(∞⋃k=1

Ik

)< m∗(E) + ε.

Hence m∗(O)−m∗(E) < ε, and since E has finite outer measure, we have

m∗(O \ E) = m∗(O)−m∗(E) < ε.

Next let m∗(E) = ∞ and let Ek = E ∩ [−k, k] for any k ≥ 1. So m∗(Ek) < ∞. Hence byabove, there exists Ok open containing Ek such that

m∗(Ok \ Ek) <ε

2k.

Since E =⋃∞k=1Ek ⊂

⋃∞k=1Ok = O, where O is open. Then

m∗(O \ E) == m∗

((∞⋃k=1

Ok

)\

(∞⋃k=1

Ek

)).

Claim. (⋃∞k=1 Ek) \ (

⋃∞k=1Ek) ⊂

⋃∞k=1(Ok \ Ek). (Exercise: Prove the claim.)

Hence

m∗(O \ E) ≤ m∗

(∞⋃k=1

(Ok \ Ek)

)≤

∞∑k=1

m∗(Ok \ Ek) <∞∑k=1

ε

2k= ε.

(ii) ⇒ iv): Given E, by (ii), pick an open set Ok such that m∗(Ok \ E) = 1k. Now let

G =⋂∞k=1Ok, then G ∈ Gδ. Since E ⊂ Ok for all k, E ⊂ G and we have

G \ E =

(∞⋂k=1

Ok

)\ E =

∞⋂k=1

(Ok \ E) ⊂ Ok \ E

for any k. Then m∗(G \E) ≤ m∗(Ok \E) < 1k

for any k ≥ 1.. This implies that m∗(G \E) = 0.

(iv) ⇒ i): For any set E ⊂ G, we have G = E ∪ (G \E) and E = G ∩ (G \E)c. If G is a setsatisfying (iv), then from the fact that Gδ ⊂ F we have G ∈ F . Also,

m∗(G \ E) = 0⇒ G \ E ∈ F ⇒ (G \ E)c ∈ F .

12

Hence E = G ∩ (G \ E)c ∈ F .

(ii) ⇒ (vi): Since E ∈ F , there exists G open with G ⊃ E such thatm∗(G \ E) < ε/2. Since G is open, G =

⋃(k disjoint) Ik. Then

⋃k disjoint

Ik = E ∪

( ⋃k disjoint

Ik \ E

)⇒ m∗

( ⋃k disjoint

Ik

)< m∗(E) +

ε

2.

Note that

m∗

( ⋃k disjoint

Ik

)=∑k

m∗(Ik) =∑k

`(Ik).

So∑

k `(Ik) is convergent. Hence there exists N such that

∞∑k=N+1

`(Ik) <ε

2.

Now let U =⋃Nk=1 Ik. Then U \ E ⊂ G \ E ⇒ m∗(U \ E) < ε

2. Thus

E \ U = E ∩ U c = E ∩

(N⋃k=1

Ik

)c

= E ∩

(N⋂k=1

Ick

).

Hence E \ U =⋃Nk=1(E \ Ik) ⊂

⋃∞k=N+1 Ik and

m∗(E \ U) < m∗

(∞⋃

k=N+1

Ik

)≤

∞∑k=N+1

`(Ik) <ε

2.

Therefore m∗(U M E) < ε.

(vi) ⇒ (i): We begin by making two observations. First, for any sets U, V ⊂ R, we haveU c \ V c = V \U (and V c \U c = U \ V ); hence, U M V = U c M V c. Consequently, m∗(U M V ) =m∗(U c M V c).

Secondly, if m∗(U) ≥ m∗(V ), then U ⊂ V ∪ (U M V ), which implies that m∗(U) ≤ m∗(V ) +m∗(U M V ). Similarly, if m∗(V ) ≥ m∗(U), then V ⊂ U ∪ (V M U), and therefore, m∗(V ) ≤m∗(U) +m∗(U M V ). Thus, |m∗(U)−m∗(V )| ≤ m∗(U M V ).

Now, assume (vi); so, given ε > 0, let B be an open set (finite union of open intervals) suchthat m∗(A M B) < ε

2. Hence, we have |m∗(A)−m∗(B|) < ε

2and |m∗(Ac)−m∗(Bc)| < ε

2. Since

B is measurable, we have, for any X ⊂ R, m∗(X) = m∗(B ∩X) +m∗(Bc ∩X). Thus,

|m∗(A ∩X) +m∗(Ac ∩X)−m∗(X)|= |m∗(A ∩X) +m∗(Ac ∩X)−m∗(B ∩X) +m∗(Bc ∩X)|≤ |m∗(A ∩X)−m∗(B ∩X)|+ |m∗(Ac ∩X)−m∗(Bc ∩X)|≤ m∗((A M B) ∩X) +m∗((Ac M Bc) ∩X) ≤ m∗(A M B) +m∗(Ac M Bc) < ε.

Therefore, m∗(A ∩X) +m∗(Ac ∩X) = m∗(X), implying that A ∈ F .

Corollary. For all E ∈ F , E = B ∪N where B ∈ B and m∗(N) = 0.

Proof. By the previous theorem (v), there exists F ∈ Fσ such that m∗(E \F ) = 0 and F ⊂ E.Then E = F ∪ (E \ F ). Take B = F and N = E \ F . �

This corollary says that the ”completion” of B is F .

13

Remark. The analysis above also proves that on (all of ) P(R) Lebesgue outer measure cannotbe additive (hence, cannot be countably additive); however, it’s additive on a (proper) subsetof it, i.e., on F .

Exercises. 1. Prove that A ∈ F if and only if for any ε > 0 there is a closed set C and anopen set O with C ⊂ A ⊂ O such that m∗(O \ C) < ε.

2. Let E ⊂ R have finite outer measure. Prove that E ∈ F if and only if for each boundedinterval (a, b),

b− a = m∗((a, b) ∩ E) +m∗((a, b) \ E).

Theorem.3 (Continuity of m, or Monotone convergence theorem for Lebesgue mea-sure) Let {Ek} ⊂ F .

i) If Ek ⊂ Ek+1 for all k ≥ 1, then m(⋃∞k=1Ek) = limn→∞m(En).

ii) If Ek ⊃ Ek+1 for all k ≥ 1 and m(E1) <∞, then m(⋂∞k=1 Ek) = limn→∞m(En).

Proof. (i) Assume m(Ek) <∞ for all k ≥ 1. Let

A1 = E1, A2 = E2 \ E1, A3 = E3 \ E2, . . . , Ak = Ek \ Ek−1

for k ≥ 2. Then {Ak} is a disjoint family in F , and furthermore∞⋃k=1

Ek =∞⋃k=1

Ak.

Hence

m

(∞⋃k=1

Ek

)= m

(∞⋃k=1

Ak

)=∞∑k=1

m(Ak) = limn→∞

n∑k=1

m(Ak) = limn→∞

n∑k=1

m(Ek \ Ek−1).

We note that the last sum is telescoping. So, it follows that

m

(∞⋃k=1

Ek

)= lim

n→∞m(En).

If there exists an n0 such that m(En0) =∞, then

En0 ⊂∞⋃n=1

En ⇒ m

(∞⋃n=1

En

)≥ m(En0) =∞.

When n ≥ n0, we have En0 ⊂ En, which implies that limn→∞m(En) =∞.

ii) Let Bk = E1\Ek for k ≥ 1. Then, since {Ek} is decreasing, {Bk} is an increasing collectionin F . Also

∞⋃k=1

Bk =∞⋃k=1

(E1 \ Ek) = E1 \∞⋂k=1

Ek.

Therefore, since m(E1) is finite, we now have

m

(∞⋃k=1

Bk

)= m

(E1 \

∞⋂k=1

Ek

)= m(E1)−m

(∞⋂k=1

Ek

).

By (i) and the fact that m(Ek) <∞ for each k, we get

m

(∞⋃k=1

Bk

)= lim

n→∞m(Bn) = lim

n→∞[m(E1)−m(Ek)] = m(E1)− lim

n→∞m(Ek).

Hence m(⋂∞k=1 Ek) = limn→∞m(En).

14

Remark. The condition m(E1) < ∞ in (ii) above cannot be removed. Exercise: Give anexample that if this condition is removed than the assertion of (ii) is no longer valid.

At this point, we would like to define a set operation that will be studied below in a specialsetting. Given a collection of subsets {En}∞n of a set X, similarly to the way we defined thelim sup an and lim inf an of a sequence, we define

lim supnAn = ∩∞n=1 ∪∞k=n Ak, and

lim infnAn = ∪∞n=1 ∩∞k=n Ak.

Exercises. Let {An}n ⊂ F .

1. m(lim infnAn) ≤ lim infnm(An).2. If m(∪∞n=1An) <∞, then m(lim supnm(An) ≤ m(lim supnAn).

Definition. We say that a property P holds almost everywhere on a measurable set E ifthere exists E0 ⊂ E with m(E0) = 0 such that P holds for all x ∈ E \ E0.

Observation: Let {Ek}∞k=1 ⊂ F . A point x ∈ R belongs to infinitely many E ′ks ⇔ for alln ≥ 1 there exists k ≥ n such that x ∈ Ek ⇔ for all n ≥ 1, x ∈

⋃k≥nEk ⇔

x ∈∞⋂n=1

(⋃k≥n

Ek

)(= lim sup

nEn).

Theorem.4 (Borel-Cantelli Lemma) Let {Ek}∞k=1 ⊆ F be a countable collection such that∞∑k=1

m(Ek) <∞,

then m(lim supnEn) = 0; i.e., almost every x ∈ R belongs to at most finitely many E ′ks.

Proof. By the above observation, we need to only show that the set of x ∈ R which belong toinfinitely many E ′ks has measure 0. Equivalently,

m

(∞⋂k=1

(⋃k≥n

Ek

))= 0.

First, notice that

m

(∞⋃k=1

Ek

)≤

∞∑k=1

m(Ek) <∞.

Lets “decreasify” by letting

A1 =∞⋃k=1

Ek, A2 =∞⋃k=2

Ek, . . . , An =∞⋃k=n

Ek, . . .

Then Ak ↓, and m(A1) <∞. Therefore, by continuity of m,

m

(∞⋂n=1

(⋃k≥n

Ek

))= m

(∞⋂n=1

An

)= lim

n→∞m(An)

= limn→∞

m

(∞⋃k=n

Ek

)≤ lim

n→∞

[∞∑k=1

m(Ek)

]= 0. �

15

Exercises

1. Show that m∗(A) = m∗(α + A) for all A ⊂ R, α ∈ R.

2. Calculate m∗(A) if

(i) A is a singleton or a finite set,(ii) A = Q ∩ [0, 1].

3. Show that

(i) For any a, b ∈ R, a < b, the sets (a, b), [a, b), (a, b], and [a, b] are measurable.(ii) Q and Qc are measurable, and calculate m(Qc ∩ [0, 1]).

4.Let E ⊂ R with m∗(E) > 0. Show that there exists a bounded subset of E that also haspositive outer measure.

5. Show that if A,B ∈ F , then m(A ∪B) +m(A ∩B) = m(A) +m(B).

6. Prove (i) ⇔ (iii) ⇔ (v) in Theorem.2; that is, TFAE:

(i) E ∈ F(iii) For all ε > 0, there exists C closed such that C ⊂ E and m∗(E \ C) < ε.(v) There exists F ⊂ Fσ with F ⊂ E such that m∗(E \ F ) = 0.

[Hint: Use the fact that (i) ⇔ (ii).]

7. Prove that A ∈ F if and only if for any ε > 0 there is a closed set C and an open set O withC ⊂ A ⊂ O such that m∗(O \ C) < ε.

8. Let E ⊂ R have finite outer measure. Prove that E ∈ F if and only if for each boundedinterval (a, b),

b− a = m∗((a, b) ∩ E) +m∗((a, b) \ E).

9. Given a collection of subsets {En}∞n of a set X, similarly to the way we defined the lim sup anand lim inf an of a sequence, we define

lim supnAn = ∩∞n=1 ∪∞k=n Ak, and

lim infnAn = ∪∞n=1 ∩∞k=n Ak.

10. Show that for {An}n ⊂ F .

(i) m(lim infnAn) ≤ lim infnm(An).(ii) If m(∪∞n=1An) <∞, then m(lim supnm(An) ≤ m(lim supnAn).

11. Let A ∈ F and r ∈ R. Show that the sets r + A and rA are also measurable and

m(r + A) = m(A), and m(rA) = |r|m(A).

12. Show that if A,B ∈ F , then m(A ∪B) +m(A ∩B) = m(A) +m(B).

II.3. Non Measurable Sets

As we promised in section I.1 above, we will now “construct ”a non-measurable set; hence,proving that not all subsets of R belong to F (equivalently, not every subset of R is con-structible).

16

Lemma. Let E ∈ F be a bounded set with m(E) ≥ 0. If Λ is a countably infinite bounded setof real numbers such that {λ+ E}λ∈Λ is disjoint, then m(E) = 0.

Proof. Exercise.

For a set E ∈ F let a relation ∼ on E be defined by

x ∼ y if and only if x− y ∈ Q.

It is easy to show that ∼ is an equivalence relation on E (Exercise).

Recall that an equivalence relation partitions a set into equivalence classes, the equiva-lence classes of E with respect to ∼ are either disjoint or the same. Namely, E = {[x] :[x] is the equivalence class of x with respect to ∼}; and, for all x, y ∈ E, either [x]∩ [y] = ∅ or[x] = [y]. Now, let

CE = {set of single element representatives from each E/ ∼}.

Then we observe that:

(i) For all x ∈ E, there exists c ∈ CE such that x = c+ q for some q ∈ Q.(ii) For all x, y ∈ CE, x 6= y, x− y ∈ Qc.

Hence, for any pair of distinct rationals, say p, q, we have (p+CE)∩ (q+CE) = ∅, for otherwise,it follows that p + x = q + y for some x, y ∈ CE, which would imply x − y = p − q ∈ Q,contradicting (ii) above.

Theorem.5 (Vitali) If E ⊂ R with m∗(E) > 0, then there exists A ⊂ E such that A 6∈ F .

Proof. Since m∗(E) > 0, we can assume that E is bounded. So consider CE for this set E.

Claim. CE 6∈ FThis claim proves the assertion by letting A = CE. Hence, all we need is to prove the Claim.

For, assume for a contradiction that CE ∈ F . Since E is bounded, E ⊂ [−m,m] for somem ∈ Z.

Pick Λ0 = [−2m, 2m] ∩Q. Then Λ0 is countably infinite, bounded, and dense in [−2m, 2m].Then, for any x ∈ E, there exists c ∈ CE such that x = c + q for some q ∈ Q. Sincex, c ∈ [−m,m], q ∈ [−2m, 2m], ⋃

λ∈Λ0

(λ+ CE)

is bounded. By previous Lemma, m(CE) = 0. Hence

0 < m∗(E) ≤ m∗

( ⋃λ∈Λ0

(λ+ CE)

)≤∑λ∈Λ0

m∗(λ+ CE) =∑λ∈Λ0

m∗(CE) = 0.

This is a contradiction! Hence the claim must hold. �

Corollary. There exists A,B ⊂ R, A ∩B = ∅, such that

m∗(A ∪B) < m∗(A) +m∗(B).

Proof. For otherwise Theorem.5 would be false. �

Remark. It follows from the Corollary that F ( P(R).

17

Exercise. Let E ⊂ R be a non-measurable set with m∗(E) < ∞. Show that there is a Gδ-setG with E ⊂ G such that m∗(E) = m∗(G) and m∗(G \ E) > 0.

II.4 The Miraculous Cantor Set

First, let’s recall the construction of the Cantor set:

C0 = [0, 1]

C1 =

[0,

1

3

]∪[

2

3, 1

]C2 =

[0,

1

9

]∪[

2

9,1

3

]∪[

2

3,7

9

]∪[

8

9, 1

]...

Let C =⋂∞n=1Cn. Then we have Cn ⊃ Cn+1, for all n ≥ 1. Thus by the Nested Set Theorem,

C 6= ∅. The set C is called the Cantor Set.

Fact.8 (i) C is closed.

(ii) m(C) = 0.

(iii) C is uncountable.

Proof. By construction C is closed. Next, C ∈ F (why?). Also, since m(C1) = 23, m(C2) =

49, m(C3) = 8

27, . . . , we have m(Cn) = (2

3)n for n ≥ 1. Hence by the continuity of m,

m(C) = m

(∞⋂n=1

Cn

)= lim

n→∞m(Cn) = 0,

proving (i).

Now, observe that if x ∈ [0, 1] can be expressed in ternary form as

x =∞∑k=1

ak3k, where ak ∈ {0, 2},

then x ∈ C. The converse is true as well. Recall that this (ternary) expansion is unique withthe convention that, if

x =∞∑k=1

ak3k, where am = 1 and ak = 0 for k ≥ m+ 1,

then instead take its equivalent

x =m−1∑k=1

ak3k

+∞∑k=m

2

3k.

Therefore, it follows that C ∼ {0, 2}N; and consequently |C| = |{0, 2}N| = c.

Exercise. Prove that C ⊕ C = [0, 2] and C C = [−1, 1].

Now we will construct a function that will be instrumental in many examples/constructions;namely, the Cantor-Lebesgue Function.

18

Let ϕ1 ∈ (C[0, 1], || · ||∞) be defined as

ϕ1(x) =

0 if x = 01 if x = 112

if x ∈(

13, 2

3

)interpolate linearly on the rest.

So ϕ1 is continuous, non-decreasing, and ϕ1([0, 1]) = [0, 1]. Next define

ϕ2(x) =

0 if x = 01 if x = 112

if x ∈(

13, 2

3

)14

if x ∈(

19, 2

9

)34

if x ∈(

79, 8

9

)interpolate linearly on the rest.

Again, ϕ2 is continuous, non-decreasing, and ϕ2([0, 1]) = [0, 1]. Continue constructing ϕn, n ≥3, in this manner. So {ϕn} ⊂ C[0, 1] for each n ≥ 1, defined as

ϕn(x) =

0 if x = 01 if x = 1

2k−12n

if x ∈ Ink , 1 ≤ k ≤ 2n−1

interpolate linearly on the rest,

where {Ink }1≤k≤2n−1 , are the open intervals removed from the set Cn−1 to obtain Cn in theconstruction of Cantor set. Now, for m < n are integers, we have

|ϕn(x)− ϕm(x)| < 1

2m

for all x ∈ [0, 1] (notice that |ϕn+1 − ϕn| ≤ 12|ϕn − ϕn−1| for n ≥ 2). Hence

||ϕn − ϕm||∞ = maxx∈[0,1]

|ϕn(x)− ϕm(x)| < 1

2m;

and hence, {ϕm} is Cauchy in C[0, 1]. Since C[0, 1] is a complete metric space, {ϕn} convergesuniformly to a function ϕ, that is continuous and non-decreasing with ϕ([0, 1]) = [0, 1]. Thusϕ : [0, 1]→ [0, 1] is an onto function such that

• ϕ is non-decreasing,• ϕ(x) = k

2n, n ≥ 1. 1 ≤ k ≤ 2n−1 when x ∈ Ikn, where Ikn’s are intervals removed at the nth

stage of the Cantor set construction.

Let O = [0, 1] \ C, which is measurable. Then [0, 1] = O ∪ C and

[0, 1] = ϕ([0, 1]) = ϕ(C) ∪ ϕ(O).

Since ϕ(O) is a countable set, m∗(ϕ(O)) = 0. Hence, ϕ(O) ∈ F . Consequently, ϕ(C) is alsomeasurable with measure m(ϕ(C)) = 1. Thus by Vitali’s Theorem, there exists P ⊂ ϕ(C) suchthat P 6∈ F . On the other hand ϕ−1(P ) ⊂ C. Since m(ϕ−1(P )) = 0, it follows that ϕ−1(P ) ∈ F .Thus, we proved that

Fact.9 The function ϕ maps a measurable set (a subset of C) onto a non-measurable set.

Next let f : [0, 1] → [0, 2] by f(x) = x + ϕ(x). Then it follows from the properties of ϕ andthe function x that

• f is continuous and onto [0, 2].• f is strictly increasing.

19

Since f is strictly increasing, it maps Borel sets onto Borel sets. (Why?) Note that if A ∈ Fsuch that ϕ(A) 6∈ F , then f(A) 6∈ F either. Then A 6∈ B holds, for otherwise if A ∈ B, so wouldf(A). This implies that f(A) ∈ F which is a contradiction. Hence:

Corollary. There exists a nonempty set A ∈ F \ B.

Exercise. Show that there exists a continuous, strictly increasing function f : [0, 1] → R thatmaps a set of positive measure onto a set of measure zero.

II.5. General Measure Spaces (A Brief Survey)

In this section, following similar steps as in Section II.2, we will construct measures on setsnot necessarily equal to R, generalizing the Lebesgue measure.

Definition. A nonempty collection G of subsets of a set X is called an algebra if

i) X ∈ G,ii) if {Ai}ni=1 ⊂ G, then ∪ni=1Ai ∈ G, andiii) A ∈ G implies X \ A ∈ G.

Remark. It follows that ∅ ∈ G and that if {Ai}ni=1 ⊂ G, then ∩ni=1Ai ∈ G.

Definition. A nonempty collection G of subsets of X is called a σ-algebra if it is an algebraand (ii′) if {Ai}∞i=1 ⊂ G, then ∪∞i=1Ai ∈ G.

Remark. The collection G = {∅, X} and P(X) are σ-algebras for any X, where {∅, X} is thesmallest and P(X) is the largest σ-algebra.

It is easy to see that the intersection of a family {Aα} of σ−algebras of a set X is anotherσ−algebra (Exercise). As before, we actually have a more general statement:

Fact.10 Given a collection E of subsets of X, then there exists a smallest σ−algebra A ofsubsets of X such that E ⊂ A.

Proof. Exercise. [Hint: Mimic the proof of Fact.1 in Section II.1]

Notation: A := σ(E) is called as the σ−algebra generated by E .

Exercise. If G is a σ−algebra and E ∈ G, then the collection A = {A ∩ E : A ∈ G} is also aσ−algebra.

Definition. Let X be a (non-empty) set and A be a σ−algebra of subsets of X, then the pair(X,A) is called a measurable space.

Examples. 1. Let X be any uncountable set and let

A = {A ⊂ X : |A| ≤ ℵ0 or |Ac| ≤ ℵ0}.

Then A is a σ-algebra and (X,A) is a measurable space.

2. Let X = R and

A = {finite, disjoint union of sets [a, b), where −∞ ≤ a <∞, −∞ < b ≤ ∞}.

Then (X,A) is not a measurable space. Note that A is an algebra, but not a σ−algebra. Onthe other hand, σ(A) = B, which is a σ-algebra.

20

Definition. Let (X,A) be a measurable space. A set function µ : A → [0,∞] is called ameasure (on A) if:

i) µ(∅) = 0.ii) µ(

⋃∞k=1 Ek) =

∑∞k=1 µ(Ek) for any disjoint collection {Ek} ⊂ A.

The triple (X,A, µ) is called a measure space. If µ(X) <∞, then (X,A, µ) is called a finitemeasure space; if, in particular, µ(X) = 1, then (X,A, µ) is called a probability space. IfX =

⋃∞n=1 Xn such that µ(Xn) < ∞ for each n, then (X,A, µ) is called a σ−finite measure

space.

Definition. A set function µ : A → [0,∞] satisfying

i) µ(∅) = 0.ii) µ(

⋃ni=1Ei) =

∑ni=1 µ(Ei)

for any disjoint collection {Ei}ni=1 ⊂ A, is called a finitely additive measure.

Examples. 1. (R,F ,m) and (R,B,m) are both measure spaces.

2. For any a, b ∈ R, a < b, the triple ([a, b],F|[a,b],m), where F|[a,b] = {A ∩ [a, b] : A ∈ F}, isa finite measure space; in particular, ([0, 1],F|[0,1],m) is a probability space.

3. Let X be any infinite set and let A = P(X). Define µ : A → [0,∞] by

µ(A) =

{0 if |A| <∞∞ if A is infinite.

Then µ is not a measure, but is a finitely additive measure.

4. Let X = Z, A = P(Z), and define c : A → [0,∞] by c(A) = |A|. Then c is a measure onZ (called the counting measure).

5. Let X be any uncountable set and let

A = {A ⊂ X : A or Ac is countable}.If a set function µ : A → [0,∞] is defined by

µ(A) =

{0 if A is countable1 if Ac is countable

,

then µ is a measure.

Remark. Sets of measure zero and almost all properties are defined in any measure spaceexactly the same as in the case of the Lebesgue measure.

Theorem.6 Let (X,A, µ) be a measure space. Then

i) For all A,B ∈ A such that A ⊂ B, µ(A) ≤ µ(B).ii) If {Ai}∞i=1 ⊂ A, then

µ

(∞⋃i=1

Ai

)≤

∞∑i=1

µ(Ai) (Subadditivity).

iii) If {Ai} ⊂ A, and Ai ⊂ Ai+1 for all i ≥ 1, then

µ

(∞⋃i=1

Ai

)= lim

n→∞µ(An) (Continuity from below).

21

iv) If {Ei} ⊂ A, and Ei ⊃ Ei+1, for all i ≥ 1 and µ(E1) <∞, then

µ

(∞⋂i=1

Ei

)= lim

n→∞µ(En) (Continuity from above).

v) Borel-Cantelli Lemma holds.

Proof. Similar to the proofs of the analogous properties in the case of m. �

Discussion: (R,B,m) vs. (R,F ,m).It is known that C ∈ B, but C contains an E ⊂ C such that E is not a Borel set. Thus,in (R,B,m), C has a subset, say E, such that E 6∈ B. However, E ∈ F . Notice m(C) = 0,hence there exists a measure space with the property that not every subset of measure zero ismeasurable. Such measure spaces are called incomplete.

Definition. A measure space (X,A, µ) is called complete if E ∈ A where µ(E) = 0 andF ⊂ E, then F ∈ A.

Theorem.7 Let (X,A, µ) be a measure space and N = {N ∈ A : µ(N) = 0}. Let

A = {E ∪ F : E ∈ A, F ⊂ N for some N ∈ N},and define µ : A → [0,∞] by µ(E ∪ F ) = µ(E). Then A is a σ−algebra and µ is a measure onA such that (X,A, µ) is a complete measure space.

Proof. First we will show that A is a σ−algebra. Let E = A ∪ F ∈ A. Without loss ofgenerality we can assume that A ∩ F = ∅ (otherwise replace F by F \ A and N by N \ A).

Observe that A ∪ F = (A ∪N) ∩ (N c ∪ F ). Then

Ec = (A ∪N)c ∪ (N c ∪ F )c = (A ∪N︸︷︷︸∈A

)c ∪ (N \ F︸︷︷︸∈N

).

Thus Ec ∈ A. If {Ai ∪ Fi} ⊂ A, then∞⋃i=1

(Ai ∪ Fi) = (∞⋃i=1

Ai︸︷︷︸∈A

) ∪ (∞⋃i=1

Fi︸︷︷︸∈N

).

Thus A is a σ−algebra. It is left as an exercise to show that µ satisfies all the properties ofbeing a measure.

Now, all we need to show is that µ is well defined. Let E ⊂ A, and A1 ∪ F1 = E = A2 ∪ F2.Then A1 ⊂ A2 ∪ F2 ⊂ (A2 ∪N2). Hence

µ(A1) ≤ µ(A2 ∪N2) = µ(A2).

Similarly we get µ(A1) ≥ µ(A2). Thus µ(A1) = µ(A2), so µ is well defined. �

Question: How can we construct measures?

We will answer this question, as in the case of Lebesgue measure, via “outer measures.”

Definition. Let X 6= ∅ be a set. A set function µ∗ : P(X) → [0,∞] is called an outermeasure (on X) if :

i) µ∗(∅) = 0.ii) µ∗(A) ≤ µ∗(B) when A ⊂ B.

iii) µ∗(⋃∞k=1Ak) ≤

∑∞k=1 µ

∗(Ak).

22

Exercise. If an outer measure is is finitely additive, then it is a measure.

Given an outer measure on X, then we define µ∗−measurable sets as:Definition. A set E ⊂ X is called a µ∗−measurable set if for all A ⊂ X,

µ∗(A) = µ∗(A ∩ E) + µ∗(A ∩ Ec).

Examples. 1. Let X be an infinite set and define µ∗(E) = |E| if E is finite and µ∗(E) = ∞if E is infinite. Then µ∗ is an outer measure (Show). Exercise: Determine the µ∗-measurablesets.

2. Let X be any set and define µ∗(∅) = 0 and µ∗(E) = 1 if E 6= ∅. Then µ∗ is an outermeasure (Show). Exercise: Determine the µ∗-measurable sets.

3. 1. Let X be an uncountable set and define µ∗(E) = 0 if E is countable and µ∗(E) = 1 if Eis uncountable. Then µ∗ is an outer measure (Show). Exercise: Determine the µ∗-measurablesets.

Theorem.8 (Caratheodory) Given a set X and an outer measure µ∗ on X, the collection

A := {E ⊂ X : E is µ∗ −measurable}

is a σ−algebra of subsets of X, and µ∗|A = µ is a (complete) measure on X.

Proof. Similar to the corresponding proofs for m (Theorem.1, Theorem.2 and its Corollary inSection II.2). �

Question: How can we construct outer measures?

Definition. Let X 6= ∅ be a set. A collection κ of subset of X is called a (sequential)covering class for X if for all A ⊂ X, there exists {Ei}∞i=1 ⊂ κ such that A ⊂

⋃∞i=1 Ei.

Examples.

1. In R, κ = {∅} ∪ {(n, n+ 2)}n∈Z is a covering class.2. In R, κ = {All intervals with rational endpoints} ∪ {∅} is a covering class (for m).

.

Exercise. Find a suitable sequential covering class κ for X in each of the examples 1-3 above.

Let λ : κ→ [0,∞] be a set function such that λ(∅) = 0. For any A ⊂ X, let µ∗λ(A) be the setfunction (induced by λ) on P(X) defined as

µ∗λ(A) = inf

{∞∑k=1

λ(Ek) : {Ek} ∈ κ,∞⋃k=1

Ek ⊃ A

}.

Theorem.9 The set function µ∗λ defined above is an outer measure on X.

Proof (sketch). µ∗λ(∅) = 0, µ∗λ(A) ≥ 0 for all A ∈ X, and A ⊂ B ⇒ µ∗λ(A) ≤ µ∗λ(B) isleft as an exercise to the reader. We need to prove the countable subadditivity property. Let

23

{Ak} ⊂ P(X). Given ε > 0,

A1 ⊂∞⋃k=1

E1k such that {E1

k} ⊂ κ with∞∑k=1

λ(E1k) < µ∗(A1) +

ε

2

A2 ⊂∞⋃k=1

E2k such that {E2

k} ⊂ κ with∞∑k=1

λ(E2k) < µ∗(A2) +

ε

22

......

Now show that µ∗λ(⋃∞m=1Am) ≤

∑∞m=1 µ

∗λ(Am) + ε. Since ε > 0 is arbitrary, we get the desired

result. �

Examples. 1. If X = R, κ = {all open intervals}, and λ = length, then we obtain µ∗λ = m∗

which gives the Lebesgue measure m.

2. (Lebesgue Stieltjes measure) Let X = R, κ = {(−∞, a] : a ∈ R} or κ = {(a, b] : a, b ∈ R#}.Let f : R→ R be such that f is monotone increasing and right continuous (i.e. limx→a+ f(x) =f(a)). Define λ(a, b] = f(b) − f(a) with the convention that λ(∅) = 0. Let µ∗f denote the setfunction “induced by λ” (via f). Then by Theorem.9, µ∗f is the outer measure induced by f ,and by Theorem.8, µf = µ∗f |A=F = is the induced measure, called the Lebesgue-Stieltjesmeasure induced by f .

Exercises

1. If µ1, µ2, . . . , µn are measures on a measurable space (X,A) and α1, α2, . . . , αn ∈ [0,∞),then µ =

∑ni=1 αiµi is also a measure on (X,A).

2. If (X,A, µ) is a measure space and E ∈ A, then the set function µE : A → [0,∞], definedby µE(A) = µ(E ∩ A) for all A ∈ A, is also a measure on A.

3. Let µ be a finitely additive measure on a σ-algebra A of subsets of a set X. µ is a measureif and only if it is continuous from below.

4. Let µ be a finitely additive measure on a σ-algebra A of subsets of a set X with µ(X) <∞.µ is a measure if and only if it is continuous from above.

5. Let (X,A, µ) be a measure space.

a) If A,B ∈ A and µ(A4B) = 0, then µ(A) = µ(B).b) Define: A ∼ B if and only if µ(A4B) = 0. Then “∼” is an equivalence relation on A.c) For A,B ∈ A, define ρ(A,B) = µ(A4B). Then ρ is a metric on A/ ∼.

(Hence, (A/ ∼ , ρ) is a metric space).

6. If an outer measure µ∗ on a set X 6= ∅ is finitely additive, then it is a measure.

7. Let X 6= ∅ and κ = {∅, X, one-point sets}. Define λ(X) = ∞, λ(∅) = 0, and λ(E) = 1 if∅ 6= E ( X. Describe the outer measure µ∗λ.

8. Let X be an uncountable set and κ = {∅, X, one-point sets}. Define λ(X) = 1 and λ(E) = 0if E ( X. Describe the outer measure µ∗λ.

24

9. Let X 6= ∅, κ be a covering class for X and λ be a set function on P(X). If E ∈ κ, thenµ∗λ(E) ≤ λ(E). Give an example where strict inequality occurs.

10. Let X 6= ∅, κ be a covering class for X, λ be a set function on P(X) and µ∗λ be the outermeasure induced by κ and λ.

a) If κ is a σ-algebra and λ is a measure, then µ∗λ(E) = λ(E) for all E ∈ κ.b) If κ is a σ-algebra and λ is a measure, then every set in κ is µ∗λ-measurable.

[The measure µλ given by µ∗λ is called an extension of λ.]

11. Let X 6= ∅, κ = {∅, X} and λ be a set function on P(X) given by λ(E) = 0 if E = ∅and λ(E) = 0 if E = X. Determine the outer measure µ∗λ and describe the σ-algebra of µ∗λ-measurable sets. The same question when X = [0, 1].

12. Let X = R, κ = P(R) and define λ(E) = |{k ∈ Z : k ∈ E}|. Determine the outer measureµ∗λ and describe the σ-algebra of µ∗λ-measurable sets.

13. Let F be a real-valued, increasing and right continuous function on R. If µF is the associatedLebesgue-Stieltjes measure, then prove that, for any a, b ∈ R,

(i) µF ({a}) = F (a)− F (a−),(ii) µF ([a, b)) = F (b−)− F (a−),

(iii) µF ([a, b]) = F (b)− F (a−), and(iv) µF ((a, b)) = F (b−)− F (a).

fundamentals of real analysis by

Documents