honors mathematics ii functions of a single variable -...
TRANSCRIPT
Honors Mathematics IIFunctions of a Single Variable
Dr. Horst Hohberger
University of Michigan - Shanghai Jiaotong UniversityJoint Institute
Fall Term 2015
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 1 / 624
Introduction
Office Hours, Email, TAs Please read the Course profile, which has been uploaded to the
Resources section on the SAKAI course site. My office is Room 218 in in the Law School Building. My email is [email protected] and I’ll try to answer email queries
within 24 hours. Office hours will be announced on SAKAI. Please also make use of the chat room on SAKAI for asking
questions, making comments or giving feedback on the course. The recitation class schedule and the TA office hours and contact
details will be announced on SAKAI. Of course, you can also contact the TA with questions outside of
office hours, but please observe basic rules of politeness. It is notappropriate to phone your TA at 10 pm in the evening and requesthelp for the exercise set due the next day. Your TA is there to helpyou, but is not a 24/7 help line.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 2 / 624
Introduction
Coursework Policy
The following applies mostly to the paper-based homework at thebeginning of this term:
Hand in your coursework on time, by the date given on each set ofcourse work. Late work will not be accepted unless you come to mepersonally and I find your explanation for the lateness acceptable.
You can be deducted up to 10% of the awarded marks for anassignment if you fail to write neatly and legibly. Messiness will bepenalized!
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 3 / 624
Introduction
Use of Wikipedia and Other Sources; Honor Code Policy
When faced with a particularly difficult problem, you may want to refer toother textbooks or online sources such as Wikipedia. Here are a fewguidelines:
Outside sources may treat a similar sounding subject matter at amuch more advanced or a much simpler level than this course. Thismeans that explanations you find are much more complicated or fartoo simple to help you. For example, when looking up the “inductionaxiom” you may find many high-school level explanations that are notsufficient for our problems; on the other hand, wikipedia contains alot of information relating to formal logic that is far beyond what weare discussing here.
If you do use any outside sources to help you solve a homeworkproblem, you are not allowed to just copy the solution; this isconsidered a violation of the Honor Code.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 4 / 624
Introduction
Use of Wikipedia and Other Sources; Honor Code Policy
The correct way of using outside sources is to understand thecontents of your source and then to write in your own words andwithout referring back to the source the solution of the problem. Yoursolution should differ in style significantly from the published solution.If you are not sure whether you are incorporating too much materialfrom your source in your solutions, then you must cite the source thatyou used.
You may cooperate with other students in finding solutions toassignments, but you must write your own answers. Do not simplycopy answers from other students. It is acceptable to discuss theproblems orally, but you may not look at each others’ written notes.Do not show your written solutions to any other student. This wouldbe considered a violation of the Honor Code.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 5 / 624
Introduction
Use of Wikipedia and Other Sources; Honor Code PolicyIn this course, the following actions are examples of violations of theHonor Code:
Showing another student your written solution to a problem. Sending a screenshot of your solution via QQ, email or other means
to another student. Showing another student the written solution of a third student;
distributing some student’s solution to other students. Viewing another student’s written solution. Copying your solution in electronic form (LATEX source, PDF, JPG
image etc.) to the computer hardware (flash drive, hard disk etc.) ofanother student. Having another student’s solution in electronic formon your computer hardware.
If you have any questions regarding the application of the Honor Code,please contact me or any of the TAs.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 6 / 624
Introduction
Grading Policy
It is JI policy to curve the grades for standard undergraduate coursesso that the median grade is a “B.” For this Honors course, the gradewill be curved to achieve a median grade of “B+.”
The grade will be composed of the course work and the exams asfollows:
First midterm exam: 20% Second midterm exam: 25% Final exam: 30% Course work: 25%
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 7 / 624
Introduction
What is Mathematics?
Mathematics is about certainty.
Mathematics is about modeling.
Mathematics is about thinking.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 8 / 624
Introduction
Course Focus
Please forget everything you have learned in school; you haven’tlearned it.
Edmund Landau, Foundations of Analysis, 1929
The course will focus on functions of one variable, in particular Foundations: Set Theory, Logic, Numbers, Algebraic Structures. Functions, Limits and Continuity, Differentiation Integration
The main textbook for this course is Calculus, M. Spivak, 3rd Edition.However, additional material from other sources will be added to thelecture. Further textbook references will then be given.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 9 / 624
Introduction
NotationThe notation used in these notes should be fairly well-known from school;some symbols will be defined explicitly to make sure that they areunderstood unambiguously (for example, the summation symbol
∑).
A particular convention needs some explaining: when an equality is adefinition (and not a statement that can be proven), the equal symbol willsometimes have a colon added on the side of the defined quantity. Forexample,
A ∪ B := x : x ∈ A or x ∈ B
indicates that that the union A ∪ B is being defined through this equality.Another example might involve
1 + 2 + · · ·+ n =n(n + 1)
2=: an,
indicating that the number an is defined to be n(n + 1)/2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 10 / 624
Introduction
Notation
On the other hand, it would be wrong to write
1 + 2 + · · ·+ n :=n(n + 1)
2
because the objects on the left and the right are already defined and theequality is a statement that can be proven; it does not define anything.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 11 / 624
Basic Tools and Definitions
Part I
Basic Tools and Definitions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 12 / 624
Basic Tools and Definitions
Basic Concepts in Logic
Basic Concepts in Set Theory
The Natural Numbers and Mathematical Induction
The Rational Numbers
Roots, Bounds & Real Numbers
Complex Numbers
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 13 / 624
Basic Tools and Definitions Basic Concepts in Logic
Basic Concepts in Logic
Basic Concepts in Set Theory
The Natural Numbers and Mathematical Induction
The Rational Numbers
Roots, Bounds & Real Numbers
Complex Numbers
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 14 / 624
Basic Tools and Definitions Basic Concepts in Logic
Propositional Logic, StatementsReference L. Loomis and S. Sternberg, Advanced Calculus, Sections 0.1-0.4.Freely available here (clickable link):
http://www.math.harvard.edu/∼shlomo/docs/Advanced_Calculus.pdf.
A statement (also called a proposition) is anything we can regard as beingeither true or false. We do not define here what the words “statement”,“true” or “false” mean. This is beyond the purview of mathematics andfalls into the realm of philosophy. Instead, we apply the principle that “weknow it when we see it.”We will generally not use examples from the “real world” as statements.The reason is that in general objects in the real world are much to looselydefine for the application of strict logic to make any sense. For example,the statement “It is raining.” may be considered true by some people(“Yes, raindrops are falling out of the sky.”) while at the same time falseby others (“No, it is merely drizzling.”) Furthermore, importantinformation is missing (Where is it raining? When is it raining?).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 15 / 624
Basic Tools and Definitions Basic Concepts in Logic
Statements1.1.1. Examples.
“3 > 2” is a true statement. “x3 > 0” is not a statement, because we can not decide whether it is
true or not. “the cube of any number is strictly positive (> 0)” is a false
statement.
The last example can be written using a statement variable x : “For any number x , x3 > 0”
The first part of the statement is a quantifier (“for any number x”), whilethe second part is called a statement frame or predicate (“x3 > 0”).A statement frame becomes a statement (which can then be either true orfalse) when the variable takes on a specific value; for example, 13 > 0 is atrue statement and 03 > 0 is a false statement.We will treat quantifiers and predicates in more detail later.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 16 / 624
Basic Tools and Definitions Basic Concepts in Logic
Working with Statements
We will denote statements by capital letters such as A,B,C , ... andstatement frames by symbols such as A(x) or B(x , y , z) etc.1.1.2. Examples.
A : 4 is an even number. B : 2 > 3. A(n) : 1 + 2 + 3 + ... + n = n(n + 1)/2.
We will now introduce logical operations on statements. The simplestpossible type of operation is a unary operation, i.e., it takes a statement Aand returns a statement B.1.1.3. Definition. Let A be a statement. Then we define the negation of A,written as ¬A, to be the statement that is true if A is false and false if Ais true.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 17 / 624
Basic Tools and Definitions Basic Concepts in Logic
Negation
1.1.4. Example. If A is the statement A : 2 > 3, then the negation of A is¬A : 2 > 3.We can describe the action of the unary operation ¬ through the followingtable:
A ¬AT FF T
If A is true (T), then ¬A is false (F) and vice-versa. Such a table is calleda truth table.We will use truth tables to define all our operations on statements.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 18 / 624
Basic Tools and Definitions Basic Concepts in Logic
ConjunctionThe next simplest type of operations on statements are binary operations.The have two statements as arguments and return a single statement,called a compound statement, whose truth or falsehood depends on thetruth or falsehood of the original two statements.1.1.5. Definition. Let A and B be two statements. Then we define theconjunction of A and B, written A ∧ B, by the following truth table:
A B A ∧ B
T T TT F FF T FF F F
The conjunction A ∧ B is spoken “A and B.” It is true only if both A andB are true, false otherwise.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 19 / 624
Basic Tools and Definitions Basic Concepts in Logic
Disjunction1.1.6. Definition. Let A and B be two statements. Then we define thedisjunction of A and B, written A ∨ B, by the following truth table:
A B A ∨ B
T T TT F TF T TF F F
The disjunction A∨B is spoken “A or B.” It is true only if either A or B istrue, false otherwise.1.1.7. Example.
Let A : 2 > 0 and B : 1+1 = 1. Then A∧B is false and A∨B is true. Let A be a statement. Then the compound statement “A ∨ (¬A)” is
always true, and “A ∧ (¬A)” is always false.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 20 / 624
Basic Tools and Definitions Basic Concepts in Logic
Proofs using Truth TablesHow do we prove that “A ∨ (¬A)” is an always true statement? We areclaiming that A ∨ (¬A) will be a true statement, regardless of whether thestatement A is true or not. To prove this, we go through all possibilitiesusing a truth table:
A ¬A A ∨ (¬A)T F TF T T
Since the column for A ∨ (¬A) only lists T for “true,” we see thatA ∨ (¬A) is always true. A compound statement that is always true iscalled a tautology.Correspondingly, the truth table for A ∧ (¬A) is
A ¬A A ∧ (¬A)T F FF T F
so A ∧ (¬A) is always false. A compound statement that is always false iscalled a contradiction.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 21 / 624
Basic Tools and Definitions Basic Concepts in Logic
Implication1.1.8. Definition. Let A and B be two statements. Then we define theimplication of B and A, written A⇒ B, by the following truth table:
A B A⇒ B
T T TT F FF T TF F T
We read “A⇒ B” as “A implies B,” “if A, then B” or “A only if B”.The fact that F⇒ T is a true statement may seem a little strange. Beforewe go into the reason for this, note that Definition 1.1.8 is a definition!That means, we don’t have to justify it in any way, there is nothing toprove, it is just a notation for a certain compound statement. Hence, wecould leave it at that and give no further explanation. However, it wouldbe useful and appropriate to motivate why we are defining the implicationin this way.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 22 / 624
Basic Tools and Definitions Basic Concepts in Logic
ImplicationTo illustrate why the implication is defined as it is, it is useful to look at aspecific example:
For all real numbers x , x > 0⇒ x3 ≥ 0 (1.1.1)
It is hoped that the reader will agree that this should be a true statement.In order to verify this, we would need to verify the truth of the statementobtained from the predicate “x > 0⇒ x3 ≥ 0” for all real numbers x .We shall see later that a real number can be either strictly positive, strictlynegative or zero (cf. the trichotomy law on Slide 66).
If x is a strictly positive number, both statements are true, so wehave T⇒ T.
If x is a strictly negative number, both statements are false, so wehave F⇒ F.
If x is zero, the first statement is false but the second statement istrue, so we have F⇒ T.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 23 / 624
Basic Tools and Definitions Basic Concepts in Logic
EquivalenceHence, if we want (1.1.1) to be a true statement, then we need to defineA⇒ B as being true in all of the three cases above. We next introduceyet another, important, compound statement.1.1.9. Definition. Let A and B be two statements. Then we define theequivalence of A and B, written A⇔ B, by the following truth table:
A B A⇔ B
T T TT F FF T FF F T
We read “A⇔ B” as “A is equivalent to B” or “A if and only if B”. Sometextbooks abbreviate “if and only if” by “iff.”If A and B are both true or both false, then they are equivalent.Otherwise, they are not equivalent. In propositional logic, “equivalence” isthe closest thing to the “equality” of arithmetic.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 24 / 624
Basic Tools and Definitions Basic Concepts in Logic
EquivalenceOn the one hand, logical equivalence is strange; two statements A and Bdo not need to have anything to do with each other to be equivalent. Forexample, the statements “2 > 0” and “100− 1 = 99” are both true, sothey are equivalent.1.1.10. Definition. Two compound statements A and B are called logicallyequivalent if A⇔ B is a tautology. We then write A ≡ B.
1.1.11. Example. The two de Morgan rules are the tautologies
¬(A ∨ B)⇔ (¬A) ∧ (¬B), ¬(A ∧ B)⇔ (¬A) ∨ (¬B).
To indicate that they are tautologies, we write
¬(A ∨ B) ≡ (¬A) ∧ (¬B), ¬(A ∧ B) ≡ (¬A) ∨ (¬B).
These logical equivalencies will be proved in the assignments.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 25 / 624
Basic Tools and Definitions Basic Concepts in Logic
Contraposition
An important tautology is the contrapositive of the compound statementA⇒ B.
(A⇒ B)⇔ (¬B ⇒ ¬A).
For example, for any number x , the statement “x > 0⇒ x3 > 0” isequivalent to “x3 > 0⇒ x > 0.” This principle is used in proofs bycontradiction.We prove the contrapositive using a truth table:
A B ¬A ¬B ¬B ⇒ ¬A A⇒ B (A⇒ B)⇔ (¬B ⇒ ¬A)T T F F T T TT F F T F F TF T T F T T TF F T T T T T
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 26 / 624
Basic Tools and Definitions Basic Concepts in Logic
Rye Whiskey
The following song is an old Western-style song, called “Rye Whiskey” andperformed by Tex Ritter in the 1930’s and 1940’s.
If the ocean was whiskey and I was a duck,I’d swim to the bottom and never come up.But the ocean ain’t whiskey, and I ain’t no duck,So I’ll play jack-of-diamonds and trust to my luck.For it’s whiskey, rye whiskey, rye whiskey I cry.If I don’t get rye whiskey I surely will die.
The lyrics make sense (at least as much as song lyrics generally do).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 27 / 624
Basic Tools and Definitions Basic Concepts in Logic
Rye WhiskeyOne can use de Morgan’s rules and the contrapositive to re-write the songlyrics as follows
If I never reach bottom or sometimes come up,Then the ocean’s not whiskey, or I’m not a duck.But my luck can’t be trusted, or the cards I’ll not buck,So the ocean is whiskey or I am a duck.For it’s whiskey, rye whiskey, rye whiskey I cry.If my death is uncertain, then I get whiskey (rye).
These lyrics seem to be logically equivalent to the original song, but arejust humorous nonsense. This again illustrates clearly why it is futile toapply mathematical logic to everyday language.This example is due to (clickable link) W. P. Cooke, The AmericanMathematical Monthly, Vol. 76, No. 9 (Nov., 1969), p. 1051.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 28 / 624
Basic Tools and Definitions Basic Concepts in Logic
Logical QuantifiersIn the previous examples we have used predicates A(x) with the words “forall x .” This is an instance of a logical quantifier that indicates for which xa predicate A(x) is to be evaluated to a statement.In order to use quantifiers properly, we clearly need a universe of objects xwhich we can insert into A(x) (a domain for A(x)). This leads usimmediately to the definition of a set. We will discuss set theory in detaillater. For the moment it is sufficient for us to view a set as a “collection ofobjects” and assume that the following sets are known:
the set of natural numbers N (which includes the number 0), the set of integers Z, the set of real numbers R, the empty set ∅ (also written ∅ or ) that does not contain any
objects.If M is a set containing x , we write x ∈ M and call x an element of M.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 29 / 624
Basic Tools and Definitions Basic Concepts in Logic
Logical QuantifiersThere are two types of quantifiers:
the universal quantifier, denoted by the symbol ∀, read as “for all” and the existential quantifier, denoted by ∃, read as “there exists.”
1.1.12. Definition. Let M be a set and A(x) be a predicate. Then wedefine the quantifier ∀ by
∀x∈M
A(x) ⇔ A(x) is true for all x ∈ M
We define the quantifier ∃ by
∃x∈M
A(x) ⇔ A(x) is true for at least one x ∈ M
We may also write ∀x ∈ M : A(x) instead of ∀x∈M
A(x) and similarly for ∃.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 30 / 624
Basic Tools and Definitions Basic Concepts in Logic
Logical Quantifiers
We may also state the domain before making the statements, as in thefollowing example.1.1.13. Examples. Let x be a real number. Then
∀x : x > 0⇒ x3 > 0 is a true statement; ∀x : x > 0⇔ x2 > 0 is a false statement; ∃x : x > 0⇔ x2 > 0 is a true statement.
Sometimes mathematicians put a quantifier at the end of a statementframe; this is known as a hanging quantifier. Such a hanging quantifierwill be interpreted as being located just before the statement frame:
∃y : y + x2 > 0 ∀x
is equivalent to ∃y∀x : y + x2 > 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 31 / 624
Basic Tools and Definitions Basic Concepts in Logic
Contraposition and Negation of QuantifiersWe do not actually need the quantifier ∃ since
∃x∈M
A(x) ⇔ A(x) is true for at least one x ∈ M
⇔ A(x) is not false for all x ∈ M
⇔ ¬ ∀x∈M
(¬A(x)) (1.1.2)
The equivalence (1.1.2) is called contraposition of quanifiers. It impliesthat the negation of ∃x ∈ M : A(x) is equivalent to ∀x ∈ M : ¬A(x). Forexample,
¬(∃x ∈ R : x2 < 0
)⇔ ∀x ∈ R : x2 < 0.
Conversely,
¬ (∀x ∈ M : A(x)) ⇔ ∃x ∈ M : ¬A(x).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 32 / 624
Basic Tools and Definitions Basic Concepts in Logic
Vacuous Truth
If the domain of the universal quantifier ∀ is the empty set M = ∅, thenthe statement ∀x ∈ M : A(x) is defined to be true regardless of thepredicate A(x). It is then said that A(x) is vacuously true.1.1.14. Example. Let M be the set of real numbers x such that x = x + 1.Then the statement
∀x∈M
x > x
is true.This convention reflects the philosophy that a universal statement is trueunless there is a counterexample to prove it false. While this may seem astrange point of view, it proves useful in practice.This is similar to saying that “All pink elephants can fly.” is a truestatement, because it is impossible to find a pink elephant that can’t fly.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 33 / 624
Basic Tools and Definitions Basic Concepts in Logic
Nesting Quantifiers
We can also treat predicates with more than one variable as shown in thefollowing example.1.1.15. Examples. In the following examples, x , y are taken from the realnumbers.
∀x∀y : x2 + y2 − 2xy ≥ 0 is equivalent to ∀y∀x : x2 + y2 − 2xy ≥ 0.Therefore, one often writes ∀x , y : x2 + y2 − 2xy ≥ 0.
∃x∃y : x + y > 0 is equivalent to ∃y∃x : x + y > 0, often abbreviatedto ∃x , y : x + y > 0.
∀x∃y : x + y > 0 is a true statement. ∃x∀y : x + y > 0 is a false statement.
As is clear from these examples, the order of the quantifiers is important ifthey are different.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 34 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Basic Concepts in Logic
Basic Concepts in Set Theory
The Natural Numbers and Mathematical Induction
The Rational Numbers
Roots, Bounds & Real Numbers
Complex Numbers
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 35 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Naive Set Theory: Sets via Predicates
We want to be able to talk about “collections of objects”; however, we willbe unable to strictly define what an “object” or a “collection” is (exceptthat we also want any collection to qualify as an “object”). The problemwith “naive” set theory is that any attempt to make a formal definitionwill lead to a contradiction - we will see an example of this later. However,for our practical purposes we can live with this, as we won’t generallyencounter these contradictions.We indicate that an object (called an element) x is part of a collection(called a set) X by writing x ∈ X . We characterize the elements of a setX by some predicate P :
x ∈ X ⇔ P(x). (1.2.1)
We write such a set X in the form X = x : P(x).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 36 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Notation for SetsWe define the empty set ∅ := x : x = x. The empty set has noelements, because the predicate x = x is never true.We may also use the notation X = x1, x2, ... , xn to denote a set. In thiscase, X is understood to be the set
X = x : (x = x1) ∨ (x = x2) ∨ · · · ∨ (x = xn).
We will frequently use the convention
x ∈ A : P(x) = x : x ∈ A ∧ P(x)
1.2.1. Example. The set of even positive integers is
n ∈ N : ∃k∈Z
n = 2k
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 37 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Subsets and Equality of Sets
If every object x ∈ X is also an element of a set Y , we say that X is asubset of Y , writing X ⊂ Y ; in other words,
X ⊂ Y ⇔ ∀x ∈ X : x ∈ Y .
We say that X = Y if and only if X ⊂ Y and Y ⊂ X .We say that X is a proper subset of Y if X ⊂ Y but X = Y . In that casewe write X ⫋ Y .Some authors write ⊆ for ⊂ and ⊂ for ⫋. Pay attention to the conventionused when referring to literature.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 38 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Examples of Sets and Subsets1.2.2. Examples.
1. For any set X , ∅ ⊂ X . Since ∅ does not contain any elements, thedomain of the statement ∀x ∈ X : x ∈ Y is empty. Therefore, it isvacuously true and hence ∅ ⊂ X .
2. Consider the set A = a, b, c where a, b, c are arbitrary objects, forexample, numbers. The set
B = a, b, a, b, c , c
is equal to A, because it satisfies A ⊂ B and B ⊂ A as follows:
x ∈ A ⇔ (x = a) ∨ (x = b) ∨ (x = c) ⇔ x ∈ B.
Therefore, neither order nor repetition of the elements affects thecontents of a set.If C = a, b, then C ⊂ A and in fact C ⫋ A. Setting D = b, c wehave D ⫋ A but C ⊂ D and D ⊂ C .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 39 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Power Set and CardinalityIf a set X has a finite number of elements, we define the cardinality of Xto be this number, denoted by #X , |X | or cardX .We define the power set
P(M) := A : A ⊂ M.
Here the elements of the set P(M) are themselves sets; P(M) is the “setof all subsets of M.” Therefore, the statements
A ⊂ M and A ∈ P(M)
are equivalent.1.2.3. Example. The power set of a, b, c is
P(a, b, c) =∅, a, b, c, a, b, b, c, a, c, a, b, c
.
The cardinality of a, b, c is 3, the cardinality of the power set is|P(a, b, c)| = 8.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 40 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Operations on SetsIf A = x : P1(x), B = x : P2(x) we define the union, intersection anddifference of A and B by
A ∪ B := x : P1(x) ∨ P2(x), A ∩ B := x : P1(x) ∧ P2(x),A \ B := x : P1(x) ∧ (¬P2(x)).
Let A ⊂ M. We then define the complement of A by
Ac := M \ A.
If A ∩ B = ∅, we say that the sets A and B are disjoint.Occasionally, the notation A− B is used for A \ B and Ac is sometimesdenoted by A.1.2.4. Example. Let A = a, b, c and B = c , d. Then
A ∪ B = a, b, c , d, A ∩ B = c, A \ B = a, b.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 41 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Operations on SetsThe laws for logical equivalencies immediately lead to several rules for setoperations. For example, de Morgan’s laws (see Example 1.1.11) for ∧ and∨ imply
(A ∪ B)c = Ac ∩ Bc, (A ∩ B)c = Ac ∪ Bc,
Other such rules are, for example, A ∩ (B ∪ C ) = (A ∩ B) ∪ (A ∩ C )
A ∪ (B ∩ C ) = (A ∪ B) ∩ (A ∪ C )
(A ∪ B) \ C = (A \ C ) ∪ (B \ C )
(A ∩ B) \ C = (A \ C ) ∩ (B \ C )
A \ (B ∪ C ) = (A \ B) ∩ (A \ C )
A \ (B ∩ C ) = (A \ B) ∪ (A \ C )
Some of these will be proved in the recitation class and the assignments.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 42 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Operations on SetsOccasionally we will need the following notation for the union andintersection of a finite number n ∈ N of sets:
n∪k=0
Ak := A0 ∪ A1 ∪ A2 ∪ · · · ∪ An,
n∩k=0
Ak := A0 ∩ A1 ∩ A2 ∩ · · · ∩ An.
This notation even extends to n =∞, but needs to be properly defined:
x ∈∞∪k=0
Ak :⇔ ∃k∈N
x ∈ Ak ,
x ∈∞∩k=0
Ak :⇔ ∀k∈N
x ∈ Ak .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 43 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Operations on SetsIn particular,
∞∩k=0
Ak ⊂∞∪k=0
Ak .
1.2.5. Example. Let Ak = 0, 1, 2, ... , k for k ∈ N. Then∞∪k=0
Ak = N,∞∩k=0
Ak = 0.
To see the first statement, note that N ⊂∪∞
k=0 Ak since x ∈ N impliesx ∈ Ax implies x ∈
∪∞k=0 Ak . Furthermore,
∪∞k=0 Ak ⊂ N since
x ∈∪∞
k=0 Ak implies x ∈ Ak for some k ∈ N implies x ∈ N.For the second statement, note that
∩∞k=0 Ak ⊂ N. Now 0 ∈ Ak for all
k ∈ N. Thus 0 ⊂∩∞
k=0 Ak . On the other hand, for any x ∈ N \ 0 wehave x /∈ Ax−1 whence x /∈
∩∞k=0 Ak .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 44 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Ordered PairsReferences Loomis/Sternberg, Advanced Calculus, Chapter 0.6.See also Spivak, Calculus, Appendix to Chapter 3.
A set does not contain any information about the order of its elements,e.g.,
a, b = b, a.
Thus, there is no such a thing as the “first element of a set”. However,sometimes it is convenient or necessary to have such an ordering. This isachieved by defining an ordered pair, denoted by
(a, b)
and having the property that
(a, b) = (c , d) ⇔ (a = c) ∧ (b = d).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 45 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Cartesian Product of SetsThere are (at least) two ways of defining an ordered pair as a set:
(i) (a, b) := a, a, b or(ii) (a, b) := 1, a, 2, b.
The first definition does not need the natural numbers and uses only settheory, but both are of course equivalent.If A,B are sets and a ∈ A, b ∈ B, then we denote the set of all orderedpairs by
A× B := (a, b) : a ∈ A, b ∈ B.A× B is called the cartesian product of A and B.
In this manner we can also define an ordered triple (a, b, c) or, moregenerally, an ordered n-tuple (a1, ... , an) and the n-fold cartesian productA1 × · · · × An of sets Ak , k = 1, ... , n.If we take the cartesian product of a set with itself, we may abbreviate itusing exponents, e.g.,
N2 := N× N.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 46 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Problems in Naive Set TheoryIf one simply views sets as arbitrary “collections” and allows a set tocontain arbitrary objects, including other sets, then fundamental problemsarise. We first illustrate this by an analogy:Suppose a library contains not only books but also catalogs of books, i.e.,books listing other books. For example, there might be a catalog listing allmathematics books in the library, a catalog listing all history books, etc.Suppose that there are so many catalogs, that you are asked to createcatalogs of catalogs, i.e., catalogs listing other catalogs. In particular, youare asked to create the following:
(i) A catalog of all catalogs in the library. This catalog lists all catalogscontained in the library, so it must of course also list itself.
(ii) A catalog of all catalogs that list themselves. Does this catalog alsolist itself?
(iii) A catalog of all catalogs that do not list themselves. Does thiscatalog also list itself?
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 47 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
The Russel Antinomy
In the previous analogy, we can view “catalogs” as “sets” and being “listedin a catalog” as “being an element of a set”. Then we have
(i) The set of all sets must have itself as an element.(ii) The set of all sets that have themselves as elements may or may not
contain itself. (This may be decidable by adding some rule to settheory.)
(iii) It is not decidable whether the “set of all sets that do not havethemselves as elements” has itself as an element.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 48 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
The Russel Antinomy
Formally, this paradox is known as the Russel antinomy:
1.2.6. Russel Antinomy. The predicate P(x) : x /∈ x does not define a setA = x : P(x).
Proof.If A = x : x /∈ x were a set, then we should be able to decide for any sety whether y ∈ A or y /∈ A. We show that for y = A this is not possiblebecause either assumption leads to a contradiction:
(i) Assume A ∈ A. Then P(A) by (1.2.1), i.e., A /∈ A. (ii) Assume A /∈ A. Then ¬P(A) by (1.2.1), therefore A ∈ A.
Since we cannot decide whether A ∈ A or A /∈ A, A can not be a set.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 49 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
The Russel Antinomy
There are several examples in classical literature and philosophy of theRussel antimony and other“paradoxes”:
1. A person says: “This sentence is a lie.” Is he lying or telling thetruth? (An example of a sentence which is not a logical statement,since a truth value can not be assigned to it.)
2. In a mountain village, there is a barber. Some villagers shavethemselves (always) while the others never shave themselves. Thebarber shaves those and only those villagers that never shavethemselves. Who shaves the barber? (An illustration of the Russelantimony.)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 50 / 624
Basic Tools and Definitions Basic Concepts in Set Theory
Russel Antinomy
We will simply ignore the existence of such contradictions and build onnaive set theory. There are further paradoxes (antinomies) in naive settheory, such as Cantor’s paradox and the Burali-Forti paradox. All of theseare resolved if naive set theory is replaced by a modern axiomatic settheory such as Zermelo-Fraenkel set theory.
Further Information:Set Theory, Stanford Encyclopedia of Philosophy,http://plato.stanford.edu/entries/set-theory/.T. Jech, Set Theory: The Third Millennium Edition, Revised andExpanded, Reprinted by World Publishing Corporation Beijing, ¥78.00元
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 51 / 624
Basic Tools and Definitions The Natural Numbers and Mathematical Induction
Basic Concepts in Logic
Basic Concepts in Set Theory
The Natural Numbers and Mathematical Induction
The Rational Numbers
Roots, Bounds & Real Numbers
Complex Numbers
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 52 / 624
Basic Tools and Definitions The Natural Numbers and Mathematical Induction
The Natural Numbers
We denote the set of natural numbers by N, i.e.,
N = 0, 1, 2, 3, 4, ....
The natural numbers can be constructed rigorously from set theory, butwe will not do so here.We assume that the familiar operations of addition and multiplication areknown for the natural numbers, i.e., for any two natural numbers a, b ∈ Nwe can define the natural number c = a+ b ∈ N called the sum of a andb. The addition has the following properties (here a, b, c ∈ N):
1. a+ (b + c) = (a+ b) + c (Associativity)2. a+ 0 = 0 + a = a (Existence of a neutral element)3. a+ b = b + a (Commutativity)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 53 / 624
Basic Tools and Definitions The Natural Numbers and Mathematical Induction
Multiplication
Similarly, we can define multiplication, where a · b ∈ N is called theproduct of a and b. We have the following properties (here a, b, c ∈ N):
1. a · (b · c) = (a · b) · c (Associativity)2. a · 1 = 1 · a = a (Existence of a neutral element)3. a · b = b · a (Commutativity)
We also have a property that essentially states that addition andmultiplication are consistent,
a · (b + c) = a · b + a · c . (Distributivity)
Note that we are not able to define subtraction or division for all naturalnumbers.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 54 / 624
Basic Tools and Definitions The Natural Numbers and Mathematical Induction
Notation for Addition and Multiplication
For any numbers a1, a2, ... , an we define the notation
a1 + a2 + · · ·+ an =:n∑
j=1
aj =:∑
1≤j≤n
aj
anda1 · a2 · · · an =:
n∏j=1
aj =:∏
1≤j≤n
aj .
(We will use this notation for rational and real numbers ak ; see thefollowing sections.) For n ∈ N we define
0! := 1 and n! := n · (n − 1)! for n > 0.
This is an example of a recursive definition.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 55 / 624
Basic Tools and Definitions The Natural Numbers and Mathematical Induction
Powers and OrderingWe are also able to define the exponential, “intuitively” written as
ab = a · a · ... · a︸ ︷︷ ︸b times
for a, b ∈ N
by setting a0 := 1 and an := a · an−1 (another recursive definition). Wenote that
ab+c = ab · ac and (ab)c = ab·c .
We can imbue the natural numbers with an ordering, denoted by <. Wesay that a < b if there exists some c ∈ N∗ := N \ 0 such that b = a+ c.We write a > b if b < a.We define the symbol ≤ to mean “< or =”, so
a ≤ b ⇔(a < b ∨ a = b
).
We write a ≥ b if b ≤ a.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 56 / 624
Basic Tools and Definitions The Natural Numbers and Mathematical Induction
Mathematical InductionReference Spivak, Chapter 2.
Often one wants to show that some statement frame A(n) is true for alln ∈ N with n ≥ n0 for some n0 ∈ N. Mathematical induction works byestablishing two statements:
(I) A(n0) is true.(II) A(n + 1) is true whenever A(n) is true for n ≥ n0, i.e.,
∀n∈Nn≥n0
(A(n)⇒ A(n + 1)
)Note that (II) does not make a statement on the situation when A(n) isfalse; it is permitted for A(n + 1) to be true even if A(n) is false.The principle of mathematical induction now claims that A(n) is true forall n ≥ n0 if (I) and (II) are true. This follows from the fifth Peano axiom(the induction axiom).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 57 / 624
Basic Tools and Definitions The Natural Numbers and Mathematical Induction
Binomial FormulaIn practice, one first establishes A(n0), then shows that A(n + 1) is true,assuming that A(n) holds.
As an example, we will prove the binomial formula:
A(n) : (a+ b)n =n∑
k=0
(n
k
)akbn−k ,
where a, b ∈ R, n ∈ N and(n
k
):=
n!
k!(n − k)!, n! :=
n∏k=1
k = 1 · 2 · 3 · ... · n, 0! := 1.
We will need the relation(n + 1
k
)=
(n
k − 1
)+
(n
k
). (1.3.1)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 58 / 624
Basic Tools and Definitions The Natural Numbers and Mathematical Induction
Binomial Formula
We start by showing that A(0) is true. We see that
(a+ b)0 = 1, and0∑
k=0
(0
k
)akb0−k =
(0
0
)a0b0−0 = 1.
Hence A(0) is true, since both sides of the equation are equal to unity.
We now consider A(n + 1):
(a+ b)n+1 = (a+ b)(a+ b)n = (a+ b)n∑
k=0
(n
k
)akbn−k (1.3.2)
We have inserted the statement A(n) in (1.3.2), since we want to showthat A(n + 1) is true under the condition that A(n) is true.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 59 / 624
Basic Tools and Definitions The Natural Numbers and Mathematical Induction
Binomial Formula
Applying some algebraic manipulations, we obtain
(a+ b)n+1 = (a+ b)n∑
k=0
(n
k
)akbn−k
=n∑
k=0
(n
k
)ak+1bn−k +
n∑k=0
(n
k
)akbn+1−k
=n+1∑j=1
(n
j − 1
)ajbn+1−j +
n∑k=0
(n
k
)akbn+1−k
where we have set j = k + 1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 60 / 624
Basic Tools and Definitions The Natural Numbers and Mathematical Induction
Binomial FormulaWe can now rename the index back to k and split off some terms of thesums:
(a+ b)n+1 = an+1 +n∑
k=1
(n
k − 1
)akbn+1−k + bn+1 +
n∑k=1
(n
k
)akbn+1−k
= an+1 + bn+1 +n∑
k=1
((n
k − 1
)+
(n
k
))akbn+1−k
Now we apply (1.3.1):
(a+ b)n+1 = an+1 + bn+1 +n∑
k=1
(n + 1
k
)akbn+1−k
=n+1∑k=0
(n + 1
k
)akbn+1−k ,
which is just A(n + 1). Hence the binomial formula holds for all n ∈ N.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 61 / 624
Basic Tools and Definitions The Rational Numbers
Basic Concepts in Logic
Basic Concepts in Set Theory
The Natural Numbers and Mathematical Induction
The Rational Numbers
Roots, Bounds & Real Numbers
Complex Numbers
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 62 / 624
Basic Tools and Definitions The Rational Numbers
Basic Properties of Numbers - AdditionReference Spivak, Chapter 1.
We assume that the set of rational numbers (fractions) Q is known andhas been introduced from set theory. (This procedure will be discussed indetail in the course Ve203 Discrete Mathematics). We recapitulate thebasic properties of rational numbers, using the notation of Spivak’s book.For addition we have
associativity ∀a,b,c∈Q
a+ (b + c) = (a+ b) + c (P1)
neutral element ∃0∈Q
∀a∈Q
a+ 0 = 0 + a = a (P2)
inverse element ∀a∈Q
∃−a∈Q
(−a) + a = a+ (−a) = 0 (P3)
commutativity ∀a,b∈Q
a+ b = b + a. (P4)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 63 / 624
Basic Tools and Definitions The Rational Numbers
Multiplication
For multiplication, we have a similar set of properties
associativity ∀a,b,c∈Q
a · (b · c) = (a · b) · c, (P5)
neutral element ∃1∈Q1 =0
∀a∈Q
a · 1 = 1 · a = a, (P6)
inverse element ∀a∈Qa =0
∃a−1∈Q
a · a−1 = a−1 · a = 1, (P7)
commutativity ∀a,b∈Q
a · b = b · a. (P8)
With these properties we can prove that if a = 0 and a · b = a · c , thenb = c . Furthermore,if a · b = 0, then either a = 0 or b = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 64 / 624
Basic Tools and Definitions The Rational Numbers
Combining Addition and Multiplication
These twice four properties for addition and multiplication are basicallyindependent of each other (the only connection was that 0 = 1). Thefollowing property ensures that the two operations “work well together”,
distributivity ∀a,b,c∈Q
a · (b + c) = a · b + a · c . (P9)
It is this relation that allows us to prove that a− b = b − a only if a = b,and it is also the law of distributivity that we employ to do elementarymultiplications.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 65 / 624
Basic Tools and Definitions The Rational Numbers
The Positive Rational NumbersWe assume that we know what a strictly positive rational number is, i.e.,that there exists a set P with the property that for every number a oneand only one of the following holds:
1. a = 0,2. a ∈ P,3. −a ∈ P .
This is known as the trichotomy law (P10) for the rational numbers. Wealso assume that the set of positive numbers is closed under addition andmultiplication:
If a and b are in P , then a+ b is in P , (P11)If a and b are in P, then a · b is in P . (P12)
We then writeP = p ∈ Q : p > 0.
so that p > 0 is defined by p ∈ P.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 66 / 624
Basic Tools and Definitions The Rational Numbers
The Absolute ValueWe define the absolute value (sometimes called the modulus) |a| of arational number a by
|a| =
a if a ≥ 0,
−a if a < 0.An important basic result is the triangle equality:
1.4.1. Triangle Inequality. For all rational numbers a, b ∈ Q, we have
|a+ b| ≤ |a|+ |b|. (1.4.1)
Proof.We prove the triangle inequality by considering all possible cases for a andb according to the trichotomy law (P10):
a = 0, b ∈ Q. If a = 0 we have a+ b = b and |a| = a = 0, so|a|+ |b| = |b|. Both sides of (1.4.1) are then equal to |b|, so theinequality holds.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 67 / 624
Basic Tools and Definitions The Rational Numbers
The Triangle InequalityProof.We prove the triangle inequality by considering all possible cases for a andb according to the trichotomy law:
a > 0, b > 0. In this case, a+ b > 0 by (P11) so |a+ b| = a+ b.Furthermore, |a| = a and |b| = b, so both sides of (1.4.1) are equal toa+ b.
a < 0, b < 0. In this case, a+ b < 0 by (P11) (why?) so|a+ b| = −a− b. Furthermore, |a| = −a and |b| = −b, so both sidesof (1.4.1) are equal to −a− b.
a < 0, b > 0, a+ b ≥ 0. In this case, |a|+ |b| = b − a and|a+ b| = b + a. Since a < 0 and −a > 0 we have
|a+ b| = b + a < b + 0 = b < b + (−a) = b − a = |a|+ |b|
and the inequality holds.The case a < 0, b > 0, a+ b < 0 is left to the reader!
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 68 / 624
Basic Tools and Definitions The Rational Numbers
The Triangle Inequality
As a corollary of the triangle inequality we obtain the reverse triangleinequality:
|a+ b| ≥∣∣|a| − |b|∣∣ for all a, b ∈ Q. (1.4.2)
For the proof, it is sufficient to apply the triangle inequality as follows:
|b| = |a+ b − a| ≤ |a+ b|+ |a|.
Rearranging gives|a+ b| ≥ |b| − |a|.
Interchanging a and b then yields (1.4.2).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 69 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Basic Concepts in Logic
Basic Concepts in Set Theory
The Natural Numbers and Mathematical Induction
The Rational Numbers
Roots, Bounds & Real Numbers
Complex Numbers
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 70 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
The Rational Numbers are Insufficient
While the rational numbers are closed with respect to addition andmultiplication, and furthermore are large enough to include “inverseelements” −a and a−1 for these operations, they are still “incomplete” inmore than one sense:
1. Algebraically: there is no inverse element for operation of squaring.Given b ∈ Q, it may not be possible to find a ∈ Q such that a2 = b.
2. Analytically: Given a sequence of rational numbers that come closerand closer to each other, there may not be a “limiting value” in Q.(We will discuss this later.)
3. Set theoretically: Given a bounded subset of Q, there may not be aleast upper or greatest lower bound for this set.
We will discuss i) and iii) in more detail now, and come back to ii) later,when we construct a suitable expansion of the rational numbers.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 71 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
The Square Root Problem
Reference Spivak, end of Chapter 2.
1.5.1. Theorem. There exists no number a ∈ Q such that a2 = 2.
Proof.Assume that a = p/q ∈ Q has the property that (p/q)2 = 2, wherep, q ∈ N. We further assume that p and q have no common divisor greaterthan 1. Then
p2 = 2q2,
so p2 is even. This implies that p is even, so p = 2k for some k ∈ N. Butthen
2q2 = 4k2 ⇒ q2 = 2k2 is even ⇒ q is even.
Thus p and q are both divisible by 2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 72 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Bounded Subsets of Q
Reference Spivak, beginning of Chapter 8.
1.5.2. Definition. We say that a set U ⊂ Q is bounded if there exists aconstant c ∈ Q such that
|x | ≤ c for all x ∈ U.
If this is not the case, we say that U is unbounded.Numbers c1 and c2 such that
c1 ≤ x ≤ c2 for all x ∈ U
are called lower and upper bounds for U, respectively.We say that a set U ∈ Q is bounded above if there exists an upper boundfor U, and bounded below if there exists a lower bound for U.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 73 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Bounded Subsets of Q
1.5.3. Examples. The set U = 1/n ∈ Q : n ∈ N∗ is bounded since |x | ≤ 1 for all
x ∈ U. Furthermore, the number −1 is a lower bound and 1 is anupper bound. Of course, there are many more upper and lowerbounds.
The set N ⊂ Q is bounded below (lower bound: 0), but not boundedabove. It is unbounded.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 74 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Maxima and Minima of Sets
1.5.4. Definition. Let U ⊂ Q be a subset of the rational numbers. We saythat a number x1 ∈ U is the minimum of U if
x1 ≤ x , for all x ∈ U
and we write x1 =: minU.Similarly, we say that x2 ∈ U is the maximum of U if
x2 ≥ x , for all x ∈ U
and write x2 =: maxU.A set that is not bounded below has no minimum, and a set that is notbounded above has no maximum. However, even a bounded set does notneed to have a maximum or a minimum.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 75 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Greatest Lower and Least Upper Bounds of Sets
1.5.5. Definition. We say that an upper bound c2 ∈ Q of a set U ⊂ Q isthe least upper bound of U if no c ∈ Q with c < c2 is an upper bound.We then write c2 =: supU.We say that a lower bound c1 ∈ Q is the greatest lower bound of U if noc ∈ Q with c > c1 is a lower bound. We then write c1 =: inf U.
1.5.6. Example. The set U = 1/n ∈ Q : n ∈ N∗ has
inf U = 0, supU = 1,
minU does not exist, maxU = 1.
In general, if maxU exists, supU = maxU, and if minU exists,inf U = minU.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 76 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Existence of Suprema and Infima
Not every bounded set in Q has an infimum or supremum: consider
U = x ∈ Q : 2 < x2 ≤ 4 ∧ x > 0
It is clear that supU = maxU = 2, but the minimum and even theinfimum do not exist (you will prove this in the assignments).On the other hand, if we add a postulate to our previous axioms, we canwiden the rational numbers to include solutions of x2 = 2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 77 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
The Real NumbersWe define the set of real numbers R as the smallest extension of therational numbers Q such that the following property holds:
(P13) If A ⊂ R, A = ∅ is bounded above, then there exists aleast upper bound for A in R.
We call all real numbers that are not rational irrational numbers.We will not immediately construct such a set of objects, but do so a littlelater in this lecture. For now, we stipulate that such a set exists, and wecall it the set of real numbers R. Furthermore, we can add and multiplyelements of R and in general all properties (P1)-(P12) remain valid forelements of R.Note that (P13) is sufficient to guarantee the existence of greatest lowerbounds; we need but consider −A = x ∈ R : − x ∈ A instead of A. Theleast upper bound of −A will yield the negative of the greatest lowerbound of A.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 78 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
The Square Root ProblemWe can now solve the square root problem:
1.5.7. Theorem. For every real number x > 0, there exists a uniquepositive real number y solving the equation y2 = x .
Proof.The proof is in two parts: existence and uniqueness, i.e., we showseparately that i) there exists a solution and ii) the solution is unique.Often, it is easier to show uniqueness than existence. Also, the uniquenessproof may yield helpful results that can be used for the (generally moredifficult) existence proof. Therefore, one often starts with proving theuniqueness.Assume that there are two numbers y1 > y2 > 0 such that y21 = y22 = x .Then
0 = y21 − y22 = (y1 − y2)︸ ︷︷ ︸>0
(y1 + y2)︸ ︷︷ ︸>0
> 0
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 79 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
The Square Root ProblemProof (continued).We now show the existence of a solution to the equation y2 = x . LetM := t ∈ R : t > 0 ∧ t2 > x. Then M is non-empty and bounded frombelow, so by (P13) it has a greatest lower bound y = infM. We willestablish that y2 = x by showing that y2 > x and y2 < x lead tocontradictions. This strategy essentially uses the trichotomy law (P10).Assume that y2 > x , so y > x
y . Then
0 <
(y − x
y
)2
= y2 − 2yx
y+
(x
y
)2
,
so thaty2 +
(x
y
)2
> 2yx
y= 2x .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 80 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
The Square Root ProblemProof (continued).Set s := 1
2(y + xy ). Then
s2 =1
4
(y2 + 2
x
yy +
(xy
)2)>
1
4(2x + 2x) = x
so s ∈ M. However,
s =1
2(y +
x
y) <
1
2(y + y) = y = infM
The proof that y2 < x leads to a contradiction is simpler and will bediscussed in the assignments.
Therefore, the real numbers include all square roots of positive numbers;we denote the positive solution to y2 = x by y =
√x . Similarly, we write
the positive solution to yn = x as y = n√x .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 81 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Subsets of the Real NumbersWe introduce some notation we will have occasion to use often later.1.5.8. Definition. Let a, b ∈ R with a < b. Then we define the followingspecial subsets of R, which we call intervals:
[a, b] := x ∈ R : (a ≤ x) ∧ (x ≤ b) = x ∈ R : a ≤ x ≤ b,[a, b) := x ∈ R : (a ≤ x) ∧ (x < b) = x ∈ R : a ≤ x < b,(a, b] := x ∈ R : (a < x) ∧ (x ≤ b) = x ∈ R : a < x ≤ b,(a, b) := x ∈ R : (a < x) ∧ (x < b) = x ∈ R : a < x < b.
Furthermore, for any a ∈ R we set
[a,∞) := x ∈ R : x ≥ a, (a,∞) := x ∈ R : x > a,(−∞, a] := x ∈ R : x ≤ a), (−∞, a) := x ∈ R : x < a
Finally, we set (−∞,∞) := R.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 82 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Subsets of the Real Numbers
1.5.9. Definition. We call an interval I open if I = (a, b), a ∈ R or a = −∞, b ∈ R or b =∞; closed if I = [a, b], a, b ∈ R; half-open if is neither open or closed.
We often denote intervals by capital letters I or J. If we say “Let I be anopen interval…” we mean “Let I ⊂ R be a set such that there existnumbers a, b ∈ R such that I = (a, b)…”
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 83 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Subsets of the Real NumbersMore generally, we have the following classification of points with respectto a set:1.5.10. Definition. Let A ⊂ R.
(i) We call x ∈ R an interior point of A if there exists some ε > 0 suchthat the interval (x − ε, x + ε) ⊂ A. The set of interior points of A isdenoted by intA.
(ii) We call x ∈ R an exterior point of A if there exists some ε > 0 suchthat the interval (x − ε, x + ε) ∩ A = ∅.
(iii) We call x ∈ R a boundary point of A if for every ε > 0(x − ε, x + ε) ∩ A = ∅ and (x − ε, x + ε) ∩ Ac = ∅. The set ofboundary points of A is denoted by ∂A.
(iv) We call x ∈ R an accumulation point of A if for every ε > 0 theinterval (x − ε, x + ε) ∩ A \ x = ∅.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 84 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Subsets of the Real Numbers
1.5.11. Example. For the interval A = [0, 1), intA = (0, 1), Any x ∈ R \ [0, 1] is an exterior point, ∂A = 0, 1, Any x ∈ [0, 1] is an accumulation point.
1.5.12. Example. For the set A =x ∈ R : x = 1
n , n ∈ N \ 0
, intA = ∅, Any x ∈ R \ (A ∪ 0) is an exterior point, ∂A = A ∪ 0, Only x = 0 is an accumulation point.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 85 / 624
Basic Tools and Definitions Roots, Bounds & Real Numbers
Subsets of the Real Numbers
We can hence define general open and closed sets:1.5.13. Definition.
(i) A set A ⊂ R is called open if all points of A are interior points, i.e., if
A = intA.
(ii) A set A ⊂ R is called closed if R \ A is open.(iii) The set
A := A ∪ ∂A
is called the closure of A. It is the smallest closed set that contains A.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 86 / 624
Basic Tools and Definitions Complex Numbers
Basic Concepts in Logic
Basic Concepts in Set Theory
The Natural Numbers and Mathematical Induction
The Rational Numbers
Roots, Bounds & Real Numbers
Complex Numbers
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 87 / 624
Basic Tools and Definitions Complex Numbers
The Complex NumbersReference Spivak, Chapter 24.
Although we have added the square roots (solutions to y2 − x = 0, x > 0)to the rational numbers, and also many other elements through theaddition of (P13), we have still not quite solved the problem of solutionsof quadratic equations: there is no solution y to y2 + 1 = 0 in the realnumbers. (This is expressed by saying that R is algebraically incomplete.)We again need to extend the real numbers to include solutions toequations such as y2 = −1. We follow the familiar procedure ofconsidering pairs of real numbers and define the complex numbers C as
C = (a, b) : a, b ∈ R = R2
From the point of view of sets, there is no difference between C and R2.We consider the real numbers R as a subset of C, writing (x , 0) ∈ C forx ∈ R.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 88 / 624
Basic Tools and Definitions Complex Numbers
The Complex Numbers
We define addition in C through
(a1, b1) + (a2, b2) := (a1 + a2, b1 + b2),
so in particular the sum of two real numbers is again a real number.We next define multiplication in an apparently strange way:
(a1, b1) · (a2, b2) := (a1a2 − b1b2, a1b2 + a2b1).
However, it turns out that this multiplication satisfies the axioms(P5)-(P8), and also (P9) together with addition. The neutral element is(1, 0) (as expected), but the inverse element is a bit more complicated.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 89 / 624
Basic Tools and Definitions Complex Numbers
The Complex NumbersFor z = (a, b) ∈ C \ 0 the multiplicative inverse is given by
z−1 =
(a
a2 + b2,−b
a2 + b2
),
i.e., zz−1 = (1, 0).With the previously defined multiplication we can find a solution to theequation z2 = −1:
(0, 1)2 = (0, 1) · (0, 1) = (0 · 0− 1 · 1, 0 · 1 + 0 · 1) = (−1, 0).
In fact, it is possible to prove that C is algebraically closed:
1.6.1. Theorem. Every polynomial equation of the form∑n
k=0 akxk = 0,
where a0, ... , an are fixed complex numbers, has precisely n complexsolutions x1, ... , xn ∈ C.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 90 / 624
Basic Tools and Definitions Complex Numbers
The Complex NumbersWe further define multiplication of complex numbers by real numbers inthe following way:
λ · (a, b) = (λa,λb), λ ∈ R, (a, b) ∈ C.
Note that is multiplication together with addition in C is distributive.We can then split any complex number into two components:
(a, b) = (a, 0) + (0, b) = a · (1, 0) + b · (0, 1).
The pair (1, 0) ∈ C corresponds to 1 ∈ R, while the pair (0, 1) ∈ C is oftendenoted by the letter i . Hence we usually write
C ∋ z = (a, b) = a · 1 + b · i = a+ bi , a, b ∈ R,
where a =: Re z is called the real part and b =: Im z the imaginary part ofa complex number z = a+ bi .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 91 / 624
Basic Tools and Definitions Complex Numbers
Visualizing Real Numbers
We visualize real numbers by drawing a single line, with an arrow pointingto the right, in the direction of increasing positive numbers:
-4 -3 -2 -1 0 1 2 3 4
We will identify the set of real numbers with this line so that a single realnumber corresponds to a “point” on the line. This is of courseproblematic; in how far this is justified, which assumptions enter into thisidentification and which precise axioms need to be stated falls into thepurview of analytic geometry. We will not discuss this further in thislecture, but simply assume a correspondence between points on a line andthe real numbers.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 92 / 624
Basic Tools and Definitions Complex Numbers
Visualizing Complex NumbersWe visualize complex numbers by drawing two perpendicular lines(“axes”), with arrows pointing to the right and upward. each representingthe real numbers. A complex number z = a+ bi = Re z + (Im z) · i isrepresented by a point in the plane whose orthogonal projection onto thehorizontal axis gives Re z , and whose orthogonal projection onto thevertical axis gives Im z .
-4 -3 -2 -1 1 2 3 4Re z
-2
-1
1
2
3
4
Im z
z = Re z + ä Im z
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 93 / 624
Basic Tools and Definitions Complex Numbers
The Complex NumbersWe note that we have no ordering relation on C; it makes no sense to saythat one complex number is bigger or smaller than the other! Therefore,properties (P10)-(P12) do not hold for a complex number.
1.6.2. Notation. Whenever we write x > 0, we automatically assume thatx ∈ R.
1.6.3. Definition. We define the modulus or absolute value of a complexnumber z = a+ bi by
|z | =√
a2 + b2 =√z · z .
Here z = a− bi is called the complex conjugate of z .This definition of the absolute value is called an extension of the absolutevalue from the real numbers to the complex numbers.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 94 / 624
Basic Tools and Definitions Complex Numbers
The Complex NumbersThe complex modulus has a geometric interpretation as distance:
-4 -3 -2 -1 1 2 3 4Re z
-2
-1
1
2
3
4
Im z
z = Re z + ä Im z
ÈzÈ = HRe zL2 + HIm zL2
In fact, the geometric distance between two complex numbers z1 and z2can is
ϱ(z1, z2) = |z1 − z2|.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 95 / 624
Basic Tools and Definitions Complex Numbers
The Complex Numbers
Although we cannot define the ordering < on the complex numbers, wecan nevertheless define the concept of bounded sets in C.
1.6.4. Definition. Let z0 ∈ C. Then we define the open ball of radiusR > 0 centered at z0 by
BR(z0) := z ∈ C : |z − z0| < R.
We say that a set Ω ⊂ C is bounded if there exists some R > 0 such thatΩ ⊂ BR(0).Of course, it makes no sense to talk of upper or lower bounds of sets.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 96 / 624
Basic Tools and Definitions Complex Numbers
The Complex Numbers
The open ball of radius R about z0 ∈ C has a geometric interpretation asa neighbourhood:
-4 -3 -2 -1 1 2 3 4Re z
-2
-1
1
2
3
4
Im z
z0
R
BR Iz0M
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 97 / 624
Real Functions, Convergence and Continuity
Part II
Real Functions, Convergence and Continuity
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 98 / 624
Real Functions, Convergence and Continuity
Functions and Maps
Sequences
Real Functions
Limits of Functions
Continuous Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 99 / 624
Real Functions, Convergence and Continuity Functions and Maps
Functions and Maps
Sequences
Real Functions
Limits of Functions
Continuous Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 100 / 624
Real Functions, Convergence and Continuity Functions and Maps
FunctionsReference Spivak, Calculus, Chapter 3.Our goal is to put the concept of a function on a sound mathematicalfooting. From school, we imagine a function to be an assignment ofvalues, e.g.,
x 7→ x2
where to every number x we associate the square of this number. Thisconcept of a function is very active: we take a number x (input) andreturn a number x2 (output). We are doing something with x . Of course,we have to specify which numbers x are acceptable as input (we call thisthe domain of the function) and in what set the output lies (this set issometimes called the codomain).We also need to make sure that every input value yields exactly one outputvalue. In summary, a function should be an object to which we associate
(i) a domain and co-domain(ii) a unique assignment
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 101 / 624
Real Functions, Convergence and Continuity Functions and Maps
DefinitionNote that the above is mostly a “wishlist” - we still have no properdefinition of what a function actually is. It turns out that defining a“unique assignment” is very difficult in mathematics. The idea of “doingsomething” does not lend itself to contradiction-free definitions. Inmodern mathematics, active concepts are always replaced by passive orstatic concepts that do not involve “actions”. The definition of a functionis the first, basic example of this.In the following definition, we will replace the idea of an assignment withthe idea of a set. Essentially, the assignment
x 7→ x2
is replaced with the set of all pairs
(x , x2).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 102 / 624
Real Functions, Convergence and Continuity Functions and Maps
Definition2.1.1. Definition. Let X and Y be sets and let P be a predicate withdomain X × Y . Let
f := (x , y) ∈ X × Y : P(x , y)
and assume that P has the property that
∀(x1,y1)∈f
∀(x2,y2)∈f
x1 = x2 ⇒ y1 = y2. (2.1.1)
Letdom f :=
x ∈ X : ∃
y∈Y: (x , y) ∈ f
Then we say that f is a function from dom f to Y . The set dom f ⊂ X iscalled the domain of f and Y is called the codomain of f . We also definethe range of f by
ran f :=y ∈ Y : ∃
x∈X: (x , y) ∈ f
.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 103 / 624
Real Functions, Convergence and Continuity Functions and Maps
Examples2.1.2. Example. Let X ,Y = Z and P(x , y) : x2 = y . Then
f = (x , y) ∈ Z2 : y = x2 = (0, 0), (−1, 1), (1, 1), (−2, 4), (2, 4), ....
The function f has domain
dom f =x ∈ Z : ∃
y∈Z: y = x2
= Z
and the range satisfies
ran f =y ∈ Z : ∃
x∈Z: y = x2
⊂ N.
By contrast, the predicate Q(x , y) : y2 = x does not give rise to a functionfrom Z to Z because
g = (x , y) ∈ Z2 : y2 = x = (0, 0), (1,−1), (1, 1), (4, 2), (4,−2), ...
does not satisfy (2.1.1): (4, 2) and (4,−2) are both in g , but 2 = −2.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 104 / 624
Real Functions, Convergence and Continuity Functions and Maps
Functions as AssignmentsThe previous example illustrates how our “wishlist” for the properties offunctions is fulfilled by Definition 2.1.1. Having a formal definition is goingto be useful whenever we need to check whether an object actually is afunction or not.In practice, however, we will still think of a function as an assignment(albeit implemented technically through a static set of pairs). In theexpression
f = (x , y) ∈ X × Y : P(x , y)
the predicate P is not arbitrary, but for every x ∈ dom f there must exist aunique value y for which P(x , y) is true. We denote this value by f (x)(here the letter f is used in a slightly different way; it does not denote aset), i.e., we write y = f (x). (Thus, P(x , f (x)) is a tautology.)We can think of a function as an assignment of f (x) ∈ Y to certain x ∈ Xsuch that P(x , f (x)) is a tautology, where we are given some sets X ,Yand a predicate with domain X × Y .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 105 / 624
Real Functions, Convergence and Continuity Functions and Maps
Notation
The idea of a function as an assignment, while mathematically difficult tohandle, is very intuitive, and in calculus we mostly think of functions inthis way. Therefore, we use the following notation to denote a function fwith domain Ω ⊂ X and range contained in Y :
f : Ω → Y , x 7→ f (x).
The above is the traditional notation in calculus for a function f . Itcontains all necessary information: the domain of f and the assignmentwhich makes P(x , f (x)) a tautology. (Note the different shapes of thearrows; they are also traditional.) An alternative notation is
f : Ω → Y , y = f (x).
Another word for function is also map or mapping.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 106 / 624
Real Functions, Convergence and Continuity Functions and Maps
Notation
2.1.3. Example. Instead of writing
f = (x , y) ∈ Z2 : y = x2
we write
f : Z→ Z, x 7→ x2
or
f : Z→ Z, f (x) = x2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 107 / 624
Real Functions, Convergence and Continuity Functions and Maps
Compositions of Functions
If X ,Y ,Z are sets, Ω ⊂ X , Σ ⊂ Y , and two maps f : Ω → Y , g : Σ → Zare given such that ran f ⊂ Σ = dom g , then we define the composition
g f : Ω → Z , x 7→ g(f (x)).
2.1.4. Example. Consider the function f : Z→ N, f (x) = 1 + x2 andg : Z \ 0 → Q, g(x) = 1/x . Then ran f ⊂ N \ 0 ⊂ dom g , so we candefine the composition
g f : Z→ Q, x 7→ 1
1 + x2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 108 / 624
Real Functions, Convergence and Continuity Sequences
Functions and Maps
Sequences
Real Functions
Limits of Functions
Continuous Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 109 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - DefinitionsReference Spivak, Calculus, Chapter 22.
We will first discuss a relatively elementary type of map, the so-called(infinite) sequences. A sequence is simply a map from the natural to thereal or complex numbers. In fact, we define
2.2.1. Definition. Let Ω ⊂ N. A map Ω → R is called a real sequence, amap Ω → C a complex sequence.
2.2.2. Notation. We denote the values of a sequence by an, i.e., a sequencemaps n 7→ an, n ∈ Ω ⊂ N. Instead of (n, an) : n ∈ Ω we denote asequence by any of the following,
(an)n∈Ω = (an) = a0, a1, a2, ... .
The values an are called terms of the sequence.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 110 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Examples
2.2.3. Examples. The first terms of the sequence of even numbers, (an) = (2n) are
(an) = 0, 2, 4, 6, 8, 10, ...
The definition an = 2n is called an explicit definition of the sequence. The Fibonacci sequence is defined by a0 = a1 = 1, an = an−1 + an−2.
This is called a recursive definition. A sequence that takes on only two values in an alternating manner is
called an alternatng sequence, e.g. an = (−1)n. The above are all integer sequences since their values are integers. A
non-integer sequence is, for example, (an)n∈N\0 = (1/n)n∈N\0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 111 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Convergence2.2.4. Definition. The real or complex sequence (an) : Ω → X , Ω ⊂ N,X = R or C, is said to converge with limit a ∈ X if
∀ε>0
∃N∈N
∀n>N
|an − a| < ε.
We then write “ limn→∞
an = a” or “an → a as n→∞.”
A sequence that does not converge (to any limit) is called divergent.
2.2.5. Remark. Definition 2.2.4 can alternatively be formulated as
limn→∞
an = a :⇔ ∀ε>0
∃N∈N
∀n>N
an ∈ Bε(a). (2.2.1)
where
Bε(a) = z ∈ X : |a− z | < ε, ε > 0, a ∈ X ,
for X = R or C is called an ε-neighbourhood of a. We hence say, “Forsufficiently large n, an is arbitrarily close to a.”
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 112 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Examples
2.2.6. Theorem. The sequence (an), an = 1n , n ∈ N \ 0, converges
towards a = 0.
Proof.We need to show that for any ε > 0 we can find an N ∈ N such that forall n > N we have
|an − a| =∣∣∣∣1n − 0
∣∣∣∣ = 1
n< ε.
We note that 1n < ε⇔ n > 1
ε . Therefore, given ε > 0, we choose anyN ∈ N with N > 1/ε. Then for all n > N we have n > 1/ε and hence1/n < ε. Thus |an − 0| = an < ε for all n > N, so by the definition ofconvergence we have
limn→∞
1
n= 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 113 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Examples
2.2.7. Example. The sequence (an), an = 2n + 1, n ∈ N, does notconverge.
Proof.Let a be any real number. We will show that an → a. This is equivalent toshowing that
∃ε>0
∀N∈N
∃n>N
|an − a| ≥ ε. (2.2.2)
We will take ε = 1 and show that for all N ∈ N there exists an n > N suchthat |an − a| ≥ 1. Let N be fixed and choose n > maxN, |a|. Then
|an − a| ≥∣∣|an| − |a|∣∣ = ∣∣2n + 1− |a|
∣∣ ≥ ∣∣1 + |a|∣∣ ≥ ε,which is what we wanted to show.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 114 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Divergence to Infinity
2.2.8. Definition. A real sequence (an) is called divergent to infinity if
∀C>0
∃N∈N
∀n>N
an > C .
It is called divergent to minus infinity if
∀C<0
∃N∈N
∀n>N
an < C .
2.2.9. Notation. We write “ limn→∞
an =∞” or “an →∞ as n→∞” in thefirst case, “ lim
n→∞an = −∞” or “an → −∞ as n→∞” in the second case.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 115 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Examples
2.2.10. Examples. The sequence (an), an = 2n + 1 diverges to infinity. Any sequence that diverges to infinity or minus infinity does not
converge. The sequence (an), an = (−1)n does not converge and does not
diverge to infinity or minus infinity.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 116 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - General Results
2.2.11. Definition. A sequence is bounded if its range is a bounded set.The range of a sequence is of course
ran(an) = an : n ∈ N.
Since
supa0, a1, ... ≤ supn∈N|an| and |infa0, a1, ...| ≤ sup
n∈N|an|,
it follows that a sequence is bounded if and only if supn∈N|an| <∞.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 117 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - General Results
2.2.12. Lemma. A convergent sequence is bounded.
Proof.Let an → a. Then there exists some N such that |an − a| < 1 for alln > N. By the triangle inequality, this means that
|an| = |an − a+ a| ≤ |an − a|+ |a| < 1 + |a|
for n > N. It follows that
|an| ≤ max|a1|, ... , |aN |, |a|+ 1 =: C
for all n ∈ N, so sup|an| ≤ C <∞ and the range is bounded.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 118 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - General ResultsThe following result looks harmless, even trivial, but is of great importancefor applications, proofs and later generalizations.2.2.13. Lemma.
1. The sequence (an) converges to a if and only if the sequence(bn) := (an − a) converges to zero, i.e.,
an → a ⇔ an − a→ 0
2. The sequence (an) converges to 0 if and only if the sequence(bn) = (|an|) converges to zero, i.e.,
an → 0 ⇔ |an| → 0
Proof.1. |an − a| < ε ⇔ |(an − a)− 0| < ε.2.∣∣|an| − 0
∣∣ < ε ⇔ |an| < ε ⇔ |an − 0| < ε
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 119 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - General Results
2.2.14. Lemma. A convergent sequence has precisely one limit.
Obviously a convergent sequence has a limit; the statement of the lemmais that it can not have more than one limit.
Proof.Let a = b and assume that an → a and an → b. Then for every ε > 0there exists some N(ε) such that |an − a|, |an − b| < ε/2 for all n > N(ε).Let ε = |a− b|/2 and choose n > N(|a− b|/2). Then
|a− b| = |a− an + an − b| ≤ |a− an|+ |an − b| < ε
2+ε
2=|a− b|
2,
which is a contradiction.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 120 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - General Results
2.2.15. Proposition. Let (an) and (bn) be convergent real or complexsequences with an → a and bn → b for some a, b ∈ C. Then
1. lim(an + bn) = a+ b ,2. lim(an · bn) = ab,3. lim an
bn= a/b if b = 0.
We will prove the first statement, the second statement will be part ofyour homework, while the last statement will be discussed in the recitationclass.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 121 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - General Results
Proof of 1.Let an → a, bn → b. We will show that an + bn → a+ b. Fix any ε > 0.Then for some Na ∈ N for all n > Na we have
|an − a| < ε
2.
Similarly, for some Nb ∈ N we have |bn − b| < ε/2 for all n > Nb. DefineN := maxNa,Nb. Then for all n > N we have |an − a| < ε/2 and|bn − b| < ε/2. Furthermore,
|an + bn − (a+ b)| ≤ |an − a|+ |bn − b| < ε
2+ε
2= ε. (2.2.3)
Hence for any ε > 0 we have found some N such that (2.2.3) holds for alln > N.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 122 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Some Applications
One of the most useful tools for showing the convergence of a sequence isthe so-called Squeze Theorem:
2.2.16. Theorem. Let (an), (bn) and (cn) be real sequences withan < cn < bn for sufficiently large n. Suppose that lim an = lim bn = a.Then (cn) converges and lim cn = a.
We use the phrase “(...) for sufficiently large n” here to say “there existssome N0 ∈ N such that for all n > N0 (...).” This is a common expressionin mathematics and will occur later in other contexts.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 123 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Some Applications
Proof of the Squeeze Theorem.Fix ε > 0. Then there exists a sufficiently large N ∈ N such that
a− ε < an and bn < a+ ε for all n > N.
Since an < cn < bn for sufficiently large n, this implies
a− ε < cn < a+ ε ⇔ |cn − a| < ε
for all n > N.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 124 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Some Applications
2.2.17. Corollary. Let (rn) be a real sequence with lim rn = 0 and let (zn)be a complex sequence with
|zn| < rn for sufficiently large n.
Then zn → 0.
This follows from the squeeze theorem with an = 0, bn = rn and cn = |zn|together with Lemma 2.2.13, part 2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 125 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Some Applications
2.2.18. Proposition. Let q ∈ C, |q| < 1. Then limn→∞
qn = 0.
For the proof, we will need a consequence of Bernoulli’s Inequality, whichwill be discussed in the assignments:
2.2.19. Lemma. For x > −1 and n ∈ N,
(1 + x)n > nx .
The proof of Bernoulli’s inequality (by induction) is part of your exercises.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 126 / 624
Real Functions, Convergence and Continuity Sequences
Sequences - Some Applications2.2.20. Proposition. Let q ∈ C, |q| < 1. Then lim
n→∞qn = 0.
Proof.The case q = 0 is trivial, so we now assume 0 < |q| < 1. Then
1
|q|=: 1 + τ for some τ > 0.
By Bernoulli’s inequality,(
1
|q|
)n
= (1 + τ)n > nτ . Therefore,
|qn| = |q|n < 1
nτ.
The sequence ( 1nτ ) converges to 0 as n→∞, so by Corollary 2.2.17 we
have qn → 0 as n→∞.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 127 / 624
Real Functions, Convergence and Continuity Sequences
Complex Sequences2.2.21. Lemma. Let (zn) be a complex sequence, zn = xn + iyn and letz = x + iy ∈ C. Then
zn → z ⇐⇒ (xn → x and yn → y).
Proof.We use that
maxa, b ≤√a2 + b2 ≤ |a|+ |b|
(see this by squaring both sides) and
|zn − z | = |xn − x + i(yn − y)| =√
(xn − x)2 + (yn − y)2,
which gives
max|xn − x |, |yn − y |
≤ |zn − z | ≤ |xn − x |+ |yn − y |.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 128 / 624
Real Functions, Convergence and Continuity Sequences
Complex Sequences
Proof (continued).It is then clear that
|zn − z | < ε ⇒ |xn − x | < ε ∧ |yn − y | < ε
and|xn − x | < ε
2∧ |yn − y | < ε
2⇒ |zn − z | < ε.
From this and the definition of convergence, the Lemma follows.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 129 / 624
Real Functions, Convergence and Continuity Sequences
Real SequencesAs we have seen, the convergence of complex sequences can be reduced tothe convergence of real sequences. This is very important, because one ofthe most useful techniques for showing convergence use the order relation“<”, which is defined for real, but not complex numbers.
2.2.22. Definition. A real sequence (an) is called increasing if an ≤ an+1 for all n ∈ N. decreasing if an ≥ an+1 for all n ∈ N. strictly increasing if an < an+1 for all n ∈ N. strictly decreasing if an > an+1 for all n ∈ N. monotonic if it is either increasing or decreasing.
2.2.23. Notation. If xn is increasing (decreasing) and xn → x , we writexn x (xn x).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 130 / 624
Real Functions, Convergence and Continuity Sequences
Real Sequences - Boundedness & Monotonicity2.2.24. Theorem. Every monotonic and bounded (real) sequence (an) isconvergent. More precisely,
an supan : n ∈ N if (an) is increasing,an infan : n ∈ N if (an) is decreasing,
We will show the first statement only.Proof.Let (an) be an increasing and bounded sequence. Then the range
A := ran(an) = an : n ∈ N
is non-empty and bounded. By property (P13) it follows that the leastupper bound
supA =: σ <∞
exists.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 131 / 624
Real Functions, Convergence and Continuity Sequences
Order relations for Sequences in R
Proof (continued).For every ε > 0 there exists some number N such that σ− ε < aN . (If thiswere untrue, then σ − ε would be an upper bound for X , contradicting thedefinition of σ as least upper bound.) Since the sequence is increasing, itfollows that
σ − ε < aN ≤ an ≤ σ
for all n ≥ N. But this means that σ − an < ε for all n > N, so an → σ.The case where (an) is decreasing is dealt with analogously.Note that the proof has essentially used not only the ordering “<” (whichis defined for rational and real numbers), but also property (P13), which isonly defined for real numbers. The statement of the theorem is false forrational numbers.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 132 / 624
Real Functions, Convergence and Continuity Sequences
General Sequences - Subsequences
2.2.25. Definition. Let (an)n∈N be a sequence and let (nk)k∈N be a strictlyincreasing sequence of natural numbers. Then the composition
(ank )k∈N := (an)n∈N (nk)k∈N = (an1 , an2 , an3 , ...)
is called a subsequence of (an).
2.2.26. Example. If (an)n∈N = (a0, a1, a2, ...) is a sequence, then(a2k) = (a0, a2, a4, ...) is a subsequence of (an), since the map k 7→ 2k(the sequence (2k)k∈N) is strictly increasing.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 133 / 624
Real Functions, Convergence and Continuity Sequences
General Sequences - Subsequences
2.2.27. Definition. Let (an)n∈N be a sequence and A ⊂ N any infinite setof natural numbers. We define the subsequence
(an)n∈A
to be the composition of (an) and the sequence (nk), where n0 = minAand nk+1 > nk , nk ∈ A, for all k ∈ N (nk is the kth-smallest element ofA).
2.2.28. Example. If (an)n∈N is a sequence and if we denote the set of evennatural numbers by E , then
(a2k)k∈N = (an)n∈E .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 134 / 624
Real Functions, Convergence and Continuity Sequences
General Sequences - Subsequences
2.2.29. Lemma. Let (an) be a convergent sequence with limit a. Then anysubsequence of (an) is convergent with the same limit.
Proof.Let (ank ) be a subsequence of (an). Then we need to show that
∀ε>0
∃K∈N
∀k>K
|ank − a| < ε.
We know that∀
ε>0∃
N∈N∀
n>N|an − a| < ε.
Fix ε > 0 and choose a corresponding N. Since the sequence (nk) isstrictly increasing, there exists some K ∈ N such that nk > N for allk > K .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 135 / 624
Real Functions, Convergence and Continuity Sequences
Monotonic Subsequences of Real Sequences
2.2.30. Lemma. Every real sequence has a monotonic subsequence.
Proof.Let the sequence (an) be given and define
A := n ∈ N : an ≥ ak for all k > n.
Now if A is not finite, the subsequence
(an)n∈A
is decreasing and we have found a monotonic subsequence.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 136 / 624
Real Functions, Convergence and Continuity Sequences
Monotonic Subsequences of Real Sequences
Proof (continued).If
A = n ∈ N : an ≥ ak for all k > n.
is finite, then there exists some N such that no number n > N belongs toA. But that means that
∀n>N
∃l>n
al > an. (2.2.4)
Fix any number n0 > N. Next, in (2.2.4) set n = n0 and choose somel > n0 such that al > an0 . Denote this l by n1. Then, in (2.2.4) set n = n1and choose some l > n1 such that al > an1 . This procedure can berepeated to find a sequence of increasing numbers n0, n1, n2, ... such that(ank )k∈N is a strictly increasing subsequence of (an)n∈N.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 137 / 624
Real Functions, Convergence and Continuity Sequences
Accumulation Points2.2.31. Definition. Let (an) be a sequence. Then a number a is called anaccumulation point of (an) if
∀ε>0
∀N∈N
∃n>N|an − a| < ε.
2.2.32. Remark. If a sequence converges, then the limit is the onlyaccumulation point. Prove this explicitly for yourself!
2.2.33. Examples.1. The sequence (−1)n − 1/n has accumulation points 1 and −1, but
does not converge.2. The sequence (an) given by
an =
0 n evenn n odd
has the accumulation point 0, but does not converge.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 138 / 624
Real Functions, Convergence and Continuity Sequences
Accumulation Points2.2.34. Lemma. A number a is an accumulation point of a sequence (an)if and only if there exists a subsequence of (an) that converges to a.
Proof.(⇐) This case follows immediately from the definitions - do it yourself!(⇒) We assume that a is an accumulation point of (an) = (a1, a2, a3, ...).
By the definition of an accumulation point, there exists some ak suchthat |a− ak | < ε := 1; let n1 = k. Again, for ε = 1/2 and N = n1there exists some l > n1 such that |a− al | < 1/2; let n2 = l . Bychoosing ε = 1, 1/2, 1/3, 1/4, ... we hence iteratively find naturalnumbers nk , k ∈ N, such that (nk) is a strictly increasing sequenceand
|ank − a| < 1
k, k = 1, 2, 3, ... .
It follows that (ank ) is a subsequence of (an) that converges to a.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 139 / 624
Real Functions, Convergence and Continuity Sequences
Theorem of Bolzano-Weierstraß
We can now prove the following fundamental result:
2.2.35. Theorem of Bolzano-Weierstraß. Every bounded real sequence hasan accumulation point.
Proof.Let (an) be a bounded sequence of real numbers. By Lemma 2.2.30, thissequence has a monotonic subsequence, which is of course also bounded.By Theorem 2.2.24, this subsequence converges, so by Lemma 2.2.34, (an)has an accumulation point.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 140 / 624
Real Functions, Convergence and Continuity Sequences
Generalizing Convergence
As previously hinted, the concept of sequences and convergence onlyrequires two things:
A sequence of objects, realized by a map (an) : N→ M, where M issome arbitrary set and
The concept of “closeness,” so we can define convergence. We wantto express mathematically that a is a limit of (an) if an is arbitrarilyclose to a for sufficiently large n.
The first concept is already quite clear, and we don’t need to elaborate.We simply define a sequence in a set M as a map (an) : N→ M.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 141 / 624
Real Functions, Convergence and Continuity Sequences
Generalizing Convergence
For the second concept there are various generalizations, where the mostgeneral ideas are part of the mathematical field of Topology, which we willnot go into here. We will instead discuss a generalization of theneighborhoods
Bε(x) = y ∈ M : |x − y | < ε
where we replace |x − y | by a general “distance function”, called a metric.A metric will associate to two points x , y ∈ M their distance, which shouldbe a positive real number. Hence it will be a function defined not on M,but on the set of pairs M ×M, and it should satisfy certain conditions(such as being positive).We also want the metric to be symmetric (the distance from x to y shouldequal the distance from y to x), we want the distance from x to y to be 0if and only if x = y , and we want a form of the triangle inequality to hold.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 142 / 624
Real Functions, Convergence and Continuity Sequences
Generalizing ConvergenceWe hence allow any function that satisfies these conditions to be called adistance function, or metric. Formally, we state the following definition:
2.2.36. Definition. Let M be a set. A map ϱ : M ×M → R is called ametric if
(i) ϱ(x , y) ≥ 0 for all x , y ∈ M and ϱ(x , y) = 0 if and only if x = y .(ii) ϱ(x , y) = ϱ(y , x)
(iii) ϱ(x , z) ≤ ϱ(x , y) + ϱ(y , z).The pair (M, ϱ) is then called a metric space.
2.2.37. Examples.(i) M = (0, 1] ⊂ R, ϱ(x , y) = |x − y |,(ii) M = C, ϱ(z ,w) = max|Re z − Rew |, |Im z − Imw |,(iii) M = C, ϱ(z ,w) = |Re z − Rew |+ |Im z − Imw |
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 143 / 624
Real Functions, Convergence and Continuity Sequences
Examples of Metric Spaces2.2.38. Example. For M = C the New York City metric is given byϱ(x , y) = |x1 − y1|+ |x2 − y2|, where x = x1 + ix2 and y = y1 + iy2.
Extracted from Hammonds 1910 map, ”New York City and vicinity.” 1910. Wikipedia. Wikimedia Foundation. Web. 20 August 2013
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 144 / 624
Real Functions, Convergence and Continuity Sequences
Generalizing Convergence
We can hence define convergence of sequences in metric spaces (M, ϱ),where convergence of a sequence (an) : N→ M is determined by (2.2.1),
limn→∞
an = a :⇔ ∀ε>0
∃N∈N
∀n>N
an ∈ Bε(a).
where
Bε(a) = y ∈ M : ϱ(y , a) < ε, ε > 0, a ∈ M.
All results that do not use the properties of the real or complex numbersthen also apply to sequences in (M, ϱ), we just need to replace |x − y |everywhere with ϱ(x , y).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 145 / 624
Real Functions, Convergence and Continuity Sequences
Generalizing Convergence
In particular, we can define boundedness.
2.2.39. Definition. Let M be a set and (an) be any sequence in M. Fixx ∈ M. Then (an) is called bounded if
∃R>0
∀n∈N
an ∈ BR(x)
Note that this definition does not depend on the point x ; if (an) isbounded for some choice of x , then it is also bounded if some other y ∈ Mis used instead. We say that boundedness is well-defined.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 146 / 624
Real Functions, Convergence and Continuity Sequences
Cauchy SequencesWe now want to study sequences in metric spaces whose elements “groupclose together”, without necessarily converging. In particular, we want tocharacterize sequences, where the distance between the elements becomessmaller and smaller. Such sequences are called Cauchy squences.
2.2.40. Definition. A sequence (an) in a metric space (M, ϱ) is called aCauchy sequence if
∀ε>0
∃N∈N
∀m,n>N
ϱ(am, an) < ε.
2.2.41. Remark. Every convergent sequence is a Cauchy sequence. Thisfollows from
ϱ(an, am) ≤ ϱ(an, a) + ϱ(am, a).
where an → a. However, not every Cauchy sequence converges.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 147 / 624
Real Functions, Convergence and Continuity Sequences
Cauchy Sequences
2.2.42. Lemma. Every Cauchy sequence in a metric space (M, ϱ) isbounded.
Proof.Let (an) be a Cauchy sequence. We show that (an) is bounded. For ε = 1we can find some N such that ϱ(aN , an) < 1 for all n > N. Let x ∈ M besome fixed point. By the triangle inequality,
ϱ(an, x) ≤ ϱ(an, aN) + ϱ(aN , x) < 1 + ϱ(aN , x) for n > N
and
ϱ(an, x) ≤ maxϱ(a1, x), ... , ϱ(aN , x), ϱ(aN , x) + 1 =: C ,
hence an ∈ BC+1(x) for all n ∈ N and (an) is bounded.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 148 / 624
Real Functions, Convergence and Continuity Sequences
Cauchy Sequences2.2.43. Theorem. Every Cauchy sequence in R with the metric
ϱ : R× R→ R, ϱ(x , y) = |x − y |,
is convergent.
Proof.Let (an) be a Cauchy sequence, so in particular (an) is bounded. By thetheorem of Bolzano-Weierstraß, (an) has a convergent subsequence (ank )whose limit we denote by a, i.e., ank → a.We now prove that an → a. Let ε > 0 be fixed. Then we can fix some Nsuch that |an − am| < ε/2 for n,m > N. Due to the convergence of thesubsequence, we can also find some nk > N such that |ank − a| < ε/2.Then for any m > N we have
|am − a| ≤ |am − ank |+ |ank − a| < ε
2+ε
2= ε.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 149 / 624
Real Functions, Convergence and Continuity Sequences
Complete Metric SpacesSimilarly, one can show that every complex Cauchy sequence converges.The property that every Cauchy sequence converges is a very nice propertyof the metric space (M, ϱ), i.e., it depends on the metric ϱ as well as onthe set M.In fact, this property is so useful, that we give it a special name,
2.2.44. Definition. A metric space (M, ϱ) is called complete if everyCauchy sequence converges in M.
2.2.45. Examples. Let ϱ(x , y) = |x − y |. Then1. The spaces (R, ϱ) and (C, ϱ) are complete. (See recitation class for
(C, ϱ).)2. The space ([0, 1), ϱ) is incomplete. (Take an = 1− 1/n.)3. The space (Q, ϱ) is incomplete. (See exercises.)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 150 / 624
Real Functions, Convergence and Continuity Sequences
Complete Metric Spaces2.2.46. Example. Consider the metric ϱ : R× R→ R defined by
ϱ(x , y) =
∣∣∣∣ x
1 + |x |− y
1 + |y |
∣∣∣∣ .It can easily be checked that ϱ is actually a metric; the graph picturedbelow helps to see why ϱ(x , y) = 0 if and only if x = y , the symmetry andtriangle inequality are clear from the definition.
x
-1
1
y
y
x
x ¤ + 1
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 151 / 624
Real Functions, Convergence and Continuity Sequences
Complete Metric SpacesNow we claim that R with this metric is incomplete. In fact, take thesequence (an) given by an = n. We will first show that this is a Cauchysequence. Consider the sequence (bn) in R with the usual metric given by
bn =n
1 + n.
Since bn → 1 as n→∞ it is clear that (bn) is a Cauchy sequence in Rwith the usual metric, i.e., for any ε > 0 there exists an N > 0 such thatfor all n,m > N ∣∣∣∣ m
1 +m− n
1 + n
∣∣∣∣ < ε.
But the left-hand side is just ϱ(am, an), so we see that (an) is a Cauchysequence in (R, ϱ). However, it is easily seen that (an) does not convergeto any number a in (R, ϱ), so we have found a non-convergent Cauchysequence in (R, ϱ) and the metric space is hence incomplete.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 152 / 624
Real Functions, Convergence and Continuity Sequences
The Real Numbers
Reference Spivak, Calculus, 4th ed., Chapter 28, Problem 1.
Our aim is to construct the completion of an incomplete metric space by“adding” all the limits of Cauchy sequences to the space. We will illustratehow to do this with the rational numbers; the general procedure isanalogous.Given Q, we may consider the set of all sequences in Q that converge to alimit. Denote this set by Conv(Q). Each sequence (an) ∈ Conv(Q) isassociated uniquely to a number a ∈ Q, namely its limit. We can now saythat two sequences are equivalent if they have the same limit, i.e.,
(an) ∼ (bn) :⇔ limn→∞
an = limn→∞
bn. (2.2.5)
(This is called an equivalence relation.)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 153 / 624
Real Functions, Convergence and Continuity Sequences
Construction of the Real Numbers
We denote the set of all sequences with the same limit as a sequence (an)by [(an)]. Such a set is called a (equivalence) class and the set of allclasses is denoted Conv(Q)/ ∼.Since each rational number is represented by a class (why?) we see thatthe rational numbers may be identified with the set of all classes ofconvergent sequences:
Q ≃ Conv(Q)/ ∼ .
This is just a formal way of saying that the set of rational numberscorresponds to the set of all convergent sequences of rational numbers, ifany two sequences with the same limit are considered equivalent. Anyrational number corresponds to exactly one class of sequences andvice-versa.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 154 / 624
Real Functions, Convergence and Continuity Sequences
Construction of the Real Numbers
We can now consider a larger class of sequences, that of Cauchy sequencesof rational numbers, denoted by Cauchy(Q). Since every convergentsequence is a Cauchy sequence, Conv(Q) ⊂ Cauchy(Q).Furthermore, we say that two Cauchy sequences are equivalent not if theyhave the same limit (because they might not converge) but rather if theirdifference converges to zero:
(an) ∼ (bn) :⇔ limn→∞
(an − bn) = 0. (2.2.6)
Of course, (2.2.6) is equivalent to (2.2.5) for convergent sequences. Wenow have the larger set
Cauchy(Q)/ ∼ ⊃ Conv(Q)/ ∼ ≃ Q
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 155 / 624
Real Functions, Convergence and Continuity Sequences
Construction of the Real Numbers
The set Cauchy(Q)/ ∼ incorporates the rational numbers and by itsconstruction every Cauchy sequence (an) in Cauchy(Q)/ ∼ has a limit,namely precisely the object represented by the class [(an)]. We write
R := Cauchy(Q)/ ∼
and call this set the real numbers.It can be shown that all the operations of the rational numbers can beextended to R and that properties (P1) – (P12) continue to hold.Furthermore, in the set R property (P13) holds. (This can be shown byrelating least upper bounds to Cauchy sequences; the details are left toyou!)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 156 / 624
Real Functions, Convergence and Continuity Sequences
Construction of the Real Numbers2.2.47. Example. Every rational number has a finite decimal representation.We can think of a real number as having an “infinite decimalrepresentation.” (More details on this later.) For example, the sequence
1, 1.4, 1.41, ...
may converge to√2 if the following terms are chosen appropriately.
This “infinite decimal representation” is just the way that real numbers areintroduced in middle school. As another example, the sequences
(an) := (0.4, 0.49, 0.499, 0.4999, 0.49999, ...)
and(bn) := (0.5, 0.5, 0.5, 0.5, 0.5, ...)
are equivalent in the sense of (2.2.6), since|an − bn| = 10−(n+1) n→∞−−−→ 0.
Hence, 0.499999 ... and 0.5 are considered to represent the same number.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 157 / 624
Real Functions, Convergence and Continuity Real Functions
Functions and Maps
Sequences
Real Functions
Limits of Functions
Continuous Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 158 / 624
Real Functions, Convergence and Continuity Real Functions
Real Functions
Reference Spivak, Calculus, Chapters 3,4.
In the previous section, we have considered real and complex sequences,i.e., functions defined on a subset of the natural numbers. Now, we willinvestigate real functions, i.e., functions defined on a subset of R with realvalues, i.e., with range in the real numbers. Recall that we use the notation
f : Ω→ R, x 7→ f (x).
where Ω ⊂ R is the domain of f .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 159 / 624
Real Functions, Convergence and Continuity Real Functions
Real Functions
IMPORTANT: Remember that we denote a function by the symbol f , andthe values of the function at a some point x by f (x). It is important notto confuse these two notations; f is a function (set of pairs), while f (x) isa (real) number.Sometimes we will say (e.g.) “the function f (x) = x2” or even “thefunction x2”, but this is just short for the correct formulation “thefunction f with values given by f (x) = x2.”In this context, we consider the function f to have the maximal possibledomain, in other words,
dom f = x ∈ R : f (x) makes sense as a real number.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 160 / 624
Real Functions, Convergence and Continuity Real Functions
GraphsIn order to specify a unique function we always need to give
(i) the domain Ω and the space Y containing the range,(ii) the association (“mapping”) x 7→ f (x).
Although a function is actually defined through a set of pairs of elements,sometimes one prefers to think of a function as being given by (i) and (ii)above, and denotes by
Γ(f ) = (x , f (x)) ∈ X × Y : x ∈ dom f
the graph of the function f .If Ω ⊂ R and Y = R, we visualize the graph through the drawing of twoperpendicular axes, the horizontal of which represents the real numbersthat encompass the domain, the vertical representing the reals includingthe range. In this way, a point (x , y) ∈ R2 is represented by a point in the“coordinate system” of these graphs.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 161 / 624
Real Functions, Convergence and Continuity Real Functions
GraphsPlotting the graph of a function can be very useful for a quick overvew ofits properties. Compare the information conveyed by
f : R→ R, f (x) =1
1 + x + x2
and
-1
2
x
1
y
y =
1
1 + x + x2
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 162 / 624
Real Functions, Convergence and Continuity Real Functions
Graphs
In order to arrive at such a plot, we need to analyze the properties of thefunction. We are interested in finding
the intercepts with the axes, any “jumps” or “breaks” in the graph, points where the graph approaches a maximum or minimum value, values which are approached by the graph as |x | becomes very large.
Much of the rest of this term’s lecture will focus on such an analysis. Forcomplicated functions, we will literally need “every trick in the book.”
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 163 / 624
Real Functions, Convergence and Continuity Real Functions
Polynomial Functions
An important class of functions is constituted b the polynomial functions,which have the form
dom f = R, f (x) =n∑
k=0
ckxk , n ∈ N, c1, ... , cn ∈ R.
The natural number n is called the degree of the polynomial.Polynomial functions of degree 1 are straight lines, and can be uniquelydetermined by fixing two points through which they pass:Let (x1, y1) and (x2, y2) be two points in the plane R2. Then the straightline through these two points is given by
f (x) =x − x2x1 − x2
y1 +x − x1x2 − x1
y2 =y1 − y2x1 − x2
(x − x2) + y2
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 164 / 624
Real Functions, Convergence and Continuity Real Functions
Power FunctionsPolynomial functions of the form f (x) = xn, n ∈ N, are also called powerfunctions and can best be visualized through their graphs:
x
1
y
y = 1
y = x
y = x2
y = x3
y = x4
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 165 / 624
Real Functions, Convergence and Continuity Real Functions
Rational FunctionsQuotients of polynomial functions are called rational functions. They havethe form
f (x) =p(x)
q(x), p, q are polynomial functions.
and have domain dom f = x ∈ R : q(x) = 0.
x
y
y =
1
x + x2
- 1
y =
x2
7 + 7 x
y =
x
1 + x2
y =
1
x
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 166 / 624
Real Functions, Convergence and Continuity Real Functions
Piecewise FunctionsWe can define functions in a piecewise way by writing
f (x) =
g1(x) P1(x)... ...gn(x) P2(x)
where P1, ... ,Pn are predicates g1, ... , gn are suitable functions withdom gk = x ∈ R : Pk(x). Generally we require that for any given x ∈ Rat most one of the predicates is true. The domain of f is then given bydom f = x ∈ R : ∃
1≤k≤nPk(x).
2.3.1. Example.
f (x) =
x , −1 ≤ x < 1,
2− x , 1 ≤ x < 3,
dom f = [−1, 3).
-1 1 2 3x
-1
1
y
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 167 / 624
Real Functions, Convergence and Continuity Real Functions
Periodic FunctionsA function f : R→ R such that there exists some p ∈ R such that
f (x + p) = f (x) for all x ∈ R
is called periodic with period p.We can construct a periodic function as follows: First, we define the floorand ceiling of a number x ∈ R as
⌊x⌋ := maxn ∈ Z : n ≤ x, ⌈x⌉ := minn ∈ Z : n ≥ x,
respectively. We then define the “distance to the nearest integer” function,the function
· : R→ [0, 1/2], x := min|x − ⌈x⌉|, |x − ⌊x⌋|. (2.3.1)
(Here we have used a dot to indicate the position of the argument x whenevaluating · at x ∈ R.) Clearly, x + 1 = x for all x , so · isperiodic with period p = 1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 168 / 624
Real Functions, Convergence and Continuity Real Functions
Periodic Functions
The graph of the function · is sketched below:
-4 -3 -2 -1 0 1 2 3
x
1
2
y
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 169 / 624
Real Functions, Convergence and Continuity Real Functions
Manipulating FunctionsGiven a function f : R→ R, we can stretch and translate the graph of fhorizontally by multiplying its argument with a number or adding aconstant to the argument:
x
y
y = f HxLy = f H2 xLy = f Hx2L
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 170 / 624
Real Functions, Convergence and Continuity Real Functions
Manipulating FunctionsGiven a function f : R→ R, we can stretch and translate the graph of fhorizontally by multiplying its argument with a number or adding aconstant to the argument:
x
y
y = f HxLy = f Hx+1Ly = f Hx-1L
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 171 / 624
Real Functions, Convergence and Continuity Real Functions
Manipulating FunctionsGiven a function f : R→ R, we can stretch and translate the graph of fvertically by multiplying its argument with a number or adding a constantto the value of f :
x
y
y = f HxL
y = 2 f HxL
y = f HxL 2
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 172 / 624
Real Functions, Convergence and Continuity Real Functions
Manipulating FunctionsGiven a function f : R→ R, we can stretch and translate the graph of fvertically by multiplying its argument with a number or adding a constantto the value of f :
x
y
y = f HxL
y = f HxL + 1
y = f HxL - 1
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 173 / 624
Real Functions, Convergence and Continuity Real Functions
Periodic FunctionsGiven the function · of (2.3.1), we would like to define the function ψhaving the graph shown below:
-10-9 -8 -7 -6 -5 -4 -3 -2 -1 1 2 3 4 5 6 7 8 9 10x
-1
1
y
Find an expression for ψ(x) using · !Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 174 / 624
Real Functions, Convergence and Continuity Real Functions
Weird Functions
Recall that we have defined the composition of maps. Now the functionφ : R \ 0 → R, φ(x) = 1/x has an interesting property: it maps theinterval (0, 1) to the interval (1,∞) bijectively, and the interval (1,∞) to(0, 1) bijectively. Furthermore, f (1) = 1. Therefore, the action of φ on thepositive real numbers can be considered as a “mirroring” at x = 1. Theaction on the negative real numbers is similar.If we compose a function g with φ, the graph of g φ shows thismirroring. We illustrate by taking the periodic function ψ that we definedon the previous slide.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 175 / 624
Real Functions, Convergence and Continuity Real Functions
Weird Functions
-10-9 -8 -7 -6 -5 -4 -3 -2 -1 1 2 3 4 5 6 7 8 9 10x
-1
1
y
The function ψ φ, domψ ϕ = R \ 0, graphed above, is not periodicany more. Something strange seems to go on near the origin.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 176 / 624
Real Functions, Convergence and Continuity Real Functions
Weird Functions
-1 1x
-1
1
y
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 177 / 624
Real Functions, Convergence and Continuity Real Functions
Weird Functions
-0.1 -0.05 0.05x
-1
1
y
The infinitely many peaks in (1,∞) of ψ have been crammed into theinterval (0, 1) by the composition. It becomes impossible to give a preciserendering of the graph near x = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 178 / 624
Real Functions, Convergence and Continuity Real Functions
Weird Functions
Strange functions such as the last one can be constructed quite easily. Forexample, it is impossible to give any precise idea of the graph of
f (x) =
1, x rational,0, x irrational.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 179 / 624
Real Functions, Convergence and Continuity Real Functions
Parity and Monotonicity
2.3.2. Definition. Let Ω ⊂ R and f : Ω → R. The f is said to be even if f (−x) = f (x) for all x ∈ Ω. odd if f (−x) = −f (x) for all x ∈ Ω. increasing if f (x) ≥ f (y) for all x > y , x , y ∈ Ω . strictly increasing if f (x) > f (y) for all x > y , x , y ∈ Ω. decreasing if f (x) ≤ f (y) for all x > y , x , y ∈ Ω. strictly decreasing if f (x) < f (y) for all x > y , x , y ∈ Ω . monotonic if f is either decreasing or increasing. strictly monotonic if f is either strictly decreasing or strictly
increasing.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 180 / 624
Real Functions, Convergence and Continuity Limits of Functions
Functions and Maps
Sequences
Real Functions
Limits of Functions
Continuous Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 181 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limit of a FunctionTaking our inspiration from sequences, we would like to formalize theconcept of the limit of a function. In other words, we would like to be ableto describe the behaviour of a function f near some point x0.In the case of sequences, the only interesting case was x0 =∞. While weare ultimately interested in limits of functions also for x0 ∈ R, this case canbe generalized to functions of a real variable in a straightforward manner:2.4.1. Definition. Let f be a real- or complex-valued function defined on asubset of R that includes some interval (a,∞), a ∈ R. Then f convergesto L ∈ C as x →∞, written
limx→∞
f (x) = L :⇔ ∀ε>0
∃C>0
∀x>C|f (x)− L| < ε. (2.4.1)
The limit of a function as x → −∞ is defined similarly.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 182 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limit of a Function
2.4.2. Example. We have
limx→∞
1
x= 0, lim
x→−∞
1
x= 0,
The analogy of the previous definition to that for sequences is clear.However, if we want to extend the concept of a limit to x0 ∈ R, we needto look at the essence of this definition:
We preset how close we want f (x) to be to L (“ ∀ε>0
”). Then if xis close enough to x0 =∞ (“ ∃
C>0∀
x>C”) we can achieve
|f (x)− L| < ε.
If we want to extend this to x0 ∈ R, we merely have to find another way ofsaying “if x is close enough to x0 ∈ R”.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 183 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limit of a FunctionThis is easily done; we express “if x is close enough to x0 ∈ R” by saying“if there exists some δ > 0 such that |x − x0| < δ.” We want this to holdfor all x , except possibly for x = x0. (Hence, “near x0” does not mean “atx0!”We also want f to be defined “near x0,” so we assume that x0 ∈ dom f .We can then finally write down our definition for the limit of a function atx0 ∈ R:2.4.3. Definition. Let f be a real- or complex-valued function defined on asubset Ω ⊂ R and let x0 ∈ Ω. Then the limit of f as x → x0 is equal toL ∈ C, written
limx→x0
f (x) = L :⇔ ∀ε>0∃
δ>0∀
x∈Ω\x0|x − x0| < δ ⇒ |f (x)− L| < ε.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 184 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limit of a Function2.4.4. Examples.
1. limx→0
x+12x+1 = 1,
2. limx→1
√x = 1.
As for sequences, the main method for finding limits is not directly by thedefinition, but rather through the following theorem:2.4.5. Theorem. Let f and g be real- or complex-valued functions andx0 ∈ dom f ∩ dom g such that lim
x→x0f (x) and lim
x→x0g(x) exist. Then
1. limx→x0
(f (x) + g(x)) = limx→x0
f (x) + limx→x0
g(x) ,
2. limx→x0
(f (x) · g(x)) =(limx→x0
f (x))(
limx→x0
g(x)),
3. limx→x0
f (x)
g(x)=
limx→x0
f (x)
limx→x0
g(x)if limx→x0 g(x) = 0.
These statements remain true if x0 = ±∞.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 185 / 624
Real Functions, Convergence and Continuity Limits of Functions
One-sided LimitsOccasionally, it may be useful to consider one-sided limits for some x0 ∈ R:2.4.6. Definition. Let f be a real- or complex-valued function defined on asubset Ω ⊂ R and let x0 ∈ Ω.Then the limit of f as x converges to x0 from above is equal to L ∈ C,
limxx0
f (x) = L :⇔ ∀ε>0∃
δ>0∀
x∈Ω\x00 < x − x0 < δ ⇒ |f (x)− L| < ε.
Analogously, the limit of f as x converges to x0 from below is equal toL ∈ C,
limxx0
f (x) = L :⇔ ∀ε>0∃
δ>0∀
x∈Ω\x00 < x0 − x < δ ⇒ |f (x)− L| < ε.
2.4.7. Remark. Clearly, f (x)→ L as x → x0 if and only if f (x)→ L asx x0 and f (x)→ L as x x0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 186 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limits of FunctionsIn a similar manner to our definition for sequences, we can also definedivergence to plus or minus infinity. The precise formulation is left as anexercise. We give some concrete examples2.4.8. Examples.
1. limx→0
1x does not exist; lim
x0
1x =∞, lim
x0
1x = −∞. Note that 1/x is not
defined at x = 0.2. Let f : R→ R be defined as
f (x) =
x
1+x2, −∞ < x ≤ 0,
1 + x , 0 < x <∞.
Then limx0
f (x) = 1, limx0
f (x) = 0 and limx→x0
f (x) exists for all x0 = 0.Note that f (0) = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 187 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limits of Functions using Sequences
There is a close relation between the limit of a function and the limits ofsequences:2.4.9. Theorem. Let f be a real- or complex-valued function defined on asubset Ω ⊂ R and let x0 ∈ Ω. Then
limx→x0
f (x) = L ⇔ ∀(an)
an∈Ω\x0
(an
n→∞−−−→ x0 ⇒ f (an)n→∞−−−→ L
)
A similar result holds for x0 = ±∞.This remarkable result means that the limits of functions can be describedentirely through sequences.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 188 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limits of Functions using Sequences
Proof.(⇒) Assume that limx→x0 f (x) = L. Let (an) be a sequence such that
an → x0. We will show that then f (an)→ L. Fix ε > 0. Then wechoose a δ > 0 such that
|x − x0| < δ ⇒ |f (x)− L| < ε.
Furthermore, we can choose an N ∈ N such that |an − x0| < δ forn > N. Then for n > N, |f (an)− L| < ε, so f (an)→ L.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 189 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limits of Functions using SequencesProof (continued).
(⇐) Assume that f (an)→ L for all sequences (an) with an → x0. Weprove that then f (x)→ L as x → x0 by contraposition. Assume thatf (x) → L as x → x0. Then
∃ε>0∀
δ>0∃
x∈Ω|x − x0| < δ ⇒ |f (x)− L| < ε
⇔ ∃ε>0∀
δ>0∃
x∈Ω|x − x0| < δ ∧ |f (x)− L| ≥ ε
Fix ε > 0. Then for every n ∈ N∗ there exists a number xn such that
|xn − x0| <1
n∧ |f (xn)− L| ≥ ε.
Clearly the sequence (xn) converges to x0, but f (xn) cannot convergeto L.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 190 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limits of Functions using Sequences2.4.10. Example. Let ψ denote the periodic function we studied earlier, asshown in the left graph below. We want to study lim
x→0ψ(1/x), shown
below right.
-10-9 -8 -7 -6 -5 -4 -3 -2 -1 1 2 3 4 5 6 7 8 9 10x
-1
1
y
-0.1 -0.05 0.05x
-1
1
y
We suspect that the limit does not exist. We consider the sequence (xn),xn = 1/(4n + 1). Then xn → 0 and ψ(1/xn) = ψ(4n + 1) = 1→ 1 asn→∞.For (yn) given by yn = 1/(4n + 3) we also have yn → 0 butψ(1/yn) = ψ(4n + 3) = −1→ −1. Therefore lim
x→0ψ(1/x) does not exist.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 191 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limits of Functions using Sequences
2.4.11. Example. Now consider the function f given by f (x) = xψ(1/x).We want to show that
limx→0
xψ(1/x) = 0.
Let (xn) be any sequence with xn → 0 as n→∞. Note that
|f (xn)| = |xn| · |ψ(1/xn)| ≤ |xn|n→∞−−−→ 0,
since |ψ(x)| ≤ 1 for all x ∈ R. It follows that
limn→∞|f (xn)| = 0,
so limx→0
f (x) = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 192 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limits of Functions using Sequences
-1 1x
-1
1
y
The function f given by f (x) = xψ(1/x); note that it is constant for|x | > 1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 193 / 624
Real Functions, Convergence and Continuity Limits of Functions
Limits of Functions using Sequences
-0.1 -0.05 0.05x
y
Behavior near x = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 194 / 624
Real Functions, Convergence and Continuity Limits of Functions
Behavior of Functions
Hence limits serve to characterize the behavior of functions near points.However, our current techniques do not give us very detailedinformation.For example, consider the functions f , g , h : R→ R,
f (x) =√
x2 + 1, g(x) = x , h(x) = x2.
We expect f and g to behave “similarly” as x →∞, but h to have “muchlarger” values. However, at this point we can only say that the individuallimits do not exist and the functions diverge to infinity.Similarly, x2 should be “much smaller” than x as x → 0, even though bothlimits vanish.The main tools for characterizing such behavior are known as Landausymbols. There are two different “kinds”: big-O and little-O. We firstintroduce big-Oh.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 195 / 624
Real Functions, Convergence and Continuity Limits of Functions
The Big-O Landau Symbol2.4.12. Definition. Let f ,ϕ be real- or complex-valued functions defined ona subset Ω ⊂ R and let x0 ∈ Ω. We say that
f (x) = O(ϕ(x)) as x → x0
if and only if
∃C>0
∃ε>0
∀x∈Ω
|x − x0| < ε ⇒ |f (x)| ≤ C |ϕ(x)| (2.4.2)
2.4.13. Example. We have x + x2 = O(x) as x → 0: taking ε = 1 andC = 2, for all x ∈ R,
|x − 0| < 1 ⇒ |x + x2| ≤ |x |+ |x |2 = (1 + |x |)︸ ︷︷ ︸<2
|x |.
2.4.14. Remark. “f (x) = O(ϕ(x)) as x → x0” can be interpreted as“|f (x)| is not significantly larger than |ϕ(x)| when x is near x0.”
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 196 / 624
Real Functions, Convergence and Continuity Limits of Functions
The Big-O Landau SymbolWe can make an analogous definition for x →∞,2.4.15. Definition. Let f ,ϕ be real- or complex-valued functions defined ona subset Ω ⊂ R containing the interval (L,∞) for some L ∈ R. We saythat
f (x) = O(ϕ(x)) as x →∞
if and only if
∃C>0
∃M>L
x > M ⇒ |f (x)| ≤ C |ϕ(x)|
The statement that f (x) = O(ϕ(x)) as x → −∞ is defined similarly.2.4.16. Example. We have x + x2 = O(x2) as x →∞: taking M = 1 andC = 2, for all x ∈ R,
|x | > 1 ⇒ |x + x2| ≤ |x |+ |x |2 < 2|x |2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 197 / 624
Real Functions, Convergence and Continuity Limits of Functions
The Little-O Landau Symbol2.4.17. Definition. Let f ,ϕ be real- or complex-valued functions defined ona subset Ω ⊂ R and let x0 ∈ Ω. We say that
f (x) = o(ϕ(x)) as x → x0
if and only if
∀C>0
∃ε>0
∀x∈Ω\x0
|x − x0| < ε ⇒ |f (x)| < C |ϕ(x)| (2.4.3)
2.4.18. Example. We have x2 = o(x) as x → 0: for any C > 0 we can takeε = C , so that for all x ∈ R,
|x − 0| < ε = C ⇒ |x2| = |x | · |x | < C |x |.
2.4.19. Remark. “f (x) = o(ϕ(x)) as x → x0” can be interpreted as “|f (x)|is much smaller than |ϕ(x)| when x is near x0.”
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 198 / 624
Real Functions, Convergence and Continuity Limits of Functions
The Little-O Landau SymbolWe can make an analogous definition for x →∞,2.4.20. Definition. Let f ,ϕ be a real- or complex-valued functions definedon a subset Ω ⊂ R containing the interval (L,∞) for some L ∈ R. We saythat
f (x) = o(ϕ(x)) as x →∞
if and only if∀
C>0∃
M>Lx > M ⇒ |f (x)| < C |ϕ(x)|
The statement that f (x) = o(ϕ(x)) as x → −∞ is defined similarly.2.4.21. Example. We have 1/x2 = o(1/x) as x →∞: For any C > 0 takeM > 1/C . Then, for all x ∈ R,
x > M = 1/C ⇒ 1
x2≤ C
1
x
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 199 / 624
Real Functions, Convergence and Continuity Limits of Functions
More Examples for Landau Symbol NotationSome more examples are given below:2.4.22. Examples.
(i) 1
x2= O
(1
x
)as x →∞,
(ii) 1
x= O
(1
x2
)as x → 0,
(iii) 1
x2 + x= O
(1
x
)as x → 0,
(iv) 1
x2 + x= O
(1
x
)as x →∞,
(v) 1
x2 + x= o
(1
x3
)as x → 0,
(vi) 1
x2 + x= o(1) as x →∞.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 200 / 624
Real Functions, Convergence and Continuity Limits of Functions
Landau Symbols using Limits
In many practical cases, the Landau symbols can be calculated using limits:2.4.23. Theorem. Let f ,ϕ be a real- or complex-valued functions definedon an interval I ⊂ R and let x0 ∈ I . If there exists some C ≥ 0 such that
limx→x0
|f (x)||ϕ(x)|
= C , (2.4.4)
then f (x) = O(ϕ(x)) as x → x0.
Proof.Let f ,ϕ, I and x0 be given and suppose that (2.4.4) holds for some C ≥ 0.Then
∀ε>0∃
δ>0∀
x∈I\x0|x − x0| < δ ⇒
∣∣∣∣ |f (x)||ϕ(x)|− C
∣∣∣∣ < ε.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 201 / 624
Real Functions, Convergence and Continuity Limits of Functions
Landau Symbols using LimitsProof (continued).Choose ε = 1. Then there exists some δ > 0 such that |x − x0| < δ implies∣∣|f (x)| − C |ϕ(x)|
∣∣ < |ϕ(x)| for all x ∈ I \ x0,
i.e.,|f (x)| < (C + 1)|ϕ(x)|.
This is just the definition of f (x) = O(ϕ(x)) as x → x0.
2.4.24. Theorem. Let f ,ϕ be a real- or complex-valued functions definedon an interval I ⊂ R and let x0 ∈ I . Then
limx→x0
|f (x)||ϕ(x)|
= 0 ⇔ f (x) = o(ϕ(x)) as x → x0. (2.4.5)
This result will be proven in the assignments.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 202 / 624
Real Functions, Convergence and Continuity Limits of Functions
Usage of Landau SymbolsThe notation f (x) = O(ϕ(x)) as x → x0 is used to describe an absolutesize relation near x0 and at x0, to the effect that f is not significantlylarger than ϕ. For such size comparisons, it is useful to have the domain ofthe inequality (2.4.2) encompass the point x0 also.On the other hand, when using little-o, the convention is that only thebehaviour near x0 should enter into the statement that f is much smallerthan ϕ, so (2.4.3) does not encompass the point x0.This is what is responsible for the different statements of Theorems 2.4.23and 2.4.24, the first involving an implication, the second an equivalence.In fact, often the little-o symbol is defined in terms of the limit in (2.4.5)while the big-o symbol is defined in terms of (2.4.2); seehttp://mathworld.wolfram.com/LandauSymbols.html.Note also that the article on Landau symbols on Wikipedia is athttps://en.wikipedia.org/wiki/Big_O_notation and written exclusively from thepoint of view of computer science - little-o, so important in mathematics,is hardly mentioned at all.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 203 / 624
Real Functions, Convergence and Continuity Limits of Functions
Some Remarks on Landau SymbolsMore properly, “O(ϕ(x)) as x → x0” denotes a set: the set of all functionsf such that for some C ≥ 0 and some ε > 0, |f (x)| < C |ϕ(x)| for allx ∈ (x0 − ε, x0 + ε). A better notation for this set might be Ox0(ϕ) and wecould (perhaps more correctly) write
f ∈ Ox0(ϕ) instead of f (x) = O(ϕ(x)) as x → x0.
However, the second notation has become standard among physicists andmathematicians, and we write expressions like
(x + 1)4 = 1 + 4x + O(x2) meaning (x + 1)4 − (1 + 4x) = O(x2)
as x → 0. This is often more intuitive than writing
(x + 1)4 − (1 + 4x) ∈ O0(x2) or (x + 1)4 = 1 + 4x mod O0(x
2).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 204 / 624
Real Functions, Convergence and Continuity Limits of Functions
Some Remarks on Landau SymbolsWe do sacrifice the symmetry of the “=” sign, however. For example,
O(x3) = O(x2) as x → 0
means “any function f such that |f (x)| < C |x |3 for all |x | < ε for someε,C also satisfies |f (x)| < C ′|x |2 for all |x | < ε′ for some ε′,C ′.” Clearly,
O(x2)?= O(x3) as x → 0 is false.
Landau symbols can be combined with each other and with functions. Forexample, as x → 0,
c · O(xn) = O(xn), xnO(xm) = O(xn+m),
O(xn) + O(xm) = O(xmin(n,m)), O(xn)O(xm) = O(xn+m).
Further examples and applications will be given in the assignments.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 205 / 624
Real Functions, Convergence and Continuity Continuous Functions
Functions and Maps
Sequences
Real Functions
Limits of Functions
Continuous Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 206 / 624
Real Functions, Convergence and Continuity Continuous Functions
ContinuityWe have extensively discussed limits of functions, and also discovered thatthe values of a function at some point need not have anything to do withthe behavior of the function near that point.However, we would of course like to assume that “reasonable” functionsbehave in a consistent manner on their domain. We will therefore give aname to this reasonableness:2.5.1. Definition. Let Ω ⊂ R be any set and f : Ω → R be a functiondefined on Ω . Let x0 ∈ Ω. We say that f is continuous at x0 if
limx→x0
f (x) = f (x0).
If U ⊂ Ω, we say that f is continuous on U if f is continuous at everyx0 ∈ U.We say that f is continuous on its domain, or simply continuous, if f iscontinuous at every x0 ∈ Ω.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 207 / 624
Real Functions, Convergence and Continuity Continuous Functions
Continuity2.5.2. Remark. For a function f to be continuous at a point x0, threeconditions have to be fulfilled:
(i) f needs to have a limit at x0 ( limx→x0
f (x) must exist);(ii) f needs to be defined at x0 (f (x0) must exist);(iii) the value of f must coincide with its limit (f (x0) = lim
x→x0f (x)).
2.5.3. Examples.1. The zig-zag function f (x) = ψ(1/x) is not continuous at x = 0
because the limit limx→0
f (x) does not exist (see Example 2.4.10).2. The function f (x) = xψ(1/x) is not continuous at x = 0 because f is
not defined at x = 0.3. The function
f (x) =
xψ(1/x) x = 0
0 x = 0
is continuous at x = 0 (see Example 2.4.11).Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 208 / 624
Real Functions, Convergence and Continuity Continuous Functions
Continuity
We can use the results that we have obtained previously concerning limitsof sequences and functions to express the continuity of f at x0 inequivalent ways:2.5.4. Theorem. Let Ω ⊂ R be any set and f : Ω → R be a functiondefined on Ω . Let x0 ∈ Ω. Then the following are equivalent:
1. f is continuous at x0;2. for any real sequence (an) with an → x0, lim
n→∞f (an) = f (x0);
3. ∀ε>0∃
δ>0∀
x∈dom f: |x − x0| < δ ⇒ |f (x)− f (x0)| < ε.
In fact, since we have discussed sequences in metric spaces, and we havean idea of distance in metric spaces, we can define continuity easily formaps between metric spaces. This is left to the exercises.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 209 / 624
Real Functions, Convergence and Continuity Continuous Functions
One-Sided ContinuityOccasionally it will be useful to consider one-sided continuity.2.5.5. Definition. Let Ω ⊂ R be any set and f : Ω → R be a functiondefined on Ω . Let x0 ∈ Ω. We say that f is continuous from the left orfrom below at x0 if
limxx0
f (x) = f (x0).
We say that f is continuous from the right or from above at x0 if
limxx0
f (x) = f (x0).
Of course, a function is continuous at a point if and only if it is continuousboth from above and from below. One-sided continuity can also becharacterized using monotonic sequences; the precise formulation is left toyou!
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 210 / 624
Real Functions, Convergence and Continuity Continuous Functions
Continuous Extension2.5.6. Definition. Let Ω ⊂ R be any set and Ω ⊃ Ω . Suppose thatf : Ω → R and f : Ω → R are continuous functions. If f (x) = f (x) for allx ∈ Ω , we say that f is a continuous extension of f to Ω .
2.5.7. Examples.1. Let f : [0,∞)→ R, f (x) = √x and f : R→ R, f (x) =
√|x |. Then f
is a continuous extension of f .2. Let f : R \ 1 → R, f (x) = (x2 − 1)/(x − 1) and f : R→ R,
f (x) = x + 1. Then f is a continuous extension of f .
2.5.8. Remark. Suppose that Ω ⊂ R, x0 ∈ Ω and f : Ω \ x0 → R iscontinuous and has the property that lim
x→x0f (x) exists. Then f : Ω → R,
f (x) = limy→x
f (y), x ∈ Ω ,
defines the unique continuous extension of f to Ω .Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 211 / 624
Real Functions, Convergence and Continuity Continuous Functions
Continuity
While continuous functions may still exhibit some curious behavior (we willstudy examples later), nevertheless continuity is sufficient as a hypothesisfor many theorems. In fact, calculus may be regarded as the branch ofmathematics studying continuous functions.For real functions f and g we can define their sum f + g and product f · gby
f + g : (dom f ) ∩ (dom g)→ R, (f + g)(x) := f (x) + g(x),
f · g : (dom f ) ∩ (dom g)→ R, (f · g)(x) := f (x) · g(x).
We thus define the sum/product of two functions by describing how f + gand f · g act on every x ∈ (dom f ) ∩ (dom g). This type of definition iscalled a pointwise definition.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 212 / 624
Real Functions, Convergence and Continuity Continuous Functions
Continuity
2.5.9. Theorem. Let f and g be two real functions andx ∈ (dom f ) ∩ (dom g). Assume that both f and g are continuous at x .Then
(i) f + g is continuous at x and(ii) f · g is continuous at x .
Furthermore, if g(x) = 0, the function h defined by h(x) = 1/g(x) iscontinuous at x .Moreover, if f and g are real functions with x ∈ dom g , g(x) ∈ dom f , gis continuous at x and f is continuous at g(x), then the composition f gis continuous at x .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 213 / 624
Real Functions, Convergence and Continuity Continuous Functions
ContinuityProof.The first three assertions follow immediately from the correspondingtheorem for limits. We will prove the last assertion concerning thecomposition of f and g .Let ε > 0. Since f is continuous at g(x), we know that there exists someδ > 0 such that for all y ∈ dom f
|g(x)− y | < δ ⇒ |f (g(x))− f (y)| < ε.
We fix some such δ > 0. Now since g is continuous at x , there exists someδ > 0 such that for all z ∈ dom g
|x − z | < δ ⇒ |g(x)− g(z)| < δ.
Recall z ∈ dom(f g) if z ∈ dom g and g(z) ∈ dom f . Hence for allz ∈ dom(f g), if |x − z | < δ, then |f (g(x))− f (g(z))| < ε. This provesthat f g is continuous.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 214 / 624
Real Functions, Convergence and Continuity Continuous Functions
Continuity
In fact, we have a stronger version of the last statement that does notpresume the continuity of f .2.5.10. Theorem. Let f , g be real functions such that lim
x→x0g(x) = L exists
and f is continuous at L ∈ dom f . Then
limx→x0
f (g(x)) = f (L).
The proof is similar to the previous proof and will be treated in yourrecitation classes next week.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 215 / 624
Real Functions, Convergence and Continuity Continuous Functions
Continuity
The following lemma is useful in many proofs and also gives a first idea ofhow powerful continuity is.2.5.11. Lemma. Let Ω ⊂ R be some set, f : Ω → R a function that iscontinuous at some point x0 ∈ Ω and assume that f (x0) > 0. Then thereexists a δ > 0 such that f (x) > 0 for all x ∈ (x0 − δ, x0 + δ) ∩ Ω.
Proof.Since f is continuous at x0, for ε = f (x0)/2 we can find a δ > 0 such thatfor all x ∈ Ω
|x − x0| < δ ⇒ |f (x)− f (x0)| < f (x0)/2
⇒ f (x) > f (x0)/2 > 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 216 / 624
Real Functions, Convergence and Continuity Continuous Functions
The Bolzano Intermediate Value TheoremThis is the first of three important theorems that enable us to obtain hardinformation on the behavior of continuous functions.2.5.12. Theorem. Let a < b and f : [a, b]→ R be a continuous functionwith f (a) < 0 < f (b). Then there exists some x ∈ [a, b] such thatf (x) = 0.
Proof.We consider
A := x ∈ [a, b] : f (y) < 0 for y ∈ [a, x ].
Clearly A is bounded (a ≤ x ≤ b for all x ∈ A) and non-empty (a ∈ A).Therefore, A has a least upper bound a ≤ α ≤ b.We claim that in fact α < b. Note that f (b) > 0. By Lemma 2.5.11, weknow that there exists some δ > 0 such that f (x) > 0 for allx ∈ (b− δ, b]. Therefore, b− δ is an upper bound for A and hence α < b.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 217 / 624
Real Functions, Convergence and Continuity Continuous Functions
The Bolzano Intermediate Value Theorem
Proof (continued).Similarly, there exists a δ′ such that f (x) < 0 for all x ∈ [a, a+ δ′), so α isat least as large as a+ δ′. It follows that a < α < b.We will now show that f (α) = 0, by proving that f (α) > 0 and f (α) < 0lead to contradictions.First, suppose f (α) < 0. Then by Lemma 2.5.11, there exists a δ > 0 suchthat f (x) < 0 for x ∈ (α− δ,α+ δ). Since α = supA, for this δ thereexists some x0 ∈ A such that α− δ < x0 < α. (Otherwise, α− δ would bean upper bound for A, giving a contradiction.) This means that f (x) < 0for all x ∈ [a, x0]. Since f (x) < 0 for x ∈ (α− δ,α+ δ) and x0 > α− δ, itfollows that f (x) < 0 for x ∈ [a,α+ δ). But this contradicts theassumption that α is an upper bound for A.The proof for f (α) > 0 is analogous and left to you!
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 218 / 624
Real Functions, Convergence and Continuity Continuous Functions
The Bolzano Intermediate Value Theorem
An immediate corollary is the following useful result:2.5.13. Bolzano Intermediate Value Theorem. Let a < b and f : [a, b]→ Rbe a continuous function. Then for y ∈ [minf (a), f (b), maxf (a), f (b)]there exists some x ∈ [a, b] such that y = f (x).
Proof.If f (a) = f (b), the statement is trivially true. Suppose that f (a) ≤ f (b).Given y , we apply Theorem 2.5.12 to the function g : [a, b]→ R,g(x) = f (x)− y . If f (a) > f (b), we apply Theorem 2.5.12 to the functiongiven by g(x) = f (a+ b − x)− y .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 219 / 624
Real Functions, Convergence and Continuity Continuous Functions
A Fixed Point Theorem
There are many applications of the intermediate value theorem. Mostinvolve choosing the function f to which the theorem is applied in a smartway. We give one such result here:2.5.14. Theorem. Let f : [a, b]→ R be a continuous function withran f ⊂ [a, b]. Then f has a fixed point, i.e., there exists some x ∈ [a, b]such that f (x) = x .This is an example of a fixed point theorem. (In fact, it is a special case ofBrouwer’s fixed point theorem.) Such results are extremely useful forproving existence and/or uniqueness of very general objects inmathematics.One of the most basic fixed point theorems is called the contractionmapping principle or Banach’s fixed point theorem and is treated in theassignments.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 220 / 624
Real Functions, Convergence and Continuity Continuous Functions
A Fixed Point Theorem
Proof.We assume that f (a) > a and f (b) < b, for otherwise a and b would befixed points. Then f (a)− a > 0 and f (b)− b < 0. Consider the functiong(x) = f (x)− x .Since g(a) > 0 and g(b) < 0, it follows that for some x0 ∈ [a, b]
0 = g(x0) = f (x0)− x0 ⇔ f (x0) = x0,
so x0 is a fixed point for f .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 221 / 624
Real Functions, Convergence and Continuity Continuous Functions
Boundedness of Continuous FunctionsIt turns out that continuity of a function implies its boundedness on closedand bounded domains; we will study this more closely.2.5.15. Lemma. Let Ω ⊂ R be some set and f : Ω → R a function that iscontinuous at some point x0 ∈ Ω. Then there exists a δ > 0 such that f isbounded above on (x0 − δ, x0 + δ) ∩ Ω .Basically, this lemma says that supf (x) : x ∈ Bδ(x0) exists for someδ > 0.
Proof.Since f is continuous at x0, for ε = 1 we can find a δ > 0 such that for allx ∈ Ω
|x − x0| < δ ⇒ |f (x)− f (x0)| < 1
⇒ |f (x)| < |f (x0)|+ 1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 222 / 624
Real Functions, Convergence and Continuity Continuous Functions
Boundedness of Continuous Functions
We can now prove a more general theorem.2.5.16. Proposition. Let a < b and f : [a, b]→ R be a continuousfunction. Then f is bounded above.By this result, supf (x) : x ∈ [a, b] exists. Proposition 2.5.16 also impliesthat f is bounded below and hence bounded on [a, b].
Proof.We define
A := x ∈ [a, b] : f is bounded above on [a, x ].
Clearly A is bounded (a ≤ x ≤ b for all x ∈ A) and non-empty (a ∈ A).Therefore, A has a least upper bound a ≤ α ≤ b.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 223 / 624
Real Functions, Convergence and Continuity Continuous Functions
Boundedness of Continuous Functions
Proof (continued).We claim that in fact α = b. Suppose that α < b. By Lemma 2.5.15, weknow that there exists some δ > 0 such that f is bounded above on(α− δ,α+ δ). Since α is the least upper bound of A, there exists somex0 ∈ A such that α− δ < x0 < α. We conclude that f is bounded aboveon [a,α+ δ) and hence α+ δ/2 ∈ A. We have shown that f is bounded above on [a, x ] for every x < b. Notethat this does not automatically imply that f is bounded above on [a, b]!But since there exists a δ > 0 such that f is bounded above on (b − δ, b],we can conclude that f is bounded above on [a, b].
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 224 / 624
Real Functions, Convergence and Continuity Continuous Functions
Boundedness of Continuous Functions
The previous result can be significantly strengthened.2.5.17. Theorem. Let a < b and f : [a, b]→ R be a continuous function.Then there exists a y ∈ [a, b] such that f (x) ≤ f (y) for all x ∈ [a, b].Hence maxf (x) : x ∈ [a, b] exists. Colloquially, we say that “acontinuous functions attains its maximum”.Proof.We define
A := y ∈ R : y = f (x), x ∈ [a, b].
By Proposition 2.5.16, A is bounded and A is clearly non-empty(f (a) ∈ A). Therefore, A has a least upper bound α such that α ≥ f (x)for all x ∈ [a, b]. We will show that α = f (y) for some y ∈ [a, b].
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 225 / 624
Real Functions, Convergence and Continuity Continuous Functions
Boundedness of Continuous Functions
Proof (continued).We argue by contradiction: assume α = f (y) for all y ∈ [a, b]. Then
g : [a, b]→ R, g(x) =1
α− f (x)
is continuous on [a, b] but not bounded above, as we shall see.Since α is the least upper bound of A, for every ε > 0 there exists anx ∈ [a, b] such that α− f (x) < ε. But then for this x , g(x) > 1/ε. Sincewe can make ε as small as desired, g becomes as large as desired, so g isnot bounded above. But this contradicts Proposition 2.5.16.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 226 / 624
Real Functions, Convergence and Continuity Continuous Functions
Inverse FunctionsWe want to say a few words about the inverse of a function. First, we takethe opportunity to introduce some new terms:2.5.18. Definition. Let Ω , Ω ⊂ R and f : Ω → Ω a function. We say that fis
injective if f (x1) = f (x2)⇒ x1 = x2 for all x1, x2 ∈ Ω; surjective if for every y ∈ Ω there exists an x ∈ Ω such that f (x) = y
(i.e., if ran f = Ω); bijective if for every y ∈ Ω there exists a unique x ∈ Ω such that
f (x) = y (i.e., f is injective and surjective).If f : Ω → Ω is bijective we define the inverse of f , denoted by f −1 as
f −1 : Ω → Ω, f (x) 7→ x .
We then say that f is invertible on Ω.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 227 / 624
Real Functions, Convergence and Continuity Continuous Functions
Inverse Functions
If we regard f as a relation
f := (x , f (x)) : x ∈ Ω
we know that (x , y1) = (x , y2) implies y1 = y2 because f is a function. If fis injective, we also know that (x1, y) = (x2, y) implies x1 = x2. Switchingthe entries, we also know that (y , x1) = (y , x2) implies x1 = x2.Therefore, it follows that
f −1 := (f (x), x) : x ∈ Ω
is a function. Due to the surjectivity of f , we can write this set of pairs as
f −1 := (f (x), x) : f (x) ∈ Ω = (y , f −1(y)) : y ∈ Ω
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 228 / 624
Real Functions, Convergence and Continuity Continuous Functions
Inverse FunctionsA function is invertible if and only if it is bijective. But we would like tofind a criterion for bijectivity. If f is strictly monotonic (either increasingor decreasing) on I then, in particular, f (x) = f (y) for x = y . This shouldimply that f is injective and hence bijective in a suitable sense.
2.5.19. Theorem. Let a, b ∈ R ∪ −∞ ∪ ∞ such that a < b. Letf : (a, b)→ R be strictly increasing and continuous. Then
α := limxa
f (x) ≥ −∞, β := limxb
f (x) ≤ ∞,
exist and f : (a, b)→ (α,β) is bijective. Furthermore, the inverse functionf −1 is also continuous and strictly increasing and, furthermore,
limyα
f −1(y) = a, limyβ
f −1(y) = b. (2.5.1)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 229 / 624
Real Functions, Convergence and Continuity Continuous Functions
Inverse Functions
Below we will give a sketch of the proof. Here, “sketch” means that somedetails are left to you to work out.Sketch of Proof.Due to the intermediate value theorem, f takes on every value in (α,β) atleast once [why?], so f is surjective. Since f is strictly increasing it is alsoinjective. Hence the inverse function exists.Denote the inverse function f −1 by g . If y1, y2 ∈ (α,β) with y1 < y2, weclaim that g(y1) < g(y2). If this were not the case,
g(y1) ≥ g(y2) ⇒ f (g(y1) = y1 ≥ f (g(y2)) = y2,
contradicting the fact that f is strictly increasing. Hence g is strictlyincreasing.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 230 / 624
Real Functions, Convergence and Continuity Continuous Functions
Inverse Functions
Sketch of Proof (continued).We now prove that g is continuous from the left. Fix y0 ∈ (α,β) andchoose x0 ∈ (a, b) such that f (x0) = y0. Let (an) be an increasingsequence with an y0. Then the sequence (g(an)) is increasing [why?]and bounded by x0, since
an ≤ y0 ⇒ an ≤ f (x0) ⇒ f (g(an)) ≤ f (x0) ⇒ g(an) ≤ x0
where we have used that f is increasing. In fact, x0 is the least upperbound of g(an) [why?] and so, by Theorem 2.2.24, g(an)→ g(y0). Thisproves that g is continuous from the left. In a similar manner, we canprove that g is continuous from the right.It remains to show (2.5.1). This is left to you!
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 231 / 624
Real Functions, Convergence and Continuity Continuous Functions
Inverse Functions
We have seen that a monotonic function on an interval is bijective; thefollowing result gives a converse to that statement.2.5.20. Theorem. Let I ⊂ R be an interval and Ω ⊂ R a set. If f : I → Ωis continuous and bijective, then f is strictly monotonic on I .
Proof.Let a0, b0 ∈ I with a0 < b0 be given. Then either
i) f (b0)− f (a0) > 0 or ii) f (b0)− f (a0) < 0
We will show that if i) is true, then i) holds for any other a1, b1 ∈ I witha1 < b1, i.e., that f is strictly increasing. The proof for the case ii) issimilar and left to the reader.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 232 / 624
Real Functions, Convergence and Continuity Continuous Functions
Inverse FunctionsProof (continued).For 0 ≤ t ≤ 1 define
xt := (1− t)a0 + ta1, yt := (1− t)b0 + tb1.
Then x0 = a0 and x1 = a1 and xt ∈ [a0, a1]. An analogous statement istrue for yt . Thus xt , yt ∈ dom f for all t ∈ [0, 1].Since a0 < b0 and a1 < b1, we have
xt < yt for t ∈ [0.1].
We define the function g : [0, 1]→ R, g(t) = f (yt)− f (xt). Since g is acomposition of continuous functions, it is continuous on [0, 1].Moreover, g(t) = 0 for all t ∈ [0, 1] since xt < yt and f is bijective.Therefore, either g > 0 or g < 0 on [0, 1]. But g(0) > 0 by i), so g(t) > 0for all t ∈ [0, 1]. This implies that i) also holds for a1 < b1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 233 / 624
Real Functions, Convergence and Continuity Continuous Functions
Image and Pre-Image of SetsWe take this opportunity to define some notation for future use:2.5.21. Definition. Let Ω ⊂ R and f : Ω → R. Then for any A ⊂ Ω andB ⊂ ran f
f (A) :=y ∈ R : ∃
x∈Af (x) = y
is called the image of A and
f −1(B) :=x ∈ Ω : ∃
y∈Bf (x) = y
is called the pre-image of B. Note that the symbol f −1(B) makes sensewhether or not f is invertible.
2.5.22. Example. Let f : R→ R, f (x) = x2. Then
f ([−1, 2]) = [0, 4] and f −1([1, 4]) = [−2,−1] ∪ [1, 2].
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 234 / 624
Real Functions, Convergence and Continuity Continuous Functions
Continuity in Intervals
Up to now we have mainly discussed continuity at single points. Recallthat a function f was said to be continuous on an interval I if it iscontinuous at all points of the interval, i.e., f is continuous on I ⊂ dom fif and only if
∀x∈I∀
ε>0∃
δ>0∀y∈I|x − y | < δ ⇒ |f (x)− f (y)| < ε.
Note that we can exchange the first two quantifiers without changing theexpression, so we obtain
∀ε>0∀x∈I∃
δ>0∀y∈I|x − y | < δ ⇒ |f (x)− f (y)| < ε.
Here the δ > 0 essentially depends on the point x ∈ I ; given ε > 0, wewould be able to choose a different δ > 0 depending on x .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 235 / 624
Real Functions, Convergence and Continuity Continuous Functions
Continuity in Intervals
If, however, a function f is “well-behaved” enough such that we can usethe same δ > 0 for all x ∈ I , i.e., if in fact
∀ε>0∃
δ>0∀x∈I∀y∈I|x − y | < δ ⇒ |f (x)− f (y)| < ε.
we would want to give the function a special attribute.2.5.23. Definition. Let I ⊂ R be an interval and f : Ω → R a function withI ⊂ Ω . Then f is called uniformly continuous on I if and only if
∀ε>0∃
δ>0∀
x ,y∈I|x − y | < δ ⇒ |f (x)− f (y)| < ε.
Clearly, if f is uniformly continuous on I , then f is also continuous on I .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 236 / 624
Real Functions, Convergence and Continuity Continuous Functions
Uniform Continuity on Closed Intervals2.5.24. Theorem. Let f : Ω → R a function with I = [a, b] ⊂ Ω. If f iscontinuous on [a, b] then f is also uniformly continuous on [a, b].
Proof.Suppose that f is continuous but not uniformly continuous on I . Then
∃ε>0∀
δ>0∃
x ,y∈I|x − y | < δ ∧ |f (x)− f (y)| ≥ ε.
Denote this ε by ε0. Then for each δ = 1/n there exist numbers xn, yn ∈ Isuch that
|xn − yn| <1
n∧ |f (xn)− f (yn)| ≥ ε0.
We obtain two sequences (xn) and (yn) that are bounded (why?). By thetheorem of Bolzano-Weierstraß (xn) has a convergent subsequences (xnk ),say with limit ξ. Consider now the sequence (ynk ). It is also bounded andhence has a convergent subsequence (ynkj ), say with limit η.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 237 / 624
Real Functions, Convergence and Continuity Continuous Functions
Uniform Continuity on Closed Intervals
Proof (continued).Since (xnk ) converges to ξ, so does the subsequence (xnkj ). It follows thatwe have subsequences (xnkj ) and (ynkj ), converging to ξ and η,respectively. We will now drop the subscript j , writing simply (xnk ) and(ynk ) for these sequences. Since I is closed, both ξ and η must lie in I .By construction, |xnk − ynk | < 1
nk, so we see that ξ = η. However, then
xnk → ξ ∧ ynk → ξ ∧ |f (xnk )− f (ynk )| ≥ ε0 → 0.
which contradicts the continuity of f at ξ.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 238 / 624
Real Functions, Convergence and Continuity Continuous Functions
OutlookContinuity is an important basic concept in calculus, and so it should notbe surprising that there are several generalizations and furtherdevelopments of continuity. As an example, consider the following result:2.5.25. Corollary. Let Ω ⊂ R and f : Ω → R continuous. Suppose thatI ⊂ Ω is a closed interval. Then the image
f (I ) =y ∈ R : ∃
x∈If (x) = y
is also a closed interval.
Proof.The continuity of f on I implies that there exist α := infx∈I f (x) andβ := supx∈I f (x) so that f (I ) ⊂ [α,β]. Furthermore, by Theorem 2.5.17,α,β ∈ f (I ). Then by the Intermediate Value Theorem 2.5.13,[α,β] ⊂ f (I ), so f (I ) = [α,β].
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 239 / 624
Real Functions, Convergence and Continuity Continuous Functions
Outlook
Now a closed interval is a special case of the following:2.5.26. Definition. A closed and bounded set K ⊂ R is called compact.It is possible to prove the following:2.5.27. Theorem. Let K ⊂ R be a compact set and f : K → R becontinuous. Then f is uniformly continuous on K , f is bounded and has amaximum and minimum. Furthermore, the image f (K ) is also compact.This result will be proven in the next term, where it will be very useful tous in the proof of various generalizations of Theorems 2.5.17 and 2.5.24.It turns out that if we want to consider functions of several variables, thehigher-dimensional analogue of a closed interval is a compact set and thebasic results that we have proven here carry over to continuous functionson compact sets.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 240 / 624
Real Functions, Convergence and Continuity Continuous Functions
First Midterm Exam
The preceding material completes the first third of the course material. Itencompasses everything that will be the subject of the First MidtermExam.The exam date and time will be announced on SAKAI.No calculators or other aids will be permitted during the exam. A sampleexam with solutions has been uploaded to SAKAI. Please study it carefully,including the instructions on the cover page.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 241 / 624
Differential Calculus in One Variable
Part III
Differential Calculus in One Variable
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 242 / 624
Differential Calculus in One Variable
Differentiation of Real Functions
Properties of Differentiable Real Functions
Vector Spaces
Sequences of Real Functions
Series
Real and Complex Power Series
The Exponential Function
The Trigonometric Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 243 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Differentiation of Real Functions
Properties of Differentiable Real Functions
Vector Spaces
Sequences of Real Functions
Series
Real and Complex Power Series
The Exponential Function
The Trigonometric Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 244 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Finer Analysis of FunctionsWe have seen that even functions whose definitions appear quitestraightforward can exhibit a lot of complexity in their behavior. When wefirst started introducing functions, our goal was to develop mathematicalmethods to describe and analyze this behavior. In this respect, thedefinition of limits and Landau symbols was a relatively modest step - wewere describing the values of the function.Of course, the behavior of a function is not just described by its numericalvalues. For example, the graphs of the functions f (x) = x2 andg(x) = 3x2 + x − 4 look quite different at x = 1, even thoughf (1) = g(1) = 1.The next step in analyzing a function is therefore to look beyond theactual values to the change of the values: Is the function monotonic in aneighborhood of a certain point? Do the values attain a maximum or aminimum? Or is the situation more complex than that?
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 245 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Linear Approximations
In order to answer (some of) these questions, it is useful to consider thesimplest non-constant type of function, the linear map
L : R→ R, L(h) = α · h for α ∈ R.
Observing the graph, it is immediately obvious whether a linear map isincreasing, strictly increasing, constant or (strictly) decreasing.We hence aim to describe the behavior of an arbitrary function f at apoint x0 by comparing it with a linear map L. In fact, we want to be ableto use a linear map as a linear approximation to f near x0. It will turn outthat if such an approximation exists it is unique and we can hence speak ofa linearisation of f .We will consider a basic example before giving the general definition.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 246 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Linear Approximation of the Square RootProblem: Calculate
√9.15 approximately.
Approach:√9 = 3 is known, write 9.15 = 9 + 0.15, where 0.15 is
considered “small”. Generalize: x = 9 and h = 0.15.Goal: Find a linear approximation of the square root
√x + h near √x .
What form does such a linear approximation take? We aim for√x + h ≈
√x + Lx · h (3.1.1)
where Lx ∈ R is a suitable constant that in general depends on x .Equation (3.1.1) is called a linear approximation because we use Lx · h (alinear function of h) to approximate
√x + h.
Our goal is now to find Lx , which will in general depend on x (but isindependent of h).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 247 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Linear ApproximationsWe now keep x > 0 fixed throughout the discussion. Instead of theapproximate identity (3.1.1) we write
√x + h =
√x + Lx · h + r(x , h) (3.1.2)
where r(x , h) is some remainder function which should become small whenh approaches zero. Let us formally solve for Lx :
√x + h =
√x + Lx · h + r(x , h)
⇔√x + h −
√x − r(x , h) = Lx · h
⇔ Lx =
√x + h −
√x
h− r(x , h)
h. (3.1.3)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 248 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Finding L
Since L is independent of h, we can let h approach zero on the right-handside of (3.1.3). Let us require that
r(x , h)
h→ 0 as h→ 0. (3.1.4)
The requirement (3.1.4) means that the remainder r(x , h) = o(h) ash→ 0, so the remainder is “much smaller” than the linear approximation.Then
Lx = limh→0
√x + h −
√x
h.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 249 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Linear Approximation of the Square RootLet us find Lx explicitly. For this, we use the following argument:
√x + h −
√x
h=
(√x + h −
√x)(√x + h +
√x)
h(√x + h +
√x)
=x + h − x
h(√x + h +
√x)
=1√
x + h +√x
Since x > 0, we can now easily see that
Lx = limh→0
√x + h −
√x
h=
1
limh→0
(√x + h +
√x)
=1
2√x
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 250 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Linear ApproximationsWe have shown that
√x + h =
√x +
h
2√x+ o(h)
To answer our original question:√9.15 ≈
√9 +
0.15
2√9= 3.025.
This is actually a good approximation. The exact (rounded) value is√9.15 = 3.024897.
We can generalize this example simply by substituting an arbitraryfunction f for the square root. However, it turns out that not all functionscan be approximated linearly at every point; this is a very nice property ofa function and hence deserves an explicit definition.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 251 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Differentiability
3.1.1. Definition. Let Ω ⊂ R be a set, x ∈ Ω an interior point of Ω andf : Ω → R a real function. Then we say that f is differentiable at x ifthere exists a linear map Lx such that for all sufficiently small h ∈ R
f (x + h) = f (x) + Lx(h) + o(h) as h→ 0. (3.1.5)
We say that f is differentiable on some open set U ⊂ Ω if f isdifferentiable at every point of U.We say that f is differentiable if its domain is an open set and f isdifferentiable at every point of the domain.
3.1.2. Lemma. For given f and x the linear map Lx in (3.1.5) is unique.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 252 / 624
Differential Calculus in One Variable Differentiation of Real Functions
DifferentiabilityThe map Lx : R→ R is a linear map, i.e., Lx(h) = αh for some α ∈ R. Itquickly becomes tedious to use two different symbols to describe the samemap, so we commit some slight abuse of notation: we will use the symbolLx to denote
1. the function Lx : R→ R as well as2. the number α ∈ R such that Lx(h) = α · h.
This means that we write
Lx : R→ R, Lx(h) = Lx · h.
This “double meaning” of Lx as a function and as a number may beconfusing at first, but will later seem very natural.Since by Lemma 3.1.2 the map Lx is unique, we may define the derivativeof f at x , denoted by f ′(x), by
f ′(x) = Lx .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 253 / 624
Differential Calculus in One Variable Differentiation of Real Functions
The Derivative
Proof of Lemma 3.1.2.Assume that two maps Lx and Mx exist such that
f (x + h) = f (x) + Lx(h) + o(h) = f (x) +Mx(h) + o(h) as h→ 0.
Then
Lx(h)−Mx(h) = (Lx −Mx)h = o(h) as h→ 0. (3.1.6)
But this mean that for all C > 0 there exists some ε > 0 such that for|h| < ε Since
|(Lx −Mx)h| < C |h| or, equivalently, |Lx −Mx | < C .
But this is possible only if Lx = Mx .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 254 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Examples of Differentiable Functions
3.1.3. Examples.(i) f : R→ R, f (x) = c has derivative f ′(x) = 0 for any x ∈ R.(ii) f : R→ R, f (x) = x has derivative f ′(x) = 1 for any x ∈ R.(iii) f : R→ R, f (x) = x2 has derivative f ′(x) = 2x for any x ∈ R.(iv) Let f : R→ R, f (x) = xn, n ∈ N. By the binomial formula,
(x + h)n =n∑
k=0
(n
k
)xn−khk = xn + nxn−1h + o(h) as h→ 0.
It follows thatf ′(x) = nxn−1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 255 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Examples of Differentiable Functions3.1.4. Lemma. The derivative of the function f : R \ 0 → R,f (x) = 1/x , is given by
f ′(x) = − 1
x2.
Proof.We first show that
1
1 + x= 1− x + o(x) as x → 0.
Indeed,
limx→0
1
x
(1
1 + x+ x − 1
)= lim
x→0
1
x
1 + (x − 1)(x + 1)
1 + x= lim
x→0
x
1 + x= 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 256 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Examples of Differentiable Functions
Proof (continued).Now
1
x + h=
1
x
1
1 + h/x=
1
x
(1− h
x+ o(h)
)=
1
x− 1
x2h + o(h)
as h→ 0. This implies that (1
x
)′= − 1
x2
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 257 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Geometric Interpretation of the DerivativeGeometrically, a linear approximation to a function can be visualised as atangent line to the graph of the function, where the slope of the tangent isequal to the derivative. In order to obtain the tangent to a graph at apoint geometrically, one considers secants through the fixed point andpoints on the graph progressively approaching the first.Let (x1, f (x1)) and (x2, f (x2)) be two points on the graph of a function f .Then the secant line through these two points is given by
S(x ; x1, x2) =f (x2)− f (x1)
x2 − x1(x − x1) + f (x1).
If we write x2 = x1 + h, the the secant line can be written as
S(x ; x1, x1 + h) =f (x1 + h)− f (x1)
h(x − x1) + f (x1).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 258 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Geometric Interpretation of the Derivative
We can use the slope, f (x1+h)−f (x1)h , to obtain an equivalent definition of
the derivative.3.1.5. Theorem. Let Ω be a set, x ∈ Ω an interior point and f : Ω → R afunction that is differentiable at x with derivative Lx = f ′(x). Then
f ′(x) = limh→0
f (x + h)− f (x)
h. (3.1.7)
Furthermore, if the limit in (3.1.7) exists for an interior point x ∈ Ω , thenf is differentiable at x and the derivative is given by (3.1.7).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 259 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Geometric Interpretation of the DerivativeProof.Assume first that f is differentiable at x . We will show that (3.1.7) holds.Since
f (x + h) = f (x) + f ′(x) · h + o(h) as h→ 0,
we have
limh→0
f (x + h)− f (x)
h= lim
h→0
f (x) + f ′(x) · h + o(h)− f (x)
h
= limh→0
f ′(x) · h + o(h)
h= f ′(x) + lim
h→0
o(h)
h
Since o(h) denotes a function g such that g(h)/h→ 0 as h→ 0, we seethat
limh→0
f (x + h)− f (x)
h= f ′(x).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 260 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Geometric Interpretation of the DerivativeProof (continued).Now let
limh→0
f (x + h)− f (x)
h= Lx . (3.1.8)
We will show that then f (x + h) = f (x) + Lx · h + o(h), proving that f isdifferentiable at x and that f ′(x) = Lx . From (3.1.8) it follows that
f (x + h)− f (x)
h= Lx + r(x , h),
where the remainder r(x , h)→ 0 as h→ 0. We can rearrange thisequation to yield
f (x + h) = f (x) + Lxh + hr(x , h),
where hr(x , h)/h→ 0 as h→ 0. But this means hr(x , h) = o(h).Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 261 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Physical Interpretation of the DerivativeIn physics one is often interested in the dependence of one physicalquantity on another, for example the dependence of position on time, or ofelectrical conductivity on temperature. Let us consider the first case,where the position x of a point mass on a line depends on time t.Here we write x(t) for the position at time t. The derivative x ′(t) thentells us the rate of change of the position x at time t: if x ′(t) > 0, then xis increasing at t, if x ′(0) < 0, x is decreasing at t.The rate of change of the position is called the (instantaneous) velocity,which is again a function of t, denoted by v(t) = x ′(t). If we define theaverage velocity as
distance travelledtime taken =
x2 − x1t2 − t1
we arrive at a similar formula to that of the secant through the points(t1, x1) and (t2, x2).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 262 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Physical Interpretation of the Derivative
As before, taking the limit t2 → t1 we obtain the instantaneous velocity as
v(t) = x ′(t) = lim∆t→0
x(t +∆t)− x(t)
∆t.
Similarly, the rate of change of the velocity is called the acceleration,
a(t) = v ′(t),
and the rate of change of the acceleration is called the jerk,
j(t) = a′(t).
Hence the acceleration is the derivative of the derivative of the position.We are thus led naturally to introduce the concept of higher derivatives.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 263 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Higher DerivativesConsider a differentiable real function f . Then for every point x ∈ dom fwe can find precisely one real number f ′(x). Thus we can regard the map
f ′ : dom f → R, x 7→ f ′(x)
as a real function, which we call the derivative of f . The function f ′ mayitself be differentiable; in this case we use the notation
(f ′)′(x) = f ′′(x)
to denote the derivative of f ′ at x , and call f ′′(x) the second derivative off at x . If f ′′(x) exists, we say that f is twice differentiable at x , and if f ′is differentiable, we say that f is twice differentiable.This can of course be developed further, so that we say that f is n timesdifferentiable at f if
(... ((f ′)′)′ ... )′︸ ︷︷ ︸n times
(x)
exists.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 264 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Higher DerivativesWe would hence have
x ′(t) = v(t), x ′′(t) = a(t), x ′′′(t) = j(t)
for the velocity, acceleration and jerk, respectively.3.1.6. Notation. For the first three derivatives, one writes f ′, f ′′ and f ′′′,respectively. For higher derivatives, one writes
f (4), f (5), f (6), ...
In physics, for time-dependent quantities only, a slightly different notationgoing back to Newton is often used. There one denotes the first threederivatives by dots instead of primes, i.e.,
x(t) = v(t), x(t) = a(t),...x (t) = j(t)
These dots are only used for the first three derivatives, because inmechanics higher derivatives very rarely occur (for instance, Newton’sequations of motion often only contain first and second derivatives).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 265 / 624
Differential Calculus in One Variable Differentiation of Real Functions
SmoothnessGeometrically, a function f is differentiable at x if we can find a tangent tothe graph of f at (x , f (x), i.e., if the limit of the slopes of secants exists.It is easy to find an example of a function where this is not the case at asingle point: the function
f (x) = |x |has derivative
f ′(x) =
1 x > 0
−1 x < 0
but the derivative is not defined for x = 0. In fact,
limh0
|0 + h| − |0|h
= limh0
h
h= 1
whilelimh0
|0 + h| − |0|h
= limh0
−hh
= −1
so f ′(0) = limh→0|0+h|−|0|
h does not exist.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 266 / 624
Differential Calculus in One Variable Differentiation of Real Functions
SmoothnessThe fact that f (x) = |x | is not differentiable at 0 is clearly visible by thesharp corner in the graph. As a first result we can note:3.1.7. Remark. A continuous function is not necessarily differentiable.However, if a function is differentiable, it is necessarily continuous:3.1.8. Lemma. Let f be a function that is differentiable at somex ∈ dom f . Then f is continuous at x .
Proof.Since f is differentiable at x , we can write
f (x + h) = f (x) + f ′(x)h + o(h).
In order to show that f is continuous at x we need to show thatlimy→x f (y) = f (x).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 267 / 624
Differential Calculus in One Variable Differentiation of Real Functions
SmoothnessProof (continued).We can instead write
limy→x
f (y) = limh→0
f (x + h)
(why?) Now
limh→0
f (x + h) = limh→0
f (x) + limh→0
f ′(x)h + limh→0
o(h)
if all the limits on the right exist. Since limh→0o(h)h = 0 we have
limh→0
o(h) = limh→0
ho(h)
h= lim
h→0h limh→0
o(h)
h= 0.
Hence limh→0
f (x + h) = limh→0
f (x) so f is continuous.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 268 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Smoothness
This shows that the differentiable functions are a “special” or “nice”subset of the continuous functions. We often say that they are “smoother”than general continuous functions, alluding to the fact that they cannothave any sharp edges.Of course this can be generalized to higher derivatives; a function that istwice differentiable will be “smoother” than a merely once differentiablefunction. We make one obvious remark:3.1.9. Remark. If f is differentiable at x , f ′ is not necessarily continuous atx . A function whose derivative is also continuous (at x) is calledcontinuously differentiable (at x).We will later give an example of a function that is differentiable, but notcontinuously differentiable at x = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 269 / 624
Differential Calculus in One Variable Differentiation of Real Functions
SmoothnessWe have looked at an example of a function that is not differentiable at asingle point; there seems to be no reason why there should not be afunction that is continuous but not differentiable at any point. Anexample of such a function is the self-similar Van der Waerden function,pictured below.
0 1
8
1
4
3
8
1
2
5
8
3
4
7
81
0.0
0.1
0.2
0.3
0.4
0.5
0.6
We will later construct a modified version of this function (see Slides 432onwards).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 270 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Smoothness
The van der Waerden function is defined as an infinite sum of simplecontinuous functions. We will only later be able to study the definition ofthis function and prove its properties.We hence have a certain hierarchy of smoothness at a point x ∈ R:
arbitrary functions functions continuous at x
functions differentiable at x functions continuously differentiable at x
functions twice differentiable at x ...
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 271 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Smoothness
For an open interval I ⊂ R we have finer gradations of smoothness: arbitrary functions functions continuous on I
functions uniformly continuous on I
functions differentiable on I
functions continuously differentiable on I
functions differentiable on I with uniformly continuous derivative functions twice differentiable at x ...
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 272 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Differentiation
As usual, we will not always calculate derivatives by hand; rather we willcalculate some derivatives of basic functions and then obtain thederivatives of more complicated functions using general theorems.3.1.10. Lemma. Let f , g be functions on R, x ∈ dom f ∩ dom g andassume that f and g are both differentiable at x . Then
(f + g)′(x) = f ′(x) + g ′(x).
Furthermore,
(λf )′(x) = λf ′(x) for λ ∈ R.
This result is a simple consequence of the definitions.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 273 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Differentiation
Using Lemma 3.1.10 we can now calculate the derivative of anypolynomial. If p(x) =
∑nk=0 ckx
k , then
p′(x) =n∑
k=1
ckkxk−1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 274 / 624
Differential Calculus in One Variable Differentiation of Real Functions
The Product RuleWe may also find the derivative of the product of two functions:3.1.11. Lemma. Let f and g be real functions with x ∈ dom f ∩ dom gsuch that f and g are differeniable at x . Then
(f · g)′(x) = f ′(x)g(x) + g ′(x)f (x).
Proof.We just write out the definitions and use what we know about Landausymbols:
(f · g)(x + h) = f (x + h)g(x + h)
=(f (x) + f ′(x)h + o(h)
)(g(x) + g ′(x)h + o(h)
)= f (x)g(x) +
(f ′(x)g(x) + g ′(x)f (x)
)h + f ′(x)g ′(x)h2
+ o(h)[f ′(x)h + g ′(x)h + o(h)]
= f (x)g(x) +(f ′(x)g(x) + g ′(x)f (x)
)h + o(h)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 275 / 624
Differential Calculus in One Variable Differentiation of Real Functions
The Chain Rule
In order to calculate the derivatives of more complicated functions, weneed to find a formula for the derivative of the composition of functions.This result is commonly called the chain rule.3.1.12. Theorem. Let f , g be real functions and x ∈ R such thatx ∈ dom g , g(x) ∈ dom f are interior points of the domains of g and f ,respectively. Assume further that g is differentiable at x and f isdifferentiable at g(x). Then
(f g)′(x) = f ′(g(x)) · g ′(x).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 276 / 624
Differential Calculus in One Variable Differentiation of Real Functions
The Chain Rule
Proof.By our assumed differentiability of g at x we have
(f g)(x + h) = f (g(x + h)) = f (g(x) + g ′(x)h + o(h))
as h→ 0. Writing H = g ′(x)h + o(h) and using the differentiability of fat g(x), we have
(f g)(x + h) = f (g(x) + H) = f (g(x)) + f ′(g(x))H + o(H).
NowH = g ′(x)h + o(h) = O(h) + O(h) = O(h)
as h→ 0, so o(H) = o(O(h)) = o(h) as h→ 0 (as proven theassignments).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 277 / 624
Differential Calculus in One Variable Differentiation of Real Functions
The Chain RuleProof (continued).We hence have
(f g)(x + h) = f (g(x)) + f ′(g(x))H + o(h)
= f (g(x)) + f ′(g(x))(g ′(x)h + o(h)) + o(h)
= f (g(x)) + f ′(g(x))g ′(x)h + o(h)
as h→ 0, so (f g)(x) = f ′(g(x))g ′(x).Using the chain rule, we can now calculate the derivative of many morefunctions, such as f (x) = x−n, n ∈ N or general quotients:3.1.13. Lemma. Let f and g be real functions with x ∈ dom f ∩ dom gsuch that f and g are differentiable at x and g(x) = 0. Then(
f
g
)′(x) =
f ′(x)g(x)− g ′(x)f (x)
g(x)2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 278 / 624
Differential Calculus in One Variable Differentiation of Real Functions
The Quotient RuleProof.Let h(x) = 1/x . Then f
g= f · (h g). Using the product rule,
(f
g
)′(x) = f ′(x) · (h g)(x) + f (x)(h g)′(x)
Now by Lemma 3.1.4 we have h′(x) = −1/x2, and by the chain rule weknow that
(h g)′(x) = h′(g(x))g ′(x),
so (f
g
)′(x) = f ′(x)
1
g(x)+ f (x)
(− 1
g(x)2
)g ′(x)
=f ′(x)g(x)− g ′(x)f (x)
g(x)2
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 279 / 624
Differential Calculus in One Variable Differentiation of Real Functions
The Inverse Function TheoremA simple consequence of the quotient rule is that (x−n)′ = (−n)x−n−1 forn ∈ N, n ≥ 1. In fact, we can now differentiate all rational functions(quotients of polynomials). However, there are still some functions we cannot yet differentiate, such as the square root function f (x) =
√x .
Since the square root function is defined as the inverse of g(x) = x2 forx > 0, it is perhaps more fruitful to consider derivatives of inversefunctions in general, rather than trying to obtain the derivative of everyfunction with a rational exponent separately.3.1.14. Theorem. Let I be an open interval and let f : I → R bedifferentiable and strictly monotonic. Then the inverse mapf −1 = g : f (I )→ I exists and is differentiable at all points y ∈ f (I ) forwhich f ′(g(y)) = 0. Furthermore,
g ′(y) =1
f ′(g(y)).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 280 / 624
Differential Calculus in One Variable Differentiation of Real Functions
The Inverse Function TheoremProof.Let y = f (x) and y + H = f (x + h). Then
y + H = f (x + h) = f (x) + f ′(x)h + o(h)
= y + f ′(g(y))h + o(h)
= y + (f ′(g(y)) + o(1))h
as h→ 0. Assuming that f ′(g(y)) = 0, this gives
h =1
f ′(g(y)) + o(1)H as h→ 0.
Note that h = g(y + H)− g(y) and g is continuous, so h→ 0 if H → 0.By the continuity of g we then have
h =1
f ′(g(y)) + o(1)H = O(H) as H → 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 281 / 624
Differential Calculus in One Variable Differentiation of Real Functions
The Inverse Function TheoremProof (continued).This allows us to use
y + H = f (x + h) = f (x) + f ′(x)h + o(h) = y + f ′(g(y))h + o(h)
as h→ 0 to deduce
h =1
f ′(g(y))H + o(h) as h→ 0,
=1
f ′(g(y))H + o(O(H)) as H → 0,
=1
f ′(g(y))H + o(H) as H → 0.
Then, as H → 0,
g(y + H) = x + h = g(y) +1
f ′(g(y))H + o(H)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 282 / 624
Differential Calculus in One Variable Differentiation of Real Functions
The Inverse Function TheoremThe main difficulty in the proof of the previous theorem was showing thatf −1 was differentiable; if we know this, then the derivative is computedfrom the chain rule, because
y = f (g(y)),
so1 = y ′ = f ′(g(y))g ′(y)
and hence g ′(y) = 1/f ′(g(y)).We are now able to differentiate all algebraic functions and their inverses.For example,
(√x)′ =
1
2√x.
In general, we can show that (xp)′ = pxp−1 for all rational numbers p.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 283 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Leibniz Notation
Next to the notation f ′ for the derivative of f there is also a notationgoing back to Leibniz, which writes
f ′ =dy
dx.
This is of course inspired by taking the derivative as a “differentialquotient”
f ′(x0) = lim∆x→0
∆y
∆x
where ∆x = x − x0 and ∆y = f (x)− f (x0). Leibniz notation has provento be extremely versatile and flexible, and has become extremely popularin mathematics.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 284 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Leibniz NotationThere are some fine points one should be aware of:
1. Leibniz notation does not differentiate between f ′(x) and f ′; both canbe written as df
dx.
2. Leibniz himself regarded df and dx as independent infinitesimalelements, called differentials and df
dxas the differential quotient. For
the moment, we are not going to treat these individually, but only inthe form of a fraction.
There is a way around the first problem: we can use the notation |x0 , read“at x0,” to denote a function evaluated at a specific point. For example,
f (x0) = f |x0 = f (x)|x=x0
are all notations for the same number. Hence
f ′(x0) =df
dx
∣∣∣∣x0
can be used to indicate the derivative evaluated at a specific point.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 285 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Leibniz NotationWhile it at first might look like this ambiguity of df /dx (does it indicate afunction or the value of a function?) is a disadvantage, it turns out thatprecisely this double meaning can be very useful to write out formulasefficiently. Furthermore, in advanced mathematics such a double meaningis often built into the statement, and a notation that refects this is veryuseful.In Leibniz notation, the chain rule for f g becomes
df
dx=
df
dg· dgdx
,
and the inverse function theorem can be writtendx
dy=
1dydx
.
While these formulae are useful mnemonics, they clearly need to beinterpreted correctly!
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 286 / 624
Differential Calculus in One Variable Differentiation of Real Functions
Leibniz NotationAnother advantage of Leibniz notation is that it becomes natural to write
df
dx=
d
dxf ,
where d
dxis a so-called differential operator that takes a function and
returns its derivative. For example,d
dx(4x3) = 12x2.
For higher derivatives, we write
f ′′ =d2f
dx2, f ′′′ =
d3f
dx3, f (n) =
dnf
dxn.
This allows us to writed
dx
(d
dxx5)
=d2
dx2x5 = 20x3.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 287 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Differentiation of Real Functions
Properties of Differentiable Real Functions
Vector Spaces
Sequences of Real Functions
Series
Real and Complex Power Series
The Exponential Function
The Trigonometric Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 288 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Extrema of Real Functions
Now that we know how to differentiate and have established the basicnotation, we can at last treat our basic objective: the analysis of thebehavior of functions. We begin by looking at extrema, i.e., maxima andminima of functions.3.2.1. Definition. Let f be a function and Ω ⊂ dom f . Then x ∈ Ω iscalled a (global) maximum point for f on Ω if
f (x) ≥ f (y) for all y ∈ Ω.
The number f (x) is called the maximum value of f on Ω.A minimum point/minimum value is defined similarly.We will use derivatives to characterize maximum and minimum points.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 289 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Extrema of Real Functions
3.2.2. Theorem. Let f be a function and (a, b) ⊂ dom f an open interval.If x ∈ (a, b) is a maximum (or minimum) point for f on (a, b) and if f isdifferentiable at x , then f ′(x) = 0.
Proof.We will consider the case of a maximum point only. Clearly, the geometricinterpretation of the derivative is well-suited to this theorem: we want toshow that the tangent at the maximum point is horizontal, i.e., has slopezero.Assume that f has a maximum at x . For sufficiently small h,x + h ∈ (a, b), so
f (x + h)− f (x) ≤ 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 290 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Extrema of Real Functions
Proof (continued).If h > 0, then
f (x + h)− f (x)
h≤ 0
and hencelimh0
f (x + h)− f (x)
h≤ 0.
Similarly,limh0
f (x + h)− f (x)
h≥ 0.
But since f is differentiable at x , these limits must be equal, so they mustboth be zero.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 291 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Extrema of Real Functions3.2.3. Definition. Let f be a real function and Ω ⊂ dom f . Then x ∈ Ω iscalled a local maximum (minimum) point for f on Ω if there exists someε > 0 such that x is a maximum (minimum) point for f on Ω ∩ Bε(x).
3.2.4. Remark. If f is defined on an open interval (a, b) and has a localmaximum (minimum) at x ∈ (a, b), and if f is differentiable at x , thenf ′(x) = 0.
3.2.5. Definition. A function f is said to have a critical point at x ∈ dom fif f ′(x) = 0. The value f (x) is then called a critical value of f .Therefore, when finding the (local and global) maxima and minima of afunction on a set Ω ⊂ dom f , we need to check
1. the critical points of f ,2. the boundary points of Ω,3. interior points of Ω where f is not differentiable.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 292 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Rolle’s Theorem
A basic result which will allow us to develop our analysis further is thefollowing, named after the French mathematician Rolle.3.2.6. Rolle’s Theorem. Let f be a real function and a < b ∈ R such that[a, b] ∈ dom f . Assume that f is continuous on [a, b] and differentiable on(a, b) and that f (a) = f (b). Then there exists a number x ∈ (a, b) suchthat f ′(x) = 0.
Proof.Since f is continuous, it will have a maximum and a minimum value on[a, b]. If either is in (a, b), then f ′ vanishes there by Theorem 3.2.2, so weare finished. Assume that both the maximum and the minimum are at a orb. Since f (a) = f (b), it follows that f is constant, so f ′(x) = 0 for any xin (a, b).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 293 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
The Mean Value TheoremRolle’s theorem may not look spectacular, but it already contains the proofof the powerful mean value theorem:3.2.7. Mean Value Theorem. Let f be a real function and a < b ∈ R suchthat [a, b] ∈ dom f . Assume that f is continuous on [a, b] anddifferentiable on (a, b). Then there exists a number x ∈ (a, b) such that
f ′(x) =f (b)− f (a)
b − a.
Proof.Let
h(x) = f (x)− f (b)− f (a)
b − a(x − a).
Then h(a) = h(b) = f (a) and by Rolle’s theorem there exists somex ∈ (a, b) such that h′(x) = 0.
Since h′(x) = f ′(x)− f (b)− f (a)
b − a, this proves the theorem.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 294 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
The Mean Value TheoremWe can now prove a seemingly “obvious” statement, which, however,requires the powerful mean value theorem.3.2.8. Corollary. Let f be a real function and I ⊂ dom f . Assume thatf ′ = 0 on I . Then f is constant on I .
Proof.Let a, b ∈ I , a < b. Then there exist some x ∈ (a, b) such that
f ′(x) =f (b)− f (a)
b − a.
But since f ′ = 0 on I , it follows that f (a) = f (b).
3.2.9. Corollary. Let f , g be real functions and I ⊂ dom f ∩dom g . Assumethat f ′ = g ′ on I . Then there exists some c ∈ R such that f = g + c .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 295 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
The Mean Value Theorem
The following corollary is of great importance for analyzing the behavior offunctions:3.2.10. Corollary. Let f be a real function and I ⊂ dom f . Assume thatf ′ > 0 on I . Then f is strictly increasing on I . If f ′ < 0 on I , f is strictlydecreasing on I .
Proof.Consider the case where f ′ > 0. Let a, b ∈ I , a < b. Then there existsome x ∈ (a, b) such that
f ′(x) =f (b)− f (a)
b − a> 0.
Since b > a this implies f (b) > f (a).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 296 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Maxima and Minima3.2.11. Theorem. Let f be a real function and x ∈ dom f such thatf ′(x) = 0. If f ′′(x) > 0, then f has a local minimum at x ; if f ′′(x) < 0,then f has a local maximum at x .
Proof.We know that
f ′(x + h) = f ′(x) + f ′′(x)h + o(h) =(f ′′(x) + o(1)
)h as h→ 0
Assume that f ′′(x) > 0. Then for sufficiently small h,
f ′(x + h) > 0 if h > 0,f ′(x + h) < 0 if h < 0.
This means that f is strictly increasing to the right of x and strictlydecreasing to the left of x . Thus f has a minimum at x .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 297 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Maxima and Minima
3.2.12. Theorem. Let f be a real function and x ∈ dom f such that f ′′(x)exists. If f has a local minimum at x , then f ′′(x) ≥ 0; if f has a localmaximum at x , then f ′′(x) ≤ 0.
Proof.Suppose f has a minimum at x . If f ′′(x) < 0, then f also has a maximumat x . But this means that f is constant, therefore f ′′(x) = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 298 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Convexity and ConcavityWe have seen how to find the extrema of real functions and also know howto determine intervals of monotonicity. However, an increasing functionmay exhibit one of two fundamentally different shapes:
1
x
y
y = x
y = x2
The graph of f (x) = x2 stays belowthe secant joining the origin withthe point (1, 1), while the graph ofg(x) =
√x stays above this line.
In fact, if we select any two pointson the graph of f and join themwith a line, the graph of f will re-main below these points. A corre-sponding statement is true for g .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 299 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Convexity and ConcavityThese observations motivate the following definition:3.2.13. Definition. Let Ω ⊂ R be any set and I ⊂ Ω an interval. A functionf : Ω → R is called strictly convex on I if for all a, x , b ∈ I with a < x < b,
f (x)− f (a)
x − a<
f (b)− f (a)
b − a.
We say that f is strictly concave on I if −f is strictly convex. If we replace“<” by “≤” above, f is simply called convex and −f concave.
3.2.14. Theorem. Let f : I → R be be strictly convex on I anddifferentiable at a, b ∈ I . Then
(i) For any h > 0 (h < 0) such that a+ h ∈ I , the graph of f over theinterval (a, a+ h) lies below the secant line through the points(a, f (a)), (a+ h, f (a+ h)).
(ii) The graph of f over all of I lies above the tangent line through thepoint (a, f (a)).
(iii) If a < b, then f ′(a) < f ′(b).Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 300 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Convexity and ConcavityProof.
(i) Let a < x < a+ h, h > 0. The secant line through the points(a, f (a)) and (a+ h, f (a+ h)) is given by
ψa,a+h : R→ R, ψa,a+h(t) = f (a) +ma,a+h(t − a),
while the line through the points (a, f (a)) and (x , f (x)) is
ψa,x : R→ R, ψa,x(t) = f (a) +ma,x(t − a)
with
ma,a+h =f (a+ h)− f (a)
h, ma,x =
f (x)− f (a)
x − a.
Since f is strictly convex, ma,x < ma,a+h. This implies
f (x) = ψa,x(x) < ψa,a+h(x).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 301 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Convexity and ConcavityProof.(ii) We show that the graph of f to the right of a lies above the tangent
line through a. A similar argument can be used for the graph of f tothe left of a. Let 0 < h1 < h2. Then, by the convexity of f ,
f (a+ h1)− f (a)
h1<
f (a+ h2)− f (a)
h2
It follows that the function
g(h) =f (a+ h)− f (a)
h
is strictly decreasing as h 0. Since limh→0
g(h) = f ′(a) we see that
f ′(a) <f (a+ h)− f (a)
h=: ma,a+h for h > 0. (3.2.1)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 302 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Convexity and ConcavityProof (continued).
Consider the secant line through the points (a, f (a)) and(a+ h, f (a+ h)), h > 0, given by
ψa,a+h : R→ R, ψa,a+h(t) = f (a) +ma,a+h(t − a).
The tangent line through the point (a, f (a)) is given by
Ta : [a, a+ h]→ R, Ta(t) = f (a) + f ′(a)(t − a).
Since f ′(a) < ma,a+h by (3.2.1) it follows that Ta(x) < ψa,h(x) fora < x ≤ a+ h. In particular,
f (a+ h) = ψa,a+h(a+ h) > Ta(a+ h).
Since this is true for any h > 0, the graph of f to the right of a liesabove the tangent line through a.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 303 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Convexity and Concavity
Proof (continued).(iii) By (ii), we know that the graph of the function stays above the
tangent at any point. In particular, f (b) is above the tangent linethrough (a, f (a)), so
f ′(a) < ma,b =f (b)− f (a)
b − a.
Similarly, considering the tangent line through (b, f (b)) we have thatf (a) is above this line, so
f ′(b) > mb,a =f (a)− f (b)
a− b=
f (b)− f (a)
b − a.
This implies f ′(a) < f ′(b).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 304 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Convexity and Concavity3.2.15. Lemma. Let I be an open interval, f : I → R differentiable and f ′
strictly increasing. If a, b ∈ I , a < b, and f (a) = f (b), thenf (x) < f (a) = f (b) for all x ∈ (a, b).
Proof.We first show that f (x) ≤ f (a) for all x ∈ (a, b). Suppose that thereexists some x ∈ (a, b) with f (x) > f (a). Then there exists some pointx0 ∈ (a, b) such that f (x0) is a maximum of f for [a, b]. Consequently, byTheorem 3.2.2, f ′(x0) = 0. However, the Mean Value Theorem 3.2.7 forthe interval [a, x0] shows the existence of some x1 ∈ (a, x0) such that
f ′(x1) =f (x0)− f (a)
x0 − a> 0.
Since x1 < x0 but f ′(x1) > f ′(x0), we obtain a contradiction to theassumption that f ′ is strictly increasing.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 305 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Convexity and ConcavityProof (continued).Now suppose there exists some x0 ∈ (a, b) with f (x0) = f (a). Sincef (x) ≤ f (a) for all x ∈ [a, b], x0 is a local maximum point for f and sof ′(x0) = 0.Since f ′ is strictly increasing, f can not be constant on [a, x0]. Thus, thereexists an x1 ∈ (a, x0) with f (x1) < f (x0). Applying the Mean ValueTheorem 3.2.7 to the interval [x1, x0] we find an x2 ∈ (x1, x0) such that
f ′(x2) =f (x0)− f (x1)
x0 − x1> 0.
This again contradicts the assumption that f ′ is increasing on I .The most important result in applications is the following:3.2.16. Theorem. Let I be an open interval, f : I → R differentiable and f ′
strictly increasing. Then f is strictly convex.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 306 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Convexity and ConcavityProof.Let a, b ∈ I with a < b. Set g : I → R,
g(x) = f (x)− f (b)− f (a)
b − a(x − a).
Then g ′ is strictly increasing and g(a) = g(b) = f (a). By Lemma 3.2.15,
g(x) < f (a) for x ∈ (a, b).
This impliesf (x)− f (b)− f (a)
b − a(x − a) < f (a)
orf (x)− f (a)
x − a<
f (b)− f (a)
b − a
for x ∈ (a, b). This shows that f is convex.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 307 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Curve Sketching
We now have all the tools we need to sketch the graph of a given function.In orer to perform a sketch, we first need to perform a curve discussion,where we explore the behavior of the curve at all points of its domain. Inparticular, we are interested in
1. the domain and range;2. the continuity and behavior near points of discontinuity;3. the behavior as x → ±∞; in particular asymptotes (straight lines
such that the values of the function approach the points on the lineas x → ±∞).;
4. local and global extrema;5. intervals where the function is increasing, decreasing or constant;6. inflection points, where the second derivative changes sign;7. other remarkable features of the curve.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 308 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Curve SketchingThe object of a sketch of a curve is to visualize the curve for the reader,and to convey all relevant information regarding the curve. Knowing whatto include in a sketch is just as important as knowing what not to include.First, here are some general guidelines:
Decide what area of the coordinate system you will need (for thegraph of a function, consider its domain and range). In most graphs,you should include the origin (0, 0) of the coordinate system.
Make sure your sketch is large enough; generally, it should be as wideas the page on which you are writing, and at least 8 cm high!
Use a ruler to draw the axes. If the sketch includes the origin of thecoordinate system, the axes should intersect there.
Add an arrowhead pointing right to the right end of the horizontalaxis (called the abscissa), and an arrowhead pointing upwards to thetop end of the vertical axis (called the ordinate).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 309 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Curve Sketching Label the axes (generally, “x” for the abscissa and “y” for the
ordinate, but you can use any other suitable variables), writing thelabels under the horizontal arrowhead and on the left of the verticalarrowhead.
You do not need to label the origin (0, 0). It is understood that theaxes intersect at (0, 0). Only if in your sketch the axes do notintersect at the origin should you label the intersection.
Sketch the curve so that all the space provided by the axes is used;for a graph of a function, the maximum should be near the top of theordinate and the minimum should be near the bottom of the ordinate.The domain of the function should be clearly visible in the sketch. Ifthe graph continues for x → ±∞, then draw the curve until you arejust above the ends of the abscissa - do not draw a curve further thanthe horizontal or vertical axes, and do not stop drawing before theends are reached.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 310 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Curve Sketching
Label the curve, e.g., “y = f (x)”, “f ”. You may use little arrows toindicate the curve.
Mark characteristic points of the curve: intersections with the axes,extrema, inflection points, other interesting points. Generally, it issufficient to mark these points on the axes; only in rare cases shouldyou resort to labeling a point (x , y) directly on the graph.
If you do not have exact values for some characteristic points,indicate them through suitable symbols (e.g., x1, x2, ...) and explainthese symbols briefly at the bottom of the graph.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 311 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Curve Sketching
Indicate asymptotes through dashed lines, and label horizontal andvertical asymptotes on the axes of the graph. A slant asymptoteshould be labeled as “y = mx + b” (perhaps using an arrow) in thesame way the graph of the function is labeled. If a horizontalasymptote lies on the abscissa (y = 0), or a vertical asymptote lies onthe ordinate (x = 0), then you should not try to draw the asymptote,but rather indicate through words and an arrow that there is anasymptote on these lines.
Title your sketch (e.g., “Graph of the function f ”).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 312 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Curve Sketching
Things you should generally not do: Mark points on the axes that do not correspond to characteristic
points of the graph; there is really no need to give a “scale” on theaxes.
Draw lines inside the graph; only use dashed lines for asymptotes. Ifyou feel that you need to connect points on the axes with points onthe graph, then your sketch is not clear enough!
All of the above points are guidelines; sometimes it may be necessary tobreak them and to do things differently. However, if you do so, be sure youhave a very good reason!
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 313 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
Curve SketchingAs an example, let us sketch the function
I : [1,∞)→ R, I (x) =x − 1
(1 + x)2(3 + x).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 314 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
The Cauchy Mean Value Theorem
3.2.17. Theorem. Let f , g be real functions and [a, b] ⊂ dom f ∩ dom g . Iff and g are continuous on [a, b] and differentiable on (a, b), then thereexists an x ∈ (a, b) such that(
f (b)− f (a))g ′(x) =
(g(b)− g(a)
)f ′(x).
Proof.We apply Rolle’s Theorem to
h(x) = f (x)(g(b)− g(a)
)− g(x)
(f (b)− f (a)
).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 315 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
L’Hôpital’s Rule
3.2.18. Theorem. Let f and g be real functions and b ∈ dom f ∩ dom gwith
limxb
f (x) = 0 and limxb
g(x) = 0.
Suppose that there exists a δ > 0 such that f and g are defined anddifferentiable on the interval (b, b+ δ) and g ′(x) = 0 for all x ∈ (b, b+ δ).Suppose further that the limit lim
xbf ′(x)/g ′(x) =: L exists. Then
limxb
f (x)
g(x)= L.
For the following proof, we will change the definition of f and g to ensurethat f (b) = g(b) = 0. This ensures that f and g are continuous at b, butdoes not change any of the statements of the theorem.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 316 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
L’Hôpital’s RuleProof.Let f and g be given as in the theorem with f (b) = g(b) = 0.Furthermore, choose δ > 0 small enough. Then for all x ∈ (b, b + δ) wehave g(x) = 0 by the mean value theorem. (Otherwise, g ′ = 0 somewherein (b, x), which is not allowed.)Fix ε > 0. Choose δ > 0 (perhaps decreasing the choice of δ above) suchthat ∣∣∣∣ f ′(ξ)g ′(ξ)
− L
∣∣∣∣ < ε for all ξ ∈ (b, b + δ).
Choose x ∈ (b, b + δ). Applying the Cauchy mean value theorem, there isa number αx in (b, x) such that
f (x)
g(x)=
f ′(αx)
g ′(αx).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 317 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
L’Hôpital’s Rule
Proof (continued).But then ∣∣∣∣ f (x)g(x)
− L
∣∣∣∣ < ε for all x ∈ (b, b + δ).
Thus f (x)/g(x)→ L as x b.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 318 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
L’Hôpital’s RuleAn analogous version of l’Hopital’s rule applies when considering limits atinfinity:3.2.19. Theorem. Let f and g be real functions such that for some C > 0the interval (C ,∞) ⊂ dom f ∩ dom g and
limx→∞
f (x) = 0 and limx→∞
g(x) = 0.
Suppose further that f and g are defined and differentiable on the interval(C ,∞) and g ′(x) = 0 for all x ∈ (C ,∞). Suppose further that the limitlimx→∞
f ′(x)/g ′(x) =: L exists. Then
limx→∞
f (x)
g(x)= L.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 319 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
L’Hôpital’s RuleProof.Fix ε > 0. By assumption, we can find (and then fix) a C > 0 such thatfor all y > C ∣∣∣∣ f ′(x)g ′(x)
− L
∣∣∣∣ < ε for x > y .
This implies, in particular, that g ′(x) = 0 for x > C . Choose any x > y .Then g(x) = g(y), for otherwise, by Rolle’s theorem, we would haveg ′(z) = 0 for some z ∈ (y , x). Furthermore, for any y > C and x > y wecan apply the Cauchy mean value theorem to the interval [y , x ] to findsome z ∈ (y , x) such that
f (y)− f (x)
g(y)− g(x)=
f ′(z)
g ′(z).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 320 / 624
Differential Calculus in One Variable Properties of Differentiable Real Functions
L’Hôpital’s Rule
Proof (continued).We deduce that for all y > C∣∣∣∣ f (y)− f (x)
g(y)− g(x)− L
∣∣∣∣ < ε for x > y .
The right-hand side does not depend on x , so we can let x →∞ and seethat ∣∣∣∣ f (y)g(y)
− L
∣∣∣∣ ≤ ε for y > C .
This proves that limx→∞
f (x)g(x) = L.
Further versions of l’Hôspital’s theorem will be proved in the recitationclasses.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 321 / 624
Differential Calculus in One Variable Vector Spaces
Differentiation of Real Functions
Properties of Differentiable Real Functions
Vector Spaces
Sequences of Real Functions
Series
Real and Complex Power Series
The Exponential Function
The Trigonometric Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 322 / 624
Differential Calculus in One Variable Vector Spaces
Vector SpacesWe have now developed many tools to analyze the functions that weknow, which essentially comprise
1. algebraic functions (quotients of polynomials),2. their inverse functions (root functions) and3. piece-wisely defined functions (e.g., f (x) = |x |).
Of course there are many more functions out there! However, in order tofind and properly define more functions, we need to mathematicallyorganize the “set of all functions” in such a way that we can provetheorems on classes of functions and are able, for example, to considersequences of functions.We have previously added the concept of distance to a set by defining ametric space (M, ϱ) consisting of a set M and a map ϱ : M ×M → R.However, this does not give us enough structure for our purposes, as wealso need to be able to at least add elements of M. Thus, we willintroduce a more suitable setting for our functions.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 323 / 624
Differential Calculus in One Variable Vector Spaces
Vector SpacesOur model for the concept of a vector space is based on R2. From thepoint of view of mathematics, R2 is simply the set of pairs (x1, x2) wherex1, x2 ∈ R. Every pair can also be interpreted as a point in the plane,which we also call R2. However, from a physical point of view we canidentify (x1, x2) with an arrow joining the origin to (x1, x2).
-4 -3 -2 -1 1 2 3 4x1
-2
-1
1
2
3
4
x2
Hx1,x2L
-4 -3 -2 -1 1 2 3 4
x1
-2
-1
1
2
3
4
x2
x1
x2
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 324 / 624
Differential Calculus in One Variable Vector Spaces
Vector Spaces
The classical (physicist’s) notation for a vector in R2 is
x =
(x1x2
).
We will keep the idea of writing(x1x2
)instead of (x1, x2) when it is useful to
do so, but we will drop the arrow in x and simply write x . It will always beclear from context whether x ∈ R or x ∈ R2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 325 / 624
Differential Calculus in One Variable Vector Spaces
Vector SpacesIn physics, vectors are used to indicate directional quantities such asvelocity or forces. In this context, position is also assumed to bedirectional. As is well-known (from school-level physics), vectors can beadded and one can consider multiples of vectors, as follows:
x + y =
(x1x2
)+
(y1y2
)=
(x1 + y1x2 + y2
), λ · x = λ
(x1x2
)=
(λx1λx2
),
where x , y ∈ R2 and λ ∈ R.However, there is no straightforward way to multiply two vectors in R2 andobtain a new vector in R2. And for physical as well as mathematicalapplications, that is often not necessary!Implementing the ideas of the previous discussion, we want to define avector space as a set of elements (“vectors”) that can be added togetheror multiplied by a (real or complex) number to obtain new elements of theset.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 326 / 624
Differential Calculus in One Variable Vector Spaces
Vector Spaces3.3.1. Definition. A triple (V , +, ·) is called a real vector space (or reallinear space) if
1. V is any set;2. +: V × V → V is a map (called addition) with the following
properties: (u + v) + w = u + (v + w) for all u, v ,w ∈ V (associativity), u + v = v + u for all u, v ∈ V (commutativity), there exists an element e ∈ V such that v + e = v for all v ∈ V
(existence of a unit element), for every v ∈ V there exists an element −v ∈ V such that
v + (−v) = e;3. · : R× V → V is a map (called scalar multiplication) with the
following properties: λ · (u + v) = λ · u + λ · v for all λ ∈ R, u, v ∈ V , (λ+ µ) · u = λ · u + µ · u for all λ,µ ∈ R, u ∈ V , (λµ) · u = λ · (µ · u) for all λ,µ ∈ R, u ∈ V .
If we replace R with C, we say that (V , +, ·) is a complex vector (or linear)space.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 327 / 624
Differential Calculus in One Variable Vector Spaces
Examples of Vector SpacesWe will often simply write V instead of (V , +, ·) if the operations · and +are understood.3.3.2. Examples.
1. The set of n-tuples (or n-dimensional vectors)Rn = x = (x1, ... , xn) : x1, ... , xn ∈ R is a real vector space withaddition defined by
x + y = (x1, ... , xn) + (y1, ... , yn)
:= (x1 + y1, ... , xn + yn), x , y ∈ Rn,
and scalar multiplication defined by
λx = λ · (x1, ... , xn) := (λx1, ... ,λxn), λ ∈ R, x ∈ Rn.
We will sometimes abbreviate (x1, ... xn) by writing (xk)nk=1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 328 / 624
Differential Calculus in One Variable Vector Spaces
Examples of Vector Spaces
2. The set Cn = z = (z1, ... , zn) : z1, ... , zn ∈ C is a complex vectorspace with addition defined by
w + z := (w1 + z1, ... ,wn + zn), w , z ∈ Cn, (3.3.1)
and scalar multiplication defined by
λz := (λz1, ... ,λzn), λ ∈ C, z ∈ Cn.
3. The set Cn is a real vector space if we define addition as in (3.3.1)and scalar multiplication
λz := (λz1, ... ,λzn), λ ∈ R, z ∈ Cn.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 329 / 624
Differential Calculus in One Variable Vector Spaces
Examples of Vector Spaces4. The set of real polynomials of degree less than or equal to n,
Pn =p : R→ R : p(x) =
n∑k=0
akxk , a0, ... , an ∈ R
is a real vector space with pointwise addition and scalar multiplication:
(p + q)(x) := p(x) + q(x), (λp)(x) := λp(x).
for p, q ∈ Pn, λ ∈ R.5. More generally, for any set M, the set of maps f : M → R (C) is a
real (real or complex) vector space with pointwise addition and scalarmultiplication
(f + g)(x) := f (x) + g(x), (λf )(x) := λf (x),
for f , g : M → R, λ ∈ R (C).Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 330 / 624
Differential Calculus in One Variable Vector Spaces
Examples of Vector Spaces
6. Hence for M = N the set of all real (complex) sequences (an)n∈N is areal (real or complex) vector space.
7. The set of all functions f : R→ R is a real vector space.8. For any open set Ω ⊂ R the set of all continuous functions f : Ω → R
is a real vector space. We denote this set by C (Ω,R) or just C (Ω).9. For any open set Ω ⊂ R and k ∈ N the set of all k times continuously
differentiable functions f : Ω → R is a real vector space. We denotethis set by C k(Ω,R) or just C k(Ω).
10. For any open set Ω ⊂ R we define
C∞(Ω ,R) = C∞(Ω) = f : Ω → R : f ∈ C k(Ω ,R) for all k ∈ N,
which is also a real vector space.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 331 / 624
Differential Calculus in One Variable Vector Spaces
Vector Spaces
3.3.3. Remarks. It is easy to prove that the unit element e ∈ V has theproperties
∀λ∈R (C)
λ · e = e, ∀x∈V
0 · x = e.
Instead of e ∈ V , we will often denote the unit element by 0 ∈ V , usingthe same symbol as for 0 ∈ R. This will simplify notation and generallynot cause any confusion.There can only be one unit element in V : if 0, 0′ ∈ V both have theproperty that v + 0 = v = v + 0′, then 0 = 0′.Let −v be the inverse of v ∈ V , i.e., v + (−v) = 0. Then −v = (−1) · v .If λ · v = 0 for some v ∈ V and λ ∈ R (C), then either v = 0 or λ = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 332 / 624
Differential Calculus in One Variable Vector Spaces
Subspaces
In the above examples, it seems clear that some vector spaces are in asense “contained” in others. For example, any polynomial of degree ≤ n isalso a polynomial of degree ≤ n + 1, so Pn ⊂ Pn+1 as a set. Furthermore,if we apply the addition and scalar multiplication defined in Pn+1 to theelements in Pn, then these elements remain in Pn and, furthermore, thisaddition and scalar multiplication acts in the same way as thecorresponding operations defined in Pn. This motivates the followingdefinition:3.3.4. Definition. Let (V , +, ·) be a real or complex vector space. If U ⊂ Vand (U, +, ·) is also a vector space, then we say that (U, +, ·) is asubspace of (V , +, ·).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 333 / 624
Differential Calculus in One Variable Vector Spaces
Subspaces3.3.5. Example. The set L = (x1, x2) ∈ R2 : x1 + x2 = 0 gives a vectorspace (L, +, ·) with the usual component-wise addition and scalarmultiplication. We first need to check that + is actually a map L× L→ L,i.e., that x + y ∈ L if x , y ∈ L. Note that
x + y = (x1 + y1, x2 + y2)
so we need to check that (x1 + y1) + (x2 + y2) = 0. Since(x1 + y1) + (x2 + y2) = (x1 + x2) + (y1 + y2) = 0 + 0 = 0,
this is the case. Furthermore, the addition is associative and commutativefor x , y ∈ R2, so it will also have these properties for x , y ∈ L. The unitelement is e = (0, 0) ∈ L, and the inverse element to x is−x = (−x1,−x2) ∈ L (because −x1 + (−x2) = −(x1 + x2) = −0 = 0). Ina similar manner, we can check that scalar multiplication is a mapR× L→ L and satisfies all conditions to ensure that (L, +, ·) is a vectorspace.Since L ⊂ R2, (L, +, ·) is a subspace of (R2, +, ·).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 334 / 624
Differential Calculus in One Variable Vector Spaces
SubspacesFortunately, we don’t always have to go to as much trouble as we did inthe previous example to verify that a vector space is a subspace of anothervector space.3.3.6. Lemma. Let (V , +, ·) be a real (complex) vector space and U ⊂ V .If u1 + u2 ∈ U for u1, u2 ∈ U and λu ∈ U for all u ∈ U and λ ∈ R (C),then (U, +, ·) is a subspace of (V , +, ·).
Proof.We consider the case of a real vector space. Assume that u1 + u2 ∈ U foru1, u2 ∈ U and λu ∈ U for all u ∈ U and λ ∈ R. Commutativity andassociativity of + are assured since they have these properties for allelements of V . Furthermore, e = 0 · v for any v ∈ V , so taking someu ∈ U, we have e = 0 · u ∈ U by our hypotheses. In the same way,(−1) · u = −u for any u ∈ U. Thus U contains the neutral element andinverse elements for every u ∈ U. All other properties of the maps + and ·are inherited from their properties in (V , +, ·).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 335 / 624
Differential Calculus in One Variable Vector Spaces
SubspacesJust as we write V for (V , +, ·) when the operations are understood, wealso write U ⊂ V to indicate a subspace U of V .3.3.7. Examples.
1. C (R) ⊃ C 1(R) ⊃ · · · ⊃ C∞(R).2. Pn ⊂ C∞(R) for any n ∈ N.3. Denote the set of bounded sequences by
l∞ := (an)∞n=0 : supn∈N|an| <∞
and the set of sequences converging to 0 by
c0 := (an)∞n=0 : limn→∞
an = 0.
Both of these sets become vector spaces with component-wiseaddition and scalar multiplication, and c0 ⊂ l∞ as vector spaces.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 336 / 624
Differential Calculus in One Variable Vector Spaces
Normed Vector SpacesNow that we have introduced the concept of a vector space, we want todefine on vector spaces something akin to the modulus |x | for x ∈ R or theeuclidean length of a vector in R2,
|x | = |(x1, x2)| =√
x21 + x22 .
3.3.8. Definition. Let V be a real (complex) vector space. Then a map∥ · ∥ : V → R is called a norm if for all u, v ∈ V and all λ ∈ R (C),
1. ∥v∥ ≥ 0 for all v ∈ V and ∥v∥ = 0 if and only if v = 0,2. ∥λ · v∥ = |λ| · ∥v∥,3. ∥u + v∥ ≤ ∥u∥+ ∥v∥.
The pair (V , ∥ · ∥) is called a normed vector space or a normed linearspace.Clearly, any normed vector space can also be considered as a metric space:it is easy to check that
ϱ(x , y) := ∥x − y∥satisfies all the requirements of a metric.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 337 / 624
Differential Calculus in One Variable Vector Spaces
Normed Vector Spaces
3.3.9. Examples.
1. Rn with ∥x∥2 =( n∑j=1
x2i
)1/2,
2. Rn with ∥x∥p =( n∑j=1|xi |p
)1/pfor any p ∈ N \ 0,
3. Rn with ∥x∥∞ = max1≤k≤n
|xk |,
4. l∞ with ∥(an)∥∞ = supn∈N|an|,
5. c0 with ∥(an)∥∞ = supn∈N|an|,
6. C ([a, b]), [a, b] ⊂ R, with ∥f ∥∞ = supx∈[a,b]
|f (x)|
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 338 / 624
Differential Calculus in One Variable Vector Spaces
Sequences in Vector SpacesSince every normed vector space is also a metric space, we can considersequences in normed vector spaces without stating any new definitions.However, it may be useful to recall the definitions for the case of normedvector spaces.In the following, we assume that (V , ∥·∥) is a normed vector space.A sequence in a vector space V is a map (an) : N→ V . We say that (an)converges to a ∈ V if
∀ε>0
∃N∈N
∀n>N
: ∥an − a∥ < ε.
3.3.10. Example. Let V = R2 with ∥x∥ = max|x1|, |x2|. Then thesequence given by
an =
(n2
2 + n2,1
n
)n→∞−−−→ (1, 0).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 339 / 624
Differential Calculus in One Variable Sequences of Real Functions
Differentiation of Real Functions
Properties of Differentiable Real Functions
Vector Spaces
Sequences of Real Functions
Series
Real and Complex Power Series
The Exponential Function
The Trigonometric Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 340 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of Functions
We will interest us in particular in sequences of functions. Thus we willconsider a sequence (fn), where for every n ∈ N, fn is a function.As anexample, take the sequence of continuous functions (fn) given by
fn : [−1, 1]→ R, fn(x) =
√1
(n + 1)2+ x2.
For every n ∈ N, fn ∈ C ([−1, 1]) (fn is continuous on R). We endow C (R)with the norm sup
x∈[−1,1]|f (x)|. Then fn → f as n→∞, where f (x) = |x |.
We shall see and prove this below.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 341 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of Functions
-1 1
1
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 342 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of FunctionsNow it seems clear that fn → f , f (x) = |x |, as stated. Mathematically,there are two distinct points of view:
1. pointwise convergence: For every x ∈ [−1, 1],
fn(x)n→∞−−−→ f (x) :⇔ |fn(x)− f (x)| n→∞−−−→ 0
2. uniform convergence: Each fn is an element of the normed vectorspace C ([−1, 1]), and so is f . Then
fnn→∞−−−→ f :⇔ ∥fn − f ∥∞
n→∞−−−→ 0.
The second approach is more natural: we are considering a sequence offunctions fn, not values of functions fn(x), so we should consider theproperties of fn over all of [−1, 1], not just separately those of fn(x) forevery x ∈ [−1, 1].
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 343 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of Functions
It is obvious that uniform convergence implies pointwise convergence forall x . In our example, we will prove that
√1/(n + 1)2 + x2 → |x |
uniformly for x ∈ [−1, 1], i.e,
∥fn − f ∥∞ = supx∈[−1,1]
∣∣√1/(n + 1)2 + x2 − |x |∣∣ n→∞−−−→ 0.
Note that we can take the supremum over [0, 1] instead of [−1, 1], sincefn(−x) = fn(x) and |−x | = |x |. Furthermore, it is sufficient to show that(
supx∈[−1,1]
∣∣√1/(n + 1)2 + x2 − |x |∣∣)2
= supx∈[−1,1]
∣∣√1/(n + 1)2 + x2 − |x |∣∣2 n→∞−−−→ 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 344 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of Functions
Now∣∣√1/(n + 1)2 + x2 − |x |∣∣2 = 1
(n + 1)2+ 2x2 − 2x
√x2 + 1/(n + 1)2︸ ︷︷ ︸
< 0 if x > 0
.
It follows that the supremum is attained at x = 0 and
∥fn − f ∥2∞ = supx∈[−1,1]
∣∣√1/(n + 1)2 + x2 − |x |∣∣2 = 1
(n + 1)2n→∞−−−→ 0.
Thus fn → f uniformly (or as a sequence in (C ([−1, 1]), ∥ · ∥∞).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 345 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of FunctionsSummarizing, we can say that a sequence of continuous functions fn thatconverges uniformly in an interval [a, b] to some function f also convergespointwise to f :
supx∈[a,b]
|fn(x)− f (x)| n→∞−−−→ 0 ⇒ ∀x∈[a,b]
|fn(x)− f (x)| n→∞−−−→ 0
In general, we make the following definition:3.4.1. Definition. Let Ω ⊂ R and (fn) be a sequence of functionsfn : Ω → C. We say that the sequence (fn) converges pointwise to thefunction f : Ω → C if
∀x∈Ω|fn(x)− f (x)| n→∞−−−→ 0.
If f is the pointwise limit of (fn), we say that (fn) converges uniformly to fon Ω if
supx∈Ω|fn(x)− f (x)| n→∞−−−→ 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 346 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of Functions
A sequence of continuous functions fn that converges pointwise to somefunction f does not need to converge uniformly to f .3.4.2. Example. The sequence (fn),
fn : [0, 1]→ R, fn(x) =
1− nx , 0 ≤ x ≤ 1/n,
0, otherwise,
converges to
f : [0, 1]→ R, f (x) =
1, x = 0,
0, otherwise,
pointwise, but not uniformly, as we now show.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 347 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of Functions
Clearly, for every x ∈ (0, 1] there exists an N ∈ N such that fn(x) = 0 forall n > N, (Just take N = 1/x .) Furthermore, fn(0) = 1 for all n. Hencefn(x)→ f (x) for all x ∈ [0, 1].However,
supx∈[0,1]
|fn(x)− f (x)| ≥ |fn(1/(2n))− f (1/(2n))| = |1− n/(2n)− 0| = 1
2,
so supx∈[0,1]
|fn(x)− f (x)| → 0 as n→∞.
In general, when considering sequences of functions you should first findthe pointwise limit and then test whether the convergence is uniform.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 348 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of Functions
In the previous example, the sequence of continuous functions fnconverged to the discontinuous function pointwise, but not uniformly. Thisno accident. In fact, a uniformly convergent sequence of continuousfunctions will always converge to a continuous function:3.4.3. Theorem. Let [a, b] ⊂ R be a closed interval. Let (fn) be asequence of continuous functions defined on [a, b] such that fn(x)converges to some f (x) ∈ R as n→∞ for every x ∈ [a, b]. If thesequence (fn) converges uniformly to the thereby defined functionf : [a, b]→ R, then f is continuous.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 349 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of Functions
Proof.We need to show that f is continuous for all x ∈ [a, b]. We will here dealonly with x ∈ (a, b); the cases x = a and x = b are left to you.Let x ∈ (a, b). We will show that for any ε > 0 there exists a δ > 0 suchthat |h| < δ implies |f (x + h)− f (x)| < ε (for h so small thatx + h ∈ (a, b)). Fix ε > 0. Then there exists some N ∈ N such that
∥fn − f ∥∞ = supx∈[a,b]
|fn(x)− f (x)| < ε
3.
for all n > N. Since each fn is continuous on [a, b], there exists someδ > 0 such that |h| < δ implies
|fn(x)− fn(x + h)| < ε
3.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 350 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of FunctionsProof (continued).Then for |h| < δ we have
|f (x + h)− f (x)| ≤ |f (x + h)− fn(x + h)|+ |fn(x + h)− fn(x)|+ |fn(x)− f (x)|
<ε
3+ε
3+ε
3= ε.
3.4.4. Theorem. Let [a, b] ⊂ R be a closed interval and C ([a, b]) thevector space of continuous functions on [a, b], endowed with the metric
ϱ(f , g) = ∥f − g∥∞ = supx∈[a,b]
|f (x)− g(x)|.
Then the metric space (C ([a, b]), ϱ) is complete, i.e., every Cauchysequence in the space converges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 351 / 624
Differential Calculus in One Variable Sequences of Real Functions
Completeness of C ([a, b])Proof.Let (fn) be a Cauchy sequence in C ([a, b]). We will show that lim
n→∞fn(x)
exists for every x ∈ [a, b] and that exists. First, by definition, for everyε > 0 we have
supx∈[a,b]
|fn(x)− fm(x)| < ε
for n,m sufficiently large. But then, for every fixed x ∈ [a, b], we have
|fn(x)− fm(x)| < ε
for n,m sufficiently large. This implies that for every x ∈ [a, b] thesequence or real numbers (fn(x)) is Cauchy. Since the real numbers arecomplete, (fn(x)) converges. Hence we can define the pointwise limit.
f (x) := limn→∞
fn(x) for every x ∈ [a, b].
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 352 / 624
Differential Calculus in One Variable Sequences of Real Functions
Completeness of C ([a, b])
Proof (continued).We now show that the convergence of (fn) to f is uniform: fix ε > 0 andchoose N so that
supx∈[a,b]
|fn(x)− fm(x)| < ε for n,m > N.
Then for n > N we have
supx∈[a,b]
|f (x)− fn(x)| = supx∈[a,b]
limm→∞
|fm(x)− fn(x)|.
For any p > N we have
|fp(x)− fn(x)| ≤ supm>N|fm(x)− fn(x)|. (3.4.1)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 353 / 624
Differential Calculus in One Variable Sequences of Real Functions
Completeness of C ([a, b])
Proof (continued).We can let p →∞ in (3.4.1) and obtain
limm→∞
|fm(x)− fn(x)| ≤ supm>N|fm(x)− fn(x)|.
This implies that for all n > N
supx∈[a,b]
|f (x)− fn(x)| ≤ supx∈[a,b]
supm>N|fm(x)− fn(x)|
= supm>N
supx∈[a,b]
|fm(x)− fn(x)| < supm≥N
ε = ε,
so the convergence is uniform. By Theorem 3.4.3, f is continuous, sof ∈ C ([a, b]).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 354 / 624
Differential Calculus in One Variable Sequences of Real Functions
Sequences of Functions
We will be especially interested in a particular type of sequence offunctions, called power series. For example, we would like to find the limitof (fn) where
f0(x) = 1, f1(x) = 1 + x , f2(x) = 1 + x +x2
2!, ... , fn(x) =
n∑k=0
xk
k!.
Sequences which are formed by continually adding summands are calledseries, and we will have to start with studying their basic theory.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 355 / 624
Differential Calculus in One Variable Series
Differentiation of Real Functions
Properties of Differentiable Real Functions
Vector Spaces
Sequences of Real Functions
Series
Real and Complex Power Series
The Exponential Function
The Trigonometric Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 356 / 624
Differential Calculus in One Variable Series
ConvergenceA series is “a sequence together with the wish to sum.” However, this maynot always be possible.3.5.1. Definition. Let (an) be a sequence in a normed vector space(V , ∥ · ∥). Then we say that (an) is summable with sum s ∈ V if
limn→∞
sn = s, sn :=n∑
k=0
ak .
We call sn the nth partial sum of (an). We use the notation∞∑k=0
ak (3.5.1)
to denote not only s, but also the “procedure of summing the sequence(an).” We call (3.5.1) an infinite series and we say that the seriesconverges if (an) is summable. If (sn) does not converge, we say that∑
ak diverges.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 357 / 624
Differential Calculus in One Variable Series
The Geometric Series
3.5.2. Lemma. The geometric series∞∑k=0
zk , z ∈ C, converges to 1
1− zif
|z | < 1 and diverges if |z | > 1.
Proof.From
(1− z)n∑
k=0
zk =n∑
k=0
zk −n+1∑k=1
zk = 1− zn+1
we see thatn∑
k=0
zk =1− zn+1
1− z. (3.5.2)
As n→∞, |z |n+1 will diverge if |z | > 1 and converge to zero if|z | < 1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 358 / 624
Differential Calculus in One Variable Series
The Koch Snowflake3.5.3. Example. The Koch snowflake is a simple example of a fractal curve.It is constructed as follows: one starts with an equilateral triangle:
One then successively adds an equilateral triangle to the middle third ofevery line segment.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 359 / 624
Differential Calculus in One Variable Series
The Koch Snowflake - First Iteration
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 360 / 624
Differential Calculus in One Variable Series
The Koch Snowflake - Second Iteration
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 361 / 624
Differential Calculus in One Variable Series
The Koch Snowflake - Third Iteration
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 362 / 624
Differential Calculus in One Variable Series
The Koch Snowflake - Fourth Iteration
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 363 / 624
Differential Calculus in One Variable Series
The Koch Snowflake - Fifth Iteration
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 364 / 624
Differential Calculus in One Variable Series
The Koch Snowflake - Sixth Iteration
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 365 / 624
Differential Calculus in One Variable Series
The Koch SnowflakeThe Koch snowflake is now defined as the limit as n→∞ of the nthiteration. The fractal nature of the curve becomes apparent when one triesto “zoom in” to a segment to analyze its structure:
This property is called self-similarity: any arbitrarily small segment of theKoch curve contains all the structural information of the entire curve.We are now interested in the total line length ln and the area An
circumscribed by the nth iteration of the Koch snowflake.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 366 / 624
Differential Calculus in One Variable Series
Length of the Koch Snowflake
It is easy to see that in every iteration the length of the curve increases by1/3: effectively, one adds 1/3 of the length of each individual linesegment, so
ln+1 =4
3ln.
Hence, if the initial triangle has side length l0,
ln =
(4
3
)n
l0
When n→∞, the length of the nth iteration diverges to infinity, so theKoch curve has infinite length.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 367 / 624
Differential Calculus in One Variable Series
Area enclosed by the Koch SnowflakeNote that an equilateral triangle of side length l has area A = (
√3/4)l2.
To the area of the initial triangle we add that of three triangles of lengthl0/3, giving
A0 =
√3
4l20 , A1 =
√3
4l20 + 3 ·
√3
4
(l03
)2
.
The first iteration consists of 12 line segments and thereafter, the nthiteration will have 4 times as many as the previous iteration. There will bea total of 3 · 4n line segments in the nth iteration and the length of eachsegment is l0/3
n. This means that the nth iteration, n ≥ 1, will have anarea of
An =
√3
4l20 +
n∑k=1
3 · 4k−1
√3
4
(l03k
)2
=
√3
4l20
(1 +
3
4
n∑k=1
(4
9
)k)
=3
4A0
(1
3+
n∑k=0
(4
9
)k)
n→∞−−−→ 3
4A0
(1
3+
1
1− 4/9
)=
8
5A0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 368 / 624
Differential Calculus in One Variable Series
Area enclosed by the Koch SnowflakeWe have
An =3
4A0
(1
3+
n∑k=0
(4
9
)k).
This is a geometric series, and when n→∞, we find that
limn→∞
An =3
4A0
(1
3+
1
1− 4/9
)=
8
5A0
Hence the Koch snowflake has 8/5 times the area of the original triangle.A finite area is hereby enclosed by a curve of infinite length.The theory of fractals owes much to the ideas of Benoit Mandelbrot, whopopularized the famous Mandelbrot fractal. Fractals play an importantrole in chaos theory and non-linear dynamics.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 369 / 624
Differential Calculus in One Variable Series
The Cauchy Criterion and its ConsequencesWe now return to the general theory of series. In particular, we want todevelop techniques to analyze the geometric series when |z | = 1.3.5.4. Cauchy Criterion. Let
∑ak be a series in a complete vector space
(V , ∥ · ∥). Then
∑ak converges ⇔ (sn)n∈N converges, sn =
n∑k=0
ak
⇔ (sn) is Cauchy⇔ ∀
ε>0∃
N∈N∀
m>n>N∥sm − sn∥ < ε
⇔ ∀ε>0
∃N∈N
∀m>n>N
∥∥∥ m∑k=n+1
ak
∥∥∥ < ε
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 370 / 624
Differential Calculus in One Variable Series
The Cauchy Criterion and its Consequences
3.5.5. Corollary. If the series∑∞
k=0 ak converges, then the sequenceak → 0 as k →∞. (Take m = n + 1 in the Cauchy Criterion.)
3.5.6. Corollary. If the series∑∞
k=0 ak converges, then the sequence (An)given by
An :=∞∑k=n
ak
converges to 0 as n→∞. (Let m→∞ in the Cauchy Criterion.)
3.5.7. Example. The harmonic series∞∑k=1
1k is divergent. (Since
|s2n − sn| ≥ 1/2, the sequence of partial sums (sn) is not Cauchy.)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 371 / 624
Differential Calculus in One Variable Series
Absolute Convergence
3.5.8. Example. The geometric series∞∑k=0
zk , z ∈ C, diverges if |z | = 1. Tosee this, note that
limk→∞|zk | = lim
k→∞|z |k = 1,
so the sequence zk does not converge to zero. By Corollary 3.5.5, theseries can not converge.
3.5.9. Definition. A series∑
ak in a normed vector space (V , ∥ · ∥) iscalled absolutely convergent if
∑∥ak∥ converges.
A sequence (ak) in a normed vector space (V , ∥ · ∥) is called absolutelysummable if
∑ak converges absolutely.
3.5.10. Theorem. An absolutely convergent series∑
ak in a completevector space (V , ∥ · ∥) is convergent.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 372 / 624
Differential Calculus in One Variable Series
Absolute Convergence
Proof.Let (ak) be a sequence with partial sum
sn =n∑
k=0
ak .
Then
∥sn − sm∥ =∥∥∥ n∑k=m+1
ak
∥∥∥ ≤ n∑k=m+1
∥ak∥. (3.5.3)
Since∑
ak converges absolutely,∑∥ak∥ is Cauchy, so for any ε > 0 there
exists N ∈ N such that∑n
k=m+1∥ak∥ < ε for all n,m > N. But by (3.5.3),this is also true for ∥sn − sm∥. Hence the sequence (sn) is Cauchy andconverges because (V , ∥ · ∥) is complete. Thus
∑ak converges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 373 / 624
Differential Calculus in One Variable Series
Absolute ConvergenceThe concept of absolute convergence is useful in many contexts, such as
(i) Alternating series: A series of real numbers of the form∑∞k=0(−1)kak with ak > 0 for all k is called alternating. The series
will converge absolutely if∞∑k=0
|(−1)kak | =∞∑k=0
ak
converges.(ii) Complex series: The series
∑∞k=0 zk with zk = ak + ibk converges
absolutely if the series∞∑k=0
|zk | =∞∑k=0
√a2k + b2k
converges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 374 / 624
Differential Calculus in One Variable Series
Absolute Convergence
(iii) Series of functions: The series∑∞
k=0 fk of continuous functionsfk ∈ C (I ,C), I = [a, b] ⊂ R a closed interval, converges absolutely ifthe series
∞∑k=0
∥fk∥∞ =∞∑k=0
supx∈I|f (x)|
converges.Since the real numbers are complete, absolute convergence in (i) impliesconvergence of the alternating series to some real number. Similarly,absolute convergence of the complex sequence in (ii) implies convergenceto some complex number. The most interesting case from the point ofview of calculus is (iii): since
(C (I ,C), ∥ · ∥∞
)is complete by Theorem
3.4.4, absolute convergence implies convergence to some continuousfunction f ∈ C (I ,C).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 375 / 624
Differential Calculus in One Variable Series
Series of Functions3.5.11. Example. Let (fn) be the sequence of functions
fn :
[−1
2,1
2
]→ R, fn(x) = xn.
Clearly, fn ∈ C ([−12 ,
12 ]) for all n ∈ N. We want to find out if (fn) is
summable, i.e., if the series∞∑n=0
fn (3.5.4)
converges to a function f ∈ C ([−12 ,
12 ]). This would be the case if the
series (3.5.4) were to converge absolutely. We have∞∑n=0
∥fn∥∞ =∞∑n=0
sup−1/2≤x≤1/2
|xn| =∞∑n=0
1
2n= 2 <∞.
Hence (3.5.4) converges absolutely and therefore also converges to afunction f ∈ C ([−1/2, 1/2]).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 376 / 624
Differential Calculus in One Variable Series
Series of FunctionsIn this simple example we can actually calculate the pointwise limit of theseries:
f (x) =∞∑n=0
fn(x) =∞∑n=0
xn =1
1− x
for all x ∈ [−12 ,
12 ]. In many applications, however, we will not be able to
calculate the pointwise limit so easily. We will return to a discussion oflimits of series of functions below.
3.5.12. Lemma. Let (V , ∥ · ∥) be a complete normed vector space and∑ak an absolutely convergent series. Then∥∥∥ ∞∑
k=0
ak
∥∥∥ ≤ ∞∑k=0
∥ak∥. (3.5.5)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 377 / 624
Differential Calculus in One Variable Series
The “Infinite” Triangle Inequality
Proof.The triangle inequality yields, for any n ∈ N,∥∥∥ n∑
k=0
ak
∥∥∥ ≤ n∑k=0
∥ak∥ ≤∞∑k=0
∥ak∥. (3.5.6)
Since the series converges absolutely, it also converges and∥∥∥ ∞∑k=0
ak
∥∥∥ =∥∥∥ limn→∞
n∑k=0
ak
∥∥∥ = limn→∞
∥∥∥ n∑k=0
ak
∥∥∥since the norm is continuous (i.e., bn → b implies ∥bn∥ → ∥b∥. Why?).Since the limit exists, the estimate (3.5.6) implies (3.5.5).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 378 / 624
Differential Calculus in One Variable Series
The Comparison TestIn the rest of this section we will use the phrase “for sufficiently large k”to mean “for k > k0 for some k0 ∈ N”. For example, the statement“1/k < 0.001 for sufficiently large k” should be interpreted in this way.Many of the following criteria apply to series of positive real numbers.They are therefore suitable for establishing the absolute convergence of aseries.3.5.13. Comparison Test. Let (ak) and (bk) be real-valued sequences with0 ≤ ak ≤ bk for sufficiently large k. Then∑
bk converges ⇒∑
ak converges.
3.5.14. Remark. In applications one also often uses the contrapositive: if0 ≤ ak ≤ bk for sufficiently large k, then∑
ak diverges ⇒∑
bk diverges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 379 / 624
Differential Calculus in One Variable Series
The Comparison TestProof.Let (ak) and (bk) be positive real sequences and denote by
Sn :=n∑
k=0
ak , sn :=n∑
k=0
bk , n ∈ N,
their partial sums. Since the series∑
bk converges, the sequence (sn)converges to
∑∞k=0 bk <∞ as n→∞.
Since ak ≤ bk for k > k0 form some k0 ∈ N, the partial sums of (ak) arebounded by
Sn =n∑
k=0
ak ≤ C +n∑
k=k0
bk ≤ C +∞∑k=0
bk
for some C > 0. Hence (Sn) is bounded above. Since (Sn) is obviouslyincreasing, the sequence converges. This means that
∑ak is
convergent.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 380 / 624
Differential Calculus in One Variable Series
The Comparison Test
3.5.15. Example. The series∞∑k=1
(−1)k
k2k(3.5.7)
converges. To see this, we note that
ak :=
∣∣∣∣(−1)kk2k
∣∣∣∣ = 1
k
1
2k≤(1
2
)k
=: bk .
Since the geometric series∑∞
k=0 zk converges if z = 1/2, we know that∑∞
k=0 bk converges. By the comparison test,∑∞
k=0 ak converges, so(3.5.7) is absolutely convergent. Since the real numbers are complete, italso converges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 381 / 624
Differential Calculus in One Variable Series
The Comparison Test
3.5.16. Example. The series∞∑k=1
k + 2
k(k + 1)(3.5.8)
diverges. To see this, we note that
bk :=k + 2
k(k + 1)≥ 1
k=: ak .
Since the harmonic series∑∞
k=11k diverges, so does
∑∞k=1
k+2k(k+1) . (See
Remark 3.5.14.)Many criteria for absolute convergence can be derived from thecomparison test. We will now study a few examples.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 382 / 624
Differential Calculus in One Variable Series
The Weierstraß M-test3.5.17. Weierstraß M-test. Let Ω ⊂ R and (fk) be a sequence of functionsdefined on Ω , fk : Ω → C, satisfying
supx∈Ω|fk(x)| ≤ Mk , k ∈ N (3.5.9)
for a sequence of real numbers (Mk). Suppose that∑
Mk converges.Then the limit
f (x) :=∞∑k=0
fk(x) exists for every x ∈ Ω.
Furthermore, the sequence (Fn) of partial sums
Fn(x) =n∑
k=0
fk(x)
converges uniformly to f .Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 383 / 624
Differential Calculus in One Variable Series
The Weierstraß M-testProof.For every x ∈ Ω, |fk(x)| ≤ Mk , so
∑∞k=0|fk(x)| converges by the
comparison test. Thus, for every x ∈ Ω, the series of numbers∑∞
k=0 fk(x)converges absolutely. Since the complex numbers are complete, we deducethat
f (x) :=∞∑k=0
fk(x) exists for every x ∈ Ω.
For any x ∈ Ω,
|f (x)− Fn(x)| =∣∣∣f (x)− n∑
k=0
fk(x)∣∣∣ = ∣∣∣ ∞∑
k=n+1
fk(x)∣∣∣
≤∞∑
k=n+1
|fk(x)| ≤∞∑
k=n+1
Mk .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 384 / 624
Differential Calculus in One Variable Series
The Weierstraß M-test
Proof (continued).Hence,
supx∈Ω
∣∣∣f (x)− n∑k=0
fk(x)∣∣∣ ≤ ∞∑
k=n+1
Mk .
By Corollary 3.5.6, the right-hand side converges to zero as n→∞, so wehave
supx∈Ω
∣∣∣f (x)− n∑k=0
fk(x)∣∣∣ n→∞−−−→ 0.
This shows the uniform convergence.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 385 / 624
Differential Calculus in One Variable Series
Uniform Convergence of Series of Functions
3.5.18. Definition. Let Ω ⊂ R and (fk) be a sequence of functions definedon Ω, fk : Ω → C. We say that the series
∞∑k=0
fk
converges uniformly (to a function f : Ω → C) if the sequence of partialsums Fn =
∑nk=0 fk converges uniformly to f .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 386 / 624
Differential Calculus in One Variable Series
Uniform Convergence of Series of Functions3.5.19. Example. Consider the sequence of monomials (fn),
fn :
[−1
2,1
2
]→ R, fn(x) =
1
nxn, n ∈ N \ 0.
Sincesup
x∈[− 12, 12]
|fn(x)| =1
n2n<
1
2n=: Mn
and∑
Mn converges, by the Weierstraß M-test the series∑
fk convergesuniformly to
f :
[−1
2,1
2
]→ R, f (x) =
∞∑n=1
1
nxn.
At this time we are not able to find a more explicit expression for f .However, by Theorem 3.4.3 we know that f is a continuous function on[−1/2, 1/2].
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 387 / 624
Differential Calculus in One Variable Series
The Root Test
A further consequence of the Comparison Test 3.5.13 is the following,quite fundamental criterion for convergence:3.5.20. Root Test. Let
∑ak be a series of positive real numbers ak ≥ 0.
(i) Suppose that there exists a q < 1 such that
k√ak ≤ q for all sufficiently large k.
Then∑
ak converges.(ii) Suppose that
k√ak > 1 for all sufficiently large k.
Then∑
ak diverges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 388 / 624
Differential Calculus in One Variable Series
The Root Test
Proof.(i) Suppose that we have a q < 1 such that k
√ak ≤ q for all k ≥ k0.
Then
ak ≤ qk for k ≥ k0.
Since the geometric series∑∞
k=k0qk converges for q < 1, the series∑∞
k=k0ak converges by the Comparison Test 3.5.13.
(ii) Suppose that k√ak ≥ 1 for all k ≥ k0. Then ak ≥ 1, so ak → 0 as
k →∞. By Corollary 3.5.5, the series∑
ak can not converge.
3.5.21. Remark. Note that the existence of a q < 1 so that k√ak < q is
crucial; this is not the same as requiring k√ak < 1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 389 / 624
Differential Calculus in One Variable Series
The Root Test3.5.22. Example. The series
∞∑k=0
kzk
converges absolutely if |z | < 1 and diverges if |z | > 1. This is easily seenfrom
k
√|kzk | = k
√k · |z |.
Since k√k → 1 as k →∞, we see that for sufficiently large k ,
k
√|kzk | > 1 if |z | > 1
and
k
√|kzk | < 1 if |z | < 1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 390 / 624
Differential Calculus in One Variable Series
The Root Test3.5.23. Example. The root test does not always yield a result: Consider theseries
∞∑k=1
1
kand
∞∑k=1
1
k2.
In both cases, k√ak < 1 for all k, but there does not exist a constant q < 1
such that k√ak < q. We have seen that the harmonic series
∞∑k=1
1
k
diverges. We will see below that the Euler series∞∑k=1
1
k2
converges.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 391 / 624
Differential Calculus in One Variable Series
The Root Test using Limits3.5.24. Root Test. Let ak be a sequence of positive real numbers ak ≥ 0.Then
limk→∞
k√ak < 1 ⇒
∞∑k=0
ak converges,
limk→∞
k√ak > 1 ⇒
∞∑k=0
ak diverges.
3.5.25. Remarks.(i) No statement is possible if lim
k→∞k√ak = 1.
(ii) If limk→∞
k√ak exists, it equals lim
k→∞k√ak . This will be the case in many
applications.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 392 / 624
Differential Calculus in One Variable Series
The Root Test using LimitsProof.Recall that for a bounded sequence (xk)
limk→∞
xk := limn→∞
supxn, xn+1, xn+2, ....
Suppose that limk→∞
k√ak =: p < 1. Then there exists an N ∈ N such that
∣∣sup n√an, n+1
√an+1, ... − p
∣∣ < 1− p
2for all n ≥ N.
This implies
sup n√an, n+1
√an+1, ... < p +
1− p
2=: q < 1
for all n ≥ N, so the series converges by the Root Test 3.5.20. This provesthe first statement. The proof of the second statement is analogous.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 393 / 624
Differential Calculus in One Variable Series
The Ratio TestThe Root Test is extremely powerful, but sometimes quite difficult tohandle. A more convenient criterion for convergence is provided by theratio test:3.5.26. Ratio Test. Let
∑ak be a series of strictly positive real numbers
ak > 0.(i) Suppose that there exists a q < 1 such that
ak+1
ak≤ q for all sufficiently large k.
Then∑
ak converges.(ii) Suppose that
ak+1
ak≥ 1 for all sufficiently large k.
Then∑
ak diverges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 394 / 624
Differential Calculus in One Variable Series
The Ratio Test
Proof.(i) Suppose that we have a q < 1 such that ak+1/ak ≤ q for all k ≥ k0.
Then
ak ≤ qak−1 ≤ q2ak−2 ≤ ak0 · qk−k0 for k ≥ k0.
Since the geometric series∑∞
k=k0qk converges for q < 1, the series∑∞
k=k0ak converges by the Comparison Test 3.5.13.
(ii) Suppose that akak−1≥ 1 for all k ≥ k0. Then ak ≥ ak−1, so ak → 0 as
k →∞. By Corollary 3.5.5, the series∑
ak can not converge.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 395 / 624
Differential Calculus in One Variable Series
The Ratio Test3.5.27. Example. Consider the series
∞∑n=0
4n
n!.
Applying the root test would be quite tedious, because we don’t know hown√n! behaves. However,
4n+1
(n + 1)!
n!
4n=
4
n≤ 4
5
if n ≥ 5, so the series converges.The ratio test is easier to apply than the root test. However, it is formallynot as powerful: there are series whose convergence or divergence can notbe determined using the ratio test, but where the root test shows theconvergence. An example will be given in the assignments.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 396 / 624
Differential Calculus in One Variable Series
The Ratio Test using Limits
3.5.28. Ratio Test. Let (ak) be a sequence of strictly positive real numbersak > 0. Then
limk→∞
ak+1
ak< 1 ⇒
∞∑k=0
ak converges,
limk→∞
ak+1
ak> 1 ⇒
∞∑k=0
ak diverges.
The proof is left to you; note the inferior limit in the condition fordivergence!
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 397 / 624
Differential Calculus in One Variable Series
The Ratio Comparison Test
We will now try to derive a finer version of the ratio test that will enableus to distinguish the divergence of the harmonic series from theconvergence of the Euler series. A first step is the following criterion:3.5.29. Ratio Comparison Test. Let (ak) and (bk) be sequences of strictlypositive real numbers ak , bk > 0. Suppose that
∑bk converges. If
ak+1
ak≤ bk+1
bkfor sufficiently large k,
then∑
ak converges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 398 / 624
Differential Calculus in One Variable Series
The Ratio Comparison Test
Proof.The hypothesis ak+1
ak≤ bk+1
bkis equivalent to
ak+1
bk+1≤ ak
bk.
Hence the sequence (ak/bk) is decreasing and positive (since ak , bk > 0for all k).Thus (ak/bk) converges and is therefore bounded: there existssome M > 0 such that
akbk
< M ⇔ ak < M · bk .
Since∑
bk converges by assumption, so does∑
(Mbk). But then∑
akconverges by the Comparison Test 3.5.13.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 399 / 624
Differential Calculus in One Variable Series
The p-SeriesWe are interested in studying the so-called p-series,
∞∑k=1
1
kp, p ∈ R.
Clearly, for p ≤ 0 the series diverges, because it is not the sum of asequence converging to zero.The root and ratio tests do not yield information on the convergence, sowe will have to treat this series separately. First, we prove a preliminaryresult.3.5.30. Lemma. Denote by sn(p) the nth partial sum of the p-series. Forp > 0,
1 +1
2p−1sn(p)−
1
2p< s2n(p) < 1 +
1
2p−1sn(p)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 400 / 624
Differential Calculus in One Variable Series
The p-Series
Proof.By definition,
s2n(p) = 1 +1
2p+
1
3p+ · · ·+ 1
(2n)p
= 1 +( 1
2p+
1
4p+ · · ·+ 1
(2n)p
)+( 1
3p+
1
5p+ · · ·+ 1
(2n − 1)p
)> 1 +
1
2psn(p) +
( 1
4p+
1
6p+ · · ·+ 1
(2n)p
)where we have used that p > 0 in the last step. Thus
s2n(p) > 1 +1
2psn(p)−
1
2p+
1
2psn(p) = 1 +
1
2p−1sn(p)−
1
2p.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 401 / 624
Differential Calculus in One Variable Series
The p-Series
Proof (continued).Similarly,
s2n(p) = 1 +( 1
2p+
1
4p+ · · ·+ 1
(2n)p
)+( 1
3p+
1
5p+ · · ·+ 1
(2n − 1)p
)< 1 +
1
2psn(p) +
( 1
2p+
1
4p+ · · ·+ 1
(2n − 2)p
)+
1
(2n)p
< 1 +2
2psn(p) = 1 +
1
2p−1sn(p)
We can now state our main result:
3.5.31. Theorem. The p-series∞∑k=0
1kp converges when p > 1 and diverges
when p ≤ 1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 402 / 624
Differential Calculus in One Variable Series
The p-SeriesProof.We only need to consider the case p > 0. Assume first that 0 < p ≤ 1 andthat the p-series converges, i.e., lim
n→∞sn(p) =: s(p) exists. Then the
estimate of Lemma 3.5.30 gives
1 +1
2p−1s(p)− 1
2p=
2p − 1
2p+
2
2ps(p) ≤ s(p).
and hence0 <
2p − 1
2p≤ 2p − 2
2ps(p) ≤ 0
which is a contradiction. Thus the series diverges for p ≤ 1. Now forp > 1 we have
sn(p) < s2n(p) < 1 + 21−psn(p),
so sn(p) < 1/(1− 21−p). Since (sn(p)) is increasing, the sequence ofpartial sums converges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 403 / 624
Differential Calculus in One Variable Series
Raabe’s TestWe can use our previous result to obtain a finer version of the ratio test:3.5.32. Raabe’s Test. Let
∑ak be a series of positive real numbers
ak ≥ 0. Suppose that there exists a number p > 1 such thatak+1
ak≤ 1− p
kfor sufficiently large k.
Then the series∑
ak converges.
3.5.33. Example. The series∞∑n=1
(2n)!
4n(n + 1)!n!
is convergent by Raabe’s test.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 404 / 624
Differential Calculus in One Variable Series
Raabe’s Test
Proof.We first observe that if p > 1 and 0 ≤ x < 1, then
1− px ≤ (1− x)p. (3.5.10)
At x = 0 we have equality, while the derivative of the left-hand side is lessthan the derivative of the right-hand side for x ∈ (0, 1). We now apply(3.5.10) with x = 1/k to our hypothesis: for sufficiently large k,
ak+1
ak≤ 1− p
k≤ (1− 1/k)p =
(k − 1
k
)p
=bk+1
bk,
where bk := 1/(k − 1)p. Since∑
bk converges, it follows from the RatioComparison Test 3.5.29 that
∑ak converges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 405 / 624
Differential Calculus in One Variable Series
Absolutely and Conditionally Convergent Series
All of the tests that we have discussed essentially test series for absoluteconvergence. In complete vector spaces, this implies ordinary convergence.But what is more, absolutely convergent series have properties that seriesthat converge without being absolutely convergent do not have.3.5.34. Definition. A series in a normed vector space (V , ∥ · ∥) is calledconditionally convergent if it is convergent, but not absolutely convergent.
3.5.35. Example. We will see later that∑
(−1)k 1k is conditionally
convergent.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 406 / 624
Differential Calculus in One Variable Series
Rearrangements of Terms in Series
For the proof of the following two theorems we refer to the text book(Spivak, Chapter 23, Theorems 7 & 8).3.5.36. Theorem. Assume that
∑ak is an absolutely convergent series in a
complete normed vector space. If the summands of the series arerearranged, the new series
∑bj , bj = ak(j), k : N→ N bijective, converges
absolutely with the same sum as∑
ak .In particular, if
∑ak and
∑bk are absolutely convergent, then∑
(ak + bk) is absolutely convergent and equal to∑
ak +∑
bk .Contrast this with the following result:3.5.37. Theorem. Let
∑ak be a conditionally convergent series of real
numbers. Then for any α ∈ R there exists a rearrangement bj = ak(j),k : N→ N bijective, of
∑ak such that
∑bj = α.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 407 / 624
Differential Calculus in One Variable Series
The Leibniz TheoremFor this reason, we are mostly interested in absolutely convergent series.However, we give one result concerning conditionally convergent series.3.5.38. Leibniz Theorem. Let
∑αk be a complex series whose partial
sums are bounded but need not converge. Let (ak) be a decreasingconvergent sequence with limit zero, ak 0. Then the series∑
αkak converges.
A typical application of the Leibniz Theorem 3.5.38 are alternating series,for which αk = (−1)k . In particular, the Leibniz series
∞∑k=1
(−1)k
k
converges by the Leibniz Theorem.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 408 / 624
Differential Calculus in One Variable Series
Summation by Parts3.5.39. Lemma. Let (ak) and (bk) denote complex sequences. Then form > n
m∑k=n+1
ak(bk − bk−1) = am+1bm − an+1bn −m∑
k=n+1
(ak+1 − ak)bk .
(3.5.11)
Proof.m∑
k=n+1
ak(bk − bk−1) =m∑
k=n+1
akbk −m∑
k=n+1
akbk−1
=m∑
k=n+1
akbk −m−1∑k=n
ak+1bk
=m∑
k=n+1
(ak − ak+1)bk + am+1bm − an+1bn
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 409 / 624
Differential Calculus in One Variable Series
The Leibniz Theorem
Proof of Theorem 3.5.38.Denote by (sn) the sequence of partial sums of (αk),
sn =n∑
k=0
αk .
For k ≥ 1, αk = sk − sk−1. Hence, applying (3.5.11),n∑
k=0
αkak =n∑
k=1
(sk − sk−1)ak + α0a0
= α0a0 + snan+1 − a1α0 −n∑
k=1
(ak+1 − ak)sk
We will show that the limit as n→∞ of the right-hand side exists.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 410 / 624
Differential Calculus in One Variable Series
The Leibniz Theorem
Proof of Theorem 3.5.38 (continued).Since (ak) is decreasing, ak − ak+1 > 0 and therefore
n∑k=1
|ak+1 − ak | =n∑
k=1
(ak − ak+1) = a1 − an+1
for all n ∈ N \ 0. We further assume |sk | ≤ M for some M > 0. Thus
|(ak+1 − ak)sk | ≤ M|ak+1 − ak |.
Since∑|ak+1 − ak | converges, the Comparison Test 3.5.13 implies that∑
(ak+1 − ak)sk is absolutely convergent. Since C is complete, the series∑(ak+1 − ak)sk converges.
Furthermore, since |sk | ≤ M, snan+1 → 0 as n→∞.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 411 / 624
Differential Calculus in One Variable Series
The Cauchy Product and Convolution of Sequences
3.5.40. Theorem. Let∑
ak and∑
bk be absolutely convergent series.Then the Cauchy product
∑ck given by
ck :=∑
i+j=k
aibj
converges absolutely and∑
ck =(∑
ak
)(∑bk
).
3.5.41. Remark. If a = (ak) and b = (bk) are two absolutely summablesequences, the sequence
a ∗ b := (ck), ck :=∑
i+j=k
aibj ,
is called the convolution of a and b.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 412 / 624
Differential Calculus in One Variable Series
The Cauchy Product and Convolution of SequencesProof.Let
An :=n∑
k=0
ak , Bn :=n∑
k=0
bk , Cn :=n∑
k=0
ck .
We will show that limn→∞|Cn − AnBn| = 0. Now
|Cn − AnBn| =∣∣∣ n∑k=0
∑i+j=k
aibj −n∑
i ,j=0
aibj
∣∣∣ ≤ ∑i+j>n0≤i ,j≤n
|aibj | =: Rn
If i + j > n, at least one of i and j will be greater than n/2. Thus
Rn ≤∑
n/2≤i≤n
|ai |n∑
j=0
|bj |+n∑
i=0
|ai |∑
n/2≤j≤n
|bj |
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 413 / 624
Differential Calculus in One Variable Series
The Cauchy Product and Convolution of Sequences
Proof (continued).We have
Rn ≤∑n/2≤i
|ai |︸ ︷︷ ︸→0
·n∑
j=0
|bj |︸ ︷︷ ︸bounded
+n∑
i=0
|ai |︸ ︷︷ ︸bounded
·∑n/2≤j
|bj |︸ ︷︷ ︸→0
→ 0 as n→∞.
This implies that∑
ck =(∑
ak
)(∑bk
). Furthermore,
∑ck is
absolutely convergent, because∑|ck | ≤
(∑|ak |)(∑
|bk |)<∞.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 414 / 624
Differential Calculus in One Variable Real and Complex Power Series
Differentiation of Real Functions
Properties of Differentiable Real Functions
Vector Spaces
Sequences of Real Functions
Series
Real and Complex Power Series
The Exponential Function
The Trigonometric Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 415 / 624
Differential Calculus in One Variable Real and Complex Power Series
Formal Power Series3.6.1. Definition. For any complex sequence (ak), the expression
∞∑k=0
ak zk
is called a (formal) complex power series.
3.6.2. Definition. Let∑
ak zk and
∑bk z
k be two complex power series.Then we define their sum
∞∑k=0
ak zk +
∞∑k=0
bk zk :=
∞∑k=0
(ak + bk) zk
and their (Cauchy) product(∑ak z
k)(∑
bk zk):=(∑
(a ∗ b)k zk)
where a = (ak), b = (bk) and ∗ denotes convolution of sequences.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 416 / 624
Differential Calculus in One Variable Real and Complex Power Series
Convergent Power Series
3.6.3. Remark. We have defined the formal power series without referenceto what zn actually is, and we have introduced no notion of convergence.Hence a formal power series is little different from a sequence.Our goal is to eventually treat a power series as a polynomial of infinitedegree. We will analyze the convergence of complex power series for z ∈ C.
3.6.4. Definition. A formal power series∑
ak zk is said to be is (absolutely)
convergent at z0 ∈ C if the series∞∑k=0
akzk0 converges (absolutely).
3.6.5. Remark. For power series that are absolutely convergent at z0 theaddition and multiplication of Definition 3.6.2 coincide with the additionand multiplication of absolutely convergent series.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 417 / 624
Differential Calculus in One Variable Real and Complex Power Series
Convergent Power Series3.6.6. Definition and Theorem. Let
∑akz
k be a complex power series.Then there exists a unique number ϱ ∈ [0,∞] such that
(i)∑
akzk is absolutely convergent if |z | < ϱ,
(ii)∑
akzk diverges if |z | > ϱ.
This number is called the radius of convergence of the power series. It isgiven by Hadamard’s formula,
ϱ =
1
lim k√
|ak |, 0 < lim
k→∞k√|ak | <∞,
0, limk→∞
k√|ak | =∞,
∞, limk→∞
k√|ak | = 0.
(3.6.1)
3.6.7. Remark. No information is given about the convergence of∑
akzk
if |z | = ϱ. In these cases the series may converge or diverge, depending onthe point z .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 418 / 624
Differential Calculus in One Variable Real and Complex Power Series
Radius of Convergence
3.6.8. Example.∞∑k=0
(−1)k
k zk has ϱ = 1. The series converges for z = 1 anddiverges for z = −1.
Proof.It is sufficient to prove that ϱ defined by Hadamard’s formula (3.6.1) hasthe properties (i) and (ii) (Clearly, no other number can have theseproperties.)We first prove (i) Let ϱ > 0 and |z | < ϱ. We can assume that ϱ <∞ (thecase ϱ =∞ is left to the reader). Then
limk→∞
k
√|akzk | = |z | lim
k→∞k√|ak | =
|z |ϱ< 1,
so∑
akzk converges absolutely by the root test.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 419 / 624
Differential Calculus in One Variable Real and Complex Power Series
Radius of Convergence
Proof (continued).Next we prove (ii) Let |z | > ϱ. We consider two cases:
ϱ = 0:Then limk→∞
k√|ak | =∞, so
limk→∞
k
√|akzk | = |z | lim
k→∞k√|ak | =∞,
so∑
akzk diverges.
0 < ϱ <∞:Then
limk→∞
k
√|akzk | = |z | lim
k→∞k√|ak | =
|z |ϱ> 1.
so∑
akzk diverges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 420 / 624
Differential Calculus in One Variable Real and Complex Power Series
Radius of Convergence
3.6.9. Lemma. Let∑
akzk be a complex power series with radius of
convergence ϱ. Then the series∑
kakzk−1 has the same radius of
convergence ϱ.
Proof.It is sufficient to show that the series∑
kakzk = z ·
∑kakz
k−1
has radius of convergence ϱ. We consider here only the case 0 < ϱ <∞.The cases ϱ = 0 and ϱ =∞ are left to you. In that case,
ϱ =1
limk→∞
k√|ak |
.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 421 / 624
Differential Calculus in One Variable Real and Complex Power Series
Radius of ConvergenceProof (continued).Suppose that |z | < ϱ. Then
1 >|z |ϱ
= limk→∞
k
√|ak ||z |k = lim
k→∞k
√|kakzk |/k.
This implies that there exists a q < 1 such that
k
√|kakzk |/k ≤ q and, hence, |kakzk | ≤ kqk
for sufficiently large k. By Example 3.5.22 the series∑
kqk converges, sothe Comparison Test 3.5.13 shows that
∑kakz
k converges absolutely.Similarly, if |z | > ϱ, then lim
k→∞k√|kakzk |/k > 1 and |kakzk | > kpk for
some p ≥ 1. This shows that∑
kakzk diverges. Therefore, the radius of
convergence of∑
kakzk must be ϱ.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 422 / 624
Differential Calculus in One Variable Real and Complex Power Series
Uniform Convergence of Power Series
Recall from Definition 3.5.18 that a series∑∞
k=0 fk of functions fk : Ω→ Cconverges uniformly to a function f if
supx∈Ω
∣∣∣f (x)− n∑k=0
fk(x)∣∣∣ n→∞−−−→ 0.
In that case, the function f is continuous by Theorem 3.4.3. We nowapply these general results to power series, where
fk(x) = akxk .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 423 / 624
Differential Calculus in One Variable Real and Complex Power Series
Uniform Convergence of Power Series3.6.10. Lemma. If
∑akz
k is a complex power series with radius ofconvergence ϱ, then for any R < ϱ the series converges uniformly onBR(0) = z : |z | < R.
Proof.Let 0 < R < ϱ be fixed. Then
supz∈BR(0)
|akzk | < |ak |Rk =: Mk
Now∞∑k=0
Mk =∞∑k=0
|ak |Rk
converges, because the point R < ϱ lies within the radius of convergenceof the series and the series convergences absolutely within its radius ofconvergence. By the Weierstraß M-test 3.5.17, the series
∑akz
k offunctions akz
k : BR(0)→ C converges uniformly on BR(0).Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 424 / 624
Differential Calculus in One Variable Real and Complex Power Series
Continuity of Power Series3.6.11. Corollary. A power series
∑∞k=0 akx
k with radius of convergence ϱdefines a continuous function
f : Bϱ(0)→ C, f (z) =∞∑k=0
akzk .
Proof.Let z ∈ C have modulus |z | < ϱ. Define R := |z |+ (ϱ− |z |)/2 < ϱ. Thenthe power series converges uniformly on BR(0) and is hence continuous onBR(0), which includes the point z .
3.6.12. Example. The power series∑∞
k=0(−1)kxk converges uniformly on[−R,R] for any 0 < R < 1. However, it does not converge uniformly on(−1, 1). (Why?) However, the series defines a continuous function on theinterval (−1, 1).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 425 / 624
Differential Calculus in One Variable Real and Complex Power Series
Differentiability of Power Series3.6.13. Remark. The definition of the derivative can be extended tofunctions of a single complex variable; in particular, if Ω ⊂ C is an openset, we say that a function f : Ω→ C is differentiable at z ∈ Ω if thereexists a number f ′(z) ∈ C such that
f (z + h) = f (z) + f ′(z)h + o(h), h→ 0.
3.6.14. Theorem. The real or complex power series f (z) :=∑
akzk with
radius of convergence ϱ defines a differentiable function f : Bϱ(0)→ C.Furthermore,
f ′(z) =∑
kakzk−1 (3.6.2)
where the series has the same radius of convergence as f .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 426 / 624
Differential Calculus in One Variable Real and Complex Power Series
Differentiability of Power SeriesProof.The statement that the series (3.6.2) has the same radius of convergenceas f was established in Lemma 3.6.9. Define
g(z) :=∞∑k=1
kakzk−1
and set
f (z) = SN(z) + EN(z), SN(z) :=N∑
k=0
akzk , EN(z) :=
∞∑k=N+1
akzk .
Fix z0 ∈ Br (0) with r < ϱ. We want to show that for any ε > 0 thereexists a δ > 0 so that∣∣∣∣ f (z0 + h)− f (z0)
h− g(z0)
∣∣∣∣ < ε whenever |h| < δ.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 427 / 624
Differential Calculus in One Variable Real and Complex Power Series
Differentiability of Power Series
Proof (continued).This is enough to show that f is differentiable at z0 and f ′(z0) = g(z0).Fix ε > 0 and choose some δ such that |z0 + h| < r if h ∈ Bδ(z0).
f (z0 + h)− f (z0)
h− g(z0) =
(SN(z0 + h)− SN(z0)
h− S ′
N(z0)
)+(S ′N(z0)− g(z0)
)+
EN(z0 + h)− EN(z0)
h
We will show that each of the three terms on the right can be made smallso that the modulus of the difference on the left is less than ε. We startwith ∣∣∣∣EN(z0 + h)− EN(z0)
h
∣∣∣∣ ≤ ∞∑k=N+1
|ak ||h|∣∣(z0 + h)k − zk0
∣∣.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 428 / 624
Differential Calculus in One Variable Real and Complex Power Series
Differentiability of Power Series
Proof (continued).Using that for k ∈ N \ 0 and a, b ∈ C,
ak − bk = (a− b)k−1∑i=0
aibk−1−i
and |z0|, |z0 + h| < r , we have∣∣∣∣EN(z0 + h)− EN(z0)
h
∣∣∣∣ ≤ ∞∑k=N+1
k|ak |rk−1. (3.6.3)
Since the series∑
k |ak |rk converges, we can choose N1 large enough sothat the right-hand side of (3.6.3) is less than ε/3 for N > N1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 429 / 624
Differential Calculus in One Variable Real and Complex Power Series
Differentiability of Power Series
Proof (continued).Since lim
N→∞S ′N(z0) = g(z0) we can also find N2 > 0 such that
|S ′N(z0)− g(z0)| <
ε
3
for N > N2. Let us now fix N > max(N1,N2). Finally, because SN isdifferentiable (a polynomial), we can find a δ > 0 small enough so that∣∣∣∣SN(z0 + h)− SN(z0)
h− S ′
N(z0)
∣∣∣∣ < ε
3
for |h| < ε.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 430 / 624
Differential Calculus in One Variable Real and Complex Power Series
Some Applications
An application of our theoretical developments on power series is theability to define a decimal expansion for any real number a ∈ R. We wantto write
a = a0.a1a2a3 ...
where a0 ∈ Z and a1, a2, ... ∈ 0, 1, 2, 3, 4, 5, 6, 7, 8, 9. We realize this bydefining
a = a0 +∞∑k=1
ak10−k .
How this can be used as a definition for the set of real numbers is sketchedout in Chapter 29, Exercise 2 of the textbook.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 431 / 624
Differential Calculus in One Variable Real and Complex Power Series
The Van der Waerden Function
We will now construct the continuous and nowhere-differentiable Van derWaerden function. We use the “distance to the nearest integer” functiondefined in (2.3.1),
· : R→ [0, 1/2], x := min|x − ⌈x⌉|, |x − ⌊x⌋|.
The function is sketched below:
-4 -3 -2 -1 0 1 2 3
x
1
2
y
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 432 / 624
Differential Calculus in One Variable Real and Complex Power Series
The Van der Waerden FunctionWe now define a sequence of functions (fn)n∈N by
fn(x) := 10−n10nx.
The graphs of the first few functions are plotted below:
0 1
x
1
2
y
y = f0 HxL
0 1
x
1
20
y
y = f1 HxL
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 433 / 624
Differential Calculus in One Variable Real and Complex Power Series
The Van der Waerden Function
We then setW (x) :=
∞∑n=0
fn(x).
From the definition of the fn it is immediate that
supx∈R|fn(x)| =
1
2· 10−n
so by the Weierstraß M-Test 3.5.17 we see that W defines a continuousreal function on R.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 434 / 624
Differential Calculus in One Variable Real and Complex Power Series
The Van der Waerden Function
An approximation to the function W is plotted below:
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 435 / 624
Differential Calculus in One Variable Real and Complex Power Series
The Van der Waerden FunctionA closer look at the graph on the interval (0, 1/10) shows the fractalnature of the function:
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 436 / 624
Differential Calculus in One Variable Real and Complex Power Series
The Van der Waerden Function3.6.15. Theorem. The Van der Waerden function
W : R→ R, W (x) =∞∑n=0
10−n10nx
is continuous but not differentiable at any point of its domain.
Proof.Due to the periodicity of W , it suffices to show the assertion for pointsa ∈ (0, 1]. We have already established the continuity of W , so it remainsto show that W is not differentiable anywhere. We will do this by fixinga ∈ (0, 1] and exhibiting a sequence (hm) with hm → 0 as m→∞ suchthat
limm→∞
W (a+ hm)−W (a)
hm(3.6.4)
does not exist.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 437 / 624
Differential Calculus in One Variable Real and Complex Power Series
The Van der Waerden Function
Proof (continued).Suppose that a has the decimal expansion
a = 0.a1a2a3 ...
where am ∈ 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, m ∈ N \ 0. We then define
hm :=
−10−m if am = 4 or am = 9,
10−m otherwise.
Then
W (a+ hm)−W (a)
hm=
∞∑n=0
±10m−n(10n(a+ hm) − 10na
)(3.6.5)
where the “±” indicates a “+” or “−” sign, depending on m.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 438 / 624
Differential Calculus in One Variable Real and Complex Power Series
The Van der Waerden FunctionProof (continued).The series in (3.6.5) is actually finite, because for n ≥ m the term10nhm ∈ N and then 10n(a+ hm) = 10na. When n < m we have
10na = A+ 0.an+1an+2 ... am−1amam+1 ... , (3.6.6)10n(a+ hm) = A+ 0.an+1an+2 ... am−1(am ± 1)am+1 ... (3.6.7)
where A ∈ N and the “±” again depends on m. Our choice of the minussign when am = 9 is essential to ensure that (3.6.5) holds.Now suppose that the non-integer part on the right of (3.6.6), 10na− A,is less than 1/2. Then also the non-integer part on the right of (3.6.7) willbe less than 1/2. (For this to be true even when m = n + 1, it is essentialthat we choose the minus sign for hm when am = 4.) It follows that
10n(a+ hm) − 10na = ±10n−m when n < m. (3.6.8)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 439 / 624
Differential Calculus in One Variable Real and Complex Power Series
The Van der Waerden FunctionProof (continued).It is not difficult to see that (3.6.8) also holds when 10na− A ≥ 1/2. Wehence have
10m−n(10n(a+ hm) − 10na
)= ±1 when n < m.
Then (3.6.5) becomes
W (a+ hm)−W (a)
hm=
m∑n=0
±1 =: Sm.
Now the sequence (Sm) satisfies
|Sm+1 − Sm| = 1,
for all m ∈ N. Thus, it is not a Cauchy sequence and can not converge. Itfollows that the limit (3.6.4) does not exist.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 440 / 624
Differential Calculus in One Variable The Exponential Function
Differentiation of Real Functions
Properties of Differentiable Real Functions
Vector Spaces
Sequences of Real Functions
Series
Real and Complex Power Series
The Exponential Function
The Trigonometric Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 441 / 624
Differential Calculus in One Variable The Exponential Function
The Exponential FunctionWe can now introduce one of the most fundamentally important functionsin calculus:3.7.1. Definition. We define the exponential function
exp: C→ C, exp z :=∞∑n=0
zn
n!. (3.7.1)
We similarly define its restriction to the real numbers,
exp: R→ R, exp x :=∞∑n=0
xn
n!. (3.7.2)
The series (3.7.1) converges for all z ∈ C (ϱ =∞), so it defines acontinuous function for all z ∈ C (x ∈ R).We will now investigate the various properties of the exponential function.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 442 / 624
Differential Calculus in One Variable The Exponential Function
The Exponential FunctionThe functional equation for the exponential function is
(exp x)(exp y) = exp(x + y), x , y ∈ C,
which can be shown using the Cauchy product as follows.
(exp x)(exp y) =( ∞∑n=0
xn
n!
)( ∞∑m=0
ym
m!
)=
∞∑n=0
((xkk!
)∗(ykk!
))n
=∞∑n=0
∑l+m=n
1
l!m!x lym
=∞∑n=0
1
n!
∑l+m=n
n!
l!m!x lym
=∞∑n=0
1
n!(x + y)n
= exp(x + y)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 443 / 624
Differential Calculus in One Variable The Exponential Function
The Exponential Function
Note that exp(0) = 1 from the series representation
exp(0) = 1 + 0 +02
2!+ · · · = 1.
We next show that exp(z) = 0 for all z ∈ C. Assume that exp(z0) = 0 forsome z0 ∈ C. Then
1 = exp(0) = exp(z0 − z0) = exp(z0) exp(−z0) = 0 · exp(−z0) = 0,
which is impossible. Since exp(0) = 1 and exp is continuous, we candeduce from the intermediate value theorem that
exp(x) > 0 for all x ∈ R.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 444 / 624
Differential Calculus in One Variable The Exponential Function
The Exponential FunctionWe can also calculate the derivative of the exponential function:
exp(x+h) = exp(x) exp(h) = exp(x)(1+h+o(h)
)= exp(x)+exp(x)h+o(h),
soexp′(x) = exp(x).
In fact, it turns out that exp is the only function satisfying the initial valueproblem
y ′(x) = y(x), y(0) = 1. (3.7.3)
Here the equation on the left is called a differential equation; its solution isa function y such that the derivative of y is equal to y . The secondequation is called an initial condition, because it specifies the value of y atfor some x0 ∈ R.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 445 / 624
Differential Calculus in One Variable The Exponential Function
The Exponential FunctionIf an initial value problem has a unique solution, it can of course be usedto define this solution. Thus we could use (3.7.3) to define exp, if we wereable to prove that there exists a unique solution to (3.7.3). Since we havealready defined exp, this is not really necessary. Nevertheless, it isinstructive to prove the uniqueness of the solution.Assume that two functions f , g both solve (3.7.3). Then(
f (x) exp(−x))′= (f ′(x)− f (x)) exp(−x) = 0 =
(g(x) exp(−x)
)′Now by Corollary 3.2.9 f (x) exp(−x) = g(x) exp(−x) + C for someC ∈ R. Setting x = 0 we see
C = f (0) exp(0)− g(0) exp(0) = 1 · 1− 1 · 1 = 0,
so f (x) exp(−x) = g(x) exp(−x) or (multiplying with exp(x)) f = g .Thus exp is the only function satisfying (3.7.3).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 446 / 624
Differential Calculus in One Variable The Exponential Function
The Exponential Function
The function exp: R→ R satisfies
exp′′ = exp′ = exp > 0,
so the real exponential function is increasing and convex. There are nolocal extrema. Furthermore,
exp(x) > 1 + x for x > 0,
so limx→∞
exp(x) =∞. Since
1 = exp(0) = exp(x − x) = exp(x) exp(−x) ⇔ exp(−x) = 1/ exp(x),
we see that limx→−∞
exp(x) = limx→∞
exp(−x) = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 447 / 624
Differential Calculus in One Variable The Exponential Function
The Exponential FunctionWe thus obtain the following plot of the exponential function:
x
1
y
y = exp HxL
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 448 / 624
Differential Calculus in One Variable The Exponential Function
Compound InterestWe now analyze a problem that will turn out to have an intrinsicconnection to the exponential function and the Euler number. Considerthe question of compound interest. If money is left in a bank savingsaccount for one year, the bank will return (1 + z) times the originalamount after the year is over. (For example, at 5% interest, z = 0.05.)However, if the money is withdrawn after half a year, tthe bank will return(1 + z/2) times the investment. If this money is then left with the bankfor another half-year, the total amount returned is(
1 +z
2
)2> 1 + z
times the original investment. Thus it seems advantageous to withdrawmoney after the nth part of a year and replace it in the savings accountimmediately. Repeating this n times, one earns(
1 +z
n
)ntimes the original investment.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 449 / 624
Differential Calculus in One Variable The Exponential Function
Compound InterestOne would perhaps like to choose n as large as possible. The questionarises: will the return approach a limit? The answer is given by thefollowing proposition:
3.7.2. Proposition. Let z ∈ C. Then limn→∞
(1 +
z
n
)n= exp z
In real life, banks have taken this into account, and publish the effectiveannual interest rate, also called the Annual Equivalent Rate (AER) whichtakes into account the compound interest.Proof.From the functional equation for the exponential function we have
exp(z)−(1 +
z
n
)n=(exp(zn
))n︸ ︷︷ ︸
=:an
−(1 +
z
n
)n︸ ︷︷ ︸
=:bn
.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 450 / 624
Differential Calculus in One Variable The Exponential Function
Compound Interest
Proof (continued).Since
an − bn = (a− b)n−1∑k=0
akbn−1−k
︸ ︷︷ ︸=:ϱ
we analyze first a− b:
a− b = expz
n−(1 +
z
n
)=
z
n
∞∑k=1
1
(k + 1)!
(zn
)k=
z
n· o(1)
as n→∞.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 451 / 624
Differential Calculus in One Variable The Exponential Function
Compound InterestProof (continued).Furthermore,
|ϱ| ≤n−1∑k=0
∣∣∣exp z
n
∣∣∣k ∣∣∣1 + z
n
∣∣∣n−1−k
≤n−1∑k=0
(exp|z |n
)k (1 +|z |n
)n−1−k
≤n−1∑k=0
(exp|z |n
)k (exp|z |n
)n−1−k
= n
(exp|z |n
)n−1
≤ n exp(|z |)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 452 / 624
Differential Calculus in One Variable The Exponential Function
The Euler NumberProof (continued).Hence ∣∣∣exp(z)− (1 + z
n
)n∣∣∣ ≤ |a− b| · |ϱ| ≤ |z |nn exp(|z |) · o(1)→ 0
as n→∞.
3.7.3. Definition. The number
e := exp(1)
is called the Euler number.It can be proven that e is irrational and transcendental (not the solution ofan algebraic equation). An approximate value is
e ≈ 2.7182818284590
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 453 / 624
Differential Calculus in One Variable The Exponential Function
The Euler Number3.7.4. Lemma. The Euler number is the monotonic limit of the sequencein Proposition 3.7.2, i.e.,(
1 +1
n
)n
e as n→∞.
Proof.By the binomial theorem,(
1 +1
n
)n
=n∑
k=0
n!
k!(n − k)!nk=
n∑k=0
1
k!
n
n
n − 1
n...
n − k + 1
n
=n∑
k=0
1
k!
(1− 1
n
)...
(1− k − 1
n
)
<
n+1∑k=0
1
k!
(1− 1
n + 1
)...
(1− k − 1
n + 1
)=
(1 +
1
n + 1
)n+1
.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 454 / 624
Differential Calculus in One Variable The Exponential Function
The Euler Number
3.7.5. Proposition. For any rational number q ∈ Q, eq = exp(q).
Proof.We first show that en = exp(n) for n ∈ N by induction. For n = 0 we havee0 = 1 = exp(0). Furthermore,
exp(n + 1) = exp(n) · exp(1) = en · e = en+1.
Moreover, for n ∈ N we have exp(n) exp(−n) = 1, so
exp(−n) = 1
exp(n)=
1
en=: e−n.
Thus the statement holds for all n ∈ Z.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 455 / 624
Differential Calculus in One Variable The Exponential Function
The Euler Number
Proof (continued).Now let x = p/q with p ∈ Z and q ∈ N \ 0. Then(
expp
q
)q
= exp p = ep
and thusexp
p
q=
q√ep =: ep/q.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 456 / 624
Differential Calculus in One Variable The Exponential Function
Real Exponents, LogarithmThe exponential function is continuous (even C∞) on R, and for rational xcoincides with ex . We do not yet have a definition for ex when x is real,so it is logical to define
ex := exp x for x ∈ R
We say that exp x is a continuous extension of ex to the real numbers. Itis automatically the only such extension.We have seen that the function exp: R→ R+ (R+ := x ∈ R : x > 0) isincreasing and hence bijective. Thus there exists an inverse function,which we call the (natural) logarithm and denote by ln : R+ → R.
3.7.6. Lemma. The logarithm satisfies the functional equation
ln(u · v) = ln(u) + ln(v).
for all u, v ∈ R+
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 457 / 624
Differential Calculus in One Variable The Exponential Function
Real Exponents, LogarithmProof.Let u = ex , v = ey . Then
ln(u · v) = ln(ex · ey ) = ln(ex+y ) = x + y = ln(u) + ln(v).
In a similar manner we can use the properties of the exponential functionto prove
ln1
u= − ln(u), ln(up/q) =
p
qln(u) (p ∈ Z, q ∈ N \ 0).
This motivates the following definition:3.7.7. Definition. For all a > 0 and x ∈ R we define
ax := ex ln(a).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 458 / 624
Differential Calculus in One Variable The Exponential Function
Real Exponents, LogarithmWe immediately obtain
ln ax = x ln a, (ax)y = e ln[(ax )y ] = eyx ln a = axy , ax+y = axay
and so on for all a > 0 and x , y ∈ R.3.7.8. Remark. Since
ex = 1 + x +x2
2!+ ... ,
we see that ex > xn/n! for any n and x > 0. In particular,limx→∞
ex/xn =∞ for any n ∈ N:
ex
xn>
1
(n + 1)!
xn+1
xn=
x
(n + 1)!x→∞−−−→∞.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 459 / 624
Differential Calculus in One Variable The Exponential Function
Real Exponents, Logarithm3.7.9. Theorem. For any α > 0 we have
limx→∞
ln x
xα= 0, lim
x0xα ln x = 0.
Proof.Letting y = ln x , we have
limx→∞
ln x
xα=
1
αlimy→∞
αy
eαy=
1
αlimt→∞
t
et= 0
Furthermore, with y = 1/x ,
limx0
xα ln x = limy→∞
1
yαln
1
y= − lim
y→∞
ln y
yα= 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 460 / 624
Differential Calculus in One Variable The Exponential Function
Real Exponents, Logarithm
3.7.10. Corollary. Since the exponential function is continuous, we see thatfor n ∈ N
n√n = n1/n = e
1nln n n→∞−−−→ e0 = 1.
Finally, we note that by the inverse function theorem the logarithm isdifferentiable on R+ and
(ln y)′ =1
(ex)′|x=ln y=
1
ex |x=ln y=
1
y.
Hence ln ∈ C∞(R+).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 461 / 624
Differential Calculus in One Variable The Exponential Function
The LogarithmWe thus obtain the following plot of the logarithmic function:
1x
y
y = ln HxL
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 462 / 624
Differential Calculus in One Variable The Trigonometric Functions
Differentiation of Real Functions
Properties of Differentiable Real Functions
Vector Spaces
Sequences of Real Functions
Series
Real and Complex Power Series
The Exponential Function
The Trigonometric Functions
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 463 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Trigonometric FunctionsWe first note that we define
ez := exp z for z ∈ C.
We then introduce the well-known trigonometric cosine and sine functionscos, sin : R→ R by
cos(x) := Re e ix =e ix + e−ix
2=
∞∑k=0
(−1)k x2k
(2k)!,
sin(x) := Im e ix =e ix − e−ix
2i=
∞∑k=0
(−1)k x2k+1
(2k + 1)!.
The equatione ix = cos(x) + i sin(x)
is sometimes called the Euler relation.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 464 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Trigonometric FunctionsWe can immediately prove some useful properties of the sine and cosine: itfollows from the definition that
cos(0) = 1, sin(0) = 0, sin2(x) + cos2(x) = |e ix |2 = e ixe−ix = 1.
From
e i(x+y) = e ixe iy = (cos(x) + i sin(x))(cos(y) + i sin(y))
=(cos(x) cos(y)− sin(x) sin(y)
)+ i(cos(x) sin(y) + sin(x) cos(y)
)= cos(x + y) + i sin(x + y)
we obtain the addition relations for the sine and cosine:
cos(x + y) = cos(x) cos(y)− sin(x) sin(y),
sin(x + y) = cos(x) sin(y) + sin(x) cos(y).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 465 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Trigonometric FunctionsWe can verify immediately that the sine and cosine functions aredifferentiable, since
cos(x + h) = cos(x) cos(h)− sin(x) sin(h)
= cos(x)(1− h2/2! + o(h2))− sin(x)(h − h3/3! + o(h3))
= cos(x)− sin(x)h + o(h)
and
sin(x + h) = cos(x) sin(h) + sin(x) cos(h)
= cos(x)(h − h3/3! + o(h3)) + sin(x)(1− h2/2! + o(h2))
= sin(x) + cos(x)h + o(h)
so we haved
dxcos x = − sin x ,
d
dxsin x = cos x
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 466 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Trigonometric Functions
It follows from these formulas that the sine and cosine functions bothsatisfy the differential equation
y ′′ + y = 0,
where the sine function also satisfies the initial condition y(0) = 0 and thecosine function satisfies y ′(0) = 0. In fact,
y(x) = a cos x + b sin x (3.8.1)
satisfies the general initial value problem
y ′′ + y = 0, y(0) = a, y ′(0) = b for any a, b ∈ R. (3.8.2)
We will show that (3.8.1) is in fact the only solution to (3.8.2).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 467 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Trigonometric Functions
3.8.1. Lemma. Suppose that y : R→ R is twice differentiable and that
y ′′ + y = 0, y(0) = 0, y ′(0) = 0. (3.8.3)
Then y(x) = 0 for all x ∈ R.
Proof.Multiplying (3.8.3) with y ′, we obtain
0 = y ′y ′′ + yy ′ =1
2
((y ′)2 + y2
)′,
so (y ′)2 + y2 is constant. Since y(0) = y ′(0) = 0, it follows that(y ′(x))2 + y(x)2 = 0 for all x , so y(x) = 0 for all x ∈ R.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 468 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Trigonometric Functions3.8.2. Theorem. Suppose that y : R→ R is twice differentiable and that forsome a, b ∈ R
y ′′ + y = 0, y(0) = a, y ′(0) = b. (3.8.4)Then
y(x) = a cos x + b sin x .
Proof.Let f be any solution to (3.8.4) and g(x) = f (x)− a cos x − b sin x .Theng solves
g ′′ + g = 0, g(0) = 0, g ′(0) = 0.
By Lemma 3.8.1, g(x) = 0 for all x , so
f (x) = a cos x + b sin x .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 469 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Trigonometric FunctionsWe can hence conclude:
If y ′′ + y = 0 and y(0) = 1, y ′(0) = 0, then y(x) = cos x , If y ′′ + y = 0 and y(0) = 0, y ′(0) = 1, then y(x) = sin x .
This property will be quite useful later.Now consider the cosine function. We know that cos(0) = 1 > 0 andcos2 x + sin2 x = 1. Therefore the cosine function has a local maximum atx = 0. Furthermore, at least for small ε > 0, cos x > 0 on [0, ε). Since(sin x)′ = cos x , we see that sin x is strictly increasing on [0, ε) andsin x > 0 on (0, ε). Since
cos′(x) = − sin(x),
the cosine is decreasing on [0, ε) and will continue to decrease untilsin x = 0. From cos2 x + sin2 x = 1 we see that if sin x = 0, then|cos x | = 1. Since the cosine is less than 1 and decreasing on (0, ε), it willcontinue to decrease until cos x = −1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 470 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Number π
Hence there exists at least one x0 > 0 such that cos x0 = 0. We define
π := 2 ·minx ∈ R+ : cos(x) = 0.
(Exercise: why is this a minimum and not just an infimum?) It will turnout that π is an irrational and transcendental number which is identical tothe diameter of the unit circle. Numerically,
π ≈ 3.141 592 653 589 793 238 462 643 383 279 502 189.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 471 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Number πWe thus have cos(π/2) = 0 and |sin(π/2)| = 1. Since sin(0) = 0 and thesine function is increasing until cos x = 0, i.e., for 0 ≤ x ≤ π/2, we seethat sin(π/2) = 1. It follows thatcos(π) = cos(π/2 + π/2) = cos(π/2) cos(π/2)− sin(π/2) sin(π/2) = −1,sin(π) = cos(π/2) sin(π/2) + sin(π/2) cos(π/2) = 0,
cos(2π) = cos(π) cos(π)− sin(π) sin(π) = 1,
sin(2π) = cos(π) sin(π) + sin(π) cos(π) = 0.
This impliescos(x + 2π) = cos(x) cos(2π)− sin(x) sin(2π) = cos(x),
sin(x + 2π) = cos(x) sin(2π) + sin(x) cos(2π) = sin(x),
so the sine and cosine functions are periodic with period 2π. Furthermore,sin(x + π/2) = cos(x) sin(π/2) + sin(x) cos(π/2) = cos x ,
so the functions are simply translations of each other.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 472 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Graph of Sine and CosineLastly, we note that directly from the definition one sees that
sin(−x) = − sin(x), cos(−x) = cos(x)
for all x ∈ R.We are now able to graph the functions:
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 473 / 624
Differential Calculus in One Variable The Trigonometric Functions
Trigonometric Interpretation of Sine and CosineOf course, the sine and cosine functions are already familiar fromelementary trigonometry, where they were defined in right-angled trianglesas
COS θ :=Adjacent sideHypotenuse , SIN θ :=
Opposite sideHypotenuse , 0 <θ < 90.
In order to prove that sin x = SIN θ and cos x = COS θ for suitablex = x(θ), one proceeds as follows:
1. Using geometrical methods, one proves that for fixed length of sidesof the triangle,
limθ→0
COS θ = 1, limθ→0
SIN θ = 0,
and thus extends the definition of the geometric sine and cosine tozero accordingly.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 474 / 624
Differential Calculus in One Variable The Trigonometric Functions
Trigonometric Interpretation of Sine and Cosine
2. Using geometrical methods, one proves that
COS(θ + ϕ) = COS θ COSϕ− SIN θ SINϕ,
SIN(θ + ϕ) = COS θ SINϕ+ SIN θ COSϕ.
whenever 0 ≤ θ,ϕ, θ + ϕ < 90.3. Using geometrical methods, one further shows (See Spivak, Ch. 15.
Ex. 27) that
limθ→0
1− COS θ
θ= 0, lim
θ→0
θ − SIN θ
θ= 0.
4. From the above, one deduces (COS θ)′ = −SIN θ and(SIN θ)′ = COS θ.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 475 / 624
Differential Calculus in One Variable The Trigonometric Functions
Trigonometric Interpretation of Sine and Cosine6. It follows that SIN θ and COS θ satisfy
(SIN θ)′′ + (SIN θ) = 0 and SIN(0) = 1, SIN′(0) = 0, (COS θ)′′ + (COS θ) = 0 and COS(0) = 0, COS′(0) = 1.
Since the solution to the corresponding initial value problems are unique,COS θ = cos θ, SIN θ = sin θ. Now θ is a number between 0 and 90, and
limθ→90
SIN θ = 1, limθ→90
COS θ = 0.
A degree θ is defined as the 360th part of the circumference of a circle;this is clearly an arbitrary definition, and we should find what therelationship between θ and x is, where x is scaled so that lim
x→π/2sin x = 1.
It is clear that θ = 360x/(2π). This is just a linear re-scaling of thecoordinate plane to accomodate the “artificial” idea of dividing a circleinto 360 parts. We will dispense with this and always assume that ourcoordinates are not rescaled in such a way.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 476 / 624
Differential Calculus in One Variable The Trigonometric Functions
More Trigonometric FunctionsWe further define trigonometric functions such as the tangent, cotangent,secant and cosecant by
tan x :=sin x
cos x, cot x :=
cos x
sin x, sec x :=
1
cos x, csc x :=
1
sin x,
Except for the tangent, we will not use these frequently.
-5 Π
2-2 Π -
3 Π
2-Π -
Π
2
Π
2Π
3 Π
22 Π
x
y
y = tan HxL
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 477 / 624
Differential Calculus in One Variable The Trigonometric Functions
Inverse Trigonometric FunctionsThe inverses to the various trigonometric functions are called the arc sine,arc cosine arc tangent etc. and denoted
arcsin x , arccos x , arctan x ,
respectively. In each case, the ranges of the inverses have to be chosencarefully, as the original functions are not injective for all x . For example,one commonly defines
arcsin : [−1, 1]→ [−π/2,π/2],arccos : [−1, 1]→ [0,π],
arctan: R→ (−π/2,π/2),
but other definitions are of course possible and sometimes useful (inparticular, for the arc tangent).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 478 / 624
Differential Calculus in One Variable The Trigonometric Functions
Inverse Trigonometric FunctionsWe can use the inverse function theorem to calculate the deivatives of theinverse trigonometric functions. For this, we first note that
(cos x)′ = − sin x = −√
1− cos2 x ,
(sin x)′ = cos x =√
1− sin2 x ,
(tan x)′ =
(sin x
cos x
)′=
1
cos2 x= 1 + tan2 x .
Then
(arccos x)′ =1
(cos y)′|y=arccos x=
−1√1− cos2 y |y=arccos x
=−1√1− x2
,
(arcsin x)′ =1
(sin y)′|y=arcsin x=
1√1− sin2 y |y=arcsin x
=1√
1− x2,
(arctan x)′ =1
(tan y)′|y=arctan x=
1
1 + tan2 y |y=arctan x=
1
1 + x2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 479 / 624
Differential Calculus in One Variable The Trigonometric Functions
Some Further Properties of the Sine FunctionThe sine (and cosine) functions have some interesting properties whichallow them to be used as study cases for continuity and differentiability.First, note that
limx→0
sin1
xdoes not exist. This can be proven similarly as for the periodic function ψstudied in the section on continuty.
x
-1
1
y
y = sin H1xL
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 480 / 624
Differential Calculus in One Variable The Trigonometric Functions
Some Further Properties of the Sine FunctionNow it is easy to prove that the function
f (x) =
x sin 1
x x = 0
0 x = 0
is continuous at x = 0.
x
-1
1
y
y = x sin H1xL
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 481 / 624
Differential Calculus in One Variable The Trigonometric Functions
Properties of the Sine Function
The function
f (x) =
x2 sin 1
x x = 0
0 x = 0
is even differentiable: For x = 0,
f ′(x) = 2x sin1
x− cos
1
x.
Furthermore, f (0 + h) = f (h) = f (0) + o(h), so f ′(0) = 0. However, f ′ isnot continuous at 0, since lim
x→0f ′(x) does not exist.
We thus finally have an example of a function that is differentiable but notcontinuously differentiable!
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 482 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Most Beautiful Theorem
Since e iθ = cos θ + i sin θ, we observe that e iπ = −1, or
e iπ + 1 = 0.
This wonderful formula, which relates the most important constants incalculus with each other has been voted the most beautiful theorem ofmathematics in a survey by the magazine The Mathematical Intelligencer.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 483 / 624
Differential Calculus in One Variable The Trigonometric Functions
Polar Coordinates in the Complex PlaneAn important observation is that every complex number z (except 0) inthe complex plane has a unique representation through the distance of zto the origin and the angle it makes with the positive real axis:
-4 -3 -2 -1 1 2 3 4Re z
-2
-1
1
2
3
4
Im z
z
r = ÈzÈ
Θ = arctan
Im z
Re z
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 484 / 624
Differential Calculus in One Variable The Trigonometric Functions
Polar Coordinates in the Complex Plane
Thus every z ∈ C \ 0 uniquely determines a pair (r , θ) given by
r = |z | =√
(Re z)2 + (Im z)2, θ =
arctan Im z
Re z , Re z > 0,
π + arctan Im zRe z , Re z < 0.
Conversely, given (r , θ) ∈ R+ × [0, 2π) we obtain a unique number
z = r cos θ + ir sin θ = r · e iθ ∈ C \ 0.
We call this the polar representation of a complex number z . While thecartesian representation
z = a+ bi
for a, b ∈ R is useful for addition and subtraction, the polar representationis more suited to multiplication, division and exponentiation.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 485 / 624
Differential Calculus in One Variable The Trigonometric Functions
Polar Coordinates in the Complex Plane
The product and quotient of two complex numbers z1, z2 ∈ C \ 0 can beexpressed through their polar representation as
z1 · z2 = (r1eiθ1)(r2e
iθ2) = r1r2ei(θ1+θ2),
z1z2
=r1e
iθ1
r2e iθ2=
r1r2e i(θ1−θ2).
In particular, we see that
zn = rne inθ for n ∈ Z.
This allows us to obtain the nth roots of a complex number, somethingthat was not easily done for complex numbers in cartesian representation.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 486 / 624
Differential Calculus in One Variable The Trigonometric Functions
Polar Coordinates in the Complex PlaneFor 0 = z = re iθ we clearly have an nth root given by
z1/n = r1/ne iθ/n,
since (z1/n)n = z . We would, however, expect to be able to find n distinctroots. The key to finding them is to note that
e i(θ+2π) = cos(θ + 2π) + i sin(θ + 2π) = cos θ + i sin θ = e iθ.
Since z = re iθ = re i(θ+2π), another root is given by
z1/n = r1/ne i(θ/n+2π/n).
Similarly, since z = re iθ = re i(θ+k2π) for k ∈ Z, we have n nth roots
r1/ne iθ/n, r1/ne i(θ+2π)/n, r1/ne i(θ+2·2π)/n, ... , r1/ne i(θ+(n−1)·2π)/n.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 487 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Hyperbolic Trigonometric FunctionsWe conclude this part by introducing a final family of functions, thehyperbolic trigonometric functions:We define the hyperbolic sine and hyperbolic cosine, sinh, cosh: C→ C, by
sinh(x) :=ex − e−x
2, cosh(x) :=
ex + e−x
2
A comparison with the definition of the sine and cosine functionsimmediately shows that
sinh(ix) = i sin(x), cosh(ix) = cos x .
From the definition, we see that
cosh(x) + sinh(x) = ex .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 488 / 624
Differential Calculus in One Variable The Trigonometric Functions
The Hyperbolic Trigonometric Functions
Similarly to their trigonometric counterparts, we have
cosh(−x) = cosh(x), sinh(−x) = − sinh(x),
the hyperbolic cosine and sine are even and odd functions, respectively.We also see that
cosh2(x)− sinh2(x) =
(ex + e−x
2
)2
−(ex − e−x
2
)2
=1
4
(e2x + 2 + e−2x − e2x + 2− e−2x
)= 1
This last relation explains the origin of the term hyperbolic: if x = cosh tand y = sinh t, then x2 − y2 = 1, which is the equation for a hyperbola.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 489 / 624
Differential Calculus in One Variable The Trigonometric Functions
Second Midterm Exam
The preceding material completes the second third of the course material.It encompasses everything that will be the subject of the Second MidtermExam.The exam date and time will be announced on SAKAI.No calculators or other aids will be permitted during the exam. A sampleexam with solutions has been uploaded to SAKAI. Please study it carefully,including the instructions on the cover page.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 490 / 624
Integral Calculus in One Variable
Part IV
Integral Calculus in One Variable
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 491 / 624
Integral Calculus in One Variable
Notions of Integration
Practical Integration
Applications of Integration
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 492 / 624
Integral Calculus in One Variable Notions of Integration
Notions of Integration
Practical Integration
Applications of Integration
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 493 / 624
Integral Calculus in One Variable Notions of Integration
Areas under CurvesWe now turn to a question that seems unrelated to our previousinvestigations: How can we determine the area between the graph of afunction and the abscissa?
1x
1
y
y x2
Area = ?
The sketch at left shows the graphof f : [0, 1] → R, f (x) = x2. Whileit is possible to determine the areabounded by the graph through ele-mentary geometrical means, we aimto find a general procedure so thatwe can discuss general functions, e.g.,f (x) = ex .
At the outset, we have not even defined what an “area” is supposed to be,so we must first lay some groundwork.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 494 / 624
Integral Calculus in One Variable Notions of Integration
Areas under CurvesSuppose f : [a, b]→ R is a constant function, i.e., there exists some c ∈ Rsuch that f (x) = c for all x ∈ [a, b]. We would then naturally consider the“area bounded by the abscissa and the graph of f ” to be equal toc · (b − a):
a bx
c
y
y c
Area c Hb - aL
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 495 / 624
Integral Calculus in One Variable Notions of Integration
Areas under CurvesWe can generalize this to the case where f : [a, b]→ R is “piecewiseconstant”:
a a1 bx
c1
c2
y
y f HxL
Area c2 Hb - a1L + Ha1 - aL c1
For our intuitive concept of what the “area” is, the value of f (a1) isirrelevant: we could have f (a1) = c1 or f (a1) = c2 or some completelydifferent value. The “area” should remain the same.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 496 / 624
Integral Calculus in One Variable Notions of Integration
Step FunctionsOur first step is to give a proper definition of a “piecewise constant”function and the area “under” its graph.4.1.1. Definition. A partition P of an interval [a, b] ⊂ R is a tuple ofnumbers P = (a0, ... , an) such that
a = a0 < a1 < ... < an = b.
4.1.2. Definition. A function φ : [a, b]→ R is called a step function withrespect to a partition P = (a0, ... , an) if there exist numbers yi ∈ R,i = 1, ... , n, such that
φ(t) = yi whenever ai−1 < t < ai (4.1.1)
for i = 1, ... , n. We denote the set of all step functions by Step([a, b])
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 497 / 624
Integral Calculus in One Variable Notions of Integration
Step Functions
4.1.3. Remarks.1. We don’t care how φ is defined at the points ai !2. We call φ simply a step function if there exists some partition P with
respect to which it is a step function.
4.1.4. Example. The function
φ : [0, 1]→ R, f (x) =
0 0 ≤ x < 1
2 ,12 x = 1
2 ,
1 12 < x ≤ 1,
is a step function. In particular, it is a step function with respect toP = (0, 12 , 1) or any other suitable partition, e.g., P = (0, 12 ,
34 , 1).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 498 / 624
Integral Calculus in One Variable Notions of Integration
Sum of Step Functions4.1.5. Definition. Let P be a partition of an interval [a, b]. We call apartition R of [a, b] a sub-partition of P if R ⊃ P, where we naturallyregard P and R as sets instead of tuples.
4.1.6. Remark. If φ : [a, b]→ R is a step function with respect to apartition P of [a, b], then it as also a step function with respect to anysub-partition R of P.We can define the sum of two step functions as follows:Given a step function φ (w.r.t. a partition P) and a step function ψ (w.r.t.a partition Q), we first define the partition R := P ∪ Q of [a, b] whichcontains all points of P and Q, ordered appropriately from smallest tolargest.Then R ⊃ P and R ⊃ Q, i.e., R is a sub-partition of both P and Q.Hence φ and ψ are step functions with respect to R and we can defineφ+ ψ by adding φ(x) and ψ(x) on each interval induced by R = P ∪ Q.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 499 / 624
Integral Calculus in One Variable Notions of Integration
Sum of Step Functions
Since multiplying step functions by a number does not present anyproblems, we have proven the following:4.1.7. Proposition. If φ,ψ are step functions on [a, b] and λ,µ ∈ R, thenλφ+ µψ is a step function on [a, b].
4.1.8. Remark. This also shows that Step([a, b]) is a vector space.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 500 / 624
Integral Calculus in One Variable Notions of Integration
Integral of a Step Function4.1.9. Definition and Theorem. Let φ : [a, b]→ R be a step function ofthe form (4.1.1) with respect to some partition P. Then
IP(φ) := (a1 − a0)y1 + · · ·+ (an − an−1)yn (4.1.2)is independent of the choice of the partition P. We define the integral of fas ∫ b
aφ := IP(φ)
for any partition P with respect to which φ is a step function.
4.1.10. Remark. It is common to use the notation∫ b
aφ(x) dx :=
∫ b
aφ.
The motivation underlying this notation will remain mysterious for now.However, it is an intuitive notation in the same sense that writinglimx→x0 f (x) is more intuitively friendly that limx0 f .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 501 / 624
Integral Calculus in One Variable Notions of Integration
Integral of a Step Function
Proof.Let φ be a step function w.r.t. a partition P and also w.r.t. some otherpartition Q. Let R = P ∪ Q be a common sub-partition of both P and Q.We will show that
IP(φ) = IR(φ) = IQ(φ).
Let c ∈ R \ P and suppose that ak < c < ak+1 for some k = 1, ... , n. Weadd c to P, denoting the resulting partition by
Pc = (a0, ... , ak , c, ak+1, ... , an).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 502 / 624
Integral Calculus in One Variable Notions of Integration
Integral of a Step Function
Proof (continued).Then φ(t) = yk for ak < t < ak+1, so
IP(φ) = (a1 − a0)y1 + ... + (ak+1 − ak)yk + ... + (an − an−1)yn
= (a1 − a0)y1 + ... + (c − ak)yk + (ak+1 − c)yk
+ ... + (an − an−1)yn
= IPc (φ).
Repeating this procedure, we eventually have IP(φ) = IR(φ). In the sameway we show that IQ(φ) = IR(φ).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 503 / 624
Integral Calculus in One Variable Notions of Integration
Properties of the Integral of a Step FunctionIt is easy to see that ∣∣∣∣∫ b
aφ
∣∣∣∣ ≤ (b − a) supx∈[a,b]
|φ(x)|
for a step function φ, since, given a partition P,
|IP(φ)| ≤n∑
i=1
(ai − ai−1) · |yi | ≤ max1≤i≤n
|yi |n∑
i=1
(ai − ai−1)
= (b − a) supx∈[a,b]
|φ(x)|. (4.1.3)
Furthermore, it is not difficult to show that∫ b
a(λφ+ µψ) = λ
∫ b
aφ+ µ
∫ b
aψ (4.1.4)
for step functions φ,ψ and λ,µ ∈ R.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 504 / 624
Integral Calculus in One Variable Notions of Integration
Bounded Functions4.1.11. Definition. Let I ⊂ R be an interval. We say that a functionf : I → R is bounded if
∥f ∥∞ := supx∈I|f (x)| <∞. (4.1.5)
The set of all bounded functions is denoted L∞(I ).
4.1.12. Remarks.1. It is easy to check that L∞(I ) (endowed with point-wise addition and
multiplication) is a vector space and that (4.1.5) defines a norm onL∞(I ).
2. If I = [a, b] is a closed interval, every continuous function is bounded,so C ([a, b]) ⊂ L∞([a, b]).
3. If I = [a, b] is a closed interval, any step function on [a, b] takes ononly a finite number of real values, so is automatically bounded.Hence, Step([a, b]) ⊂ L∞([a, b]).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 505 / 624
Integral Calculus in One Variable Notions of Integration
Area under the Graph of Bounded Functions
We restrict ourselves (for now) to bounded functions on an interval [a, b].The area under the graph of a bounded function (if it exists) can then notbe larger than the product of (a− b) and the bound of the function, i.e., itmust be finite.However, not all bounded functions may have an intuitive “area” undertheir graph. As an example, consider the Dirichlet function
χ : [0, 1]→ R, χ(x) =
1 for x rational,0 for x irrational.
It is not clear, what the value of the “area” might be: 1? 0? undefined?Given such an example, we will have to approach the concept of area verycarefully.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 506 / 624
Integral Calculus in One Variable Notions of Integration
Area under the Graph of Bounded FunctionsSuppose a function f ∈ L∞([a, b]) can be approximated uniformly by stepfunctions, i.e., it is the uniform limit of a sequence of step functions. Thenwe might hope that the area under the step functions approaches the areaunder f :
a bx
y
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 507 / 624
Integral Calculus in One Variable Notions of Integration
Regulated Functions
It turns out that this approach works for functions that can beapproximated uniformly by step functions. We give the class of thesefunctions a specific name:4.1.13. Definition. A function f ∈ L∞([a, b]) is said to be regulated if forany ε > 0 there exists a step function φ such that
supx∈[a,b]
|f (x)− φ(x)| < ε. (4.1.6)
Equivalently, a regulated function f is a function such that there exists asequence of step functions (φn) which converges uniformly to f . (To seethis, we simply define φn as the step function satisfying (4.1.6) withε = 1/n.)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 508 / 624
Integral Calculus in One Variable Notions of Integration
Example of a Regulated Function4.1.14. Example. The function f : [0, 1]→ R, f (x) = x2 is regulated. Tosee this, choose the sequence of step functions (φn) on [0, 1] given by
φn(x) =k2
n2for k − 1
n< x ≤ k
n, k = 1, ... , n,
and φn(0) = 0. Then
∥φn − f ∥∞ = supx∈[0,1]
|φn(x)− f (x)| = maxk=1,...,n
supk−1n
<x≤ kn
|φn(x)− f (x)|
= maxk=1,...,n
supk−1n
<x≤ kn
∣∣∣∣k2n2 − x2∣∣∣∣
= maxk=1,...,n
∣∣∣∣k2n2 − (k − 1)2
n2
∣∣∣∣ = maxk=1,...,n
∣∣∣∣2k − 1
n2
∣∣∣∣ = 2n − 1
n2.
It follows that ∥φn − f ∥∞ → 0 as n→∞, so f ∈ Reg([a, b]).Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 509 / 624
Integral Calculus in One Variable Notions of Integration
Continuous Functions are Regulated4.1.15. Theorem. The continuous functions on the interval [a, b] areregulated, i.e., C ([a, b]) ⊂ Reg([a, b]).
Proof.We need to show that any continuous function on the interval [a, b] canbe regarded as the uniform limit of step functions. Let f ∈ C ([a, b]) begiven. By Theorem 2.5.24 f is also uniformly continuous, i.e.,
∀ε>0∃
δ>0∀
x ,y∈[a,b]|x − y | < δ ⇒ |f (x)− f (y)| < ε. (4.1.7)
Now fix ε > 0 and choose some δ > 0 according to (4.1.7). Define apartition P = (a0, ... , an) of [a, b] such that aj − aj−1 = (a− b)/n < δ,j = 1, ... , n, by choosing n large enough. Define φ : [a, b]→ R byφ(b) = f (b) and
φ(t) = f (ai−1) for ai−1 ≤ t < ai , i = 1, ... , n.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 510 / 624
Integral Calculus in One Variable Notions of Integration
Continuous Functions are Regulated
Proof (continued).Then φ ∈ Step([a, b]) and
∥f − φ∥∞ = supt∈[a,b]
|f (t)− φ(t)|
= max1≤i≤n
supt∈[ai−1,ai ]
|f (t)− φ(t)|
= max1≤i≤n
supt∈[ai−1,ai ]
|f (t)− f (ai−1)| < ε
Thus for every ε > 0 we can find a function φ with ∥f − φ∥∞ < ε.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 511 / 624
Integral Calculus in One Variable Notions of Integration
Piecewise Continuous Functions
4.1.16. Definition. Let I ⊂ R be an interval, a1, ... , an ⊂ I a finite set ofpoints and f : I → C a function such that
(i) f : I \ a1, ... , an → C is continuous and(ii) For all ai , i = 1, ... , n, both
limxai
f (x) and limxai
f (x)
exist.Then f is said to be piecewise continuous. We denote the set of piecewisecontinuous functions on I by PC(I ).
4.1.17. Theorem. The piecewise continuous functions on the interval [a, b]are regulated, i.e., PC([a, b]) ⊂ Reg([a, b]).The proof is left to the assignments.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 512 / 624
Integral Calculus in One Variable Notions of Integration
Integral of a Regulated FunctionWe now define the integral of a regulated function as follows:4.1.18. Definition and Theorem. Let f ∈ Reg([a, b]) and (φn) a sequencein Step([a, b]) converging uniformly to f . Then the regulated integral of f ,defined by ∫ b
af := lim
n→∞
∫ b
aφn (4.1.8)
exists and does not depend on the choice of (φn).
Proof.We first prove that the limit (4.1.8) exists. Since (φn) converges to funiformly, (φn) is a Cauchy sequence in C ([a, b]) with the ∥ · ∥∞ norm.This implies that the sequence (
∫ ba φn) is Cauchy, since∣∣∣∣∫ b
aφn −
∫ b
aφm
∣∣∣∣ = ∣∣∣∣∫ b
a(φn − φm)
∣∣∣∣ ≤ (b − a)∥φn − φm∥∞.
by (4.1.4) and (4.1.3).Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 513 / 624
Integral Calculus in One Variable Notions of Integration
Integral of a Regulated FunctionProof.Since the real numbers are complete, this implies that the sequence(∫ ba φn) converges. Hence, the limit (4.1.8) exists.
Furthermore, let (φn) and (ψn) be two sequences of step functionsconverging uniformly to f . Then∣∣∣∣∫ b
aφn −
∫ b
aψn
∣∣∣∣ = ∣∣∣∣∫ b
a(φn − ψn)
∣∣∣∣≤ (b − a)∥φn − ψn∥∞≤ (b − a)∥φn − f ∥∞ + (b − a)∥ψn − f ∥∞n→∞−−−→ 0.
This shows that both sequences yield the same limit (4.1.8), i.e., theintegral of f does not depend on the choice of the sequence of stepfunctions.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 514 / 624
Integral Calculus in One Variable Notions of Integration
Example of a Regulated Integral4.1.19. Example. We have seen in Example 4.1.14 that the functionf : [0, 1]→ R, f (x) = x2 is the uniform limit of the sequence of stepfunctions (φn) on [0, 1] given by
φn(x) =k2
n2for k − 1
n< x ≤ k
n, k = 1, ... , n,
and φn(0) = 0. Now it is easy to see that the integral of the stepfunctions, by Definition 4.1.9, is given by∫ 1
0φn =
n∑k=1
1
n· k
2
n2=
1
n3
n∑k=1
k2 =n(n + 1)(2n + 1)
6n3.
It follows that ∫ 1
0f = lim
n→∞
∫ 1
0φn =
1
3.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 515 / 624
Integral Calculus in One Variable Notions of Integration
Properties of the Regulated IntegralWe can verify that properties (4.1.4) and (4.1.3) also hold for theregulated integral. Let f , g ∈ Reg([a, b]) and (φn), (ψn) be sequences inStep([a, b]) converging uniformly to f and g , respectively. Then∣∣∣∣∫ b
af
∣∣∣∣ = ∣∣∣∣ limn→∞
∫ b
aφn
∣∣∣∣ = limn→∞
∣∣∣∣∫ b
aφn
∣∣∣∣ ≤ (b − a) limn→∞
supx∈[a,b]
|φn(x)|
≤ (b − a) supx∈[a,b]
|f (x)|. (4.1.9)
In the same way,
λ
∫ b
af + µ
∫ b
ag = λ lim
n→∞
∫ b
aφn + µ lim
n→∞
∫ b
aψn = lim
n→∞
∫ b
a(λφn + µψn)
=
∫ b
a(λf + µg)
since the sequence (λφn + µψn) converges uniformly to λf + µg .Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 516 / 624
Integral Calculus in One Variable Notions of Integration
Properties of the Regulated IntegralWe also note that if a < c < b, then for f ∈ L∞([a, b]),∫ b
af =
∫ c
af +
∫ b
cf . (4.1.10)
This can easily be seen by including the point c in any partition of stepfunctions approximating f .The set of regulated functions is closed and the regulated integral iscontinuous on this set. More precisely, we have the following result:4.1.20. Theorem. Let (fn) be a sequence of regulated functions such thatfn → f uniformly for some f ∈ L∞, i.e., ∥f − fn∥∞ → 0. Then f isregulated and ∫ b
afn
n→∞−−−→∫ b
af . (4.1.11)
The proof is left to the assignments.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 517 / 624
Integral Calculus in One Variable Notions of Integration
Non-Regulated FunctionsHowever, not all bounded functions are regulated:4.1.21. Example. Consider the function
f : [0, 1]→ R, f (x) =
1 if x = 1/n for some n ∈ N \ 0,0 otherwise.
Now any step function φ must be constant (say, equal to c ∈ R) in theinterval (0, δ) for some δ > 0. This interval will always contain both pointswhere x = 1/n and points where x = 1/n, no matter how small δ ischosen. Then
supx∈[0,1]
|f (x)− φ(x)| ≥ supx∈(0,δ)
|f (x)− c | ≥ max|1− c |, |c |
≥ 1
2
so that (4.1.6) can not be satisfied. Hence, f is not regulated.For technical reasons, however, it is desirable to extend the notion ofintegral to cover even non-regulated functions such as the one above.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 518 / 624
Integral Calculus in One Variable Notions of Integration
Extending the Regulated IntegralLet us attempt another approach at defining the area under a curve: theregulated integral was based on approximating the function f uniformlythrough step functions and then defining the area under f to be the limitof the areas of step functions.
In contrast, we will now try to ap-proximate the area under f , with-out attempting to approximate fat all.We will do this by consideringstep functions that are larger thanf (i.e., φ(x) ≥ f (x) for all x)and step functions that are smallerthan f (ψ(x) ≤ f (x) for all x), asillustrated at right.
a bx
y
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 519 / 624
Integral Calculus in One Variable Notions of Integration
Properties of the IntegralIf the area between “upper” and “’lower” step functions can be made assmall as desired, we will have successfully approximated the area of fwithout necessarily needing to approximate f . We will now formalize theseideas.We have proven (or can easily check) that the regulated integral has thefollowing properties:
(i) The integral is linear, i.e.,∫(λf + µg) = λ
∫f + µ
∫g for λ,µ ∈ R;
(ii) The integral is positive, i.e., if f > 0 on [a, b], then∫f > 0;
(iii) The integral of a constant function f = c , c ∈ R, on [a, b] is given by∫f = c · (b − a);
(iv) The integral does not depend on the values of a function on finitesets. If f (x) = 0 for all but a finite set of x ∈ [a, b], then
∫f = 0.
Suppose there is a class of bounded functions on [a, b] that includes theregulated functions and all those functions for which it is possible to definean integral with the properties (i) - (iv) above.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 520 / 624
Integral Calculus in One Variable Notions of Integration
Upper and Lower Step FunctionsLet f ∈ L∞([a, b]) be a function for which an integral with properties (i) -(iv) exists. Since f is bounded, there exists a step function u such thatu ≥ f on [a, b]. Thus ∫ b
af ≤
∫ b
au
by properties (i) and (ii) (since u − f > 0 and∫ ba (u − f ) =
∫ ba u −
∫ ba f ).
Denote by Uf the set of all such “upper” step functions. Then∫ b
af ≤ inf
u∈Uf
∫ b
au.
This infimum exists because the step functions are bounded below by thelower bound for f , and properties (ii) and (iii) guarantee the existence of alower bound for the integral. We define the upper area of f as
I (f ) := infu∈Uf
∫ b
au.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 521 / 624
Integral Calculus in One Variable Notions of Integration
Upper and Lower Step FunctionsSimilarly, if Lf denotes the set of all step functions v such that v ≤ f , then∫ b
af ≥ sup
v∈Lf
∫ b
av =: I (f ).
Thus, if it is possible to define an integral of the bounded function f , then
I (f ) ≤∫ b
af ≤ I (f ).
Even if it is not possible to define an integral of f , we may still define Ufand Lf and it remains true that
I (f ) ≤ I (f ).
Clearly, for an arbitrary bounded f the supremum and infimum may differ;but if they are equal, then their common value must also be the value of∫ ba f . We can use this to define
∫ ba f by this common value, if it exists.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 522 / 624
Integral Calculus in One Variable Notions of Integration
The Darboux Integral4.1.22. Definition. Let [a, b] ⊂ R be a closed interval and f a bounded realfunction on [a, b]. Let Uf denote the set of all step functions u on [a, b]such that u ≥ f and Lf the set of all step functions v on [a, b] such thatv ≤ f . The function f is then said to be Darboux-integrable if
I (f ) = supv∈Lf
∫ b
av = inf
u∈Uf
∫ b
au = I (f ).
In this case, the Darboux integral of f ,∫ ba f , is defined to be this common
value.
4.1.23. Remark. A bounded function f ∈ L∞([a, b]) is Darboux-integrableif and only if for every ε > 0 there exist step functions uε and vε such thatvε ≤ f ≤ uε and ∫ b
auε −
∫ b
avε ≤ ε.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 523 / 624
Integral Calculus in One Variable Notions of Integration
Example for the Darboux Integral of a Function4.1.24. Example. Consider the function f : [0, 1]→ R, f (x) = x2. For anyn ∈ N, the step function defined by
un(x) =k2
n2for k − 1
n< x ≤ k
n, k = 1, ... , n,
and un(0) = 0 is an upper step function for f , while the step function
vn(x) =(k − 1)2
n2for k − 1
n< x ≤ k
n, k = 1, ... , n,
and un(0) = 0 is a lower step function for f . Then∫ 1
0un =
n∑k=1
1
n· k
2
n2=
1
n3
n∑k=1
k2 =n(n + 1)(2n + 1)
6n3,
∫ 1
0vn =
n∑k=1
1
n· (k − 1)2
n2=
1
n3
n−1∑k=1
k2 =n(n − 1)(2n − 1)
6n3.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 524 / 624
Integral Calculus in One Variable Notions of Integration
Example for the Darboux Integral of a FunctionNow
infu∈Uf
∫ 1
0u ≤ lim
n→∞
∫ 1
0un =
1
3,
supv∈Lf
∫ 1
0v ≥ lim
n→∞
∫ 1
0vn =
1
3.
Since infu∈Uf
∫ 10 u ≥ sup
v∈Lf
∫ 10 v , this implies that
infu∈Uf
∫ 1
0u = sup
v∈Lf
∫ 1
0v =
1
3.
By the definition of the Darboux integral,∫ 1
0f = 1/3.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 525 / 624
Integral Calculus in One Variable Notions of Integration
The Darboux Integral of a Non-Regulated Function4.1.25. Example. The function
f : [0, 1]→ R, f (x) =
1 if x = 1/n for some n ∈ N \ 0,0 otherwise.
of Example 4.1.21 is not regulated, but it is Darboux-integrable. To seethis, note that v0(x) = 0 defines a lower step function v0 ∈ Lf , while forany n,
un(x) =
1 0 ≤ x ≤ 1/n,
f (x) otherwisedefines an upper step function un ∈ Uf . As in the previous example,
infu∈Uf
∫ 1
0u ≤ lim
n→∞
∫ 1
0un =
1
n= 0
and supv∈Lf
∫ 10 v ≥
∫ 10 v0 = 0, yielding the common value
∫ 10 f = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 526 / 624
Integral Calculus in One Variable Notions of Integration
A regulated function is Darboux-integrable4.1.26. Theorem. If f ∈ Reg([a, b]), then f is Darboux-integrable and theDarboux integral coincides with the regulated integral.
Proof.Let f ∈ Reg([a, b]). We will show that for every ε > 0 there exist upperand lower step functions uε and vε such that
∫ ba uε −
∫ ba vε ≤ ε.
Fix ε > 0. Since f is regulated, there exists a step function φ such thatsup
x∈[a,b]|f (x)− φ(x)| < ε/[2(b − a)]. Then
uε := φ+ε
2(b − a), vε := φ− ε
2(b − a).
are upper and lower step functions for f and∫ b
auε −
∫ b
avε ≤ (b − a) sup
x∈[a,b]|uε(x)− vε(x)| = ε.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 527 / 624
Integral Calculus in One Variable Notions of Integration
A regulated function is Darboux-integrable
Proof (continued).This shows that f is Darboux integrable.To show that the Darboux integral is equal to the regulated integral, wewill show that the limit of the integrals of step functions converginguniformly to f is just the Darboux integral. Let (φn) be a sequence of stepfunctions such that
supx∈[a,b]
|f (x)− φn(x)| ≤1
2n.
Then
un := φn +1
n, vn := φn −
1
n.
define upper and lower step functions for f .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 528 / 624
Integral Calculus in One Variable Notions of Integration
A regulated function is Darboux-integrableProof (continued).The Darboux integral satisfies∫ b
avn ≤
∫ b
af ≤
∫ b
aun.
Since
limn→∞
∫ b
avn = lim
n→∞
∫ b
aφn − lim
n→∞
b − a
n= lim
n→∞
∫ b
aφn,
limn→∞
∫ b
aun = lim
n→∞
∫ b
aφn + lim
n→∞
b − a
n= lim
n→∞
∫ b
aφn.
it follows that ∫ b
af = lim
n→∞
∫ b
aφn.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 529 / 624
Integral Calculus in One Variable Notions of Integration
A regulated function is Darboux-integrable
Hence, the Darboux integral is an extension of the regulated integral forfunctions f : [a, b]→ R. We can easily verify that the Darboux integral isbounded and additive:4.1.27. Proposition. The Darboux integral of Riemann-integrable functionsf and g on [a, b] satisfies ∣∣∣∣∫ b
af
∣∣∣∣ ≤ (b − a) supx∈[a,b]
|f (x)|,
λ
∫ b
af + µ
∫ b
ag =
∫ b
a(λf + µg).
The proof is left as an exercise.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 530 / 624
Integral Calculus in One Variable Notions of Integration
Tagged Partitions and Riemann SumsThere is another approach to the integral that is most often seen inundergraduate textbooks.A tagged partition (P , Ξ) on an interval [a, b] consists of a partitionP = x0, ... , xn together with numbers Ξ = ξ1, ... , ξn such that eachξk ∈ [xk−1, xk ]. The mesh size of P is defined as
m(P) := maxk=1,...,n
(xk − xk−1)
A step function φ ∈ Step([a, b]) for a function f ∈ L∞([a, b]) with respectto (P, Ξ) is given by
φ(x) = f (ξk) for xk−1 < x < xk , k = 1, ... , n,where as usual φ(xk) can be defined in an arbitrary manner. The sum∫ b
aφ :=
n∑k=1
f (ξk)(xk − xk−1) (4.1.12)
is then called a Riemann sum for f .Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 531 / 624
Integral Calculus in One Variable Notions of Integration
Riemann SumsThe Riemann sum will hopefully approach the area under the graph of f asthe partitions become finer. Below, we have taken partitions of equal sizeand tags lying in the midpoints of the subintervals, but the subintervallengths and tag locations should not influence the limit:
a bx
y
Given a function f , the choice offiner and finer partitions gives Rie-mann sums of more and more val-ues of f , weighted by the meshsize.In the limit, one may regard a se-quence of such Riemann sums asapproaching the “sum of all valuesof f in the interval [a, b].”
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 532 / 624
Integral Calculus in One Variable Notions of Integration
The Riemann Integral4.1.28. Definition. Let [a, b] ⊂ R be a closed interval and f a boundedreal function on [a, b]. Then f is Riemann-integrable with integral∫ b
af ∈ R
if for every ε > 0 there exists a δ > 0 such that for any tagged partition(P , Ξ) on [a, b] with mesh size m(P) < δ∣∣∣∣ n∑
k=1
f (ξk)(xk − xk−1)−∫ b
af
∣∣∣∣ < ε.
Definition 4.1.28 is quite difficult to work with in practice, so it isfortunate that the following result holds:4.1.29. Theorem. A bounded real function is Riemann-integrable on [a, b]if and only if it is Darboux-integrable. Moreover, the values of theintegrals coincide.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 533 / 624
Integral Calculus in One Variable Notions of Integration
Equivalence of the Darboux and Riemann Integrals
Proof.Suppose that f is Riemann integrable. Fix ε > 0. Then there exists apartition P = (x0, ... , xn) such that∣∣∣∣ n∑
k=1
f (ξk)(xk − xk−1)−∫ b
af
∣∣∣∣ < ε
4.
for any choice of ξk ∈ [xk−1, xk ]. On this partition P we can define upperand lower step functions for f by
u(x) := supξ∈[xk−1,xk ]
f (ξ), v(x) := infξ∈[xk−1,xk ]
f (ξ)
for x ∈ (xk−1, xk). By choosing ξk appropriately, we can ensure that thecorresponding Riemann sum differs from the integral of u by less than ε/4.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 534 / 624
Integral Calculus in One Variable Notions of Integration
Equivalence of the Darboux and Riemann Integrals
Proof (continued).This ensures that∣∣∣∣∫ b
au −
∫ b
af
∣∣∣∣ ≤ ∣∣∣∣∫ b
au −
n∑k=1
f (ξk)(xk − xk−1)
∣∣∣∣+
∣∣∣∣ n∑k=1
f (ξk)(xk − xk−1)−∫ b
af
∣∣∣∣<ε
2.
Applying the same argument for the lower step function, we find∣∣∣∣∫ b
au −
∫ b
av
∣∣∣∣ < ∣∣∣∣∫ b
au −
∫ b
af
∣∣∣∣+ ∣∣∣∣∫ b
av −
∫ b
af
∣∣∣∣ < ε. (4.1.13)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 535 / 624
Integral Calculus in One Variable Notions of Integration
Equivalence of the Darboux and Riemann Integrals
Proof (continued).Since such step functions u and v can be found for any ε > 0, this showsthat f is Darboux-integrable (see Remark 4.1.23). Furthermore, since forany choice of partition∫ b
av ≤
n∑k=1
f (ξk)(xk − xk−1) ≤∫ b
au
we see that the Darboux integral equals the Riemann integral.The proof that a Darboux-integrable function is Riemann-integrable is leftas an exercise.
4.1.30. Remark. The Riemann integral emphasizes the interpretation of theintegral as a sum or series over a “continuous parameter range”.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 536 / 624
Integral Calculus in One Variable Notions of Integration
Insufficiency of the Darboux/Riemann IntegralFrom the definition of a step function, we can immediately see that theregulated integral of a function f ∈ L∞([a, b]) such that f (x) = 0 at allexcept a finite number of points in [a, b] vanishes. For example, thefunction f (x) = 0 for x = 0, f (0) = 1 is integrable on [0, 1] and theregulated integral vanishes.On the other hand, the function
f : [0, 1]→ R, f (x) =
1 if x = 1/n for some n ∈ N \ 0,0 otherwise.
(4.1.14)
does not have a regulated integral. Since f vanishes “at almost everypoint of the interval”, we would like the integral of f to be defined and tobe zero, but the notion of the regulated integral is not sufficient for this.However, we were able to extend the concept of an integral to the Darbouxintegral and as we have seen, the Darboux integral of f is indeed zero.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 537 / 624
Integral Calculus in One Variable Notions of Integration
Insufficiency of the Darboux/Riemann IntegralHowever, there are still functions that seem to vanish “always everywhere”but whose Darboux integral is not defined. The basic example is theDirichlet function, which we have already mentioned earlier.4.1.31. Example. The Dirichlet function
χ : [0, 1]→ R, χ(x) =
1 for x rational,0 for x irrational.
(4.1.15)
is not Darboux-integrable.The reason for this is simple: any lower step function v must satisfyv(x) ≤ 0 while an upper step function must satisfy u(x) ≥ 1 for all xexcept a finite number of exceptions (at the “jumps” between the steps).Hence,
infu∈Uχ
∫ 1
0u ≥ 1 > 0 ≥ sup
v∈Lχ
∫ 1
0v
and the two bounds can not have a common value.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 538 / 624
Integral Calculus in One Variable Notions of Integration
Extensions of the Darboux/Riemann IntegralThere exist extensions of the Darboux and Riemann integrals that allow usto define an integral even of functions such as χ. While we lack the timeto go into details on these, two deserve to be mentioned:
(i) The Lebesgue integral (dating from around 1917) is the mostcommonly seen extension. It may be regarded as a modification ofthe Darboux integral in which step functions are replaced withso-called simple functions that are basically step functions with an“infinite number” of steps.
(ii) The gauge integral, also called the Henstock-Kurzweil integral, is afairly recent formulation (developed in the 1980s) that modifies theRiemann integral to require the mesh size of a tagged partition todepend on the specific tag within each subinterval. Hence, inDefinition 4.1.28 m(P) < δ is replaced by the conditionxk+1 − xk < δ(ξk) for a gauge function δ : [a, b]→ (0,∞).It turns out that the Henstock-Kurzweil integral is more general thanthe Lebesgue integral.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 539 / 624
Integral Calculus in One Variable Notions of Integration
Conclusion
In fact, there is no significant benefit in the introduction of the Darbouxand Riemann integrals since:
All functions we will encounter in practice are at least piecewisecontinuous and therefore regulated.
The Darboux integral is not general enough to treat a very wide classof functions that includes, e.g., the Dirichlet function. If it werenecessary for us to treat such functions, we would need to develop aneven more general concept.
The Darboux and Riemann integrals were mainly introduced to illustratesome basic ideas on how to implement the seemingly simple question ofdefining the “area under a function”. In practice, it is entirely sufficient toconsider all integrals as referring to the regulated integral.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 540 / 624
Integral Calculus in One Variable Practical Integration
Notions of Integration
Practical Integration
Applications of Integration
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 541 / 624
Integral Calculus in One Variable Practical Integration
The Fundamental Theorem of CalculusThe derivation in the previous chapter can also be applied tocomplex-valued functions with no change in the arguments. We henceforthassume that the regulated integral has been established for functionsf : [a, b]→ C.4.2.1. Theorem. Let f : [a, b]→ C be continuous and set
F : [a, b]→ C, F (x) :=
∫ x
af .
Then F is differentiable on (a, b) andF ′(x) = f (x), x ∈ (a, b).
While this theorem can be proved for general regulated functions, we arecontent with the (relatively simple) proof for continuous functions. It maybe immediately applied to piecewise continuous functions by using(4.1.10). In practice, we will not often come across integrals of functionsthat are not at least piecewise continuous.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 542 / 624
Integral Calculus in One Variable Practical Integration
The Fundamental Theorem of CalculusProof.Let h be sufficiently small. Then for x , x + h ∈ (a, b) we have
F (x + h) =
∫ x+h
af =
∫ x
af (t) dt +
∫ x+h
xf (t) dt = F (x) +
∫ x+h
xf (t) dt
= F (x) + hf (x) +
∫ x+h
x
(f (t)− f (x)
)dt.
Now
limh→0
1
|h|
∣∣∣∫ x+h
x
(f (t)− f (x)
)dt∣∣∣ ≤ lim
h→0
|h||h|
supx≤t≤x+h
|f (t)− f (x)|
= limh→0
supx≤t≤x+h
|f (t)− f (x)| = 0
Hence the integral is o(h) as h→ 0 and F ′(x) = f (x).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 543 / 624
Integral Calculus in One Variable Practical Integration
The Fundamental Theorem of Calculus
4.2.2. Definition. Let Ω ⊂ R be an open set and f : Ω→ C. We say thatF : Ω→ C is a primitive, indefinite integral or anti-derivative of f ifF ′ = f . We write
F =
∫f =
∫f (t) dt.
4.2.3. Lemma. Let f ∈ C ([a, b],C) and F be a primitive of f . Then∫ b
af = F (b)− F (a).
This fundamental result establishes that integration can be regarded as asort of “inverse differentiation.”
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 544 / 624
Integral Calculus in One Variable Practical Integration
The Fundamental Theorem of Calculus
Proof.We note that by Theorem 4.2.1
d
dx
(∫ x
af − F (x)
)= f (x)− f (x) = 0,
so there exists a c ∈ C such that∫ x
af − F (x) = c
Letting x → a, we see that c = −F (a). This implies∫ x
af = F (x)− F (a).
Letting x → b we obtain the result.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 545 / 624
Integral Calculus in One Variable Practical Integration
Practical IntegrationWe can hence immediately calculate various integrals by guessing theirprimitives. For example, for α = 1,∫ b
axα dx =
1
α+ 1xα+1
∣∣∣x=b− 1
α+ 1xα+1
∣∣∣x=a
=:1
α+ 1xα+1
∣∣∣ba.
For short, we often write the right-hand side as∫ b
axα dx =
1
α+ 1xα+1
∣∣∣ba=[ 1
α+ 1xα+1
]ba.
Similarly, we “see” that∫eαx dx =
1
αeαx , α = 0,∫
1
xdx =
∫dx
x= ln|x |, x = 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 546 / 624
Integral Calculus in One Variable Practical Integration
Substitution RuleHowever, not all integrals are obvious. For example,∫
ln x dx = x ln x − x ,
∫dx√1 + x2
= Arsinh x .
Since we integrate by “inverse differentiation,” we should expect some helpfrom the theorems that govern differentiation of products andcompositions of functions.The following theorem is a direct result of the chain rule of differentiation.4.2.4. Substitution Rule. Let f ∈ Reg([α,β]) and g : [a, b]→ [α,β]continuously differentiable. Then∫ b
a(f g)(x)g ′(x) dx =
∫ g(b)
g(a)f (y) dy .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 547 / 624
Integral Calculus in One Variable Practical Integration
Substitution RuleProof.Let F be a primitive of f . Then (F g)′ = (f g)g ′ and∫ b
a(f g)(x)g ′(x) dx =
∫ b
a(F g)′ dx
= (F g)(x)∣∣ba= F (x)
∣∣g(b)g(a)
=
∫ g(b)
g(a)f (y) dy .
4.2.5. Remark. Using the same proof, we see that the substitution rule canbe applied to indefinite integrals (primitives):∫
(f g)(x)g ′(x) dx =
∫f (y) dy
∣∣∣y=g(x)
.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 548 / 624
Integral Calculus in One Variable Practical Integration
Substitution Rule
4.2.6. Example. We want to calculate∫
dx
λ2 + x2for λ = 0. We write
∫dx
λ2 + x2=
∫1
λ2dx
1 + (x/λ)2.
We now set f (x) = 1/(1 + x2), g(x) = y = x/λ. Then g ′(x) = 1/λ, sowe have∫
dx
λ2 + x2=
1
λ
∫1
λ
dx
1 + (x/λ)2
=1
λ
∫g ′(x)f (g(x)) dx =
1
λ
∫f (y) dy
∣∣∣y=g(x)
=1
λ
∫dy
1 + y2
∣∣∣y=x/lambda
=1
λarctan y
∣∣∣y=x/λ
=1
λarctan
x
λ.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 549 / 624
Integral Calculus in One Variable Practical Integration
Substitution Rule
The following “short cut” using Leibniz notation works very well with thesubstitution rule: Set y = g(x), so g ′(x) = dy
dx . Then we substitute y forg(x) and dy for g ′(x) dx .
4.2.7. Example. We calculate∫
xe−x2 dx . We set y = −x2. Then
dy
dx= −2x ⇔ dy = −2x dx ⇔ −1
2dy = x dx .
Thus ∫xe−x2 dx = −1
2
∫ey dy
∣∣∣y=−x2
= −1
2ey |y=−x2 = −
1
2e−x2 .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 550 / 624
Integral Calculus in One Variable Practical Integration
Integration by PartsAnalogously to the previous result, we obtain a technique for integrationbased on the product rule of differentiation.4.2.8. Theorem. Let f , g ∈ C 1([a, b],C). Then∫ b
af ′(x)g(x) dx = f (x)g(x)
∣∣∣ba−∫ b
af (x)g ′(x) dx .
Proof.This follows from the fact that fg is a primitive of f ′g + g ′f .4.2.9. Example.∫
ln x dx =
∫1︸︷︷︸
f ′(x)
· ln x︸︷︷︸g(x)
dx = x ln x︸ ︷︷ ︸f (x)g(x)
−∫
x︸︷︷︸f (x)
· 1
x︸︷︷︸g ′(x)
dx
= x ln x − x
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 551 / 624
Integral Calculus in One Variable Practical Integration
The Area of a DiscWe now calculate the area of a disc of radius R. In fact, we calculate thearea of a quarter-disc by integrating the function f given byf (x) =
√R2 − x2 from 0 to R:
Rx
R
y
y = f HxL = R2- x2
t
R
Substituting x = R sin t,dx = R cos t dt, we have∫ R
0
√R2 − x2 dx
=
∫ π/2
0
√R2 − R2 sin2 t R cos t dt
= R2
∫ π/2
0cos2 t dt
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 552 / 624
Integral Calculus in One Variable Practical Integration
The Area of a DiscWe are now left with calculating
∫ π/2
0cos2 t dt.
Using integration by parts:∫ π/2
0cos2 t dt = sin t cos t
∣∣∣π/20
+
∫ π/2
0sin2 t dt
= 0 +
∫ π/2
0(1− cos2 t) dt = π/2−
∫ π/2
0cos2 t dt
⇒∫ π/2
0cos2 t dt =
π
4
or using trigonometry and substitution:∫ π/2
0cos2 t dt =
1
2
∫ π/2
0(cos(2t) + 1) dt =
π
4+
1
2
∫ π/2
0cos(2t) dt
=π
4+
1
4
∫ π
0cos(s) ds =
π
4+
1
4sin s
∣∣∣π0=π
4
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 553 / 624
Integral Calculus in One Variable Practical Integration
The Area of a Disc
Hence the area of a disc of radius R is 4 · π4R2 = πR2, as we know from
elementary geometry.Integration is an art, not a skill like differentiation. There are many more“tricks” and “techniques” for calculating integrals, some of which you willencounter in the assignments and recitation classes.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 554 / 624
Integral Calculus in One Variable Practical Integration
Improper IntegralsAn integral ∫ b
af (t) dt
is called “improper” if the domain of integration is unbounded i.e., a = −∞ or b =∞
and/or the integrand f is unbounded on (a, b) or otherwise not regulated.
4.2.10. Definition. Assume that b ≤ ∞ and that f : [a, b)→ C is regulatedon any closed subinterval [a, x ], x < b. Then∫ b
af (t) dt
is called an improper integral and is said to converge or exist if
limxb
∫ x
af (t) dt =: I
exists.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 555 / 624
Integral Calculus in One Variable Practical Integration
Improper Integrals4.2.10. Definition. (continued). The number I ∈ C is then called the valueof the improper integral and we write
I =
∫ b
af (t) dt.
Similarly, we define ∫ b
af (t) dt = lim
xa
∫ b
xf (t) dt
(if the limit exists) in the case where b <∞. In general, we define theintegral∫ b
af (t) dt :=
∫ c
af (t) dt +
∫ b
cf (t) dt, −∞ ≤ a < b ≤ ∞
for any c ∈ R such that both limits on the right exist.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 556 / 624
Integral Calculus in One Variable Practical Integration
Improper Integrals
4.2.11. Example. Consider the integral∫ ∞
1
dt
tα. Then for α = 1
∫ x
1
dt
tα=
1
1− α(x1−α − 1)
x→∞−−−→
1
α−1 if α > 1
∞ if α < 1
Thus the integral converges for α > 1 and diverges for α < 1. We also seethat in the case α = 1,∫ x
1
dt
t= ln x − ln 1
x→∞−−−→∞,
so the integral diverges for α = 1.
In the same way we can show that∫ 1
0
dt
tαconverges if α < 1 and diverges
if α ≥ 1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 557 / 624
Integral Calculus in One Variable Practical Integration
Cauchy Property of FunctionsWe would like to find a criterion for the convergence of improper integrals.As a preliminary step, we establish the following convergence criterion.4.2.12. Theorem. Let I ⊂ R be an interval, a ∈ I ⊂ R ∪ −∞,∞ andF : I → C. Then the following statements are equivalent:
a) The limit limx→a
F (x) exists.b) The function F satisfies the Cauchy property,
∀ε>0∃
δ>0∀
x ,y∈I: |x − a| < δ ∧ |y − a| < δ ⇒ |F (x)− F (y)| < ε.
(4.2.1)
when a ∈ R, or
∀ε>0
∃R>0
∀x ,y∈I
: ± x ,±y > R ⇒ |F (x)− F (y)| < ε. (4.2.2)
when a = ±∞.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 558 / 624
Integral Calculus in One Variable Practical Integration
Cauchy Property of Functions
Proof.We will only consider the case a ∈ R. The case a = ±∞ is left to you.
a)⇒ b) Fix ε > 0. If the limit exists (call it L) we can find δ > 0such that for all x , y ∈ I
|x − a| < δ ⇒ |F (x)− L| < ε/2.
Then |F (x)− F (y)| ≤ |F (x)− L|+ |F (y)− L| implies b). b)⇒ a) We suppose that (4.2.1) holds and show that lim
x→aF (x) exists
by considering sequences (xn) with xn → a.Let (xn) be given and suppose that xn → a. Then b) implies that(F (xn)) is a complex Cauchy sequence (why? prove this!) and thissequence converges because C is complete.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 559 / 624
Integral Calculus in One Variable Practical Integration
Cauchy Property of Functions
Proof (continued).
Next, suppose that (xn) and (yn) are two such sequences. Define
(zn) = (x1, y1, x2, y2, ...)
Then by the same argument as for (xn), the limit of F (zn) exists.Since the limit is the only accumulation point of (F (zn)),
L := limn→∞
F (zn) = limn→∞
F (xn) = limn→∞
F (yn)
so limx→a
F (x) exists.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 560 / 624
Integral Calculus in One Variable Practical Integration
Cauchy Criterion for FunctionsThe main application of the Cauchy property is in the study of improperintegrals. For example, we have the following corollary of Theorem 4.2.12.4.2.13. Cauchy Criterion. Let a ∈ R and f : [a,∞)→ R be integrable onevery interval [a, x ], x ∈ R. The improper integral∫ ∞
af (x) dx
converges if and only if
∀ε>0
∃R>0
∀x ,y>R
∣∣∣∣∫ y
xf (t) dt
∣∣∣∣ < ε.
Proof.Apply Theorem 4.2.12 to F : [a,∞)→ R, F (x) =
∫ xa f (t) dt, noting that
F (y)− F (x) =
∫ y
af (t) dt −
∫ x
af (t) dt =
∫ y
xf (t) dt.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 561 / 624
Integral Calculus in One Variable Practical Integration
The Dirichlet Integral4.2.14. Example. The Dirichlet integral∫ ∞
0
sin x
xdx (4.2.3)
is convergent.
Π 3 Π 5 Π 7 Π 9 Π
x
1
y
y
sinHxL
x
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 562 / 624
Integral Calculus in One Variable Practical Integration
The Dirichlet IntegralTo prove that (4.2.3) converges, we consider
I (x , y) :=
∫ y
x
sin t
tdt, x < y , (4.2.4)
which we can rewrite as
I (x , y) =
∫ x+π
x
sin t
tdt −
∫ y+π
y
sin t
tdt +
∫ y+π
x+π
sin τ
τdτ .
Substituting t = τ − π in the last integral, we obtain
I (x , y) =
∫ x+π
x
sin t
tdt −
∫ y+π
y
sin t
tdt −
∫ y
x
sin t
t + πdt. (4.2.5)
Adding (4.2.4) to (4.2.5), we obtain
2I (x , y) =
∫ x+π
x
sin t
tdt −
∫ y+π
y
sin t
tdt + π
∫ y
x
sin t
t(t + π)dτ . (4.2.6)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 563 / 624
Integral Calculus in One Variable Practical Integration
The Dirichlet IntegralSince |sin(t)/t| ≤ 1/t and |sin(t)/(t(t + π))| ≤ 1/t2 we obtain from(4.2.6) that
2|I (x , y)| ≤ 2π
x+ π
∫ y
x
dt
t2<
2π
x+ π
∫ ∞
x
dt
t2=
2π
x+π
x.
Fix ε > 0. Then choose R > 3π/(2ε). Then for all x , y > R we have∣∣∣∣∫ y
x
sin t
tdt
∣∣∣∣ = |I (x , y)| < ε.
By the Cauchy Criterion 4.2.13, the integral (4.2.3) converges.4.2.15. Remark. The sine cardinal, sinc : R→ R,
sinc(x) :=
sin xx x = 0,
1 x = 0,
plays an important role in signal processing. We will later be able to provethat the value of the integral (4.2.3) is π/2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 564 / 624
Integral Calculus in One Variable Practical Integration
Comparison Test for Improper IntegralsAnother important consequence of Theorem 4.2.12 is the following:4.2.16. Comparison Test. Let I ⊂ R and f : I → C, g : I → [0,∞).Suppose that |f (t)| ≤ g(t) for t ∈ I and
∫I g(t) dt converges. Then∫
I f (t) dt also converges.
Proof.We treat the case I = [a, b] only. Let F (x) =
∫ xa f , G (x) =
∫ xa g for
a < x < b. Then
|F (x)− F (y)| =∣∣∣∫ x
af −
∫ y
af∣∣∣ = ∣∣∣∫ x
yf∣∣∣
≤∫ x
y|f | ≤
∫ x
yg = G (x)− G (y). (4.2.7)
Since limxb
G (x) exists, G satisfies the Cauchy criterion (4.2.1). From theestimate (4.2.7), F also satisfies (4.2.1), so lim
xbF (x) exists.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 565 / 624
Integral Calculus in One Variable Practical Integration
Absolute and Conditional Convergence
4.2.17. Definition. Let I ⊂ R and f : I → C. We say that the improperintegral
∫I f (t) dt converges absolutely if the improper integral
∫I |f (t)| dt
converges.If∫I f (t) dt converges but is not absolutely convergent, we say that the
integral is conditionally convergent.
4.2.18. Remark. An immediate consequence of the Comparison Test 4.2.16is that every absolutely convergent integral is also convergent.
4.2.19. Example. The Dirichlet integral (4.2.3) is conditionally convergent,i.e., ∫ ∞
0
|sin x |x
dx
diverges.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 566 / 624
Integral Calculus in One Variable Practical Integration
Absolute and Conditional ConvergenceFor any R > 0 ∫ R
0
|sin x |x
dx ≥∫ nRπ
0
|sin x |x
dx ,
where nR ∈ N is the largest integer such that nRπ ≤ R, i.e., nR = ⌊R/π⌋.We write ∫ nRπ
0
|sin x |x
dx =
nR∑k=1
∫ kπ
(k−1)π
|sin x |x
dx
and use ∫ kπ
(k−1)π
|sin x |x
dx ≥ 1
kπ
∫ kπ
(k−1)π|sin x | dx =
2
kπ
to see that ∫ R
0
|sin x |x
dx ≥ 1
π
nR∑k=1
1
k.
Since the harmonic series diverges, it follows that limR→∞
∫ R0
|sin x |x dx =∞.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 567 / 624
Integral Calculus in One Variable Practical Integration
Euler Gamma FunctionThe Euler gamma function is defined through an improper integral,
Γ : R+ → R, Γ (t) : =
∫ ∞
0z t−1e−z dz , t > 0. (4.2.8)
The gamma function plays an important role in many parts ofmathematics, physics and engineering. For example, standard probabilisticdistributions that describe system failure are based on the gamma function(this will be discussed in detail in the course Ve401 Probabilistic Methodsin Engineering).We will first show that the integral (4.2.8) converges absolutely for everyt > 0. We write∫ ∞
0z t−1e−z dz =
∫ 1
0z t−1e−z dz +
∫ ∞
1z t−1e−z dz .
For z > 0 we have |z t−1e−z | < z t−1 and for t > 0 we know thatt − 1 > −1 and
∫ 10 z t−1 dz converges. Hence the first integral converges
absolutely.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 568 / 624
Integral Calculus in One Variable Practical Integration
Euler Gamma FunctionNext we consider ∫ ∞
1z t−1e−z dz . (4.2.9)
Recall that ex > xn/n! for x > 0 and any n ∈ N. Hence (setting x = z/2,n = ⌈t⌉ = mink ∈ Z : k > t)
z t−1e−z/2 =1
z1−tez/2<
2⌈t⌉⌈t⌉!z1−tz⌈t⌉
< C (t) for z > 1.
Hence z t−1e−z < C (t)e−z/2. Since∫ ∞
1e−z/2 dz = lim
x→∞
∫ x
1e−z/2 dz = lim
x→∞(2e−1/2 − 2e−x/2) =
2√e
converges, we see that (4.2.9) and hence Γ (t) converges absolutely.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 569 / 624
Integral Calculus in One Variable Practical Integration
Properties of the Gamma Function4.2.20. Lemma. For any t > 0 we have
Γ (t + 1) = tΓ (t).
Before we discuss the proof, we note that for improper integrals, theconcept of integration by parts generalizes to∫ b
af ′(x)g(x) dx = f (x)g(x)
∣∣ba−∫ b
af (x)g ′(x) dx . (4.2.10)
where all the integrals are improper and
f (x)g(x)∣∣ba:= lim
xbf (x)g(x)− lim
xaf (x)g(x).
All the above limits are of course assumed to exist for (4.2.10) to hold.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 570 / 624
Integral Calculus in One Variable Practical Integration
Properties of the Gamma Function
Proof of Lemma 4.2.20.We have
Γ (t + 1) =
∫ ∞
0z te−z dz = −e−zz t
∣∣∞0
+ t
∫ ∞
0z t−1e−z dz = tΓ (t)
Lemma 4.2.20 implies
n! = Γ (n + 1) for n ∈ N.
The Gamma function is hence a continuous extension of the factorialfunction to the positive real numbers. It is also the only continuousextension such that ln Γ is a convex function on R+. (You will checkthat Γ is even differentiable in the assignments.)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 571 / 624
Integral Calculus in One Variable Practical Integration
Euler Gamma FunctionThe Gamma function is, by its definition, only defined for t > 0. However,it turns out that one can find an extension of Γ (t) to values t > −1,t = 0, as follows: Set
F1(t) :=Γ (t + 1)
t.
Clearly, F1 defines a function on (−1,∞) \ 0. If t > 0, we haveΓ (t + 1) = tΓ (t), so
F1(t) =Γ (t + 1)
t=
tΓ (t)
t= Γ (t)
for t > 0. Hence F1 is an extension of Γ to values t > −1, t = 0. In asimilar manner, an extension F2 of F1 can be found that is defined on(−2,∞) \ −1, 0. This process can be continued indefinitely to achievean extension of the Gamma function to x ∈ R : x = −n, n ∈ N. Thisextension is also denoted Γ (t).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 572 / 624
Integral Calculus in One Variable Practical Integration
Euler Gamma FunctionBy defining the Gamma function for complex t ∈ C it can be proven thatthe above extension defines a continuous function for all t ∈ C except thenegative real integers, thereby justifying the definitions of F1, F2 etc.
-3 -2 -1 1 2 3 4 5x
-10
-5
5
10
15
y
y = G HxL = à0
¥
zx - 1 e-z âz
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 573 / 624
Integral Calculus in One Variable Applications of Integration
Notions of Integration
Practical Integration
Applications of Integration
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 574 / 624
Integral Calculus in One Variable Applications of Integration
Integrals and Series
From the preceding discussion of improper integrals, it seems that thereare obvious analogies between functions and sequences:
functions f ←→ sequences (an)
integral∫ ∞
1f ←→ series
∞∑n=1
an
Comparison Test 4.2.16 ←→ Comparison Test 3.5.13
and many more. But can we use these analogies in a practical manner tohelp solve problems?
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 575 / 624
Integral Calculus in One Variable Applications of Integration
Integral Test for Series
4.3.1. Integral Test. Let m ∈ N and f : [m,∞)→ [0,∞) be a decreasingfunction, i.e., f (x) ≥ f (y) for m ≤ x ≤ y , such that
∫ xm f (t) dt exists for
every x > m. Then∫ ∞
mf (x) dx <∞ ⇔
∞∑n=m
f (n) <∞. (4.3.1)
Proof.Define the functions
g : [m,∞)→ R, g(x) = f (n) for x ∈ [n − 1, n), n ∈ N, n ≥ m,
h : [m,∞)→ R, h(x) = f (n − 1) for x ∈ [n − 1, n), n ∈ N, n ≥ m.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 576 / 624
Integral Calculus in One Variable Applications of Integration
Integral Test for SeriesProof (continued).For any N ∈ N, the functions g |[m,N] and h|[m,N] are just step functionsand we find∫ N
mg =
N∑n=m+1
f (n),
∫ N
mh =
N∑n=m+1
f (n − 1) =N−1∑n=m
f (n)
Since f is decreasing, we have g(x) ≤ f (x) ≤ h(x) for all x ∈ [m,∞) andso
N∑n=m+1
f (n) =
∫ N
mg ≤
∫ N
mf ≤
∫ N
mh =
N−1∑n=m
f (n)
Letting N →∞, we obtain∞∑
n=m+1
f (n) ≤∫ ∞
mf ≤
∞∑n=m
f (n).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 577 / 624
Integral Calculus in One Variable Applications of Integration
Integral Test for Series4.3.2. Example. The series
∞∑n=1
ln(n)
nα
converges if and only if α > 1. Note that f : [1,∞)→ R, f (x) = x−α ln xis decreasing for xα > e:
f ′(x) = x−α−1(1− α ln x)
We apply the integral test for m > e1/α. Suppose that α = 1. Integratingby ports, ∫
ln x
xαdx =
ln x
(1− α)xα−1− 1
1− α
∫1
xαdx
= −1 + (α− 1) ln x
xα−1(α− 1)2
so it is easy to see that∫∞3
ln xxα dx converges if and only if α > 1. The
case α = 1 can be handled similarly.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 578 / 624
Integral Calculus in One Variable Applications of Integration
Sequences of FunctionsRecall from Theorem 4.1.20 that if (fn) is a sequence of functionsfn ∈ C ([a, b]) such that fn n→∞−−−→
∥ · ∥∞f then
∫ b
afn
n→∞−−−→∫ b
af .
We can apply this to give the following practical result:4.3.3. Theorem. Let [a, b] ⊂ R and (fn) be a sequence of functions wherefn ∈ C 1([a, b]). Suppose that
f ′nn→∞−−−→∥ · ∥∞
g for some g ∈ C ([a, b])
and that for some x0 ∈ [a, b] the sequence (fn(x0)) converges. Then (fn)converges uniformly to a function f ∈ C 1([a, b]) such that f ′ = g .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 579 / 624
Integral Calculus in One Variable Applications of Integration
Sequences of Functions
Proof.For n ∈ N we have
fn(x) =
∫ x
af ′n + cn, cn = fn(a).
Let x = x0 as in the theorem. Then
cn = fn(x0)−∫ x0
af ′n
n→∞−−−→ limn→∞
fn(x0)−∫ x0
ag =: c .
Hence for any x ∈ [a, b] we have
fn(x)n→∞−−−→
∫ x
ag + c =: f (x),
so we have pointwise convergence to a function f with f ′ = g .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 580 / 624
Integral Calculus in One Variable Applications of Integration
Sequences of Functions
Proof (continued).We will show that the convergence is actually uniform:
∥fn − f ∥∞ ≤ supx∈[a,b]
∣∣∣∫ x
af ′n −
∫ x
ag∣∣∣+ |cn − c|
≤ supx∈[a,b]
|x − a| supt∈[a,x]
|f ′n(t)− g(t)|+ |cn − c |
= |b − a| · ∥f ′n − g∥∞ + |cn − c |.
Since cn → c and f ′n → g uniformly, we are finished.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 581 / 624
Integral Calculus in One Variable Applications of Integration
Series of FunctionsTheorem 4.3.3 can be expressed for series of functions in the following way:4.3.4. Corollary. Let [a, b] ⊂ R and (fn) be a sequence of functions,fn ∈ C 1([a, b]), n ∈ N, such that
∞∑n=0
fn =: f
converges pointwise on [a, b] and∞∑n=0
f ′n
converges uniformly on [a, b]. Then the converge to f is actually uniform,f is differentiable and
f ′ =∞∑n=0
f ′n.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 582 / 624
Integral Calculus in One Variable Applications of Integration
Differentiation and Integration of Power SeriesLet
f : (−ϱ, ϱ)→ R, f (x) =∞∑n=0
anxn
be the function defined by a power series with radius of convergence ϱ.Recall from Lemma 3.6.10 that a power series converges uniformly withinany closed and bounded set within its radius of convergence. Then
(i) f is continuous on (−ϱ, ϱ) by Theorem 3.4.3.(ii) f is differentiable by Theorem 3.6.14 and
f ′(x) =d
dx
∞∑n=0
anxn =
∞∑n=0
d
dx
(anx
n)
=∞∑n=1
nanxn−1, x ∈ (−ϱ, ϱ).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 583 / 624
Integral Calculus in One Variable Applications of Integration
Differentiation and Integration of Power Series(iii) f may be integrated componentwise by Corollary 4.3.4,∫ x
0f (y) dy =
∫ x
0
( ∞∑n=0
anyn)dy =
∞∑n=0
∫ x
0
(any
n)dy
=∞∑n=0
ann + 1
xn+1, x ∈ (−ϱ, ϱ).
In summary:Within the radius of convergence we can “interchange”differentiation and summation as well as integration andsummation of a power series.
The analogous statements hold for power series “centered at x0 ∈ R”,
f : (x0 − ϱ, x0 + ϱ)→ R, f (x) =∞∑n=0
an(x − x0)n. (4.3.2)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 584 / 624
Integral Calculus in One Variable Applications of Integration
Coefficients of Power SeriesFor the power series
f : (x0 − ϱ, x0 + ϱ)→ R, f (x) =∞∑n=0
an(x − x0)n.
we havea0 = f (x0).
Moreover, f ′(x) = 1 · a1 + 2a2(x − x0)2−1 + ... , so
a1 =f ′(x0)
1
and, in general,
an =f (n)(x0)
n!, n ∈ N. (4.3.3)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 585 / 624
Integral Calculus in One Variable Applications of Integration
Taylor’s TheoremHence, a power series
f : (x0 − ϱ, x0 + ϱ)→ R, f (x) =∞∑n=0
an(x − x0)n
has coefficients given by
an =f (n)(x0)
n!, n ∈ N.
We are now interested in the converse question: If I ⊂ R is an interval andf ∈ C∞(R) a smooth function, does the power series
∞∑n=0
an(x − x0)n, an =
f (n)(x0)
n!, n ∈ N.
exist for x0 ∈ I? Does it converge to f (x) for all x?Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 586 / 624
Integral Calculus in One Variable Applications of Integration
Taylor’s Theorem4.3.5. Taylor’s Theorem. Let I ⊂ R an open interval and f ∈ C k(I ). Letx ∈ I and y ∈ R such that x + y ∈ I . Then for all p ≤ k,
f (x + y) = f (x) +1
1!f ′(x)y + · · ·+ 1
(p − 1)!f (p−1)(x)yp−1 + Rp (4.3.4)
with the remainder term
Rp :=
∫ 1
0
(1− t)p−1
(p − 1)!f (p)(x + ty)yp dt.
Proof.We prove the theorem by induction in p. For p = 1, (4.3.4) becomes
f (x + y)− f (x) =
∫ 1
0f ′(x + ty) · y dt =
∫ x+y
xf ′(z) dz ,
which is just the fundamental theorem of calculus.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 587 / 624
Integral Calculus in One Variable Applications of Integration
Taylor’s TheoremProof (continued).In order to prove that (4.3.4) for p implies (4.3.4) for p + 1, we show
Rp =1
p!f (p)(x)yp + Rp+1.
Integrating by parts, we have
Rp =
∫ 1
0
(1− t)p−1
(p − 1)!f (p)(x + ty)yp dt
= − (1− t)p
p!f (p)(x + ty)yp
∣∣∣∣1t=0
+
∫ 1
0
(1− t)p
p!
d
dtf (p)(x + ty)yp dt
=1
p!f (p)(x)yp +
∫ 1
0
(1− t)p
p!f (p+1)(x + ty)yp+1 dt
=1
p!f (p)(x)yp + Rp+1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 588 / 624
Integral Calculus in One Variable Applications of Integration
Lagrange RemainderIf we set x = x0 and y = x − x0 in (4.3.4),
f (x) =
p−1∑n=0
f (n)(x0)
n!(x − x0)
n + Rp (4.3.5)
for f ∈ Cp(I ), with
Rp =
∫ 1
0
(1− t)p−1
(p − 1)!f (p)(tx + (1− t)x0) dt · (x − x0)
p
=
∫ x
x0
(x − τ)p−1
(p − 1)!f (p)(τ) dτ (4.3.6)
This is known as the Lagrange form of the remainder. The remainder termsatisfies the estimate
|Rp| ≤1
(p − 1)!sup
y∈[x0,x]|f (p)(y)| · |x − x0|p (4.3.7)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 589 / 624
Integral Calculus in One Variable Applications of Integration
Analytic Functions and Taylor SeriesWe will call
Tf ;p,x0(x) := f (x)− Rp+1 =
p∑n=0
f (n)(x0)
n!(x − x0)
n (4.3.8)
the Taylor polynomial of degree p of f at x0. The series
limp→∞
Tf ;p,x0(x) =∞∑n=0
f (n)(x0)
n!(x − x0)
n (4.3.9)
is called the Taylor series of f at x0.4.3.6. Definition. Let I ⊂ R be an open interval and f ∈ C∞(I ). We saythat f is real-analytic or just analytic at x0 ∈ I if there exists aneighborhood Bε(x0) = (x0 − ε, x0 + ε) ⊂ I such that
f (x) =∞∑n=0
f (n)(x0)
n!(x − x0)
n
for all x ∈ Bε(x0).Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 590 / 624
Integral Calculus in One Variable Applications of Integration
Taylor Series4.3.7. Example. Let f : (−1,∞), f (x) = 1/(1 + x). Letting x0 = 0, wehave
f (x) =
p−1∑n=0
f (n)(0)
n!xn + Rp.
We can easily check thatf (n)(x) = (1 + x)−1−n · (−1)nn!,
so f (n)(0) = n!(−1)n and
f (x) =
p−1∑n=0
f (n)(0)
n!xn + Rp =
p−1∑n=0
(−1)nxn + Rp.
Thus the Taylor polynomial of degree p for f at x0 = 0 is given by
Tf ;p,0(x) =
p∑n=0
(−1)nxn
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 591 / 624
Integral Calculus in One Variable Applications of Integration
Taylor SeriesWe will show that f actually has a Taylor series at x0 = 0,
f (x) =∞∑n=0
(−1)nxn, for x ∈ (−1, 1). (4.3.10)
First, note that the power series in (4.3.10) has radius of convergenceϱ = 1, so the right-hand side is well-defined.We now prove that the Taylor series actually converges to f , i.e., for anyfixed x ∈ (−1, 1) Rp → 0 as p →∞. This is easy to show if x ≥ 0. Inthat case, we can use the estimate (4.3.7) to see that
|Rp| ≤1
(p − 1)!sup
y∈[x0,x]|f (p)(y)| · |x − x0|p
=1
(p − 1)!sup
y∈[0,x]|1 + y |−1−pp!|x |p = p|x |p p→∞−−−→ 0.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 592 / 624
Integral Calculus in One Variable Applications of Integration
Taylor SeriesHowever, if −1 < x < 0 we have to use a more careful estimate. In thatcase, the Lagrange remainder (4.3.6) gives
Rp =
∫ x
0
(x − τ)p−1
(p − 1)!f (p)(τ) dτ = −p
∫ −|x |
0
(|x |+ τ)p−1
(1 + τ)p+1dτ
= p
∫ |x |
0
(|x | − τ)p−1
(1− τ)p+1dτ
Then
|Rp| ≤ p supτ∈[0,|x |]
∣∣∣∣ |x | − τ1− τ
∣∣∣∣p−1 ∫ |x |
0
1
(1− τ)2dτ
= p|x |p−1
∣∣∣∣1− 1
1− |x |
∣∣∣∣ p→∞−−−→ 0.
Thus we have proven that for every x ∈ (−1, 1) the formal Taylor series off actually converges to f . The convergence is uniform in [−R,R] for everyR < 1 (by the general theory of power series).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 593 / 624
Integral Calculus in One Variable Applications of Integration
Taylor SeriesA 10-term approximation (green) to f (x) = 1/(1 + x) (blue):
-1 1
t
1
x
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 594 / 624
Integral Calculus in One Variable Applications of Integration
Taylor Series
4.3.8. Remark. The Taylor series for the previous example could have beenobtained in an easier way: using the geometric series expansion, we have
1
1 + x=
1
1− (−x)=
∞∑n=0
(−x)n
whenever |x | < 1. Since the Taylor series of a function f at x0 is unique(the coefficients are determined by (4.3.3)) it doesn’t matter how weobtain a series expansion - it is always the Taylor series.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 595 / 624
Integral Calculus in One Variable Applications of Integration
Taylor Series
4.3.9. Remark. In the previous example, the Taylor series converged onlyfor |x | < 1. This was not surprising, since the function has a singularity atx = −1. However, the function
f : R→ R, f (x) =1
1 + x2
has a series expansion at x0 = 0 given by
f (x) =1
1 + x2=
1
1− (−x2)=
∞∑n=0
(−1)nx2n, |x | < 1.
The function evidently has no poles in R, but nevertheless the power seriesonly has radius of convergence ϱ = 1. In this case, the poles that limit theradius of convergence are hidden in the complex plane, at x = ±i .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 596 / 624
Integral Calculus in One Variable Applications of Integration
Taylor SeriesWe can use the term-by-term integrability of power series to obtain seriesfor functions where this would otherwise be difficult:4.3.10. Example. Consider the function
f : R→ R, f (x) = arctan x .
To find the Taylor series (4.3.9) directly, we would need to calculatederivatives:
f ′(x) =1
1 + x2, f ′′(x) =
−2x(1 + x2)2
,
f ′′′(x) =−2
(1 + x2)2+
2 · (2x)2
(1 + x2)3
and it is obvious that obtaining higher-order derivatives becomes evermore complicated.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 597 / 624
Integral Calculus in One Variable Applications of Integration
Taylor SeriesHowever, we can calculate
arctan x =
∫ x
0
dy
1 + y2=
∫ x
0
∞∑n=0
(−1)ny2n dy =∞∑n=0
(−1)n∫ x
0y2n dy
=∞∑n=0
(−1)n
2n + 1x2n+1.
The radius of convergence of the power series is ϱ = 1, and the aboveequalities hold by Corollary 4.3.4 for all x ∈ (−1, 1). So far, so good.However, the situation at x = 1 is now quite interesting: both arctan 1 and∑∞
n=0(−1)n
2n+1 converge (the latter by the Leibniz theorem). If they wereequal, we would have
arctan 1 =π
4= 1− 1
3+
1
5− 1
7+− ... ,
which is known as the Leibniz series.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 598 / 624
Integral Calculus in One Variable Applications of Integration
Abel’s Limit Theorem
The justification for the Leibniz series is in fact provided by the followingtheorem, which will be proved in the assignments:4.3.11. Abel’s Limit Theorem. Let f : [0, 1)→ R be defined by
f (x) =∞∑k=0
akxk ,
where∑
ak converges. Then∑
akxk converges uniformly on [0, 1], so
f ∈ C ([0, 1]) and∞∑k=0
ak = limx1
∞∑k=0
akxk .
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 599 / 624
Integral Calculus in One Variable Applications of Integration
Abel SummabilityAbel’s Limit Theorem gives rise to another consideration: if a series∑∞
k=0 ak converges, then the power series∞∑k=0
akxk
has radius of convergence ϱ ≥ 1 and certainly
limx1
∞∑k=0
akxk =
∞∑k=0
ak .
We can hence generalize the concept of summability of sequences, bydefining the sum to be the limit of the power series; we define the Abelsum
A –∞∑n=0
an := limx1
∞∑n=0
anxn
for any complex sequence (an) for which the right-hand side exists.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 600 / 624
Integral Calculus in One Variable Applications of Integration
Abel Summability4.3.12. Remark. If
∑∞k=0 ak converges,
∞∑k=0
ak = A –∞∑n=1
an.
But we also have sequences whose series does not converge in the classicalsense, such as the following:4.3.13. Example. (an) with an = (−1)n. From Example 4.3.7 we know that
A –∞∑n=0
(−1)n = limx1
∞∑n=0
(−1)nxn = limx1
1
1 + x=
1
2
so the sequence (1,−1, 1,−1, 1, ...) is Abel summable with sum 1/2. Wehence write
1− 1 + 1− 1 + 1−+ · · · = 1
2(4.3.11)
where the left-hand side is a classically divergent series.Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 601 / 624
Integral Calculus in One Variable Applications of Integration
Abel SummabilityIn the assignments you will show that
A –∞∑n=1
n(−1)n+1 = 1− 2 + 3− 4 + 5−+ · · · = 1
4.
These types of divergent series were historically analyzed by Euler andothers through naive methods, which we briefly sketch here.Let us revisit (4.3.11). If we write S = 1− 1 + 1− 1 + 1− 1 +− ...observe that
1− S = 1− (1− 1 + 1− 1 + 1− 1 +− ...)
= 1− 1 + 1− 1 + 1−+ ... = S
Hence we can deduce that S = 1− S or
S = 1− 1 + 1− 1 + 1− 1 +− ... = 1/2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 602 / 624
Integral Calculus in One Variable Applications of Integration
Divergent SeriesIn a similar way we can tackle a more interesting series: let
S = 1 + 2 + 3 + 4 + 5 + 6 + 7 + ...
Then−3S = (S − 2 · 2S) = 1 + 2 + 3 + 4 + ...− 2(2 + 4 + 6 + 8 + ...)
= 1− 2 + 3− 4 + 5− 6 +− ...
We can further write−3S = 1− 2 + 3− 4 + 5− 6 +− ...
= 1− (2− 3 + 4− 5 +− ...)
= 1− (1− 2 + 3− 4 +− ...)− (1− 1 + 1− 1 +− ...)
= 1 + 3S − 1/2
Together, this gives
S = 1 + 2 + 3 + 4 + 5 + 6 + 7 + ... = − 1
12. (4.3.12)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 603 / 624
Integral Calculus in One Variable Applications of Integration
Divergent SeriesIt may seem strange that a divergent series yields a “sum” which isnegative. However, mathematicians in the 17th century did believe that“beyond positive infinity” lay the negative numbers.It is remarkable that this concept is even relevant in physics today: forexample, physical system of atoms can have a “negative (absolute)temperature” and then be “hotter” than any such system with a “positivetemperature”.Reference Max-Planck-Gesellschaft. “A temperature below absolute zero: Atomsat negative absolute temperature are the hottest systems in the world.”ScienceDaily, 4 Jan. 2013.http://www.sciencedaily.com/releases/2013/01/130104143516.htm.
The actual sum (4.3.12) occurs in quantum field theory and string theory,so that it is not “nonsense” as you may think. Let us try to justify the sumin a more rigorous fashion.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 604 / 624
Integral Calculus in One Variable Applications of Integration
Riemann Zeta FunctionIt turns out that the series 1 + 2 + 3 + 4 + 5 + 6 + ... is notAbel-summable: the limit
A –∞∑n=1
n = limx1
∞∑n=1
nxn = limx1
x
(1− x)2
does not exist! To justify (4.3.12), a more sophisticated approach isnecessary.The Riemann zeta function is defined as
ζ(s) =∞∑n=1
1
ns.
Note that the right-hand side is just the p-series, so we know that ζ(s)exists for s > 1. However, ζ can be extended to a function defined onR \ 1. This extension is analogous to, but more complicated than, theextension of the Euler Gamma function (see Slide 572).
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 605 / 624
Integral Calculus in One Variable Applications of Integration
Riemann Zeta FunctionWe denote the extension by ζ also; a sketch is shown below:
1 2s
1
Π2
6
y
y ΖHsL
For example, ζ(2) =∑ 1
n2= π2
6 (see the exercises).Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 606 / 624
Integral Calculus in One Variable Applications of Integration
Riemann Zeta Function
Now some divergent series might be considered as special cases of the zetafunction. For example,
ζ(0) =∞∑n=1
1
n0=
∞∑n=1
1 = 1 + 1 + 1 + 1 + ... ,
ζ(−1) =∞∑n=1
1
n−1=
∞∑n=1
n = 1 + 2 + 3 + 4 + ...
We see that the series in (4.3.12) is given by ζ(−1). Let us analyze thegraph of the zeta function more closely in this region.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 607 / 624
Integral Calculus in One Variable Applications of Integration
Riemann Zeta Function
-8 -6 -4 -2 -1s
-1
12
-1
2
y
y ΖHsL
Observe that ζ(0) = −1/2 and ζ(−1) = −1/12! Thus, it (4.3.12) doesmake sense if we regard the summation by extending the zeta function.This summation procedure is called zeta function regularization.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 608 / 624
Integral Calculus in One Variable Applications of Integration
Riemann Zeta FunctionIn order to calculate ζ(−1), we need to understand more about theextension of the zeta function. In particular, we would need to prove thatthe extension of the zeta function satisfies the functional equation
ζ(1− s) =2Γ (s)
(2π)scos(π2s)ζ(s) (4.3.13)
where Γ is the Euler Gamma function. If we set s = 2 in (4.3.13) weobtain
ζ(−1) = 2
4π2Γ (2) cos(π)ζ(2) = − 1
12.
The proof of (4.3.13) is today usually performed using methods ofcomplex analysis. An elementary (but complicated) proof, retracing Euler’soriginal calculations, is available on SAKAI.Reference B. Cais, Divergent series: why 1 + 2 + 3 + · · · = −1/12, retrieved fromhttp://www.math.mcgill.ca/bcais/papers.html
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 609 / 624
Integral Calculus in One Variable Applications of Integration
A Deceptively Simple Power SeriesConsider the power series
∞∑n=1
nn
n!zn. (4.3.14)
The radius of convergence can be found from
limn→∞
(n + 1)n+1n!
nn(n + 1)!= lim
n→∞
(1 +
1
n
)n
= e
(see Proposition 3.7.2), soϱ =
1
e.
We are interested in the convergence at ±1/e. Let us discuss first theseries
∞∑n=1
nn
n!en. (4.3.15)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 610 / 624
Integral Calculus in One Variable Applications of Integration
Elementary Estimate on the FactorialTo analyze the convergence of (4.3.15), we need an estimate on thegrowth of n!. We may obtain an elementary estimate as follows: Let
Sn = ln(n!) =n∑
k=1
ln k.
Then it is easy to see that∫ n+1
1ln x dx > Sn >
∫ n+1
2ln(x − 1) dx =
∫ n
1ln x dx . (4.3.16)
1 2 3 4 5 6 7x
lnH2L
lnH3L
lnH4LlnH5LlnH6LlnH7L
y
y lnHxL
y lnHx - 1L
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 611 / 624
Integral Calculus in One Variable Applications of Integration
Elementary Estimate on the Factorial
Integrating (4.3.16), we immediately obtain
lnnn
en−1< Sn < ln
(n + 1)n+1
en
or, since Sn = ln(n!),
nn
en−1< n! <
(n + 1)n+1
en. (4.3.17)
Noting that (1 + 1/n)n < e by Lemma 3.7.4, we have (n + 1)n < enn andobtain
e(ne
)n< n! < e(n + 1)
(ne
)n. (4.3.18)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 612 / 624
Integral Calculus in One Variable Applications of Integration
Elementary Estimate on the Factorial
4.3.14. Remark. From (4.3.18) and the squeeze theorem, we findn√n!
nn→∞−−−→ 1
e.
This relation is useful when applying the root test to series with factorialcoefficients.We can now analyze the convergence of (4.3.14) at x = ϱ = 1/e. Theestimate (4.3.18) gives
1
n!
(ne
)n>
1
n + 1
1
e
so (4.3.15) diverges by comparison with the harmonic series.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 613 / 624
Integral Calculus in One Variable Applications of Integration
Elementary Estimate on the Factorial
We now turn to the convergence of (4.3.14) at z = −1/e, i.e., theconvergence of
∞∑n=1
(−1)n nn
n!en. (4.3.19)
We would like to apply the Leibniz criterion to show the convergence of(4.3.19). However, while it is not difficult to show that the sequence (an),
an =nn
n!en
is decreasing, it does not follow that it converges to zero. The estimate(4.3.18) only yields an < e for all n ∈ N. Thus, we need a finer estimateon the behavior of n!.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 614 / 624
Integral Calculus in One Variable Applications of Integration
Stirling’s Formula
We say that two functions f , g : R→ R are asymptotically equal asx →∞, writing f (x) ∼ g(x) as x →∞, if
limx→∞
f (x)
g(x)= 1.
This implies thatlimx→∞
g(x)− f (x)
g(x)= 0,
i.e., the relative difference between f (x) and g(x) converges to zero. Wenow prove a famous result on the asymptotic behavior of the factorialwhich has important applications in probability theory and statistics.
4.3.15. Stirling’s Formula. n! ∼√2πn
(ne
)nas n→∞.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 615 / 624
Integral Calculus in One Variable Applications of Integration
Stirling’s FormulaProof.We will consider the sequence (an) given by
an :=n!√
2n(ne
)nand first show that it converges to some C ∈ R. Write
bn := ln an.
Then, after some elementary transformations,
bn − bn+1 =2n + 1
2ln
n + 1
n− 1. (4.3.20)
We want to show that the right-hand side is positive, so that (bn) isdecreasing.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 616 / 624
Integral Calculus in One Variable Applications of Integration
Stirling’s FormulaProof (continued).We will work with a series expansion for ln n+1
n , n ∈ N \ 0. The mostimmediate way to obtain such an expansion would be to use the Taylorseries
ln(1 + x) = x − 1
2x2 +
1
3x3 − 1
4x4 +− ... , |x | < 1. (4.3.21)
However, this leads to an alternating series, which will not be useful inestimating the RHS of (4.3.20):
lnn + 1
n= ln
(1 +
1
n
)=
∞∑k=1
(−1)k+1
k
1
nk
Instead, we might use that
− ln(1− x) = x +1
2x2 +
1
3x3 +
1
4x4 + ... , |x | < 1. (4.3.22)
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 617 / 624
Integral Calculus in One Variable Applications of Integration
Stirling’s FormulaProof (continued).Then
lnn + 1
n= − ln
n
n + 1= − ln
(1− 1
n + 1
)=
∞∑k=1
1
k
1
(n + 1)k
=1
n + 1+
∞∑k=2
1
k
1
(n + 1)k.
However, without additional estimates, this is not sufficient to show thatthe RHS of (4.3.20) is positive. We need to use a more sneaky expansion.Note that (4.3.21) and (4.3.22) give
ln1 + x
1− x= ln(1 + x)− ln(1− x) = 2
∞∑k=0
x2k+1
2k + 1.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 618 / 624
Integral Calculus in One Variable Applications of Integration
Stirling’s Formula
Proof (continued).We then have
lnn + 1
n= ln
1 + 12n+1
1− 12n+1
= 2∞∑k=0
1
2k + 1
1
(2n + 1)2k+1
=2
2n + 1+ 2
∞∑k=1
1
2k + 1
1
(2n + 1)2k+1
Inserting this into (4.3.20), we have
bn − bn+1 =2n + 1
2ln
n + 1
n− 1 =
∞∑k=1
1
2k + 1
1
(2n + 1)2k> 0.
It follows that the sequence (bn) is decreasing.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 619 / 624
Integral Calculus in One Variable Applications of Integration
Stirling’s FormulaProof (continued).Observe that
bn − bn+1 <
∞∑k=1
1
(2n + 1)2k=
1
(2n + 1)21
1− 1(2n+1)2
=1
4
1
n(n + 1).
Hence,
b1 − bn = (b1 − b2) + (b2 − b3) + · · ·+ (bn−1 − bn)
<1
4
n−1∑k=1
1
k(k + 1)
<1
4
∞∑k=1
1
k(k + 1)=
1
4
∞∑k=1
(1
k− 1
k + 1
)=
1
4.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 620 / 624
Integral Calculus in One Variable Applications of Integration
Stirling’s Formula
Proof (continued).so
bn > b1 −1
4=
e√2− 1
4
and (bn) is bounded below. Hence, (bn) converges. Since the exponentialfunction is continuous, we have
limn→∞
an = limn→∞
ebn = elim
n→∞bn,
so (an) converges, i.e.,
C := limn→∞
n!√2n(ne
)nexists.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 621 / 624
Integral Calculus in One Variable Applications of Integration
Stirling’s FormulaProof (continued).We now use the Wallis product formula,
π
2= lim
n→∞
24n(n!)4
((2n)!)2(2n + 1)
(which is proven in the assignments). Then
π
2= lim
n→∞
24n
(2n + 1)
(√2n(ne
)n)4 ( n!√2n(ne
)n)4×(
1√4n(2ne
)2n)2(√4n (2ne )2n(2n)!
)2
= C 2 limn→∞
n2
n(2n + 1)=
C 2
2.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 622 / 624
Integral Calculus in One Variable Applications of Integration
Stirling’s Formula
Proof (continued).It follows that C =
√π and hence
limn→∞
n!√2n(ne
)n =√π,
completing the proof.We can now see that there exists a C > 0 such that
an =nn
n!en<
C√n
for all n large enough, i.e., the sequence (an) converges to zeromonotonically. Thus, the power series (4.3.14) converges for z = −1/e.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 623 / 624
Integral Calculus in One Variable Applications of Integration
Final Exam
The preceding material completes the final third of the course material. Itencompasses everything that will be the subject of the Final Exam.The exam date and time will be announced on SAKAI.No calculators or other aids will be permitted during the exam. A sampleexam with solutions has been uploaded to SAKAI. Please study it carefully,including the instructions on the cover page.
Dr. Hohberger (UM-SJTU JI) Functions of a Single Variable Fall 2015 624 / 624