holonomic versus nonholonomic constraints

Karlstads universitet 651 88 Karlstad

Tfn 054-700 10 00 Fax 054-700 14 60

[email protected] www.kau.se

Faculty of Technology and Science

Mattias Flygare

Holonomic versus nonholonomicconstraints

PhysicsC-level thesis

Date: 2012-05-30

Supervisor: Jürgen Fuchs

Examiner: Claes Uggla

Abstract

Courses in analytical mechanics for undergraduate students are often limited

to treatment of holonomic constraints, which are constraints on coordinates. The

concept of nonholonomic constraints, constraints on velocities, is usually only men-

tioned brie�y and it is easy to get a wrongful idea of what they are and how to treat

them. This text explains and compares the methods of deriving the Euler-Lagrange

equations and the consequences when imposing di�erent kinds of constraints. One

way to properly treat both holonomic and nonholonomic constraints is given, pin-

pointing the di�culties and common errors. Along the way, the treatment in local

coordinates is also put in more modern terms, in the language of di�erential geom-

etry, which is the language most commonly used in modern texts on the subject.

2

Acknowledgements

My biggest thanks to my supervisor Jürgen Fuchs, always interested and always

interesting! I would also like to thank Jonatan Andersson for our long discussions

and all your helpful comments. Thanks also to David, So�e, Jessica and Joakim

for your willingness to listen and help. Last but not least, thank you to my wife

Anna-Lena and my two sons Sixten and Alexander!

3

Contents

1 Introduction 5

2 Mathematical tools 9

2.1 Linear and a�ne functions . . . . . . . . . . . . . . . . . . . . . . . . . . 9

2.2 Projection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

2.3 Manifolds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

2.4 Tangent vectors, spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

2.5 Fibre bundles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

2.6 Di�erential k-forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

2.7 Wedge product between di�erential forms . . . . . . . . . . . . . . . . . 13

2.8 Exterior derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

2.9 Jacobi-Lie bracket . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

2.10 Ehresmann connection . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

3 D'Alembert's principle 15

4 Constraints 17

4.1 Holonomic constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

4.2 Nonholonomic constraints . . . . . . . . . . . . . . . . . . . . . . . . . . 18

5 Structure of the equations of motion 22

5.1 Imposing the constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

5.2 Obtaining the equations of motion . . . . . . . . . . . . . . . . . . . . . 22

5.3 Constrained Lagrangian . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

5.4 Interpretation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

5.5 Conservation laws . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

5.6 An Ehresmann connection . . . . . . . . . . . . . . . . . . . . . . . . . . 26

5.7 Curvature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29

6 An example: The falling disc 32

6.1 Writing the Lagrangian . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

6.2 Change of coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

6.3 Nonholonomic constraints . . . . . . . . . . . . . . . . . . . . . . . . . . 34

6.4 Equations of motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36

6.5 Interpretation and stable solutions . . . . . . . . . . . . . . . . . . . . . 37

7 Results and conclusion 39

4

1 Introduction

In the late 17th century, the German mathematician Gottfried Leibniz introduced the

concept of vis viva, the living force, to be the mass times the square of the velocity, mv2.

This is today referred to as kinetic energy (di�ering only by the unimportant factor

2)[11]. He also replaced Isaac Newton's concept of force with the work of the force, later

replaced by an even more basic quantity, the work function. This marks the starting

point of a new branch of mechanics, often called analytical mechanics, where the vector

quantities of force and momentum are no longer important, but instead these two scalar

quantities, the kinetic energy T and the work function U completely describe the motion

of the system.

Through the work of Leonard Euler, Joseph-Louis Lagrange and others in the second

half of the 18th century, a new formalism was developed, called Lagrangian mechanics.

Euler and Lagrange were the �rst to discover and formulate the principle of least action,

stating that given a start point and an end point, assuming the energy of the system is

conserved, a time-integral of the kinetic energy over any possible path between the start

and end points has a stationary value, and that nature �will choose� the path with the

smallest of these values. Thus, to �nd the actual motion of the system one only needed

to �nd the minimum value for this integral. This was achieved by using calculus of

variations, a �eld of mathematics that deals with maximizing or minimizing functionals,

analogous to �nding extremal points of functions in ordinary calculus.

In 1834 and 1835, William Rowan Hamilton published two papers titled On a General

Method in Dynamics where he was able to generalize the principle of least action to

include nonconservative systems. The integrand in the action integral was no longer the

kinetic energy alone, but the di�erence between kinetic energy T and potential energy V ,

the most fundamental quantity in Lagrangian mechanics, later named the Lagrangian,

L = T − V. (1.1)

This variational principle was named Hamilton's principle. Numerous other formulations

of the principle of least work were also developed (by Lagrange, Euler, Jacobi) but they

are all similar and all originate from the principle of least action. These principles

all lead to the equations of motion in the Lagrangian formalism, the Euler-Lagrange

equationsd

dt

∂L

∂qj− ∂L

∂qj= 0, (1.2)

for any generalized coordinate qj where j = 1, ..., n and n is the total number of coordi-

nates.

There is however another way to arrive at these equations, and that is d'Alembert's

principle, sometimes called the Lagrange-d'Alembert principle, introduced 1743 in Traité

de Dynamique by Jean-Baptiste le Rond d'Alembert. This principle does not rely on the

least action principle at all, but instead starts with Newton's second law and applies the

principle of virtual work, stating that a system is in equilibrium if and only if the total

virtual work (the work done by a force along a spacial displacement when time is kept

�xed) of all applied forces vanish. The principle thus assumes that all the work done by

5

constraint forces cancel. The new addition by d'Alembert (startlingly simple and still

very important) to this principle was to take the right hand side of Newton's second

law, the mass times the acceleration, and move it to the left hand side and treat it as

if it were a force, called the force of inertia. In this way, the principle of virtual work

could be extended to include systems with non-trivial dynamics and not only statical

systems.

The d'Alembert principle also leads to the Euler-Lagrange equations and it is actually

the most general principle of them all, and indeed it can quite easily be mathematically

reformulated as Hamilton's principle. Hamilton's principle (and the same goes for all

the mentioned least action principles) is thus equivalent to d'Alembert's principle in all

applications where both can be applied [11]. The latter half of the previous statement is

very important but sometimes forgotten. To get a quick appreciation of how the two

principles di�er we �rst need to collect some more information.

As previously mentioned, in analytical mechanics, the quantities of importance are

the kinetic energy and the work function. The forces that gives rise to a virtual work

are commonly classi�ed as conservative or nonconservative, or sometimes as potential

or nonpotential. A more general classi�cation and a terminology used in [11], divides

the forces into two categories:

1. Monogenic. Forces that are derivable from a scalar function U called the work

function, of the coordinates, the coordinate velocities and time explicitly.

2. Polygenic. Forces that are not monogenic, but when it is still relevant to talk

about the virtual work done by the force (such as frictional forces).

Conservative forces are now the special case when U does not depend on time explicitly.

Potential forces are then the special case of conservative forces but when the work

function neither depends on time explicitly, nor the coordinate velocities. The negative

of such a work function is what we normally call a scalar potential.

Hamilton's variational principle actually has many advantages over d'Alembert's

principle, in areas such as optimal control and when dealing with systems of curvilinear

coordinates, although it turns out that Hamilton's principle is invalid when dealing

with polygenic forces while d'Alembert's still works. To see what this means, let us also

classify forces of constraint into two categories:

1. Holonomic constraints. Integrable constraints, meaning that given some con-

straints depending on time-derivatives of coordinates, these constraints can be

integrated as to express the constraints in only the coordinates themselves, a ter-

minology �rst introduced by Heinrich Hertz in 1894.

2. Nonholonomic constraints. Constraints that are not holonomic. These might

be constraints that are expressed in terms of coordinate velocities that are not

derivable from coordinate constraints (thereby unintegrable) or constraints not

given as an equation at all.

There is also a connection between the two above classi�cations; Holonomic con-

straints give rise to monogenic forces and nonholonomic constraints give rise to polygenic

6

forces[11]. From this we can immediately deduce the important fact that Hamilton's

principle is applicable only to systems subject to holonomic constraints and that for

treatment of nonholonomic constraints only d'Alembert's principle is applicable.

Probably due to the elegance and wide areas of use of Hamilton's principle on systems

with holonomic constraints there has sometimes arisen some confusion when nonholo-

nomic systems are to be treated. One way to do this properly was �rmly established

already in 1900 in [10] but still today one can �nd books for students in analytical me-

chanics, as pointed out by [1] and [6], where the topic is treated in an uncareful manner,

such as when generality is claimed for results applying only to holonomic constraints,

or the author may simply have misunderstood how the treatment of nonholonomic con-

straints di�ers from holonomic.

Although there still is some confusion remaining, there are numerous publications

where the importance of using the d'Alembert principle is stressed and the main dif-

ferences are pointed out. One of the most active contemporary author within the �eld

is A.M. Bloch (for example in [1], [12], [3] or [2]), but there are also other authors (for

example W.S. Koon and J.E. Marsden in [9] and M.R. Flannery in [6]).

The main purpose of this text is to create a reasonably straight path through the

process of obtaining equations of motion for systems with both holonomic and nonholo-

nomic constraints, starting with the d'Alembert principle, deriving the Euler-Lagrange

equations and then applying constraints, both holonomic and nonholonomic. By do-

ing this in a very straightforward and transparent way (the ambition was to include

almost all equations and calculations) it becomes fairly easy to pinpoint the di�erences

between the two types of constraints, and how they appear explicitly in the equations

when imposing the constraints.

A large part of the more recent literature in the �eld is written, not in terms of local

coordinates, but in terms of di�erential geometry. Since this text is intended for last

year undergraduate level and up, one of the bi-purposes of the text was to open the

door (but never completely walk through it) to describing mechanical systems in the

coordinate free language of geometry.

Finally the formulas developed are put to use on the mechanical system of a rolling

disc (such as a coin) on a horizontal plane, a good example of how rolling constraints

leads to linear nonholonomic constraint equations. The example has (of course) been

treated before in a similar way, but is included to illustrate the process of obtaining

the equations of motion by the process developed in the foregoing sections and it also

demonstrates some of the di�erences between holonomic and nonholonomic systems.

By pure necessity, and sometimes to avoid confusing the reader more than necessary,

there are some restrictions to generality in the text. To begin with we only treat sys-

tems where there are no nonpotential forces acting, which in this context means that

we only consider forces that arise from constraints. We also only treat systems where

the constraints are not depending on time explicitly (so called scleronomic systems, as

opposed to rheonomic, constraints that are depending on time explicitly). The nonholo-

nomic constraints we treat are all linear in the velocities, disregarding both constraints

that also have a term depending only on the coordinates (a�ne constraints) and also

7

completely nonlinear constraints.

Many of these restrictions can quite easily be lifted so that the results can be extended

in generality. In these cases the reader will be referred to external literature where this is

treated. In other cases, such as not treating completely nonlinear constraints, it is simply

outside the scope of either principle mentioned in this text to treat such constraints.

This text draws heavily from the above mentioned authors and follows closely their

argumentations and derivations, sometimes �lling in missing equations and sometimes

adding comments or conclusions that help tie together all loose ends. While no single

detail in this text is completely new, its main contribution is to gather all these loose

ends and tie them together in a way that cannot be located in the literature and that

hopefully provides the reader with new or refreshed knowledge and understanding of

Lagrangian mechanics.

8

2 Mathematical tools

In this section, de�nitions and explanations of some of the mathematical tools and

concepts used in this text are given. Some clarifying examples are also included to make

some of the more abstract concepts a bit more concrete. The section can be read straight

through, used as a reference or skipped altogether depending on the reader.

To read more about these mathematical tools and concepts, the reader is referred to

for instance [1] and [15].

2.1 Linear and a�ne functions

De�nition 1. A function f : Rn → Rm is said to be linear if

1. f(X+Y) = f(X) + f(Y), ∀X,Y ∈ Rn,

2. f(aX) = af(X), ∀X ∈ Rn,∀a ∈ R.

De�nition 2. A function g : Rn → Rm is said to be a�ne if there exists a linear

function f : Rn → Rm and a vector b ∈ Rm such that

g(X) = f(X) + b, ∀X ∈ Rn (2.1)

In plainer language, an a�ne transformation is a linear transformation followed by

a translation.

2.2 Projection

De�nition 3. A projection P is a linear transformation

P : Rn → Rn (2.2)

such that if u ∈ Rn and v = P (u) then

P (v) = v. (2.3)

In other words, if P is applied to a vector that is already transformed by P , the

result is the same vector. P is thus said to be idempotent, which is to say that

P 2 = P. (2.4)

2.3 Manifolds

To describe the positional state of a given mechanical system we �nd a minimum number

n of generalized coordinates, not necessarily coordinates in position space but any quan-

tity that describes the position of the system. All possible values of these coordinates

make up a set of points Q. This set may not resemble Euclidean space as a whole, but

locally (on a small enough scale) there exists a homeomorphism from Q to Rn. All theselocal charts to Rn overlap smoothly with each other so that the whole set of points can

be described in terms of these charts. Q is then called a di�erentiable manifold. More

explicitly, the de�nition is:

9

De�nition 4. A di�erentiable manifold of dimension n is a set Q together with a

�nite or countably in�nite set of subsets Uα ⊂ Q and with one-to-one mappings ϕα :

Uα → Rn such that

1. The union of all Uα is Q,

2. If I = Uα∩Uβ is a nonempty intersection of Uα and Uβ, then there exists a smooth

function ϕαβ : ϕα(I)→ ϕβ(I).

Figure 1: An example of a manifold with mappings to local coordinates

A simple illustration is to see the manifold as the earth and each chart as one of the

pages of an atlas. Although two di�erent pages may be scaled and rotated di�erently,

in the areas where they overlap it is easy to �nd points that correspond to each other.

Trivial cases of manifolds are when Q is Rn itself. An example of a non-trivial

manifold of dimension one is the circle S1. The generalized coordinate is usually the

angle θ which varies from 0 to 2π. This example also illustrates the need for more than

one chart to describe the manifold, since the coordinate values 0 and 2π describe the

same point on the circle.

Another example of a manifold is the two dimensional sphere S2, illustrated by �gure

1, which consists of all the points lying on the surface of a three dimensional ball, which is

a useful manifold to describe a spherical pendulum. Here the two generalized coordinates

would typically be the usual spherical coordinates θ and φ, with θ ranging from 0 to 2π

and φ from 0 to π. Also in this example we have several values of coordinates describing

the same point.

These examples of some possible manifolds are included to provide a better feeling

for what manifolds are.

2.4 Tangent vectors, spaces

Let c(t) and d(t) be smooth curves on the manifold Q that go through a point q ∈ Q so

that

c(0) = d(0) = q. (2.5)

10

Now assume a map ϕ from an open subset containing q to Rn so that ϕ(c(t)) and ϕ(d(t))

describe curves on the chart corresponding to the curves on the manifold. The curves c

and d are said to be N -tangent at q if

di

dtiϕ(c(t))

∣∣∣∣t=0

=di

dtiϕ(d(t))

∣∣∣∣t=0

, ∀i = 1, ..., N. (2.6)

In this text we only consider N = 1. Nth-order tangency is useful for instance when

considering constraints given in higher order time derivatives of the coordinates. Further

information can be found on the topic in [5, p. 566-570] and [4, p. 2785-2787].

Now we de�ne the tangent vector u to a manifold Q of dimension n at a point q to

be an n-dimensional vector with the components given by

ui =d

dt(ϕ(c(t)))i

∣∣∣∣t=0

, ∀i = 1, ..., n. (2.7)

In short, the tangent vector is the time derivative of the curve once it is mapped to

local coordinates.

If we take the span of all the tangent vectors of all possible curves through a point

q ∈ Q we get the tangent space of Q at the point q, denoted TqQ which is the vector

space Rn.If we now take all points in Q and bundle them together with all the attached tangent

spaces, we get the tangent bundle TQ. The tangent bundle is thus a 2n-dimensional

manifold that is suited to describe a system completely, and for example, since the

Lagrangian L is a function of both position and velocity, it is de�ned on TQ.

The tangent space to a point on a circle is simply the tangent line to the circle through

that point. The tangent bundle would then be the circle itself with lines coming out

of each point. This can be visualized as an in�nite cylinder with the manifold at the

bottom and the cylinder itself being made up by all the tangent spaces.

Figure 2: The tangent lines of a circle are placed over the point of tangency in a non-

intersecting manner. All tangent lines are �nally joined smoothly to make up the cylin-

der.

The tangent space to a point lying on a sphere is a tangent plane. The bundle in

this case has four dimensions so it is hard to visualize.

Mathematically, TQ, Q and the TqQ's form a �bre bundle, with each tangent space

TqQ called a �bre. In section 2.5 we explore this concept further.

11

2.5 Fibre bundles

A �bre bundle consists of four pieces of data, three topological spaces, such as manifolds,

B, T, F and a continuous surjection

π : T → B, (2.8)

where B is called the base space, T is called the total space and F is called the �bre.

The map π is called the bundle projection of the bundle which locally is a continuous

surjection which projects any element of T to B. For every q ∈ T there exists an open

neighbourhood U ⊂ B of the projection π(q) such that the inverse image π−1(U) is

homeomorphic to the product space U × F . The bundle projection πq at each point

q ∈ Q should also be a submersion, that is to say that the di�erential dπq should be a

surjection for every q ∈ Q. The bundle projection is thus also often called the submersion

of the �bre bundle.

We can think of B as a subspace of T and π as a projection to only the base variables.

Since π is not a bijection there is no inverse, but we can still speak of the �bre, de�ned

for any element r ∈ B as

π−1(r) = {q ∈ T : π(q) = r}, (2.9)

or in words, the set of points in T whose projection through π equals r ∈ B. This set

is called the �bre at r and when π−1(r) is homeomorphic to F for all r ∈ B, the totalspace, the base space, the �bre and the submersion π together form the �bre bundle.

Fibre bundles where the spaces B, T and F are vector spaces are called vector bundles.

A bundle where T = B × F is called a trivial bundle.

To see that the tangent bundle TQ is a �bre bundle, we call TQ the total space and

the con�guration manifold Q the base space and de�ne a submersion

π : TQ→ Q (2.10)

that locally projects each element in TQ onto the corresponding q ∈ Q.An example of a trivial bundle is again the circle S1 with each �bre a copy of the real

line R, so that the total space is simply S1×R. An example of a nontrivial �bre bundle

is the same base space, the circle S1, but instead of having the �bres make up a cylinder

we have them make up a Möbius strip, a sort of twisted cylinder such that going one

revolution around the strip reverses the sign. This �bre bundle locally resembles the

product space S1 × R but di�ers from it when considering what lies above the entire

circle [14]. Both these examples can be seen in �gure 3.

12

Figure 3: To the left a trivial �bre bundle involving a cylinder and to the right a nontrivial

bundle involving a Möbius strip.

2.6 Di�erential k-forms

A k-form α on a di�erentiable manifold Q is a smooth map from the products space

TQ×· · ·×TQ with k factors of TQ, to R and at any point q ∈ Q de�nes an alternating

multilinear map αq to R,αq : TqQ× · · · × TqQ︸︷︷︸

k factors

→ R, (2.11)

where αq is linear.

For example, a one-form can locally be expressed as

αq =

n∑i=1

fi(q)dqi (2.12)

for any q ∈ Q and some functions fi, where dqi are the exterior derivatives (section 2.8)

of each coordinate.

A two-form βq is alternating bilinear (or anti-symmetric). This means that

βq(X,Y) = −βq(Y,X), ∀X,Y ∈ TqQ. (2.13)

It can locally be expressed as

βq =

n∑i=1

n∑j=1

gij(q)dqi ∧ dqj (2.14)

for any q ∈ Q and some anti-symmetric matrix of functions gij . The symbol ∧ denotes

the wedge product which will be explained in section 2.7.

2.7 Wedge product between di�erential forms

The wedge product of a k-form α and an l-form β is a (k + l)-form denoted α ∧ β and

has the property

α ∧ β = (−1)klβ ∧ α. (2.15)

In particular, the wedge product α ∧ β between two one-forms α and β is a two-form

whose value at a point q ∈ Q is an alternating bilinear form de�ned by

(α ∧ β)q(X,Y) = α(X)β(Y)− β(Y)α(X). (2.16)

13

2.8 Exterior derivative

The exterior derivative of a function f(q1, ..., qn) is given in local coordinates by

df =

n∑i=1

∂f

∂qidqi. (2.17)

If a function is considered to be a di�erential zero-form, then for any k-form α, with k

a natural number, the exterior derivative is

d(α ∧ β) = (dα ∧ β) + (−1)k(α ∧ dβ). (2.18)

In local coordinates, the exterior derivative of a one-form expressed as

αq =

n∑i=1

fi(q)dqi, (2.19)

is the sum

dαq =

n∑i=1

n∑j=1

∂fi∂qj

dqj ∧ dqi. (2.20)

Comparing with (2.14) we see that the exterior derivative of a one-form is a two-form.

In the same way, the exterior derivative of a zero-form is a one-form. In general, if k is a

natural number, a (k+1)-form that is the exterior derivative of a k-form is called exact

and a k-form whose exterior derivative is zero is called closed. It can also be shown that

all exact k-forms are also closed.

2.9 Jacobi-Lie bracket

Any vector �eld on a manifold Q can be regarded as a di�erential operator acting on

smooth functions on Q. The Jacobi-Lie bracket is then de�ned as follows;

De�nition 5. The Jacobi-Lie bracket (sometimes called the Lie bracket) [X,Y] of

two vector �elds X and Y acting on a function f is then de�ned by

[X,Y](f) := X(Y(f))−Y(X(f)). (2.21)

In local coordinates, if {q1, q2, ..., qn} are coordinates, and { ∂∂q1 ,

∂∂q2 , ...,

∂∂qn } denotes

the basis for vector �elds, then

[X,Y] :=

n∑i=1

n∑j=1

(Xj ∂Y

i

∂qj− Y j ∂X

i

∂qj

)∂

∂qi. (2.22)

2.10 Ehresmann connection

De�nition 6. An Ehresmann connection A is a vector-valued one-form on Q that

satis�es:

1. A is vertical valued: Aq : TqQ→ Vq is a linear map for each point q ∈ Q.

2. A is a projection: A(u) = u, ∀u ∈ Vq.

Vq is called the vertical space, which is a subspace of TqQ, and it is determined by the

�bre bundle submersion π. If Tqπ is the projection from TqQ to the tangent space of the

base space of a speci�c �bre bundle, then Vq is the kernel of Tqπ.

14

3 D'Alembert's principle

In this section the equations of motion for a mechanical system with n independent

generalized coordinates will be derived using the d'Alembert principle, similarly to how

it is done in [1].

Starting from Newtons second law and splitting up the forces into applied forces Fiand constraint forces fci according to

Fi + fci = pi, (3.1)

we take the sum of the virtual work (the work done by a force upon a virtual displacement

δri, a displacement only in the coordinate but not in time, all consistent with the forces

and constraints at the time of the displacement)∑i

(Fi − pi) · δri +∑i

fci · δri = 0. (3.2)

Now we make our �rst assumption, which is that the virtual work of all constraint forces

vanishes (the virtual displacements must all satisfy any constraint equations). This is

true for many (but far from all) mechanical systems, such as rigid bodies. By applying

this assumption to (3.2) we arrive at∑i

(Fi − pi) · δri = 0. (3.3)

This is called d'Alembert's principle. Since the δri are, in general, not independent of

each other we switch to the generalized coordinates by the transformation

ri = ri(q1, q2, ..., qn; t) (3.4)

with n independent qj 's. The indices of q are given as an upper index to conform with

mathematical conventions and are not to be confused with powers of q. Introducing T

as the kinetic energy and V as the potential energy we can now de�ne the Lagrangian

L(q, q; t) = T − V, (3.5)

which now leads us to the following equation, equivalent to (3.3);

n∑j=1

[d

dt

(∂L

∂qj

)− ∂L

∂qj− F (NP )

j

]δqj = 0, (3.6)

where F (NP )j are all applied nonpotential forces acting on the system. For simplicity we

will hereafter assume that

F(NP )j = 0, ∀j = 1, ..., n, (3.7)

so that (3.6) becomesn∑j=1

[d

dt

(∂L

∂qj

)− ∂L

∂qj

]δqj = 0. (3.8)

15

When all n coordinates in (3.8) are independent of each other, the terms of the sum

in (3.8) must all equal zero individually, so the n equations needed to describe the

motion of the system are immediately obtained from (3.8). These are the Euler-Lagrange

equations,d

dt

(∂L

∂qj

)− ∂L

∂qj= 0, ∀j = 1, ..., n. (3.9)

16

4 Constraints

In section 3 we saw that (3.8) gives us (3.9) when all coordinates are independent, or

unconstrained. In this section we explore how to treat systems with constraints.

4.1 Holonomic constraints

We now consider systems where some of the coordinates are dependent of some of the

others (holonomic constraints), the dependence being described by some functions1

fa = fa(q), a = 1, ..., p, p < n, (4.1)

and the equations

fa(q) = 0, ∀a = 1, ..., p. (4.2)

In some cases it might be most practical to use these equations to eliminate all

dependent variables in the Lagrangian directly, and thus end up with a system of n− pindependent variables which can then be treated by (3.8).

In other cases it may be di�cult to perform such eliminations, or the forces pro-

duced by these constraints might be of interest. In such cases the constraints can be

incorporated into (3.8) and even be included in the Lagrangian itself. By d'Alembert's

principle, all the virtual displacements δq in (3.8) must satisfy the constraint equations.

In practice, this condition can be written as an equation by �rst di�erentiating (4.2),

giving the p equations

dfa =

n∑j=1

∂fa

∂qjdqj = 0, (4.3)

so when taking a virtual displacement (time is kept �xed) we get

n∑j=1

∂fa

∂qjδqj = 0. (4.4)

After multiplying each of these sums with a function

λa = λa(t), (4.5)

which are called Lagrange multipliers, we may subtract the sum of all variations of the

constraints from the left hand side of (3.8) to obtain

n∑j=1

[d

dt

(∂L

∂qj

)− ∂L

∂qj−

p∑a=1

λa(t)∂fa

∂qj(q)

]δqj = 0. (4.6)

Now we denote the dependent coordinates with

s1, s2, ..., sp (4.7)

and the independent ones with

r1, r2, ..., rn−p (4.8)

1The constraint functions may also be dependent on time explicitly but for simplicity we disregard

this case. The following results may be extended to such cases, see [6].

17

and choose the functions λa such that

d

dt

(∂L

∂sb

)− ∂L

∂sb−

p∑a=1

λa∂fa

∂sb= 0, ∀b = 1, ..., p. (4.9)

Equation (4.6) now reduces to

n−p∑α=1

[d

dt

(∂L

∂rα

)− ∂L

∂rα−

p∑a=1

λa∂fa

∂rα

]δrα = 0 (4.10)

where the δr's are all independent and arbitrary. Each term must therefore individually

vanish, which gives

d

dt

(∂L

∂rα

)− ∂L

∂rα=

p∑a=1

λa∂fa

∂rα, ∀α = 1, ..., n− p. (4.11)

Together with the original constraint equations, expressed in the new labelling of coor-

dinates, as

fa(s1, s2, ..., sp, r1, r2, ..., rn−p) = 0 (4.12)

we now have n independent equations that completely describe the motion of the system.

In addition we also get, as a bonus, the constraint forces given by∑a=1

λa∂fa

∂rα, ∀α = 1, ..., n− p. (4.13)

This also enables us, since the fa's are independent of q, to introduce the modi�ed

Lagrangian L∗, de�ned by

L∗ := L+

p∑a=1

λafa (4.14)

so that (4.6) becomesn∑j=1

[d

dt

(∂L∗

∂qj

)− ∂L∗

∂qj

]δqj = 0. (4.15)

This indeed shows that the holonomic constraint equations can be added to the La-

grangian with Lagrange multipliers and then use (4.15) to get the constrained equations

of motion. Equation (4.15) is sometimes referred to as d'Alembert's generalized principle,

and what this shows is that it does not matter whether we impose the constraints before

or after we take variations of the Lagrangian. This is however only true for holonomic

constraints, as we shall see in the next section.

4.2 Nonholonomic constraints

The treatment of constraint equations in which only the generalized coordinates and

time are present is well known and fairly straightforward. However, in some systems the

constraints are not expressible only in terms of coordinates, but also with coordinate

velocities (nonholonomic constraints). Consider constraints that can be expressed by p

functions of the form

ga = ga(q, q; t), a = 1, ..., p, p < n, (4.16)

18

and the equations

ga(q, q; t) = 0, ∀a = 1, ..., p. (4.17)

In general there is no guarantee that constraints of this type are treatable with the

d'Alembert principle. In particular, when the ga's are nonlinear in the velocities we

can't take variations of the constraints that are linear in the δq's. In many mechanical

systems, however, the constraints only depend on position or are linear in velocities.

Therefore we now focus on the interesting special case where

ga(q, q) =

n∑i=1

Aai (q)qi, (4.18)

commonly occurring, for example in systems with rolling constraints. We again disregard

explicit time dependence and now also disregard constraints with an additional term

involving only the coordinates (a�ne constraints, see section 2.1). The following results

can be extended to include also a�ne constraints, as seen in [1, p. 219-220].

Multiplying (4.18) with dt we get

ga(q, q)dt =

n∑i=1

Aai (q)dqi, (4.19)

and, when taking a virtual displacement (that is when dt = 0 and dqi becomes δqi), in

accordance with the d'Alembert principle we have

n∑i=1

Aai δqi = 0, ∀a = 1, ..., p. (4.20)

Multiplying with λa as before, we may now write

n∑j=1

[d

dt

(∂L

∂qj

)− ∂L

∂qj−

p∑a=1

λaAai

]δqj = 0. (4.21)

and proceed as in the holonomic case, denoting dependent and independent coordinates

with (4.7) and (4.8) and choosing the λa's appropriately to get

d

dt

(∂L

∂rα

)− ∂L

∂rα=

p∑a=1

λaAaα, ∀α = 1, ..., n− p, (4.22)

which is the analogue to equation (4.11) for holonomic constraints.

In (4.22) the functions Aaα have been reordered in the lower index so that

Aa1 , Aa2 , ..., A

an−p (4.23)

corresponds to the independent coordinate velocities that were relabelled as

r1, r2, ..., rn−p. (4.24)

Equation (4.22) gives the correct equations of motion in terms of Lagrange multipli-

ers, and we may now pose the question if it is possible to include the constraint functions

19

in the Lagrangian, thus imposing the constraints before taking the variation, to form an

equivalent generalized principle as we saw was possible in the holonomic case.

As before, we take the modi�ed Lagrangian to be

L∗ = L+

p∑a=1

µaga, (4.25)

with µa instead of λa to distinguish these functions from the ones used previously. Now

we get∂L∗

∂qj=∂L

∂qj+

p∑a=1

µaAaj , (4.26)

with time derivative

d

dt

∂L∗

∂qj=

d

dt

∂L

∂qj+

p∑a=1

(µaAaj + µa

n∑i=1

∂Aaj∂qi

qi

), (4.27)

and∂L∗

∂qj=∂L

∂qj+

p∑a=1

n∑i=1

µa∂Aai∂qj

qi. (4.28)

Using this L∗ in (4.15), combined with (4.27) and (4.27) gives us

n∑j=1

[d

dt

∂L

∂qj− ∂L

∂qj+

p∑a=1

µaAaj +

p∑a=1

n∑i=1

µa(∂Aaj∂qi− ∂Aai∂qj

)qi

]δqj = 0. (4.29)

Comparing (4.29) with (4.21) it becomes apparent that

µa(t) = −λa(t) (4.30)

and that the termp∑a=1

n∑i=1

µa(∂Aaj∂qi− ∂Aai∂qj

)qi (4.31)

needs to vanish in order to obtain the correct equations of motion (4.22). Since this

procedure works for holonomic constraints it might be expected that if the constraint

functions ga are simply the total time derivatives of some other functions depending

only on the coordinates, in other words, the constraints are really holonomic2, then

this term is zero for every j. It turns out that this is indeed the case, as shown in [6].

Generally these terms are not zero however, which shows that this procedure is generally

an incorrect approach.

The failure of obtaining the correct equations of motion by including them in the

Lagrangian also provides an explanation as to why Hamilton's variational principle does

not work when dealing with nonholonomic constraints, since Hamilton's principle o�ers

no option to impose the constraints after taking the variation. In fact, imposing the

constraints after taking the variations on the Lagrangian is crucial for nonholonomic

mechanical systems and shows that imposing constraints before taking the variation,

2These kind of constraints (when the linear velocity constraints are integrable, so that they can be

expressed in terms of only coordinates) are sometimes called semi-holonomic.

20

however useful in its proper context, is allowed only for holonomic systems. This �proper

context� has been pointed out by several authors on the subject ([1, p. 208], [3, p. 325])

to be optimal control systems, whereas the correct approach for mechanics is the one

given by equations (4.18) through (4.22).

21

5 Structure of the equations of motion

We now explore in more detail the structure of the equations determined by (4.21) of

nonholonomic systems with constraints linear in the velocities. The following passage

follows closely the path of [3, p. 325-326] but with added explanations and equations.

5.1 Imposing the constraints

We consider a con�guration manifold (section 2.3) Q with local coordinates

q1, q2, ..., qn (5.1)

constrained by a nonintegrable (nonholonomic), linear in velocities, constraint distribu-

tion D on Q that is given by a linear subset of the tangent space (section 2.4) TqQ at

each point q ∈ Q.Assume there are p constraint equations and that there is a choice of coordinates3

q = (r, s), (r ∈ Rn−p, s ∈ Rp) (5.2)

such that s represents the dependent generalized velocities and r the independent ones.

The constraints can now be described locally by the p equations

sa = −n−p∑α=1

Aaα(r, s)rα, ∀a = 1, ..., p. (5.3)

5.2 Obtaining the equations of motion

After taking the variations of the curves in con�guration space as before, we get

n∑j=1

(d

dt

∂L

∂qj− ∂L

∂qj

)δqj = 0. (5.4)

Now we impose the constraints. Because of (5.3), and the d'Alembert principle, the

variations must satisfy the equations

δsa +

n−p∑α=1

Aaα(r, s)δrα = 0, ∀a = 1, ..., p. (5.5)

Substituting q with r and s in (5.4)

n−p∑α=1

(d

dt

∂L

∂rα− ∂L

∂rα

)δrα +

p∑a=1

(d

dt

∂L

∂sa− ∂L

∂sa

)δsa = 0 (5.6)

and combining with (5.5)

n−p∑α=1

(d

dt

∂L

∂rα− ∂L

∂rα

)δrα +

p∑a=1

(d

dt

∂L

∂sa− ∂L

∂sa

)(−n−p∑α=1

Aaαδrα

)= 0. (5.7)

3According to [1, p. 217] this choice of coordinates is always possible.

22

Rearranging, we get

n−p∑α=1

[(d

dt

∂L

∂rα− ∂L

∂rα

)−

p∑a=1

Aaα

(d

dt

∂L

∂sa− ∂L

∂sa

)]δrα = 0. (5.8)

Since all the δrα are independent we can write(d

dt

∂L

∂rα− ∂L

∂rα

)=

p∑a=1

Aaα

(d

dt

∂L

∂sa− ∂L

∂sa

), ∀α = 1, ..., n− p. (5.9)

This is one way to eliminate the Lagrange multipliers and a comparison with equation

(4.22) shows us that the multipliers here are given by

λa =d

dt

∂L

∂sa− ∂L

∂sa, ∀a = 1, ..., p. (5.10)

Together with the constraint equations

sa = −n−p∑α=1

Aaα(r, s)rα, ∀a = 1, ..., p (5.11)

we now have n equations that are the complete equations of motion.

5.3 Constrained Lagrangian

Now we eliminate the s dependence in the Lagrangian L, thus getting a �constrained

Lagrangian�

Lc(r, s, r) := L(r, s, r, s(r, s, r)) (5.12)

where the components of s(r, s, r) are given by (5.11).

Then we have

∂L

∂rα=∂Lc∂rα

+

p∑b=1

n−p∑β=1

∂Abβ∂rα

rβ∂L

∂sb, ∀α = 1, ..., n− p (5.13)

∂L

∂sa=∂Lc∂sa

+

p∑b=1

n−p∑β=1

∂Abβ∂sa

rβ∂L

∂sb, ∀a = 1, ..., p (5.14)

and∂L

∂rα=∂Lc∂rα

+

p∑b=1

Abα∂L

∂sb, ∀α = 1, ..., n− p. (5.15)

The time derivative of (5.15) is

d

dt

∂L

∂rα=

d

dt

∂Lc∂rα

+

p∑b=1

n−p∑β=1

[∂Abα∂rβ

−p∑a=1

Aaβ∂Abα∂sa

]rβ∂L

∂sb+

p∑a=1

Aaαd

dt

∂L

∂sa. (5.16)

Inserting (5.13), (5.14) and (5.16) into (5.9), the left hand side becomes

d

dt

∂Lc∂rα

+

p∑b=1

n−p∑β=1

[∂Abα∂rβ

−p∑a=1

Aaβ∂Abα∂sa

]rβ∂L

∂sb+

+

p∑a=1

Aaαd

dt

∂L

∂sa− ∂Lc∂rα−

p∑b=1

n−p∑β=1

∂Abβ∂rα

rβ∂L

∂sb(5.17)

23

and the right hand side becomes

p∑a=1

Aaαd

dt

∂L

∂sa−

p∑a=1

Aaα∂Lc∂sa−

p∑a=1

p∑b=1

n−p∑β=1

Aaα∂Abβ∂sa

rβ∂L

∂sb. (5.18)

The term∑pa=1A

aαddt

∂L∂sa cancels out and by grouping all terms with rβ ∂L

∂sbwe write

d

dt

∂Lc∂rα− ∂Lc∂rα

+

p∑a=1

Aaα∂Lc∂sa

=

=

p∑b=1

n−p∑β=1

[(∂Abβ∂rα

− ∂Abα∂rβ

)+

p∑a=1

(Aaβ

∂Abα∂sa

−Aaα∂Abβ∂sa

)]rβ∂L

∂sb(5.19)

or

d

dt


+

p∑a=1

Aaα∂Lc∂sa

= −p∑b=1

n−p∑β=1

Bbαβ rβ ∂L

∂sb, ∀α = 1, ..., n− p (5.20)

where

Bbαβ :=

(∂Abα∂rβ

−∂Abβ∂rα

)+

p∑a=1

(Aaα

∂Abβ∂sa

−Aaβ∂Abα∂sa

). (5.21)

5.4 Interpretation

In section 5.3, the equations of motion were rewritten in terms of the constrained La-

grangian Lc. To be able to make further interpretations we try to rewrite once more,

following the general path of [9, p. 24].

By writing the constraint equations as p one-forms (section 2.6) we have

ωb = dsb +

n−p∑α=1

Abα(r, s) drα, ∀b = 1, ..., p. (5.22)

Taking the exterior derivative (section 2.8) of the ωb's, we get the two-forms

dωb =

n−p∑α=1

n−p∑β=1

∂Abα∂rβ

drβ ∧ drα +

p∑a=1

∂Abα∂sa

dsa ∧ drα . (5.23)

Using the constraints in the form

dsb = −n−p∑α=1

Abα(r, s)drα (5.24)

we can eliminate ds:

dωb =

n−p∑α=1

n−p∑β=1

(∂Abα∂rβ

−p∑a=1

Aaβ∂Abα∂sa

)drβ ∧ drα. (5.25)

Now, using the fact that the wedge product is alternating bilinear (section 2.7) we have

drβ ∧ drα(r, δr) = rβδrα − δrβ rα (5.26)

24

and thus

dωb(r, δr) =

n−p∑α=1

n−p∑β=1

(∂Abα∂rβ

−p∑a=1

Aaβ∂Abα∂sa

)rβδrα−

−n−p∑α=1

n−p∑β=1

(∂Abα∂rβ

−p∑a=1

Aaβ∂Abα∂sa

)rαδrβ . (5.27)

Changing the role of the summation variables α and β in the second term we get

dωb(r, δr) =

n−p∑α=1

n−p∑β=1

[∂Abα∂rβ

−∂Abβ∂rα

+

p∑a=1

(Aaα

∂Abβ∂sa

−Aaβ∂Abα∂sa

)]rβδrα (5.28)

or

dωb(r, δr) =

n−p∑α=1

n−p∑β=1

Bbαβ rβδrα, ∀b = 1, ..., p. (5.29)

So the variation of the constrained Lagrangian Lc can be written as

δLc = −n−p∑α=1

[d

dt


+

p∑a=1

Aaα∂Lc∂sa

]δrα =

p∑b=1

∂L

∂sbdωb(r, δr). (5.30)

This equation isolates the e�ects of the constraints and also shows that when dω = 0 the

right hand side of (5.29) vanishes. This is the case when the constraints are integrable

(holonomic), that is when the one-form ω is closed (section 2.8), once again con�rming

what can be seen in equation (4.29). This equation thus also provides an alternate way

to obtain the equations of motion for holonomic systems, especially when the constraints

are initially given as functions linear in velocities, by substituting the constraints into

the Lagrangian and setting the variation of Lc to zero [1]. In cases where the constraints

truly are nonholonomic however, there are additional forces generated by the constraints,

given by the right hand side of (5.30). In sections 5.6 and 5.7 we shall explore how these

terms can be treated in terms of geometry.

5.5 Conservation laws

It is customary to de�ne an associated momentum (often called the conjugate momen-

tum) to each generalized coordinate. The conjugate momentum of a coordinate rk is

de�ned as

pk :=∂L

∂rk. (5.31)

This also means that for a holonomic system the equations of motion are given by (4.11)

in the new form of conjugate momenta;

pα −∂L

∂rα=

p∑a=1

λa∂fa

∂rα, ∀α = 1, ..., n− p. (5.32)

Now, if the Lagrangian L along with all the constraint equations f is independent of a

certain coordinate rk, then rk is said to be cyclic. For such a coordinate, (5.32) becomes

pk = 0, (5.33)

25

which means that

pk = constant, (5.34)

or, it is said to be conserved. Indeed, �the generalized momentum conjugate to a cyclic

coordinate is conserved� is stated in [7, p. 56] to be a general conservation theorem and

there are several other examples (for instance in [8, p. 568]) where similar statements

imply that this conservation law always holds. However, this is only true for holonomic

constraints, as we shall see below.

If we instead look at the equations of motion that we derived for nonholonomic

constraints of the form (5.3), the conjugate momentum is now

pk :=∂Lc∂rk

, (5.35)

so when inserting this into (5.20) we get

pα −∂Lc∂rα

= −p∑b=1

n−p∑β=1

Bbαβ rβ ∂L

∂sb−

p∑a=1

Aaα∂Lc∂sa

, ∀α = 1, ..., n− p. (5.36)

In this context, if rk is cyclic, then all the quantities L, Lc, Bbαβ and all the constraint

functions Aaα are independent of rk. If we also assume, as is often the case, that the

dependent coordinates s are also cyclic, then (5.36) becomes

pk = −p∑b=1

n−p∑β=1

Bbkβ rβ ∂L

∂sb, (5.37)

and we can clearly see, even with this last assumption, that the conjugate momentum of a

cyclic variable is generally not conserved for nonholonomic systems. Further discussion

of this fact can be found for instance in [17] and [3].

5.6 An Ehresmann connection

We now try to formulate the results derived so far in section 5 in geometrical terms.

The idea of an Ehresmann connection (section 2.10) is to think of the tangent space

TqQ at the point q ∈ Q as a the sum of a horizontal part and a vertical part, where the

horizontal part of the space is exactly the subspace of TqQ de�ned by the constraints.

Using our previous notation of r's and s's, the basis of TqQ is

B =

{∂

∂r1,∂

∂r2, ...,

∂

∂rn−p,∂

∂s1,∂

∂s2, ...,

∂

∂sp

}(5.38)

and any vector u ∈ TqQ can be written as

u =

r1

r2

...

rn−p

s1

s2

...

sp

=

n−p∑α=1

rα∂

∂rα+

p∑a=1

sa∂

∂sa, (5.39)

26

where the rα's and sa's are to be interpreted only as a notation for the values of the

elements of u and are not to be confused with the time derivatives of the elements of

any speci�c vector in the corresponding position space.

We now de�ne a �bre bundle (section 2.5) with total space Q and base space that

we will call R, that is locally the space of the r-variables. Thus we have a submersion

π that locally is the projection

r1

r2

...

rn−p

s1

s2

...

sp

7−→

r1

r2

...

rn−p

0

0...

0

. (5.40)

The tangent space of the base space at each point is then the space

TqR = span

{∂

∂r1,∂

∂r2, ...,

∂

∂rn−p

}. (5.41)

The projection from TqQ to TqR is called Tqπ and the kernel of Tqπ is what we call the

vertical space, de�ned as

Vq := span

{∂

∂s1,∂

∂s2, ...,

∂

∂sp

}. (5.42)

The space TqR is now a possible choice of horizontal space in order to have the tangent

space TqQ be a sum of the vertical and horizontal parts. This choice of horizontal space

does not, however, ful�l our second wish, that the horizontal space also is the subspace

of TqQ de�ned by our constraints. Instead we de�ne the map

A : TqQ→ Vq (5.43)

by

Aq(u) :=

p∑b=1

ωb(u)∂

∂sb. (5.44)

Inserting (5.22) in (5.44) we get

Aq(u) =

p∑b=1

(sb +

n−p∑α=1

Abαrα

)∂

∂sb. (5.45)

27

Written in the basis B, the resulting vector is

a := Aq(u) =

0

0...

0

s1 +n−p∑α=1

A1αrα

s2 +n−p∑α=1

A2αrα

...

sp +n−p∑α=1

Apαrα

(5.46)

so that

Aq(a) =

p∑b=1

(sb +

n−p∑α=1

Abαrα

)∂

∂sb= a. (5.47)

Aq is a projection (section 2.2) from TqQ to the vertical space Vq, or in other words, Aqis an Ehresmann connection.

The horizontal space Hq is now given by the kernel of Aq,

Hq = ker(Aq) = {u ∈ TqQ : Aq(u) = 0}, (5.48)

or more explicitly, Hq is spanned by all vectors in TqQ that satisfy

sa = −n−p∑α=1

Aaαrα, ∀a = 1, ..., p. (5.49)

This means that even if TqQ is the total space for velocity vectors at a point q, any

allowed velocity lies in the subspace Hq, determined by the Ehresmann connection that

was de�ned using the constraint equations.

To sum up, the vertical projection of a vector u ∈ TqQ, written

uV := A(u), (5.50)

is given by the map

r1

r2

...

rn−p

s1

s2

...

sp

7−→

0

0...

0

s1 +n−p∑α=1

A1αrα

s2 +n−p∑α=1

A2αrα

...

sp +n−p∑α=1

Apαrα

(5.51)

28

and the horizontal projection, written uH, is given by

r1

r2

...

rn−p

s1

s2

...

sp

7−→

r1

r2

...

rn−p

−n−p∑α=1

A1αrα

−n−p∑α=1

A2αrα

...

−n−p∑α=1

Apαrα

. (5.52)

From this it is easily checked that adding up the two projections gives us back the

original vector and that the constrained Lagrangian can be written as

Lc(r, s, r) = L(q, qH). (5.53)

5.7 Curvature

The curvature B of the connection A, with respect to two vector �elds X and Y on Q,

is de�ned as

B(X,Y) := −A([XH,YH]) (5.54)

where the bracket on the right hand side of (5.54) is the Jacobi-Lie bracket (section 2.9).

In local coordinates, let X and Y be described by

X =

X1

X2

...

Xn

=

r1x

r2x...

rn−px

s1x

s2x...

spx

, Y =

Y 1

Y 2

...

Y n

=

r1y

r2y...

rn−py

s1y

s2y...

spy

. (5.55)

The horizontal projections of the vector �elds, written in the basis B as before, are then

XH =

n−p∑β=1

rβx∂

∂rβ−

p∑a=1

n−p∑β=1

Aaβ(r, s)rβx

∂

∂sa(5.56)

and

YH =

n−p∑α=1

rαy∂

∂rα−

p∑a=1

n−p∑α=1

Aaα(r, s)rαy

∂

∂sa. (5.57)

29

Let XkH and Y kH denotes the k'th row of XH and YH respectively. Now we have

[XH,YH]k =

n−p∑β=1

rβx∂

∂rβY kH −

p∑a=1

n−p∑β=1

Aaβ rβx

∂

∂saY kH −

n−p∑α=1

rαy∂

∂rαXk

H+

+

p∑a=1

n−p∑α=1

Aaαrαy

∂

∂saXk

H, ∀k = 1, ..., n. (5.58)

The derivatives in (5.58) are

∂Y kH∂rβ

=

0 , for k = 1, ..., n− p

−n−p∑α=1

∂Ak−(n−p)α

∂rβrαy , for k = (n− p+ 1), ..., n,

(5.59)

∂Y kH∂sa

=

0 , for k = 1, ..., n− p

−n−p∑α=1

∂Ak−(n−p)α

∂sa rαy , for k = (n− p+ 1), ..., n,(5.60)

∂XkH

∂rα=

0 , for k = 1, ..., n− p

−n−p∑β=1

∂Ak−(n−p)β

∂rα rβx , for k = (n− p+ 1), ..., n(5.61)

and

∂XkH

∂sa=

0 , for k = 1, ..., n− p

−n−p∑β=1

∂Ak−(n−p)β

∂sa rβx , for k = (n− p+ 1), ..., n.(5.62)

Since [XH,YH]k = 0 for all k = 1, ..., n− p we write b = k − (n− p) so that

[XH,YH] =

p∑b=1

n−p∑α=1

n−p∑β=1

[∂Abβ∂rα

− ∂Abα∂rβ

+

p∑a=1

(Aaβ

∂Abβ∂sa

−Aaα∂Abα∂sa

)]rβx r

αy

∂

∂sb(5.63)

or

[XH,YH] = −p∑b=1

n−p∑α=1

n−p∑β=1

Bbαβ rβx rαy

∂

∂sb. (5.64)

Finally, recall that rβx = Xβ and rαy = Y α so, �nally, we get

[XH,YH] = −p∑b=1

n−p∑α=1

n−p∑β=1

BbαβXβY α

∂

∂sb. (5.65)

Since this vector �eld is already in the vertical space Vq, and since A is a projection to

Vq, we also know that

A([XH,YH]) = [XH,YH]. (5.66)

Also, equations (5.25) and (5.26) imply that

dωb(XH,YH) =

n−p∑α=1

n−p∑β=1

BbαβXβY α, ∀b = 1, ..., p. (5.67)

30

The conclusion is that the curvature B of the Ehresmann connection A, acting on two

arbitrary vectors X and Y on Q, can be written as

B(X,Y) =

p∑b=1

dωb(XH,YH)∂

∂sb(5.68)

and that it can be seen as a measure of the failure of the constraints to be integrable [1,

p. 108]. The local expression for curvature is then given by

B(X,Y)b =

n−p∑α=1

n−p∑β=1

BbαβXβY α, ∀b = 1, ..., p, (5.69)

with the coe�cients Bbαβ given by (5.21).

31

6 An example: The falling disc

Now we try to make use of the method developed in the previous sections in an example, a

model of a rolling homogenous disc, such as a coin, of negligible thickness on a horizontal

plane. A similar, but not identical, model was explored as early as 1892 by Vierkandt

in [16] and further, for example, in [1] and [18].

6.1 Writing the Lagrangian

We assume a homogeneous disc of radius R and mass m. We neglect all frictional forces

and assume that the disc is not allowed to slip as it rolls and rotates on the surface.

Rolling without slipping is of course an unrealistic idealization that is commonly adopted

for these types of systems, but for our purpose in this text it is deemed to be �realistic

enough�. The disc's centre of mass is positioned at the coordinates

X,Y, Z. (6.1)

The rotation of the disc in its rolling direction measured counter clockwise by the angle

θ, (6.2)

the rotation about the disc's z-axis measured from the x-axis counter clockwise by

φ, (6.3)

and the inclination angle, measured by

ψ. (6.4)

These generalized coordinates are illustrated by �gure 4.

Figure 4: A rolling disc on a �at surface

32

The kinetic energy of the system is

T =1

2m(X2 + Y 2 + Z2) +

1

2(I1θ

2 + I2φ2 + I3ψ

2) (6.5)

with I1, I2 and I3 the moments of inertia of the disc about each corresponding axis from

where we measure the rotation, here given by

I1 =1

2mR2, (6.6)

corresponding to the change in rotation measured by θ,

I2 =1

4mR2, (6.7)

corresponding to φ, and

I3 =1

4mR2, (6.8)

corresponding to the angle ψ. A more detailed description on how these values can be

calculated can be found in any book on basic mechanics, such as [13, p. 663-690].

The potential energy is

V = mgZ, (6.9)

so that we obtain the Lagrangian on TQ:

L =m

2

[X2 + Y 2 + Z2 +

1

2R2θ2 +

1

4R2φ2 +

1

4R2ψ2 − 2gZ

]. (6.10)

6.2 Change of coordinates

Since rolling without slipping is better expressed as a function of the coordinates at the

point of contact between the disc and the surface, we introduce

x, y (6.11)

for this purpose, the coordinates X,Y and Z can now be expressed as functions of these

new coordinates and the angles φ and ψ. We now have

X = x−R sinφ sinψ, (6.12)

Y = y −R cosφ sinψ (6.13)

and

Z = R cosψ. (6.14)

What this actually means is that the position of the centre of mass is known as soon

as we know the point of contact, the heading angle and the inclination of the disc.

These constraints are all holonomic so we can eliminate them directly by inserting the

equations into the Lagrangian. To be able to do this we also need the time derivatives,

X = x−R(cosφ sinψφ+ sinφ cosψψ

), (6.15)

Y = y −R(sinφ sinψφ− cosφ cosψψ

), (6.16)

33

and

Z = −R sin(ψ)ψ (6.17)

We also calculate

X2 = x2 − 2xR(cosφ sinψφ+ sinφ cosψψ

)+

+R2(cos2 φ sin2 ψφ2 + 2 cosφ sinφ cosψ sinψφψ + sin2 φ cos2 ψψ2

), (6.18)

Y 2 = y2 − 2yR(sinφ sinψφ− cosφ cosψψ

)+

+R2(sin2 φ sin2 ψφ2 − 2 cosφ sinφ cosψ sinψφψ + cos2 φ cos2 ψψ2

)(6.19)

and

Z2 = R2 sin2 ψψ2. (6.20)

Then we have

X2 + Y 2 + Z2 = x2 + y2 +R2ψ2 +R2 sin2 ψφ2−

− 2R[φ sinψ(x cosφ+ y sinφ) + ψ cosψ(x sinφ− y cosφ)

]. (6.21)

Now insert (6.14) and (6.21) into the Lagrangian to get

L =m

2

[x2 + y2 +

5R2

4ψ2 +R2

(1

4+ sin2 ψ

)φ2 +

R2

2θ2 − 2gR cosψ − F

], (6.22)

where

F = 2R[φ sinψ(x cosφ+ y sinφ) + ψ cosψ(x sinφ− y cosφ)

]. (6.23)

In the language of geometry we now have the con�guration manifold

Q = R2 × S1 × S1 × S1+ (6.24)

where

[x, y]T ∈ R2 (6.25)

and S1 is the one-sphere, the set of all points on a circle, and it is therefore the natural

space for each of the angles θ and φ. The set of points S1+ is de�ned as

S1+ := {x, y ∈ R | x2 + y2 = 1, y ≥ 0}, (6.26)

and is the set of points of the upper half of a circle, which is to be used as the space for

the angle ψ.

6.3 Nonholonomic constraints

The condition of rolling without slipping can be described by the equations

x = R cosφθ (6.27)

and

y = R sinφθ. (6.28)

34

To be able to write these constraints in the language of the approach we have devel-

oped, we let

s = [x, y]T (6.29)

and

r = [θ, φ, ψ]T (6.30)

so that (6.27) and (6.28) are written as

s1 = R cos(r2)r1 (6.31)

and

s2 = R sin(r2)r1. (6.32)

This is exactly the form of (5.3) with

A11(r, s) = −R cosφ, (6.33)

A21(r, s) = −R sinφ (6.34)

and with all the other A's zero.

Now we can use (5.21) to calculate the curvature terms:

B112 =

∂A11

∂φ=

∂

∂φ(−R cosφ) = R sinφ, (6.35)

B212 =

∂A21

∂φ=

∂

∂φ(−R sinφ) = −R cosφ, (6.36)

B121 = −∂A

11

∂φ= − ∂

∂φ(−R cosφ) = −R sinφ (6.37)

and

B221 = −∂A

21

∂φ= − ∂

∂φ(−R sinφ) = R cosφ, (6.38)

with all other B's zero.

Remembering that the constrained Lagrangian is the Lagrangian where all veloci-

ties ful�l the nonholonomic constraint equations, we insert (6.27) and (6.28) into the

Lagrangian. First we see that when inserting the constraints into the function F we get

F = 2R2[φ sinψ(cos2 φ+ sin2 φ)θ + ψ cosψ(cosφ sinφ− sinφ cosφ)θ

]=

= 2R2 sinψθφ, (6.39)

and then we see that

x2 + y2 = R2 cos2(φ)θ2 +R2 sin2(φ)θ2 = R2θ2 (6.40)

so the constrained Lagrangian becomes

Lc = L(q, qH) = mR2

[3

4θ2 +

(1

8+

1

2sin2 ψ

)φ2 − sinψθφ+

5

8ψ2 − g

Rcosψ

]. (6.41)

Since∂Lc∂s1

=∂Lc∂x

= 0 (6.42)

35

and∂Lc∂s2

=∂Lc∂y

= 0, (6.43)

equation (5.20) becomes

d

dt


+

2∑b=1

3∑β=1

Bbαβ rβ ∂L

∂sb= 0, α = 1, 2, 3 (6.44)

with

[r1, r2, r3]T = [θ, φ, ψ]T (6.45)

and

[s1, s2]T = [x, y]T . (6.46)

6.4 Equations of motion

Now we calculate the necessary derivatives to insert into (6.44):

Pθ =d

dt

∂Lc

∂θ= mR2

(3

2θ − φ sinψ − cosψφψ

), (6.47)

Pφ =d

dt

∂Lc

∂φ= mR2

[(1

4+ sin2 ψ

)φ+ sin(2ψ)φψ − sinψθ − cosψθψ

], (6.48)

Pψ =d

dt

∂Lc

∂ψ= mR2 5

4ψ, (6.49)

∂Lc∂ψ

= mR2

[1

2sin(2ψ)φ2 − cosψθφ+

g

Rsinψ

](6.50)

and∂Lc∂θ

=∂Lc∂φ

= 0. (6.51)

We also need

∂L

∂x= mx−mR

(φ sinψ cosφ+ ψ cosψ sinφ

)(6.52)

and∂L

∂y= my −mR

(φ sinψ sinφ− ψ cosψ cosφ

). (6.53)

Now we can calculate

B112φ

∂L

∂x+B2

12φ∂L

∂y= mRφ(xsinφ− ycosφ)−mR2 cosψφψ (6.54)

and

B121θ

∂L

∂x+B2

21θ∂L

∂y= mR2 cosψθψ −mRθ(xsinφ− ycosφ). (6.55)

Inserting into (6.44) gives us

mR2

[3

2θ − φ sinψ − 2 cosψφψ

]+mRφ (x sinφ− y cosφ) = 0, (6.56)

mR2

[(1

4+ sin2 ψ

)φ+ sin(2ψ)φψ − sinψθ

]−mRφ (x sinφ− y cosφ) = 0, (6.57)

36

and

mR2

[5

4ψ − 1

2sin(2ψ)φ2 + cosψθφ− g

Rsinψ

]= 0. (6.58)

Using the constraint equations (6.27) and (6.28) we see that

(x sinφ− y cosφ) = Rθ (cosφ sinφ− sinφ cosφ) = 0, (6.59)

so now we arrive at the �nal equations of motion

θ − 2

3

[φ sinψ + 2 cosψφψ

]= 0, (6.60)

φ+sin(2ψ)φψ − sinψθ(

14 + sin2 ψ

) = 0, (6.61)

and

ψ − 4

5

[1


g

Rsinψ

]= 0. (6.62)

Equations (6.60), (6.61) and (6.62) give the complete motion of the system.

6.5 Interpretation and stable solutions

In this example, θ and φ are cyclic coordinates (L, Lc, Aaα and Bbαβ are all independent

of θ and φ), so we can check whether their associated momenta are conserved, with the

help of equation (5.37). We have

Pθ = mR2 cosψφψ (6.63)

and

Pφ = −mR2 cosψθψ (6.64)

so here we see that the conjugate momenta are not conserved. This is consistent with

the statements made in section 5.5.

It is interesting to explore some special solutions to these equations. For instance

we can set θ = 0 and φ = 0, so that the rolling velocity and turning velocity are both

constant. Both equations (6.60) and (6.61) now give

φψ = 0, (6.65)

which tells us that we either have a constant direction of rolling φ or a constant tilt

angle ψ. If we assume the direction is constant, equation (6.62) then gives the solution

for ψ as the second order nonlinear di�erential equation

ψ − 4g

5Rsinψ = 0. (6.66)

Looking at equation (6.66) we see that the acceleration of the tilt angle ψ will always

have the same sign as the angle ψ itself. It follows that the only (unstable) equilibrium

solution in this case is when ψ = 0.

37

On the other hand, if we assume the tilt angle is constant, then equation (6.62)

provides an equation for the possible constant values of θ and φ,

φ =θ

2 sinψ±

√θ2

4 sin2 ψ− g

R cosψ, (6.67)

which in turn gives the inequality

∣∣∣θ∣∣∣ ≥√

4g sin2 ψ

R cosψ, (6.68)

which gives the lower limit of the rolling velocity θ as a function of the constant value

of ψ.

Another example is to set ψ = 0, so that the tilt angle is constant. Here, equation

(6.60) shows that

θ =2

3sinψφ (6.69)

while equation (6.61) gives us

θ =

(14 + sin2 ψ

)sinψ

φ. (6.70)

This can only be true if both θ and φ are zero or if

sin2 ψ = −3

4, (6.71)

which of course has no real solutions. Thus we must assume that

θ = φ = 0, (6.72)

so now we are back to our �rst case again, with ψ constant.

In summary we can say that constant velocities for rolling and turning gives possible

solutions with the tilt angle either constant or nonconstant, but if we start from an

assumption of constant tilt angle, then we only have solutions where both turning and

rolling velocities are constant.

In both of these examples there are possible solutions where the equations (6.63) and

(6.64) predict conserved conjugate momenta for both θ and/or φ, which is also what we

would expect for this model of a rolling disc.

38

7 Results and conclusion

In this section the main results and conclusions will be reviewed.

We started from Newton's second law and used d'Alembert's principle to arrive at

the equationn∑j=1

[d

dt

∂L

∂qj− ∂L

∂qj

]δqj = 0. (7.1)

We argued that when all n generalized coordinates are independent of each other,

we get the Euler-Lagrange equations

d

dt

∂L

∂qj− ∂L

∂qj= 0, ∀j = 1, ..., n. (7.2)

We also saw that when some of the coordinates are dependent of each other, their

dependency expressed by p functions fa, the constraints can be included in the Euler-

Lagrange equations, when labelling the (n − p) independent variables with rα, to give

us the equations

d

dt

∂L

∂rα− ∂L

∂rα=

p∑a=1

λa∂fa

∂rα, ∀r = 1, ..., n− p. (7.3)

We also showed that, when the constraints are holonomic, they can be added to the

Lagrangian to form a modi�ed Lagrangian L∗ so that the constraints in e�ect can be

imposed before taking the variations to the Lagrangian, consistent with the approach

for dealing with constraints via Hamilton's principle.

Next we showed that this procedure fails when applied to nonholonomic constraint,

and that the proper equations of motion only are obtained by imposing the constraints

after taking the variations, as can only be done by d'Alembert's principle. The equations

when using Lagrange multipliers on linear nonholonomic constraints given by

ga(q, q) :=

n∑i=1

Aai (q)qi = 0, (7.4)

are the equations

d

dt

∂L

∂rα− ∂L

∂rα=

p∑a=1

λaAaα, ∀α = 1, ..., n− p. (7.5)

In particular, we showed that when the constraint equations were written in the form

sa = −n−p∑α=1

Aaα(r, s)rα, ∀a = 1, ..., p, (7.6)

we derived the Lagrange multipliers to be

λa =d

dt

∂L

∂sa− ∂L

∂sa, ∀a = 1, ..., p, (7.7)

so that the equations of motion become

d

dt

∂L

∂rα− ∂L

∂rα=

p∑a=1

(d

dt

∂L

∂sa− ∂L

∂sa

)Aaα, ∀α = 1, ..., n− p. (7.8)

39

Further, we introduced the concept of the constrained Lagrangian Lc, which is the

Lagrangian de�ned only on the horizontal space of TQ, or more explicitly, de�ned only

on the subset of TQ de�ned by the constraint equations. In terms of the constrained

Lagrangian we derived the equations of motion to be

d

dt


+

p∑a=1

Aaα∂Lc∂sa

= −p∑b=1

n−p∑β=1

Bbαβ rβ ∂L

∂sb, ∀α = 1, ..., n− p (7.9)

where

Bbαβ :=

(∂Abα∂rβ

−∂Abβ∂rα

)+

p∑a=1

(Aaα

∂Abβ∂sa

−Aaβ∂Abα∂sa

). (7.10)

By rewriting the right hand side to be in terms of the exterior derivative of the two

form ω, which was the constraint �function�, we could show that the results were con-

sistent with holonomic constraints and that these extra terms involving Bbαβ , generated

by the nonholonomic constraint forces, could be seen, from a geometric perspective, as

curvature terms for a speci�c choice of connection.

This connection, called an Ehresmann connection, is a projection de�ned on the tan-

gent space TqQ of each point q ∈ Q and it is a powerful way to describe the constraints,

without using local coordinates.

We also showed that the conjugate momenta of a cyclic variable is generally not

conserved for nonholonomic systems, as it is in the holonomic case.

Finally we showed how to use some of the above mentioned results in an example.

The equations of motion for a �coin�, rolling without slipping on a plane surface, were

calculated to be

θ − 2

3

[φ sinψ + 2 cosψφψ

]= 0, (7.11)

φ+sin(2ψ)φψ − sinψθ(

14 + sin2 ψ

) = 0, (7.12)

and

ψ − 4

5

[1


g

Rsinψ

]= 0. (7.13)

The equations obtained were also consistent with the earlier conclusions that conju-

gate momenta are normally not conserved for nonholonomic constraints.

In conclusion it should also be said that, while being a fairly instructive example

showing how to use the developed equations, the example of the falling disc that was

considered in section 6 is arguably not very hard to solve even without the help of the

above formulas, simply with the use of Newtonian mechanics. There are of course other

examples when the Newton approach becomes too cumbersome or impractical and where

the above approach would o�er more of an advantage. More advantages still are to be

gained by developing the geometrical approach even further than was done in this text,

many of which are explored in the source literature.

40

References

[1] A.M. Bloch. Nonholonomic Mechanics and Control. Interdisciplinary applied math-

ematics: Systems and control. Springer, 2003.

[2] A.M. Bloch, P. Krishnaprasad, J. Marsden, and R. Murray. Nonholonomic me-

chanical systems with symmetry. Archive for Rational Mechanics and Analysis,

136:21�99, 1996.

[3] A.M. Bloch, J.E. Marsden, and D.V. Zenkov. Nonholonomic dynamics. Notices of

the AMS, 52(3):324�333, 2005.

[4] H. Cendra, A. Ibort, M. de León, and D. Martín de Diego. A generalization of

Chetaev's principle for a class of higher order nonholonomic constraints. Journal

of Mathematical Physics, 45(7):2785�2801, 2004.

[5] M. Crampin, W. Sarlet, and F. Cantrijn. Higher-order di�erential equations and

higher-order lagrangian mechanics. Mathematical Proceedings of the Cambridge

Philosophical Society, 99(03):565�587, 1986.

[6] M.R. Flannery. The enigma of nonholonomic constraints. American Journal of

Physics, 73(3):265�272, 2005.

[7] H. Goldstein, C.P. Poole, and J.L. Safko. Classical Mechanics. Addison Wesley,

2002.

[8] J.V. José and E.J. Saletan. Classical Dynamics: A Contemporary Approach. Cam-

bridge University Press, 1998.

[9] W.S. Koon and J.E. Marsden. The hamiltonian and lagrangian approaches to the

dynamics of nonholonomic systems. Reports on Mathematical Physics, 40:21�62,

1997.

[10] D. J. Korteweg. Über eine ziemlich verbreitete unrichtige Behandlungsweise eines

Problems der rollenden Bewegung, über die Theorie dieser Bewegung, und ins-

besondere über kleine rollende Schwingungen um eine Gleichgewichtslage. Nieuw

archief voor wiskunde, 24:130�161, 1900.

[11] C. Lanczos. The Variational Principles of Mechanics. Mathematical expositions.

Dover Publications, 1970.

[12] C-M. Marle. Various approaches to conservative and nonconservative nonholonomic

systems. Reports on Mathematical Physics, 42:211�229, 1998.

[13] J.L. Meriam and L.G. Kraige. Engineering Mechanics: Dynamics. Engineering

Mechanics. Wiley, 2008.

[14] R. Penrose. The Road to Reality: A Complete Guide to the Laws of the Universe.

Vintage Series. Vintage Books, 2007.

41

[15] R. Sjamaar. Manifolds and di�erential forms. Lecture notes from Cornell University,

2006.

[16] A. Vierkandt. Dritter Abschnitt: Das Rollen und Gleiten einer ebenen Fläche,

insbesondere einer homogenen Kreisscheibe, auf der Horizontalebene unter dem

Ein�uss der Schwere. Monatshefte für Mathematik, 3:117�134, 1892.

[17] D. V. Zenkov. Linear conservation laws of nonholonomic systems with symme-

try. Discrete and Continuous Dynamical Systems, supplementary volume:963�972,

2003.

[18] D.V. Zenkov, A.M. Bloch, and J.E. Marsden. The energy-momentum method

for the stability of non-holonomic systems. Dynamics and Stability of Systems,

13(2):123�165, 1998.

42

holonomic versus nonholonomic constraints

Documents