specialrelativityandclassicalfieldtheoryteaching.hayehinrichsen.de/lecturenotes/sr-resume.pdf ·...

306
Special Relativity and Classical Field Theory Résumé Short Summary Prof. Dr. Haye Hinrichsen Faculty for Physics and Astronomy University of Würzburg Summer term 2021 Version: 16. April 2021

Upload: others

Post on 25-Jan-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

  • Special Relativity and Classical Field Theory

    RésuméShort Summary

    Prof. Dr. Haye HinrichsenFaculty for Physics and Astronomy

    University of WürzburgSummer term 2021

    Version: 16. April 2021

  • Section 1.1

    Groups, Vector Space, and Linear Algebra

  • Groups

    A group G is a set of elements g ∈ G (typically transformations)with the properties

    I Associativity: ∀a,b,c∈G :

    (a ◦ b) ◦ c = a ◦ (b ◦ c)

    I Identity element: There exists an element e ∈ G such that

    e ◦ a = a ◦ e = a.

    I Inverse element: ∀a ∈ G , there exists an a−1 such that

    a ◦ a−1 = a−1 ◦ a = e.

    A group is called commutative or Abelian ⇔ a ◦ b = b ◦ a

  • Linear vector spaces

    A linear vector space V over R is a set of elements v ∈ Vequipped with the operations ’+’ and ’·’:

    I V is a commutative group under addition.I Identity element of scalar multiplication: 1v = v.

    Compatibility laws:I Compatibility with scalar multiplication: a(bv) = (ab)v.I Distributivity with respect to vectors: a(u + v) = au + av.I Distributivity with respect to scalars: (a + b)v = av + bv.

    Special element: The neutral element for ’+’, called zero vector ~0.

    So far we have no information about the length of a vector.

  • Representation of vectorsLinear dependence:

    {v1, . . . , vn} linear dependent ⇔ ∃{ai} :∑n

    i=1 aivi = 0.

    Basis:

    {e1, . . . , ed} ⊂ V is called basis if every v ∈ V can berepresented as a linear combination

    v =d∑

    i=1v iei

    I d is the dimension.I v1, v2, . . . , vd ∈ R are the components (with upper indices)

    which depend on the basis.I Einstein sum convention: v = v iei .

  • Covector spaceLine vectors versus column vectors

    A linear form α is a linear map α : V → R.

    I Linearity under vector addition: α(u + v) = α(u) + α(v)I Linearity under scalar multiplication: α(au) = aα(u)

    In order to specify a linear form we only have to know how it mapsthe basis vectors:

    α(u) = α(uiei ) = ui α(ei )︸ ︷︷ ︸=:αi

    = αiui

    α1, . . . , αd are the components of the linear form (lower indices).

  • Covector space – Dual basisThe set of all possible linear forms constitute a vector space V ∗,called dual space or covector space.

    I Addition of linear forms

    α + β : [α + β](u) := α(u) + β(u)

    I Scalar multiplication of linear forms

    aα : [aα](u) := aα(u)

    For each basis {e1, . . . , ed} of V there exists a corresponding dualbasis {e1, . . . , ed} of V ∗ defined by the ’holy relation’

    ei (ej) = δij ={1 if i = j0 if i 6= j

  • Covector spaceRepresentation of linear forms by components

    ei (ej) = δij ={1 if i = j0 if i 6= j

    Using this associated co-basis of V ∗, we can represent α by

    α = αiei ,

    where αi are the components of the linear form.

    I Vector components with upper indices are calledcontravariant.

    I Components of linear forms with lower indices are calledcovariant.

  • Basis and co-basis: An example

  • Recall

    SUMMARY

    I Vectors u ∈ V have contravariant components ui withupper indices.

    I Linear forms α ∈ V ∗ have covariant components αi withlower indices.

    I Einstein sum convention: Always add over pairs ofoppositely oriented indices.

    I Contraction: α(u) = αiui

  • Basis transformationsTwo basis systems {e1, . . . , ed} and {ẽ1, . . . , ẽd} in the same vector space V

    Since each basis vector can be represented in the respective otherbasis, we can write

    ei = ẽjAj i ẽj = ekBkj .

    where B is the inverse of A.

    I An abstract vector v ∈ V remains invariant, but itscomponents transform as

    ṽ j = Aj ivi vk = Bkj ṽ j .

    I An abstract linear form α ∈ V ∗ remains invariant, but itscomponents transform as

    αj = α̃iAi j α̃l = αkBkl

  • TensorsLinear machines with many connections

    A tensor T of rank (p, q) is a multilinear map that maps p linearforms α(1), . . . , α(p) and q vectors v(1), . . . , v(q) onto a number:

    α(1), . . . , α(p), v(1), . . . , v(q) 7→ T(α(1), . . . , α(p), v(1), . . . , v(q))

    Special cases:I Tensors of rank (0,0) are scalars.I Tensors of rank (1,0) are vectors.I Tensors of rank (0,1) are linear forms.I Tensors of rank (1,1) are linear maps.

  • Tensor representation and transformation

    Representation in a basis:

    T(α(1), . . . , α(p), v(1), . . . , v(q)) = Ti1,...,ip

    j1,...,jqα(1)i1 · · ·α

    (p)ip v

    j1(1) · · · v

    jq(q)

    whereT i1,...,ipj1,...,jq = T(e

    i1 , . . . , eip , ej1 , . . . , ejq )

    Behavior under basis transformations:

    T̃ k1,...,kpl1,...,lq = Ak1

    i1 · · ·Akp

    ipTi1,...,ip

    j1,...,jqBj1

    l1 · · ·Bjq

    lq

    Folklore: A tensor is something which transforms in each upperindex like a vector and in each lower index like a linear form.

  • Scalar productAn additional structure that is not part of the vector space axioms.

    A scalar product is a map g : V ⊗ V → R with the properties:

    I Bilinearity (= linear in both arguments)I Symmetry: g(u, v) = g(v,u).I Positive definite: g(v, v) ≥ 0 and g(v, v) = 0⇔ v = 0

    Representation of the scalar product:

    gij := g(ei , ej). g(u, v) = gijuiv j .

    The matrix gij isI realI symmetricI positive eigenvalues.

  • Scalar product – PropertiesTransformation behavior: g is a (0,2)-tensor, i.e., it is invariantunder basis transformations, while its representation changes as

    g̃kl = gijB i kBjl ,

    Induced norm:||u|| =

    √g(u,u)

    Induced distance measure:

    d(u, v) = ||u− v||

    Example: Euclidean metric:

    gij = g(ei , ej) = δij ⇒ g(u, v) = gijuiv j =d∑

    i=1uiv i

  • PseudometricA scalar product that is NOT positive definite

    I g(v, v) can be positive or negative.I g(v, v) can be zero for v 6= ~0.

    Simplest example in 2d:

    g(u, v) = u1v1−u2v2

    gij =(1−1

    )

    The set of vectors v for which g(v, v) = 0 is called light cone.

    The sorted signs of the eigenvalues of g is called the signature

  • Mapping vectors to linear forms and vice versaMusical isomorphism

    Flat: The scalar product can turn vectors into linear forms:

    [ : u ∈ V 7→ α ∈ V ∗ : α(v) := g(v,u)

    In components this means to lower the index: ui := αi = gij uj

    Induced scalar product on V ∗

    g∗(α, β) := g(u, v) g ij := g∗(ei , ej) ⇒ g ijgjl = δil

    Sharp: The induced scalar product turns forms back into vectors:

    ] : α ∈ V ∗ 7→ u ∈ V : u(β) = β(u) := g∗(α, β)

    In components this means to raise the index:uj = g jk uk = g jk αk

  • Why upper and lower indices?

    I It helps us to distinguish vectors and linear forms.I We can switch freely from contravariant to covariant indices

    and vice versa by raising or lowering the index:

    uj = g jk uk , ui = gij uj

    I The Einstein sum convention allows us to write the scalarproduct as

    g(u, v) = uivi = uiv i .

    There is no need to use upper and lower indices.It is just a matter of convenience.

  • Summary

    Object FormulaBasis of V {e1, e2, . . .}Dual basis of V ∗ {e1, e2, . . .} ei (ej ) = δijVector representation v = v ieiLinear form representation α = αjej

    Metric tensor gij = g(ei , ej ) gij g jk = δkiInverse metric tensor g jk = g∗(ej , ek )Lowering indices ui = gij uj Tij = gikgjl T kl

    Raising indices uj = g jkuk T ij = g ikg jl TklScalar product 〈u, v〉 = g(u, v) = gij ui v j = g ij ui vj = ui vj = ui v j

    Basis transformation ẽj = ekBk j , ei = ẽj Aj

    i , Bk

    j Aj

    i = δki

    Transformation of vectors ṽ j = Aj i vi , vk = Bk j ṽ j

    Transformation of forms α̃l = αkBk l , αk = α̃i Ai

    j

    Transformation of metric g̃kl = gij Bi kBj

    l , g̃kl = Ak i Al j g ij

  • WARNING

    What you should never do:

    Never raise or lower the index of a basis vector of form:

    �����

    ��XXXXXXXei = g ijej

    The reason is that ej ∈ V while ei ∈ V ∗.

    This does not make sense.

    It is like comparing apples with bananas.

  • Section 1.2

    Symmetries of Metric Spaces

  • Coordinate SystemsNot to be confused with vector components!

    Definition:

    I A coordinate system is a set of d functions

    x i : V → R : v 7→ x i (v)

    labeled by i = 1, . . . , d , where d is the dimension.

    I The coordinates describe the vector uniquely, that is, the setof coordinate functions has to be invertible.

    I The coordinate functions have to be smooth and differentiablebut they can be nonlinear.

  • Examples of Coordinate Systems

    Grid lines are curves where one of the coordinates is constant.

    In Cartesian curvilinear coordinate systems the coordinates areidentical with the components of the position vector:

    v(x1, . . . , xd ) = x iei ⇒ ei :=∂v∂x i

  • TransformationsActive and passive transformations

    I Active transformations move the object physically:

    I Passive transformations move the coordinate system:

    I In both cases the coordinate changes as x 7→ x̃ = x + aIn this sense active and passive transformations are equivalent.

  • Transformations of functions

    Active transformation Passive transformation

    In both cases:f (x) 7→ f̃ (x) = f (x − a)

    Applying an active transformation to a functionmeans to apply the inverse transformation to its

    argument.

  • Coordinate transformationsNot to be confused with basis transformations

    A coordinate transformation is a set of d functions

    x̃ i : {x1, . . . , xd} 7→ x̃ i (x1, . . . , xd )

    with i = 1, . . . , d . These maps may be nonlinear.

    Example:x : {r , φ} 7→ x(r , φ) = r cosφ

    y : {r , φ} 7→ y(r , φ) = r sinφ

    Special case: Linear coordinate transformations:

    x̃ i = Λi jx j + s i ⇒ Λi j =∂x̃ i∂x j

  • What is an Algebra?

    An algebra consists of

    I a set of symbols, say A,B,C ,I words formed from symbols, like ABABCA,I a space of linear combinations of words, such as 2AB + 7BA,I certain algebraic rules (commutation relations), restricting the

    size of the word space.

    Example: The Heisenberg algebra:

    Symbols: 1, x i , and ∂/∂x j

    Algebraic relations: ∂∂x j x

    i = δij

    The algebraic relations should hold in any coordinate system.

  • Transformation behavior of partial derivativesStarting point: The Heisenberg algebra holds in both coordinatesystems:

    ∂xk xi = δik ,

    ∂x̃k x̃i = δik .

    Conclusion: This requires derivatives to transform inverselycompared to the coordinates.

    Notation:

    ∂i :=∂

    ∂x i ∂i := ∂

    ∂xiThen we can write:

    ∂ix j = ∂̃i x̃ j = δji ∂i = g ij∂j

    The partial derivative with respect to a component with upperindex transforms like a component with lower index, and vice versa.

  • Lie groups

    Lie groups are continuous groups with infinitely many groupelements, parameterized by continuous parameters, which can beTaylor-expanded around the neutral element.

    Example: Group of translations in Rn :

    Ta : x 7→ x + a

    Example: Special orthogonal group SO(2)(x̃1x̃2

    )=(

    cosφ − sinφsinφ cosφ

    )(x1x2

    )

  • Example: Lie group of translations in 1dConsider an infinitesimaltranslation of a function bythe distance � to the right:

    f (x) 7→ f̃ (x) = f (x − �)

    This may be Taylor-expanded:

    f̃ (x) ≈(1− � ddx

    )f (x)

    Consider a finite translation:

    f̃ (x) = f (x − a) = limN→∞

    (1− aN

    ddx)N

    f (x) = exp(−a ddx

    )f (x)

    or in higher dimensions:

    f̃ (x) = e−a·∇f (x) = e−ai∂i f (x).

  • Examples: SO(2) and SO(3)

    I SO(2) has only one generatorwith the Lie algebra L2 = −1:

    L =(0 −11 0

    )⇒ Rφ = exp

    (φL)

    =(

    cosφ − sinφsinφ cosφ

    )

    I SO(3) has three generators with Lie algebra [Li , Lj ] = �ijkLk :

    L1 =

    0 0 00 0 −10 1 0

    , L2 = 0 0 10 0 0−1 0 0

    , L3 =0 −1 01 0 00 0 0

    R~φ = exp

    (φiLi

    )

  • Isometries

    I An isometry is a coordinate transformation that, wheninterpreted as an active transformation, preserves angles andlengths, i.e., a metric-preserving map.

    I In practice, an isometry leaves the components of the metrictensor invariant:

    gkl = g̃kl , gkl = g̃kl .

    I Proof: see Lecture Notes.

  • Section 2.1

    Lorentz Transformations

  • From Galilei to Lorentz InvarianceThe Galilei transformation reads:

    x→ x̃ = x− vtt → t̃ = t

    In matrix form: t̃x̃ỹz̃

    =

    1 0 0 0−vx 1 0 0−vy 0 1 0−vz 0 0 1

    txyz

    or (

    t̃x̃

    )=(

    1 0−v 1

    )(tx

    )

    I The transformation is linear.I The time t = t̃ is not modified.

  • Broken Galilei invariance in Electrodynamics

    Maxwell equations → Wave equation in vacuum:

    �B = 0 , �E = 0

    Here ’quabla’ is the d’Alembert operator:

    � := ∇2 − 1c2∂2

    ∂t2 with c =1

    √µ0ε0

    ≈ 3× 108 m/s

    � is not invariant under Galilei transformations

    �̃ = � +2∂t(v · ∇)− (v · ∇)2︸ ︷︷ ︸symmetry-breaking terms

  • Broken Galilei invariance in Electrodynamics

    The broken Galilei invariance comes as no surprise:I Velocities u = ẋ transform as

    u 7→ ũ = u− v

    where v is the relative velocity between the inertial systems.I Electrodynamics singles out a particular velocity, namely c.I Galilei invariance forbids to single out any velocity

    (since c 7→ c + v)

    Obvious solution:I There must be a medium, called ether.I The wave equation is only valid in the rest frame of the ether.

    But the Michelson-Morley shows that there is no ether.

  • Hendrik Antoon LorentzIf electrodynamics is not invariant under Galilei, is it invariant under something else?

    I Galilei transformation:(t̃x̃

    )=(

    1 0−v 1

    )(tx

    )

    I Try instead a general linear transformation:(t̃x̃

    )=(A BC D

    )(tx

    )

    I Find A,B,C ,D in such a way that � = �̃.

    Solution (see lecture notes):

    ⇒ A = D , C = Bc2 , AD − BC = 1.

  • The Lorentz transformationThe origin x̃ = 0 of S̃, when seen from S, moves with velocity

    v = xt = −CD .

    So we have four equations with four unknowns A,B,C ,D:

    A = D = 1√1− v2/c2

    , B = − vc2√1− v2/c2

    , C = − v√1− v2/c2

    Define the dimensionless Lorentz factor

    γ := 1√1− β2

    , β := vc

    Final form of the Lorentz transformation

    ⇒(t̃x̃

    )= γ

    (1 −v/c2−v 1

    )(tx

    )

  • The Lorentz transformation

    Use ct instead of t:(ct̃x̃

    )=(

    γ −βγ−βγ γ

    )(ctx

    );

    (ctx

    )=(

    γ +βγ+βγ γ

    )(ct̃x̃

    )

    4-vectors: x =

    ctxyz

    , A point in R1+3 is called event.

    Remark:The Lorentz transformation can also be derived in the traditionalway by analyzing a sequence of events (see lecture notes).

  • The Lorentz transformation as a Lie groupConsider infinitesimal Lorentz transformation with β � 1.Expand Lorentz transformation to linear order in β:(

    ctx

    )≈(

    1 +β+β 1

    )(ct̃x̃

    )=[1 + β

    (1

    1

    )︸ ︷︷ ︸

    =:λ

    ](ct̃x̃

    )

    ⇒ x = (1 + βλ) x̃ + O(β2)

    Make transformation finite using the exponential function:

    x = exp(θλ) x̃

    This leads to (exercise)(ctx

    )=(

    cosh θ sinh θsinh θ cosh θ

    )︸ ︷︷ ︸

    =Λ(θ)

    (ct̃x̃

    )

  • RapidityThe ’angle’ of the Lorentz transformation

    Compare the two representation:(ctx

    )=(

    γ +βγ+βγ γ

    )(ct̃x̃

    )(ctx

    )=(

    cosh θ sinh θsinh θ cosh θ

    )(ct̃x̃

    )One can show that the rapidity θ is given by

    θ = tanh−1(

    vc

    ).

  • Additivity of the Rapidityand the Relativistic Addition Theorem of Velocities

    Subsequent Lorentz transformations are additive in the rapidity(just in the same way as 2d rotations are additive in the angle)

    Λ(θ) = Λ(θ1)Λ(θ2) ⇒ θ = θ1 + θ2

    Proof: see lecture notes.

    Because of θ = tanh−1( v

    c)we can rewrite this as

    vc = tanh

    [tanh−1

    ( v1c)

    + tanh−1( v2

    c)]

    This boils down to the relativistic addition theorem for velocities:

    β = β1 + β21 + β1β2⇒ v = v1 + v21 + v1v2c2

    .

  • Lorentz boosts in 3+1 dimensions

    A Lorentz boost does not affect the degrees of freedomperpendicular to the boost direction.

    For example, a Lorentz boost in x direction does not modify y andz components:

    ctxyz

    =

    cosh θ sinh θsinh θ cosh θ

    11

    ︸ ︷︷ ︸

    =Λx (θ)

    ct̃x̃ỹz̃

  • Section 2.2

    Minkowski Spacetime

  • Minkowski Space R1+3

    Position vectors have components with upper indicesctxyz

    =x0x1x2x3

    using Greek indices running from 0 to 3:

    x = xµeµ = cte0 + xe1 + ye2 + ze3.

    Covectors have components with lower indices, for example thegradient:

    ∇ = eν∂ν(∂0, ∂1, ∂2, ∂3

    )=(

    1c∂∂t ,

    ∂∂x ,

    ∂∂y ,

    ∂∂z

    )

  • Scalar product: The Minkowski metricThe metric for zero gravity

    The Minkowski metric is so special that we use ηµν instead of gµν :

    ηµν =

    −1

    +1+1

    +1

    This metric is self-inverse:

    ηµν = ηµν

    Raising and lowering indices in Special Relativityamounts simply to changing the sign in the zeroth

    component.

  • Light cone

    I xµxµ > 0: Spacelike distance: The events cannot be causally connected.I xµxµ = 0: Lightlike distance: The events can be connected by a light ray.I xµxµ < 0: Timelike distance: The events can be connected causally.

  • Wave equation in Minkowski spaceWave equation

    �Φ = 0 ⇒ ∂µ∂µΦ = (∇2 − 1c2∂2t )Φ = 0

    Plane-wave ansatz:

    Φ(x) = eik·x = eikµxµ where k =(ω/c~k

    )

    Inserting the ansatz into the wave equation:

    �Φ = −kµkµ︸ ︷︷ ︸=0

    Φ

    Dispersion relation of light:

    kµkµ = 0 ⇒ ω = ±kc

  • Adding mass: The Klein-Gordon equation

    Klein-Gordon equation:(�−M2

    )ψ = 0

    Insert plane wave:Ψ(x) = eik·x

    ⇒ The wave vector k has to lie on the mass shell:

    kµkµ = −M2 ,

    Dispersion relation:ω

    c = ±√M2 − k2.

  • Mass shell

  • Isometries of the Minkowski spaceThe isometries are transformations between intertial systems

    Recall: An isometry is a coordinate transformation x 7→ x̃ (activeor passive) which leaves the metric tensor invariant:

    gµν = g̃µν , gµν = g̃µν

    In general the transformation is nonlinear, the metric tensor isposition-dependent (like in polar coordinates), and thetransformation matrix reads:

    Λρµ(x) =∂x̃ρ∂xµ = ∂µx̃

    ρ

    The isometry condition then reads:

    g̃µν(x) = gρτ (x) Λρµ(x) Λτν(x) , g̃µν(x) = Λµρ(x) Λντ (x) gρτ (x)

    In Special Relativity (zero gravity), where gµν = ηµν , theisometries are elements of the Poincaré group.

  • Zero gravity isometries: The Poincaré group and itssubgroups

  • Exercise: The Lie group SO(3)

    Consider the isometry condition for the Euclidean metric gij = δij :

    1 = Λ1ΛT ⇒ ΛΛT = 1.

    I These are the orthogonal transformations O(3).I Adding the condition det(Λ) = 1 we get the SO(3)

    Consider infinitesimal SO(3) transformations:

    Λ = 1 + �λ ⇒ λ+ λT = 0

    I The generators λ have to be antisymmetric.

  • Exercise: The Lie group SO(3)

    Any antisymmetric 3×3 matrix can be written as a linearcombination of the three generators:

    λ(12) =

    0 1 0−1 0 00 0 0

    , λ(13) = 0 0 10 0 0−1 0 0

    , λ(23) =0 0 00 0 10 −1 0

    For example the generator λ(12) generates a rotation in thex1-x2-plane around the z-axis :

    exp(ϕλ(12)

    )=

    cosϕ sinϕ− sinϕ cosϕ1

    = R(12)(ϕ)Similarly λ(13) and λ(23) generate rotations around the x , y axes.

  • SO(3) Angular momentum algebra

    λ(12) =

    0 1 0−1 0 00 0 0

    , λ(13) = 0 0 10 0 0−1 0 0

    , λ(23) =0 0 00 0 10 −1 0

    In quantum physics, generators are renamed and multiplied by −i :

    L1 =

    0 0 00 0 −i0 i 0

    , L2 = 0 0 i0 0 0−i 0 0

    , L3 = 0 −i 0i 0 0

    0 0 0

    These three generators form the angular momentum operator ~Lobeying the angular momentum algebra

    [Li , Lj ] = i�ijkLk .

    Finite rotations:R(~φ) = ei~L·~φ.

    ~φ is the vector defining the rotational axis.

  • The SO+(3, 1) and its Lie algebracombining rotations and Lorentz boosts

    Definition of the generators:

    [λ(αβ)]µν = δµαηβν − δµβηαν

    Commutation relations:

    [λ(αβ), λ(γδ)] = −(ηαγλ(βδ) − ηαδλ(βγ) + ηβδλ(αγ) − ηβγλ(αδ)

    )Matrix representation:

    λ(10) =

    11 λ(20) = 1

    1

    λ(30) = 1

    1

    λ(12) =

    1−1

    λ(13) = 1

    −1

    λ(23) =

    1−1

  • The SO+(3, 1) and its Lie algebra

    I Rotations:

    exp(φλ(12)

    )=

    1

    cosφ sinφ− sinφ cosφ

    1

    I Lorentz boosts:

    exp(θλ(01)

    )=

    cosh θ sinh θsinh θ cosh θ

    11

  • Rotationsmove things on circles

  • Lorentz boostsmove things on hyperbolas

  • Lorentz boosts

    I Straight lines remain straight.I The light cone is mapped onto itself.

  • Redefining the presenceThere is no notion of simultaneity any more

    I The present cannotchange my past.

    I I cannot change thepresent in the future.

  • Section 3.1

    Relativistic Mechanics – Equations of Motion

  • Mass points and world lines

    In Relativity, we have the same notion of pointlike particles as inNewtonian Mechanics, but

    I In Minkowski space, even if the particle is at rest, it proceedsin time, hence the mass point moves on a line, the so-calledworld line.

    I The effective mass of the particle depends on its velocity anddiverges as the particle velocity approaches the speed of light.

    I Therefore, m always denotes the rest mass of the particle.

  • Worldlines are timelike

    Events along a worldline have tobe causally connected.

    ⇒ Tangent vector has to be timelike.

    ⇒ Worldlines are confined to lightcone.

  • Worldline parameterizationA curve can be parameterized freely (gauge)

    From the mathematical point of view, a worldline is aparameterized curve of the form

    C : [a, b]→ R3+1 : λ 7→ x(λ) =

    x0(λ)x1(λ)x2(λ)x3(λ)

    constrained by the condition that the tangent vector is timelike:

    dxµ(λ)dλ

    dxµ(λ)dλ < 0

  • ReparameterizationChanging the gauge

    Two parameterizations C(λ) and C̃(τ) describe the same curveif there is a function f : R→ R such that

    τ = f (λ) , C(λ) = C̃(f (λ)).

    The corresponding tangent vectors differ by a factor of df / dλ:

    dxµ(λ)dλ =

    dx̃µ(τ)dτ

    df (λ)dλ .

    ⇒ The tangent vectors differ in their squared lengths:( dxµ(λ)dλ

    dxµ(λ)dλ

    )=( dx̃µ(τ)

    dτdx̃µ(τ)

    dτ)( df (λ)

    dλ)2.

  • Eigenzeit (proper time) parameterizationThe eigenzeit is the time the particle experiences by itself

    I We can always choose a special parameterization such that

    dxµ(τ)dτ

    dxµ(τ)dτ = −c

    2.

    τ is called proper time (eigenzeit).I The proper time τ of an object is defined as the time

    displayed by a clock attached to the object.I τ is a scalar since it is invariant under basis transformations.

    I Remark: Choosing the eigenzeit parameterization is like“fixing a gauge”.

  • Relativistic 4-velocityEven when sleeping, we move in time...

    The relativistic velocity or 4-velocity of a mass point is defined by

    u := ẋ = dxdτ

    • τ is a scalar, x is a 4-vector, so ẋ has to be a 4-vector as well.

    The squared norm of the velocity always equals the speed of light:

    u2 = uµuµ = −c2

    If we apply a Lorentz transformation we get the components

    uµ =(γcγ~v

    )= 1√

    1− v2c2

    cvxvyvz

  • Relativistic 4-momentumequals rest mass times relativistic velocity

    The relativistic 4-momentum of a free particle is defined by

    p := mu = mẋ

    Since m is Lorentz-invariant, p transforms like a 4-vector.

    I Components:

    pµ =(E/c~p

    )where E = γmc2 , ~p = γm~v

    I Squared norm:

    pµpµ = m2uµuµ = −m2c2

  • E = mc2

    The squared-norm condition

    pµpµ = −m2c2

    translates into the energy-momentum relation

    E 2 = m2c4 + ~p2c2.

    Resting particle:E = ±mc2

    Taylor expansion:

    E = ±(mc2 + p

    2

    2m −p4

    8m3c2 +O(p6/c4)

    )

  • 4-accelerationVery simple definition:

    a := u̇ = ẍ = d2x

    dτ2

    Components (see exercise)

    a =(

    γγ̇cγ2~a + γγ̇~v

    ).

    Remarks:I In an instantaneously co-moving inertial frame we have

    a =(0~a

    )

    I The 4-acceleration is the curvature vector of a worldline.

  • Principle of Least ActionRecall lecture on Classical Mechanics

    Action of a particle in a potential on trajectory q(t):

    S[q] =∫ t2

    t1L(q, q̇) dt where L = T − V = 12mq̇

    2 − V (q)

    Realized trajectory in Nature ⇔ Minimal action.

    0 = δS =∫ t2

    t1

    (∂L(q, q̇)∂q δq +

    ∂L(q, q̇)∂q̇ δq̇

    )Here δq and δq̇ are not independent, perform partial integration:

    0 = δS =∫ t2

    t1

    (∂L(q, q̇)∂q −

    ddt∂L(q, q̇)∂q̇

    )δq ⇒ ∂L

    ∂q−ddt∂L∂q̇ = 0.

  • Relativistic action of a free point particleThe action is simply the length of the worldline

    Heuristic Principle of Simplicity:

    Nature always selects the simplest solution compatible withthe required symmetries.

    The relativistic action should be invariant under frame changes:

    Simplest solution:The action is defined as the relativistic length of the world line:

    S = −mc∫

    cds

    where ds =√−dxµ dxµ .

  • !! Never fix the gauge before carrying out the variation !!Proposed action of a free particle:

    S = −mc∫

    cds

    We know that ds = c dτ , so the action issimply the elapsed eigenzeit:

    S = −mc2∫ τBτA

    dτ = −mc2(τB − τA).

    This is wrong!

    I The ending points (ctA, xA) and (ctB, xB), but the eigenzeitdifference varies for each of the trajectories over which wewant to minimize.

    I We won’t get meaningful Lagrange equations of motion.

  • Solution:

    For each of the trajectories, choose a pa-rameterization such that the parameters atthe ending points coincide, say λA and λB:

    S = −mc∫

    cds = −mc

    ∫ λBλA

    dsdλ dλ

    =∫ λBλA−mc

    √− dx

    µ

    dλdxµdλ︸ ︷︷ ︸

    =L(x,ẋ)

    This gives us directly the Lagrange function for a free particle

    L(x, ẋ) = −mc√−ηµν ẋµẋν

  • Derivation of the equations of motionThe variation of the action is given by

    δS =∫ λBλA

    (∂L(x, ẋ)∂xµ δx

    µ + ∂L(x, ẋ)∂ẋµ δẋ

    µ)

    Principle of least action ⇒ δS = 0.δxµ and δẋµ are not independent ⇒ Partial integration:

    0 = δS =[∂L(x, ẋ)

    ∂ẋµ δxµ]λBλA︸ ︷︷ ︸

    =0

    +∫ λBλA

    (∂L(x, ẋ)∂xµ −

    ddλ

    ∂L(x, ẋ)∂ẋµ

    )︸ ︷︷ ︸

    =0

    δxµ dλ

    Variation δxµ is arbitrary, hence we obtain the relativisticEuler-Lagrange equations

    ∂L(x, ẋ)∂xµ −

    ddλ

    ∂L(x, ẋ)∂ẋµ = 0 .

  • Canonically conjugated momentumis defined as the derivative of the Lagrange function

    Define relativistic canoncially conjugated momentum:

    pµ = ∂L∂ẋµ

    , pµ =∂L∂ẋµ

    This allows us to rewrite the EOM’s as

    ṗµ = ∂L∂xµ

    Free particle:

    pρ = −mc∂√−ηµν ẋµẋν∂ẋρ =

    mc ẋρ√−ηµν ẋµẋν

  • Fixing the gaugeThe momentum does not depend on the parameterization

    Consider reparameterization:

    λ 7→ λ̃ = f (λ) , ddλ 7→d

    dλ̃= 1f ′(λ)

    ddλ .

    The Lagrange function is not invariant under reparameterization:

    L 7→ L̃ = −mc√−ηµν ˜̇xµ ˜̇xν =

    −mc√−ηµν ẋµẋν|f ′(λ)| .

    But the conjugated momentum is invariant (f ′(λ) cancels out):

    pρ = ∂L∂ẋρ

    = mc ẋρ√

    −ηµν ẋµẋν

    At this point we can choose eigenzeit parameterization!

    p = mu .

  • Relativistic 4-forceNatural definition:

    f = ddτ p = ṗ = ma .

    Components:

    f =

    γc dEdtγ~F

    Something is strange here:

    p(τ = 0) =

    mc000

    , f =a000

    ⇒ p(τ) =mc + aτ

    000

    ⇒ The object is resting, but gaining weight!

  • “Non-weight-gaining” forceshave to be “perpendicular” on the world line.

    Mass conservation:

    ||p||2 = pµpµ = −m2c2 ⇒ddt pµp

    µ = 2ṗµpµ = 0

    This leads to:f · u = fµuµ = 0

    Such a mass-conserving force is of the form

    f =(γc~F · ~vγ~F

    ).

  • Charged particle in a given electromagnetic fieldThe electromagnetic 4-potential

    Recall: The electromagnetic fields ~E and ~B can be derived from ascalar potential φ and a vector potential ~A as follows:

    ~E (~x , t) = −∇φ(~x , t)− ∂~A(~x , t)∂t ,

    ~B(~x , t) = ∇× ~A(~x , t) .

    In Special Relativity, we combine the potentials in a 4-potential A:

    Aµ(x) =

    1cφ(~x , t)~A(~x , t)

    The SI unit is:

    [A] = Vsm

  • Charged particle in a given electromagnetic fieldThe Action of a charged particle in an electromagnetic field

    Heuristic principle of simplicity:Add a term to the Lagrange function L that is as simple as possible:

    Lfree → Lem = Lfree + e A(x) · ẋ︸ ︷︷ ︸scalar

    I The extra term has to be translation-invariant and scalar.I e is a coupling constant.

    I L has the dimension of an energy: [L] = J = VAs.I e has the dimension of an electric charge:

    [e] = [L][ẋ][A] = As = C

  • Charged particle in a given electromagnetic fieldThe Action of a charged particle in an electromagnetic field

    L(x, ẋ) = −mc√−ηµν ẋµẋν︸ ︷︷ ︸

    free particle

    + eAν ẋν︸ ︷︷ ︸electromagnetic

    Put this into the equations of motion:

    ∂L(x, ẋ)∂xµ −

    ddτ

    ∂L(x, ẋ)∂ẋµ = 0

    I First term:∂L(x, ẋ)∂xµ = e

    ∂Aν∂xµ ẋ

    ν = e(∂µAν)ẋν

    I Second term:ddτ

    ∂L(x, ẋ)∂ẋµ =

    ddτ(mẋµ + eAµ(x(τ))

    )= mẍµ + e(∂νAµ)ẋν

  • Charged particle in a given electromagnetic fieldThe field tensor

    ⇒ Equations of motion:

    mẍµ = e (∂µAν − ∂νAµ) ẋν

    We define the electromagnetic tensor:

    Fµν = ∂µAν − ∂νAµ

    Equations of motion:

    mẍµ = e Fµν ẋν

  • Tensor of the electromagnetic fieldDetermine the matrix elements

    1. Take equations of motion

    mẍµ = e Fµν ẋν

    2. Consider non-relativistic limit v � c, where ddτ ≈ddt :

    mẍi = e(Fi0c + Fijv j

    )3. Compare this expression with formula for Lorentz force:

    m~a = e(~E + ~v × ~B

    )4. ⇒ Determine components of the antisymmetric tensor Fµν .

  • Tensor of the electromagnetic field

    Result:

    With lower indices:

    Fµν =

    0 −Ex/c −Ey/c −Ez/c

    Ex/c 0 Bz −ByEy/c −Bz 0 BxEz/c By −Bx 0

    With upper indices:

    Fµν =

    0 Ex/c Ey/c Ez/c

    −Ex/c 0 Bz −By−Ey/c −Bz 0 Bx−Ez/c By −Bx 0

    Remark: You may find different conventions in various textbooks.

  • Electromagnetic Gauge Invariance

    A gauge is an non-physical redundancy in ourdescription of physical reality.

    Claim:Electrodynamics is gauge-invariant under gauge transformations

    Aµ → õ = Aµ + ∂µf .

    where f (x) is an arbitrary scalar function.

    Consistency check:The equations of motion mẍµ = e Fµν ẋν are in fact invariant:

    Fµν → F̃µν = ∂µAν + ∂µ∂ν f − ∂νAµ − ∂ν∂µf = Fµν

  • Electromagnetic Gauge Invarianceworks also nicely on the level of the Lagrangian:

    Every Lagrangian has a redundancy by adding

    L(x, ẋ) → L̃(x, ẋ) = L(x, ẋ)+ dfdλ = L(x, ẋ)+(∂µf )ẋµ

    because

    S → S̃ =∫ λBλA

    (L(x(λ), ẋ(λ)) + dfdλ

    )= S + f (λB)− f (λA)︸ ︷︷ ︸

    const

    .

    ⇒ δS = δS̃

    Compare this with:

    L(x, ẋ) = −mc√−ηµν ẋµẋν + eAµẋµ

    The redundancy is equivalent to replacing Aµ → Aµ + 1e∂µf .

  • Section 3.2

    Noether Theorem

  • Noether TheoremRecall nonrelativistic Noether Theorem

    I Let q(t) be a solution of the Lagrange equations.I Let qs(t) be a continuous transformation of the solution with

    no change at s = 0.

    I This transformation is called symmetry transformation if allqs(t) minimize the action (to first order in s).

    ⇔ ∂∂s L(qs , q̇s)

    ∣∣∣s=0

    = dΛdt

    Claim: For a symmetry transformation, the Noether charge

    Q(t) = p∂qs(t)∂s

    ∣∣∣s=0− Λ(t)

    is conserved in time.

  • Relativistic Noether Theorem

    I Let x(τ) be a solution of the Lagrange equations.I Let xs(τ) be a continuous transformation of the solution with

    no change at s = 0.

    I This transformation is called symmetry transformation if allxs(τ) minimize the action (to first order in s).

    ⇔ ∂∂s L

    (xs(τ), ẋs(τ)

    )∣∣∣s=0

    = dΛ(τ)dτ

    Claim: For a symmetry transformation, the Noether charge

    Q(τ) = pµ(τ)∂xµs (τ)∂s

    ∣∣∣s=0− Λ(τ)

    is conserved along the worldline of the particle.

  • Applications of the Noether TheoremTranslations

    Move everything by a constant 4-vector f:

    x → xs = x + sf ⇒ xµ → xµs = xµ + sf µ

    I For a free particle the Lagrangian depends only on ẋ.I The Lagrangian does not depend on s, hence Λ = 0.

    Now apply Noether theorem:I

    Q = ∂L(xs , ẋs)∂ẋµs

    ∂xµs∂s

    ∣∣∣s=0

    = pµf µ.

    I For a translation in direction f = eν :

    Qν = pµ(eν)µ = pν

  • Applications of the Noether TheoremRotations / Lorentz transformations

    Consider a rotation / Lorentz-transformation in the (α, β)-plane:

    x → xs = (1 + sλ(αβ))x

    where s is the angle / rapidity.I The Lagrangian is scalar, i.e., invariant under SO+(1, 3).I Λ = 0.

    xµ → xµs =(δµν + s [λ(αβ)]µν

    )xν

    ẋµ → ẋµs =(δµν + s [λ(αβ)]µν

    )ẋν

    ⇒ ∂sL =∂L∂xµ

    ∂xµs∂s +

    ∂L∂ẋµ

    ∂ẋµs∂s =

    ( ∂L∂xµ x

    ν + ∂L∂ẋµ ẋ

    ν)

    [λ(αβ)]µν = 0

    ⇒ Qαβ = pαxβ − pβxα

  • Section 4.1

    Classical Scalar Field

  • Concept of classical fields

    In Special Relativity, a classical field lives in space-time, e.g.

    φ : R3+1 → R : x 7→ φ(x)

    The field is the solution of a field equation, e.g., the Klein-Gordon equ.:(�−M2

    )φ = 0

  • Lagrange formalism for Classical Fields

    I Principle of Least Action: The solution of the fieldequations minimizes the action S[φ].

    I The action is a functional, mapping a function onto R.

    I The field lives not only on a line, but in the whole space-time.I The action can be written as an integral over a Lagrange

    density:S =

    ∫d4x L

    I The Lagrange density should only depend on the field and itsfour partial derivatives

    S[φ] =∫

    d4x L(φ(x),∇φ(x)

    )

  • Variational calculus for fields

    Consider an infinitesimal variation of the field

    φ(x) → φ(x) + χ(x)

    with |χ(x)| � |φ(x)| which vanishes on ∂Ω.

    S[φ]→ S[φ+ χ] =∫

    Ωd4x L

    (φ(x) + χ(x), ∇φ(x) +∇χ(x)

    )=∫

    Ωd4x

    [L(φ,∇φ) + ∂L(φ,∇φ)

    ∂φχ+ ∂L(φ, ∇φ)

    ∂(∂µφ)∂µχ

    ]δS = S[φ+ χ]− S[φ] ≈

    ∫Ω

    d4x[∂L(φ,∇φ)

    ∂φχ+ ∂L(φ,∇φ)

    ∂(∂µφ)∂µχ

    ]How can we integrate by parts?

  • Variational calculus for fields

    Recall Gauss divergence theorem in R3:∫V~∇ · ~F dV =

    ∮∂V

    ~F · ~n

    A similar theorem holds in Minkowski spaceR1+3 for any differentiable 4-vector field F:∫

    Ω∂µFµ d4x =

    ∮∂Ω

    Fµ dnµ

  • Variational calculus for fieldsPartial integration

    Gauss divergence theorem in R1+3.∫Ω∂µFµ d4x =

    ∮∂Ω

    Fµ dnµ

    Now replaceFµ := ∂L

    ∂(∂µφ)χ

    and apply product rule:

    ∫Ω

    ([∂µ

    ∂L∂(∂µφ)

    ]χ+ ∂L

    ∂(∂µφ)[∂µχ

    ])=∮∂Ω

    ∂L∂(∂µφ)

    χ dnµ︸ ︷︷ ︸zero since χ=0 on ∂Ω

    This is like partial integration in the variational integral.

  • Variational calculus for fieldsField equations

    δS ≈∫

    Ωd4x

    [∂L(φ,∇φ)∂φ

    χ+ ∂L(φ,∇φ)∂(∂µφ)

    ∂µχ]

    Partial integration:

    δS ≈∫

    Ωd4x

    [∂L(φ,∇φ)∂φ

    − ∂µ∂L(φ,∇φ)∂(∂µφ)

    ]︸ ︷︷ ︸

    =0

    χ

    Field equations:

    ⇒ ∂L(φ,∇φ)∂φ

    − ∂µ∂L(φ,∇φ)∂(∂µφ)

    = 0

  • Action for the wave equationWhich action S gives us the wave equation

    �φ = 0 ?

    We have no potential, therefore:

    �φ = ∂µ∂µφ ∝ ∂µ∂L(∇φ)∂(∂µφ)

    ⇒ ∂µφ ∝ ∂L∂(∂µφ)

    The simplest action would be:

    L(∇φ) = −12(∂νφ)(∂νφ).

    Note that

    L = 12c2 (∂tφ)2︸ ︷︷ ︸

    Ekin

    − 12(~∇φ)2︸ ︷︷ ︸Epot

    = Ekin − Epot

  • Action for the Klein Gordon equationNow add another ’potential’ V to the action:

    L(φ,∇φ) = −12(∂νφ)(∂νφ)−V (φ).

    Explicitely:

    L(φ,∇φ) = 12c2 (∂tφ)2︸ ︷︷ ︸

    =Ekin

    − 12(~∇φ)2−V (φ)︸ ︷︷ ︸

    =Epot

    ,

    ⇒ Equations of motion:∂L∂φ− ∂µ

    ∂L∂(∂µφ)

    = −V ′(φ) +�φ = 0.

    Choose potential of the form V (φ) = 12M2φ2:

    L(φ,∇φ) = −12(∂νφ)(∂νφ)− 12M

    2φ2.

    (�−M2

    )φ = 0

  • Scalar field as a “wave machine”

    (�−M2

    )φ =

    (∇2 − 1c2

    ∂2

    ∂t2 −m2c2~2

    )φ = 0

  • Vector fieldsI Vector fields are vector-valued fields, for example Φ(x) ∈ RN .I Such vector fields have N components Φa(x) labelled by Latin

    indices a = 1, . . . ,N.I Usually they come with a symmetry among the components.

    Example:

    I Free O(N) vector model:

    L(Φ,∇Φ) = −12

    N∑a=1

    (∂νΦa)(∂νΦa)−12M

    2N∑

    a=1

    ΦaΦa

    I Interacting O(N) vector model with Φ4-coupling:

    L(Φ,∇Φ) = −12

    N∑a=1

    (∂νΦa)(∂νΦa)−12M

    2N∑

    a=1

    ΦaΦa−124λ

    N∑a,b=1

    ΦaΦaΦbΦb

  • Noether theorem for fieldsA coordinate transformation can be interpreted as a changing field

    x→ x̃ = x− �f(x)⇒ xµ → x̃µ = xµ − �f µ(x)

    φ(x) = φ̃(x− �f(x)

    )' φ̃(x)− �(∂µφ̃)f µ(x)︸ ︷︷ ︸

    =: �∆φ(x)

    In short:

    φ̃(x) ' φ(x) + �∆φ(x).

  • Noether theorem for fields

    φ̃(x) ' φ(x) + �∆φ(x).

    This transformation is called a symmetry transformation if theLagrangian changes only by a gauge transformation

    δL ' �∂µΛµ

    If we manage to determine Λ, the Noether current is given by

    jµ := πµ∆φ − Λµ

    where πµ = ∂L∂(∂µφ) is the conjugate field momentum.

    This current is conserved via continuity equation

    ∂µjµ(x) = 0.

  • Noether theorem for fieldsGeneralization

    Noether theorem for several fields ψ1, ψ2, . . . , ψN :

    δψi (x) = �∆ψi (x)

    jµ(x) =( N∑

    i=1πµψi (x)∆ψi (x)

    )− Λµ(x) with πµψi (x) =

    ∂L∂(∂µψi )

    Noether theorem for vector fields A(x) with components Aµ(x):

    δAµ(x) = �∆ µA (x)

    jµ(x) = πµν (x)∆ νA (x)− Λµ(x) with πµν (x) =∂L

    ∂(∂µAν)

    In both cases:∂µjµ(x) = 0

  • Noether theorem for fieldsTranslation

    Move coordinates (space & time) infinitesimallyby a constant vector �a:

    x̃ = x + �a

    Equivalently, the fields change as

    δφ(x) = �∆φ(x) = �(∂µφ(x)

    )aµ.

    One can show that in this case

    Λ = La

    The corresponding Noether current is

    jµ := πµ∆φ − Λµ = πµ(∂ρφ)aρ − Laµ

    and we have as usual the continuity equation

    ∂µjµ(x) = 0

  • The energy momentum tensorof a field is the Noether current related to translation

    Choose translation in the direction of the basis vector a := eν :

    aρ = δρν

    Then the Noether current carries two indices:

    jµν = πµ(∂ρφ)δρν − Lδµν = πµ∂νφ− δµνL .

    We call it the energy-momentum tensor:

    T µν = πµ∂νφ− δµνL

    which obeys the continuity equation

    ∂µT µν = 0.

  • Energy momentum tensor

    T µν = πµ∂νφ− δµνL ∂µT µν = 0.

    Names used in the literature:I Stress-energy tensorI Stress-energy-momentum tensorI Energy-momentum tensor

    Why do we need a tensor to describe energy and momentum?Why not a simple (position-dependent) 4-vector?

    Answer:A field is not a single particle, it is a soup of many particles.

  • Energy momentum tensor - InterpretationExercise: Consider cloud of dust

    I Density of particles ρ(~x , t)I Local average velocity ~v(~x , t)I ⇒ Particle current

    ~j(~x , t) = ρ(~x , t)~v(~x , t)

    Now consider test volume V , containing mass M.Apply Gauss divergence theorem:

    dMdt =

    ddt

    ∫V

    dV ρ(~x) = −∮∂V

    d~n ·~j = −∫

    VdV ~∇ ·~j(~x).

    Compare integrands ⇒ 3D Continuity equation:

    ddt ρ(

    ~x) = −~∇ ·~j(~x).

  • Energy momentum tensor - InterpretationDefine 4-current for dust:

    ddt ρ(

    ~x) = −~∇ ·~j(~x).

    I Define 4-current

    jµ(x) =(cρ(~x , t)~j(~x , t)

    ).

    I Rewrite continuity equation

    ∂µjµ = 0.

  • Energy momentum tensor - InterpretationInterpretation of the 4-current

    jµ(x) =(cρ(~x , t)~j(~x , t)

    ).

    I Zeroth component:The differential mass dM contained in infinitesimal volumedV is

    dM = 1c j0 dx dy dz

    I First component:The differential mass dM passing the area dy dz in time dt is

    dM = j1 dy dz dt

    The content of an infinitesimal 3-volume dA dt canbe interpreted as the flux of this content through

    the surface dA within the time span dt.

  • Energy momentum tensor - InterpretationNormal vectors in momentum space

    I A timelike vector dn ∝ e0 is normal on dx dy dz .I A spacelike vector dn ∝ e1 is normal on dy dz dt.I A spacelike vector dn ∝ e2 is normal on dx dz dt.I A spacelike vector dn ∝ e3 is normal on dx dy dt.

    dM = j · dn = jµ dnµ

  • Energy momentum tensor - InterpretationThe matrix elements

  • Section 5.1

    Electrodynamics - Geometric Content

  • ElectrodynamicsMaxwell equations

    Name Integral equation Differential equation

    Gauß law for ~E∮∂V

    ~E · d~A = 1�0

    ∫Vρ dV ~∇ · ~E = 1

    �0ρ

    Gauß law for ~B∮∂V

    ~B · d~A = 0 ~∇ · ~B = 0

    Faraday’s law∮∂A

    ~E · d~l − ddt

    ∫A

    ~B · d~A = 0 ~∇× ~E + ∂~B∂t = 0

    Ampère’s law∮∂A

    ~B · d~l = µ0∫

    A

    ~J · d~A ~∇× ~B − µ0�0∂~E∂t = µ0

    ~J

    +µ0�0ddt

    ∫A

    ~E · d~A

  • Electrodynamics is nothing but a ring

    A ring is attached to each point in space-time.

  • Electrodynamics is nothing but a ring

    The ring is not attached in a single point.Rather each point of space-time is associated with a ring.

  • Electrodynamics is nothing but a ring

    Think of yourself as having a position on the ring.

  • Electrodynamics is nothing but a ring

    For characterizing the position, we have to define coordinates.

  • Electrodynamics is nothing but a ringThe Circe Group U(1)

    I The circle group U(1) is the

    group of translations on a circle.

    I The U(1) is an Abelian Lie group.

    I Representation:• Angular: ϕ→ ϕ̃ = (ϕ+ ∆ϕ)mod2π

    • Complex: z → z̃ = ze i∆ϕ

  • Electrodynamics is U(1)

    If space-time was one-dimensional,the U(1) rings would form a tube.

  • Electrodynamics is U(1)

    Living on the tube, you cannot say where you are,but there is a clear notion of “moving straight”.

  • Electrodynamics is U(1)

    We can ’twist’ the tube physically.Electromagnetism is the story of tube twists.

  • Electrodynamics is U(1)

    IMORTANT:If the tube is twisted by force, the trajectories of

    “moving straight” would follow the twist.

  • Electrodynamics is U(1)

    In terms of the coordinate ϕ, one cannot decidewhether the tube is physically twisted or whether the

    origins φ = 0 are just chosen in a fancy way.

  • Local gauge transformations

    A gauge transformation is nothing butthe change of the local coordinate sys-tems on the U(1) circles:

    ϕ(x) 7→ ϕ̃(x) = ϕ(x) + θ(x)

    Equivalently with z = eiϕ:

    z(x) 7→ z̃(x) = z(x)eiθ(x)

  • Dreaming of a unified electrodynamics......would be to include ϕ as a fifth dimension!

    Consider φ as an additional dimension:

    ~x =

    ctxyzϕ

    , gµν =−1 ∗

    +1 ∗+1 ∗

    +1 ∗∗ ∗ ∗ ∗ ∗

    This is known as Kaluza-Klein theory.

    I Very cool, but difficult.I Requires knowledge of differential geometry.

    Most people consider the electromagnetic field as a separate thing.

  • How electrodynamics is usually done...Slice 5d space into a 4+1-dimensional fiber bundle

    I We have to install a map that relates the fibers.I We pay a price: The map will be gauge-dependent.

  • How electrodynamics is usually done...

    I Choose a gauge (coordinate system) in the U(1) circles.I Choose a position x in space-time.I Walk “straight” in direction eµ and see at which rate ϕ(x)

    changes. Call this quantitity Aµ(x) (up to a proportionalityfactor).

    ⇒ dϕ ∝ A(x) · dx = Aµ(x) dxµ

    The proportionality constant turns out to be the electric charge:

    dϕ = e~A(x) · dx = e

    ~Aµ(x) dxµ

    The Aµ(x) are the connection coefficients which tell us how to “go straight”. By construction, they depend on the chosen gauge.

  • Behavior of A(x) under gauge transformationsConnection of two circles at infinitesimal distance dx:

    dϕ = e~A(x) · dx

    Gauge transformation:ϕ(x) 7→ ϕ̃(x) = ϕ(x) + θ(x)

    Transformation behavior of the infinitesimal angle difference:

    dϕ(x) 7→ dϕ̃(x) = dϕ(x) +∇θ(x) · dx

    Gauge transformation of the connection:e~A(x) · dx 7→ e

    ~Ã(x) · dx = e

    ~A(x) · dx +∇θ(x) · dx

    ⇒ A 7→ Ã = A + ~e∇θ.

  • Electromagnetism - Gauge transformations

    A 7→ Ã = A + ~e∇θ.

    In coordinates:

    Aµ(x) → õ(x) = Aµ(x) +~e ∂µθ(x)

    Usually, the factor ~e is suppressed by setting θ(x) =e~ f (x):

    Aµ 7→ Aµ + ∂µf

    In a gauge transformation, the electromagnetic 4-vectorpotential A changes by a gradient of a scalar function.

  • Detection of twists / curvature

    Intrinsic curvature:

    In two dimesions, curvaturemanifests itself in the factthat the angular sum in atriangle is larger than π.

    Idea: Study mismatcheson closed contours

  • Detection of twists in U(1) tubes

    Walking on a closed contour means to walk on a torus.

  • Detection of twists in U(1) tubesFind mismatch along a closed tube

    Generally, a twist leads to a mismatch if we “go straight”all the way along a closed contour.

  • Compute the twist along a closed contour

    I For simplicity set

    e/~ = 1.

    I Compute the fourcontributions via∫

    dϕ =∫Aµ dxµ.

    I Add them up.

  • Quantitative computation of the twist

    Consider connection A with components Ax ,Ay .

    For simplicity let coordinate origin be in the center.

    Compute contribution on A→ B:

    ∆ϕA→B =∫ +�−�

    Ax (x ,−�) dx

    Insert Taylor expansion:

    A(x , y) = A(0, 0) + x∂xA(0, 0) + y∂yA(0, 0) +O(x2, y2)

    ⇒ ∆ϕA→B '∫ +�−�

    (Ax (0, 0) + x∂xAx (0, 0)− �∂yAx (0, 0)

    )dx

    ⇒ ∆ϕA→B ' 2�Ax (0, 0)− 2�2∂yAx (0, 0) + O(�3)

  • Quantitative computation of the twist

    Add up the four contributions:

    ∆ϕA→B = 2�Ax (0, 0)− 2�2∂yAx (0, 0) + O(�3)∆ϕB→C = 2�Ay (0, 0) + 2�2∂xAy (0, 0) + O(�3)∆ϕC→D = −2�Ax (0, 0)− 2�2∂yAx (0, 0) + O(�3)∆ϕD→A = −2�Ay (0, 0) + 2�2∂xAy (0, 0) + O(�3) .

    Altogether the resulting change is

    ∆ϕ = ∆ϕA→B + ∆ϕB→C + ∆ϕC→D + ∆ϕD→A= 4�2

    (∂xAy (0, 0)− ∂yAx (0, 0)

    )+ O(�3) .

  • The electromagnetic field tensor

    ∆ϕ = 4�2(∂xAy (0, 0)− ∂yAx (0, 0)

    )+ O(�3) .

    ⇒ Fxy := lim�→0

    ∆ϕ4�2 = ∂xAy − ∂yAx .

    ⇒ Fµν := lim�→0

    ∆ϕ4�2 = ∂µAν − ∂νAµ .

  • Electromagnetic field tensor in 3+1 dimensions

    Fµν := ∂µAν − ∂νAµ .

    Fµν =

    0 −Ex/c −Ey/c −Ez/c

    Ex/c 0 Bz −ByEy/c −Bz 0 BxEz/c By −Bx 0

  • Electrodynamics in a 2+1-dimensional world

    Fµν =

    0 −Ex/c −Ey/cEx/c 0 BEy/c −B 0

  • The homogeneous Maxwell equationsare purely geometric

  • Homogeneous Maxwell equations

    I Each face of the cube contributeswith 4�2Fµν , evaluted at thecenter of the respective face.

    I Consider e.g. the front face (0,1):

    4�2F01(x + �e2) ' 4�2(1 + �∂2)F01(x).

    front: F01(x + �e2) ' (1 + �∂2)F01(x)rear: − F01(x− �e2) ' (−1 + �∂2)F01(x)top: F12(x + �e0) ' (1 + �∂0)F12(x)

    bottom: − F12(x− �e0) ' (−1 + �∂0)F12(x)right: − F02(x + �e1) ' (−1− �∂1)F02(x)left: F02(x− �e1) ' (1− �∂1)F02(x)

    ⇒ 2�(∂0F12(x)− ∂1F02(x) + ∂2F01(x)

    )= 0

  • Homogeneous Maxwell equationsin 1+3 dimenions

    Orientation of the cube Differential equation

    txy ∂0F12(x)− ∂1F02(x) + ∂2F01(x) = 0

    txz ∂0F13(x)− ∂1F03(x) + ∂3F01(x) = 0

    tyz ∂0F23(x)− ∂2F03(x) + ∂3F02(x) = 0

    xyz ∂1F23(x)− ∂2F13(x) + ∂3F12(x) = 0

  • Homogeneous Maxwell equations

    Shortcut: Cyclic bracket notations:I Round parenthesis where all permutations are added

    symmetrically with a positive sign:

    T(µν...) =1n!∑σ∈Sn

    Tσ{µν...}

    I Square brackets where all permuations are addedantisymmetrically with the sign of the permutation:

    T[µν...] =1n!∑σ∈Sn

    sign(σ)Tσ{µν...}

  • Shortest form of the homogeneous Maxwell equations:

    ∂[µFνρ] = 0.

    There are three spatio-temporal ones:

    ∂[0F12] = 0 ⇒ ∂tBz + ∂xEy − ∂yEx = 0∂[0F13] = 0 ⇒ −∂tBy + ∂xEz − ∂zEx = 0∂[0F23] = 0 ⇒ ∂tBx + ∂yEz − ∂zEy = 0

    ⇒ ∂t~B + ~∇× ~E = 0

    and a single equation among the three spatial indices:

    ∂[1F23] = 0 ⇒ ∂xBx + ∂yBy + ∂zBz = 0

    ⇒ ~∇ · ~B = 0

  • The dual field tensor ?F

    I Define the fully antisymmetric Levy-Civita symbols in 1+3dimenions:

    �µνρτ :=

    1 if (µνρτ) is an even permutation of (0123)−1 if (µνρτ) is an odd permutation of (0123)0 if (µνρτ) is not a permutation of (0123)

    together with�µνρτ := −�µνρτ

    I Define the dual field tensor ?F with the components

    ?Fµν = �µνρτF ρτ , ?Fµν = �µνρτFρτ

  • The dual field tensor ?FThe dual field tensor has the components

    ?Fµν =

    0 Bx By Bz−Bx 0 Ez/c −Ey/c−By −Ez/c 0 Ex/c−Bz Ey/c −Ex/c 0

    and

    ?Fµν =

    0 −Bx −By −BzBx 0 Ez/c −Ey/cBy −Ez/c 0 Ex/cBz Ey/c −Ex/c 0

    With this tensor, the homogeneous Maxwell equations can bewritten in a continuity-equation-like form as

    ∂µ?Fµν = 0

  • Section 5.2

    Electrodynamics - Physical Content

  • Action of the electromagnetic field

    Apply heuristic principle of simplicity.

    Search for the simplest action that is compatible with symmetries

    I Poincaré invariance (Lorentz + rotation + translation):

    ⇒ Action should be a Lorentz scalar

    I Gauge invariance:

    ⇒ Action should depend on F, not on A

    Fµµ: too simple FµνFµν : this works!

  • Action of the electromagnetic field

    SEM =∫

    d4x L(A,∇A)

    L(A,∇A) = − 14µ0FµνFµν

    Rewrite Lagrangian in A and ∇A:

    L(A,∇A) = − 12µ0

    ((∂µAν)(∂µAν)− (∂µAν)(∂νAµ)

    )Rewrite Lagrangian in the physical field ~E and ~B:

    L = 14µ0

    ( 2c2~E 2 − 2~B2

    )= 12

    (�0~E 2 −

    1µ0~B2)

  • Action of the electromagnetic field

    L(A,∇A) = − 14µ0FµνFµν

    The Lagrangian is just the sum of the squares of the twist in eachpossible orientation of the loop.

  • Second set of Maxwell equations without sources

    Lagrangian of the electromagnetic field without sources:

    L(A,∇A) = − 12µ0

    ((∂µAν)(∂µAν)− (∂µAν)(∂νAµ)

    )Euler-Lagrange equations of motion:

    −∂ρ∂L

    ∂(∂ρAσ)= 0

    ⇒ 1µ0∂ρ(∂ρAσ − ∂σAρ

    )= 0

    Short form:∂ρF ρσ = 0

  • Second set of Maxwell equations without sources

    ∂ρF ρσ = 0 with Fµν =

    0 Ex/c Ey/c Ez/c

    −Ex/c 0 Bz −By−Ey/c −Bz 0 Bx−Ez/c By −Bx 0

    For σ = 0:

    −1c (∂xEx + ∂yEy + ∂zEz) = 0 ⇒ div~E = 0

    For σ = 1, 2, 3:

    1c2∂tEx − ∂yBz + ∂zBy = 01c2∂tEy + ∂xBz − ∂zBx = 01c2∂tEz − ∂xBy + ∂yBx = 0

    ⇒ rot~B − µ0�0 ∂t~E = 0

  • Electromagnetic waves in vacuum

    −∂ρ∂L

    ∂(∂ρAσ)= 0 ⇒ 1

    µ0∂ρ(∂ρAσ − ∂σAρ

    )= 0

    Wave equation, valid in any gauge:

    �Aµ − ∂µ ∂ρAρ = 0

    We can simplify the wave equation by choosing a particular gauge:

    Chose Lorenz gauge: ∂µAµ = 0

    ⇒ �Aµ = 0

    The wave equation �A = 0 is only validin the Lorenz gauge ∂µAµ = 0.

  • Electromagnetic field with charged sourcesCharges are twist generators

    jµ(x) =

    cρ(~x , t)j1(~x , t)j2(~x , t)j3(~x , t)

    .

  • Electromagnetic field with charged sourcesInduced twists in the vicinity

  • Easiest way to couple the charged source to the EM field

    L(A,∇A) = − 14µ0FµνFµν + Aµjµ.

    Attention: This term should be gauge-invariant!

    Inhomogeneous Maxwell equations:

    ∂ρF ρσ = −µ0jσ

    Inhomogeneous Maxwell equations old-fashioned:

    div~E = 1�0ρ0 , rot~B − µ0�0 ∂t~E = µ0~J

    Wave equation in Lorenz gauge:

    �Aµ = −µ0jµ

  • Gauge invariance of the coupling term

    L(A,∇A) = − 14µ0FµνFµν + Aµjµ.

    Aµ → õ = Aµ + ∂µf ⇒ L → L̃ = L+ (∂µf )jµ

    The solution must not change under gauge transformations!

    ⇒ S̃ − S = −∫

    d4x (∂µf )jµ = 0

    Partial integration

    S̃ − S =∫

    d4x f ∂µjµ = 0

    Charge conservation: ∂µjµ = 0 ⇒ ∂tρ(~x , t) +∇~j(~x , t) = 0

    Charge conservation follows from gauge invariance.

  • Energy-momentum tensor of the electromagnetic field

    L = − 14µ0FµνFµν + Aµjµ.

    The conjugate momentum can be shown to be given by:

    ∂L∂(∂ρAσ)

    = Fσρ.

    The energy-momentum tensor is defined as

    T µν =∂L

    ∂(∂µAρ)∂νAρ − δµνL

    ⇒ T µν = F ρµ∂νAρ +14FρσF

    ρσδµν

  • Section 5.3

    Electrodynamics using differential forms

    Extra section, see lecture notes for further details

  • Section 5.4

    Electromagnetically charged fields

  • Real-valued scalar fieldMassless spring chain → Wave equation

    L = −12(∂νφ)(∂νφ) = 12c2 (

    ∂t φ)2︸ ︷︷ ︸

    Ekin

    − 12(∇φ)2︸ ︷︷ ︸

    Epot

  • Real-valued scalar fieldMassive spring chain → Klein-Gordon equation

    L = −12(∂νφ)(∂νφ)− 12M

    2φ2

  • Attaching U(1)-circles: Complex-valued scalar fieldMassless spring chain

    Epot =12(∇Re[φ]

    )2 + 12(∇Im[φ])2

  • Complex-valued scalar field

    L = 12c2( ∂∂tRe[φ]

    )2 − 12(∇Re[φ])2 − 12M2Re[φ]2+ 12c2

    ( ∂∂t Im[φ]

    )2 − 12(∇Im[φ])2 − 12M2Im[φ]2

    Simplify expression:

    L = 1c2( ∂∂t φ

    ∗)( ∂∂t φ

    )−(∇φ∗

    )(∇φ)−M2φ∗φ

    Final form:L = −(∂νφ

    )∗(∂νφ

    )−M2φ∗φ

  • Global and local gauge invariance

    Massive complex scalar field:

    L = −(∂νφ)∗(∂νφ

    )−M2φ∗φ

    This Lagrangian is invariant under global gauge transformations:

    φ(x)→ eiθφ(x)(θ = const

    )

    It is not invariant under local gauge transformations

    φ(x)→ eiθ(x)φ(x)(θ(x) 6= const

    )⇒ We have to modify the Lagrangian to make in gauge-invariant!

  • Restore local gauge invarianceModify the derivative

    The ordinary derivativegenerates translations:

    exp(a∂x )f (x) = f (x + a)

    exp(aµ∂µ)f (x) = f (x + a)

    ⇒ We should modify thederivative in such a waythat it also goes straightinside the tube!

  • Covariant derivative

    Begin with a real-valued field and perform the following gaugetransformation:

    φ(x)→ eiϕ(x)φ(x)

    Modify the partial derivative by the following steps:

    I First revert the gauge transformation.I Then apply the usual partial derivative.I Finally restore the gauge transformation.

    ⇒ Dµ := eiϕ(x)∂µe−iϕ(x)

    ⇒ Dµ = ∂µ − i(∂µϕ(x)

    )

  • Covariant derivative

    Dµ = ∂µ − i(∂µϕ(x)

    )Recall that

    dϕ(x) = e~Aµ(x) dxµ

    where e is the elementary charge. This means that

    ⇒ ∂µϕ(x) =e~Aµ(x)

    Hence the covariant derivative reads:

    Dµ = ∂µ −ie~

    Aµ .

  • Minimal coupling by covariant derivativeCooking recipe

    I Take the Lagrangian:

    L = −12(∂νφ)(∂νφ)− 12M

    2φ2

    I Complexify it:

    L = −(∂νφ)∗(∂νφ

    )−M2φ∗φ

    I Replace partial by covariant derivative:

    L = −(Dνφ)∗(Dνφ)−M2φ∗φ.

    I Add the Lagrangian of the free electromagnetic field:

    L = − 14µ0FµνFµν − (Dνφ

    )∗(Dνφ)−M2φ∗φ.

  • Minimal coupling: Solving the equations of motion

    L = − 14µ0FµνFµν − (Dνφ

    )∗(Dνφ)−M2φ∗φ.Rewrite:

    (Dνφ)∗(Dνφ) = (∂νφ∗ + ie~ Aνφ∗

    )(∂νφ− ie

    ~Aνφ

    )= (∂νφ∗)(∂νφ) +

    ie~

    (Aνφ∗(∂µφ)− Aνφ(∂νφ∗)

    )+ e

    2

    ~2AνAνφ∗φ

    Simply solve the Lagrange equations of motion:

    ∂L∂φ− ∂µ

    ∂L∂(∂µφ)

    = 0

    ∂L∂φ∗− ∂µ

    ∂L∂(∂µφ∗)

    = 0

    ∂L∂Aν − ∂µ

    ∂L∂(∂µAν)

    = 0

  • Section 6.1

    Relativistic quantum mechanics - without spin

  • Quantum mechanics of spinless particlesRecall the non-relativistic Schrödinger equation

    Starting point for “deriving” the free Schrödinger equation is

    E = p2

    2m .

    Then substitute according to the correspondence rules:

    E → i~∂t ~p → −i~~∇

    Let the relation act on a complex-valued wave function ψ(~x , t):

    i~∂tψ(x, t) = −~2~∇2

    2m ψ(x, t) .

    Interpretation: |ψ(x, t)|2 = ψ∗(x, t)ψ(x, t) is the probabilitydensity to find a particle at x at time t. This requires normalization∫

    ddx ψ∗(x, t)ψ(x, t) = 1.

  • Galilei invariance of the Schrödinger equationUnder Galilei transformation x→ x̃ = x− vt a function T (x)transforms as

    T̃ (x̃) = T (x)

  • Galilei invariance of the Schrödinger equation

    I Recall Galilei transformation:

    x→ x̃ = x− vt , t̃ = t , ∂̃t = ∂t + v · ∇ , ∇̃ = ∇ .

    I Assume “naive” transformation behavior of the wave function:

    ψ̃(x̃, t̃) = ψ(x, t) .

    I Insert both into the Schrödinger equation:

    −i~∂̃t ψ̃(x̃, t̃) = −~2∇̃2

    2m ψ̃(x̃, t̃)

    ⇒ −i~(∂t + v · ∇)ψ(x, t) 6= −~2∇2

    2m ψ(x, t)

    ...violates form invariance!

  • Galilei invariance of the Schrödinger equation

    This is not so surprising:I A resting particle has a constant wave function.I A moving particle has a wave function with oscillations in

    space and time.ψ̃(x̃, t̃) = ψ(x, t) cannot work .

    Possible solution: We have to add some kind of waviness

    ψ̃(x̃, t̃) = ei θ(x ,t)ψ(x, t)

    with a real function θ(x, t).I This preserves the transformation of the probability density

    ψ̃∗(x̃, t̃)ψ̃(x̃, t̃) = ψ∗(x, t)ψ(x, t)

  • Galilei invariance of the Schrödinger equationBoost of the wave function

    ψ̃(x̃, t̃) = ei θ(x ,t)ψ(x, t)

    Choose phase function θ(x, t) such that the Schrödinger equationis form-invariant under Galilei transformations:

    Exercise:

    ψ̃(x̃, t̃) = exp(− imv · x

    ~+ imv

    2t2~

    )ψ(x, t)

    = exp(− imv · x̃

    ~− imv

    2t̃2~

    )ψ(x̃ + vt̃, t̃)

    This boosted wave function obeys the transformed wave equation

    −i~∂̃t ψ̃(x̃, t̃) = −~2∇̃2

    2m ψ̃(x̃, t̃) .

  • Klein-Gordon equation

    Relativistic replacement rule:

    p→ −i~∂ ⇔ pµ → −i~∂µ .

    Recall that

    pµ =

    E/cpxpypz

    , pµ =−E/cpxpy

    pz

    T

    , ∂µ =

    1c ∂t∂x∂y∂z

    T

    This is compatible with E → i~∂t , ~p → −i~~∇ .

    Insert into energy-momentum relation pµpµ = −m2c2:

    ⇒ −~2∂µ∂µφ(x) = −m2c2φ(x) ⇒ (�−M2)φ(x) = 0

    where M = mc~ is the mass parameter.

  • Klein-Gordon equationThe problem of negative probability currents

    φ∗(�−M2)φ− φ(�−M2)φ∗ = 0.⇒ φ∗∂µ∂µφ− φ∂µ∂µφ∗ = 0.

    Because of∂µ(φ∗∂µφ) = (∂µφ∗)(∂µφ) + φ∗(∂µ∂µφ)∂µ(φ∂µφ∗) = (∂µφ)(∂µφ∗) + φ(∂µ∂µφ∗)

    this can be rewritten as

    ∂µ(φ∗∂µφ− φ∂µφ∗︸ ︷︷ ︸4-current jµ

    ) = ∂µjµ = 0 .

    This is a nothing but a relativistic continuity equation of the form

    − 1c2 ∂t( φ∗∂tφ− φ∂tφ∗︸ ︷︷ ︸

    probability density ρc2

    ) +∇ · ( φ∗∇φ− φ∇φ∗︸ ︷︷ ︸probability current −~j

    ) = 0

    or in short ∂tρ+∇~j = 0.

  • Klein-Gordon equationThe problem of negative probability currents

    − 1c2 ∂t( φ∗∂tφ− φ∂tφ∗︸ ︷︷ ︸

    probability density ρc2

    ) +∇ · ( φ∗∇φ− φ∇φ∗︸ ︷︷ ︸probability current −~j

    ) = 0

    I The probability density in the KG equation is no longerpositive definite!

    I This reflects the existence of antimatter, but needs a deeperunderstanding.

  • Section 6.2

    Dirac equation

  • Dirac equationMotivation

    Warmup exercise: Klein-Gordon equation in 0+1 dimensions:( 1c2∂

    2t + M2

    )φ(t) = 0.

    Define χ(t) = iMc ∂tφ(t) and split into two equations:

    i 1c ∂tχ(t)−Mφ(t) = 0i 1c ∂tφ(t)−Mχ(t) = 0 .

    Define vectorψ(t) =

    (φ(t)χ(t)

    )Rewrite system of first-order differential equations:

    i(0 11 0

    )1c ∂t

    (φ(t)χ(t)

    )−M

    (φ(t)χ(t)

    )= 0

  • Dirac equationWarmup exercise

    i(0 11 0

    )︸ ︷︷ ︸

    =:γ

    1c ∂t

    (φ(t)χ(t)

    )−M

    (φ(t)χ(t)

    )= 0

    In short: (iγ 1c ∂t −M

    )ψ(t) = 0.

    Iterate two times:

    M2ψ(t) = − 1c2γ2∂2t ψ(t)

    Each component separately should obey the Klein-Gordon equation( 1c

    2∂2t + M2)ψ = 0

    ⇒ γ2 = 1

  • Dirac equationWarmup exercise

    (iγ 1c ∂t −M

    )ψ(t) = 0 with γ2 = 1

    γ2 = 1 is the Dirac algebra in this simple case.

    Representations:I γ = ±1

    I γ = σx =(0 11 0

    )

    I γ = σy =(0 −ii 0

    )

    I γ = σz =(1 00 −1

    )

  • Full Dirac equationin 1+3 dimensions

    (iγµ∂µ −M

    )ψ(x) = 0.

    γ0, γ1, γ2, γ3 are four operators acting in anauxiliary k-dimensional space, called spinor space.

    Shortcut notation: Fermi dagger:

    /∂ = γµ∂µ = γµ∂µ.

    ⇒ (i/∂ −M)ψ = 0.

  • Dirac algebra

    Iterate the Dirac equation i/∂ψ = Mψ two times:

    M2ψ(x) = −/∂/∂ψ(x) = −γµγν∂µ∂νψ(x).

    Each component obeys the Klein-Gordon equation

    −γµγν∂µ∂νψ(x) = �ψ(x) = ηµν∂µ∂νψ(x)

    I Comparing both sides it seems that γµγν = −ηµν .I However, the partial derivatives commute, leading to the

    weaker condition γµγν + γνγµ = −2ηµν1k

    Dirac algebra:

    {γµ, γν} = −2ηµν1k

  • Representation of the γ-matricesWarmup exercise: The Pauli algebra

    {σi , σj} = 2δij1k . (i , j = x , y)

    Category Monomial NumberScalar S: 1 1Vector V: σx , σy 2Pseudoscalar P: σxσy = iσz 1

    total number: M=4

    Therefore, we need at least 2× 2 matrices. And in fact, this iswhat we get:

    σx =(0 11 0

    ), σy =

    (0 −ii 0

    )

  • Representation of the γ-matricesNow consider the Pauli algebra

    {γµ, γν} = −2ηµν1k

    Category Monomial #Scalar S: 1 1Vector V: γ0, γ1, γ2, γ3 4Tensor T: γ0γ1, γ0γ2, γ0γ3, γ1γ2, γ1γ3, γ2γ3 6Pseudovector A: γ0γ1γ2, γ0γ1γ3, γ0γ2γ3, γ1γ2γ3 4Pseudoscalar P: γ0γ1γ2γ3 1

    total number: M=16

    ⇒ So we need at least 4× 4 matrices.

  • The γ5 matrix

    Define:γ5 := iγ0γ1γ2γ3.

    This is analogous to σz in the Pauli case.

    (Why not γ4?)

    Properties:I γ5 is Hermitean, i.e., (γ5)† = γ5.I (γ5)2 = γ5γ5 = 1, therefore the eigenvalues of γ5 are ±1.I γ5 anticommutes with all other γ-matrices:{γ5, γµ} = γ5γµ + γµγ5 = 0.

  • Dirac representationThe standard choice

    γ0 =(1

    −1

    ), γj =

    (σj

    −σj

    ), γ5 =

    (1

    1

    ),

    Explicitely:

    γ0 =

    1 1 −1−1

    γ1 = 11−1−1

    γ2 =

    −iii− i

    γ3 = 1 −1−1

    1

    and γ5 = iγ0γ1γ2γ3 =

    1 111

    .

  • Proof of the Dirac representation

    We can write the γ-matrices in Dirac representation as tensorproducts of Pauli matrices:

    γ0 = σz ⊗ 12 , γj = iσy ⊗ σj (j = 1, 2, 3)

    This allows us to verify the Dirac algebra:

    1. {γ0, γ0} = 2(γ0)2 = 2(σz)2 ⊗ 1 = 2122. {γ0, γj} = (σzi σy )⊗ σj + (iσyσz)⊗ σj = i{σz , σy} ⊗ σj = 03. {γj , γk} = (iσy )2 ⊗ {σj , σk} = −2δjk 14

    ⇒ {γµ, γν} = −2ηµν 14.

  • Weyl and Majorana representation

    Weyl representation:

    γ0 =(

    1

    1

    ), γj =

    (σj

    −σj

    ), γ5 =

    (−1

    1

    ),

    Majorana representation:

    γ0 =(

    σ2

    σ2

    ), γ1 =

    (iσ3

    iσ3

    ), γ2 =

    (−σ2

    σ2

    )

    γ3 =(−iσ1

    −iσ1

    ), γ5 =

    (σ2

    −σ2

    ),

  • Spinor space

    I All the representations introduced above are four-dimensional.I The purpose of this internal space is to provide a realization

    of the anticommutation relations {γµ, γν} = −2ηµν .I This internal space is called spinor space.

    I A spinor is a vector living in spinor space.I A spinor field ψ(x) is a vector field mapping the Minkowski

    space to the spinor space.

    Internal spinor space 6= Minkowski space

  • Form-invariance of the Dirac equation under Lorentztransformation

    I The Dirac equation (iγµ∂µ −M

    )ψ(x) = 0

    is expected to be invariant under Lorentz transformations

    xµ → x̃µ = Λµνxν , ∂µ → ∂̃µ = Λ νµ ∂νI Obviously this requires the Dirac γ-matrices to transform like

    a 4-vectorγµ → γ̃µ = Λµνγν .

    I This makes only sense if the transformed γ̃-matrices obeyagain the Dirac algebra, namely, {γ̃µ, γ̃ν} = −2ηµν .

    can be proven easily!

  • Covariant spinor transformation

    What we have shown so far:

    If ψ(x) is a solution of the Dirac equation(iγµ∂µ −M

    )ψ = 0

    and ifx̃µ = Λµνxν and γ̃µ = Λµνγν ,

    then ψ̃(x̃) = ψ(x) is also solution of(i γ̃µ∂̃µ −M

    )ψ̃ = 0 .

    But for form-invariance, we would like to have the same Diracequation with the same γ-matrices in the moving frame.

  • Covariant spinor transformation

    I Idea: Instead of transforming γ̃µ = Λµνγν , we transform thewave function by

    ψ̃(x̃) = Sψ(x) ,

    where S is a 4× 4 matrix in spinor space.I This leads us to the condition(

    iγµ∂̃µ −M)Sψ =

    (iγµΛ νµ ∂ν −M

    )Sψ = 0 .

    I This is equivalent to (see lecture notes)

    Λρνγν = S−1γρS.

    Unfortunately, this is quadratic in S...

  • Covariant spinor transformationCalculation of the spinor transformation matrix S

    If you have a nonlinear relation between elements of Liegroups, try it infinitesimally and get a linear relation.

    Consider infinitesimal Lorentz transformation:

    Λµν = δµν + �(αβ)λ(αβ)µν +O(�2)

    Assume that spinor transformation has the same form:

    S = 1 + i2 �(αβ)σ(αβ) +O(�2)

    Then the inverse is super-trivial:

    S−1 = 1− i2 �(αβ)σ(αβ) +O(�2)

    Insert all this into the equation above:

    Λµνγν = S−1γµS

  • Covariant spinor transformationCalculation of the spinor transformation matrix S

    Insert this into Λµνγν = S−1γµS:

    ⇒ γµ + �(αβ)λ(αβ)µνγν +O(�2) =

    =(1− i2 �

    (αβ)σ(αβ) +O(�2))

    ︸ ︷︷ ︸S−1

    γµ(1 + i2 �

    (αβ)σ(αβ) +O(�2))

    ︸ ︷︷ ︸S

    Compare to first order in �:

    ⇒ �(αβ)λ(αβ)µνγν =

    [γµ ,

    i2 �

    (αβ)σ(αβ)]

    This is now a linear system of equations!Linear equations are solvable.

  • Covariant spinor transformationCalculation of the spinor transformation matrix S

    Intuitive approach: Recall the defintion of the generators[λ(αβ)]µν = δµαηβν − δ

    µβηαν

    and insert them into the linear system of equations:

    γµσ(αβ) − σ(αβ)γµ = −2i(δµαηβν − δ

    µβηαν

    )γν

    ⇒ σ(αβ) =i2 [γα, γβ] =

    {iγαγβ if α 6= β0 otherwise

    Proof: Trivial for α = β. For α 6= β we have to show that

    i(γµγαγβ − γαγβγµ

    )= −2i

    (δµαγβ − δµβγα

    ).

    Now we use {γµ, γν} = −2ηµν1. If µ 6= α and µ 6= β we have {γα, γµ} ={γβ , γµ} = 0, hence both sides are zero. If µ = α then

    γαγαγβ − γα γβγα︸︷︷︸=−γαγβ

    = {γα, γα}︸ ︷︷ ︸=−2

    γβ = −2γβ .

    The proof for µ = β is analogous.

  • Covariant spinor transformationCalculation of the spinor transformation matrix S

    Remark: We have done a very common trick in Theoretical Physics.Given a non-linear relation between transformations Λ and S, we studythem infinitesimally to first order in �. This gives a linear relationbetween the corresponding generators λ and σ.

    To get the relation between the (non-infinitesimal) fulltransformations, we have to exponentiate the generators:

    Λ = exp( ∑

    0≤α

  • Covariant spinor transformation

    Example: Rotation O(3):

    Λ(12)(θ) =

    1

    cos θ sin θ− sin θ cos θ

    1

    goes together with

    S(12)(θ) =

    eiθ/2

    e−iθ/2eiθ/2

    e−iθ/2

    For a rotation by 2π = 360◦ we get Λ = 1 but S = −1.

  • Covariant spinor transformation acting on fieldsRecall Lorentz transformation acting on a scalar field

    Under a translation a scalar field φ : Rd → R changes as

    x→ x− a⇒ φ(x)→ φ(x + a)

    ⇒ φ(xµeµ)→ φ((xµ + aµ)eµ) = eaµ∂µφ(xµeµ)

    Similarly, for a Lorentz boost or a rotation we have

    x→ Λ−1x⇒ φ(x)→ φ(Λx)

    ⇒ φ(xµeµ)→ φ(Λµνxνeµ) = e???φ(xµeµ)

    What has to be put in the exponential?

  • Covariant spinor transformation acting on fieldsRecall SO+(1, 3) transformation acting on a scalar field

    φ(xµeµ)→ φ(Λµνxνeµ) = e???φ(xµeµ)

    Consider infinitesimal transformation Λ = 1 + �(αβ)λ(αβ):

    φ(xµeµ)→ φ(xµeµ + �(αβ)λαβµνx

    νeµ)

    This is like a small (x-dependent) translation:

    φ(xµeµ)→(1 + �(αβ)λ(αβ)µνx

    ν∂µ)φ(xµeµ)

    Exponentiate it:

    φ(x)→ exp(θ(αβ)λ(αβ)

    µνxν∂µ

    )φ(x).

  • Covariant spinor transformation acting on fieldsExample: Rotation SO(3)

    Consider for example λ(12):

    φ(x)→ exp(φ(−x∂y + y∂x

    ))φ(x).

    If you like quantum mechanics, rewrite it as

    φ(x)→ exp(− i~φ[−i~(x∂y − y∂x︸ ︷︷ ︸

    Lz

    )])φ(x).

    whith the orbital angular momentum

    Lz = ~r × ~p = −i~(x∂y − y∂x )

  • Covariant spinor transformation acting on fieldsHow a Lorentz transformation acts on a spinor field

    If the scalar field φ(x) is replaced by a 4-component spinor fieldψ(x), then the transformation of the internal degrees of freedomhas to be included as well:

    ψ(x)→ exp(− i~θ(αβ)

    [i~λ(αβ)µνx

    ν∂µ︸ ︷︷ ︸orbital

    −~2σ(αβ)︸ ︷︷ ︸spin

    ])ψ(x)

    Example: Rotation around the z-axis:

    ψ(x)→ exp(− i~

    (Lz + Sz︸ ︷︷ ︸=Jz

    ))ψ(x)

    operator acting on eigenvalues

    Lz = i~λ(12)µνxν∂µ Minkowski space n~

    Sz = −~2σ(12) Spinor space ±~2

  • Plane-wave solutions and their interpretation

    ψ(x) = u(p)ei~ ε p·x

    I u(p) is a 4-component “amplitude”, called spinor.I ε is a dimensionless constant to be determined later.

    Insert into Dirac equation (iγµ∂µ −M)ψ = 0 with M = mc/~:(−1~εγµpµ −M

    )u(p) = 0

    This gives the eigenvalue problem

    −εγµpµ u(p) = mc u(p)

  • Plane-wave solutions and their interpretationAnalyze eigenvalue problem −εγµpµ u(p) = mc u(p)

    −γµpµ =

    E/c 0 −pz −p−0 E/c −p+ pzpz p− −E/c 0p+ −pz 0 −E/c

    .Eigenvalues:

    ±√E 2/c2 − p+p− − p2z = ±

    √E 2/c2 − p2

    Compare with E 2 = p2c2 + m2c4:

    �1 = �2 = +1 (solutions with positive energy)�3 = �4 = −1 (solutions with negative energy)

  • Plane-wave solutions and their interpretationAnalyze eigenvalue problem −εγµpµ u(p) = mc u(p)

    Eigenvectors for � = 1 in Dirac representation:

    u1(p) =

    E/c + mc

    0pzp+

    , u2(p) =

    0E/c + mc

    p−−pz

    ,Eigenvectors for � = −1:

    u3(p) =

    pzp+

    E/c + mc0

    , u4(p) =

    p−−pz0

    E/c + mc

    .

  • Vecolity-momentum problem of negative-energy solutions

    First consider positive energy � = +1:

    ψ(x) = u(p) ei~pµx

    µ.

    For simplicity consider wave in x -direction with py = pz = 0:

    I In which direction does the wave move?Determine velocity of a “wave crest” with phase zero:

    0 = pµxµ = −Ec ct+pxx ⇒ x =Epx t ⇒ v = E/px > 0.

    I What is the momentum P = −i~∇ ?

    Pψ = −i~∇ei~pµx

    µ = −i~∇ei~ (−Et+~p·~x) = +~pψ

    For positive energy, ~v and ~p point in the same direction.

  • Vecolity-momentum problem of negative-energy solutions

    Now consider negative energy � = −1:

    ψ(x) = u(p) e−i~pµx

    µ.

    For simplicity consider wave in x -direction with py = pz = 0:

    I In which direction does the wave move?Determine velocity of a “wave crest” with phase zero:

    0 = pµxµ = −Ec ct+pxx ⇒ x =Epx t ⇒ v = E/px > 0.

    I What is the momentum P = −i~∇ ?

    Pψ = −i~∇e−i~pµx

    µ = −i~∇e−i~ (−Et+~p·~x) = −~pψ

    For negative energy, ~v and ~p point in opposite directions.

  • Vecolity-momentum problem of negative-energy solutions

    Remember: Relativistic particles with negative energy have the pa-radoxical property that velocity and momentum point in oppositedirections.

    Problems:I Particle transfers a momentum opposite to its own flight

    direction, contradicting with our everyday experience.I Negative pressure.I Systems with an unlimited negative energy are

    thermodynamically unstable.

  • Dirac hole theory

    At T = 0 a physical system selects the state of lowest energy.Therefore, all modes with negative energy have to be occupied.I The Dirac sea is phy-

    sically unobservable.I The total energy of

    the sea is set to zero.I Absence of negative

    energy and negativemomentum = presenceof positive energyand positive momentum.

    ⇒ Concept of an antiparticle

  • Dirac hole theory

    I If the particles carry an additional electrical charge, thecorresponding anti-particles carry the opposite charge. Forexample, the antiparticles of electrons (e−) are calledpositrons (e+) since they carry a positive charge.

    I The bare Dirac equation describes a two-particle theory(particle and antiparticle). However, Dirac’s hole theoryimplicitly uses the scenario of a multi-particle theory obeyingthe Pauli principle.

    I Particles and antiparticles recombine to radiation, i.e.,e+e− → 2γ.

    I The opposite reaction γ → e+e− is only possible in thepresence of an atom, for example a crystal lattice, providedthat the energy of the photon is high enough.

  • Feynman-Stückelberg Interpretation

  • Probability conservation in the Dirac equationLet ~∂ be a derivative acting to the right:(

    iγµ~∂µ −M)ψ = 0.

    Adjoint Dirac equation

    ψ†(−i(γµ)† ~∂µ −M

    )= 0 .

    The adjoint Dirac matrices are given by

    γ0 = (γ0)† , γk = −(γk)† (k = 1, 2, 3)

    ⇒ (γµ)† = γ0γµγ0

    ψ†(−iγ0γµγ0 ~∂µ −M

    )= 0

    ⇒ ψ†(iγ0γµ ~∂µγ0 + M

    )= 0

    ∣∣∣ γ0γ0 = −η00 = 1⇒ ψ†

    (iγ0γµ ~∂µγ0 + γ0Mγ0

    )= 0

    ⇒ ψ†γ0(iγµ ~∂µ + M

    )γ0 = 0

  • Probability conservation in the Dirac equationDefine the adjoint spinor:

    ψ̄ := ψ†γ0

    Then the Dirac equation and its adjoint read

    (iγµ~∂µ −M

    )ψ = 0 ψ̄

    (iγµ ~∂µ + M

    )= 0.

    Probabilty current and continuity equation

    ψ̄γµ( ~∂µ + ~∂µ)ψ = 0 ⇒ ∂µ(ψ̄γµψ︸ ︷︷ ︸

    =jµ

    )= 0.

    jµ := ψ̄γµψ =(ρ1c~j

    ).

  • Probability conservation in the Dirac equation

    ∂µjµ = 0.

    In terms of the probability density

    ρ = ψ̄γ0ψ = ψ†ψ

    and the probability 3-current ~j with

    (~j)k = cψ̄γkψ = cψ†γ0γkψ

    this continuity equation can be written in the usual form as

    ∂tρ = ~∇ ·~j .

  • Action for the Dirac equationfor electrically neutral fermions

    We can guess the action:

    S =∫

    d4x ψ̄(x)(iγµ∂µ −M

    )ψ(x).

    ψ̄γµψ transforms as a 4-vector ⇒ S is a scalar.

    S depends linearly on two (independent) fields ψ and ψ̄:

    I Variation with respect to ψ̄:

    δS =∫

    d4x δψ̄(x)(iγµ∂µ −M

    )ψ(x)︸ ︷︷ ︸

    =0

    ⇒(iγµ∂µ −M

    )ψ(x) = 0.

    I Variation with respect to ψ (needs partial integration):

    δS =∫

    d4x ψ̄(x)(iγµ∂µ −M

    )︸ ︷︷ ︸=0

    δψ(x) ⇒ ψ̄(x)(iγµ ~∂µ+M

    )= 0.

  • Dirac equation: Adding an electric field

    Although the Dirac equation involves the imaginary unit, itdescribes electrically neutral fermions.Recall Majorana representation:

    γ0 =(

    σ2

    σ2

    ), γ1 =

    (iσ3

    iσ3), γ2 =

    (−σ2

    σ2

    )γ3 =

    (−iσ1

    −iσ1), γ5 =

    (σ2

    −σ2),

    Then the Dirac equation (iγµ∂µ −M)ψ = 0 reads:−∂1 ∂3 0 ∂0 − ∂2∂3 ∂1 −∂0 + ∂2 00 ∂0 + ∂2 −∂1 ∂3

    −∂0 − ∂2 0 ∂3 ∂1

    ψ = Mψ.⇒ This equation is real. There is no U(1) circle.

  • Dirac equation: Adding an electric fieldComplexification

    I Complexify ψ(x) in order to implementing a U(1) circle.I Replace derivative by covariant derivative:

    ∂µ → Dµ = ∂µ −ie~Aµ

    ⇒ Dirac equation of a charged fermion in an electromagnetic field:

    (iγµ(∂µ −

    ie~Aµ)−M

    )ψ = 0.

    Substitute pµ = −i~∂µ and M = mc/~:(γµ(pµ − eAµ

    )︸ ︷︷ ︸Min. coupling

    +mc)ψ = 0 .

  • Charged Dirac equation: Probability conservation

    Dirac equation and adjoint Dirac equation:(iγµ(~∂µ − ie~ Aµ)−M

    )ψ = 0

    ∣∣∣ ψ̄(iγµ( ~∂µ + ie~ Aµ)+ M)

    = 0

    Multiply from left and right and add them:

    ψ̄(iγµ(~∂µ − ie~ Aµ)−M

    )ψ = 0

    ψ̄(iγµ(~∂µ +

    ie~Aµ)

    + M)ψ = 0

    —————————————-

    iψ̄γµ( ~∂µ + ~∂µ)ψ = 0

    ∂µjµ = 0 , jµ = ψ̄γµψ

  • Charged Dirac equation: ActionSimply replace derivative by the covariant derivative and add thecontribution of the electromagnetic field:

    S =∫

    d4x[ψ̄(x)

    (iγµ(∂µ −

    ie~Aµ)−M

    )ψ(x) − 14µ0

    FµνFµν].

    I Varying with respect to ψ̄ gives the charged Dirac equation:(iγµ(∂µ −

    ie~Aµ)−M

    )ψ = 0

    I Varying with respect to A gives the inhomogeneous Maxwellequations

    ∂L∂Aµ

    − ∂ν∂L

    ∂(∂νAµ)= 0

    with∂νF νµ = −µ0

    e~ψ̄γµψ︸ ︷︷ ︸

    =jµ.

    As we have shown above, this current is conserved, reflectingcharge conservation.

  • Charge conjugationReflections:

    in space: parityin time: time reversalin the circle: charge conjugation

    Charge conjugation: ψ(x)→ ψ∗(x)

    ψ∗(x) obeys the complex-conjugated Dirac equation:

    (−i(γµ)∗∂µ −M)ψ∗ = 0

    Charge conjugation is a symmetry if and only if −(γµ)∗ is again avalid representation of the Dirac algebra. If so, it differs from γµ byunitary transformation (see exercise):

    (−γµ)∗ = C†γµC

  • Chirality

    Chirality refers to the eigenvalues of γ5 = iγ0γ1γ2γ3.

    Define the projectorsP± :=

    12(1± γ5

    )with the usual properties P2± = P±, P+P− = 0, and P+ + P− = 1.

    If the fermion is massless, obeying the massless Dirac equation

    γµ∂µψ = 0

    then the so-called chiral spinors

    ψR = ψ+ := P+ψ , ψL = ψ− := P−ψ

    are also solutions of the Dirac equation. ψR is the right-handedspinor, ψL is the left-handed spinor.

  • ChiralityThe most suitable representation is the Weyl representation with

    γ5Weyl =(−1

    1

    )

    Here the Dirac equation takes the form:

    (iγµ∂µ −M)ψ =(

    −M i(∂0 + ~σ · ~∇)i(∂0 − ~σ · ~∇) −M

    )(ψLψR

    )= 0

    The mass terms −M mix the two components. For M = 0 theydecouple into two equations, the so-called Weyl equations:

    i(∂0 − ~σ · ~∇)ψL = 0 ,i(∂0 + ~σ · ~∇)ψR = 0

    This is also interesting if the mass is small (for example neutrino oscillations).

  • Section 7.1

    Non-relativistic Hamilton formalism

  • Lagrange is fine.Why then do we need another formalism?

    I Different viewpoint, first-order differential equations.I Much closer to quantum mechanics than Lagrange.I Quantization straight-forward by replacing momenta by

    derivatives and replacing Poisson brackets with commutators.I Possibly a good investment for