Written in May 2026, backdated to when the work happened. This post is a reflection, not a contemporaneous journal entry.

Classical Field Theory: A Comprehensive Reference

Scalar fields, gauge theory, spinors, and the Standard Model Lagrangian; everything you need before QFT.

The Lagrangian mechanics document introduced the basic framework of classical field theory. The special relativity and tensors document built the mathematical language. This document combines them into the real subject: a complete classical treatment of the field theories that underlie modern physics.

By the end, you should understand:

  • Why spontaneous symmetry breaking leads to Goldstone bosons (and to the Higgs mechanism)
  • Why the photon exists at all; not as a postulate but as a consequence of local U(1)U(1) symmetry
  • How Yang-Mills theory generalizes QED to describe the strong and weak forces
  • Why the Dirac equation automatically predicts antimatter and spin
  • What the Standard Model Lagrangian actually says, term by term

That’s the bridge into quantum field theory proper.


Table of Contents

  1. The Setup: What We’re Building
  2. Real Scalar Fields
  3. Complex Scalar Fields and Global U(1)
  4. Spontaneous Symmetry Breaking and Goldstone Bosons
  5. Gauge Symmetry: Why Local Matters
  6. Scalar Electrodynamics and Minimal Coupling
  7. Non-Abelian Gauge Theory (Yang-Mills)
  8. Spinor Representations of the Lorentz Group
  9. The Dirac Equation
  10. Quantum Electrodynamics as a Classical Field Theory
  11. The Higgs Mechanism
  12. The Standard Model Lagrangian
  13. Appendix: Formulas and Conventions

1. The Setup: What We’re Building

Every fundamental theory of physics is specified by a Lagrangian (density). From this single object; a Lorentz-invariant combination of fields and their derivatives; everything else follows: equations of motion (via Euler-Lagrange), conservation laws (via Noether), interactions, and ultimately (after quantization) scattering amplitudes and particle physics.

The rules for constructing fundamental Lagrangians are surprisingly constrained:

  1. Lorentz invariance; the Lagrangian must be a Lorentz scalar
  2. Locality; it’s a function of fields and their derivatives at a single spacetime point (no action-at-a-distance)
  3. Hermiticity / reality; L\mathcal{L} is real (so the action is real)
  4. Gauge symmetries; if present, the Lagrangian respects them
  5. Renormalizability; for quantum field theory, typically only operators up to mass dimension 4 in 4D spacetime (this is a quantum criterion, but it massively constrains classical writing)
  6. Discrete symmetries; depending on the theory, may respect P, C, T, or specific combinations

These rules plus particle content typically narrow the Lagrangian to a handful of possibilities, often just one. The Standard Model Lagrangian is essentially the Lagrangian you get from requiring Lorentz invariance, gauge invariance under SU(3)×SU(2)×U(1)SU(3) \times SU(2) \times U(1), and the observed particle content.

With those rules in mind, let’s build.

Natural Units Reminder

Throughout: =c=1\hbar = c = 1, metric ημν=diag(+,,,)\eta_{\mu\nu} = \text{diag}(+,-,-,-), Einstein summation implicit.


2. Real Scalar Fields

The simplest possible field: a single real number at every spacetime point. No direction, no internal structure, just a value.

ϕ(x)R\phi(x) \in \mathbb{R}

The Free Lagrangian

L=12(μϕ)(μϕ)12m2ϕ2\mathcal{L} = \tfrac{1}{2}(\partial_\mu \phi)(\partial^\mu \phi) - \tfrac{1}{2} m^2 \phi^2

Each term is a Lorentz scalar. The first is a “kinetic” term (involves derivatives); the second is a “mass” term. The factors of 1/21/2 are conventional, chosen to make the equations of motion look clean.

Euler-Lagrange

From the general field equation μ(L/(μϕ))L/ϕ=0\partial_\mu(\partial \mathcal{L}/\partial(\partial_\mu \phi)) - \partial \mathcal{L}/\partial \phi = 0:

L(μϕ)=μϕ,Lϕ=m2ϕ\frac{\partial \mathcal{L}}{\partial(\partial_\mu \phi)} = \partial^\mu \phi, \qquad \frac{\partial \mathcal{L}}{\partial \phi} = -m^2 \phi

So:

μμϕ+m2ϕ=0\partial_\mu \partial^\mu \phi + m^2 \phi = 0

(+m2)ϕ=0\boxed{(\Box + m^2)\phi = 0}

This is the Klein-Gordon equation. It’s the relativistic wave equation for a spin-0 particle of mass mm.

Plane-Wave Solutions

Try ϕ(x)=eikx=ei(k0tkx)\phi(x) = e^{-ik \cdot x} = e^{-i(k^0 t - \vec k \cdot \vec x)}:

ϕ=kμkμϕ=(k02k2)ϕ\Box \phi = -k^\mu k_\mu \phi = -(k^{0 2} - |\vec k|^2)\phi

So Klein-Gordon requires:

kμkμ=m2k02=k2+m2k^\mu k_\mu = m^2 \quad \Longleftrightarrow \quad k^{0 2} = |\vec k|^2 + m^2

This is exactly the relativistic energy-momentum relation if we identify k0=Ek^0 = E and k=p\vec k = \vec p. Klein-Gordon plane waves describe particles of mass mm.

General solution: a superposition of plane waves with both positive and negative k0k^0:

ϕ(x)=d3k(2π)32ωk[a(k)eikx+a(k)e+ikx]\phi(x) = \int \frac{d^3 k}{(2\pi)^3 \, 2\omega_k}\left[a(\vec k)\, e^{-i k \cdot x} + a^*(\vec k)\, e^{+i k \cdot x}\right]

where ωk=+k2+m2\omega_k = +\sqrt{|\vec k|^2 + m^2} is the positive energy. In classical field theory this is a real-valued solution; in the quantum version, aa and aa^* get promoted to annihilation/creation operators.

The Canonical Momentum

π(x)=Lϕ˙=ϕ˙=0ϕ\pi(x) = \frac{\partial \mathcal{L}}{\partial \dot \phi} = \dot\phi = \partial^0 \phi

This is what gets quantized into the commutator [ϕ^(x,t),π^(y,t)]=iδ3(xy)[\hat\phi(\vec x, t), \hat\pi(\vec y, t)] = i\delta^3(\vec x - \vec y).

Adding Interactions

The simplest interacting scalar theory adds a ϕ4\phi^4 term:

L=12(μϕ)(μϕ)12m2ϕ2λ4!ϕ4\mathcal{L} = \tfrac{1}{2}(\partial_\mu \phi)(\partial^\mu \phi) - \tfrac{1}{2} m^2 \phi^2 - \frac{\lambda}{4!}\phi^4

The ϕ4\phi^4 term is the simplest Lorentz-invariant interaction for a real scalar that’s renormalizable (mass dimension 4). The equation of motion becomes:

(+m2)ϕ=λ6ϕ3(\Box + m^2)\phi = -\frac{\lambda}{6}\phi^3

Nonlinear. This theory; called ”ϕ4\phi^4 theory”; is the simplest interacting field theory and the standard pedagogical example in QFT. It’s also a prototype for the Higgs sector (just with a different potential, as we’ll see).


3. Complex Scalar Fields and Global U(1)

A complex scalar has two real components:

ϕ(x)C\phi(x) \in \mathbb{C}

Equivalently ϕ=(ϕ1+iϕ2)/2\phi = (\phi_1 + i\phi_2)/\sqrt{2} for two real fields ϕ1,ϕ2\phi_1, \phi_2.

Lagrangian

L=(μϕ)(μϕ)m2ϕϕ\mathcal{L} = (\partial_\mu \phi^*)(\partial^\mu \phi) - m^2 \phi^* \phi

(Note: no factor of 1/2 in front. That’s because ϕ\phi and ϕ\phi^* are treated as independent fields in the variational principle, so there’s an implicit factor-of-2 doubling compared to the real case.)

Equations of Motion

Varying with respect to ϕ\phi^* (treating ϕ\phi as independent):

(+m2)ϕ=0(\Box + m^2)\phi = 0

and similarly for ϕ\phi^*. Two complex Klein-Gordon equations; or equivalently, two real ones for ϕ1\phi_1 and ϕ2\phi_2.

The U(1) Symmetry

The Lagrangian is invariant under the transformation

ϕeiαϕ,ϕeiαϕ\phi \to e^{i\alpha} \phi, \qquad \phi^* \to e^{-i\alpha}\phi^*

for any constant real α\alpha. The two factors cancel in products like ϕϕ\phi^*\phi and (ϕ)(ϕ)(\partial \phi^*)(\partial \phi).

This is a global symmetry; α\alpha is the same at every spacetime point. It’s called U(1)U(1) because the set of transformations {eiα}\{e^{i\alpha}\} forms the group U(1)U(1) (unit circle in complex plane).

Noether Current

Apply Noether’s theorem. The infinitesimal variation is δϕ=iαϕ\delta \phi = i\alpha \phi, δϕ=iαϕ\delta \phi^* = -i\alpha \phi^*. The Noether current (derived in the Lagrangian doc):

jμ=L(μϕ)δϕ+L(μϕ)δϕj^\mu = \frac{\partial \mathcal{L}}{\partial(\partial_\mu \phi)}\delta\phi + \frac{\partial \mathcal{L}}{\partial(\partial_\mu \phi^*)}\delta\phi^*

=(μϕ)(iαϕ)+(μϕ)(iαϕ)=iα[ϕ(μϕ)ϕ(μϕ)]= (\partial^\mu \phi^*)(i\alpha\phi) + (\partial^\mu \phi)(-i\alpha \phi^*) = i\alpha[\phi^*(\partial^\mu \phi) - \phi(\partial^\mu \phi^*)]

Dropping the arbitrary α\alpha factor (conventional), and also dropping the ii to make the current real:

jμ=i[ϕ(μϕ)ϕ(μϕ)]j^\mu = i[\phi^*(\partial^\mu \phi) - \phi(\partial^\mu \phi^*)]

You can verify directly: μjμ=0\partial_\mu j^\mu = 0 using the Klein-Gordon equation.

The conserved charge Q=d3xj0Q = \int d^3 x \, j^0 is what we call electric charge (up to overall factor). So electric charge conservation ultimately comes from a U(1)U(1) symmetry of the Lagrangian; a beautiful instance of Noether’s theorem.

Interpretation

In quantum theory, the two complex components encode particles and antiparticles. A complex scalar describes a charged particle (like a π+\pi^+) together with its antiparticle (π\pi^-). A real scalar (like a π0\pi^0, approximately) has no such internal structure; it’s its own antiparticle.


4. Spontaneous Symmetry Breaking and Goldstone Bosons

Now we modify the potential. Consider a complex scalar with Lagrangian:

L=(μϕ)(μϕ)V(ϕϕ)\mathcal{L} = (\partial_\mu \phi^*)(\partial^\mu \phi) - V(\phi^*\phi)

with:

V(ϕϕ)=μ2ϕϕ+λ(ϕϕ)2V(\phi^*\phi) = -\mu^2 \phi^*\phi + \lambda (\phi^*\phi)^2

Here μ2>0\mu^2 > 0 and λ>0\lambda > 0. Note the wrong sign on the mass-like term: the coefficient of ϕϕ\phi^*\phi is negative, whereas a normal mass term would be positive.

The “Mexican Hat” Potential

Plot VV as a function of ϕ|\phi|. At ϕ=0|\phi| = 0, we’re at a local maximum, not a minimum. The minimum is at:

dVd(ϕϕ)=μ2+2λϕϕ=0    ϕϕ=μ22λv22\frac{dV}{d(\phi^*\phi)} = -\mu^2 + 2\lambda \phi^*\phi = 0 \implies \phi^*\phi = \frac{\mu^2}{2\lambda} \equiv \frac{v^2}{2}

So the minimum is a ring in the complex ϕ\phi plane, at ϕ=v/2|\phi| = v/\sqrt{2} where v=μ/λv = \mu/\sqrt{\lambda} is the vacuum expectation value.

The potential is rotationally symmetric (invariant under U(1)U(1)), but the minimum is not a single point; it’s a circle, and the system has to pick one point on that circle.

Spontaneous Symmetry Breaking

The U(1)U(1) symmetry of the Lagrangian is still exact. But the ground state (vacuum) is not symmetric; once the field picks a direction on the circle, you can’t transform to another without doing work.

This is spontaneous symmetry breaking: the laws are symmetric, the vacuum is not.

Classical analogy: a ball rolling on a rotationally symmetric Mexican-hat surface. The surface doesn’t favor any direction. But the ball eventually settles at one point, and from then on, the system as a whole (ball + surface) has broken the symmetry.

Expanding Around the Vacuum

Let’s choose the vacuum at ϕ=v/2\phi = v/\sqrt{2} (real, no imaginary part). Write the field as small fluctuations:

ϕ(x)=12[v+η(x)+iξ(x)]\phi(x) = \frac{1}{\sqrt{2}}[v + \eta(x) + i\xi(x)]

where η\eta and ξ\xi are real fields. Substitute into the Lagrangian and expand. The algebra is tedious but instructive; the result is:

L=12(η)2+12(ξ)2μ2η2+(cubic and quartic terms)+const\mathcal{L} = \tfrac{1}{2}(\partial \eta)^2 + \tfrac{1}{2}(\partial \xi)^2 - \mu^2 \eta^2 + (\text{cubic and quartic terms}) + \text{const}

The key observations:

  • The field η\eta (fluctuation along the radial direction; up the side of the hat) has a mass term: mη2=2μ2m_\eta^2 = 2\mu^2
  • The field ξ\xi (fluctuation along the tangential direction; around the rim) has no mass term at all

Goldstone’s Theorem

The massless field ξ\xi is a Goldstone boson. Goldstone’s theorem says: for every spontaneously broken continuous symmetry, there is a massless scalar particle in the spectrum.

Why? Because fluctuations along the broken-symmetry direction cost no energy; you’re moving along the valley, where the potential is flat. “No energy cost” translates to “no mass term.”

More formally: broken symmetry generators produce massless particles. The number of Goldstone bosons equals the number of broken generators.

Generalization

For a broken symmetry group GHG \to H (where HH is the unbroken subgroup), the number of Goldstone bosons is dimGdimH\dim G - \dim H. In our U(1)U(1) example: dimU(1)dim{trivial}=10=1\dim U(1) - \dim\{\text{trivial}\} = 1 - 0 = 1 Goldstone. Matches.

Physical Examples

  • Pions in QCD are (approximately) Goldstone bosons of broken chiral symmetry. They’re not exactly massless (the symmetry is only approximate), so pions are pseudo-Goldstones.
  • Phonons in solids are Goldstone bosons of broken translational symmetry.
  • Magnons in ferromagnets are Goldstones of broken rotational symmetry.

Goldstone’s theorem is a deep statement about broken continuous symmetries, with applications well beyond particle physics.

Foreshadowing

In a moment, we’ll make the U(1)U(1) symmetry local (gauge). When a gauge symmetry is spontaneously broken, Goldstone’s theorem gets modified: instead of a massless Goldstone boson, the would-be Goldstone gets “eaten” by the gauge boson, giving it a mass. This is the Higgs mechanism. We’ll work it through in section 11.


5. Gauge Symmetry: Why Local Matters

Global vs. Local Symmetry

A global symmetry transforms the field the same way everywhere: ϕeiαϕ\phi \to e^{i\alpha}\phi with α\alpha a constant.

A local (or gauge) symmetry allows α\alpha to depend on spacetime: ϕ(x)eiα(x)ϕ(x)\phi(x) \to e^{i\alpha(x)}\phi(x).

Global symmetries are relatively tame; they yield conservation laws via Noether. Local symmetries are much stronger: they constrain the form of the Lagrangian itself, forcing new fields into existence.

The Problem

Start with the complex scalar Lagrangian L=(μϕ)(μϕ)m2ϕϕ\mathcal{L} = (\partial_\mu \phi^*)(\partial^\mu \phi) - m^2 \phi^*\phi and try to make it locally invariant. Under ϕeiα(x)ϕ\phi \to e^{i\alpha(x)}\phi:

μϕeiαμϕ+i(μα)eiαϕ\partial_\mu \phi \to e^{i\alpha} \partial_\mu \phi + i(\partial_\mu \alpha) e^{i\alpha}\phi

The extra term i(μα)eiαϕi(\partial_\mu \alpha)e^{i\alpha}\phi breaks the invariance of the kinetic term. Under local U(1)U(1), the Lagrangian is not invariant.

The Fix: the Covariant Derivative

Introduce a new four-vector field Aμ(x)A_\mu(x), transforming as

AμAμ+1eμαA_\mu \to A_\mu + \frac{1}{e}\partial_\mu \alpha

and define the covariant derivative:

Dμϕ(μ+ieAμ)ϕD_\mu \phi \equiv (\partial_\mu + ie A_\mu)\phi

Check: under the combined local U(1)U(1) transformation (on ϕ\phi) and the above shift (on AμA_\mu):

Dμϕ(μ+ieAμ+iμα)(eiαϕ)D_\mu \phi \to (\partial_\mu + ieA_\mu + i\partial_\mu \alpha)(e^{i\alpha}\phi)

=eiαμϕ+i(μα)eiαϕ+ieAμeiαϕ+i(μα)eiαϕ(1)= e^{i\alpha}\partial_\mu \phi + i(\partial_\mu \alpha)e^{i\alpha}\phi + ieA_\mu e^{i\alpha}\phi + i(\partial_\mu\alpha)e^{i\alpha}\phi \cdot (-1)

Wait; let me redo this carefully. Under ϕϕ=eiαϕ\phi \to \phi' = e^{i\alpha}\phi and AμAμ=Aμ+(1/e)μαA_\mu \to A'_\mu = A_\mu + (1/e)\partial_\mu \alpha:

Dμϕ=(μ+ieAμ)(eiαϕ)D'_\mu \phi' = (\partial_\mu + ieA'_\mu)(e^{i\alpha}\phi)

=μ(eiαϕ)+ie[Aμ+(1/e)μα]eiαϕ= \partial_\mu(e^{i\alpha}\phi) + ie[A_\mu + (1/e)\partial_\mu \alpha] e^{i\alpha}\phi

=eiαμϕ+i(μα)eiαϕ+ieAμeiαϕ+i(μα)eiαϕ= e^{i\alpha}\partial_\mu \phi + i(\partial_\mu \alpha)e^{i\alpha}\phi + ieA_\mu e^{i\alpha}\phi + i(\partial_\mu \alpha)e^{i\alpha}\phi

Hmm, the i(μα)eiαϕi(\partial_\mu \alpha) e^{i\alpha}\phi terms add, not cancel. Let me reconsider the sign in the AμA_\mu transformation. The right choice is:

AμAμ1eμαA_\mu \to A_\mu - \frac{1}{e}\partial_\mu \alpha

Then:

Dμϕ=eiαμϕ+i(μα)eiαϕ+ieAμeiαϕi(μα)eiαϕD'_\mu \phi' = e^{i\alpha}\partial_\mu \phi + i(\partial_\mu \alpha)e^{i\alpha}\phi + ieA_\mu e^{i\alpha}\phi - i(\partial_\mu \alpha)e^{i\alpha}\phi

=eiα[μϕ+ieAμϕ]=eiαDμϕ= e^{i\alpha}[\partial_\mu \phi + ieA_\mu \phi] = e^{i\alpha} D_\mu \phi

The covariant derivative transforms covariantly; just like ϕ\phi itself, with the same phase. This is the whole point.

Summary: the Gauge Structure

Under a local U(1)U(1) transformation with parameter α(x)\alpha(x):

ϕeiαϕ\phi \to e^{i\alpha}\phi

AμAμ1eμαA_\mu \to A_\mu - \frac{1}{e}\partial_\mu \alpha

Dμϕeiα(Dμϕ)D_\mu \phi \to e^{i\alpha}(D_\mu \phi)

The covariant derivative transforms like the field. Therefore (Dμϕ)(Dμϕ)(D_\mu \phi)^* (D^\mu \phi) is gauge invariant, and so is ϕϕ\phi^*\phi. We can write a gauge-invariant Lagrangian:

Lmatter=(Dμϕ)(Dμϕ)m2ϕϕλ(ϕϕ)2\mathcal{L}_{\text{matter}} = (D_\mu \phi)^*(D^\mu \phi) - m^2 \phi^*\phi - \lambda (\phi^*\phi)^2

What About AμA_\mu Itself?

The gauge field AμA_\mu needs its own kinetic term. From the tensor doc, we know the gauge-invariant combination is:

Fμν=μAννAμF_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu

Under the gauge transformation AμAμ(1/e)μαA_\mu \to A_\mu - (1/e)\partial_\mu \alpha:

FμνFμν(1/e)(μνανμα)=FμνF_{\mu\nu} \to F_{\mu\nu} - (1/e)(\partial_\mu \partial_\nu \alpha - \partial_\nu \partial_\mu \alpha) = F_{\mu\nu}

(Partial derivatives commute.) So FμνF_{\mu\nu} is gauge invariant, and 14FμνFμν-\tfrac{1}{4}F_{\mu\nu}F^{\mu\nu} is the correct kinetic term.

Can AμA_\mu Have a Mass?

A mass term for AμA_\mu would be 12mA2AμAμ\tfrac{1}{2} m_A^2 A_\mu A^\mu. But under the gauge transformation, AμAμA_\mu A^\mu is not invariant. A naive mass term breaks gauge symmetry.

So gauge symmetry forces gauge bosons to be massless. The photon is massless because electromagnetism has gauge symmetry.

(Exception: if the gauge symmetry is spontaneously broken; the Higgs mechanism; mass can sneak in without explicitly breaking the symmetry. Section 11.)

The Full Classical Theory

Putting it all together:

L=(Dμϕ)(Dμϕ)m2ϕϕλ(ϕϕ)214FμνFμν\mathcal{L} = (D_\mu \phi)^*(D^\mu \phi) - m^2 \phi^*\phi - \lambda(\phi^*\phi)^2 - \tfrac{1}{4} F_{\mu\nu}F^{\mu\nu}

This is a complete, gauge-invariant, Lorentz-invariant, renormalizable theory describing a complex scalar field coupled to an Abelian gauge field. It’s called scalar electrodynamics.

The Moral

Gauge symmetry isn’t a convenience or a mathematical trick. It’s a principle that demands the existence of gauge fields. Start with a free complex scalar and insist on local U(1)U(1) invariance; a new field AμA_\mu must exist, with specific transformation law and specific coupling. The photon is not postulated; it is the unique consequence of demanding local U(1)U(1).

This generalizes. Every force in the Standard Model comes from gauging a symmetry:

  • U(1)EMU(1)_{\text{EM}} → photon
  • SU(2)LSU(2)_L → W and Z bosons
  • SU(3)CSU(3)_C → gluons

Gauge symmetry is the principle that generates all known non-gravitational forces. General relativity, similarly, can be understood as gauging translations (though this is technically more subtle).


6. Scalar Electrodynamics and Minimal Coupling

Let’s unpack what we just wrote.

The Lagrangian Expanded

LSQED=(Dμϕ)(Dμϕ)V(ϕϕ)14FμνFμν\mathcal{L}_{\text{SQED}} = (D_\mu \phi)^*(D^\mu \phi) - V(\phi^*\phi) - \tfrac{1}{4} F_{\mu\nu}F^{\mu\nu}

Expanding the covariant derivative:

(Dμϕ)(Dμϕ)=(μϕieAμϕ)(μϕ+ieAμϕ)(D_\mu \phi)^*(D^\mu \phi) = (\partial_\mu \phi^* - ieA_\mu \phi^*)(\partial^\mu \phi + ieA^\mu \phi)

=(μϕ)(μϕ)+ieAμ[ϕ(μϕ)ϕ(μϕ)]+e2AμAμϕϕ= (\partial_\mu \phi^*)(\partial^\mu \phi) + ieA^\mu[\phi^*(\partial_\mu \phi) - \phi(\partial_\mu \phi^*)] + e^2 A_\mu A^\mu \phi^*\phi

Three terms:

  • Free scalar kinetic term
  • Current-gauge field coupling: AμjμA^\mu \cdot j_\mu where jμ=ie[ϕ(μϕ)ϕ(μϕ)]j_\mu = ie[\phi^*(\partial_\mu\phi) - \phi(\partial_\mu\phi^*)]
  • “Seagull” interaction e2A2ϕ2e^2 A^2 \phi^2; unique to scalar QED (fermion QED has no such direct term)

Minimal Coupling

The prescription “replace μ\partial_\mu with DμD_\mu” is called minimal coupling. It’s the simplest way to couple a charged field to a gauge field, and it’s minimal in the sense of using only the fewest derivatives and powers of fields. Standard Model couplings are all minimal.

Equations of Motion

Varying with respect to AμA^\mu:

μFμν=Jν\partial_\mu F^{\mu\nu} = J^\nu

where JνJ^\nu is the matter current; now including pieces from both the scalar field and its interaction with AμA^\mu. Maxwell’s equations, with a specific source.

Varying with respect to ϕ\phi^*:

DμDμϕ+Vϕ=0D_\mu D^\mu \phi + \frac{\partial V}{\partial \phi^*} = 0

This is Klein-Gordon with covariant derivatives. In the presence of an electromagnetic field, a charged scalar satisfies this equation, not the ordinary Klein-Gordon.

Charge Quantization

In classical scalar QED, the charge ee is just a parameter; any value is allowed. Quantum mechanics, through Dirac’s argument about magnetic monopoles, suggests that charges would be quantized in units of ee if monopoles existed. This argument remains theoretical, since monopoles haven’t been observed; but in the Standard Model, charge quantization is built in via the gauge structure (hypercharge assignments).


7. Non-Abelian Gauge Theory (Yang-Mills)

Everything we just did for U(1)U(1) generalizes to larger symmetry groups; but with crucial new features. This is where the strong and weak forces come from.

Non-Abelian Groups

U(1)U(1) is abelian: any two elements commute (eiαeiβ=eiβeiαe^{i\alpha} e^{i\beta} = e^{i\beta} e^{i\alpha}). Physical gauge groups used in the Standard Model include:

  • SU(2)SU(2): 2×2 unitary matrices with determinant 1. Three generators. Used for weak isospin.
  • SU(3)SU(3): 3×3 unitary matrices with determinant 1. Eight generators. Used for color.

These groups are non-abelian: matrix multiplication doesn’t commute.

Matter Fields in Non-Abelian Gauge Theory

The matter field is now a multiplet; not a single complex scalar, but a vector of complex scalars transforming as a representation of the group. For SU(N)SU(N), the simplest is the fundamental representation: an NN-component complex vector.

ϕ=(ϕ1ϕN)\phi = \begin{pmatrix} \phi_1 \\ \vdots \\ \phi_N \end{pmatrix}

Under a gauge transformation:

ϕ(x)U(x)ϕ(x)\phi(x) \to U(x)\phi(x)

where U(x)U(x) is an N×NN\times N unitary matrix that can vary in spacetime.

The Lie Algebra

Near the identity, group elements can be written:

U(x)=exp(igαa(x)Ta)1+igαaTa+U(x) = \exp\left(i g \alpha^a(x) T^a\right) \approx 1 + ig\alpha^a T^a + \cdots

where TaT^a (a=1,,N21a = 1, \ldots, N^2 - 1 for SU(N)SU(N)) are the generators of the group. They’re Hermitian matrices. For SU(2)SU(2), Ta=σa/2T^a = \sigma^a/2 (Pauli matrices over 2). For SU(3)SU(3), Ta=λa/2T^a = \lambda^a/2 (Gell-Mann matrices over 2).

Key property: generators don’t commute:

[Ta,Tb]=ifabcTc[T^a, T^b] = i f^{abc} T^c

The numbers fabcf^{abc} are the structure constants of the group. For SU(2)SU(2), fabc=ϵabcf^{abc} = \epsilon^{abc} (Levi-Civita). For SU(3)SU(3), fabcf^{abc} is more complicated but specific.

The Gauge Field

For non-abelian groups, you need one gauge field per generator:

Aμa(x),a=1,,N21A_\mu^a(x), \quad a = 1, \ldots, N^2-1

SU(2)SU(2) has three: Wμ1,Wμ2,Wμ3W^1_\mu, W^2_\mu, W^3_\mu (the weak gauge bosons, before mixing).

SU(3)SU(3) has eight: the eight gluons.

Often package into a matrix-valued field:

Aμ(x)Aμa(x)TaA_\mu(x) \equiv A_\mu^a(x) T^a

The Covariant Derivative

Dμϕ=(μ+igAμ)ϕ=(μ+igAμaTa)ϕD_\mu \phi = (\partial_\mu + ig A_\mu)\phi = (\partial_\mu + igA_\mu^a T^a)\phi

where gg is the coupling constant (analog of ee).

The Transformation Law of AμA_\mu

For the covariant derivative to transform simply (DμϕU(x)DμϕD_\mu \phi \to U(x) D_\mu \phi), the gauge field must transform as:

AμUAμU1+ig(μU)U1A_\mu \to U A_\mu U^{-1} + \frac{i}{g}(\partial_\mu U) U^{-1}

The first term is a matrix conjugation (reflecting that AμA_\mu transforms in the adjoint rep of the group); the second is the analog of the inhomogeneous μα\partial_\mu \alpha shift from U(1)U(1).

For an infinitesimal transformation U=1+igαaTaU = 1 + ig\alpha^a T^a:

δAμa=μαa+gfabcαbAμc\delta A_\mu^a = \partial_\mu \alpha^a + g f^{abc}\alpha^b A_\mu^c

The second term; involving structure constants; has no U(1)U(1) analog. It says the gauge field itself transforms non-trivially under the symmetry. This will have dramatic consequences.

The Field Strength

The obvious generalization μAννAμ\partial_\mu A_\nu - \partial_\nu A_\mu doesn’t transform nicely by itself in the non-abelian case. The correct gauge-covariant generalization:

Fμνa=μAνaνAμa+gfabcAμbAνcF_{\mu\nu}^a = \partial_\mu A_\nu^a - \partial_\nu A_\mu^a + g f^{abc} A_\mu^b A_\nu^c

Or in matrix form:

Fμν=μAννAμ+ig[Aμ,Aν]F_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu + ig[A_\mu, A_\nu]

The extra term ig[Aμ,Aν]ig[A_\mu, A_\nu] (involving the commutator) is the crucial new feature.

Gauge Boson Self-Interactions

Here’s where things get genuinely different from electromagnetism. Expand 14FμνaFaμν-\tfrac{1}{4} F^a_{\mu\nu} F^{a\,\mu\nu} in the Lagrangian. Schematically:

FμνaFaμν(A)2+gAA2+g2A4F^a_{\mu\nu} F^{a\,\mu\nu} \sim (\partial A)^2 + g \partial A \cdot A^2 + g^2 A^4

The first term gives kinetic terms for AaA^a. But the other two terms describe gauge bosons interacting with each other.

  • Three-gluon vertex (schematically gAA2g \partial A \cdot A^2)
  • Four-gluon vertex (g2A4g^2 A^4)

Gluons; unlike photons; carry color charge, and hence interact with each other directly. This is the mathematical root of asymptotic freedom and confinement in QCD. It is also why non-abelian gauge theories are so much richer than QED.

The Yang-Mills Lagrangian

LYM=14FμνaFaμν\mathcal{L}_{\text{YM}} = -\tfrac{1}{4} F^a_{\mu\nu} F^{a\,\mu\nu}

Despite looking similar to the Maxwell Lagrangian, this theory is radically different from QED:

  • Nonlinear: the gauge fields interact with each other
  • Has a dimensionless coupling gg that runs with energy
  • Exhibits asymptotic freedom (coupling decreases at high energy)
  • Confinement (coupling diverges at low energy, if there are no Higgs-like fields)
  • Much harder to solve, both classically and in quantum theory

The Standard Model uses Yang-Mills with gauge groups SU(3)C×SU(2)L×U(1)YSU(3)_C \times SU(2)_L \times U(1)_Y.


8. Spinor Representations of the Lorentz Group

This is the hardest section. Up to this point we’ve worked with scalar fields (Lorentz invariant) and vector fields (transforming with the matrix Λμν\Lambda^\mu{}_\nu). Fermions transform in a third way; as spinors; that has no analog in pre-relativistic physics.

Take this section slowly. The machinery here is genuinely new.

The Lorentz Algebra

The Lorentz group’s Lie algebra has six generators: three for rotations (JiJ^i) and three for boosts (KiK^i). Commutation relations:

[Ji,Jj]=iϵijkJk[J^i, J^j] = i\epsilon^{ijk} J^k

[Ji,Kj]=iϵijkKk[J^i, K^j] = i\epsilon^{ijk} K^k

[Ki,Kj]=iϵijkJk[K^i, K^j] = -i\epsilon^{ijk} J^k

The minus sign in the last one; boosts don’t close under commutation; their product is a rotation; is what makes the Lorentz group non-compact and distinguishes it from the rotation group.

The Trick: Complex Combinations

Define:

Ai=12(Ji+iKi),Bi=12(JiiKi)A^i = \tfrac{1}{2}(J^i + iK^i), \qquad B^i = \tfrac{1}{2}(J^i - iK^i)

These satisfy:

[Ai,Aj]=iϵijkAk,[Bi,Bj]=iϵijkBk,[Ai,Bj]=0[A^i, A^j] = i\epsilon^{ijk} A^k, \quad [B^i, B^j] = i\epsilon^{ijk} B^k, \quad [A^i, B^j] = 0

Two independent copies of the rotation algebra. So the complexified Lorentz algebra factorizes into SU(2)×SU(2)SU(2) \times SU(2).

Representations of SU(2)SU(2) are labeled by a non-negative half-integer jj (spin). Representations of the Lorentz group are labeled by a pair (jA,jB)(j_A, j_B).

The Fundamental Representations

  • (0,0)(0, 0): scalar. One component.
  • (12,0)(\tfrac{1}{2}, 0): left-handed Weyl spinor. Two complex components.
  • (0,12)(0, \tfrac{1}{2}): right-handed Weyl spinor. Two complex components.
  • (12,12)(\tfrac{1}{2}, \tfrac{1}{2}): vector. Four components.
  • (12,12)(\tfrac{1}{2}, \tfrac{1}{2}) appears as the rank-2 antisymmetric tensor, spin-1 massive field, etc.

The two Weyl representations are the building blocks for all spinor physics. Left-handed and right-handed Weyl spinors transform differently under the Lorentz group. This is the mathematical fact behind the observation that the weak interaction treats left-handed and right-handed fermions asymmetrically.

Weyl Spinors Explicitly

A left-handed Weyl spinor ψL\psi_L is a two-component complex object. Under a Lorentz transformation with rotation parameters θ\vec\theta and boost parameters η\vec\eta:

ψLeiσθ/2+ση/2ψL\psi_L \to e^{-i\vec\sigma \cdot \vec\theta/2 + \vec\sigma \cdot \vec\eta/2}\psi_L

where σ=(σ1,σ2,σ3)\vec\sigma = (\sigma^1, \sigma^2, \sigma^3) are the Pauli matrices. Note: the boost generator has no ii; boosts act differently from rotations.

A right-handed Weyl spinor transforms with the opposite sign on the boost:

ψReiσθ/2ση/2ψR\psi_R \to e^{-i\vec\sigma \cdot \vec\theta/2 - \vec\sigma \cdot \vec\eta/2}\psi_R

Under rotations, both ψL\psi_L and ψR\psi_R transform the same way. Under boosts, they transform oppositely.

The Double Cover

A 2π2\pi rotation, which you’d think returns you to the same state, sends ψψ\psi \to -\psi for a spinor. You need a 4π4\pi rotation to truly come back. This is because spinors are representations not of the Lorentz group itself, but of its double cover; which for the Lorentz group is SL(2,C)SL(2, \mathbb{C}).

This ”- sign under 2π2\pi rotation” is the experimental signature of spin-½ particles. It’s why Fermi-Dirac statistics work the way they do, and why electrons can’t all pile into the same ground state.

Dirac Spinors

A Dirac spinor ψ\psi combines a left-handed and a right-handed Weyl spinor into a four-component object:

ψ=(ψLψR)\psi = \begin{pmatrix} \psi_L \\ \psi_R \end{pmatrix}

This is the representation used in standard Dirac theory. It’s reducible; it’s the direct sum of two Weyl representations; but for a massive particle, the two halves are coupled by the mass term, making the Dirac description natural.

Gamma Matrices

To write Lorentz-invariant equations involving spinors, we need matrices that connect the different spinor components. The gamma matrices γμ\gamma^\mu (four of them, μ=0,1,2,3\mu = 0, 1, 2, 3) are defined by the anticommutation relations:

{γμ,γν}γμγν+γνγμ=2ημν1\{\gamma^\mu, \gamma^\nu\} \equiv \gamma^\mu \gamma^\nu + \gamma^\nu \gamma^\mu = 2\eta^{\mu\nu} \mathbb{1}

This is the Clifford algebra of Minkowski space. Any set of 4×4 matrices satisfying these relations works; different choices are called “representations” or “bases.”

Chiral (Weyl) Basis

A convenient choice:

γ0=(0110),γi=(0σiσi0)\gamma^0 = \begin{pmatrix} 0 & \mathbb{1} \\ \mathbb{1} & 0 \end{pmatrix}, \qquad \gamma^i = \begin{pmatrix} 0 & \sigma^i \\ -\sigma^i & 0 \end{pmatrix}

(4×4 matrices built from 2×2 blocks, with σi\sigma^i the Pauli matrices.) In this basis, the Dirac spinor naturally splits into its left- and right-handed components.

The Chirality Operator

Define:

γ5iγ0γ1γ2γ3\gamma^5 \equiv i\gamma^0 \gamma^1 \gamma^2 \gamma^3

Properties:

  • (γ5)2=1(\gamma^5)^2 = 1
  • {γ5,γμ}=0\{\gamma^5, \gamma^\mu\} = 0
  • Eigenvalues ±1\pm 1

In the chiral basis:

γ5=(1001)\gamma^5 = \begin{pmatrix} -\mathbb{1} & 0 \\ 0 & \mathbb{1} \end{pmatrix}

So γ5\gamma^5 distinguishes left-handed (γ5=1\gamma^5 = -1) from right-handed (γ5=+1\gamma^5 = +1) components. Projection operators:

PL=12(1γ5),PR=12(1+γ5)P_L = \tfrac{1}{2}(1 - \gamma^5), \qquad P_R = \tfrac{1}{2}(1 + \gamma^5)

extract the left- and right-handed parts: ψL=PLψ\psi_L = P_L \psi, ψR=PRψ\psi_R = P_R \psi.

Dirac Adjoint

For spinor Lagrangians to be Lorentz scalars, we need a specific combination of complex conjugation and multiplication by γ0\gamma^0:

ψˉψγ0\bar\psi \equiv \psi^\dagger \gamma^0

Then ψˉψ\bar\psi \psi (a product of a row and column of spinors) is a Lorentz scalar, and ψˉγμψ\bar\psi \gamma^\mu \psi is a Lorentz four-vector. These are the invariants we can put in the Lagrangian.


9. The Dirac Equation

With spinors in hand, we can write the field equation for a free spin-½ particle.

The Derivation

Dirac’s challenge (1928): write a Lorentz-invariant wave equation that is first-order in derivatives, so that particle energies are not ±\pm square roots (as in Klein-Gordon).

The guess: (iγμμm)ψ=0(i\gamma^\mu \partial_\mu - m)\psi = 0.

Check by squaring. Act on the equation with (iγνν+m)(i\gamma^\nu \partial_\nu + m):

(iγνν+m)(iγμμm)ψ=γνγμνμψm2ψ(i\gamma^\nu \partial_\nu + m)(i\gamma^\mu \partial_\mu - m)\psi = -\gamma^\nu \gamma^\mu \partial_\nu \partial_\mu \psi - m^2 \psi

Using γνγμ=12({γν,γμ}+[γν,γμ])\gamma^\nu \gamma^\mu = \tfrac{1}{2}(\{\gamma^\nu, \gamma^\mu\} + [\gamma^\nu, \gamma^\mu]) and that μν\partial_\mu \partial_\nu is symmetric (so the antisymmetric part drops):

γνγμνμψ=ημνμνψ=ψ-\gamma^\nu \gamma^\mu \partial_\nu \partial_\mu \psi = -\eta^{\mu\nu}\partial_\mu \partial_\nu \psi = -\Box \psi

So (iγνν+m)(iγμμm)ψ=(+m2)ψ(i\gamma^\nu\partial_\nu + m)(i\gamma^\mu \partial_\mu - m)\psi = -(\Box + m^2)\psi.

If ψ\psi satisfies the Dirac equation, it automatically satisfies Klein-Gordon. Good; the right relativistic dispersion E2=p2+m2E^2 = p^2 + m^2 is built in.

The Dirac Equation

(iγμμm)ψ=0\boxed{(i\gamma^\mu \partial_\mu - m)\psi = 0}

Or using the “Feynman slash” notation \slashedγμμ\slashed{\partial} \equiv \gamma^\mu \partial_\mu:

(i\slashedm)ψ=0(i\slashed{\partial} - m)\psi = 0

The Dirac Lagrangian

LDirac=ψˉ(i\slashedm)ψ\mathcal{L}_{\text{Dirac}} = \bar\psi(i\slashed{\partial} - m)\psi

Varying with respect to ψˉ\bar\psi gives the Dirac equation. Varying with respect to ψ\psi gives the conjugate equation:

iμψˉγμ+mψˉ=0i\partial_\mu \bar\psi \gamma^\mu + m\bar\psi = 0

The Spin Comes for Free

The Dirac equation doesn’t postulate spin. It falls out automatically because ψ\psi has four components and the Lorentz transformation law for spinors includes the Pauli matrices. Solutions turn out to come in spin-up and spin-down varieties; two spin states per particle.

Antimatter Falls Out Too

Plane-wave solutions to the Dirac equation come in two types, corresponding to positive and negative frequency. Dirac initially interpreted the negative-energy solutions as a “sea” of filled states, with antimatter being holes in the sea. The modern interpretation: negative-frequency solutions describe antiparticles.

A free electron has two spin states. An antiparticle (positron) also has two spin states. Total: four states per momentum, matching the four components of the Dirac spinor. The mathematical structure of the Dirac equation; dictated by Lorentz invariance and first-order derivatives; requires antiparticles. Dirac’s prediction of the positron is one of the most beautiful examples of mathematics anticipating experiment.

Plane-Wave Solutions

Try ψ(x)=u(p)eipx\psi(x) = u(p) e^{-ip\cdot x} with pμ=(E,p)p^\mu = (E, \vec p):

(\slashedpm)u(p)=0(\slashed{p} - m) u(p) = 0

This is an algebraic equation for the four-component spinor u(p)u(p). Solutions exist when p2=m2p^2 = m^2 (on shell). For each such pp, there are two linearly independent solutions corresponding to the two spin states.

Similarly, antiparticles: ψ(x)=v(p)e+ipx\psi(x) = v(p) e^{+ip\cdot x} with (\slashedp+m)v(p)=0(\slashed{p} + m) v(p) = 0, also two spin states.

Chirality and the Dirac Equation

Project onto left- and right-handed components:

i\slashedψL=mψRi\slashed{\partial} \psi_L = m \psi_R

i\slashedψR=mψLi\slashed{\partial} \psi_R = m \psi_L

The mass term couples ψL\psi_L and ψR\psi_R! For a massless particle (m=0m = 0), left- and right-handed components decouple, and you have two independent Weyl equations. This is exactly what’s needed for neutrinos in the original Standard Model (massless and left-handed only).

For massive particles, ψL\psi_L and ψR\psi_R are coupled, and the relevant description is the full Dirac equation.

Helicity vs. Chirality

Two closely related but distinct concepts:

  • Helicity: spin projected along momentum direction. Frame-dependent for massive particles.
  • Chirality: the eigenvalue of γ5\gamma^5. Frame-independent but not conserved unless m=0m = 0.

For massless particles, chirality = helicity. For massive particles, they coincide only at very high energies.


10. Quantum Electrodynamics as a Classical Field Theory

Now we have all the pieces to write QED at the classical level.

Coupling Dirac Fermions to EM

The Dirac Lagrangian has a global U(1)U(1) symmetry: ψeiαψ\psi \to e^{i\alpha}\psi. Promote to local: ψeieQα(x)ψ\psi \to e^{ieQ\alpha(x)}\psi where QQ is the electric charge in units of ee (for the electron, Q=1Q = -1).

Replace μ\partial_\mu with the covariant derivative Dμ=μ+ieQAμD_\mu = \partial_\mu + ieQA_\mu.

The QED Lagrangian

LQED=ψˉ(i\slashedDm)ψ14FμνFμν\boxed{\mathcal{L}_{\text{QED}} = \bar\psi(i\slashed{D} - m)\psi - \tfrac{1}{4}F_{\mu\nu}F^{\mu\nu}}

Expanding \slashedD=γμ(μ+ieQAμ)\slashed{D} = \gamma^\mu(\partial_\mu + ieQ A_\mu):

LQED=ψˉ(i\slashedm)ψeQψˉγμψAμ14FμνFμν\mathcal{L}_{\text{QED}} = \bar\psi(i\slashed{\partial} - m)\psi - eQ\bar\psi \gamma^\mu \psi A_\mu - \tfrac{1}{4}F_{\mu\nu}F^{\mu\nu}

Three terms:

  • Free Dirac Lagrangian
  • Interaction: eQAμψˉγμψ-eQA_\mu \bar\psi \gamma^\mu \psi. The quantity jμ=ψˉγμψj^\mu = \bar\psi\gamma^\mu \psi is the electromagnetic current; the electron couples to the photon through this current.
  • Free Maxwell Lagrangian

That’s QED at the classical level. Every precision prediction of QED; anomalous magnetic moments, Lamb shift, pair production; is derived by quantizing this Lagrangian.

The QED Vertex

The interaction term has exactly one photon and two fermion lines. In quantum theory, this becomes the QED vertex: an electron line emits or absorbs a photon, nothing else. Every QED Feynman diagram is built from this single vertex.

Equations of Motion

For ψ\psi: (i\slashedDm)ψ=0(i\slashed{D} - m)\psi = 0 (covariant Dirac equation).

For AμA_\mu: μFμν=eQψˉγνψ=jν\partial_\mu F^{\mu\nu} = eQ \bar\psi \gamma^\nu \psi = j^\nu (Maxwell’s equations with Dirac current as source).

These are coupled, nonlinear, and in general impossible to solve exactly. QED calculations proceed perturbatively in ee (or equivalently α=e2/4π\alpha = e^2/4\pi), generating the Feynman diagram expansion.

Gauge Invariance Check

ψˉγμψ\bar\psi \gamma^\mu \psi is a Noether current; it’s automatically conserved (μjμ=0\partial_\mu j^\mu = 0 on the equations of motion). This is necessary for Maxwell’s equations to be consistent: μνFμν=0\partial_\mu \partial_\nu F^{\mu\nu} = 0 identically by antisymmetry, so the source jνj^\nu must be conserved.

Current Structure

The form ψˉγμψ\bar\psi \gamma^\mu \psi is a vector current; it transforms as a Lorentz four-vector. Its time component is the charge density ψˉγ0ψ=ψψ\bar\psi\gamma^0\psi = \psi^\dagger \psi.

QED’s interaction is purely vectorial; the same coupling for left- and right-handed electrons. Parity is conserved in QED.

Compare: the weak interaction uses ψˉγμ(1γ5)ψ\bar\psi \gamma^\mu(1 - \gamma^5)\psi; a V-A current (vector minus axial vector); which couples only to left-handed fermions. This is why the weak interaction violates parity.


11. The Higgs Mechanism

We return to spontaneous symmetry breaking, now with a gauge symmetry. The results are dramatically different.

The Setup: U(1) Gauge Theory with Scalar

Take scalar QED with a Mexican-hat potential:

L=(Dμϕ)(Dμϕ)V(ϕϕ)14FμνFμν\mathcal{L} = (D_\mu \phi)^*(D^\mu \phi) - V(\phi^*\phi) - \tfrac{1}{4}F_{\mu\nu}F^{\mu\nu}

with V=μ2ϕϕ+λ(ϕϕ)2V = -\mu^2 \phi^*\phi + \lambda(\phi^*\phi)^2, so the minimum is at ϕ=v/2|\phi| = v/\sqrt{2}.

Choice of Vacuum

Choose (say) ϕ0=v/2\phi_0 = v/\sqrt{2}, real. The U(1)U(1) symmetry is broken.

Parametrize Fluctuations

A clever parametrization: write

ϕ(x)=12[v+h(x)]eiθ(x)/v\phi(x) = \frac{1}{\sqrt{2}}[v + h(x)] e^{i\theta(x)/v}

Here hh is a real field (fluctuation along the radial direction) and θ\theta is a real field (fluctuation along the tangential/phase direction). The Goldstone field is θ\theta.

The Gauge Choice That Eliminates θ\theta

In a gauge theory, we’re allowed to make a gauge transformation. Choose the gauge α(x)=θ(x)/(ev)\alpha(x) = \theta(x)/(ev). Then:

ϕ(x)eiα(x)ϕ(x)=12[v+h(x)]\phi(x) \to e^{-i\alpha(x)}\phi(x) = \frac{1}{\sqrt{2}}[v + h(x)]

After this gauge transformation, ϕ\phi is purely real. The Goldstone θ\theta has vanished from ϕ\phi.

But: the gauge field transforms too. Working through:

AμAμ+1evμθA_\mu \to A_\mu + \frac{1}{ev}\partial_\mu \theta

So θ\theta reappears, absorbed into AμA_\mu.

This gauge choice; where we “eat” the Goldstone; is called unitary gauge.

What Happens to the Lagrangian

Substitute ϕ=(v+h)/2\phi = (v + h)/\sqrt{2} (real) into the Lagrangian and expand. The kinetic term (Dμϕ)(Dμϕ)(D_\mu \phi)^*(D^\mu \phi) produces:

(Dμϕ)(Dμϕ)=12(μh)(μh)+12e2(v+h)2AμAμ(D_\mu \phi)^*(D^\mu \phi) = \tfrac{1}{2}(\partial_\mu h)(\partial^\mu h) + \tfrac{1}{2} e^2 (v + h)^2 A_\mu A^\mu

Expand the second term:

12e2(v+h)2AμAμ=12e2v2AμAμ+e2vhAμAμ+12e2h2AμAμ\tfrac{1}{2} e^2(v + h)^2 A_\mu A^\mu = \tfrac{1}{2}e^2 v^2 A_\mu A^\mu + e^2 v h A_\mu A^\mu + \tfrac{1}{2}e^2 h^2 A_\mu A^\mu

The first term is a mass term for AμA_\mu!

mA2=e2v2m_A^2 = e^2 v^2

The gauge boson has acquired a mass; despite the fact that explicit mass terms for AμA_\mu are forbidden by gauge invariance. The mass came from the gauge boson’s coupling to the Higgs VEV.

Counting Degrees of Freedom

Before symmetry breaking:

  • Complex scalar ϕ\phi: 2 degrees of freedom
  • Massless gauge field AμA_\mu: 2 degrees of freedom (transverse polarizations)
  • Total: 4

After symmetry breaking (unitary gauge):

  • Real Higgs field hh: 1 degree of freedom
  • Massive gauge field AμA_\mu: 3 degrees of freedom (transverse + longitudinal)
  • Total: 4

The would-be Goldstone boson became the longitudinal polarization of the massive gauge boson. “The gauge boson ate the Goldstone.”

Physical Consequences

The photon remains massless because the electromagnetic U(1)U(1) is not broken; only a different U(1)U(1) is, which isn’t electromagnetism.

The W and Z get their masses from the Higgs VEV via the analogous mechanism in SU(2)L×U(1)YSU(2)_L \times U(1)_Y.

MW=12gv,MZ=12g2+g2vM_W = \tfrac{1}{2} g v, \qquad M_Z = \tfrac{1}{2}\sqrt{g^2 + g'^2} v

The physical Higgs hh is the remaining scalar. Its mass comes from the shape of the potential:

mh2=2μ2=2λv2m_h^2 = 2\mu^2 = 2\lambda v^2

Measured at the LHC: mh125m_h \approx 125 GeV, with v246v \approx 246 GeV.

Fermion masses come from Yukawa couplings to the Higgs:

LYukawa=yfψˉϕψ\mathcal{L}_{\text{Yukawa}} = -y_f \bar\psi \phi \psi

When ϕ\phi gets its VEV, this becomes yfv/2ψˉψ-y_f v/\sqrt{2} \cdot \bar\psi \psi; a mass term mf=yfv/2m_f = y_f v/\sqrt{2}. Every fermion’s mass comes from its coupling to the Higgs field.

The Bigger Picture

The Higgs mechanism is the one known way to give gauge bosons masses without destroying gauge invariance. Without it, the Standard Model is mathematically sick above ~1 TeV (cross-sections violate unitarity). The 2012 Higgs discovery confirmed not just one more particle, but the consistency of the whole theoretical framework.

It’s worth pausing to appreciate what’s happening here. By positing one scalar field with the Mexican-hat potential and coupling it to the SU(2)×U(1)SU(2) \times U(1) gauge fields with the right charges, we get:

  • Photon: massless (good, reality)
  • W, Z: massive with specific predicted masses
  • Electron, quarks, etc.: masses proportional to their Yukawa couplings
  • A new scalar boson (the Higgs) with a specific mass from λ\lambda and vv

Every one of these predictions has been experimentally verified. The Standard Model is astonishingly tight.


12. The Standard Model Lagrangian

Here it is, piece by piece.

Gauge Structure

GSM=SU(3)C×SU(2)L×U(1)YG_{\text{SM}} = SU(3)_C \times SU(2)_L \times U(1)_Y

  • SU(3)CSU(3)_C: color (QCD). 8 gluons GμaG_\mu^a, coupling gsg_s.
  • SU(2)LSU(2)_L: weak isospin, acts only on left-handed fermions. 3 bosons WμaW_\mu^a, coupling gg.
  • U(1)YU(1)_Y: weak hypercharge. 1 boson BμB_\mu, coupling gg'.

After electroweak symmetry breaking, the physical gauge bosons are:

  • Photon AμA_\mu (massless)
  • Wμ±W^\pm_\mu (mass MWM_W)
  • ZμZ_\mu (mass MZM_Z)
  • 8 gluons GμaG_\mu^a (massless)

Matter Fields

Three generations of quarks and leptons, each in specific representations:

Left-handed quarks in (3,2)(3, 2) of SU(3)×SU(2)SU(3) \times SU(2) with hypercharge Y=+1/6Y = +1/6:

QL=(uLdL)Q_L = \begin{pmatrix} u_L \\ d_L \end{pmatrix}

Right-handed up-type quarks in (3,1)(3, 1), Y=+2/3Y = +2/3: uRu_R.

Right-handed down-type quarks in (3,1)(3, 1), Y=1/3Y = -1/3: dRd_R.

Left-handed leptons in (1,2)(1, 2), Y=1/2Y = -1/2:

LL=(νLeL)L_L = \begin{pmatrix} \nu_L \\ e_L \end{pmatrix}

Right-handed charged leptons in (1,1)(1, 1), Y=1Y = -1: eRe_R.

(No right-handed neutrinos in the original Standard Model. They’re added in extensions to give neutrinos mass.)

Three copies of this pattern for the three generations.

Higgs Field

Complex scalar doublet under SU(2)SU(2), hypercharge Y=+1/2Y = +1/2:

ϕ=(ϕ+ϕ0)\phi = \begin{pmatrix} \phi^+ \\ \phi^0 \end{pmatrix}

The Lagrangian

LSM=Lgauge+Lfermion+LHiggs+LYukawa\mathcal{L}_{\text{SM}} = \mathcal{L}_{\text{gauge}} + \mathcal{L}_{\text{fermion}} + \mathcal{L}_{\text{Higgs}} + \mathcal{L}_{\text{Yukawa}}

Gauge term:

Lgauge=14GμνaGaμν14WμνaWaμν14BμνBμν\mathcal{L}_{\text{gauge}} = -\tfrac{1}{4} G^a_{\mu\nu} G^{a\,\mu\nu} - \tfrac{1}{4} W^a_{\mu\nu} W^{a\,\mu\nu} - \tfrac{1}{4} B_{\mu\nu} B^{\mu\nu}

Kinetic terms for the gauge fields (with self-interactions for the non-abelian ones).

Fermion kinetic + gauge coupling:

Lfermion=ψψˉi\slashedDψ\mathcal{L}_{\text{fermion}} = \sum_\psi \bar\psi i \slashed{D}\psi

summed over ψ{QL,uR,dR,LL,eR}\psi \in \{Q_L, u_R, d_R, L_L, e_R\} for each generation. The covariant derivative DμD_\mu contains all gauge couplings appropriate to each field’s representation. For the left-handed quark doublet:

DμQL=(μigsGμaTCaigWμaTaigYQLBμ)QLD_\mu Q_L = \left(\partial_\mu - i g_s G_\mu^a T_C^a - i g W_\mu^a T^a - i g' Y_{Q_L} B_\mu\right) Q_L

Each fermion couples to gauge fields according to its charges under each group.

Higgs:

LHiggs=(Dμϕ)(Dμϕ)V(ϕ)\mathcal{L}_{\text{Higgs}} = (D_\mu \phi)^\dagger(D^\mu \phi) - V(\phi)

V(ϕ)=μ2ϕϕ+λ(ϕϕ)2V(\phi) = -\mu^2 \phi^\dagger \phi + \lambda(\phi^\dagger \phi)^2

Mexican hat.

Yukawa:

LYukawa=yeLˉLϕeRydQˉLϕdRyuQˉLϕ~uR+h.c.\mathcal{L}_{\text{Yukawa}} = -y_e \bar{L}_L \phi e_R - y_d \bar{Q}_L \phi d_R - y_u \bar{Q}_L \tilde\phi u_R + \text{h.c.}

(with ϕ~=iσ2ϕ\tilde\phi = i\sigma^2 \phi^* for giving mass to up-type quarks); one Yukawa coupling per fermion, per generation. The yy‘s are matrices when you include generation mixing, giving rise to CKM.

Parameter Count

The Standard Model has roughly 19 free parameters that must be measured:

  • 3 gauge couplings: gs,g,gg_s, g, g'
  • 6 quark masses: mu,md,mc,ms,mt,mbm_u, m_d, m_c, m_s, m_t, m_b
  • 3 charged lepton masses: me,mμ,mτm_e, m_\mu, m_\tau
  • 4 CKM parameters: 3 angles + 1 phase
  • 2 Higgs: vv, λ\lambda (or equivalently mhm_h, vv)
  • 1 strong CP phase: θQCD\theta_{\text{QCD}} (consistent with zero)

Extensions with neutrino masses add more parameters (Majorana or Dirac masses + PMNS matrix).

What’s Not Here

Gravity; cannot be added to this Lagrangian in a renormalizable way. Requires a separate theoretical framework (general relativity classically; quantum gravity remains unsolved).

Dark matter; no Standard Model candidate.

Neutrino masses; require extension.

Baryon asymmetry; SM CP violation is insufficient to explain it.

Understanding the Structure

Everything you know about particle physics is in this Lagrangian:

  • Photon-electron coupling? Comes from LˉLi\slashedDLL\bar L_L i \slashed{D} L_L expanded with ee-photon piece.
  • W boson? Comes from the ϕDμϕ\phi^\dagger D_\mu \phi kinetic term after electroweak symmetry breaking.
  • Neutron beta decay? Comes from a W boson exchange between a d-quark current and a lepton current, both drawn from Lfermion\mathcal{L}_{\text{fermion}}.
  • Quark-gluon interaction? Comes from QˉLi\slashedDQL\bar Q_L i\slashed{D} Q_L with the gluon piece of DD.
  • Higgs to bottom-quark decay? Comes from ybQˉLϕbRy_b \bar Q_L \phi b_R expanded around the Higgs VEV.

Every vertex in every Feynman diagram is in this Lagrangian. This one expression; maybe a single page of equations; encodes the deepest current understanding of matter.


Appendix: Formulas and Conventions

Notation Summary

SymbolMeaning
ϕ\phiScalar field
ψ\psiDirac spinor field
ψˉ=ψγ0\bar\psi = \psi^\dagger \gamma^0Dirac adjoint
AμA_\muGauge field (general)
FμνF_{\mu\nu}Field strength (abelian)
FμνaF^a_{\mu\nu}Field strength (non-abelian)
DμD_\muCovariant derivative
γμ\gamma^\muDirac gamma matrices
γ5\gamma^5Chirality operator
\slashedA=γμAμ\slashed{A} = \gamma^\mu A_\muFeynman slash
TaT^aGroup generators
fabcf^{abc}Structure constants

Key Lagrangians

Real scalar (Klein-Gordon):

L=12(ϕ)212m2ϕ2\mathcal{L} = \tfrac{1}{2}(\partial\phi)^2 - \tfrac{1}{2}m^2\phi^2

Complex scalar with U(1)U(1):

L=(ϕ)(ϕ)m2ϕϕ\mathcal{L} = (\partial\phi^*)(\partial\phi) - m^2\phi^*\phi

Maxwell (photon):

L=14FμνFμν\mathcal{L} = -\tfrac{1}{4}F_{\mu\nu}F^{\mu\nu}

Dirac (free fermion):

L=ψˉ(i\slashedm)ψ\mathcal{L} = \bar\psi(i\slashed{\partial} - m)\psi

Scalar QED:

L=(Dμϕ)(Dμϕ)V(ϕϕ)14F2\mathcal{L} = (D_\mu \phi)^*(D^\mu\phi) - V(\phi^*\phi) - \tfrac{1}{4}F^2

QED:

L=ψˉ(i\slashedDm)ψ14FμνFμν\mathcal{L} = \bar\psi(i\slashed{D} - m)\psi - \tfrac{1}{4}F_{\mu\nu}F^{\mu\nu}

Yang-Mills:

L=14FμνaFaμν\mathcal{L} = -\tfrac{1}{4}F^a_{\mu\nu}F^{a\,\mu\nu}

Gauge Transformation Rules

Abelian U(1)U(1):

ϕeieα(x)ϕ\phi \to e^{ie\alpha(x)}\phi

AμAμμαA_\mu \to A_\mu - \partial_\mu \alpha

Dμ=μ+ieAμD_\mu = \partial_\mu + ieA_\mu

Non-abelian (infinitesimal):

ϕϕ+igαaTaϕ\phi \to \phi + ig\alpha^a T^a \phi

δAμa=μαa+gfabcαbAμc\delta A_\mu^a = \partial_\mu \alpha^a + g f^{abc}\alpha^b A_\mu^c

Dμ=μ+igAμaTaD_\mu = \partial_\mu + ig A_\mu^a T^a

Gamma Matrix Identities

{γμ,γν}=2ημν\{\gamma^\mu, \gamma^\nu\} = 2\eta^{\mu\nu}

(γ0)=γ0,(γi)=γi(\gamma^0)^\dagger = \gamma^0, \quad (\gamma^i)^\dagger = -\gamma^i

γμγμ=4\gamma^\mu \gamma_\mu = 4

γμγνγμ=2γν\gamma^\mu \gamma^\nu \gamma_\mu = -2\gamma^\nu

γμγνγργμ=4ηνρ\gamma^\mu \gamma^\nu \gamma^\rho \gamma_\mu = 4\eta^{\nu\rho}

Tr(γμγν)=4ημν\text{Tr}(\gamma^\mu \gamma^\nu) = 4\eta^{\mu\nu}

Tr(γμγνγργσ)=4(ημνηρσημρηνσ+ημσηνρ)\text{Tr}(\gamma^\mu \gamma^\nu \gamma^\rho \gamma^\sigma) = 4(\eta^{\mu\nu}\eta^{\rho\sigma} - \eta^{\mu\rho}\eta^{\nu\sigma} + \eta^{\mu\sigma}\eta^{\nu\rho})

Tr(odd number of γ’s)=0\text{Tr}(\text{odd number of } \gamma\text{'s}) = 0

Chirality Projectors

PL=12(1γ5),PR=12(1+γ5)P_L = \tfrac{1}{2}(1 - \gamma^5), \quad P_R = \tfrac{1}{2}(1 + \gamma^5)

PL+PR=1,PL2=PL,PR2=PR,PLPR=0P_L + P_R = 1, \quad P_L^2 = P_L, \quad P_R^2 = P_R, \quad P_L P_R = 0

Common Bilinears

QuantityTransformation
ψˉψ\bar\psi\psiscalar
ψˉγ5ψ\bar\psi \gamma^5 \psipseudoscalar
ψˉγμψ\bar\psi \gamma^\mu \psivector
ψˉγμγ5ψ\bar\psi \gamma^\mu \gamma^5 \psiaxial vector
ψˉσμνψ\bar\psi \sigma^{\mu\nu}\psiantisymmetric tensor

(with σμν=i2[γμ,γν]\sigma^{\mu\nu} = \tfrac{i}{2}[\gamma^\mu, \gamma^\nu])

Standard Model Quantum Numbers

FieldSU(3)SU(3)SU(2)SU(2)U(1)YU(1)_YQ=T3+YQ = T_3 + Y
QL=(uL,dL)Q_L = (u_L, d_L)32+1/6(+2/3, −1/3)
uRu_R31+2/3+2/3
dRd_R31−1/3−1/3
LL=(νL,eL)L_L = (\nu_L, e_L)12−1/2(0, −1)
eRe_R11−1−1
ϕ=(ϕ+,ϕ0)\phi = (\phi^+, \phi^0)12+1/2(+1, 0)

Closing Note

What you have now is the complete classical foundation of modern particle physics. You can:

  • Read a Standard Model Lagrangian and understand each term
  • See why the photon is massless, why gauge bosons exist, why the Higgs field must be there
  • Understand that spin, antimatter, and parity violation are built into the mathematics
  • Recognize that what looks like “just a Lagrangian” is the result of three deep principles; Lorentz invariance, gauge symmetry, and renormalizability; working together to produce an almost unique theory

The next step is quantum field theory proper. That means:

  1. Canonical quantization of fields. Fields ϕ\phi, ψ\psi, AμA_\mu become operator-valued; their Fourier coefficients become creation and annihilation operators; states live in Fock space.

  2. The path integral formulation. An alternative (and in many ways cleaner) approach where quantum amplitudes are integrals of eiS/e^{iS/\hbar} over all field configurations.

  3. Perturbation theory and Feynman rules. Derive systematic rules for computing amplitudes from the Lagrangian, turning Feynman diagrams into actual calculations.

  4. Loop diagrams and renormalization. Deal with the infinities that appear in loop calculations by absorbing them into a few measured physical parameters.

  5. Specific computations. Cross sections, decay rates, running couplings, the full machinery of QED/QCD/electroweak predictions.

Peskin & Schroeder is the standard textbook; Srednicki and Schwartz are alternatives with different emphases. A rigorous QFT course is typically a full academic year. You now have the prerequisites to start it seriously; not skimming, but actually working through problems and following the derivations.

Whenever you’re ready, we can either:

  • Begin QFT proper (a much larger undertaking, likely several documents)
  • Go back and fill in any gaps you feel in what we’ve already covered
  • Work through specific calculations in this framework (deriving QED from symmetry principles, computing the Higgs mechanism in detail, etc.)
  • Branch into a different area entirely (statistical mechanics, general relativity, condensed matter)

You’ve built a foundation that most people never reach. The hardest conceptual work is done; from here it’s refinement and application.