QFT document 3: quantizing a gauge field, where redundancy becomes a feature and photons emerge as massless spin-1 excitations.
Documents 1 and 2 handled scalar and Dirac fields; relatively straightforward to quantize because their classical degrees of freedom are “physical” (no redundancy). The electromagnetic field is different. The classical has four components per spacetime point, but only two correspond to physical photon polarizations. Gauge invariance is what makes the extra components unphysical; and also what makes naive quantization fail.
This document develops three approaches to quantizing electromagnetism, each with different tradeoffs. By the end, you’ll have the photon propagator (which looks simple but hides subtleties) and the machinery needed to write down QED as a genuine interacting theory.
Prerequisites
- The scalar and Dirac quantization documents (documents 1 and 2)
- Classical field theory: the Lagrangian formulation of Maxwell’s equations, gauge invariance
- Covariant tensors: , four-potential , index gymnastics
Conventions
Same as documents 1 and 2:
- Metric
Table of Contents
- The Problem: Too Many Components
- Counting Physical Degrees of Freedom
- Approach 1: Coulomb Gauge
- Approach 2: Gupta-Bleuler Quantization
- Approach 3: Path Integral Preview (Faddeev-Popov)
- The Photon Propagator
- Polarization States and Helicity
- Gauge Fixing as Lagrange Modification
- Coupling to Matter: QED as a Field Theory
- Masslessness and Gauge Invariance
- Physical Content and What’s Next
- Appendix: Formulas and Identities
1. The Problem: Too Many Components
The Setup
Classical electromagnetism is described by the Maxwell Lagrangian:
with . The equations of motion (for a source ) are:
Naive Canonical Quantization Fails
Let’s try to proceed as with the scalar field. Conjugate momentum:
Immediately we hit a problem:
The momentum conjugate to is identically zero; not just small, not just weakly conserved, but structurally zero. So the canonical commutation relation cannot be imposed; it would read , which is inconsistent.
is not a dynamical degree of freedom. It’s a Lagrange multiplier; the equation of motion for gives Gauss’s law , a constraint, not a time evolution equation.
Gauge Redundancy
The deeper issue: even if we only had three components, that would still be too many. The gauge transformation
leaves and hence the physics unchanged. So at the classical level, itself is not observable; only gauge-invariant combinations are. We have a 1-parameter family of fields all describing the same physics.
Why This Matters
In ordinary quantum mechanics, we don’t worry about “unphysical” degrees of freedom because there aren’t any; every position and momentum is a genuine observable. The Dirac field in document 2 also had no redundancy; is well-defined (up to a global phase, which doesn’t count as redundancy).
But for gauge fields, we have genuine redundancy. If we don’t handle it, three things go wrong:
- Negative norm states. The timelike component has “wrong-sign” commutation relations, producing states with . Probabilities don’t make sense.
- Propagator singularity. The naive propagator has an un-invertible matrix structure; there’s no way to write it down cleanly.
- Wrong degree of freedom count. We need 2 polarizations for the photon, but naive quantization gives 4.
Gauge fixing is the procedure that resolves all three.
Three Approaches
Historically and pedagogically, there are three main approaches:
- Coulomb gauge: choose ; explicit, physical, but breaks manifest Lorentz invariance
- Gupta-Bleuler (Lorenz gauge): keep Lorentz invariance but work in an indefinite-metric Hilbert space, imposing subsidiary conditions to pick out physical states
- Path integral / Faddeev-Popov: the modern approach; preview here, full development later
Each has its uses. Physical results are the same; intermediate details differ dramatically.
2. Counting Physical Degrees of Freedom
Before diving into any approach, let’s count what we should get.
Starting Point
has 4 components at each spacetime point. If each were independent, a massless vector field would have 4 polarization states per momentum.
Gauge Redundancy
Gauge transformation is a one-parameter family of redundancies per spacetime point. This kills 1 degree of freedom.
Gauss’s Law Constraint
is a Lagrange multiplier, not dynamical. Its equation of motion (Gauss’s law) is a constraint on the physical states. This kills another degree of freedom.
Net Count
physical polarizations per momentum. ✓
These are the two transverse polarizations; the familiar horizontal and vertical polarizations of light, or equivalently left- and right-circular. In relativistic language, they correspond to helicity .
Why Massive Vectors Differ
A massive spin-1 particle (like the W or Z boson) has three polarization states: two transverse plus one longitudinal. The longitudinal mode exists because massive vectors aren’t gauge-invariant in the same way.
The massless limit involves the longitudinal mode decoupling from physical processes; one of those subtle “kinematic” effects that secretly involves the Higgs mechanism in the Standard Model. Massless means gauge-invariant means 2 polarizations. Massive means 3 polarizations, and the third has to come from somewhere (the Higgs).
3. Approach 1: Coulomb Gauge
The most physical approach: pick coordinates that explicitly separate physical and unphysical parts.
The Gauge Choice
Impose
This is the Coulomb gauge (also called transverse or radiation gauge). It doesn’t completely fix the gauge; you can still do time-dependent transformations that shift ; but combined with the boundary conditions at infinity (fields vanish), it uniquely determines the potential.
The Physical Degrees of Freedom
With , the vector potential is purely transverse. Fourier-decompose:
with (photons are massless).
The polarization vectors for are transverse:
and normalized: .
This is manifestly 2 polarizations per momentum; exactly what we wanted.
Commutation Relations
Canonical quantization on the transverse modes:
All others vanish. Same as scalar field commutators, labeled by polarization.
Hamiltonian
One-photon states:
have energy ; the dispersion relation of a massless particle. ✓
The Coulomb Interaction
In Coulomb gauge, is not a dynamical field. Its equation of motion is the Gauss’s law constraint:
This is the instantaneous Coulomb interaction. For a system of charges, the Hamiltonian contains a term:
That is, action at a distance, apparently instantaneous!
This looks like it violates causality, but in the full theory, radiation effects (retarded interactions through transverse photons) cancel the causality violation. Coulomb gauge is not manifestly Lorentz-invariant, but the physical predictions are.
Pros and Cons
Pros:
- Only physical degrees of freedom; manifestly positive-norm
- Explicit 2-photon counting
- Intuitive (transverse = radiation; = static Coulomb potential)
Cons:
- Not manifestly Lorentz-covariant; Lorentz transformations are “hidden”
- Instantaneous Coulomb term in the Hamiltonian looks superluminal (although physics isn’t)
- Awkward for relativistic calculations involving virtual photons
- The polarization vectors don’t transform as a four-vector
For practical QED calculations, Coulomb gauge is often cumbersome. We need a covariant approach.
4. Approach 2: Gupta-Bleuler Quantization
Sacrifice positive-norm Hilbert space (temporarily) to get manifest Lorentz covariance.
The Modified Lagrangian
Start with:
The second term; the gauge-fixing term; breaks gauge invariance explicitly. The parameter is a free choice (different values correspond to different gauges). Common choices:
- : Feynman gauge (simplest for calculations)
- : Landau gauge (manifestly transverse)
Equations of Motion
Vary with respect to :
In Feynman gauge ():
wait, that’s not quite right. Let me be careful. The equations of motion from the modified Lagrangian work out to:
In Feynman gauge , this simplifies beautifully:
Each component of obeys the same equation as a massless scalar field. That makes quantization easy.
Canonical Quantization in Feynman Gauge
Treat each component of as an independent field, like four massless scalars. Mode expansion:
Now runs from 0 to 3; four polarizations, corresponding to the four components of .
The polarization vectors :
- : timelike,
- : transverse spatial
- : longitudinal (along )
The Sign Problem
The commutation relations:
Note the minus sign from the metric tensor; specifically, the (timelike) mode has:
This is a wrong-sign commutator; the same disaster we saw when we tried commutators for fermions in document 2!
Consequence: states with timelike photons have negative norm:
Can’t be a probability.
The Gupta-Bleuler Subsidiary Condition
The fix: rather than making all states “physical,” define physical states by a subsidiary condition:
where is the positive-frequency part (containing only annihilation operators). This requires physical states to be annihilated by the Gauss’s law operator in the appropriate sense.
Physical states are superpositions of transverse photons. The timelike and longitudinal modes appear in physical states only in specific combinations (roughly, equal numbers of both) that have net zero norm contribution.
The physical Hilbert space:
- States containing only transverse photons have positive norm
- States with equal numbers of timelike and longitudinal photons have zero norm
- Other combinations are excluded by the subsidiary condition
Why This Works
The physical observables (cross-sections, decay rates) depend only on the transverse photons. The timelike and longitudinal photons are “ghosts” in the sense of unphysical degrees of freedom; they must be carried along for Lorentz covariance, but they don’t contribute to physical probabilities.
This is the Gupta-Bleuler approach (1950). Mathematically fiddly, but Lorentz-covariant.
Pros and Cons
Pros:
- Manifestly Lorentz-covariant at every step
- Simple propagator (Feynman gauge: )
- Four-polarization structure makes calculations algorithmic
Cons:
- Indefinite-metric Hilbert space (non-standard, conceptually uncomfortable)
- Only works for abelian gauge theory (QED); fails for Yang-Mills
- Hides the geometric content of gauge invariance
For QED, Gupta-Bleuler works. For QCD or electroweak theory, you need the path integral approach.
5. Approach 3: Path Integral Preview (Faddeev-Popov)
The modern approach, which we’ll develop fully when we get to path integrals.
The Idea
In the path integral, you sum over all field configurations weighted by . If the action is gauge-invariant, you’re over-counting: every physical configuration is represented infinitely many times (by all its gauge-equivalent partners).
The fix: insert a factor into the path integral that picks one representative from each gauge orbit; a procedure called gauge fixing. The price of doing this consistently is the introduction of Faddeev-Popov determinants (1967), which for non-abelian theories produce anticommuting scalar fields called ghosts.
The Result for QED (Preview)
For QED, the Faddeev-Popov procedure in a general gauge gives:
exactly the gauge-fixed Lagrangian from section 4. The ghosts decouple for abelian theories (they don’t interact with anything), so they play no physical role in QED.
For Yang-Mills (later document), ghosts are essential; they interact with gauge bosons through the structure constants and contribute to loop diagrams.
Pros and Cons
Pros:
- Most general; works for any gauge theory
- Manifestly Lorentz-covariant
- Geometric interpretation
- Essential for non-abelian theories
Cons:
- Requires path integral machinery (later doc)
- Introduces unphysical ghost fields
We’ll revisit this properly when we do path integrals. For now, just know the results of Faddeev-Popov and Gupta-Bleuler agree for QED, and the Gupta-Bleuler approach gives the same photon propagator as path-integral gauge-fixing.
6. The Photon Propagator
Definition
In Feynman Gauge
Same structure as a massless scalar propagator, but with the extra tensor structure reflecting the vector indices.
In momentum space:
Remarkably simple; and that simplicity is the main reason Feynman gauge is the standard choice for QED calculations.
In General Gauge
Setting recovers Feynman gauge. Setting gives Landau gauge (transverse propagator). Setting gives Yennie gauge. Different choices are convenient for different calculations.
Gauge-independence of physical results: Any physical observable (cross-section, decay rate) computed with the propagator must be independent of . If you compute a cross-section and get a -dependent answer, you made a mistake. This is a useful consistency check.
The Pole Structure
The photon propagator has a pole at ; exactly where a massless particle should have its mass shell. The residue of the pole encodes the photon’s interactions; gauge invariance ensures that only transverse polarizations contribute to physical processes.
Comparison Table
Propagators from documents 1-3, in momentum space (Feynman gauge for the photon):
| Field | Propagator |
|---|---|
| Real scalar | |
| Dirac fermion | |
| Photon |
The common structure: times the appropriate tensor factor for the spin. This pattern generalizes to any free particle.
7. Polarization States and Helicity
Two Physical Polarizations
For a photon with momentum (WLOG along ), the two physical polarizations are:
Linear polarizations:
- (polarization along )
- (polarization along )
Circular polarizations (eigenstates of angular momentum along ):
- (right-circular, helicity )
- (left-circular, helicity )
For a photon moving along , helicity is the spin projection along . Right-circular light has helicity (spin aligned with motion); left-circular has .
Why Not Helicity 0?
A massive spin-1 particle can have helicity . A massless one is forced to have helicity only. The absence of helicity 0 is why gauge symmetry is needed; without it, the helicity-0 mode would propagate, and you’d have a massless vector with three polarizations, which doesn’t make sense physically.
This is the content of “massless vector = gauge field”: massless = 2 polarizations = need gauge invariance to kill the third.
Photon as Force Carrier
When we couple the photon field to matter, photons exchanged between particles mediate the electromagnetic force. Off-shell (“virtual”) photons have and can carry non-transverse polarizations; on-shell (“real”) photons always have and transverse polarizations.
The distinction matters for calculations: internal photon lines in Feynman diagrams are off-shell (use propagator); external photon lines are on-shell (use polarization vectors satisfying ).
Polarization Sum
When computing cross-sections involving external photons, we often sum over final photon polarizations:
The and terms drop out for physical processes because of gauge invariance (Ward identity, which we’ll meet in document 4). So effectively:
for practical cross-section calculations. This replacement is one of the most commonly used shortcuts in QED.
8. Gauge Fixing as Lagrange Modification
Let’s step back and appreciate what gauge fixing really is.
The Principle
The Lagrangian is gauge-invariant. It has a redundancy; physically equivalent configurations are counted multiple times. To quantize, we need to either:
- Explicitly pick physical coordinates (Coulomb gauge)
- Add a non-invariant term to the Lagrangian that breaks the redundancy (covariant gauges)
Adding the term is an example of the second approach. The term is not gauge-invariant; it would naively be a disaster. But the point is: it’s picking one gauge from the family, and gauge-invariant quantities (physical observables) are independent of which gauge we chose.
Different Gauges for Different Problems
- Coulomb gauge; good for nonrelativistic QED (atomic physics), bound-state problems
- Feynman gauge; simplest propagator, standard for perturbative QED
- Landau gauge; manifestly transverse, useful when transversality matters
- Axial gauge ( or similar); no ghosts, but highly non-covariant
- Light-cone gauge (); used in QCD factorization theorems
Each has strengths. The ultimate test: all must give the same gauge-invariant answers.
BRST as the Modern View
A modern approach (Becchi-Rouet-Stora-Tyutin, 1975-1976) identifies a fermionic symmetry of the gauge-fixed Lagrangian; the BRST symmetry; that encodes what’s left of gauge invariance after fixing. Physical states are those annihilated by the BRST charge.
BRST is elegant and powerful, especially for non-abelian theories. We won’t develop it fully here, but be aware it exists and provides the most sophisticated framework for handling gauge theories.
9. Coupling to Matter: QED as a Field Theory
Now we have all the pieces. Putting fermions and photons together:
The QED Lagrangian
with (where is the electric charge in units of ; for the electron).
Expanding:
The last term is the interaction Lagrangian:
where is the electron’s electromagnetic current.
The QED Vertex
The interaction is a product of three fields: , , and . Each corresponds to a line in a Feynman diagram. The interaction term describes the vertex where these three lines meet:
- Two fermion lines (one , one )
- One photon line ()
- Coupling constant (in momentum space)
This single vertex is the entirety of QED interactions. Every QED process; , Compton scattering, the anomalous magnetic moment, the Lamb shift, everything; is built from combining copies of this vertex with propagators.
What We’ve Built
Three types of “lines” in QED Feynman diagrams:
| Line | Corresponds to |
|---|---|
| Electron (solid, arrowed) | Dirac fermion propagator |
| Photon (wavy) | Photon propagator |
| Vertex | QED interaction |
External lines represent real particles (on-shell). Internal lines represent virtual particles (off-shell). Loops involve integrating over internal momenta.
This is the graphical representation of the perturbative expansion we’ll develop in document 4.
Coupling Strength
The QED coupling is:
The small value of is why perturbation theory works so well in QED. Each extra vertex in a diagram adds a factor of , so higher-order diagrams are suppressed by powers of .
10. Masslessness and Gauge Invariance
A key feature: the photon is massless. This is tied deeply to gauge invariance.
Why No Mass Term?
A mass term for the photon would look like . Under a gauge transformation :
Not invariant. A photon mass breaks gauge symmetry explicitly.
Consequence: Gauge invariance forces the photon to be massless. Experimentally, the photon mass is bounded above by eV (astonishingly stringent). As far as we know, it’s exactly zero; consistent with exact gauge symmetry.
Exceptions: The Higgs Mechanism
The W and Z bosons are massive, even though they’re also gauge bosons (of ). They get their masses through the Higgs mechanism; spontaneous symmetry breaking in a way that doesn’t explicitly break gauge invariance.
The photon remains massless because the particular combination of and that survives the Higgs mechanism is still an unbroken gauge symmetry. The electromagnetic is exact; the weak is spontaneously broken.
This is why the photon is massless but W and Z are heavy; the Higgs picks the combination.
Long-Range Force
A massless force carrier gives rise to a long-range force ( potential). Electromagnetism has infinite range; because the photon is massless.
Compare: the weak force has range m because the W and Z have masses around 80-90 GeV. The strong force is complicated (confinement means gluons don’t propagate asymptotically) but at short distances it’s long-range like QED.
Goldstone’s Theorem and Gauge Bosons
A related fact from the classical field theory document: when a continuous global symmetry is spontaneously broken, you get a massless Goldstone boson. When a gauge symmetry is spontaneously broken, the would-be Goldstone is “eaten” by the gauge boson, which becomes massive. The photon stays massless because its gauge symmetry isn’t broken.
The photon’s masslessness, long range, and exact gauge invariance are all parts of the same story.
11. Physical Content and What’s Next
What We’ve Accomplished
- Recognized the problem of gauge redundancy; fewer physical degrees of freedom than naive field components
- Developed three approaches: Coulomb gauge, Gupta-Bleuler, Faddeev-Popov (preview)
- Derived the photon propagator in Feynman gauge:
- Identified the two physical polarizations and their helicity content
- Assembled the QED Lagrangian with the interaction vertex
- Understood why gauge invariance implies massless photons
The Three Free Fields Are Done
Documents 1-3 complete the free-field story:
| Spin | Field | Statistics | Special features |
|---|---|---|---|
| 0 | Scalar | Boson | Commutators, Fock space |
| 1/2 | Dirac | Fermion | Anticommutators, Pauli exclusion |
| 1 (massless) | Photon | Boson | Gauge fixing, 2 polarizations |
We can extend to higher spins (spin 3/2 Rarita-Schwinger, spin 2 graviton) and massive vectors (W, Z), but these all follow the patterns we’ve established.
What Comes Next
Document 4: Interacting Fields and Perturbation Theory. Until now, we’ve only quantized free fields. The interaction term couples them together, but we haven’t yet developed the tools to handle interactions. Document 4 introduces:
- The interaction picture of quantum mechanics
- Dyson’s formula for time evolution
- Wick’s theorem for contracting field products
- The LSZ reduction formula connecting correlation functions to scattering amplitudes
These are the mathematical prerequisites to actually computing anything.
Document 5: Feynman Diagrams and Tree-Level QED. With the machinery in place, we finally compute. Classic processes like , Compton scattering, and Møller scattering. Feynman rules derived from first principles. Trace technology. Cross-sections extracted from amplitudes.
This is where QFT becomes physics rather than mathematical framework.
The Big Picture So Far
Quantum field theory, assembled from:
- Relativistic field Lagrangians (from the classical field theory document)
- Canonical quantization procedure (promoting fields to operators)
- Commutators or anticommutators depending on spin (spin-statistics)
- Gauge fixing for gauge fields
- Particles as excitations of fields; creation and annihilation operators
- Propagators encoding the two-point correlation functions
And the coming ingredients:
- Perturbation theory for interactions (document 4)
- Feynman diagrams as graphical representation (document 5)
- Loop integrals and regularization (document 6)
- Renormalization (document 7)
- Path integrals (documents 9-10)
- Yang-Mills and the Standard Model (documents 11-12)
You’re a quarter of the way through. Keep going.
Appendix: Formulas and Identities
The QED Lagrangian
Gauge Fixing Terms
| Name | Features | |
|---|---|---|
| 1 | Feynman | Simplest propagator |
| 0 | Landau | Manifestly transverse |
| 3 | Yennie | Sometimes used in bound-state calculations |
Photon Propagator
Feynman gauge:
General :
Polarization Vectors
Transverse polarizations for along :
Circular polarizations (helicity eigenstates):
Satisfying and .
Polarization Sum
For on-shell external photons:
These extra terms vanish when contracted with physical amplitudes (Ward identity), so effectively:
The QED Vertex
In position space, the interaction is . In momentum space (for Feynman rules):
For the electron (): . For other charged fermions, use appropriate .
Fine-Structure Constant
In natural units with : .
Feynman Rules Summary (QED)
From the Lagrangian, the Feynman rules for QED:
| Element | Rule |
|---|---|
| Fermion line (internal) | |
| Photon line (internal, Feynman gauge) | |
| Fermion-photon vertex | |
| External electron | or |
| External positron | or |
| External photon | or |
| Loop momentum | |
| Fermion loop | Extra factor |
These will be derived rigorously in document 5. For now, they’re a preview of where we’re heading.
Ward Identity (Preview)
A crucial identity from QED, following from gauge invariance:
where is an amplitude with one external photon of momentum . Ward identities ensure that gauge-dependent propagator terms drop out of physical predictions and that the photon only has 2 physical polarizations.
We’ll develop Ward identities properly in later documents. Their existence is what makes QED calculationally tractable.
Closing Note
Document 3 completes the free-field trilogy. With scalars, fermions, and photons in hand, we have the ingredients for QED; the most precisely tested theory in physics.
Key Takeaways
Gauge redundancy is real. has 4 components but the physics has only 2. Gauge fixing is how we handle this tension while maintaining computability.
Multiple gauge choices exist. Coulomb, Feynman, Landau, axial, light-cone; each is appropriate for different problems. Physical results don’t depend on the choice.
The photon propagator is clean. In Feynman gauge, . The structure matches what you’d expect for a massless vector field.
Gauge invariance forces masslessness. The photon is massless because electromagnetism has exact gauge invariance. The W and Z are massive because the electroweak gauge invariance is spontaneously broken.
QED is now assembled as a Lagrangian. We have the fermion kinetic term, photon kinetic term, gauge fixing, and interaction vertex. What’s missing is the machinery to actually compute things; perturbation theory.
Where We’re Going
The next document is the computational heart of QFT: how do you actually calculate things in an interacting theory? The answer is perturbation theory, and specifically the Dyson expansion plus Wick’s theorem plus the LSZ reduction formula. All three are genuinely beautiful pieces of mathematics that connect the operator formalism we’ve developed to actual scattering amplitudes.
After that, Feynman diagrams; the graphical representation of these perturbative calculations. And then we finally compute cross-sections for real processes.
You’ve built the foundation. The building starts going up.
Ghosts pending.