QFT document 7: the systematic procedure that turns infinite loop integrals into finite physical predictions; and reveals that “bare” and “physical” parameters are different things.
Document 6 exposed the problem: loop integrals in QFT are generally divergent, and naive perturbation theory gives infinite answers for physical observables. Document 6 also gave us regularization; tools for making the divergences finite and manipulable.
This document completes the story. Renormalization is the procedure that systematically absorbs divergences into redefinitions of the parameters of the theory (mass, coupling, field strength), leaving finite, well-defined predictions for all physical observables.
Historically, this was one of the most conceptually difficult developments in physics. Dirac famously complained that renormalization was “sweeping infinities under the rug.” What changed the view was Wilson’s reformulation in the 1970s, which showed that renormalization isn’t a trick but a deep statement about effective field theories and scales. Document 8 will develop Wilson’s picture fully. This document focuses on the procedure: how you actually do renormalization in practice.
Prerequisites and Conventions
- QFT documents 1-6
- Familiarity with loop integrals, dimensional regularization, and the three one-loop QED diagrams
- Same conventions: mostly-minus metric,
Table of Contents
- The Problem Restated
- Bare vs. Renormalized: The Key Distinction
- Counterterms: The Mechanical Procedure
- Renormalization Conditions
- On-Shell Renormalization
- Minimal Subtraction ()
- QED Renormalization at One Loop
- The Ward-Takahashi Identity:
- Renormalization to All Orders: Consistency
- Power Counting and Renormalizability
- Non-Renormalizable Theories as EFTs
- Physical Meaning: What Is Renormalization Really Doing?
- Appendix: Renormalization Formulas
1. The Problem Restated
After one-loop calculations, we have divergent expressions:
Electron self-energy:
Vacuum polarization:
Vertex correction:
(With specific coefficients from document 6.)
These are all poles (in dimensional regularization). Something needs to be done to extract physical predictions.
The Key Observation
Notice what the divergences aren’t: they’re not new operator structures. The self-energy has the structure and ; the same structures as in the original Lagrangian. The vacuum polarization has the gauge-invariant structure , same structure as the photon kinetic term . The vertex correction is , same structure as the QED interaction .
Every divergence matches an operator already in the Lagrangian. This is the miracle of renormalizable theories; the divergent corrections have the same form as terms already there. We can therefore absorb them by redefining the coefficients of those terms.
2. Bare vs. Renormalized: The Key Distinction
The Two Sets of Parameters
Introduce two different sets of parameters:
Bare parameters; the ones appearing in the “original” Lagrangian. Let’s call them , , and let the bare fields be , . These are the symbolic quantities in the Lagrangian.
Renormalized (or physical) parameters; what you actually measure. Let’s call them , , , (no subscript).
The bare and renormalized quantities are related by:
where are renormalization constants, and is the mass counterterm. Each is 1 at tree level and receives corrections at each loop order.
Why Two Sets?
Here’s the conceptual point: the bare parameters are formal; they’re just symbols in the Lagrangian, and their numerical values depend on the regularization scheme. The renormalized parameters are what you actually measure in experiments.
In the regularized theory (say, dimensional regularization with finite ):
- Bare parameters , are finite
- Renormalized parameters , are also finite
- The relationship between them contains terms that diverge as
As you take the regulator away ():
- The renormalized parameters remain finite (they’re what experiments measure)
- The bare parameters must diverge in a specific way to compensate
- This divergence is exactly what absorbs the loop divergences
The divergence isn’t a problem; it’s a feature. The bare parameters were never physical to begin with. Only the renormalized parameters need to be finite.
Mental Model
Imagine a theory where the electron’s “raw” mass (the in the Lagrangian) is actually infinite, but surrounded by a cloud of virtual particles that contribute a subtraction, leaving the observed mass finite. The divergence in is the “correction from the virtual cloud.”
This picture is qualitatively right but quantitatively misleading; the actual picture requires Wilson’s effective-field-theory view (document 8). But it captures the idea: the bare parameters and renormalized parameters are different things, and only the renormalized ones are measurable.
A Technical Note
In dimensional regularization, the renormalization constants take the form:
The pole is divergent as . The finite part depends on the renormalization scheme (below).
3. Counterterms: The Mechanical Procedure
Splitting the Lagrangian
Write the Lagrangian in terms of bare fields, then substitute the bare-to-renormalized relations:
With , , , :
Now write :
The first piece is the “physical” Lagrangian; identical to the original, but with renormalized parameters and fields. The second piece is the counterterm Lagrangian; extra vertices and corrections whose job is to cancel the loop divergences.
Counterterm Feynman Rules
Each counterterm corresponds to a new Feynman rule. At order :
-
Electron counterterm : an “insertion” in an electron propagator line that contributes (after simplification).
-
Photon counterterm : an insertion in a photon propagator that contributes .
-
Vertex counterterm : an extra vertex with factor .
(Different textbook conventions give these with slightly different signs or factors; I’m using Peskin-like conventions.)
The Key Idea
The counterterms are chosen such that they cancel the poles in the one-loop calculations. Since the counterterms have the same form as the loop corrections (both give contributions with , , , structures), they can match pole-for-pole.
The counterterm coefficients are therefore divergent ( poles) in a specific way that mirrors the loop divergences.
Order-by-Order
- At tree level: , . Just the original Lagrangian.
- At one loop: .
- At two loops: receives further corrections at .
- At loops: at .
Each order of perturbation theory requires its own adjustments to the counterterms.
4. Renormalization Conditions
There’s something still unfixed: which finite parts do the counterterms absorb? The poles are forced (to cancel the divergences), but the finite parts can be anything. Different choices give different renormalization schemes.
The Choice to Make
The renormalization conditions are a set of rules fixing the finite parts of the counterterms. They’re typically defined by specifying the values of physical (renormalized) parameters at specific kinematic points.
Common Schemes
On-shell (OS) scheme: Define parameters by their “on-mass-shell” values. The renormalized mass equals the physical pole mass; the renormalized coupling equals the coupling measured at zero momentum transfer (Thomson limit in QED). Most physical, but technically more complex.
Minimal subtraction (MS): Absorb only the pole (and nothing else) into the counterterms. The finite parts stay in the physical amplitude. Simplest, but the “renormalized coupling” is a formal parameter, not directly measured.
Modified minimal subtraction (): Absorb the pole plus specific “universal” finite pieces () that always accompany the pole in dim reg. Standard in modern calculations.
Momentum subtraction (MOM): Define the coupling by matching to the amplitude at specific momenta. Used in some QCD contexts.
Physical Equivalence
Important: All schemes give the same physical predictions. The “coupling at a given scale” differs between schemes, but observable quantities (cross sections, decay rates) are scheme-independent; once you express everything in the same scheme consistently.
The scheme is like a gauge in gauge theory; a choice that simplifies calculations but doesn’t affect physics.
Why Multiple Schemes Exist
Different schemes are convenient for different problems:
- On-shell: good for theories with few masses, gives a “physical” interpretation
- : good for high-energy calculations where masses are neglected; standard in QCD
- MOM: useful when there are specific kinematic regions you want to emphasize
Modern practice: compute in , convert to on-shell (or other schemes) when comparing to specific experiments.
5. On-Shell Renormalization
The most physically intuitive scheme. We’ll use QED to illustrate.
The Renormalization Conditions (On-Shell)
Condition 1: Pole of the electron propagator at .
The full electron propagator in the interacting theory is:
Where is the electron self-energy (including all loop corrections). On-shell, we demand:
The first condition says the pole is at the physical mass (not some shifted location). The second says the residue of the pole is 1; no wave function modification at the pole.
Condition 2: Photon propagator pole at .
The full photon propagator includes vacuum polarization:
(Schematic; see document 6.) On-shell condition: residue at is 1, i.e.:
This ensures the photon is exactly massless and has unit residue at the pole.
Condition 3: Coupling at (Thomson limit).
The QED vertex with all corrections is . At on-shell electrons and zero photon momentum, we demand:
The coupling at zero momentum transfer equals the classical electron charge; the Thomson limit defines .
Determining Counterterms
These three conditions determine the three counterterms (and implicitly ) uniquely at each order in perturbation theory.
At One Loop: The Result
Working through the one-loop QED diagrams with these conditions (I won’t do the full calculation; Peskin Chapter 10 does):
(Values are scheme-dependent; these use on-shell conditions.)
Note: at one loop (and all orders; this is the Ward-Takahashi identity, section 8).
Physical Consequence
With these renormalization conditions:
- The electron pole is at the measured mass (by definition)
- The photon is massless (by gauge invariance + condition)
- The coupling is defined at
All physical quantities computed from the theory are finite and match experiments.
6. Minimal Subtraction ()
The dimensionally-regularized version of renormalization. Simpler to use in practice, especially for high-energy calculations.
The Prescription
MS scheme: Each counterterm absorbs exactly the pole, nothing else:
scheme: Absorb the pole plus a universal finite piece that always accompanies poles in dim-reg integrals. Define by rescaling the dimensional-regularization scale:
Then absorbing the pole gets rid of the factors too.
The resulting scheme is very clean; coefficients are just the leading-log parts of physical quantities.
Comparison to On-Shell
In on-shell, the mass is the physical pole mass; directly measurable.
In , the mass is a formal parameter that depends on the renormalization scale . It’s not directly measurable but is useful for calculation.
Conversion formula (one-loop):
(For QCD masses; analogous for other theories.)
Why Is Standard
Simpler calculations. In , you don’t have to evaluate complicated kinematic functions at specific points. Just subtract the pole.
Systematic. Every quantity in is defined by a clear rule. No human judgement required.
RG-friendly. The renormalization group equations (document 8) take their cleanest form in .
Good for high energies. When particle masses are negligible compared to the relevant energy scale, on-shell subtraction gets clumsy while remains simple.
Modern particle physics papers overwhelmingly report results in , with conversion to on-shell for specific comparisons.
An Important Point
In , the “renormalized coupling” is explicitly -dependent. The running coupling; the fact that the measured strength of the interaction depends on the probe energy; is encoded in this -dependence. Document 8 will make this the central concept.
7. QED Renormalization at One Loop
Let me assemble the full story with explicit formulas.
The Renormalized Lagrangian
One-Loop Counterterm Values (On-Shell)
| Parameter | Divergent part | Physical interpretation |
|---|---|---|
| Electron field-strength renormalization | ||
| Photon field-strength renormalization | ||
| Vertex renormalization | ||
| Mass counterterm |
The Bare Coupling
From :
Expanding to first order in the small :
With (Ward-Takahashi):
This is the bare coupling in terms of the renormalized one. The divergence is absorbed by the bare coupling becoming formally infinite.
Physical Amplitudes
Every physical amplitude, computed using:
- The renormalized Lagrangian
- Plus counterterms
- Matched at one-loop
gives a finite result. All poles from loop diagrams are canceled by matching poles from counterterms.
Explicit Example: Electron Self-Energy
Full electron self-energy at one loop:
Where:
- is the one-loop integral (divergent)
- is the counterterm contribution
By the on-shell condition, these two pieces sum to zero at , and the renormalized self-energy vanishes at the pole. For , you get a finite, calculable correction.
This gives the electron propagator its full structure: free propagator + small finite corrections from loops, all well-defined.
8. The Ward-Takahashi Identity:
A remarkable relation that makes QED cleaner than generic theories.
The Statement
The coupling renormalization equals the electron wave function renormalization.
Physical Content
From our formula , if :
The electron charge is only affected by the photon field strength renormalization. The electron’s self-energy and vertex modifications don’t contribute to charge renormalization; they cancel exactly.
Why This Is True
The Ward-Takahashi identity:
relates the vertex function to the propagator. It comes from gauge invariance of the underlying theory.
At (zero photon momentum), setting :
But taking the derivative with respect to :
This relates the residue of the vertex at zero momentum to the derivative of the inverse propagator; which, on-shell, determines . The conclusion: the vertex renormalization at equals the wave function renormalization. Hence .
Universal Consequence
This is important because it means:
The electron charge renormalization depends only on the photon self-energy.
So if you measured charge renormalization from scattering (electron vertex correction) vs. scattering (electron self-energy) vs. Thomson scattering (photon polarization), you’d get the same answer. This is a consistency check that QED passes beautifully.
Why Universal Charge Screening?
Here’s the remarkable physical consequence of : every charged particle in QED is screened by exactly the same factor (coming from ). The electron charge, the muon charge, the W-boson charge; all are renormalized identically.
This is why electric charge is exactly conserved: the one-loop corrections to charge are universal. If this weren’t true, different particles would have different renormalized charges, and charge wouldn’t be a consistent concept.
In Non-Abelian Theories
In non-abelian gauge theories (Yang-Mills), there are multiple analogous identities called Slavnov-Taylor identities. They’re more complex (because of the self-interactions of gauge bosons and the presence of ghosts) but play a similar role; ensuring gauge invariance of the renormalized theory.
This will come up in document 11 when we quantize Yang-Mills.
9. Renormalization to All Orders: Consistency
The one-loop story is clean. What about higher orders?
The Question
At two loops, you get new divergent diagrams. Do the counterterms from one loop plus additional two-loop counterterms absorb the new divergences?
The Answer: Yes, If Renormalizable
For a renormalizable theory like QED, at every order in perturbation theory:
- New divergences appear
- All new divergences have the same operator structures as the original Lagrangian
- They can be absorbed by adjusting the same counterterms order-by-order
Schematically:
At each order, the new contribution is determined by matching renormalization conditions.
The Non-Trivial Check
The non-trivial statement is that this works; that the counterterm structure is consistent at all orders. This was shown by Dyson (1949), ‘t Hooft and Veltman (for gauge theories, 1972), and many others.
The key technical results:
BPHZ theorem (Bogoliubov-Parasiuk-Hepp-Zimmermann): a rigorous proof that the counterterm procedure is consistent and produces finite results at all orders for renormalizable theories.
‘t Hooft-Veltman theorem: gauge theories like QED are renormalizable, with the counterterm structure preserving gauge invariance at all orders.
These results are reasonably technical (BPHZ requires careful handling of “nested” and “overlapping” divergences). The bottom line is: renormalization isn’t a trick that happens to work at one loop. It’s a systematic procedure that works to arbitrarily high orders.
Nested vs. Overlapping Divergences
At two loops and beyond, some divergences come from “inner” loops that are themselves divergent, plus “outer” loops that contain the inner ones. Handling these systematically requires care:
Nested divergence: A sub-diagram that’s divergent, appearing inside a larger diagram. Overlapping divergence: Two sub-diagrams that share lines, both divergent.
The BPHZ theorem essentially says: subtract inner divergences first (using the inner counterterms), then subtract the remaining outer divergence. This forest formula systematically handles all cases.
The Consistency Principle
The reason this all works is that the counterterm structure has exactly the same symmetries as the original Lagrangian. Lorentz invariance, gauge invariance, renormalizability; all are preserved. New divergences can’t appear with structures that weren’t there from the start.
In theories with less symmetry, renormalization can fail or require additional counterterms. This is why gauge invariance is so precious; it constrains the divergence structure.
10. Power Counting and Renormalizability
The Dimension of Operators
In spacetime dimensions, each field has a natural mass dimension (from requiring the action to be dimensionless):
| Field | Dimension |
|---|---|
| Scalar | 1 |
| Fermion | 3/2 |
| Gauge field | 1 |
Each derivative has dimension 1.
An operator has some dimension . In the Lagrangian , the coefficient has dimension (since the Lagrangian itself has dimension 4).
Three Classes of Operators
-
Relevant (): coupling has positive mass dimension. At low energies, these dominate. Examples: (mass term), (mass term).
-
Marginal (): dimensionless coupling. Equally important at all scales (classically). Examples: , .
-
Irrelevant (): coupling has negative mass dimension. Suppressed at low energies. Examples: , .
Power Counting for Renormalizability
A theory with only relevant + marginal couplings is power-counting renormalizable. All divergences can be absorbed into terms already in the Lagrangian.
A theory with any irrelevant couplings is non-renormalizable. Divergences appear in arbitrarily high-dimension operators, requiring infinitely many counterterms.
QED Example
QED has:
- : relevant + marginal
- : marginal
- : marginal
No irrelevant operators. QED is renormalizable.
Theory
- : marginal
- : relevant
- : marginal ()
Renormalizable.
Standard Model
The Standard Model contains only relevant and marginal operators. This is why it’s renormalizable and predictive.
Gravity
General relativity has coupling where is the Planck mass; negative mass dimension. This makes the theory (as a QFT) non-renormalizable. Quantum gravity effects at high energies require new physics (string theory, loop quantum gravity, etc.).
11. Non-Renormalizable Theories as EFTs
Even though non-renormalizable theories can’t be UV-complete, they’re extremely useful as effective field theories (EFTs); valid below some cutoff scale.
The EFT Framework
An EFT is a theory valid below some UV cutoff . The Lagrangian contains:
Each higher-dimension operator is suppressed by appropriate powers of the cutoff. At energies , the irrelevant operators are suppressed by and can be treated as small corrections.
Examples
Fermi theory of weak interactions: is non-renormalizable. But it’s the low-energy limit of the electroweak theory (valid below the boson mass). The cutoff is , and the higher-dimension Fermi operator is suppressed appropriately.
Chiral perturbation theory: low-energy QCD, valid below the chiral symmetry breaking scale. Contains non-renormalizable interactions but is predictive at energies GeV.
General relativity: an EFT valid below the Planck scale. Quantum corrections calculable order-by-order in , even though the theory isn’t UV-complete.
Renormalization in EFTs
EFTs can be renormalized order by order in the EFT expansion. At each order, finitely many operators contribute, and their coefficients are either matched to experiments or computed by matching to a UV-complete theory (if known).
This framework is extraordinarily powerful. It decouples short-distance physics (encoded in the operator coefficients) from long-distance dynamics (handled by the EFT). The Wilsonian view of renormalization (document 8) formalizes this.
The Lesson
Renormalizability isn’t a sacred property required of fundamental theories. Nature seems to be described, at energies below the Planck scale, by a renormalizable theory (the Standard Model); but that’s an empirical fact, not a logical necessity. Non-renormalizable theories are fine, as long as you interpret them as EFTs with a specified cutoff.
The “true” laws of physics may well be non-renormalizable from the perspective of our present theories. What we call renormalizable is simply “the leading terms in an EFT valid at accessible energies.”
12. Physical Meaning: What Is Renormalization Really Doing?
Let me step back and try to convey what’s genuinely happening.
The Traditional View (Pre-Wilson)
Renormalization was originally viewed as an ad hoc procedure: subtract infinite constants, pretend you’ve defined a finite theory, hope the subtractions are consistent. Many physicists (including Dirac) found this unsatisfying.
The Wilson View (Post-Wilson)
Wilson’s 1970s reformulation changed this completely. The key idea:
Every QFT should be understood as an effective theory at an energy scale . The Lagrangian at scale contains all operators compatible with the symmetries, with coefficients that may or may not be small.
Integrating out high-energy modes (going from scale to scale ) changes the coefficients of operators in a calculable way. This is the renormalization group flow.
At low energies (much less than any cutoff), only relevant and marginal operators dominate. Irrelevant operators are suppressed by .
What Renormalization Means
In the Wilsonian picture:
- Start with a Lagrangian at some high scale with specific coefficients (these are the “bare” parameters).
- Integrate out modes between and some lower scale . This produces a modified Lagrangian with renormalized coefficients.
- The modified coefficients depend on ; this dependence is the running.
- Physical predictions for observables at scale are computed using the renormalized coefficients.
The divergences appear because we’re asking what happens as ; sending the cutoff to infinity. The bare parameters then have to run to specific divergent values to keep low-energy physics fixed.
Renormalization is the change of variables from bare (UV-scale) parameters to renormalized (low-scale) ones. The infinities are artifacts of a particular parameterization, not physical phenomena.
Why It’s Deep
This reinterpretation connects QFT to critical phenomena in statistical mechanics (second-order phase transitions):
- The RG flow for QFT ↔ the RG flow for critical systems
- Universality classes in stat mech ↔ universality of low-energy EFTs
- Relevant/irrelevant/marginal operators apply to both
The mathematical structure; operators flowing under coarse-graining; is identical. Document 8 will develop this parallel properly.
Practical Consequence
For practical calculations, all this structure means:
- Divergences are handled systematically by counterterms
- Physical predictions are finite
- Running couplings can be computed and compared to experiment (for QED, the running of ; for QCD, the running of ; etc.)
- The theory has full predictive power, at the cost of inputting a finite number of parameters from experiment
This is how modern particle physics works. The Standard Model has roughly 20 free parameters (masses, couplings, mixing angles). Everything else is computed. The agreement with experiment is spectacular.
13. Appendix: Renormalization Formulas
QED Renormalization Constants
Bare-to-renormalized relations:
One-Loop On-Shell Values (Feynman Gauge)
Renormalization Conditions (On-Shell)
Electron:
Photon:
Vertex:
Scheme
Running coupling in :
Operator Dimensions in 4D
Renormalizable interactions in 4D: .
Further Reading
- Peskin & Schroeder, Chapter 10: standard treatment of renormalization
- Schwartz, Chapters 21-22: clean modern presentation
- Srednicki, Chapters 51-55: careful and systematic
- Weinberg, Vol. 1, Chapters 10-12: rigorous foundations
- Collins, Renormalization: the deep technical reference
Problems
-
Show at one loop by explicit calculation, not just citing the Ward-Takahashi identity. This is Peskin 10.2.
-
Compute the one-loop from the vacuum polarization in scheme, and show it gives the leading-log running of .
-
Write down all possible dimension- operators invariant under QED’s symmetries (Lorentz, gauge, CPT). Verify that they’re exactly the terms in the QED Lagrangian.
-
For a fictional theory, show that loop divergences generate operators , so the theory is non-renormalizable.
-
Using the renormalized Lagrangian, compute the one-loop electron-muon scattering amplitude at . Show all divergences cancel between loop and counterterm contributions.
Closing Note
Renormalization completes the conceptual journey from “here’s a QFT Lagrangian” to “here’s a finite prediction.” The procedure:
- Write down bare parameters and counterterms
- Compute loop diagrams (divergent)
- Compute counterterm diagrams (finite coefficients, to be determined)
- Impose renormalization conditions (on-shell, , or other)
- Extract finite physical predictions
Along the way, you’ve seen:
- Bare vs. renormalized parameters; only the latter are physical
- Counterterms; corrections to the Lagrangian that absorb divergences
- Renormalization conditions; define the finite parts of counterterms
- Ward-Takahashi identity (); makes QED charge renormalization universal
- Renormalizability; a technical property requiring only finitely many counterterms
- Non-renormalizable theories as EFTs; still useful, just at finite energies
What’s Next
Document 8 develops the renormalization group; the beautiful modern framework that shows renormalization isn’t just a procedure but a deep structural property of QFT. We’ll see running couplings, beta functions, asymptotic freedom, and Wilson’s effective-theory picture. The RG connects QFT to statistical mechanics, critical phenomena, and the structure of physics at different energy scales.
The RG is where renormalization stops being a calculational necessity and becomes physics.