QFT document 18: non-perturbative quantum field theory. Tunneling between vacua, the -angle, the BPST instanton, resolution of the problem via instantons, dilute instanton gas, and the large-N expansion as an alternative non-perturbative tool.
Every QFT calculation we’ve done has been perturbative: expand in a small coupling, compute Feynman diagrams, sum to some order. Beautiful when it works. But there are physical phenomena that cannot be accessed this way at all, because they’re suppressed by ; zero to all orders in perturbation theory but present non-perturbatively.
These include:
- Tunneling between vacua (QCD has infinitely many vacua labeled by integers)
- The mass (explained by anomaly + instantons, not perturbation theory)
- Baryon number violation (sphalerons in electroweak theory)
- Gluon condensates and chiral condensates (non-perturbative order parameters)
- Confinement (in full rigor, not accessible perturbatively)
This document develops the main tools for non-perturbative QFT: instantons (localized Euclidean field configurations that give tunneling amplitudes) and the large- expansion (a systematic expansion in where is the rank of a gauge group, revealing structure invisible in perturbation theory).
Prerequisites
- Documents 1-17, especially 11 (Yang-Mills), 15 (ChPT), 17 (anomalies)
- Path integrals in Euclidean signature
Conventions
- Euclidean signature throughout for instanton calculations (indicated explicitly when relevant)
- Mostly-minus Minkowski metric otherwise
- Gauge coupling , fine structure
Table of Contents
- Why Non-Perturbative?
- Tunneling in Quantum Mechanics: A Warmup
- The Yang-Mills Vacuum
- The BPST Instanton
- Topological Charge and the -Vacuum
- Instantons and the Problem
- The Dilute Instanton Gas
- Instantons in Other Theories
- Sphalerons and Baryon Number Violation
- The Large- Expansion
- Large- QCD Predictions
- Holographic Connection (Brief)
- Appendix: Non-Perturbative Reference
1. Why Non-Perturbative?
The Perturbative Series Doesn’t Converge
Take any QFT. Expand an observable in powers of coupling: . This series is typically asymptotic, not convergent. It’s useful for , after which terms grow.
The non-convergence isn’t a mathematical accident; it reflects physics. Non-perturbative effects of order lie beyond the asymptotic series. No amount of summing perturbative terms reveals them.
What’s Invisible Perturbatively
Perturbative QCD tells you nothing about:
- Why quarks are confined (no analytic proof exists from first principles)
- The value of (comes from RG, technically perturbative, but the physics it represents is non-perturbative)
- The chiral condensate
- The QCD vacuum structure (including )
- The mass
- Baryon number violation at high temperature
- Glueballs, exotic hadrons
All of these require non-perturbative methods; either instantons, large-, lattice QCD, or holography.
The Two Main Tools
Instantons: Euclidean solutions to the classical equations of motion that give tunneling amplitudes. They’re localized in spacetime and carry topological charge.
Large-: Replace with and take (with fixed). The theory simplifies dramatically; only planar diagrams contribute at leading order. Provides a different non-perturbative handle.
Lattice QCD is another major tool, but it’s numerical rather than analytical. Instantons and large- are what this document covers.
2. Tunneling in Quantum Mechanics: A Warmup
Before attacking field theory, let’s understand tunneling in quantum mechanics. The key ideas transfer directly.
The Double-Well Potential
Consider a particle in the potential:
Two degenerate minima at , barrier in between at height . Classically, a particle at stays there. Quantum mechanically, it can tunnel to .
The WKB Tunneling Amplitude
The standard result from QM:
where is the “Euclidean action” of the tunneling path:
For our double-well: .
The tunneling amplitude is exponentially suppressed in . It’s pure non-perturbative.
The Euclidean Path Integral Picture
The same result comes from the Euclidean path integral. The classical equation of motion in Euclidean time:
(Note the sign flip from Minkowski: .) This is the equation of motion in the inverted potential .
The “bounce” or “instanton” solution: starts at at , tunnels through to at . Plugging into the Euclidean action gives exactly from WKB.
Key insight: quantum tunneling in Minkowski time = classical motion in Euclidean time with inverted potential.
Why This Generalizes to Field Theory
The same mathematical structure; Euclidean classical solutions giving tunneling amplitudes; works in field theory. The potential is replaced by the action functional, and the classical solution is a field configuration, not just a particle trajectory.
The prefactor in front of requires care (fluctuations around the instanton), but the leading exponential is the same.
3. The Yang-Mills Vacuum
Vacuum Redundancy
In a Yang-Mills theory with gauge group , the “vacuum” isn’t unique. Gauge transformations relate physically equivalent configurations. The true vacuum is the space of physically distinct configurations.
Topology of Gauge Configurations
Consider the pure Yang-Mills vacuum with zero field strength: , so (pure gauge).
At spatial infinity, to have finite action, , which requires constant. But “constant” can be any element of . The 3-sphere at spatial infinity maps into :
The Key Topology:
For (with ), the third homotopy group is:
This means: maps from to are classified by an integer . Two maps with different cannot be continuously deformed into each other.
Vacuum Sectors
The integer labels distinct vacuum sectors . Gauge transformations labeled by map:
where . Small gauge transformations (continuously connected to identity) have winding 0 and don’t change . “Large” gauge transformations shift by the winding number.
The -Vacuum
Physical states must be eigenstates of gauge transformations. A linear combination:
is an eigenstate of large gauge transformations (shifts by 1):
Different labels physically different theories. is a parameter of QCD, like or quark masses.
Instantons as Tunneling
Tunneling between different vacuum sectors and is mediated by instantons: Euclidean field configurations with topological charge 1.
is the instanton action, of order . Tunneling amplitude ; exponentially small at weak coupling.
This is why the -vacuum is a non-perturbative phenomenon. Perturbatively, it’s invisible. But tunneling (instantons) makes it real.
4. The BPST Instanton
The Equation to Solve
The Euclidean Yang-Mills action:
(Using Lie-algebra indices, .)
We want localized solutions with finite action. The key identity:
where . Since , we have:
where is the topological charge:
Wait, let me reorganize. The topological charge is:
And then the bound becomes:
The bound is saturated when : self-dual or anti-self-dual field strengths.
Self-Duality Condition
For , we need self-dual () configurations. The equation:
is first-order (simpler than the full second-order Yang-Mills equations). Solutions to this automatically extremize the action.
The BPST Solution (Belavin-Polyakov-Schwarz-Tyupkin, 1975)
The simplest self-dual solution, in gauge theory:
where:
- is the position of the instanton (a free parameter)
- is the size of the instanton (a free parameter)
- is the ‘t Hooft symbol (a specific combination of Pauli matrices and Kronecker deltas)
The ‘t Hooft Symbol
The ‘t Hooft symbol is defined by:
It encodes the mapping from spacetime to gauge space. The anti-self-dual version is (differs in signs for the components).
Computing the Field Strength
Plugging the BPST into :
This is manifestly self-dual (because is self-dual by construction).
The field strength is localized: it peaks at with size and falls off as at large distance.
The Instanton Action
Plugging into the action:
Computing the integral:
Let me just cite the result. The standard result is:
Independent of the position and size . The instanton has a universal action determined only by the topology.
Topological Charge
Computing for BPST:
exactly. This is the topological charge; the same integer that labels the class of the gauge transformation at infinity.
Instanton Zero Modes
The BPST solution depends on parameters ; 5 collective coordinates in :
- 4 from position
- 1 from size
Plus 3 from “orientation” (rotating the color direction of the instanton). Total: 8 zero modes for BPST instanton.
More generally, for -instanton configurations, the number of collective coordinates is .
These zero modes must be handled as moduli in the path integral; you integrate over them rather than Gaussian-integrating around the instanton.
Multi-Instanton Solutions
For topological charge , you can superpose instantons (if they’re well-separated). The -instanton moduli space has complicated structure, studied extensively (ADHM construction).
5. Topological Charge and the -Vacuum
Physical Meaning of
The -term in the Euclidean action:
Adding this to the Yang-Mills action:
In Minkowski signature:
Why Can Seem Trivial (But Isn’t)
Naively, is a total derivative. Plugging in, the -term is:
Looks like it shouldn’t affect physics. But the boundary term picks up nontrivial values on topologically-nontrivial configurations (instantons), so contributes to those sectors.
Physical Consequences of
The -term:
- Preserves P, C, CP for or (mod )
- Violates CP for generic
- Contributes to the neutron EDM: e·cm
Experimentally: e·cm, implying .
This tiny value of is mysterious; why so small? This is the strong CP problem.
Proposed Solutions to Strong CP
Peccei-Quinn mechanism (axion): Introduce a new symmetry, explicitly broken by instantons. The Goldstone of spontaneous breaking is the axion , which dynamically drives .
Nelson-Barr models: CP is a symmetry of the high-energy theory, spontaneously broken by complex Higgs VEVs. The effective becomes calculable.
Massless up quark: If , is physically unobservable (anomaly absorbs it into the quark phase). But measurements give , so this is disfavored.
None is confirmed. Axion searches are the most active program.
: Special Point
At , CP is preserved (spontaneously broken in some sectors). Strong-coupling physics at is subtle: in SUSY, there are “oblique confinement” phases; in QCD at , lattice studies show unusual behavior.
The exact value of matters for QCD dynamics, and is a distinguished point.
6. Instantons and the Problem
Recap of the Problem
From documents 15 and 17: the symmetry of massless QCD is spontaneously broken but doesn’t produce a light Goldstone (the is 958 MeV, too heavy). The anomaly explains this quantum-mechanically.
But we can be more specific: the anomaly is zero in perturbation theory. It comes from non-perturbative effects. Specifically, instantons.
The Mechanism
The anomalous current divergence:
The integral on an instanton background is for .
Integrating:
So on a instanton, the axial charge changes by (with 2 coming from each of the 3 quark flavors contributing 2 units).
Instantons transfer axial charge between left and right, breaking invariance of correlators.
The ‘t Hooft Vertex
Consider a correlator of right-handed quarks and left-handed antiquarks. ‘t Hooft showed this gets a non-zero vacuum expectation value from instantons:
This is the famous ‘t Hooft effective vertex; a -fermion operator generated by instantons.
Physically: instantons induce a multi-fermion interaction that explicitly breaks .
Impact on Mass
Through the ‘t Hooft vertex, the meson (the would-be Goldstone) gets a mass. The calculation is intricate, but the final result (Witten-Veneziano):
where is the topological susceptibility; the vacuum fluctuation of topological charge, which is non-zero precisely because of instantons.
Quantitatively: , giving MeV. Observed: 958 MeV. Close, with the difference attributed to higher-order corrections in .
The Instanton-Anomaly Connection
The anomaly says the symmetry could be broken. Instantons are the mechanism that actually breaks it. The topological susceptibility measures how much instantons break it.
Without instantons, the axial anomaly might still be there as a formal result, but the physical consequences (heavy ) require explicit non-perturbative contributions.
Why Perturbation Theory Misses This
In perturbation theory, is a total derivative that integrates to zero (for topologically-trivial backgrounds). So perturbatively, and would be massless.
Only with non-perturbative instanton contributions does become non-zero. The mass is a direct measurement of non-perturbative QCD structure.
7. The Dilute Instanton Gas
The Approximation
For small instantons (size typical scale), instantons are localized and don’t interact much. Multi-instanton configurations can be approximated as a dilute gas of independent instantons.
The total partition function:
where counts instantons, anti-instantons, and we sum over all their positions, sizes, orientations.
The Single-Instanton Contribution
For a single instanton:
The fluctuation determinant around the instanton gives a specific prefactor. ‘t Hooft computed it for :
The comes from dimensional analysis of the collective coordinate measure.
The IR Problem
The integral over is divergent at large . At size , the coupling becomes large, and the instanton calculation breaks down. Large instantons probe strong coupling, where the dilute-gas approximation fails.
What Instantons Give Us
Despite the IR issues, instantons provide:
- mass (calculable in dilute-gas + ChPT matching)
- Axial charge non-conservation (multi-fermion operators)
- Vacuum structure (the -vacuum physics)
- Baryon number violation rate (electroweak sphalerons at finite T)
- Specific calculable effects in SUSY QCD: exact results for certain correlators
For observables dominated by small instantons (e.g., high-energy processes), the dilute-gas approximation works quantitatively. For observables sensitive to strong coupling (like confinement itself), it fails.
Instantons vs. Perturbation Theory
Instantons contribute at order , completely invisible to perturbation theory which expands in . The “gap” between perturbative and non-perturbative effects is exponential.
For (high-energy QCD): . Utterly negligible.
For (low-energy): . Small but non-negligible.
For : . Starting to matter.
Instantons become quantitatively important only at strong coupling.
8. Instantons in Other Theories
Abelian Instantons
In 4D abelian gauge theory (), there are no instantons; the topology is trivial ().
But in 2D: , so there are 2D vortices/instantons. Important for 2D gauge theory and superconductors.
Gravitational Instantons
In Euclidean general relativity, gravitational instantons are finite-action Einstein space configurations. Examples:
- Eguchi-Hanson: self-dual (or anti-self-dual) with specific boundary topology
- Taub-NUT: different asymptotic structure
- Black hole instantons (Euclidean Schwarzschild): describes black hole pair creation
Gravitational instantons contribute to quantum gravity calculations (e.g., in path integral quantum gravity).
Supersymmetric Instantons
In SUSY gauge theories, instantons have special properties:
- Exact cancellations between bosonic and fermionic fluctuations
- Exact non-perturbative results for certain correlators
- Seiberg-Witten theory: instantons give exact low-energy dynamics of SUSY
The Nekrasov partition function captures all-instanton contributions in 4D SUSY gauge theories.
Yang-Mills Instantons in Higher Dimensions
In dimensions, there are generalizations of instantons (solutions to generalized self-duality equations). These play roles in M-theory and supergravity compactifications.
9. Sphalerons and Baryon Number Violation
The Problem in the Standard Model
In the Standard Model, baryon number and lepton number are classically conserved. But the gauge theory has instantons (and related “sphaleron” configurations), and these violate and quantum-mechanically:
(And similarly for .) Here is the number of generations.
For : instantons cause and simultaneously. So is conserved, but isn’t.
Rate at Zero Temperature
At , the rate of such processes is per year for current values. Essentially zero; the proton is stable on cosmological timescales.
Rate at High Temperature (Sphalerons)
At high , tunneling gives way to thermal excitation over the barrier. The relevant configurations are sphalerons (saddle points of the classical action at finite temperature).
The sphaleron rate:
(up to logarithms), in the symmetric phase of electroweak theory. This is not exponentially suppressed; it’s a polynomial function of .
Implications for the Early Universe
At GeV (above the electroweak scale), sphalerons are in equilibrium. Any baryon number produced earlier gets washed out.
At GeV (electroweak scale), sphalerons freeze out as the Higgs acquires its VEV and generates gauge boson masses. If there’s a first-order electroweak phase transition, this provides out-of-equilibrium conditions where CP violation can create a baryon asymmetry; electroweak baryogenesis.
For SM: the transition is a crossover, not first-order, and CP violation is too small. So SM electroweak baryogenesis doesn’t work. New physics is needed.
Sphalerons as Thermally Excited Instantons
A sphaleron is a static classical configuration, not a tunneling solution. But physically, the sphaleron is what an instanton “becomes” at finite temperature; the saddle point that determines the barrier-crossing rate.
At : pure instanton tunneling, rate .
At : combination of tunneling and thermal excitation, increasing rate.
At : purely thermal barrier-crossing, rate .
The interpolation is a rich topic in thermal field theory.
10. The Large- Expansion
The Idea
Take gauge theory with colors. Expand in (rather than in ). At fixed ‘t Hooft coupling , take .
Why does this help? Because:
- Only planar diagrams contribute at leading order
- The theory simplifies dramatically
- Many qualitative features emerge cleanly
For QCD, isn’t large. But surprisingly, many predictions work reasonably well; suggesting large- captures real physics.
Double-Line Notation
Gluons have color indices: with . In terms of fundamental indices: with .
Think of a gluon as a “quark-antiquark pair in color space”: one fundamental index and one anti-fundamental. Draw diagrams with double lines: gluon = two oriented lines, one for each color index.
Counting Diagrams
For a diagram with vertices, edges (propagators), faces (closed color loops):
- Each vertex: factor
- Each edge: factor (propagator)
- Each face: factor (trace over color indices going around)
Total: .
For planar diagrams (can be drawn on a sphere without crossings): Using Euler: .
A diagram’s contribution scales as:
For planar (): the net -power is where is the genus.
Planar diagrams (genus 0): . Leading.
Non-planar diagrams (genus ): . Suppressed by .
The ‘t Hooft Limit
In the ‘t Hooft limit: with fixed.
At this limit:
- Only planar diagrams survive
- The theory has a (formal) string-like expansion in
- Large- QCD is a “master theory” governing the ‘t Hooft expansion
Large- Simplifications
Many simplifications emerge at large :
Meson masses are . Mesons don’t depend on (at leading order).
Meson decay constants are . Couplings , so decays .
Baryon masses are . Baryons contain quarks, so their mass scales.
Zweig rule violations . Diagrams that require breaking a quark loop are suppressed.
is a Goldstone. In the large- limit, the anomaly is suppressed by , so becomes massless. This is the Witten-Veneziano formula context.
Connected correlators factorize. .
The Planar Limit Is Solvable in Some Cases
In 2D: Yang-Mills in 2D is exactly solvable in the planar limit (Gross-Witten transition, Douglas-Kazakov transition).
In 4D: not exactly solvable, but many structural features can be computed.
AdS/CFT correspondence (next section) relates the planar limit of certain gauge theories to string theory in AdS; making the large- limit tractable via holography.
11. Large- QCD Predictions
Meson Spectrum
At large :
- Meson masses are -independent (at leading order)
- There are infinitely many mesons at each quantum number (pions, kaon-like states, etc., at higher masses)
- Decay widths are , so mesons are narrow
Finite- corrections: meson widths , mass shifts .
Zweig’s Rule (OZI Rule)
At large , meson decays that require disconnected diagrams (breaking a quark loop) are suppressed by .
Example: decay. The is nearly pure , so decay into requires annihilation of the pair via gluons; a disconnected topology.
Experimentally: , tiny compared to naive expectations. This is the OZI rule in action.
Large- predicts: OZI violations . Matches observation.
Baryon Properties
Baryons at large contain quarks and have mass . Their interactions are . At , baryons have specific structure (protons, neutrons) whose properties scale predictably.
Nucleon-nucleon coupling from one-pion exchange: consistent with large- scaling.
Baryon decuplet structure: Skyrme’s idea that baryons are solitons in a pion effective theory emerges naturally at large . Predicts specific relations between baryon masses and their excitations (quantitatively successful).
Glueball Spectrum
Glueballs (bound states of pure glue) exist in large- QCD, with definite quantum numbers. Large- predicts:
- Mass hierarchy: lightest, next, etc.
- Widths suppressed by
- Mixing with quark-mesons of similar quantum numbers suppressed at large
Lattice QCD confirms these predictions qualitatively. Glueball masses predicted near lattice measurements.
Confinement and Flux Tubes
In large- QCD, quark confinement is rigorously obtained. The quark-antiquark potential grows linearly with separation, due to a flux tube of color field lines.
The flux tube tension const at large (not scaling with ). Its thickness .
At : lattice QCD confirms this, with , a phenomenologically important value.
Predictions at
Remarkably, many large- predictions work quantitatively at , suggesting that corrections are “small enough” for most quantities.
This is why large- is more than a theoretical curiosity; it captures real aspects of QCD.
12. Holographic Connection (Brief)
The AdS/CFT Correspondence
Maldacena (1997) conjectured that 4D super Yang-Mills with gauge group is dual to Type IIB string theory on .
At large and large ‘t Hooft coupling : the gauge theory is dual to classical supergravity in AdS.
Why This Matters for Non-Perturbative QFT
Strong-coupling gauge theory becomes weak-coupling supergravity. Calculations that are impossible in perturbative QFT become tractable via classical gravity.
Computations have been done for:
- Transport coefficients in QGP-like plasmas: , matching QGP observations
- Mass spectra of strongly-coupled theories
- Confinement-deconfinement transitions in dual geometries
- Quark-antiquark potentials in various theories
- Many-body dynamics at finite temperature and density
Limitations
AdS/CFT is exact for SYM, but only a conjecture for other theories. For real QCD, no exact dual is known. But AdS/QCD models provide quantitative phenomenology by building dual geometries that match experimental data.
Relation to Large-
AdS/CFT is most tractable in the large- limit. The planar diagram expansion of the gauge theory maps onto a string theory perturbation expansion in .
In this sense, AdS/CFT is a deep generalization of large- QCD; extending it from gauge theories to string theory, with stringy corrections computable as corrections.
Full Treatment Would Require Own Documents
Holography is a vast topic. Full treatment requires:
- String theory basics
- Supergravity in AdS
- The AdS/CFT dictionary (operators ↔ fields, boundary conditions ↔ sources)
- Applications to condensed matter, nuclear physics
These are Option E topics. For now, mentioning the connection is enough.
13. Appendix: Non-Perturbative Reference
Key Formulas
Yang-Mills instanton action:
Topological charge:
Instanton rate (dilute gas):
-term:
Witten-Veneziano:
Topological susceptibility:
The BPST Instanton
Large- Scalings
| Quantity | Scaling |
|---|---|
| Meson mass | |
| Meson decay constant | |
| Meson-meson interaction | |
| Baryon mass | |
| OZI violation | |
| anomaly | |
| Glueball mass | |
| Glueball width |
Further Reading
- Coleman, Aspects of Symmetry (1985): classic lectures including instantons
- Polyakov, Gauge Fields and Strings (1987): monographs on non-perturbative methods
- Shifman, Advanced Topics in Quantum Field Theory: comprehensive textbook
- ‘t Hooft, Computation of the Quantum Effects Due to a Four-Dimensional Pseudoparticle (1976): original ABJ-anomaly-to-instantons paper
- Witten, Instantons, the Quark Model, and the 1/N Expansion (1979): large- QCD foundation
- Rubakov, Classical Theory of Gauge Fields: non-perturbative gauge theory textbook
- Manton & Sutcliffe, Topological Solitons: covers instantons, monopoles, etc.
Problems
-
Verify that the BPST instanton is self-dual by computing and explicitly.
-
Compute the BPST instanton’s topological charge by direct integration.
-
Derive the dilute instanton gas partition function, identifying the density expansion.
-
Using the ‘t Hooft vertex, estimate the contribution to the mass from a dilute instanton gas. Compare with Witten-Veneziano.
-
For Yang-Mills, show that the planar expansion generates only diagrams that can be drawn on a sphere.
-
Compute the meson decay constant in terms of and at large . Show it scales as .
-
At what value of do instanton effects become “large” (say, 10% of perturbative) in QCD? What physical scale does this correspond to?
Closing Note
Non-perturbative QFT is where the deepest physics lives. Perturbation theory is the toolkit for weak coupling, but many physical phenomena; confinement, tunneling, vacuum structure, baryon number violation, mass; are inherently non-perturbative.
What We Covered
- Instantons: Euclidean tunneling solutions with topological charge. The BPST solution, its moduli, its physics.
- -vacuum: the parameter labeling distinct QCD theories. Strong CP problem and axion.
- Instanton effects: mass, ‘t Hooft vertex, anomaly realization. How perturbatively-invisible physics becomes dominant.
- Dilute instanton gas: the approximation scheme that makes instanton calculations tractable.
- Sphalerons: thermal analogs of instantons, giving electroweak baryogenesis possibilities.
- Large-: expansion in , planar limit, qualitative predictions for QCD.
- Holography (brief): how large- connects to string theory via AdS/CFT.
What’s Next
Document 19 covers solitons, monopoles, and dualities; classical field configurations that extend in space (not just spacetime events like instantons). Kinks, vortices, ‘t Hooft-Polyakov monopoles, BPS states, Dirac quantization, Montonen-Olive duality.
Together with document 18, this covers the main non-perturbative structures in QFT: events in spacetime (instantons) and objects in space (solitons).
After that, depending on interest:
- Option E: Beyond Standard Model topics; SUSY, strings, holography in depth, quantum gravity
Or sideways into:
- Topological phases of matter in depth
- Entanglement and quantum information in QFT
- Integrability and exactly solvable models
- Advanced mathematical aspects (index theorems, cohomology, etc.)
You have extensive foundations now. Many research-level topics are accessible from here.