QFT document 10: Grassmann variables, anticommuting c-numbers, and how to put fermions inside the path integral.
Document 9 developed path integrals for bosonic fields. But all matter is fermionic, and no complete QFT framework can ignore fermions. The challenge: fermions anticommute, and ordinary c-number integration variables commute. If we want path integrals over fermion fields, we need a new kind of number.
Those new numbers are Grassmann variables; anticommuting c-numbers. They’re strange at first sight (a number that squares to zero? that doesn’t commute with itself?) but the algebra is perfectly well-defined and maps beautifully onto fermionic physics.
Once we have Grassmann integration, the fermionic path integral is essentially identical in structure to the bosonic one; same generating functionals, same perturbation theory, same Feynman rules. But with one crucial difference: Gaussian integrals over Grassmann variables give determinants, not inverse determinants. This sign flip is what encodes fermionic statistics at the level of the path integral, and it has profound consequences.
Prerequisites and Conventions
- QFT documents 1-9 (especially documents 2 and 9)
- Linear algebra (determinants, matrix exponentials)
- Same conventions: mostly-minus metric,
Table of Contents
- Why We Need Anticommuting Numbers
- Grassmann Numbers: The Algebra
- Calculus on Grassmann Variables
- Grassmann Gaussian Integrals
- The Fermionic Path Integral
- Generating Functional for Fermions
- The Fermion Propagator from the Path Integral
- Integrating Out Fermions: Effective Actions
- Perturbation Theory with Fermions
- The Sign of the Fermion Loop
- Applications: Yukawa, QED, and Beyond
- The Sign Problem and Lattice QCD
- Appendix: Grassmann Algebra Reference
1. Why We Need Anticommuting Numbers
The Problem
In document 2, we quantized the Dirac field using anticommutators instead of commutators. The spin-statistics theorem forced this: any attempt to use commutators produced either negative-energy states or negative-norm vectors.
Now we want to write the Dirac field theory as a path integral:
The integration variables and are “classical” fields corresponding to the Dirac operator. But classical fields that correspond to fermionic quantum operators must satisfy something unusual: they should anticommute.
Why? Because when you derive the path integral from canonical quantization (as in document 2, section 2), the “classical” fields appearing in the integrand are eigenvalues of the field operators. Eigenvalues of anticommuting operators must themselves anticommute; otherwise the path integral won’t reproduce the right commutation structure.
The Requirement
We need integration variables , satisfying:
These anticommutators are at the level of classical c-numbers, not operators. The variables multiply each other anticommutatively.
Such quantities are called Grassmann numbers (after Hermann Grassmann, 1844).
The Payoff
Once we have Grassmann numbers, the fermionic path integral looks almost identical to the bosonic one, with Gaussian integrals giving determinants instead of inverse determinants. Everything about fermion physics falls out.
2. Grassmann Numbers: The Algebra
Definition
A Grassmann number is an object satisfying:
More generally, a set of Grassmann numbers satisfies:
Properties
Nilpotency. Since , any power higher than one vanishes. So a function of a single Grassmann number can have at most two terms:
where are ordinary (commuting) numbers. Higher powers all vanish.
Linear independence. For Grassmann variables , the most general function has terms (each either appears or doesn’t):
The final term (with all ‘s present) is called the “top form.”
Commutation with ordinary numbers. Grassmann variables commute with ordinary (c-number) variables:
for ordinary numbers .
Two Kinds of Numbers
It’s crucial to distinguish:
- c-numbers (commuting numbers): ordinary real or complex numbers. .
- a-numbers (anticommuting numbers) = Grassmann numbers. .
Products of Grassmann numbers can be either kind:
- An even number of ‘s multiplied together is a c-number:
- An odd number of ‘s is an a-number
This parity (even vs. odd) determines the statistics.
Complex Grassmann Numbers
For fermion fields, we need complex Grassmann variables. Define and its independent “complex conjugate” (strictly, they’re independent complex Grassmann variables, not actually related by complex conjugation). They satisfy:
Functions of Grassmann Fields
A “Grassmann field” is a Grassmann variable at each spacetime point . Different points give different Grassmann variables, all anticommuting with each other.
An arbitrary function of a Grassmann field has the structure:
where the coefficients are ordinary functions. The anticommutation of ‘s at different points is respected.
3. Calculus on Grassmann Variables
Differentiation
Define differentiation of Grassmann functions by:
Since , the Leibniz rule takes a modified form:
Note the minus sign in the second term. Grassmann derivatives anticommute with Grassmann variables: … no wait. Let me be more careful.
The rule is: is itself an a-number, and it anticommutes with Grassmann variables. So:
(one minus sign from moving past , then … no, this gives … let me just state the rule directly.)
Standard Grassmann derivative (left-derivative):
That is, moving to act on past an intervening Grassmann variable picks up a minus sign. This convention is called the left-derivative.
Integration
This is where Grassmann calculus gets strange. Define Grassmann integration by:
Integration is equivalent to differentiation for Grassmann variables. This is perhaps the weirdest feature of the algebra.
From these two rules, any Grassmann integral can be computed. For a function :
Only the -term in the integrand contributes. The constant piece integrates to zero.
Multi-Variable Integration
For multiple Grassmann variables:
The ordering of ‘s matters and is conventional (I’m using one common convention). Different orderings may differ by signs.
Change of Variables
Ordinary integration: where is the Jacobian.
Grassmann integration reverses this:
The Jacobian appears inversely. This is because “integration” for Grassmann variables is more like differentiation than integration.
Specifically, for a linear change of variables :
(With appropriate sign conventions.)
This inverse-Jacobian property is what leads to determinants (rather than inverse determinants) in Gaussian Grassmann integrals.
A Simple Example
Consider . Expand using the rules:
Alternatively, changing order:
Hmm, this is getting tangled. The conventions matter. Just accept the rule: for each Grassmann variable in the integrand that matches a in the measure, you get a factor of 1 (with appropriate signs); otherwise you get zero.
The Fundamental Integration Formula
For Grassmann variables:
Wait; this is for real Grassmann variables (a single copy). For complex Grassmann variables (our case), the relevant formula is the Gaussian integral.
4. Grassmann Gaussian Integrals
The Key Formula
For complex Grassmann variables (think of them as at different points) with a bilinear form:
Compare to bosonic Gaussian integration (single complex ):
(Up to factors.) The crucial difference: for fermions, for bosons.
Why the Difference?
For ordinary (commuting) integration variables, the Gaussian integral in dimensions is:
because the integrand is suppressed by , so the integral is proportional to its reciprocal.
For Grassmann variables, integration is essentially differentiation. Expanding the exponential in a Taylor series, only the term contributes (because higher powers vanish by Grassmann nilpotency, and lower powers don’t have enough fields to match all in the measure).
Computing this specific term:
Reordering the ‘s and ‘s picks up signs. After the dust settles, the coefficient of (the only surviving Grassmann structure) is exactly .
So the integral equals , not .
Source Terms
With source terms; Grassmann functions ; the generating function is:
This is the fermionic analog of the bosonic result .
The exponent ; with the matrix inverse; gives the propagator structure. Hitting with derivatives and pulls down factors of .
The Importance
The fact that fermion integration gives has two huge consequences:
1. Fermion loops come with a minus sign. This is the Feynman rule we stated in document 5 without derivation.
2. Integrating out fermions produces a functional determinant that modifies the effective action. In a theory of fermions coupled to a background field, integrating out the fermions gives , which contains all the back-reaction of the fermions on the background.
5. The Fermionic Path Integral
The Setup
For the Dirac field, the action is:
The path integral is:
where and are now independent Grassmann-valued fields (i.e., at each point and for each spinor index, they’re independent Grassmann variables).
The Euclidean Version
Wick-rotate . The Euclidean Dirac action becomes:
(with appropriate sign conventions for Euclidean gamma matrices). The path integral is:
This is a Grassmann Gaussian integral with the bilinear form .
Evaluating the Free Fermion Path Integral
By the Grassmann Gaussian formula:
(up to normalization). This is the fermion functional determinant.
In momentum space, , and:
This product is formally infinite and requires regularization, but can be made sense of using zeta function regularization, lattice discretization, or dimensional regularization.
Including Sources
Using the Gaussian formula:
Where is the fermion Feynman propagator. Again up to signs from conventions.
6. Generating Functional for Fermions
The Definition
As for the bosonic case, introduce Grassmann source terms:
Correlators via Functional Derivatives
Now the derivatives are Grassmann derivatives (which pick up signs):
Two-Point Function
For the free theory:
This matches the result from canonical quantization in document 2. The Feynman propagator emerges from the path integral via functional differentiation.
The Direction of Arrows
A careful comment: the ordering of Grassmann derivatives matters because and anticommute. Getting the right number of minus signs requires tracking the order. This is one of the places where fermion calculations are slightly more tedious than bosonic ones.
In practice, Feynman-diagram conventions are set up to handle this automatically. As long as you follow them, the signs work out.
7. The Fermion Propagator from the Path Integral
Let me derive the Feynman propagator explicitly from the path integral, to ground the formalism.
The Free Fermion Generating Functional
From section 5:
Computing the Two-Point Function
Take Grassmann derivatives:
(The on the second derivative accounts for anticommutation of Grassmann derivatives.)
Applying the derivatives to the exponent:
The derivative brings down a factor . Then brings down . The signs and ‘s work out to give:
Exactly the Dirac propagator from document 2. ✓
The Grassmann Structure
Notice: the path integral formalism never mentions “creation operators” or “anticommutators” explicitly. Everything comes out of the Grassmann algebra of the integration variables. The fermionic statistics; Pauli exclusion, antisymmetric correlators, minus signs for fermion loops; all emerge naturally from Grassmann rules.
This is the elegance of the path integral: physics is encoded in the mathematical structure of the integration variables.
8. Integrating Out Fermions: Effective Actions
One of the most useful techniques in modern QFT.
The Setup
Suppose you have a theory with fermions coupled to some bosonic field (could be a scalar, a gauge field, whatever):
where depends on both fermions and bosons, depends only on bosons.
Integrating Out the Fermions
The path integral is:
If the fermion dependence is bilinear (i.e., for some operator depending on ), then we can do the fermion integral exactly:
Now we have a bosonic path integral:
The Effective Action
Define the effective action:
(The comes from the in the path integral versus appearing in the Gaussian formula.)
The fermionic effects are now encoded in the determinant. This is a non-local functional of , but it’s well-defined and computable (perturbatively or on the lattice).
Example: QED Effective Lagrangian
Consider QED with the electron field and the photon. Integrate out the electron:
where . Taking the log:
Expanding this gives the Heisenberg-Euler effective Lagrangian; the effective action for the photon field alone, after integrating out the electrons.
At lowest order, this gives the vacuum polarization contribution. At higher orders in the photon field, it gives the famous Heisenberg-Euler result:
This is the leading non-linear correction to Maxwell electrodynamics from QED. It predicts phenomena like:
- Light-by-light scattering: two photons can scatter off each other (forbidden classically). Observed at the LHC in 2017!
- Schwinger pair production: strong electric fields can spontaneously create electron-positron pairs. Theoretical threshold: V/m. Not yet reached in lab, but predicted.
- Vacuum birefringence: the vacuum in strong magnetic fields becomes birefringent (different speeds for different polarizations). Important near neutron stars.
All of these come from integrating out the electron from QED.
Physical Interpretation
Integrating out heavy fields gives you an effective theory for the light fields. The effective Lagrangian has higher-dimension operators suppressed by powers of the heavy mass. This is the EFT machinery from the RG document, made explicit through Grassmann path integrals.
Any time you have a theory with both light and heavy degrees of freedom, you can systematically integrate out the heavy ones and get an effective description in terms of the light ones. This is how we get:
- Chiral perturbation theory (integrate out quarks above the chiral scale)
- The Fermi theory of weak interactions (integrate out W bosons)
- The Euler-Heisenberg Lagrangian (integrate out electrons below their mass)
The trick works in reverse too: you can integrate out the light fields (via background field methods) to study effective Lagrangians for heavy fields.
9. Perturbation Theory with Fermions
The formalism for perturbation theory is nearly identical to the bosonic case.
The Setup
For QED with interaction :
Expand in powers of . Each term pulls down factors of . Contracting these with external and internal lines gives Feynman diagrams.
What’s Different
Two key differences from the bosonic case:
1. Fermion lines have directed arrows. The and are different Grassmann fields, so contractions must respect the direction.
2. Closed fermion loops get a minus sign. This comes from the determinant structure; every closed fermion loop produces a contribution, which when expanded gives a sign for each cycle.
Sign Tracking
Careful sign tracking is tedious but mechanical. The rules:
- Each closed fermion loop: factor of
- Order of spinor factors along a fermion line: follows the arrow backwards (as in canonical Feynman rules)
- Exchange of external fermion legs: factor of per swap (reflecting antisymmetry)
These rules emerge automatically from Grassmann combinatorics. You don’t have to memorize them separately; they fall out of keeping track of signs when commuting Grassmann variables past each other.
An Example: The Vacuum Polarization Revisited
The vacuum polarization diagram has a closed fermion loop (electron-positron pair). In the canonical treatment, we imposed the minus sign ad hoc. In the path integral treatment:
- Closed fermion loop corresponds to a trace over spinor indices of the product of propagators
- The trace originates from Grassmann integration: (up to factors)
- The overall minus sign is an artifact of the direction of fermion integration
So the minus sign we used in document 6 is a consequence of Grassmann integration, not an extra rule we had to add.
Loops of Fermions are Actually Determinants
Another way to see the minus sign: each closed fermion loop corresponds to a determinant contribution in . When we expand the determinant perturbatively, the resulting Feynman diagrams come with for fermion loops.
This is consistent with: determinants are products of eigenvalues, and expanding gives a sum over loops, each with specific signs.
10. The Sign of the Fermion Loop
Let me make the sign rule concrete with a calculation.
Simplest Case: Electron Self-Energy
The one-loop electron self-energy has no closed fermion loops; just a virtual photon being emitted and reabsorbed on a single fermion line. No minus sign.
Vacuum Polarization
The vacuum polarization has one closed fermion loop. Let me trace through the sign.
In the path integral, the relevant correlator is:
where .
Expanding:
where the trace is over spinor indices and over the two “fermions” in the loop. The minus sign is encoded in the direction of the trace; it comes from anticommuting past in the derivation.
Explicitly:
Note the argument order is swapped from the “usual” convention, and the indices are swapped. This gives the minus sign automatically.
Practical Rule
For Feynman diagram calculations, the rule is simple: every closed fermion loop gets an extra factor of .
This applies to:
- Vacuum polarization (1 fermion loop → factor of )
- annihilation followed by pair production (no closed loops in tree-level → no minus sign)
- The “penguin” diagrams (depending on how loops close)
- Higher-order diagrams with multiple loops (factor of for loops)
11. Applications: Yukawa, QED, and Beyond
Yukawa Theory
Scalar fermions coupled via a Yukawa interaction:
The Yukawa coupling is the simplest interaction between a scalar and fermion. Present in the Standard Model (Higgs-fermion couplings).
In the path integral, this theory is:
Integrating out the fermions gives:
Where is just the bosonic part.
For a constant background field , the determinant can be computed and gives corrections to the scalar mass and self-interactions. This is how the Higgs boson’s mass receives quantum corrections from Yukawa interactions; the famous hierarchy problem.
Higgs-Fermion Couplings
In the Standard Model, the Higgs couples to fermions via:
(for each fermion species ). Integrating out the Higgs (above its mass) would give an effective theory with four-fermion interactions. Below the Higgs mass, these integrated-out effects are in the Fermi constant and CKM matrix.
The top quark Yukawa coupling is the largest. Its loop contributions to the Higgs mass are what make the hierarchy problem acute; without cancellations (like supersymmetry), the Higgs mass would naturally be near the Planck scale rather than 125 GeV.
QED: Revisiting
For QED, the gauge symmetry constrains interactions. The path integral:
Integrating out the electrons (at leading order in the coupling, this is just the one-loop vacuum polarization):
where is the Heisenberg-Euler Lagrangian mentioned in section 8. This effective Lagrangian describes how photons scatter off each other at low energies (below ).
Electroweak Theory
In the electroweak theory (part of the Standard Model), fermions, gauge bosons, and the Higgs all interact. The path integral is a complicated multi-field integral, but the Grassmann structure for fermions remains the same.
Integrating out fermions in various limits gives effective theories for bosons, and vice versa. This is how weak-boson exchange (at energies below ) becomes the Fermi four-fermion interaction.
Lattice Fermion Theories
On a Euclidean lattice, putting fermions is notoriously tricky because of fermion doubling; naive lattice fermion actions give too many species (15 extra copies!). This is a consequence of a theorem (Nielsen-Ninomiya) that forbids having a single chiral fermion on a lattice without breaking some nice property.
Workarounds:
- Wilson fermions: add extra terms that lift the doublers at high momenta. Physical at low momenta.
- Staggered fermions: spread the fermion over the lattice, reducing doublers.
- Domain-wall/Overlap fermions: modern approach preserving approximate chiral symmetry.
Each has trade-offs. The difficulties are why lattice QCD took decades to become genuinely predictive.
12. The Sign Problem and Lattice QCD
A subtle problem that limits lattice simulations.
The Issue
In Euclidean path integrals for bosons, is real and positive; so the integrand acts like a probability density. Monte Carlo methods can sample it efficiently.
After integrating out fermions, the integrand for bosons contains , which can be negative or even complex. If it’s not positive, you can’t treat it as a probability measure.
This is the sign problem (or “fermion sign problem”), and it’s a major obstacle to certain classes of lattice calculations.
When Does It Occur?
The sign problem is mild or absent for:
- Pure gauge theories (no fermions)
- QCD at zero chemical potential and zero -angle
- Some supersymmetric theories
The sign problem is severe for:
- QCD at finite chemical potential (finite baryon density; relevant for neutron stars, quark-gluon plasma)
- Real-time evolution in quantum field theory
- Many condensed matter systems (Hubbard model away from half-filling)
Consequences
Lattice QCD is extremely successful for equilibrium properties at zero chemical potential (hadron spectrum, form factors, etc.) but struggles with:
- The QCD phase diagram at nonzero baryon density
- Real-time dynamics of the quark-gluon plasma
- The equation of state of neutron stars
These are among the most pressing frontiers of lattice QCD research.
Possible Solutions
Various ideas exist:
- Complex Langevin dynamics: reformulate the problem with complexified variables
- Lefschetz thimble: deform the contour of integration to find manifolds where the problem is mild
- Tensor network methods: a completely different approach to representing many-body quantum states
- Quantum simulation: use quantum computers to simulate fermionic systems directly
None of these have fully solved the problem, but progress is steady.
A Quantum Gravity Analog
Interestingly, the sign problem has an analog in quantum gravity; the Euclidean path integral for gravity has unstable modes (the conformal mode is unbounded), making the standard Wick rotation problematic. Various proposals have emerged (Lorentzian path integrals, causal dynamical triangulations, specific saddle points) but no fully satisfactory resolution.
These technical problems may be pointing at deeper issues. If our regulated QFTs still have sign problems, maybe our foundations need rethinking.
13. Appendix: Grassmann Algebra Reference
Basic Rules
Integration
Multi-Variable Integration
For Grassmann variables:
Gaussian Integrals
Real Grassmann:
(Pfaffian of the antisymmetric matrix .)
Complex Grassmann:
With sources:
Differentiation
Fermion Path Integral
Free-Fermion Generating Functional
Two-Point Function
Fermion Feynman Propagator
The Fermion Loop Sign
Each closed fermion loop in a Feynman diagram contributes an overall factor of .
Further Reading
- Peskin & Schroeder, Sections 9.5, 9.6: clean introduction to Grassmann path integrals
- Srednicki, Chapters 43-45: rigorous treatment
- Zinn-Justin, Quantum Field Theory and Critical Phenomena: encyclopedic reference
- Cvetic, Fermion Integrals: mathematical physics perspective
- Wipf, Statistical Approach to Quantum Field Theory: good for the lattice connection
Problems to Work
-
Verify that (the basic Grassmann identity).
-
Compute for an ordinary number . Show it equals (the “determinant” of a matrix).
-
Derive the formula for a case: explicitly compute .
-
Compute formally in momentum space. Show it’s the product over momenta of (for a single Dirac fermion; four spinor components).
-
Expand the Heisenberg-Euler Lagrangian to show the dependence for the low-energy photon-photon interaction.
-
Integrate out a heavy fermion from a Yukawa theory and obtain the resulting effective Lagrangian for the scalar.
Problem 4 is particularly instructive; it shows how fermion functional determinants relate to the free propagator, and reveals the structure of Euclidean QFT cleanly.
Closing Note
Fermionic path integrals complete the basic QFT toolkit. Every particle of matter can now be described via path integrals; bosons through ordinary numbers, fermions through Grassmann variables. The formalism is entirely parallel, with one key difference: fermion Gaussian integrals give rather than , and this sign difference ripples through to give all of fermionic statistics: Pauli exclusion, minus signs for fermion loops, antisymmetry of multi-fermion wave functions.
What We’ve Gained
- A systematic treatment of fermions in the path integral formalism
- The technology to integrate out fermions and derive effective Lagrangians (EFTs)
- Clean derivation of fermion loop signs from Grassmann combinatorics (no longer an ad hoc rule)
- Tools for computing non-perturbative quantities like effective potentials and Heisenberg-Euler Lagrangians
- A foundation for understanding subtle issues like anomalies (which show up in the fermionic measure) and the sign problem
What’s Next
With both bosonic (doc 9) and fermionic (doc 10) path integrals, we have the complete machinery for quantizing interacting field theories. The next major hurdle: non-abelian gauge theories.
The canonical approach to Yang-Mills requires Gupta-Bleuler-like tricks that get complicated quickly. The path integral approach, using Faddeev-Popov gauge fixing, is dramatically cleaner. The “ghost” fields that Faddeev-Popov introduces are themselves Grassmann-valued (despite being scalars, not fermions); a beautiful application of the machinery we just developed.
Document 11: Yang-Mills quantization with the Faddeev-Popov procedure. This is where the literal ghosts of Ghostbusters fame make their appearance. Fitting end to a long journey.
Document 12: The complete Standard Model as a QFT. Everything together; gauge bosons, fermions, Higgs, anomalies; the crown jewel.
Two more documents to go. You’re almost there.