The Dirac equation is a relativistically covariant first-order field equation for a charged spinor field. But by itself it is not yet the full quantum field theory of spin-21 matter.
In quantum field theory, the spinor field ψ(r,t) is not the state vector of one electron. It is a field. After quantization, it becomes an operator-valued field acting on a Fock space containing states with arbitrary numbers of particles and antiparticles.
So the real task is not only to write the Dirac equation. The real task is to construct the Hamiltonian operator whose Heisenberg equation of motion gives the Dirac equation:
iℏ∂t∂ψ^=[ψ^,H^].
To build Dirac quantum field theory, we need several steps:
construct Lorentz scalars and four-vectors from spinors,
write a Lorentz-scalar Lagrangian density,
find the canonical momentum field,
derive the Hamiltonian,
diagonalize the Hamiltonian using normal modes,
decide whether the modes must be quantized bosonically or fermionically,
derive the momentum and charge operators.
The main result is sharp: a spin-21 Dirac field cannot be quantized consistently with bosonic commutation relations. Bosonic quantization makes the Hamiltonian unbounded from below. Fermionic anti-commutation relations fix the sign problem and give a stable theory. This is one route to the spin-statistics connection for the Dirac field.
The quantized field has two kinds of excitations. The a-operators create particles, and the c-operators create antiparticles. Both have positive energy, but they carry opposite charge.
Learning Objectives
Explain why the Dirac equation alone is not yet a full quantum field theory.
Construct Lorentz scalars and four-vectors from left- and right-handed two-spinors.
Understand why ψˉ=ψ†γ0 is needed.
Write the Dirac Lagrangian density.
Derive the canonical momentum field of the Dirac field.
Derive the Hamiltonian from the Dirac Lagrangian.
Expand the Dirac field in normal modes using eigenspinors u±(k) and v±(k).
Understand the momentum-space Dirac Hamiltonian as a 4×4 matrix eigenvalue problem.
Explain why bosonic quantization gives an unstable Hamiltonian.
Explain why fermionic quantization gives a Hamiltonian bounded from below.
Derive the momentum and charge operators of the free Dirac field.
Prerequisite Knowledge
Left-handed and right-handed Lorentz two-spinors
Dirac equation in two-spinor and four-spinor notation
Gamma matrices
Pauli matrices
Lorentz covariance
Canonical field theory
Noether charges
Creation and annihilation operators
Bosonic commutators and fermionic anti-commutators
1. Scalars from two-spinors
Let η be a left-handed Lorentz two-spinor and let ξ be a right-handed Lorentz two-spinor.
Under an infinitesimal rotation, both transform with the same spin-21 matrix:
η→η′=(1+2iϵ⋅σ)η,ξ→ξ′=(1+2iϵ⋅σ)ξ.
Therefore the mixed product
ξ†η
is invariant under rotations:
ξ†η→ξ′†η′=ξ†(1+2iϵ⋅σ)†(1+2iϵ⋅σ)η=ξ†η
to first order in ϵ.
Under boosts, left- and right-handed spinors transform oppositely:
η→η′=(1+21ϵ⋅σ)η,ξ→ξ′=(1−21ϵ⋅σ)ξ.
So
ξ†η→ξ′†η′=ξ†(1−21ϵ⋅σ)†(1+21ϵ⋅σ)η=ξ†η
again to first order. Thus
ξ†η is a Lorentz scalar.
Since
η†ξ=(ξ†η)∗,
it is also a Lorentz scalar.
This is the first key structural point: Lorentz scalars are naturally made by pairing a left-handed spinor with a right-handed spinor.
2. Four-vectors from two-spinors
Let vμ=(v0,v) be a covariant four-vector. The combinations
ξ~:=(v0+σ⋅v)η
and
η~:=(v0−σ⋅v)ξ
transform respectively as right-handed and left-handed spinors.
Since left-right scalar products are Lorentz scalars, the quantities
This is a contravariant Lorentz four-vector built symmetrically from the left- and right-handed parts.
The insertion of γ0 is essential. For Lorentz spinors, time and space are not treated like four Euclidean directions. The object ψ†ψ is not a Lorentz scalar. The correct scalar uses γ0.
This motivates the standard Dirac adjoint:
ψˉ:=ψ†γ0.
With this notation,
ψˉψ=ψ†γ0ψ
is a scalar, and
ψˉγμψ=ψ†γ0γμψ
is a four-vector.
Another common abbreviation is Feynman slash notation. For any four-vector Aμ,
A:=Aμγμ.
So, for example,
∂:=γμ∂μ.
4. Why the Dirac equation is not the full quantum theory
The Dirac equation in four-spinor notation is
iℏγμ∂μψ=Mcψ.
This is a relativistically covariant field equation, but it is not yet the full quantum field theory.
In the Schrödinger picture, the Hamiltonian evolves the quantum state:
iℏdtd∣Ψ⟩=H^∣Ψ⟩.
Here ∣Ψ⟩ is a state vector in the full Hilbert space of the theory. It is not the spinor field ψ(r,t).
In the Heisenberg picture, operators evolve according to
iℏ∂t∂A^=[A^,H^].
For the Dirac field operator, this equation should reproduce the Dirac equation:
iℏ∂t∂ψ^=[ψ^,H^]=cγ0(−iℏγ⋅∇+Mc)ψ^.
So the central question is:
What is H^, and what is ψ^ as an operator?
5. The Dirac Lagrangian density
A relativistic field theory must have a Lorentz-scalar Lagrangian density. The natural scalar terms built from ψ are
ψˉγμ∂μψ
and
ψˉψ.
After a convenient rescaling of the field, the free Dirac Lagrangian density can be written as
L=iℏcψˉγμ∂μψ−Mc2ψˉψ.
Using ψˉ=ψ†γ0, this is
L=iℏcψ†γ0γμ∂μψ−Mc2ψ†γ0ψ.
The Euler-Lagrange equation of this Lagrangian is the Dirac equation.
6. Canonical momentum field
The canonical momentum conjugate to ψ is
Πψ=∂(∂tψ)∂L.
Since
∂0=c1∂t,
the time-derivative part of the Lagrangian is
iℏcψ†γ0γ0∂0ψ=iℏψ†∂tψ.
Therefore
Πψ=iℏψ†.
This first-order structure is different from scalar field theory, where the canonical momentum is proportional to a time derivative of the field. For the Dirac field, the canonical momentum is directly proportional to ψ†.
7. The Dirac Hamiltonian
The Hamiltonian is obtained by the Legendre transform
H=∫d3r(Πψ∂t∂ψ−L).
Substituting the Dirac Lagrangian gives
H=∫d3r(−iℏcψ†γ0γ⋅∇ψ+Mc2ψ†γ0ψ).
Equivalently, using Π=iℏψ†,
H=∫d3r(cΠγ0γ⋅∇ψ−iℏMc2Πγ0ψ).
This Hamiltonian gives the Dirac equation as the canonical equation of motion for ψ.
8. Normal-mode expansion of the Dirac field
To quantize the theory, we need to diagonalize the Hamiltonian.
For a scalar Klein-Gordon field, Fourier transformation almost immediately diagonalizes the free Hamiltonian. For a Dirac field, Fourier transformation alone is not enough because ψ has four spinor components.
Substituting the normal-mode expansion into the Hamiltonian gives
H=∫d3kΨ†γ0(ℏcγ⋅k+Mc2)Ψ.
Using the positive- and negative-energy eigenspinors gives
H=s=±∑∫d3kE(k)(as∗(k)as(k)−cs(k)cs∗(k)).
The minus sign in the c-sector is the crucial problem. It is not a harmless convention. It determines which quantization rule gives a stable Hamiltonian.
13. Why bosonic quantization fails
Suppose we quantize bosonically:
as→a^s,as∗→a^s†,cs→c^s,cs∗→c^s†,
with canonical commutation relations.
Then the Hamiltonian contains
−E(k)c^s(k)c^s†(k).
For bosons,
c^c^†=c^†c^+1.
So this term becomes
−E(k)(c^†c^+1).
The energy decreases without bound as more c-quanta are added. Therefore bosonic quantization gives a Hamiltonian that is unbounded from below.
Bosonic quantization of the Dirac field is unstable.
14. Fermionic quantization fixes the Hamiltonian
Now quantize fermionically, using anti-commutation relations. For fermions,
ψ^ destroys a particle or creates an antiparticle.
For the electron field, this means that ψ^ destroys electrons and creates positrons. The adjoint field performs the opposite operations.
This is why the Dirac equation should not be interpreted as a single-particle Schrödinger equation. In quantum field theory, it is the field equation obeyed by an operator-valued spinor field.
Worked Example 1: Why ψˉψ is the scalar mass term
A naïve expression like
ψ†ψ
is positive-looking, but it is not a Lorentz scalar. It treats the spinor components as if boosts were unitary rotations, which they are not.
The correct scalar is
ψˉψ=ψ†γ0ψ.
Writing ψ=(η,ξ)T, this becomes
ψˉψ=ξ†η+η†ξ.
This couples left-handed and right-handed spinors, exactly as a Lorentz scalar should. That is why the Dirac mass term is
−Mc2ψˉψ.
Worked Example 2: Why fermionic quantization fixes the sign problem
The diagonalized classical Hamiltonian contains
E(k)(a∗a−cc∗).
Bosonic quantization gives
E(k)(a^†a^−c^c^†)=E(k)(a^†a^−c^†c^−1),
which becomes arbitrarily negative as more c-quanta are added.
Fermionic quantization gives
c^c^†=1−c^†c^,
so
E(k)(a^†a^−c^c^†)=E(k)(a^†a^+c^†c^)−E(k).
The remaining negative term is only a constant vacuum-energy shift, not an instability. The excitation spectrum is positive.
Intuition
The Dirac equation becomes a proper quantum field theory only after the Hamiltonian is built and quantized.
The essential logic is:
ψ is a field, not a one-particle state.
Lorentz covariance forces the use of ψˉ=ψ†γ0.
The scalar Lagrangian gives a first-order Hamiltonian.
Fourier transformation leaves a 4×4 spinor matrix to diagonalize.
The matrix has positive- and negative-energy eigenspinors.
The negative-energy sector produces a dangerous sign.
Fermionic anti-commutation relations turn that sign into a stable antiparticle sector.
The result is a field whose excitations are fermions with positive energy. The two operator families create particles and antiparticles of opposite charge.
Common Mistakes
Treating ψ(r,t) as the wavefunction of one electron.
Forgetting that the QFT state vector lives in Fock space, not in spinor-component space.
Using ψ†ψ as a Lorentz scalar mass term.
Forgetting the definition ψˉ=ψ†γ0.
Thinking Fourier transformation alone diagonalizes the Dirac Hamiltonian.
Ignoring the spinor eigenvalue problem at each momentum.
Missing the negative sign in the c-sector of the diagonalized classical Hamiltonian.
Thinking bosonic quantization is a harmless alternative.
Forgetting that fermionic anti-commutation relations are what make the Hamiltonian bounded from below.
Forgetting that particles and antiparticles have opposite charge but positive energy.
Short Summary
Dirac quantum field theory starts by constructing Lorentz scalars and four-vectors from spinors. A left-right product such as ξ†η is a scalar, while combinations like
(η†η,η†ση)
and
(ξ†ξ,−ξ†σξ)
are four-vectors. In four-spinor notation, this becomes the standard Dirac adjoint
ψˉ=ψ†γ0,
so that ψˉψ is a scalar and ψˉγμψ is a four-vector.
The free Dirac Lagrangian is
L=iℏcψˉγμ∂μψ−Mc2ψˉψ.
Its canonical momentum field is
Πψ=iℏψ†,
and its Hamiltonian is
H=∫d3r(−iℏcψ†γ0γ⋅∇ψ+Mc2ψ†γ0ψ).
In momentum space, diagonalization requires solving