The derivation of these results for the Dirac theory is identical to that for the KG theory [review the arguments which led from Eq. (4.73) to Eq. (4.78)]. Each of these two expressions for differs from its KG counterpart [which is (4.76b) for the first and (4.78) for the second] only in the sign of the negative energy term. And here, as in the KG theory, the propagation of the negative energy states backward in time is interpreted as the propagation of the corresponding antiparticle states forward in time (recall Fig. 4.3).
The difference in sign of the negative energy contributions to the KG and Dirac expressions for /(2) will appear again in field theory. In that discussion the sign difference will come from the fact that Dirac particles satisfy FermiDirac statistics (i.e., their field operators anticommute) and that when the time ordering of the interactions is changed, as it is for the negative energy states, there is an extra minus sign for fermions.
We now investigate the nonrelativistic limit of the Dirac equation. As we did for the KleinGordon equation, we will work out the expansion to order (v/c)2 ~ (p/m)2x leading terms. In making our estimates, we assume all potentials V° and V to be of the same order as the kinetic energy term (justified by the virial theorem). Since all of these leading terms are of order p2/m, we want all terms up to order p4/m3.
Assume a positive energy solution of the form where E = m + T. Then, using the Dirac equation, the coupled equations for \{r) and 77(7) become
In the nonrelativistic limit, T, \p\, and all components of VM = e.AM are assumed to be very much smaller than m. Hence, the second of the two equations (5.52) shows that the lower components of the Dirac spinor are very much smaller than the upper components, and therefore the equations are solved approximately by eliminating the lower components, as we did for the KG equation. However, if we proceed directly by solving the lower equation for 77 and substituting the solution into the equation for x< we obtain
Since T is of the same order as V°, which is ~ p2/m, it is necessary to expand the denominator of the second term if we want to collect all terms of order p4/m3.
Tx = V0x + <r(p V)v (2m + T)n = V°r] + (T ■ (p  V) x ■
This expansion gives
4 7TI2<T
Note the presence of the energy T in the last term on the righthand side. This means that the effective Hamiltonian defined by Eq. (5.54) is dependent on the energy, and an energydependent Hamiltonian leads to many complications which should be avoided, if possible. The explicit dependence on the energy should be eliminated. Since the T dependence occurs only in the highest order term, it might seem that it could be removed by replacing it by an estimate obtained from the solution of the lower order equation, i.e.,
However, this method will not give a unique answer because T is a number and commutes with <r • (p — V), while V°, part of the above estimate for T, does not. It is better to attack the problem from a different direction.
A better method, known as the FoldyWouthuysen (FW) transformation [FW 50], is to transform the equations to a new form in which the offdiagonal elements of the Hamiltonian are so small that the leading order estimate of the lower components (which does not depend on the energy T) is sufficient to get the effective Hamiltonian to the desired order of accuracy. For example, in this problem where we want the Hamiltonian to order p4/m3, it would be sufficient to reduce the offdiagonal elements to order p2/m. If they were that small, the leading contribution from the lower components would be of order p2/m2, and their contribution to the equation for x would therefore be of order p4/rri3, sufficient for our purposes. In the KG case treated in the last chapter, the offdiagonal elements were initially that small, so we were able to get the desired result immediately. Here, the offdiagonal elements of the Dirac equation are of (larger) order p, so the simplest approach did not work.
To prepare for the application of the FW transformation, return to the matrix equations (5.52), and write them in terms of Dirac matrices
The offdiagonal terms are those involving the Dirac matrices a, and they are large (of order m°). We want to transform the equation so that they are of order m~l. Then, when the equation is solved, T will not enter into the m~3 term.
The equation will be transformed using a general unitary transformation constructed from the Dirac matrices. Since the large offdiagonal terms we wish to
Hence, choosing A = ~ gives
•^offdiag — a ■ V , which is O (m1) by assumption.
With these approximations, the coupled equations (5.52) become
where only the largest (leading) terms have been retained in every element but H'n, which is yet to be reduced. We may now neglect Trf in the second equation, giving
Noting that the large
The remaining task is to reduce H'n using A = terms proportional to m occur in the combination (1 + /3), which makes no contribution to the H'n matrix element, we have, to O (m~3),
where the first three terms on the RHS are the expansion of A V°A, the first two in the second line are the expansion of the contributions from Ua(p — V)U'"1, and the last is the combined contribution from Um0U~l. To further reduce these terms we will use the identity
Using this identity gives
=■ 2p2 p ■ V  V p  ier ■ (p x V)  ier ■ (V x p) = iP  V)2 +P2 ~ V2 ~ tr ■ [V x V] = (pV)2+p2V2etrB , (5.68)
where the use of square brackets will mean that p or V operates only within the brackets. Note the new term describing a magnetic moment interaction, which
Darwin term
8m2 v 8m2 8m2 w
Because of the 63(r), this term is nonzero for Sstates only. Physically, it comes from quantum fluctuations in the position of the electron, referred to as Zitterbewegung (jittering motion), which make the electron sensitive to the average potential in the vicinity of its average position. The average of the potential is proportional to V20 ~ <53(r), and this accounts for the general structure of the Darwin term.
where S = <r/2 is the electron spin operator. This term is due to the interaction of the electron's magnetic moment with the magnetic field it sees due to its motion and automatically includes the Thomas precession, which reduces the result naively expected by a factor of 2. It is zero in Sstates, because L = 0.
The Darwin term contributes only to L = 0 states, and the spin orbit term only to states where L ^ 0, but when both corrections are taken into account, the spin orbit splitting is given by a single formula which depends only on the principal quantum number and the total angular momentum j of the state,
2 n2
The first term is the familiar nonrelativistic result, and the second is the fine structure, which splits states with the same n but different j. In the next chapter we will show that the exact solutions of the Dirac equation also predict levels which depend on n and j only. This gives a good account of the main features of the hydrogen atom spectrum, but the additional ¿dependent Lamb shift can only be explained by field theory, as we discussed in Chapter 3.
Zeeman Effect (Dirac)
The full Zeeman effect comes from two terms. The orbital part is the same as the result obtained from the KG equation and was calculated in Eq. (4.60). The result is
¿(P.A+A.PHZTL . Combining this with the spin part, —eB ■ cr/2m, gives
Note the factor of 2 for the electron's intrinsic gyromagnetic ratio. This factor has no classical explanation but was discovered empirically before the Dirac equation was discovered. Its automatic appearance in the Dirac theory is one of its major successes and provides the only "explanation" for this effect that we have.
The Dirac space is fourdimensional but is otherwise an abstract space unrelated to physical spacetime. To discuss the Lorentz transformation (LT) of a Dirac wave function, the Dirac equation, or a Dirac matrix element requires that we first construct a representation of each Lorentz transformation on the Dirac space and then show that the wave functions and matrix elements transform in such a way that the Dirac equation is invariant in form and the matrix elements transform as scalars, fourvectors, or tensors, depending on their structure. In this section the properties of the Lorentz group will be reviewed, and in the next two sections the representation of the Lorentz group on the Dirac space will be worked out and the construction and transformation of Dirac matrix elements will be discussed.
In Sec. 2.1 we discussed how Lorentz transformations change the spacetime coordinates. Any transformation which leaves the metric tensor invariant is, by definition, a LT. In the matrix notation, Eq. (2.8), this was written
The set of all transformations which satisfy this constraint form a group, which is called the homogeneous Lorentz group. The four group properties are easily demonstrated:
• If Ai and A2 are members of the group, then AXA2 is also, because
• The multiplication law (matrix multiplication in this case) is associative:
• There exists an identity A = 1 which is a Lorentz transformation.
• For each A, there exists an inverse A1 because
and hence det A = ±1, and since it is not zero, A1 exists. Multiplying Eq. (2.8) by (AT) 1 from the left and A1 from the right gives
G = (A_1)t GA"1 showing that A1 is a Lorentz transformation.
Figure 5.3 illustrates this continuity by showing the four classes as disconnected regions, with a continuous distribution of transformations within each region (class). The figure and the above equations show that to study the homogeneous Lorentz group, it is sufficient to study the group of continuous transformations L\ and the two discrete transformations T and P.
The complex LT's must also have detA = ±1, but the restriction (5.78) on Aoo no longer holds [because (Aj0)2 need no longer be positive]. Therefore
Label 
Properties 
Class 
Continuity with 
Aoo > 1 detA = +1 
orthochronous, proper restricted group 
1  
Aoo < 1 detA = +1 
nonorthochronous, proper 
TP  
Ll 
Aoo > 1 detA = 1 
orthochronous, improper 
P 
Li 
Aoo < 1 detA = 1 
nonorthochronous improper 
T 
5.8 THE LORENTZ GROUP
the complex LT's separate into only two classes, depending on the sign of the determinant, and the transformations in L\ and L\j_ can now be connected by a continuous path. As an example of such a "path," consider the transformations
cos 6 
¿sin0  
COS0 
 sinö  
sin 0 
cos 8  
i sin# 
which depend on the continuous parameter 9. These transformations satisfy (2.8) for all values of 9 and hence map out a continuous path of transformations in the space of complex LT's. By varying 9 continuously from 0 to 7r, we are able to connect the transformations 1 and 1. This fact will be of crucial importance to our discussion of the PCT theorem in Sec. 8.7.* Infinitesimal Transformations in l\ Consider the real LT's in the subgroup L\. Because they can be continuously connected to the identity, they can be written where 9 is a number and A is said to be the generator of the transformation A. [This is assumed without proof. It is a general property of a continuous group.] The structure of the group can be inferred from the structure of the generators A. To study this structure, it is sufficient to consider those transformations for v hich 9 = e is infinitesimally small. In this case, the transformations can be expanded and only the first order terms retained, so that It is easy to determine the structure of A from this equation, which looks like A31 A41 \

Post a comment