Spin waves via HP bosons

Created over 10 years ago, updated over 10 years ago

We use the Holstein-Primakoff boson representation of spin operators
$$\begin{eqnarray} S^+ &=& (\sqrt{2 S - n_b}) b,\cr S^- &=& b^\dagger (\sqrt{2 S - n_b}),\cr S^z &=& S - n_b. \end{eqnarray}$$
in order to understand spin waves around both ferromagnetic and antiferromagnetic ground states. For the antiferromagnetic case, these provide a way to expand systematically around the (effectively classical) large-$S$ limit, as will be seen. We follow Auerbach chapter 11 mostly.

First let's do ferromagnetic spin waves again. Our approach is going to be to expand the square roots in the HP relations in powers of $1/S$; this is justified as long as the spin fluctuations are relatively small,
$$\begin{equation} \langle n_b \rangle \ll 2 S, \end{equation}$$
which in principle we should check at the end of the calculation. This is basically a check to see whether spin waves are so strong that in fact they destroy the long-range order, as indeed happens in 1D for the antiferromagnet.

Suppose that the ordered ground state is in the ${\hat z}$ direction, and expand the Hamiltonian as
$$\begin{eqnarray} H &=& - |J| \sum_{\langle ij \rangle} S_i \cdot S_j \cr &=& -S^2 |J| N z / 2 - |J| \sum_{\langle ij \rangle} \left[ S b^\dagger_i \sqrt{1 - n_i/2S} \sqrt{1 - n_j/2 S} b_j -\frac{1}{2}S (n_i+n_j)+\frac{1}{2} n_i n_j\right]. \end{eqnarray}$$
Here $z$ is the coordination number. We now expand the square roots and keep terms proportional to $S^2$, $S$, and $S^0$, but not $S^{-1}$. This gives
$$\begin{eqnarray} H &\approx& -S^2 |J| N z / 2 + H_1 + H_2 + O(1/S)\cr H_1 &=& \sum_{\bf k} \omega_{\bf k} b^\dagger_{\bf k} b_{\bf k}\cr H_2 &=& |J|/4 \sum_{\langle ij\rangle} \left]b^\dagger_i b^\dagger_j (b_i - b_j)^2 + (b^\dagger_i - b^\dagger_j)^2 b_i b_j\right]. \end{eqnarray}$$
We are not going to do anything with the quartic terms in $H_2$, since these are smaller than those in $H_1$ if the boson occupancy is small. Here the bosonic spin wave operators are
$$\begin{equation} b_{\bf k} = {1 \over \sqrt{N}} \sum_{\bf k} e^{-i {\bf k}\cdot {\bf x}_i} b_i \end{equation}$$
and their energies are
$$\begin{equation} \omega_{\bf k} = S |J| z \left( 1 - z^{-1} \sum_{j,\langle ij \rangle} e^{i ({\bf x}_j - {\bf x}_i) \cdot {{\bf k}}} \right). \end{equation}$$
Here the sum is over all the $z$ nearest neighbors of one particular site, just as in a single-band tight-binding model. For the cubic lattice, for example, this just becomes
$$\begin{equation} \omega_{\bf k} = S |J| (6 -2 \cos(k_x a)-2 \cos(k_y a) - 2 \cos(k_z a)). \end{equation}$$
In Auerbach's notation the above is written as (for the cubic lattice $z=6$)
$$\begin{equation} \omega_{\bf k} = S |J| z (1 - \gamma_{\bf k}). \end{equation}$$

Again, this is what we would get for a tight-binding model, but with an offset so that zero $k$ corresponds to zero energy. For the cubic lattice, we have at small ${\bf k}$
$$\begin{equation} \omega_{\bf k} \approx S |J| a^2 k^2. \end{equation}$$
This soft mode is the Goldstone boson corresponding to broken spin rotational invariance. We will see that the antiferromagnetic spin wave is rather different: it has multiple zero-energy points for the cubic lattice, rather than just the single point ${\bf k}=0$, and also has a linear dispersion relation near these points, rather than a quadratic one.

Before moving on to the antiferromagnet, let's look at how the ferromagnetic order is reduced by spin-wave excitations for $T>0$. The change in the magnetization per site at finite temperature is given by
$$\begin{equation} \Delta m_0 = {1 \over N} \langle S^z_{tot} \rangle - S = -\langle n_i \rangle = - {1 \over N} \sum_{\bf k} n_{\bf k}. \end{equation}$$
The occupation number $n_{\bf k}$ is just given by Bose-Einstein statistics,
$$\begin{equation} n_{\bf k} = {1 \over e^{\omega_{\bf k} / T} - 1}. \end{equation}$$

In order to understand the effect of temperature, let $k_0$ be an "infrared cutoff": some small momentum that we will keep as a lower bound when converting the sum over spin-wave modes to an integral. The important physics will be that the behavior under removing this cutoff is strongly dimensionality-dependent, which tells us something about the destruction of order at finite temperature. We also introduce a larger momentum ${\bar k}>k_0$, with the idea that below this momentum the quadratic form $\omega_{\bf k} \approx S |J| a^2 k^2$ is valid.
The sum then becomes
$$\begin{equation} \Delta m_0 = -\int^{\bar k}_{k_0} {dk\,k^{d-1} \over (2 \pi)^d} {T \over J S k^2} - N^{-1} \sum_{|k|>{\bar k}} {1 \over e^{\omega_{\bf k} / T}-1}. \end{equation}$$
The second part is independent of $k_0$ and finite, so we ignore it. We want instead to focus on the asymptotic behavior of the first part for small $k_0$. In one dimension it diverges as $-T / k_0 JS$, while in two dimensions it diverges as $T (\log k_0)/JS$.

These divergences mean obviously that the spin-wave approximation (which required that the expectations be relatively small perturbations to the original ordered state) is not justified at finite temperature in 1D and 2D, since even at low temperature the reduction of magnetization is divergent. The physical interpretation of this is that at finite temperature there is no long-range order in the Heisenberg model in 1D and 2D, as a consequence of the Mermin-Wagner theorem (cf. Phys 212). You can show that the reduction of the moment in 3D is proportional to $T^{3/2}$, so that the magnetic order is stable up to some nonzero temperature.

Now we consider the antiferromagnetic case. We assume a bipartite lattice: a lattice that can be divided into two sublattices $A$ and $B$ so that every bond connects one site from $A$ and one site from $B$. Then the classical ground state is just all spins up on $A$ and all down on $B$, modulo a global rotation. It is useful to rotate the spin quantization axis between sublattices, so that zero bosons at every site corresponds to this classical N'eel state: then on sublattice $B$ we have
$$\begin{equation} {\tilde S}^z_j = -S^z_j,\quad {\tilde S}^x_j = S^x_j,\quad {\tilde S}^y_j = - S^y_j. \end{equation}$$
These satisfy the same commutation relation as the original spin operators and therefore can be represented by HP bosons. In this new representation, the Hamiltonian is
$$\begin{equation} H = -|J| \sum_{\langle ij \rangle} S^z_i {\tilde S}^z_j + \frac{|J|}{2}\sum_{\langle ij \rangle} (S^+_i {\tilde S}^+_j + S^-_i {\tilde S}^-_j). \end{equation}$$
(Because of the rotation on one sublattice, the above has $++$ and $--$ terms instead of combinations of one raising and one lowering operator.) Consider one pair of sites:
substituting HP bosons and ignoring quartic and $O(1/S)$ terms leaves
$$\begin{equation} -|J| S^z_i {\tilde S}^z_j + \frac{|J|}{2} (S^+_i {\tilde S}^+_j + S^-_i {\tilde S}^-_j) \approx -|J| S^2 + |J| S (n_i+n_j) + {|J| \over 2} S (b_i b_j +b^\dagger_i b^\dagger_j). \end{equation}$$
We can rewrite these in terms of $b_{\bf k}$ operators as before: note that the sum of $n_i$ will be equal to the sum over all $k$ (Parseval's theorem), and that the $bb$ and $b^\dagger b^\dagger$ terms are just like what we had before.
$$\begin{eqnarray} H &=& - S^2 J N z / 2 + H_1 \cr H_1 &=& J S z \sum_{\bf k} \left[ b^\dagger_{\bf k} b_{\bf k} + {\gamma_{\bf k} \over 2} (b_{\bf k} b_{-{\bf k}} + b^\dagger_{{\bf k}} b_{{\bf k}}) \right]. \end{eqnarray}$$
Here we see that $H_1$ is a bit more complicated: it is still just quadratic in the bosonic operators, but has some terms that change the overall number of bosons. Does this remind you of anything? We use the same Bogoliubov transformation to diagonalize this Hamiltonian that we used to get the $\gamma$ operators in superconductivity, with the difference being that now we are dealing with bosonic operators. The basis change we want is to new spin-wave operators $\alpha_{\bf k}$:
$$\begin{eqnarray} \alpha_{\bf k} &=& \cosh \theta_{\bf k} b_{\bf k} - \sinh \theta_{\bf k} b^\dagger_{-{\bf k}} \cr b_{\bf k} &=& \cosh \theta_{\bf k} \alpha_{\bf k} + \sinh \theta_{\bf k} \alpha^\dagger_{-{\bf k}}. \end{eqnarray}$$
The operators $\alpha$ are still bosonic: for example,
$$\begin{equation} [\alpha_{\bf k},\alpha^\dagger_{\bf k}] = \cosh^2 \theta_{\bf k} [b_{\bf k},b^\dagger_{\bf k}]+\sinh^2 \theta_{\bf k} [b^\dagger_{-{\bf k}},b_{-{\bf k}}]= 1. \end{equation}$$
and the $[\alpha,\alpha]$ and $[\alpha^\dagger,\alpha^\dagger]$ commutators vanish.

In terms of the new $\alpha$ operators, we have
$$\begin{eqnarray} H_1 &=& |J| S z \sum_{\bf k} \Big[ (\cosh 2 \theta_{\bf k} + \gamma_{\bf k} \sinh 2 \theta_{\bf k}) \alpha^\dagger_{\bf k} \alpha_{\bf k} \cr &&+ \frac{1}{2} (\sinh 2 \theta_{\bf k} + \gamma_k \cosh 2 \theta_{\bf k}) (\alpha^\dagger_{\bf k} \alpha^\dagger_{-{\bf k}} + \alpha_{\bf k} \alpha_{-{\bf k}})+\sinh^2 \theta_{\bf k} + {\gamma_k \over 2} \sinh 2 \theta_{\bf k} \Big]. \end{eqnarray}$$
We can now choose the $\theta_{\bf k}$ to make the non-boson-number-conserving terms vanish:
$$\begin{equation} \tanh 2 \theta_{\bf k} = - \gamma_k, \end{equation}$$
so
$$\begin{eqnarray} H_1 &=& \sum_{\bf k} \omega_{\bf k} (\alpha^\dagger _{\bf k} \alpha_{\bf k}+\frac{1}{2}) - {J S z N \over 2},\cr \omega_{\bf k} &=& |J| S z \sqrt{1 - \gamma_{{\bf k}}^2}. \end{eqnarray}$$
This is different from the ferromagnet in two important ways: the function $\omega_{\bf k}$ is linear near its minima rather than quadratic, and quantum fluctuations cause a reduction in the ground state energy of order $S$:
$$\begin{equation} \Delta E = \frac{1}{2} \sum_{\bf k} |J| S z (\sqrt{1 - {\gamma_{\bf k}}^2} - 1). \end{equation}$$

For the $d$-cubic lattice, near ${\bf k} = (0,0,\ldots)$ and ${\bf k}=(\pi,\pi,\pi)$ the spin-wave spectrum looks like $\omega_{\bf k} \sim J S \sqrt{2 z} |{\bf k} - {\bf k}_{min}|$, so there are two minima with the same spin-wave velocity near each. This prediction has been strikingly confirmed via neutron scattering on antiferromagnets.

A somewhat unsatisfactory way to explain the difference between $\omega \sim |k|$ and $\omega \sim k^2$ is to argue that the ferromagnet breaks time-reversal symmetry "more strongly" than the antiferromagnet, since on long length scales the antiferromagnet's mean magnetization is zero. If time-reversal symmetry were preserved, then a relation like $\omega \sim k^2$ would be impossible because the two sides of the equation transform differently under the time-reversal operator. The problem with this argument, of course, is that the antiferromagnet also breaks time-reversal, but at least the heuristic argument is good for remembering which dispersion relation goes with which magnet.

Note that in the case of both the ferromagnet and antiferromagnet the fundamental excitations carried integer spin (we showed this in an earlier lecture for the ferromagnet). An active area, which you can read more about in Auerbach and also the textbook of Fradkin, is how some "spin liquids" can have deconfined spin-half excitations known as spinons.

Dashboard

Spin waves via HP bosons

Quick navigation