Discovering the Fourier Transform Through Quantum Mechanics

Introduction

The Fourier transform has a million applications across physics, engineering, pure math, and more. Given a suitable function $\psi(x)$, it enables us to decompose it as a sum of complex waves, $e^{ikx}$,

$$ \psi(x) = \frac{1}{\sqrt{2\pi}} \int \limits_{-\infty}^\infty\mathrm{d}k~e^{ikx} \hat{\psi}(k), $$

where the coefficients $\hat{\psi}(k)$ are similarly given by

$$ \hat{\psi}(k) = \frac{1}{\sqrt{2\pi}} \int \limits_{-\infty}^\infty\mathrm{d}x~e^{-ikx} \psi(x). $$

If these formulas are new to you, they probably look a bit formidable! In this lesson, I want to explain what they mean by exploring how they arise in the context of quantum mechanics—where $\psi(x)$ is the wavefunction of a quantum particle. Because, as we’ll discover, the Fourier transform plays a critically important role in this subject.

Of course, the Fourier transform was first written down long before the beginnings of quantum mechanics—by Joseph Fourier in the early 1800s. But even if it hadn’t been, the transform still would have inevitably been discovered once we attempted to understand the strange laws of the quantum world. And in this lesson, we’re going to see how the Fourier transform appears so naturally in quantum mechanics, and to use the physics to build up an intuition for what these mathematical expressions mean.

Now, I want you to be able to appreciate all this even if you’re not already terribly familiar with the abstract rules of quantum mechanics. And so for the purposes of this lesson, you’ll mainly need to know two things going in, which we’ll basically take as axioms:

A quantum particle is described by its wavefunction $\psi(x)$, which is a function that assigns a number to each point $x$ in space—a complex number, in general—and which tells us the probability of where we’ll find the particle when we go to measure its position. In particular, the probability of finding the particle at position $x$, within a little window of width $\mathrm{d}x$, is given by the square of the wavefunction, times the little width $\mathrm{d}x$: $$ \mathrm{Prob}(x) = |\psi(x)|^2\mathrm{d}x.\ $$ $$\vphantom{|}$$ The wavefunction will generally also depend on time, but that won’t be important for this lesson.
Given a wavefunction, the momentum is described by an operator $P$ that acts on the wavefunction by taking its derivative, times an overall factor: $$ P \psi(x)= \frac{\hbar}{i} \frac{\mathrm{d} }{\mathrm{d} x } \psi(x). $$ $\hbar$ here is Planck’s constant, which is the fundamental physical constant of quantum mechanics. And $i = \sqrt{-1}$ is the imaginary number. As we’ll see shortly, it’s there to ensure that the numbers we predict when we actually measure the momentum come out to be real—because again the wavefunction $\psi(x)$ itself is in general going to be complex.

We’ll better understand what these formulas mean as we go along. But for the moment, these are the two main facts about quantum mechanics that we need going into this lesson, and now we’re ready to begin to discover how all this is connected to the mathematics of the Fourier transform.

The Fourier Series

To understand the role of the Fourier transform in quantum mechanics, we’re actually going to start off by thinking about a particle living on a circle of radius $R$ instead of an infinite line. Living on a circle will make the math a little bit easier to begin with, and then later on we’ll see what happens when we go back to infinite space.

Here’s the difference between living on a circle versus an infinite line: if you start going for a walk, after you’ve traveled a distance of $2\pi R$ you look around and discover that you’re just back where you originally started!

As far as our quantum mechanics goes, now that any two points $x$ and $x + 2\pi R$ are identified, the key implication is that our wavefunction has to take the same value at each one:

$$ \psi(x) = \psi(x+2\pi R). $$

After all, $x$ and $x + 2\pi R$ label the same point in space, so $\psi$ had better be the same when we plug in either coordinate.

In other words, the wavefunction has to be periodic—with period $2\pi R$—in order for it to define a single-valued function on our circular space.

So, for example, $\psi(x) = x$ wouldn’t be allowed here, since when you take a walk over by $2\pi R$, the value of the function changes. But if we took something like $\psi(x) = \cos(\frac{x}{R})$ we’re golden, because now shifting $x \to x +2\pi R$ leaves the value of the function unchanged:

$$ \cos \left( \frac{x + 2\pi R}{R} \right) = \cos \left( \frac{x}{R} + 2\pi \right) = \cos \left( \frac{x}{R} \right). $$

Speaking of cosines, the fact that our wavefunction $\psi(x)$ is periodic means that under very general conditions we can expand it as a sum of cosines and sines using a Fourier series:

$$ \psi(x) = \sum_{n=0}^\infty A_n \cos \left( \frac{nx}{R} \right) + B_n \sin \left( \frac{nx}{R} \right) $$

This isn’t the Fourier transform yet—that’ll come soon. For now we’re just talking about the ordinary Fourier series that lets us expand most any periodic function as a sum of sines and cosines.

For instance, here’s a simple example of a wavefunction for a particle that’s localized in a little window:

Inside the window, $\psi(x)$ is just a constant, and so the particle has an equal chance of being found anywhere in that range. But outside, the wavefunction goes to zero, and so we’ll never find the particle there.

We can make this wavefunction periodic simply by demanding that it repeats itself over and over again with period $2\pi R$, and that way we can write down its Fourier series. After we add up just a few terms in the sum, it quickly begins to close in on the precise shape of the wavefunction:

This idea is really powerful, and it’s the foundation of everything we’re going to discuss in this lesson. Each term in the Fourier series is just a simple sinusoidal function—a wave that oscillates up and down forever. But when we add them together, the waves interfere with each other—constructively in some places and destructively in others—and the result quickly closes in on the wavefunction.

Here are the first five waves in the Fourier series, whose sum equals the approximation to the wavefunction drawn in the previous picture:

Actually, cosines and sines can be a little bit awkward to work with, and so for our purposes here it will be more convenient to write the Fourier series in terms of exponentials. We can do that thanks to Euler’s identity, which says that

$$ e^{i\theta} = \cos \theta + i \sin \theta, $$

and which therefore lets us go back and forth between sines and cosines and exponentials.

That lets us write the Fourier series much more conveniently as

$$ \psi(x) = \sum_{n = -\infty}^\infty \psi_n e^{inx/R}. $$

This is the same expansion, I’ve just rearranged it a bit using Euler’s identity, and now I’m writing the coefficients as $\psi_n$. We’re summing over every integer, $n = 0, \pm 1, \pm 2, \pm 3,$ and so on.

Let’s take some time to unpack what’s going on here. Each term in the sum is a wave, called a Fourier wave or Fourier mode, which I’ll write as

$$ e^{in x/R} = e^{ikx}, $$

with $k = n/R$. In particular, it’s a complex wave, because Euler’s identity says that

$$ e^{ikx} = \cos(kx) + i \sin(kx). $$

So the real part is the wave $\cos(kx)$, and the imaginary part is $\sin(kx)$—which is the same wave, it’s just shifted over from the cosine by $90^\circ$.

For each of these Fourier waves $e^{ikx}$ in the sum, this number $k$ is called the wavenumber of the wave—it’s analogous to frequency, except that here we’re talking about a wave that’s oscillating in space, not time. A big value of $k$ means a wave that’s oscillating very rapidly, and which therefore has a short wavelength. A small value of $k$ means a wave that’s oscillating very slowly, with a long wavelength.

The fact that the sum only includes the special values $k = n/R$ with $n$ an integer is critical here. We’ll come back to the significance of that in a moment, but first we need to appreciate the intimate connection between these Fourier waves and the quantum mechanical momentum operator that we talked about at the beginning.

Let’s see what happens when we apply the momentum operator, $P = \frac{\hbar}{i} \frac{\mathrm{d} }{\mathrm{d}x }$, to one of our Fourier modes, $e^{ik x}$:

$$ P e^{ikx}= \frac{\hbar}{i} \frac{\mathrm{d}}{\mathrm{d}x}e^{ikx}. $$

The derivative will bring down a factor of $ik$ from the exponent, and the $i$ will cancel against the $i$ in the denominator, leaving

$$ P e^{ikx} = \hbar k e^{ikx}. $$

Something very special has happened here. When we apply the momentum operator to the Fourier mode, we get back the same function $e^{ikx}$ multiplied by a constant, $\hbar k$. $e^{ikx}$ is called an eigenfunction of the momentum operator, and $\hbar k$ is called the corresponding eigenvalue. When you apply an operator to one of its eigenfunctions, by definition you get back the same function times the eigenvalue.

This is important, because in quantum mechanics the eigenvalues of an operator are the numbers that you can get back when you make a measurement of that quantity. And so when we measure the momentum of a particle whose wavefunction is a pure Fourier wave $e^{ikx}$, the value we’ll find is Planck’s constant $\hbar$ times the wavenumber $k$.

Notice that those values are real, by the way, as we’d of course expect since we’re talking about a physical measurement here. That’s why we divided by $i$ in the definition of the momentum operator—the $i$ in the denominator cancels against the $i$ that came down with the derivative of $e^{ikx}$.

So these Fourier waves are very special. They’re the eigenfunctions of the momentum operator, and therefore each one describes a state with definite momentum. But in general the total wavefunction won’t be described by any single Fourier wave, and therefore it won’t have definite momentum. A general wavefunction will be given by a superposition of many such waves with those special wavenumbers $k = n/R$. That’s what the Fourier series expressed,

$$ \psi(x) = \sum_{k = n/R} \psi_k e^{ikx}, $$

where the coefficients tell us how much each individual Fourier wave is contributing to the total wavefunction.

(I’ll go back and forth between writing the Fourier series in terms of $n$ and in terms of $k = n/R$. $\psi_n$ and $\psi_k$ refer to the same thing—the coefficient of the term $e^{ikx} = e^{inx/R}$ in the sum.)

When we go to measure the momentum of our particle, all those wavenumbers $k = \frac{n}{R}$ appearing in the Fourier series are fair game, and therefore we can get back any of those potential values $\hbar k = \hbar \frac{n}{R}$ for the momentum. The probability of each possibility is given by the square of its proportion of the total wavefunction,

$$ \mathrm{Prob}(p = \hbar k) \propto |\psi_k|^2. $$

(I’m writing $\propto$ instead of $=$ here just because we need to ensure that all the probabilities add up to a total of one. That means there’ll be an extra constant factor multiplying $|\psi_k|^2$, but that doesn’t matter right now.)

So that’s why the Fourier series is so closely connected to the momentum of our quantum particle. It tells us which momenta are contributing to the wavefunction, and therefore the probabilities of what we’ll find when we go to measure it.

I’ll explain how to actually compute the coefficients in just a minute, but first we skimmed over a really important point earlier: why are the wavenumbers—and therefore the values of the momentum—only allowed to take these special values $k = n/R$?

The reason comes back to the fact that we’ve put our particle on a circular space, where as we discussed the wavefunction has got to be periodic. Meanwhile, our Fourier waves are of course also periodic, but the periodicity of the waves has to be compatible with the periodicity of the circular space.

In other words, we need to be able to fit a whole number of wavelengths into the circumference of our circle. They’re a little like the standing waves you may have learned about in high school physics when you pin down the two ends of a rope (except that these waves aren’t going to sit still, in general, as time goes on.)

If a whole number of wavelengths didn’t fit into the length of the circle, then when you take a walk all the way around and come back to where you started, the value of the wave will have changed, and the function won’t make any sense on our circle.

In particular, when we shift $x \to x+2\pi R$, corresponding to walking all the way around the circle, $e^{ikx}$ transforms as

$$ e^{ikx} \to e^{ikx}e^{2\pi i kR}. $$

So for arbitrary values of $k$, that’s not going to be invariant. We only get a wave that’s well-defined on our circle when $e^{2\pi i kR} = 1$. And that’s where the condition that $k R$ has to be an integer, $n$, comes in, because $e^{2\pi i n}$ is indeed equal to one.

You can see that again from Euler’s identity,

$$ e^{2\pi i n} = \cos(2\pi n) + i \sin(2\pi n). $$

The angles we’ve got here are $0,2\pi,4\pi$ and so on, all of which have $\cos = 1$ and $\sin = 0$.

So that’s why the wavenumbers of our Fourier modes are restricted to be of this form, an integer divided by the radius of the circle. And correspondingly, that’s why the momentum of the particle is quantized as $\hbar n/R$.

Abstractly, what the Fourier series means is that the set of all the Fourier waves with these special values of $k$ forms a basis for the space of square-integrable functions on a circle. Where square-integrable means that

$$ \int\limits_{-\pi R}^{\pi R} \mathrm{d}x~ |\psi(x)|^2 < \infty. $$

And indeed, since the integral of $|\psi(x)|^2$ is the total probability for finding the particle somewhere—which we want to be equal to one—the wavefunction will be square-integrable.

To say that the Fourier waves $e^{ik x}$ form a basis for this space means that we can expand any wavefunction as a superposition of them, with some appropriate coefficients, and that’s exactly what the Fourier series expresses. But we haven’t actually discussed yet how we’re supposed to figure out those coefficients, and there’s a beautiful trick for getting them.

To see how, we’ll start with the Fourier series again,

$$ \psi(x) = \sum_{n = -\infty}^\infty \psi_n e^{inx/R}, $$

and now multiply each side by $e^{-im x/R}$ for some other integer $m$:

$$ e^{-i m x/R}\psi(x) = \sum_{n = -\infty}^\infty \psi_n e^{i(n-m)x/R}. $$

What we’re going to do is integrate both sides of this equation over the circumference of the circle, from $-\pi R$ to $+\pi R$:

$$ \int\limits_{-\pi R}^{\pi R} \mathrm{d}x~e^{-i m x/R}\psi(x) = \sum_{n = -\infty}^\infty \psi_n \int\limits_{-\pi R}^{\pi R} \mathrm{d}x~ e^{i(n-m)x/R}. $$

This looks a little complicated, but actually we get something very simple here. On the right-hand-side, we’re doing the integral of $e^{i(n-m)x/R}$ over the circumference of the circle. But like we just discussed, we specifically chose these functions to ensure that a whole number of oscillations fit into this interval.

Then when we do the integral of a periodic function like this over a whole number of periods, the positive parts where the wave is above the $x$-axis precisely cancel the negative parts where it’s below, and we get zero!

Or at least, we usually get zero. There’s one exception, and that’s when $n$ and $m$ happen to be the same integer. Because in that special case, $n-m$ in the exponent vanishes, and we’re left with the integral of $e^0 = 1$. And that comes out to the circumference, $2\pi R$:

$$ \int\limits_{-\pi R}^{\pi R} \mathrm{d}x~ e^{i(n-m)x/R}= \begin{cases} \displaystyle 2\pi R & n = m\\ \displaystyle 0 & n \neq m \end{cases} $$

So the complicated looking formula with the sum and the integral isn’t actually very complicated at all. Almost all the terms in the sum over $n$ are equal to zero. The only non-zero term is the special one where $n$ happens to coincide with this other integer $m$ that we chose. And so at the end of the day all we’re left with is

$$ \int\limits_{-\pi R}^{\pi R} \mathrm{d}x~e^{-i m x/R}\psi(x) = 2\pi R \psi_m. $$

And now we’ve got our Fourier coefficients! We just divide the $2\pi R$ over to the other side, and we find that all we need to do to compute each coefficient for a given wavefunction $\psi(x)$ is evaluate this integral

$$ \psi_n = \frac{1}{2\pi R}\int\limits_{-\pi R}^{\pi R} \mathrm{d}x~e^{-i n x/R}\psi(x), $$

where I’ve switched the label back to $n$ like we were using before.

In words, we take the wavefunction $\psi(x)$, multiply it by the complex conjugate Fourier wave $e^{-inx/R}$, and then integrate that over the circle. And finally we divide by the circumference, and that gives the Fourier coefficient. Or, in terms of $k$,

$$ \psi_k = \frac{1}{2\pi R}\int\limits_{-\pi R}^{\pi R} \mathrm{d}x~e^{-i k x}\psi(x), $$

where again, in a slight abuse of notation, $\psi_k$ and $\psi_{n}$ refer to the same thing: the coefficient of $e^{ikx} = e^{inx/R}$ in the series.

So for our example of a wavefunction from earlier, you can evaluate those integrals and you’ll get a distribution of Fourier coefficients like this, for the first handful of terms:

Each data point here is telling us how much the Fourier wave $e^{ikx}$ with that particular value of $k = n/R$ is contributing to the given wavefunction.

The Fourier Transform

Okay, that was already a lot of information. And now we’re ready to see how what we’ve learned about the quantum mechanics of the Fourier series extends to the full-fledged Fourier transform. But first let’s quickly summarize the main things we’ve learned so far.

First, a quantum mechanical particle is described by a wavefunction $\psi(x)$, whose square $|\psi(x)|^2$ tells us the probability density of finding the particle at position $x$ when we make a measurement,

$$ \mathrm{Prob}(x) = |\psi(x)|^2 \mathrm{d}x. $$

Second, for a particle that lives on a circle, we can expand the wavefunction as a sum over Fourier waves, $\psi(x) = \sum_k \psi_k e^{ikx}$, where $k$ is forced to take the discrete values $k = n/R$ for any integer $n$.
And third, each individual Fourier wave has definite momentum, $\hbar k$, and the probability of obtaining that value when we make a measurement is determined by that mode’s contribution to the Fourier series, squared:

$$ \mathrm{Prob}(p = \hbar k) \propto|\psi_k|^2. $$

Notice, though, that there’s a strange sort of asymmetry here between the position and momentum. The particle can be found at any position $x$ along the circular space, while the momentum can only take these discrete values, like $0$, $\hbar/R$, $2\hbar/R$, and so on.

The origin of the discrepancy comes back to making our space a circle. But now we’re prepared to investigate what happens when we go back to studying our particle on the infinite $x$-axis, and in doing so we’re going to see that the Fourier transform falls out.

We constructed our circle of radius $R$ in the first place by identifying each pair of points on the real line that differed by a multiple of $2\pi R$. But now we can go the other way if we like by sending $R$ out to infinity. In that sense, the problem of a particle on a line is just a special limit of a particle on a circle, and in fact we’ve really gotten most of the hard work out of the way already.

So let’s think about what happens in the $R \to \infty$ limit. First of all, we found that the allowed values for the momentum were

$$ p = \frac{\hbar n}{R}. $$

They came in this discrete lattice of points, one for each integer $n$, separated by $\hbar/R$. But notice that in the limit $R \to \infty$, that separation goes to zero. Then instead of a discrete lattice of possible momenta, we’ll find a continuous line. And therefore the particle in an infinite space can take any value for its momentum, just like it could for its position, eliminating the asymmetry between the two variables.

Now let’s see what happens to our Fourier series in the infinite radius limit. First I’m just going to relabel things a little bit. Let me rewrite the coefficients as

$$ \psi_k = \frac{1}{\sqrt{2\pi}R} \hat{\psi}(k), $$

where as before $k = n/R$ are the discrete values of the wavenumber. Putting in the $1/R$ will help us keep control over the $R \to \infty$ limit, and the factor of $\sqrt{2\pi}$ is just a matter of convention. Then the Fourier series is

$$ \psi(x) = \sum_{k = n/R}\frac{1}{\sqrt{2\pi }R}\hat\psi(k) e^{ikx}. $$

And as for the coefficients $\hat{\psi}(k) = \sqrt{2\pi}R\psi_k$, using our earlier result for $\psi_k$ we have

$$ \hat\psi(k) = \frac{1}{\sqrt{2\pi} }\int\limits_{-\pi R}^{\pi R} \mathrm{d}x~e^{-i k x}\psi(x). $$

These are the exact same formulas as before, all I’ve done is change the notation a little bit to make it clearer how we’re going to take the $R \to \infty$ limit, which we’re now ready to do.

Let’s start with the sum over $k$. Just like the momentum, our values of $k = n/R$ were originally spaced out on a discrete lattice, separated by $\Delta k = \frac{1}{R}$. Notice that we also have a $1/R$ in the Fourier series; let’s go ahead and replace that with this spacing $\Delta k$:

$$ \psi(x) = \sum_{k = n/R} \Delta k \cdot \frac{1}{\sqrt{2\pi }}\hat\psi(k) e^{ikx}. $$

Now what is this sum telling us to do? It contains this function of $k$: $\frac{1}{\sqrt{2\pi}} \hat{\psi}(k)e^{ikx}$. Here’s a cartoon of what it might look like:

It’s complex of course, so let’s say this picture is the real part, and you could draw a similar picture of whatever the imaginary part happens to look like.

Then what we’re told to do is take the value of this function at each of these special points $k = n/R$. Then we multiply that by the width $\Delta k$ of each interval. That gives us the area of each rectangle aligned at those values of $k$, as I've drawn above.

Finally, the sum instructs us to add all these areas up.

As we let $R$ get really big, though, the separation $\Delta k$ is going to zero, and the width of each rectangle is getting very skinny. Then as we add up all the areas of the skinny rectangles between the ever more finely spaced values of $k$, we obtain the area under the curve.

In other words, in the $R \to \infty$ limit, the sum over the discrete values $k = n/R$ turns into an integral over the continuous variable $k$:

$$ \sum_{k = n/R} \Delta k \overset{R \to \infty}{\longrightarrow} \int\limits_{-\infty}^\infty \mathrm{d}k. $$

And so the result of the Fourier series in the $R \to \infty$ limit is

$$ \psi(x) = \frac{1}{\sqrt{2\pi}} \int\limits_{-\infty}^\infty\mathrm{d}k~ e^{ikx} \hat{\psi}(k). $$

As for our other formula for the components $\hat{\psi}(k)$, that limit is even simpler. All that changes is that the bounds of the integral $\int_{-\pi R}^{\pi R} \mathrm{d}x$ now extend out to $\pm \infty$,

$$ \hat\psi(k) = \frac{1}{\sqrt{2\pi} }\int\limits_{-\infty}^{\infty} \mathrm{d}x~e^{-i k x}\psi(x). $$

And there we have it! This pair of formulas defines the Fourier transform of the wavefunction, together with the inverse Fourier transform. And we’ve found that it simply emerges from the Fourier series for the wavefunction on a circle, in the limit where the circle becomes infinitely large.

But the understanding we developed by studying the quantum mechanics of a particle on a circle gives us a good intuition for understanding the Fourier transform in an infinite space, which I think otherwise might look a little imposing the first time you see it.

Because even though our discrete Fourier series from earlier has turned into an integral, the underlying idea is exactly the same as before. Each Fourier wave $e^{ikx}$ represents a state of definite momentum, $\hbar k$. And what we’re doing is writing a general wavefunction $\psi(x)$ as a superposition of these basic waves:

$$ \psi(x) \sim \sum_k e^{ikx} \hat{\psi}(k). $$

The only difference is that we had a discrete sum for a particle on a circle, whereas now we get a continuous integral when we put the particle on a line. That again was because for the circle we had to be able to fit a whole number of wavelengths inside the circumference, and that meant that $k$ could only take the discrete values $n/R$. Whereas on an infinite line, there’s no constraint on the wavelength, and $k$ can take any value it wants—it’s a continuous parameter.

The coefficients $\psi_k$ or $\hat{\psi}(k)$ simply tell us how much of each individual wave we need to include in the sum in order to reproduce our given wavefunction. We wrote them as $\psi_k$ (or $\psi_n$) in the circle case because it was just a discrete list of numbers. But in the continuous case it’s a whole function of the continuous variable $k$, which is why I wrote it as $\hat{\psi}(k)$—and we throw the hat on there just so we don’t get it confused with the original wavefunction.

When it comes to the momentum, since a general wavefunction is going to be a superposition of many different Fourier waves, each with its own momentum $\hbar k$, when we go to measure the momentum of the particle, we can potentially find any of those individual values.

All we can predict ahead of time is the probability that we’ll find one value or another when we make a measurement. And as you might guess based on what we’ve discussed so far, the probability of finding $k$ in a little window of width $\mathrm{d}k$, is given by the proportion of that Fourier wave, squared, multiplied by that little width $\mathrm{d}k$:

$$ \mathrm{Prob}(p = \hbar k) = |\hat{\psi}(k)|^2\mathrm{d}k. $$

This may sound familiar. In fact, we’ve found a striking parallel with our discussion of the position probability function from earlier: the probability of finding the position of the particle in a little window of width $\mathrm{d}x$ around a point $x$ was given by

$$ \mathrm{Prob}(x) = |\psi(x)|^2\mathrm{d}x. $$

Therefore, whereas $|\psi(x)|^2$ tells us the probability density for finding the particle at position $x$, its Fourier transform squared is the probability density for finding the particle with momentum $\hbar k$.

We call $\psi(x)$ the position space wavefunction to emphasize this, and $\hat{\psi}(k)$ the momentum space wavefunction. They contain the same information about the system, since we can get either one from the other by applying the Fourier transform.

And so we could just as well write the fundamental rules of quantum mechanics in terms of the momentum space wavefunction—it’s just a different way of packaging the information about the state of the particle.

More abstractly, the quantum state is a vector called the state vector, denoted by $\ket{\psi}$. And similar to how you can represent an ordinary vector $\vec{v}$ in a Cartesian basis, $\vec{v} = v_x \hat{x} + v_y \hat{y}$, just as well as you can write it in a polar basis, $\vec{v} = v_r \hat{r} + v_\theta \hat{\theta}$, we can likewise represent the state vector $\ket{\psi}$ in the position basis or the momentum basis. $\psi(x)$ and $\hat{\psi}(k)$ are simply the components of this same vector in those two bases.

But unpacking all the details of that will have to wait for another lesson.

By the way, to further complete the parallel between position and momentum, it’s common practice in quantum physics to write the momentum space wavefunction as a function of $p$ rather than of $k$, and to rescale the definition of the Fourier transform by a factor of $\sqrt{\hbar}$ in the process:

$$ \tilde{ \psi}(p) = \frac{1}{\sqrt{2\pi \hbar}} \int\limits_{-\infty}^\infty \mathrm{d}x ~e^{-ipx/\hbar}\psi(x). $$

Aside from that extra factor of $\sqrt{\hbar}$, all we’ve done here is replace the variable $k$ with $p = \hbar k$:

$$ \tilde{\psi}(p) = \frac{1}{\sqrt{\hbar}} \hat{\psi}(k = p/\hbar). $$

The $\sqrt{\hbar}$ is included so that

$$ |\hat{\psi}(k)|^2 \mathrm{d}k = \left|\sqrt{\hbar} ~\tilde{\psi}(p) \right|^2 \frac{\mathrm{ d}p}{\hbar} = |\tilde{\psi}(p)|^2 \mathrm{d}p. $$

That way, the momentum probability function is $\mathrm{Prob}(p) = |\tilde{\psi}(p)|^2\mathrm{d}p$, just as the position probability function is $\mathrm{Prob}(x) = |\psi(x)|^2 \mathrm{d}x$.

On the other hand, it’s also common practice to simply work in units where $\hbar \to 1$, and then the distinction between $\hat{\psi}$ and $\tilde{\psi}$ disappears.

Anyway, I think we’d better take a look at a concrete example at this point to get comfortable working with the Fourier transform. And it’s also going to lead us to a sneak peak at the uncertainty principle.

An Example

Let’s go back to our earlier wavefunction that described a particle localized inside a region of space around the origin, let’s say from $-a$ to $a$:

$$ \psi(x)=\begin{cases}\displaystyle\frac{1}{\sqrt{2a}} & |x| < a\\ \displaystyle 0 &|x| > a\end{cases} $$

I’ve set the height to $1/\sqrt{2a}$ in order to ensure that the square of the wavefunction has got an area of one underneath it, since that’s the total probability of finding the particle anywhere.

With this wavefunction, the particle has an equal chance of being found at any point between $-a$ and $a$, and zero probability anywhere else.

Now let’s see what the momentum space wavefunction looks like by taking the Fourier transform:

$$ \hat{\psi}(k) = \frac{1}{\sqrt{2\pi}} \int\limits_{-\infty}^\infty \mathrm{d}x~e^{-ikx} \psi(x). $$

The integral’s actually not too bad. First of all, we don’t have integrate from $-\infty$ to $\infty$ because the wavefunction is zero everywhere except in the window from $-a$ to $a$. And within that range, the wavefunction is a constant, $1/\sqrt{2a}$, so we can pull that straight outside of the integral:

$$ \hat{\psi}(k) = \frac{1}{\sqrt{2\pi}} \frac{1}{\sqrt{2a}}\int\limits_{-a}^a \mathrm{d}x~e^{-ikx} . $$

All that’s left is the integral of this exponential:

$$ \int\limits_{-a}^a\mathrm{d}x~e^{-ikx} =\frac{1}{-ik}(e^{-ika} - e^{ika}). $$

Plugging in Euler’s identity $e^{ika} = \cos(ka) + i \sin(ka)$ and $e^{-ika} = \cos(ka) - i \sin(ka)$, we can simplify that as $\sin(ka)$ times $2/k$.

Then we’re left with

$$ \hat\psi(k) = \frac{1}{\sqrt{\pi a} }\frac{\sin (ka)}{k}. $$

That’s the Fourier transform of this particular wavefunction $\psi(x)$:

Earlier, when we considered the periodic version of this wavefunction on a circle, the Fourier coefficients were a discrete list of data points. But now that we’ve graduated to an infinite domain, we have to sum over all the continuous values of $k$ along this curve.

It’s oscillatory, because of the factor of $\sin(ka)$, but notice that the amplitude decays because of the factor of $1/k$. That means that when you go to measure the momentum of the particle, you’re most likely to find it in the window between the first zeros of $\hat{\psi}(k)$, since that’s where $|\hat{\psi}(k)|^2$ is going to be dominated. Those zeros occur at $k = \pm \frac{\pi}{a}$, since that’s where $\sin(ka)$ vanishes.

In particular, notice that whereas the position of the particle is guaranteed to be found in a window of radius $a$ around $x = 0$, the momentum is most likely to be found in a window of radius of order $\frac{1}{a}$ around $p = 0$.

That’s our first glimpse of the uncertainty principle. To understand it better, let’s consider the two extreme limits: when $a$ is really tiny and when it’s huge.

When $a$ is very small, the window in position space where the particle hangs out becomes very narrow—and correspondingly very tall, because remember the total area under the square of the wavefunction is fixed to one.

Then when we go to measure the position of the particle, in this limit we’re guaranteed to find it very close to $x = 0$.

But what about when we measure the momentum? When $a$ is small we can apply the small-angle approximation for $\sin(ka)$, which means that the sine of $ka$ is very nearly equal to $ka$ again. Then the $k$’s cancel, and we’re simply left with a constant for the momentum space wavefunction,

$$ \hat\psi(k) \approx \sqrt{\frac{a}{\pi } }. $$

A small constant, because we’re looking at the limit where $a$ itself is very small.

What’s happened is that by shrinking $a$ down so that the position space wavefunction becomes a very narrow spike, the momentum space wavefunction inversely gets stretched out and looks almost flat. It’ll eventually decay away to zero at a big value of $k$ of order $1/a$, but in between it’s essentially constant over a huge range of wavenumbers.

And that means that when we go to measure the momentum of the particle, we have an equal chance of getting any of those values of $k$, times $\hbar$, since the probability function is likewise a constant, independent of $k$.

In other words, by pinning down the location of the particle in space, we haven’t got a clue what value we’ll find when we measure the momentum: we could get any number $\hbar k$ with equal probability.

You can guess what happens in the opposite limit, when $a$ is very large, meaning that the particle has room to spread out across a big region in position space. Now it’s the momentum space wavefunction that develops a very tall spike at $k = 0$, and quickly decays away on either side:

This time, we can predict the momentum very precisely. But we have no idea what position we’ll find the particle at when we go to measure it.

This is a manifestation of the uncertainty principle. The better we can pin down the particle’s position in space, the less we can say about what its momentum will be, and vice-versa—if we know the momentum precisely, the position could be anywhere at all. This is a very general feature at the heart of quantum mechanics, and here we’ve seen how it emerges inevitably from the Fourier transform.