How Einstein Uncovered the Path a Particle Traces Through Spacetime
In this physics mini lesson, we're going to continue our discussion of the principle of least action, following up on the lessons about the action in Newtonian mechanics and in special relativity. This time, we'll talk about the action for a particle in general relativity, Einstein's theory of gravity. We'll write down the action for a particle traveling through spacetime, and see how the particle is forced to traverse a very special kind of curve called a geodesic.
The basic idea of Einstein's theory is that a massive object like a star warps the geometry of spacetime around it. Then according to Einstein, something like a planet traveling along nearby doesn't really experience a gravitational force at all, it just keeps moving along the straightest and shortest path that it can through this curved geometry. And that's what a geodesic is: the straightest and shortest possible path through a curved space.
In special relativity, we learned about how as particle travels around, it traces out a path through spacetime called the worldline of the particle. And then we set the action to be proportional to the length of the worldline,
This Minkowski metric tells us about the geometry of flat spacetime. So if we had a particle soaring through empty outer space, far away from any stars or other objects, it would travel along a straight line through this flat spacetime. But a few years after publishing his special theory of relativity, Einstein came back for the one-two punch and generalized his geometric framework for the universe by explaining the geometric origins of gravity. It's called general relativity, and it's probably the most beautiful physical theory that humans have ever written down.
Gravity is different than the other forces that we encounter. From Newton's law
Gravity is therefore universal; it affects all particles in the same way regardless of their mass. Einstein reasoned that we therefore shouldn't think of gravity as a force at all, but as a feature of the background spacetime on which particles move, and which subsequently affects all particles in the same way.
A particle in a gravitational field isn't being accelerated at all, it's just traveling along on its merry way—it's the spacetime around the particle that has changed.
This is what lead Einstein to the idea that gravity could be attributed to the shape of spacetime. Like I mentioned at the top, the gist is that the presence of a massive object like a star warps the spacetime around it, deforming it from the flat, Minkowski spacetime of special relativity into a curved spacetime. Then a particle (or planet) passing nearby still does its best to keep traveling along the straightest and shortest line that it can, but now it's tracing out a path in the curved geometry. These paths are the geodesics.
In particular, the action for a particle in general relativity is still going to be given by the same formula we wrote down before: the length of the worldline. Only now we need to replace the flat space Minkowski metric
I'm going to tell you about how all this works in a little more detail. This is a pretty advanced subject though—physics students usually take their first general relativity class at the end of college or the beginning of grad school. But the ideas are so beautiful that I think it's definitely worth exploring a bit even if you're more of a beginner. So if you are a beginner, don't sweat the details of the equations too much—and definitely don't be scared away by them. If you keep studying physics then they'll make sense in time. For now I hope you'll at least come away with an appreciation for a few of the big ideas of general relativity, and the transformative way that Einstein reshaped the way we look at the universe.
Since general relativity is, like the name implies, a generalization of special relativity, let's start by reviewing what we learned about the action for a particle in special relativity last time. And we'll also introduce some new notation that will make the generalization from special relativity to general relativity more straightforward.
When we went to compute the length of the worldline of a particle in special relativity, we were confronted with the fact that the Minkowski metric
where
The length of the worldline is maximized along a straight line through spacetime—that it's a maximum instead of a minimum is one of those peculiar features of the Minkowski metric. That's why in the twin paradox, the twin who stays home on Earth winds up older than the twin who flies around outer space in a rocket ship before coming home. The worldline for the twin sitting at home is a straight line, and so the most time has elapsed on their watch. The twin in the rocket ship followed a curvy worldline through spacetime. So even though they begin and end at the same event, less proper time has elapsed on the rocket ship twin's watch, and when they get home they're younger.
That lead us to identify the action for a particle in special relativity with the length of the worldline, up to some factors:
The
Now we want to extend this to general relativity. In fact, we don't have to change our action formula at all. The particle is still going to follow the straightest and "shortest" path through spacetime that it can—where again "shortest" really means the maximum proper time. The difference is that the Minkowski metric that describes flat spacetime gets replaced with the curved metric of a spacetime that's been warped by the presence of something like a star.
To describe a curved metric, it's convenient to introduce some new notation. Let's write the spacetime coordinates of the particle as
Note that those superscripts are labels, not exponents. Likewise, we can write the displacement vector as
Next let's define a
So in other words
Now we can write the Minkowski metric in a nice and compact form
All this notation might seem like overkill, since the Minkowski metric isn't all that complicated to begin with. But it's going to be very convenient for the generalization to curved spacetime in a minute.
With our new notation, we can write the length of the worldline like this:
Notice that I didn't write the summation symbol
Now remember, we're evaluating this integral along the worldline that the particle traces out through spacetime. We can specify the worldline by giving its coordinates
Let's multiply and divide the integrand by
For example, if we pick
just like before. (I'm again dropping the
This notation makes it really straightforward to go from the flat spacetime of special relativity to the curved spacetime of general relativity. We just replace the constant matrix
In general, this is going to be the metric of a curved space. Roughly, the reason is that the coefficients
So what lead Einstein to think that gravity is related to the curvature of spacetime? Like I briefly mentioned in the introduction, the remarkable feature of gravity is that it's universal: it affects all particles in the same way, regardless of their mass. Galileo demonstrated this long ago for projectiles on Earth, supposedly by dropping balls of different masses from the top of the leaning tower of Pisa. They were all accelerated downward at the same rate and hit the ground at the same time, regardless of their mass.
So on Earth, we observe that the weight of an object is
and once again the mass
The fact that gravity acts on all particles in the same way made Einstein suspect that it shouldn't really be attributed to a force at all in the sense of
The conceptual framework here is very similar to electromagnetism, which you may be more familiar with. Electric charges and currents create electric and magnetic fields according to Maxwell's equations, which then influence the motion of charged particles according to the Lorentz force law,
The way that massive objects warp the shape of spacetime is described mathematically by what are called "Einstein's field equations." They're the analog of Maxwell's equations for electromagnetism. I'm not going to get into the details of those equations right now, but the point is that if somebody hands us some distribution of mass like a big star, then we can try to solve Einstein's equations to figure out the curved metric
The action is just like we wrote down before, only this time we need to compute the length of the worldline using the curved metric:
Again, the conceptual idea is more important than the detailed equation here: up to some factors, the action is just equal to the length of the particle's worldline through spacetime, which has been warped by the presence of e.g. a star. Then to minimize the action, the particle will take the shortest path that it can through spacetime—or more precisely it takes the path of maximum proper time.
Like we learned in the previous lessons, to apply the principle of least action we take a little variation of the trajectory
stand for the integrand of the action. Then when we make the little variation of
The first term comes from the change in the
Now we integrate to get the change in the action, and we integrate by parts on the first term to pull out the common factor of
Since this is supposed to vanish for any variation
We're looking for an equation for
That's looking a little bit better—still complicated, but a little better. It would be even nicer if the right-hand-side vanished. And in fact we can make it vanish, by remembering that
So with this choice,
This, at last, is the geodesic equation. Back in Minkowski spacetime, where the metric is constant, the second term vanishes. Then we're left with the equation of a straight line:
But in a curved space, the second term introduces a deformation of the straight line. A geodesic is as straight as you can get in a curved spacetime. By adding any wiggles to the curve, we would increase its length—or, rather, decrease the proper time—and therefore it wouldn't be an extremal path anymore.
There's one more manipulation we should make to put the equation of motion into the standard form that people usually write the geodesic equation. Notice that, in the second term, the combination of
The combination that appears here is called the Christoffel symbol,
It's a very important object in the mathematics of a curved geometry, but for our purposes here we can just think of it as some matrix
Anyway, at long last we wind up with the standard form of the geodesic equation,
That calculation got a little hairy, which is why I didn't include it in the video itself. If you're new to all this curved geometry business, don't worry too much about the details of these equations for right now. You can learn to unpack them all later on if you're interested in properly studying GR.
The geodesic equation describes the motion of a free particle in the presence of some other much more massive objects that created the warped geometry. It's the generalization of Newton's second law for a free particle,
The last thing I want to do is give you an intuitive idea of what geodesics are all about by describing what's probably the simplest example of a curved space that we can all picture: the surface of a sphere. These aren't directly relevant to the geodesics in spacetime that we encounter in general relativity, but they'll at least give you an idea that you can picture in your head to understand what a geodesic is.
So picture a sphere, and pick any two points on it. To find the geodesic between them, just draw an equator of the sphere that goes between the two endpoints. In other words, think of the sphere as an onion, and chop the onion in half so that your knife goes through both of the given points. Call one half the "northern" hemisphere and the other the "southern" hemisphere. The cut you made is along the "equator", and it defines a geodesic between the two points (two, actually, one going the short way around and the other the long way).
Okay, that was a very quick introduction to a bunch of very challenging, but also hopefully very interesting ideas. So let me quickly summarize the key things we learned about.
Spacetime is the stage on which physical processes play out, and Einstein's theory of relativity might better be called the theory of spacetime, because it tells us how to understand the structure of spacetime. It's a framework for doing physics, and we can build on top of it additional features like particles and forces and fields.
Free particles basically travel along the straighest and shortest paths through spacetime that they can—with the caveat that, in spacetime, "shortest" actually means maximizing the proper time, which is the time that's ticked off on a watch that's strapped to the particle.
In special relativity, we ignore the effect of gravity (or, at least, we assume that it's weak). Then spacetime is flat, and a free particle literally travels along a straight line.
General relativity builds gravity into the structure of spacetime by warping the metric into a curved geometry. The way that works is governed by Einstein's equations, which we didn't talk much about here. Then a free particle follows the next-best-thing to a straight line: what we called a geodesic.
The action for a free particle in either special relativity or general relativity is the same: it's simply equal to the length of the worldline that the particle traces out as it moves through spacetime, up to some constant factors. Then the principle of least action says that the particle indeed follows the shortest path that it can in getting from one point to another.
See also:
Part 4: The Action for String Theory
If you encounter any errors on this page, please let me know at feedback@PhysicsWithElliot.com.