This maths in this Section is non-examinable and is included just to fill in the details for those who want it. The physical concepts about scalar fields in cosmology, which are examinable and will be discussed in class, are recapped in the summary section at the end.
When we make the transition from SR to GR, the Minkowski metric is promoted to a dynamical tensor field, . In this sense, GR is an example of a classical field theory. Working in field theoretic language is helpful for theoretical derivations -- it allows a powerful sort of meta-analysis of the properties of equations2121 21 See, for example, the astounding Noether’s theorem — one of the most profound results in all of physics — which we previously mentioned in the context of why energy conservation does not apply to photons in an expanding universe; https://en.wikipedia.org/wiki/Noether%27s_theorem.. But even more importantly, understanding classical field theory is an essential pre-requisite for a working understanding of the mechanism thought to be responsible for cosmic inflation.
So this Section is a crash course in classical field theory. For a more complete overview of the underlying theory try the courses MATH6202 or PHAS3424. We will not discuss quantum fields in this chapter, though we will touch on them in the second part of the course when the origin of primordial perturbations is discussed.
In Section 7 we recapped the Euler-Lagrange equations to see how they simplify calculations for geodesics. To go further, we need just a bit more formal development to see how the equations work for fields, as opposed to single particles.
We first replace the single coordinate by a set of spacetime-dependent fields , and the action becomes a functional of these fields. Such fields could be, for example, electric or magnetic fields, or gravitational fields, or scalar fields (like the Higgs field) and so on… in any case, labels individual fields. A functional is a function of an infinite number of variables – in this case, the values of a field at each point in spacetime.
The Lagrangian can now be expressed as an integral over the space of a Lagrangian density, , which is a function of the fields and their spacetime derivatives :
(217) |
Then the action becomes,
(218) |
We should note in passing that, if we’re to have a hope of building a coordinate-independent theory, we need to make sure that is independent of coordinate system. We will adopt this as a requirement.
The Euler-Lagrange equations, just like when we considered them for individual particles, come from requiring that be invariant under small variations – but this time around, variations of the field:
(219) |
The expression for variation in is the derivative of the variation of , just as we saw in Exercise 7. Since is assumed to be small, we can Taylor-expand the Lagrangian under this variation,
(220) | |||||
Correspondingly, the action goes to , with
(221) |
As before, we now argue that since this is to be true for any possible change , the term multiplying must be zero everywhere:
(223) |
Before moving on, let’s introduce a slightly more compact notation for this kind of argument. Namely, let be the functional derivative of a functional with respect to a function , defined to satisfy
(224) |
The actual function denoted by has to be extracted by the exact same series of manipulations that we’ve explored above, so there’s nothing actually new here. It’s just that, for example, saying “” will be a helpful short-hand for saying “ for all possible small changes to ”.
In addition to all the benefits of the Lagrangian formulation for particle mechanics that we discussed above, in field theory we also get a unique definition for the energy-momentum tensor. This is essential to have in hand as we move from familiar content for the universe (matter, radiation) on to more abstract content required for study of the early Universe and inflation.
A scalar field is simply an association of a single number with every point in spacetime. To turn this into a physical theory, we need to specify an equation of motion for the field – or, equivalently, a Lagrangian. We’ll go right ahead and start by writing down a Lagrangian:
(225) |
The equations of motion for this field are
(226) |
where the box operator (also known as the d’Alembert operator) is defined by .
What on earth does it mean? First, set . Then obeys a wave equation. Recalling that in this course, the waves will propagate at the speed of light. You can see this explicitly by substituting the trial wave solution .
So, localized features in the field travel along and start to look rather like a quantum particle (more in the exercise below).
Alternatively, imagine the field starts out completely homogeneous. Then, (and it remains zero for all time). All we have, at each point, is . This equation is also familiar; for example, if we make the potential we just have a ‘mass on a spring’ which oscillates back and forth forever. The only difference is that is an abstract number, not an actual position. This is a useful analogy: quite often when discussing inflation we will talk about things like “balls rolling down a hill”. The thing to remember is that in reality there is no ball, no rolling, and no hill! There is just an abstract mean field value represented by the ball’s height, an abstract rate of change of that value represented by the motion of the ball, and an abstract potential represented by the hill.
The result from the exercise above is not a coincidence – we’re seeing that relativistic particles can be understood as oscillations in quantum fields. We’ll touch on this later but the detail is beyond the scope of this course (see e.g. PHAS0073 instead). Note that in the limit – when the fluctuations in the field are on very long wavelengths – we get , i.e. we automatically revert to the homogeneous, coherent oscillator picture outlined just above the exercise. This sort of coherent behaviour of a scalar field (suitably modified for expanding spacetime) will be our main focus for cosmology.
Before looking at cosmological implications, we need to generalise the field theory framework into curved space. One might imagine that we can use the equivalence principle to promote the fixed metric of Minkowski space into a general metric :
(228) |
Sadly, it’s not true. The problem is that is dependent on our choice of coordinates. For example, suppose we transform coordinates such that . This isn’t permitted in special relativity, but in general relativity we have to allow it – remember the whole construction is about keeping physics invariant even in different coordinate systems. If is to have an actual physical meaning, it has to be invariant under this sort of transformation.
The integrand is correctly invariant under our trial transformation, because the component of the metric picks up a factor that cancels against a attached to each of the s. But the integral still changes because . This needs to be cancelled off, and it turns out the way to do it in general is to write:
(229) |
where is the determinant of . The point is, this determinant exactly cancels the effect of coordinate transformations on the measure, as we’ll now see…
Of course we’ve just shown it works for a simple, specific coordinate transformation but it’s not too much extra work to show that it actually works for any coordinate transformation (see Carroll for help with this if you want to give it a go)2222 22 It also turns out that measure theory gives you a more fundamental justification for why has to be the correct way to integrate in a curved space; see https://en.wikipedia.org/wiki/Volume_form..
Following the arguments given above for the specific case, it’s correct to generalise and say that actions in curved space look like this
(230) |
where is a scalar (i.e. invariant under coordinate transformations). Here, we don’t consider to be one of the fields because we have to treat it on a different footing (later). So in the scalar field example, there is just and no other fields to consider.
What do the Euler-Lagrange equations now look like? Well, there’s nothing special about this new expression, so all the old arguments go through and the correct equations of motion for the field are:
(231) |
We’ve derived this equation from a coordinate-invariant expression so it’s got to be coordinate-invariant itself. Knowing that, we can just throw caution to the wind and have a guess at what the above statement really means:
(232) |
This is a correct guess. The technical details of why it works out are unnecessary for our future purposes, but if you’re interested, the next exercise guides you through it.
Now we can apply the Euler-Lagrange equations of motion in curved space, equation (232), to the action (229) to show that
(234) |
where now
(235) |
It’s all just the same as in flat space, but with partial derivatives replaced by covariant derivatives – a direct consequence of the equivalence principle, of course. It really couldn’t have worked out any other way.
We have developed the machinary to work out how fields behave within a curved spacetime – but now we have to recognise the fact that the metric, , is itself a field (i.e. a function of spacetime position). What scalars can we make out of the metric to serve as a Lagrangian? We know that the metric can be set equal to the Minkowski metric () and its derivatives set to zero at any one point (remember Exercise 4). Consequently any non-trivial, coordinate-invariant scalar must involve at least second derivatives of the metric.
We have already encountered the Ricci scalar. It turns out to be the only independent scalar constructed from the metric, which is no higher than second order in its derivatives. The mathematician David Hilbert figured that this was the simplest possible choice for a Lagrangian, and proposed that the following action might be appropriate for general relativity:
(236) |
This turns out to be a correct guess (as we will sketch out below). In some ways it gives a very natural explanation for why Einstein’s equations look the way they do – because their action is as simple as it possibly could be.
How can we check that the Einstein-Hilbert action correctly describes GR? Since is a function of , the approach that we so carefully developed above – working with and just applying the “covariantised” Euler-Lagrange equations (232) – does not work. We have to work with the “raw” Lagrangian (231) and replace by (i.e. we treat each component of the metric tensor as a separate field when deriving the equations of motion).
This becomes a bit of a dreary exercise in algebra, so (although I certainly encourage you to go through the derivation once in your life; see Exercise 5) let’s for now just state the result:
(237) |
meaning that at stationary points of with respect to variations in we have:
(238) |
which has recovered Einstein’s equation in a vacuum ().
To generalise to Einstein’s equation coupled to matter, we just need to include the relevant matter terms in the action. Consider
(239) |
where is the matter action, and we have pre-normalized the gravitational action to get the right answer. (Note our normalization includes a minus sign in front of , which is not present in all textbooks – it depends on the metric signature.) The calculation for the revised action goes through just by linearity:
(240) |
from which one recovers the Einstein equation provided that the energy-momentum tensor can be written
(241) |
(Note again the different sign to some textbooks, again due to our choice of metric signature.)
Consider again the action (229) for a scalar field. Now vary this action with respect to , not :
(242) |
we obtain from (241) the energy-momentum tensor for a scalar field,
(243) |
Here, it is worth pointing out that you will find different sign conventions to (243) in the cosmology literature. This can be traced to the fact that most of the cosmology literature uses the opposite metric signature to ours: .
The above is all pretty technical, so let’s step back and see the practical consequences. So far we’ve used the Euler-Lagrange formalism to define what is meant by the energy-momentum tensor for a scalar field. Because it has been derived from a coherent, self-consistent framework, we can be assured that (for example) for our definition, making it properly compatible with Einstein’s theory.
The single most important result for our purposes is Eq. (243) – the correct energy-momentum tensor for a scalar field. Let’s simplify by writing it out in component form in the case that the field is homogeneous but time-varying (i.e. , but ):
(245) |
Compare this with Eq. (91), and we can read off the density and pressure of a scalar field in a homogeneous universe:
(246) | ||||
(247) |
The ‘kinetic energy’ term is named by analogy with the corresponding term if we had just a single particle with position . But let’s be clear: is not a position, it’s a field value. It’s important to realise that the existence of kinetic energy in the field does not imply the field configuration represents a moving particle. In that light, the terminology is actually quite unfortunate. But it’s the convention, so we’re stuck with it.
The resulting equation of state
(248) |
shows that a scalar field can lead to negative pressure () and accelerated expansion () if the potential energy dominates over the kinetic energy . In the next chapter, we will look at this phenomenology in detail.
As stated at the start of the chapter, the mathematical derivations are non-examinable. You may be asked to briefly explain the physics, as we will apply it to inflation in the next chapter. This can be summarised as follows:
A field is just an assignment of a number, vector, or other quantity to every point in spacetime; often by ‘field’ we also have in mind that there are some equations of motion that predict its evolution, i.e. the future values of the field can be predicted from the past values of the fields.
The equations governing classical evolution of fields can be derived from the Euler-Lagrange equations applied to suitable actions.
Deriving things that way, rather than just writing down an evolution equation, helps us analyse the classical behaviour of the field – and is also essential for quantising the field (Chapter 14).
In cosmology, we are usually most interested in scalar fields (i.e. fields which assign a single number to each point in spacetime), because they may be responsible for powering the accelerated expansion of inflation and/or dark energy.
Even in flat space, with the simplest possible non-trivial Lagrangian for a scalar field, the field can exhibit a range of behaviours from coherent oscillations to particle-like excitations. Within cosmology, it is normally coherent behaviour of the field, where the field takes one value throughout space, that is of most interest.
When describing how this field changes over time, we quite often refer to analogies like “a ball rolling down a hill”. But there is no physical ball or hill! The horizontal position of the ball represents the field value (the same throughout space) and the hill is an abstract potential which you can think of as the ball’s height.
The metric can also be thought of as a tensor field. The action for Einstein’s relativity is then remarkably simple, Eq. (236).
Using this action we can deduce the correct energy-momentum tensor for the fields within the Universe;
If the potential dominates over the kinetic term, we find a negative equation of state which we previously concluded (Section 2) will give rise to accelerated expansion.