We present a short account of our work to provide quantum electrodynamics (QED) with a product picture. We aim to complement the longer exposition in a recent paper in Foundations of Physics and to help to make that work more accessible. The product picture is a formulation of QED, equivalent to standard Coulomb gauge QED, in which the Hilbert space arises as (a certain physical subspace of) a product of a Hilbert space for the electromagnetic field and a Hilbert space for charged matter (i.e., the Dirac field) and the Hamiltonian arises as the sum of an electromagnetic Hamiltonian, a charged matter Hamiltonian, and an interaction term. (The Coulomb gauge formulation of QED is not a product picture because, in it, the longitudinal part of the electromagnetic field is made out of charged matter operators.) We also recall a “Contradictory Commutator Theorem” for QED, which exposes flaws in previous attempts at temporal gauge quantization of QED, and we explain how our product picture appears to offer a way to overcome those flaws. Additionally, we discuss the extent to which that theorem may be generalized to Yang–Mills fields. We also develop a product picture for nonrelativistic charged particles in interaction with the electromagnetic field and point out how this leads to a novel way of thinking about the theory of many nonrelativistic electrically charged particles with Coulomb interactions. In an afterword, we explain how the provision of a product picture for QED gives hope that one will be able likewise to have a product picture for (Yang Mills and for) quantum gravity—the latter being needed to make sense of the author's matter-gravity entanglement hypothesis. Also, we briefly discuss some similarities and differences between that hypothesis and its predictions and ideas of Roger Penrose related to a possible role of gravity in quantum state reduction and related to cosmological entropy.
I. INTRODUCTION: COULOMB GAUGE QED AND ITS AESTHETICALLY UNPLEASANT FEATURES
Let me begin by recalling the standard Coulomb gauge formulation for a charged Dirac field, ψ, in interaction with the quantum electromagnetic field. The Hamiltonian is
where
is the standard Coulomb action-at-a-distance electrostatic potential energy of the Dirac charge density
Here, e is the charge on the electron, A is the vector potential for the magnetic field satisfying the Coulomb gauge condition, , and is (minus) the transverse part (satisfying ) of the electric field.
This is to be supplemented with the commutation and anticommutation relations
while the commutators of with with , and of and with and vanish, and also the anticommutators of with and of with vanish.
While it leaves unaddressed many questions of mathematical rigor (and the related difficulties of renormalization), the Hamiltonian (1) together with the commutation and anticommutation relations (3) provides us with a suitable starting definition for what we know to be the correct theory of quantum electrodynamics (QED). For example, QED is introduced in this way in Steven Weinberg's textbook on quantum field theory.12
However, leaving aside questions of mathematical rigor, this Coulomb-gauge version of quantum electrodynamics suffers from a number of aesthetically unpleasant features. An obvious such unpleasant feature is that it makes implicit reference to a particular choice of Lorentz frame. (The way our product picture transforms under Lorentz transformations is an interesting question but we will not address it here.) However, leaving that aside too, we wish here to focus on the following:
is quartic in ψ and also nonlocal.
The commutation relations (3) also have a nonlocal piece in addition to the delta function on the right hand side. (One can see that such a term is necessary to be consistent with the transversality of and of the Coulomb gauge condition on A on the left hand side.)
Only the transverse part of the electric field participates in the dynamics. Instead of a longitudinal part, we have an action-at-a-distance force due to the potential . This is unsatisfactory because, for example, it suggests that we need to think of the energy stored in a capacitor as due to the work required to separate its charged plates or the work done against the electrostatic attraction of opposite charges to charge them up, whereas we would prefer (since Faraday!) to think it is stored in the (longitudinal) electric field between the plates!
- True, we can define the longitudinal electric field by , where(4)
which, by the way, is equivalent to imposing Gauss's law, , by definition! However, then, is made out of Dirac field operators and does not have an independent existence (as well as not participating in the dynamics).
- It would be nice if the Hilbert space, , could be written as the tensor product,(5)of a Hilbert space, , for the electromagnetic field and a Hilbert space, , for charged matter (i.e., for our Dirac field), and if also the QED Hamiltonian then arose in the form(6)
When such a statement is true of a quantum theory involving two systems in interaction, we say it has a product structure. However, the Coulomb-gauge formulation of QED does not have a “product structure” in this sense. It might appear to do so if were interpreted as the Hilbert space of the transverse part of the electromagnetic field. However, if we insist on all electromagnetic field operators (both transverse and longitudinal) acting on as operators of the form , where I denotes the identity on —which is what we really mean when we say that is “the Hilbert space for the electromagnetic field”—then this clearly does not hold since the longitudinal part of the electric field is made out of charged matter field operators.
To amplify on Point 5, what would be preferable (and as we shall discuss further in Sec. V not only for aesthetic reasons but also because it enables one to ask and answer new physical questions) would be if there were ways to quantize electrodynamics in which the Hilbert space took the product form (5) and in which the Hamiltonian took the product picture form (6), or more precisely, if it took the form
with no Coulomb potential term; if the electromagnetic field commutation relations took the form
with no nonlocal term; if was (minus) the full electric field operator (rather than just its transverse part); if Gauss's law, , held as a genuine operator equation (rather than by definition); and yet, if this new formulation was entirely equivalent to Coulomb gauge QED. Because after all, while we have complained that it is aesthetically unpleasant, we do know that it is physically correct!
I want next to show that (at a similarly mathematically non-rigorous level to that at which the Coulomb gauge theory is typically discussed) it is possible to have a formulation of QED, which we call the product picture, equivalent to the Coulomb gauge formulation, and with all these nice features. More precisely, the equivalence is between the Coulomb gauge formulation and our product picture theory restricted to a certain product picture physical subspace (PPPS) of a full (unphysical) product picture Hilbert space, which we call below the “augmented QED Hilbert space.”
As a preliminary to that, we conclude this introduction by recalling how the operators A and in Coulomb gauge QED may be represented on a Hilbert space. In terms of the usual (say, momentum-space) annihilation and creation operators, on the Fock space, over the one particle Hilbert space, , we may define
where is the transverse part, of and is the transverse part of .
It will be important to note here that we can think about the Hilbert space on which and act in two slightly different ways. Either we can think of it as the Fock space, , over the subspace, , consisting of transverse elements of or we can note that the full one particle Hilbert space can be written as a sum of its two orthogonal subspaces and consisting, respectively, of transverse and longitudinal vectors, and thus, the full Fock space, , is the tensor product, , and think of the subspace on which and act as consisting of the subspace , where is the vacuum vector in . With the latter point of view, we admit the possibility of longitudinal photons but keep them in their vacuum state.
With the latter viewpoint, we can think of the full representation space of QED (now incorporating the Dirac field too) as what we shall call the Coulomb gauge physical subspace (CGPS), , of the augmented QED Hilbert space, which is the name we shall give to .
II. A PRODUCT PICTURE FOR QED
To provide QED with a product picture, the first step is to define operators, and , on our augmented QED Hilbert space by
where is defined as in Eq. (9) and
This last definition may seem strange. Let us pause to notice that we could think of as the result of first defining an operator that we shall call by again taking its transverse part to be the same as the definition of in Eq. (9) and defining its longitudinal part, along similar lines, i.e., by
but then deleting the second term on the right hand side containing a creation operator and doubling the term on the right hand side containing an annihilation operator. Clearly, in consequence (and unlike and ), will fail to be (even formally) self-adjoint. However, it will still have the same commutation relations with all other operators (and in particular with ) that has and we shall soon see (see Theorem 1a below) that it will be (formally) self-adjoint on the appropriate “product picture physical subspace” of the augmented QED Hilbert space which we next define.
First, we define the (formally) unitary operator
where is given by Eq. (4), and then we define the product picture physical subspace (PPPS) to be the result of acting with U on the Coulomb gauge physical subspace, i.e., as . Let us notice here that all states in the PPPS (including the vacuum state, ) are entangled between charged matter and longitudinal photons!
We then have:
Theorem 1a. and [defined as in (7) with the above definitions for and ] map the PPPS to itself and are self-adjoint when restricted to it.
Theorem 1b. Also, the magnetic field operator, , maps the PPPS to itself. [Although the (as always, unphysical) vector potential, , does not.]
Theorem 2. on the PPPS is (unitarily) equivalent to on the CGPS.
Theorem 3. PPPS, , i.e., Gauss's law holds as an operator equation on the PPPS (in contrast to Coulomb gauge, where it held by definition).
We briefly sketch the proofs. For more details, see Ref. 8, Sec. 3.
Proofs of Theorems 3 and 1a. These follow easily after noting that
Proof of Theorem 1b. This follows immediately from the fact that is the same as acting on the transverse part of (which is the same thing as the Coulomb gauge A) and that, as one easily sees, this maps the PPPS to itself. (It is noteworthy that the longitudinal part of does not map the PPPS to itself but this is annihilated by .)
Proof of Theorem 2. First define
and notice that, restricted to the CGPS, .
Then, on the full augmented QED Hilbert space, if one calculates , one finds that it equals .
To do this calculation, it is helpful as a first step to notice that
where we define
Using these, one easily finds that is equal to
It is then not difficult to see (or see Ref. 8, Sec. 3) that this is equal to the of (7). (End of proofs.)
Let us also remark that one may think of as the appropriate counterpart to ψ in the product picture. In fact, we have the dictionary shown in Table I.
Coulomb gauge to product picture dictionary.
Quantity . | Coulomb gauge . | Product picture . |
---|---|---|
Hamiltonian | (or ) | |
Electric field | (or ) | |
Magnetic field | (or ) | |
Dirac field | ψ | |
Adjoint Dirac field | ||
Dirac electrical potential | ||
Vacuum state | (or ) | (entangled) |
Quantity . | Coulomb gauge . | Product picture . |
---|---|---|
Hamiltonian | (or ) | |
Electric field | (or ) | |
Magnetic field | (or ) | |
Dirac field | ψ | |
Adjoint Dirac field | ||
Dirac electrical potential | ||
Vacuum state | (or ) | (entangled) |
Finally, let us remark that there are two ways that we may obtain the unitary time evolution on the PPPS (which must both lead to the same answer). One way is to exponentiate (−i times) the self-adjoint restriction of to the PPPS on the PPPS. However, another way would be to first exponentiate (−i times) the non-self-adjoint on the augmented QED Hilbert space—obtaining a nonunitary operator—and then to restrict that nonunitary operator to the PPPS on which it will, of course, be unitary. In an approach to constructing QED in a mathematically fully meaningful way based on the product picture, the latter method may be technically advantageous, in part because it would enable us to separate the issue of defining the dynamics from the issue of assigning a mathematical meaning to the operator U of Eq. (10) in Sec. II, which defines the PPPS. This seems worthy of further exploration.
III. THE CONTRADICTORY COMMUTATOR THEOREM AND COMPARISON WITH TEMPORAL GAUGE QUANTIZATION
The Hamiltonian, (7), resembles the classical Hamiltonian appropriate to the temporal gauge condition (also known as the Weyl gauge condition). Several attempts have been made in the past to obtain a formulation of QED by “quantizing” that temporal gauge classical Hamiltonian—taking the commutation relations to be those of Eq. (8) but hitherto certain assumptions have been (sometimes tacitly) made that led to pathologies of one sort or another (see, for example, Ref. 3). For example, one could define in (8) as we have here but replace by the that we mentioned in Sec. II. However, one finds that any attempt to obtain a theory equivalent to Coulomb gauge QED with those choices would fail. The essence of what goes wrong can be summed up in what we shall call the Contradictory Commutator Theorem (see Ref. 8, Sec. 3.4 for more details).
The Contradictory Commutator (CC) Theorem. There can be no pair of 3-vector operators and on a Hilbert space such that
- and satisfy the canonical commutation relations(12)
and are each self-adjoint;
- For some vector, ()
for some operator-valued function of x, ρ.
ρ commutes with .
Proof. (a) Easily implies the equality
while (b), (c), and (d) imply that quantity on the left hand side of (13) is zero—a contradiction!
However, our product picture evades the conclusions of this seeming no-go theorem, as, of course, it must in view of Theorem 2. First let us notice that, in attempting to apply the CC Theorem to our product picture, there are two different interpretations we could make for the Hilbert space . We could identify it with the full augmented QED Hilbert space, but then the CC Theorem fails to apply because fails to be self-adjoint on . Alternatively, we could identify with the product picture physical subspace, and then will map to itself and be self-adjoint on it. However, will fail to map to itself! So, either way the Contradictory Commutator theorem fails (as it must) to rule out our product picture formulation of QED.
A. The question of whether and to what extent the Contradictory Commutator theorem generalizes to Yang–Mills
It is natural to ask to what extent the formulation and proof of the Contradictory Commutator Theorem generalize to the Yang–Mills theory [say, for a compact connected semisimple Lie group with Lie algebra generators, Tl, satisfying for totally antisymmetric structure constants flmn and ] and also to gravity in general relativity.
We content ourselves here with a discussion of the Yang–Mills theory (in flat spacetime). Adopting signature , we may take the classical Lagrangian density of pure Yang–Mills to be , where Fab is , where g is the Yang–Mills coupling constant and . Choosing temporal gauge where , the conjugate momentum, say πi, to Ai is , while the counterpart to the Gauss law constraint is
where the covariant derivative, , written out in full, is given by
For, say, a quark-like field (assumed, of course, to commute with the Yang–Mills fields) with values, , in the (finite dimensional) Hilbert space of (say) the fundamental representation of the Lie algebra, ρ is the operator on this Hilbert space given by the formula [cf. (2)]
The hypotheses of the CC theorem stated above for QED clearly have counterparts for Yang–Mills fields, which may be expressed with the same equations provided one makes two changes in their interpretation. First, we need to reinterpret and as Lie-algebra valued quantum (vector) fields where it is to be understood that (now identifying our Lie algebra with its adjoint representation) next to the δij, there is an unwritten identity operator (on the Lie algebra thought of as the representation vector space of the adjoint representation—see below for a more accurate statement) on the right hand side of the commutation relations in part (a).
Second, the equation of Part (c) needs, in light of (14), to be replaced by for a Lie-algebra valued ρ, which commutes with . (The equation is still assumed to hold for some in the Hilbert space.)
With this changed interpretation of its hypotheses to make them relevant to Yang–Mills fields, the question we want to ask, and will partly answer, is to what extent the statement and proof of the theorem do—or don't—go through as before. We will also comment on the relation between our work and the work in Refs. 1 and 3.
One big difference in the nonabelian case is that because of the nonlinearity of the Yang–Mills theory, one now has to deal with products of fields at the same point, notably, in the second term in (15). However, adopting the point of view of the first few pages of a paper1 of Dimock, one may proceed by formally manipulating expressions involving suitably smeared versions of our fields. To begin to explain this, let us momentarily backtrack and notice that we might have re-expressed the commutation relations (12) in the electromagnetic case in terms of the smeared fields, and , where u and are 3-vector valued say smooth compactly supported test functions, which are to be interpreted in terms of the formal equations,
whereupon the first commutator in Eq. (12) would have taken the form
where the symbol denotes the L2 inner product in the space of 3-vector fields.
Finally, let us remark that since the unsmeared equation (13) needed a δij on the right hand side due to the quantum vector fields being expressed as triples of quantum scalar fields in some coordinate system, the Hilbert space involved in that unsmeared version should have strictly been taken to be the result of taking the tensor product of the Hilbert space relevant to the smeared version with the (complexified) Hilbert space of triples of numbers with the inner product being the (complexified) dot product.
Returning to the Yang–Mills case, we proceed likewise but (following Ref. 1) now smearing with a test triple ui whose components take their values in the Lie algebra to obtain a smeared field , etc., where we think of the latter as “” (summed over i from 1 to 3) where the inner product now means the integral over of minus the Killing form of and ui (summed over i). [Similarly for .] Let us note, related to our above remark about the δij, that when we reinterpreted the hypotheses of our CC Theorem, written in their unsmeared version, to apply to Yang–Mills theories, then, strictly, we should have thought of the Hilbert space as the tensor product of the Hilbert space involved in the smeared version with the (again finite dimensional) Hilbert space consisting of the complexified Lie Algebra (thought of as a complex finite dimensional vector space) equipped with that inner product. The identity operator that we mentioned above should really have been regarded as the identity operator on that latter Hilbert space. (This is the more accurate statement about that identity operator that we promised to make above.)
Assuming that the standard Lie-Algebra identity applies also when some of X, Y, and Z are quantum operators, one easily sees formally (again as in Ref. 1 but adapted to our conventions) that the smeared quantity can be equated with [or alternatively with ]. Making use of this, one easily computes formally [now writing in place of , etc.] that [cf. Ref. 1, Eq. (8)]
This is clearly the Yang–Mills counterpart to the commutator (16). However, a crucial difference is that, unlike the right hand side of (16), the right hand side of (17) is no longer just a c-number, but there is also an additional term that involves the operator . So, when we take the expectation value of the above smeared commutator in a state vector, , in the appropriate Hilbert space, , the result will depend on , and we will no longer obtain an immediate contradiction as we did in the proof of the CC Theorem in the electromagnetic case. Instead, let us first shift our viewpoint slightly and notice that one way of restating the CC Theorem in that electromagnetic case is that, if (following Ref. 1) we define a physical state to be a state vector that satisfies , then the set of physical states is empty! If we now attempt to adapt the steps of that proof to the Yang–Mills case, by calculating the expectation value of the commutator in the left hand side of (17) in the two different ways [one way using property (a), and the other using properties (b), (c), and (d)], we will clearly find, now defining a physical state vector to be one satisfying , that if a state vector, , is physical, then we must have that
Before we consider the implications of this, let us first notice that, in the Yang–Mills case, there is also a non-trivial commutator between and [cf. Ref. 1, Eq. (9)],
from which, by a similar argument, we may conclude that, for a state vector, , to be physical, we must also have that, for all Lie-algebra valued smearing functions, w,
(Here, we have used the fact that, in a semisimple Lie Algebra, every element arises as the Lie bracket of two other elements.)
So, we have the striking result that, for a state vector to be physical, the expectation value of the field momentum , which (in components) is our quantized (, must vanish! On the contrary, returning to the implications of (18), we clearly, at least, have the immediate corollary that, for a state vector to be physical, the expectation value of must not vanish! More information would seem to be extractable from (18), but we will not attempt to do so here.
The aforementioned results certainly seem to suggest that, in the case of Yang–Mills, the set of physical state vectors (defined to be state vectors on which the Yang–Mills counterpart to Gauss's law in temporal gauge holds) is strangely restricted. However, this seems as far as one can easily go in directly generalizing our CC theorem proof for nonabelian Yang–Mills. Note though that, in the case of linearized (nonabelian) Yang–Mills, the right hand side of (17) clearly becomes a c-number, and so the proof goes through exactly as in the electromagnetic case. Thus, for linearized Yang–Mills, we may conclude that the set of physical states is empty!
Returning to full (nonlinear) Yang–Mills, in the later pages of Dimock's paper, defining J(h) to be the Gauss law operator smeared with a test function h, he finds that , and J form an infinite dimensional Lie Algebra. Formally, exponentiating that to obtain an infinite dimensional Lie group, he points out that the elements that formally correspond to may be understood as the generators of gauge transformations. The notion of “physical state” is then redefined to mean a state annihilated by all those elements. He then studies Hilbert space representations of this Lie Group and finds classes of representations, for which it turns out that the set of physical state vectors, defined in that way, is empty! However, using C* algebra techniques, he then finds a representation, which is a certain limit of one of those classes of representations in which it is not empty. Presumably, in the electromagnetic case, this coincides with the representation of the formal exponentiation of Eq. (12) that was found in Ref. 3, in which Gauss's law holds strongly but the fields and do not exist. (Rather only their formal exponentials exist.)
On the other hand, for the electromagnetic case, we found here and in Ref. 8 that, at least at our level of mathematical rigor, our product picture has the advantage of evading the conclusions of the CC Theorem and in it the fields and exist. Thus, it would seem to be of interest to find a suitable counterpart to our electromagnetic product picture both for (linearized and full) Yang–Mills theory and also (for reasons discussed further in Sec. V) at least for linearized quantum gravity and, as far as is possible, for full quantum gravity. It would also seem to be of interest to attempt to make these product pictures mathematically rigorous.
IV. A PRODUCT PICTURE FOR MAXWELL–SCHRÖDINGER THEORY AND THE NEW ALTERNATIVE UNDERSTANDING ENTAILED FOR THE NONRELATIVISTIC QUANTUM MECHANICS OF MANY CHARGED PARTICLES
There is a similar product picture in the quantum electrodynamics of many non-relativistic Schrödinger particles interacting with the EM field (for more details, see Ref. 8, Sec. 4). One has to treat the particles as finite radius balls with charge density ρ. The Hamiltonian
where
on the appropriate CGPS gets replaced by
on the PPPS, all of whose states are again entangled between our charged balls and longitudinal photons.
We also define by adding the same extra terms to that were added to in (11) and find that
and
etc.
In the usual approximation where we neglect terms that lead to radiative corrections [and suppress the in the Hilbert space], the Coulomb gauge formulation of this nonrelativistic version of QED gives us the familiar non-relativistic Schrödinger equation for systems of many charged particles. [With extended balls, it is natural to include the (finite) self-energies of the balls in and in E.] So, one might wonder how the product picture can work in the nonrelativistic limit since the Hamiltonian of Eq. (21) has no potential term! To understand this, let us look at the familiar example of the (spinless) hydrogen atom (for more details, see Ref. 8, Sec. 4.2). If Particle 1 is the proton, Particle 2 the electron, and the two-body wave function is say a product of one of the usual Coulomb gauge energy eigenfunctions of the relative coordinate with a wave packet in the center of mass coordinate of negligible energy, then in the usual Coulomb gauge description, we will have
In the product picture, this (on the CGPS) gets replaced by
(on the PPPS). We see that the binding energy is now understood to be the decrease in the energy of the longitudinal part of the electric field when the charged balls are closer to one another. The (entangled) quantum state, , schematically takes the form—when we think of as :
where is the coherent state of longitudinal photons, which corresponds to the classical electric field due to the presence of the proton at and the electron at .
For more information about exactly what is meant by coherent states of longitudinal photons, see Ref. 8 that gives an alternative account of the product picture formulation of QED, which takes the construction of such coherent states as its starting point. We also refer the reader to the paper8 for a discussion of a number of other topics including more about Gauss's law and the charged field commutator (see Sec. 3.3 there) and (in Sec. 4.3 there) about the reduced density operator of charged Schrödinger matter. We shall also have more to say about the latter topic in Sec. V.
V. AFTERWORD: THE RELEVANCE TO THE MATTER GRAVITY ENTANGLEMENT HYPOTHESIS AND COMMENTS ON THE RELATION WITH PENROSE'S IDEAS ABOUT COSMOLOGICAL ENTROPY AND STATE VECTOR REDUCTION
Aside from providing a new formulation of QED that is free from the aesthetically unpleasant features that we mentioned at the outset, and aside from offering the intriguing new way of thinking about the non-relativistic quantum mechanics of many charged particles that we discussed in Sec. IV, the product picture formulation of QED also permits us to ask some questions about QED that could not even be posed in the Coulomb gauge formulation. In particular, thanks to the fact that, in the product picture, we can write the Hilbert space, , as a tensor product of the form (5), it becomes meaningful to ask, for a (say pure) quantum state of QED—thought of now as the density operator, for —what is its reduced density operator on the Hilbert space (which we define to be the partial trace of σ over ). Similarly, we could ask what is the reduced density operator on the Hilbert space (which we define to be the partial trace of σ over ). Relatedly, we can ask, e.g., what is the entanglement entropy of between charged matter and the electromagnetic field. Here, we recall that because of the assumed purity of , the von Neumann entropy of and the von Neumann entropy of will be equal, and their equal value is deservedly named the charged matter-electromagnetic entanglement entropy of .
Aside from any possible interest in these questions in their own right, they are of interest as providing analogies to basic questions one asks on my matter gravity entanglement hypothesis (see Refs. 6 and 7 and references therein), which aims to help resolve a number of fundamental issues both about the foundations of quantum mechanics [in particular regarding the possibility of having an objective (observer-independent) notion of quantum state reduction] and also about the foundations of thermodynamics (in particular regarding the definition of entropy and the origin of the second law). This hypothesis was inspired partly by the discovery2 of the Hawking effect, which suggests that there may be new interconnections among quantum mechanics, gravity, and thermodynamics waiting to be discovered. It was also party inspired by Roger Penrose's writings (see, for example, Ref. 10) on the status of the notion of entropy in a cosmological context and the importance of understanding how the entropy of the universe evolves in time from the big bang to the late-time universe.
The idea of the matter-gravity entanglement hypothesis is that, to address such questions, a closed system must be understood in the context of a (possibly low-energy approximate) theory of quantum gravity, which is assumed, conservatively, to be formulated in terms of a total Hilbert space, say , which arises as a tensor product, , of a gravity Hilbert space and a matter Hilbert space . It is further assumed that the state of a closed system is always a pure state, described by a density operator of form for , and that (with a naïve notion of ‘time’ and from a Schrödinger picture point of view) this evolves unitarily in time. A fundamental process of decoherence (potentially offering an explanation of quantum state reduction) will then be expected to occur if we assume that the Hamiltonian generating our unitary time evolution (the quantum gravity Hamiltonian) arises in the following form [cf. Eq. (7)]:
and if we assume that the relevant density operator when we talk about the occurrence of decoherence is not itself (which will always be pure) but rather the reduced density operator . We then obtain a natural candidate for the physical entropy of a closed system by defining it to be the system's matter-gravity entanglement entropy (rather than the von Neumann entropy of which will always be zero). In other words we define it to be the von Neumann entropy of (or equally of ).
Indeed, suppose (now taking our closed system to be, say, the entire universe) we make the further assumption that the initial state was unentangled between matter and gravity (or perhaps just had a low degree of matter-gravity entanglement), then, for any of a wide range of interaction terms, , in (22), we would expect to get more and more mixed (thus providing us with a fundamental process of decoherence), and, concomitantly, we would expect the matter-gravity entanglement entropy (i.e., our model for the physical entropy of the universe) to increase monotonically with time, thus offering the possibility of an objective explanation for the second law.
Moreover, taking instead our closed system to be a model asymptotically flat universe containing a ball of matter that collapses to a black hole, and similarly assuming that the initial total state had a low degree of matter-gravity entanglement, we would again expect to get more and more mixed and the system's matter-gravity entanglement entropy to increase monotonically. Thus, we would reconcile “information loss” (which we interpret now as the reduced density operator getting more and more mixed) with an underlying unitary evolution of .
Thus, this very simple hypothesis already appears to give us a conceptual template capable of both explaining the second law and resolving the black hole information loss puzzle—which, in fact, is seen now just to be a special case of the second law.
However, the entire hypothesis is predicated on quantum gravity having (to use the terminology of our introduction) a product picture. And even granting quantum gravity (or at least an appropriate low-energy approximation of it which we might hope would be the suitable setting for discussing questions of decoherence and entropy) will turn out to be a conservative sort of (unitary) quantum theory, there might have seemed reasons to worry that this quantum theory would not have a product picture. After all, one might have objected that quantum gravity is, in a certain well-known sense, a gauge theory, and, more to the point, its Hamiltonian formulation involves four well-known constraints. Furthermore, one might then have argued by analogy that, just as the Gauss law constraint prevents quantum electrodynamics, in its Coulomb gauge formulation, from having a product picture, so one might expect that the four constraints of general relativity will prevent quantum gravity from having a product picture.
However, now that we have provided QED with an alternative formulation, which is a product picture, this worry is at least to some extent dispelled. It raises the hope that, with further work, we will similarly be able to provide quantum gravity with a product picture.
To end this afterword, I would like to make some tentative and sketchy remarks about how our matter gravity entanglement hypothesis might possibly be related to Roger Penrose's ideas (e.g., as discussed in Ref. 10) about cosmological entropy and especially to his related ideas (see, e.g., Ref. 11) about quantum state reduction. It seems to me to be still quite unclear at present whether the two lines of thought are fully in contradiction with one another or whether our hypothesis perhaps stands in relation to Penrose's vision in somewhat the same way that, e.g., Bohr's 1910 theory of the hydrogen atom related to the 1924 ideas of de Broglie (which led, a little later, to the Heisenberg–Schrödinger theory of quantum mechanics).
One point of contact is that both theories have addressed the issue of what happens to would-be macroscopic quantum superpositions involving different spacetime geometries, and, for concreteness, both theories have been illustrated by considering a Schrödinger Cat-like superposition of two states of a single uniform density massive ball centered on two distinct locations in an otherwise empty (asymptotically flat) space. Each ball state will have its own (Newtonian) gravitational field, and the question is whether and in what sense the total state might spontaneously decohere. According to Penrose's ideas,11 one expects such a macroscopic superposition to decay to a state centered on a single location on some time-scale, which depends on the size and mass of the ball and the separation of the centers of mass of the two ball states in the superposition. According to the matter gravity entanglement hypothesis, such a superposition could be in a static state, but the relevant density operator, when one asks questions about purity versus mixedness, will not be its (which will always be pure) but rather it will be the reduced density operator . One can compute this similarly to the way in which one would compute in our product picture for QED for a superposition involving two states of an electrically charged ball centered on two different locations in the non-relativistic version of QED described in Sec. IV. In fact, formulae for both in the QED analogy and for in weak-field Newtonian quantum gravity were obtained by the author in Ref. 4 (albeit I was able, only around 24 years later, to provide them, in the case of QED, with the more satisfactory theoretical underpinning described here and in Ref. 8 [where the formulae are reproduced in Secs. 1 and 4.3] and I should mention that work is in progress on providing such an underpinning [and the fixing of an overall constant] in the case of linearized gravity), and one finds that (in the case the displacement of the centers is much smaller than the ball radius) the degree of mixedness of / is governed by a certain decoherence length, which again depends on the size and mass of the ball and the separation of the centers of mass of the two ball states in the superposition.
As discussed in more detail in Ref. 4, the formula obtained there for the decoherence length is remarkably similar to the formula obtained (up to a presently unfixable multiplicative constant) by Penrose in Ref. 11 for his decay time-scale. (The only difference is that where there is a Laplacian in the latter formula, there is a square root of the Laplacian in the former.) On the other hand, there are important physical differences. One difference, as we have already seen, is that the decoherence is a time-dependent process in the Penrose theory, while it is ever present in the setting of Ref. 4. Another is that the sort of decoherence one obtains on the theory of Refs. 4 and 8 is, in the case of static or slowly changing ball configurations, a reversible sort of decoherence; if one were to (slowly) bring the ball centers back into coincidence, the decohered ball states would recohere. For this reason, in the Newtonian limit, the matter gravity entanglement hypothesis appears (see Ref. 5 for details) to predict the same result as standard quantum mechanics for the experiment9 proposed by Penrose and thus to be experimentally distinguishable from his theory. One expects instead that, on the matter-gravity entanglement hypothesis, irreversible decoherence would occur and lead to a noticeable loss of coherence in that experiment only if the oscillator in the experiment were to oscillate sufficiently rapidly to emit a graviton in a suitably short time interval—a much harder regime to access (for more discussion and for more discussion of the physical interpretation of , see again Refs. 5 and 7).
Another point of difference between the two sets of ideas is that, on the matter-gravity entanglement hypothesis, the entropy of the universe resides at all times in the entanglement of matter with gravity (which is assumed to start low and then predicted to increase), whereas, on Penrose's ideas, one apparently thinks in terms of an entropy, which is a sum of a matter entropy (which dominates at early times) and a gravity entropy (which dominates at late times and is larger than the matter entropy at early times). Here, there appears to be an antagonism between the two theories, which resembles (and partly subsumes) the antagonism between the understanding of black hole entropy on the matter-gravity entanglement hypothesis (according to which the total state is pure, while it is the reduced density operator of matter as well as the reduced density operator of gravity which are approximately Gibbs states), and the traditional (Hawking) understanding of black holes according to which the total state is a (mixed) Gibbs state. See the proposed resolution of the thermal atmosphere puzzle in Ref. 6.
ACKNOWLEDGMENTS
The author thanks the Leverhulme Foundation for the award of Leverhulme Fellowship RF&G/9/RFG/2002/0377 for the period October 2002 to June 2003 during which some of this work was done. The author also thanks Jonathan Dimock and Michael Kay for valuable comments.
AUTHOR DECLARATIONS
Conflict of Interest
The author has no conflicts to disclose.
DATA AVAILABILITY
Data sharing is not applicable to this article as no new data were created or analyzed in this study.