We investigate the canonical structure of the bosonic sector of the unique maximal supergravity theory in five dimensions that is manifestly invariant under the global action of . Starting from the Lagrangian formulation of the theory, we construct the Hamiltonian formulation and the full set of canonical constraints. We determine all gauge transformations and compute the algebra formed by the canonical constraints under the Poisson bracket. We re-derive the number of physical degrees of freedom and construct the extended Hamiltonian, describing the most general time evolution of the theory, where the full gauge freedom is manifest.
I. INTRODUCTION
In this work, we investigate the canonical structure of the bosonic sector of the unique ungauged maximal supergravity theory in five dimensions that is manifestly invariant under the global action of .
Toroidal compactification of 11 dimensional supergravity leads to symmetries described by the exceptional Lie groups in the lower dimensional theories in 11 − n dimensions. This fact was first discovered by Cremmer and Julia in 1979,1 and Cremmer went on to describe the five dimensional case with symmetry in detail in 1980.2 The occurrence of exceptional symmetries in maximal supergravity remains one of the most remarkable features of these theories and remains to be fully understood at the quantum level.3
We choose to analyze the invariant theory since it is one of the simplest examples of a theory with an exceptional symmetry. In five dimensions, no self-dual forms exist and the symmetries are symmetries of the Lagrangian itself, not just of the equations of motion. Furthermore, the group itself is easier to work with than the larger exceptional groups. Moreover, we choose to analyze the bosonic sector of the theory since this greatly reduces the complexity of the analysis while still keeping many interesting features intact.
exceptional field theory—first described in 20134—is an covariant extension of 11 dimensional supergravity on a 5 + 27 dimensional generalized space–time with external and internal coordinates (xμ, YM).4–7 The internal coordinates YM carry an index in the fundamental representation of . The so-called section condition of the theory is a covariant constraint on the coordinate derivatives ∂M of the internal geometry, which requires that only a subset of the 27 internal coordinates is physical.
Exceptional field theory has since developed into a very active field with too many applications and directions to mention here in detail. Some examples are the construction of consistent truncations and duality covariant solutions of supergravity theories (including also less than maximal supersymmetry), a unifying treatment for brane solutions and exotic branes, non-geometric backgrounds, duality covariant graviton amplitudes, and many more—we refer the reader to the following review and publications.8–10
The ungauged invariant supergravity theory in five dimensions is related to the exceptional field theory by taking the trivial solution to the section condition ∂M = 0, thus removing all traces of the exceptional generalized geometry.5 This procedure is equivalent to the compactification of 11 dimensional supergravity on a 6 dimensional torus.5
So far the canonical formulation of exceptional field theory has not been investigated, at least in part due to the great complexity of the exceptional generalized geometry, the complicated topological term and the many covariantization terms involved. However, a comprehensive canonical analysis of exceptional field theory should cast light on the exact role of the section condition—in particular on its relevance to the closure of the gauge algebra. Furthermore, it would explicitly demonstrate the role of the external diffeomorphisms in interconnecting the terms of the bosonic exceptional field theory Lagrangian to fix all the relative coefficients.5 Moreover, one may see the canonical formulation as the starting point for the canonical quantization procedure and a potential way to identify a higher dimensional Ashtekar connection.11–13 Likewise, one may be interested in the canonical formulation of exceptional field theory as a tool for finding the solution to the local initial value problem for the exceptional generalized geometry as well as for finding the correct definition of the Arnowitt–Deser–Misner (ADM) energy and momentum in this setting.
This work is intended as a preparation and first stepping stone toward the canonical analysis of the full exceptional field theory. We aim to make this analysis as explicit, self-contained, and complete as possible in order to provide a useful reference for future work. By analyzing the supergravity theory on its own, we expect to gain insight into and greatly facilitate the canonical analysis of the full exceptional field theory—nonetheless, the supergravity theory is a very interesting theory in its own right, and hence, we expect there to be applications of this work that are not related to exceptional field theory.
We use the canonical formalism for this analysis because it is an algorithmic framework that can be applied to virtually all gauge theories. The canonical analysis yields much of the relevant information about a theory, such as its symmetries, gauge transformations, gauge algebra, and physical degrees of freedom. Furthermore, it is the starting point for the canonical quantization of a theory, and in its extended form, the canonical formulation goes beyond the Lagrangian framework by making the full gauge freedom manifest.14,15
The outline of this work is as follows: In Sec. I A, we clarify the notation and conventions used in this work. In Sec. I B, we review the necessary principles of the Arnowitt–Deser–Misner formulation of general relativity. In Sec. I C, we describe the Lagrangian formulation of the invariant supergravity theory.
We begin the construction of the canonical theory in Sec. II. In Sec. II A, we compute the canonical momenta and perform some redefinitions. We then state the canonical Hamiltonian in Sec. II B. The primary constraints are discussed in Sec. II C. The complete set of canonical constraints is constructed in Sec. II D. The total Hamiltonian is stated in Sec. II E.
We begin the canonical analysis itself in Sec. III. We define the diffeomorphism weight in Sec. III A and compute all gauge transformations of all canonical coordinates in Sec. III B. The algebra of constraints is calculated in Sec. III C. The extended Hamiltonian is stated in Sec. III D. In Sec. III E, we calculate the number of physical degrees of freedom of the theory.
We conclude with a summary of the results in Sec. IV.
For a short review of the notation and basics of canonical analysis, see Appendix A. In Appendix B, we list Poisson bracket relations that are needed and useful in computations. In Appendix C, we discuss some other useful formulas needed in the computations. In Appendix D, we discuss the treatment of the scalar coset constraints for the example of the SL(n)/SO(n) coset.
A. Notation and conventions
We work in the 1 + 4 dimensional ADM decomposition (see Subsection I B) of the 5 dimensional space–time and use the signature (− + + + +). We assume to work with a smooth, metric, Lorentzian, and globally hyperbolic manifold. Indices from the middle of the Greek alphabet, e.g., μ, ν, ρ, σ, τ, …, denote five dimensional curved space–time indices. We use Greek indices from the beginning of the alphabet, e.g., α, β, …, to denote five dimensional flat indices. We use Latin letters from the middle of the alphabet, e.g., k, l, m, n, …, for four dimensional curved spatial indices and Latin letters from the beginning of the alphabet, e.g., a, b, c, d, e, …, for flat spatial indices. We exclude the letter t from this list, since we use it to denote the curved time index. The spatial flat indices are lowered and raised by the four dimensional Euclidean metric, and thus, their placement is irrelevant. For a simple reference of the types of space–time indices used, see Table I.
. | Flat . | Curved . |
---|---|---|
1 + 4 dimensional | α, β, γ, … | μ, ν, ρ, σ, τ, … |
4 dimensional | a, b, c, d, e, … | k, l, m, n, … |
. | Flat . | Curved . |
---|---|---|
1 + 4 dimensional | α, β, γ, … | μ, ν, ρ, σ, τ, … |
4 dimensional | a, b, c, d, e, … | k, l, m, n, … |
In addition to the letters mentioned above, we will reserve the indices 0 and t to denote the flat and curved time directions, meaning α = (0, a) and μ = (t, m).
Furthermore, we use capitalized Latin letters, e.g., K, L, M, N, …, to denote indices of the fundamental representation of the Lie group, and these indices run from 1, …, 27. These are taken from the middle of the Latin alphabet; however, in this work, there will be no other types of capitalized Latin indices.
The object ϵμνρστ is the totally antisymmetric Levi–Cività symbol and is independent of the metric and coordinates. We choose the convention ϵ01234 ≔ +1.
B. Arnowitt–Deser–Misner (ADM) formulation of general relativity
In this section, we state the main results concerning the ADM decomposition which we need for the canonical formulation of supergravity. For many more details on the topic, see Refs. 16–19.
If we assume that the manifold is globally hyperbolic, we can foliate the manifold into a set of space-like hypersurfaces and decompose the metric in the following way.20
We write the metric in the vielbein formalism (also known as the tetrad-, local frame-, or Cartan formalism) as follows: is the five dimensional frame field (or fünfbein), and ηαβ is the Minkowski metric with signature (− + + + +),20
The following equivalent identities define the inverse vielbein:
By introducing the lapse function N, the shift vector Na, and the spatial vierbein (four dimensional frame field) , we can parameterize the fünfbein (five dimensional frame field) in the following way:
Using Eq. (1.2), we find that the parameterization of the inverse fünfbein is given by the following equation:
The components of the metric that follow from these parameterizations are given in Eqs. (1.6)–(1.8). The four dimensional metric on the spatial hypersurfaces is given by gmn,
When decomposing the inverse metric, one should note that the components of the inverse metric are not the inverse of the components of the metric, in particular with respect to the spatial metric inverse [see Eq. (1.11)]. The inverse metric components are written out as follows:
The relation between the determinant of the five dimensional frame field E and the determinant of the spatial four dimensional frame field e is given by a factor of the lapse function,
The determinant of the metric G can be expressed in the following form:
Other formulas relating to the geometry, which are useful in computations, can be found in Appendix C.
When matter is coupled to the Einstein–Hilbert action, we arrive at the ADM decomposition by inserting the decomposition of the metric or vielbein fields and then separate the matter field components by their space–time index structure. For example, a one form Aμ splits into At and Am, a two form Bμν into Btn and Bmn, and a scalar field remains unchanged.
C. invariant five dimensional (super-)gravity
We are interested in the bosonic sector of the maximal supergravity in five dimensions that is manifestly invariant. The bosonic field content of this theory is given by or in the ADM decomposition by .2,21 The lapse function, the shift vector field, and the spatial vielbein were already introduced in Sec. I B.22
The fields are Abelian vector gauge fields with one Lorentz index and one fundamental index.
The fields MMN are Lorentz scalars and carry two fundamental indices that are symmetric. They are elements of the USp(8) coset, which is 42 dimensional; hence, not all components of MMN are independent.2 Due to the coset structure, MMN transforms covariantly under the global action of and is invariant under the local action of USp(8). The scalar fields can be interpreted as an metric, and if one considers also the fermionic sector of the theory, it is, in fact, necessary to also rewrite the scalar fields in terms of the 27 dimensional vielbein (see Ref. 2). The metric interpretation becomes much clearer when one considers the full exceptional field theory (see Refs. 5 and 6).
We treat the USp(8) coset constraints as being implicit—meaning that in this formalism, MMN is a priori treated as if it is a generic symmetric matrix with 378 components (M, N = 1, …, 27) until all Poisson brackets have been evaluated.
This allows us to sum over all components of MMN without distinguishing whether they are truly independent coset parameters or not and simplifies the notation and canonical analysis significantly. However, there are implicit coset constraints on this MMN that ultimately—after evaluating all Poisson brackets—guarantee that it is an E6(6)/USp(8) coset representative with 42 degrees of freedom. We want to treat the coset constraints on MMN implicitly because of their great complexity and because we do not need to use them explicitly in the context of this work—one should think of them as being added to the Lagrangian with appropriate Lagrange multipliers. If we were to do this explicitly, they would appear as canonical constraints and their consistency conditions would generate further canonical constraints on the canonical momenta ΠMN(M). The constraints would relate various components of MMN and their canonical momenta ΠMN(M) among each other—implying among other things that det(M) = 1. Because they are canonical constraints, we are not allowed to apply them before fully evaluating all Poisson brackets. Indeed, if we treated them explicitly, they would require the introduction of a Dirac bracket because the canonical coset constraints of the fields and the momenta will form a second class system of constraints. To clarify this implicit treatment of the coset constraints, we demonstrate in Appendix D the explicit and implicit formalism for the much simpler case of the coset SL(n)/SO(n)—which has as its only coset constraint det(M) = 1. The scope of the implicit formalism will be sufficient for all calculations done in this work, and we will find that it leads to the right gauge transformations and dynamics when comparing it to the Lagrangian formulation.
In the Lagrangian formalism, the theory is given by the action of Eq. (1.14) and the Lagrangian density can be written in the form of Eq. (1.15). The Lagrangian density was first described in Ref. 2. Another way of obtaining Eq. (1.15) is to apply the trivial (∂M = 0) solution of the section condition to the full exceptional field theory Lagrangian density, thus removing all terms related to the exceptional generalized geometry,5
The gravitational part of the theory is described by the Einstein–Hilbert term and minimal coupling to the other fields. (5)R = (5) is the Ricci scalar in five dimensions.
The kinetic term of the scalar fields is a non-linear sigma model of the USp(8) coset—with the coset constraints treated as explained above. The inverse scalar fields MMN are defined by .
The vector field kinetic term is described by a Maxwell theory type term with the additional contraction of the indices by the scalar fields. The Abelian field strength is given by . The Abelian gauge group is U(1)27 due to the fact that 27 is the dimension of the fundamental representation of , and we have 27 copies of the vector field. The Bianchi identity is not a canonical constraint, since it does not put any constraints on the canonical variables. The Bianchi identity simply follows from the commutativity of partial derivatives.
In addition, there is a metric independent topological term of the form A ∧ F ∧ F—note that this term is also second order in derivatives. The coefficient of the topological term (the sign is convention dependent) is needed for maximal supersymmetry in five dimensions.22 This value of κ also guarantees invariance upon reduction to three dimensions (cf. Ref. 23 for the reduction from 11 dimensions). The precise value is not relevant for this paper, and we will keep the coefficient general.
The symbols dLMN and dLMN are fully symmetric and carry three fundamental indices. They are the (up to factors) unique invariant symbols of the fundamental representation of , and we use the normalization given by .2,5 In this work, we only need the fact that they are fully symmetric. For more details and useful identities regarding these symbols, see Ref. 5.
II. CANONICAL FORMULATION: HAMILTONIAN AND CANONICAL CONSTRAINTS
In this section, we construct the Hamiltonian formulation of the theory, starting from the Lagrangian formulation described in Sec. I C. As a reminder or as a brief introduction, we summarize the main definitions and notations of the canonical formalism and canonical analysis in Appendix A.
A. Canonical momenta
We will employ the notation that a capital Π(X)—with appropriate indices—signifies the canonical momentum associated with the field X. In many cases, the specification of X can be skipped since it is usually clear from the index structure of Π alone which field is conjugate to it.
Calculating the canonical momenta [see Eq. (A2) for the definition] of the lapse function, shift vector, and the time component of the gauge field, we immediately find the following primary constraints of the form Π(X) = 0:
They vanish due to the fact that the Lagrangian density in Eq. (1.15) does not depend on the time derivative of their conjugate fields. We refer to this type of constraint as a shift type constraint since the gauge transformations generated by these constraints are shifts of the conjugate fields (as we will see in Sec. III B).
The remaining canonical momenta do not vanish since the Lagrangian does contain time derivatives of their conjugate fields. A dot above a function—such as in Eq. (2.6)—indicates a ∂t derivative operator,
The component of the coefficient of anholonomy in Eq. (2.4) follows from the ADM decomposition of the definition of the coefficient of anholonomy .19
It is useful to define the contractions Πab(e) and Π(e) of the vielbein with the vielbein momentum,
To compare with the metric formulation of canonical general relativity (see, e.g., Refs. 16, 17, and 24), one can use Eq. (2.9) to relate the canonical momenta of the spatial vielbein to the canonical momenta of the metric,
Starting from Eq. (2.5), we can redefine to simplify the Hamiltonian and hence the gauge transformations. We define as the canonical momentum subtracted by the topological term contribution. This way takes the form expected of the theory without topological term [see Eq. (2.11)],
One has to be careful with this redefinition, however, since it is not a canonical transformation. This can be seen from the fact that has a non-vanishing Poisson bracket with itself . If we explicitly compute this Poisson bracket, we find Eq. (2.12). The upper letter at the derivatives indicates the coordinate of differentiation. We refrain from using this equation in the following calculations since it is rather cumbersome—it is nonetheless a valid identity. A more manageable approach to the calculations is to proceed order by order in the coefficient of the topological term κ,
Nonetheless, the use of the momentum greatly simplifies the Hamiltonian, and it has nice transformation properties, as we will see in Sec. III B.
The scalar momentum ΠRS(M) from Eq. (2.6) is computed using the part of the variation of the Lagrangian given by Eq. (2.13). We furthermore assume Eq. (2.14) as the definition of the fundamental Poisson bracket relation of ΠRS(M) with the scalar fields. As explained in Sec. I C and Appendix D, we treat the scalar fields and their momenta as generic symmetric matrices until all the Poisson brackets have been fully evaluated, and hence, Eq. (2.14) is the fundamental Poisson bracket of a symmetric matrix. Equation (2.14) furthermore eliminates the need for any distinction between diagonal and off-diagonal elements of the scalar fields and momenta. We assume that sums run over the full index range,
B. Canonical Hamiltonian
Having found all the canonical momenta in Sec. II A, we can now calculate the Hamiltonian density associated with Eq. (1.15) by a Legendre transformation. To do so, we calculate the ADM decomposition of all terms of the Lagrangian density—as described in Sec. I B—and then perform the Legendre transformation with respect to all canonical momenta [see Eq. (A3)].
The canonical Hamiltonian density is given in Eq. (2.15). Here, we can factor out the Lagrange multipliers—meaning the fields whose momenta are primary constraints of shift type—thus already making the secondary constraints apparent,
We can see that there are just three terms that contain the redefined momentum and only one topological term—just like in the Lagrangian density. Thus, using instead of gives a much simpler Hamiltonian—as can be seen from reinserting the definition of .
The terms in the last line of the Hamiltonian stem from the Legendre transformation; since the Lagrangian does not depend on the time derivatives of the Lagrange multipliers, these terms stay as they are.
It is common to work on the surface of primary constraints, consequently, removing the last line from the Hamiltonian. However, we aim to keep the setting as general as possible and do not want to restrict the analysis to a subregion in phase space.
C. Primary constraints
Calculating the canonical momenta, we have already seen that some of them vanish to yield primary constraints of shift type (see Sec. II A). There are six more primary constraints, called the Lorentz constraints Lab, with Lab = L[ab]. Since the Lorentz symmetry is manifest in the vielbein formalism, there are constraints associated with this symmetry. The constraints Lab are not of shift type and are not obvious, but they do follow immediately from the momenta of the spatial vielbein [see Eq. (2.4)] and take the explicit form of Eq. (2.19).19 The complete list of all 38 primary constraints is as follows:
D. Secondary constraints
In this section, we follow the procedure for finding the complete set of constraints and guaranteeing their consistency, as outlined in Appendix A.
In order for the primary constraints to be consistent, we require that their time evolution is constant, i.e., that their time derivative vanishes.15 We are free to add the primary constraints—with arbitrary phase space functions C0, C1, C2, C3 as parameters—to the Hamiltonian, since we can arbitrarily extend the Hamiltonian away from the primary constraint surface in phase space. We arrive at a preliminary total Hamiltonian given by the following equation:
Using , we can now test the consistency of a primary constraint Φ using the following equation [cf. Eq. (A8)]:
It is important to note here that the primary constraints all Poisson-commute among each other—with the exception of the Lorentz constraints with themselves {Lab, Lcd} ≠ 0, since they form the Lorentz subalgebra, as we will see in Sec. III C.
First, we consider the consistency of the primary constraints of shift type—meaning vanishing canonical momenta. Since the primary constraints of shift type appear as Lagrange multipliers in the Hamiltonian and Poisson-commute with all other primary constraints, we find that their consistency requires that each shift type of primary constraint yields one secondary constraint. We call Eq. (2.22) the Hamilton constraint, Eq. (2.23) the diffeomorphism constraint and Eq. (2.24) the Gauss constraint . The Gauss constraint is named due to the similarity to the constraint in Maxwell theory and Gauss’s law. Note that we have chosen to include a vielbein in the definition of the diffeomorphism constraint in order for the constraint to have a curved index. We will see in Sec. III B that these names are indeed justified,
The secondary constraints restrict the canonical coordinates dynamically since we have made use of the equations of motion to find them. The secondary constraints are equivalent to the time components of the Lagrangian equations of motion. The Hamilton constraint is the time–time-component of the Einstein equation, the diffeomorphism constraints are the time-spatial-components of the Einstein equation. Similarly, the Gauss constraints are the time-components of the analog of the Maxwell equation.
It is worth noting that the Gauss constraints HGauss are independent of the metric and scalar degrees of freedom.
The gravitational part of the diffeomorphism constraint HDiff can be rewritten using Eq. (2.25) to reveal a term containing the Lorentz constraint with a field dependent coefficient given by the spin connection . The covariant derivative ∇n contains the Levi–Cività connection, and Dm contains the spin connection,
One can redefine the diffeomorphism constraint as in Eq. (2.26), which makes the new diffeomorphism constraint and the Lorentz constraint Poisson-commute. However, we will see that the constraint of Eq. (2.23) gives the nicer and expected gauge transformations [see Eq. (3.22)]. We will thus continue to work with the diffeomorphism constraint of Eq. (2.23). The redefinition would simply represent a different choice of basis of the constraint algebra,
The only primary constraints not of shift type are the Lorentz constraints. If we insert the Lorentz constraints in Eq. (2.21), we find that no new constraints are being generated. The consistency requirement does however restrict the coefficient of the Lorentz constraint term in the total Hamiltonian. The coefficient has to be for the constraints to be consistent. This is precisely the term found in Eq. (2.26). This means that the total Hamiltonian does not contain an intrinsic Lorentz constraint term. We will see in Sec. III D that there is a Lorentz constraint term in the extended Hamiltonian.
To verify the consistency of the secondary constraints, we also take Eq. (2.21) and now insert the secondary constraints as Φ. We find no tertiary constraints implying that the set of constraints we have found so far is complete and consistent. There is no constraint associated with the exceptional symmetry of the theory since it is a global symmetry of the theory and not a gauge symmetry.
It is beneficial to make use of the concept of integrated or smeared constraints in order to avoid expressions that contain derivatives of the delta distribution. We define the smeared constraints by contracting all indices of the constraints with a tensor of test functions with the same symmetries and then integrating this expression over the entire spatial hypersurface. We denote the integrated constraint by adjoining brackets with the name of the test function to the constraint.
If we take the diffeomorphism constraint as an example, the smeared constraint is defined by Eq. (2.27), where λn(x) is a vector of test functions on the spatial hypersurface,
Note that no symmetry factors are inserted. The smeared version of the Lorentz constraints is given by Eq. (2.28), where γab = γ[ab],
E. Total Hamiltonian
The total Hamiltonian is given by Eq. (2.29). To arrive at the total Hamiltonian, we start with the canonical Hamiltonian of Eq. (2.15) and add all primary constraints with coefficients that satisfy the consistency conditions derived in Sec. II D,
The coefficients and are arbitrary phase space functions. The coefficient of the Lorentz constraint term is restricted to this particular form containing the spin connection, as explained in Subsection II D.
The time evolution of a phase space function F, generated by the total Hamiltonian, via Eq. (2.30) is equivalent to the original Lagrangian time evolution.15 Note that the equality in Eq. (2.30) is weak—this means that after evaluating the Poisson bracket, the equation holds only on the phase space surface where the constraints are satisfied (see Appendix A),
In Sec. III D, we construct the extended Hamiltonian and find that the exact coefficients in the total Hamiltonian are irrelevant to the most general time evolution since they are all replaced by arbitrary coefficient functions in the extended Hamiltonian.
In practice, it is often easiest to determine the time evolution by first computing all the gauge transformations. We compute all gauge transformations of all fields and momenta in Sec. III B.
III. CANONICAL ANALYSIS: GAUGE TRANSFORMATIONS AND GAUGE ALGEBRA
In this section, we analyze the constraints we found in Sec. II. We calculate the gauge transformations generated by the constraints, compute the algebra that is formed by the constraints under the Poisson bracket, determine the number of physical degrees of freedom, and discuss the extended Hamiltonian.
A. Diffeomorphism weight and the Lie derivative
We briefly define the diffeomorphism weight in this section, as the concept is required for Secs. III B–III E.
The Lie derivative of a tensor T with parameter ξ is denoted by . The diffeomorphism weight Λ(T) is defined as the coefficient of the weight term in the Lie derivative. For example, if T is a vector, we write Eq. (3.1) for the components of the Lie derivative,
Table II lists the diffeomorphism weights of all the relevant fields and momenta. The vielbein determinant is a tensor density, and since the Lagrangian includes a vielbein determinant, the canonical momenta have diffeomorphism weight one too.
B. Gauge transformations
In this section, we compute the explicit form of all infinitesimal gauge transformations. To do so, we calculate the Poisson brackets of all the constraints Φ[λ] with all the canonical coordinates X via δX = {X, Φ[λ]} [see Eq. (A12)]. We use the notion of smeared constraints in Secs. III B–III E, as explained in Sec. II D. We only state the non-vanishing gauge transformations.
To compute Poisson brackets of the canonical coordinates, we define the fundamental Poisson brackets as follows:
Due to the implicit treatment of the coset constraints—as explained in Sec. I C and Appendix D—Eq. (3.7) is the fundamental Poisson bracket relation for a generic symmetric scalar matrix, and thus, there is no coset projector term.
Note that the redefinition of the momentum of from Eq. (2.10) does not affect the fundamental Poisson bracket, since the term that is subtracted in the redefinition only depends on the field .
In Appendix B, we list many other useful Poisson bracket identities that are needed in the computation of the gauge transformations.
To keep the expressions as simple as possible, we omit the notation of the coordinate dependence in the following. It is understood that the gauge transformation only depends on the coordinate of the field that the constraint acts upon.
We begin with the primary constraints of shift type. The only gauge transformations that one can generate using these constraints are shift transformations on the fields canonically conjugate to the vanishing momenta,
The Lorentz constraints generate Lorentz transformations on the spatial vielbein and its canonical momentum. The vielbein determinant is Lorentz invariant—as are all quantities that can be expressed solely through the metric tensor. The transformations take the form of a rotation of the flat index by the smearing tensor,
The Hamilton constraint generates time evolution, which we can interpret as a gauge transformation.15 The time evolution generated by just the Hamilton constraint—unlike the total Hamiltonian—does not capture any other gauge freedom. However, without the Hamilton constraint, time evolution is not possible at all. The time evolution of the fields generated by the Hamilton constraint is essentially given by the canonically conjugate momenta. The time evolution of the canonical momenta is more complicated and captures most of the dynamics. Since the vielbein—or equivalently the spatial metric—contracts all terms in the Lagrangian—with the exception of the topological term—its conjugate canonical momentum has a particularly complicated time evolution [see Eq. (3.14)]. In the third line of Eq. (3.14), we see the spatial Einstein equation in the vielbein form. It is useful to first compute the Poisson brackets of Eq. (B2) and the following equations.
The time evolution of can be compared to the time evolution of the canonical momentum of the metric in pure general relativity (see Refs. 16 and 17) using Eq. (2.9),
The Gauss constraint generates Abelian U(1)27 gauge transformations on the one form gauge field . The conjugate momentum is invariant under these transformations—like it is in the free theory—which is a nice property of the redefinition from Eq. (2.10). Since the constraint is independent of the metric and scalar fields, they do not transform,
The diffeomorphism constraint generates diffeomorphisms on the spatial hypersurface via the Lie derivative (including the appropriate weight terms). If we were to use the redefined diffeomorphism constraint from Eq. (2.26), we would see additional terms in the transformation of the vielbein and its conjugate momentum,
Due to the parameterization of the Lagrangian, the transformation of the one form gauge field is a Lie derivative only up to a U(1)23 gauge transformation generated by the Gauss constraint HGauss. The notation is taken to mean . In the case of its conjugate canonical momentum , the transformation is a Lie derivative up to the Gauss constraint HGauss and thus weakly equal to the Lie derivative.
The Schouten identity from Appendix C has to be used repeatedly to compute the transformation of the one form field and its momentum.
C. Algebra of constraints
The algebra that is spanned by the canonical constraints under the Poisson bracket is equivalent to the algebra of gauge transformations. We can interpret a Poisson bracket of two canonical constraints as the commutator of two infinitesimal gauge transformations.
In general, it is easiest to explicitly write out the simpler looking constraint and then make use of the gauge transformations from Sec. III B to compute the algebra. For relations that involve the diffeomorphism constraint, it is easiest to make use of the fact that all fields—except the one from gauge field and its conjugate momentum—transform as Lie derivatives under the diffeomorphism constraint.
The primary constraints of shift type Poisson-commute with all the other constraints and are therefore not listed below. The full constraint algebra can be written as follows:
From Eq. (3.28), we see that two different orderings of time evolutions (as generated by the Hamilton constraint) can only differ by a diffeomorphism and a Lorentz transformation. This means that the time evolution with the Hamilton constraint is unique up to these gauge transformations.
Due to the use of the vielbein formalism and the choice of diffeomorphism constraint (see the discussion in Sec. II D), Eq. (3.28) contains a Lorentz constraint term whose smearing function depends on the spin connection. Nonetheless, Lorentz invariance is preserved since both the spin connection and the Lorentz constraint itself transform under a Lorentz transformation to cancel out the transformation of the diffeomorphism constraint term. From Eq. (2.26), one can moreover see that this additional term disappears when using the redefined constraint. However, the redefinition has many other consequences, which are not nice, including the introduction of additional terms in the algebra and in the gauge transformations as we have already mentioned.
In Eq. (3.29), we see that the Poisson bracket of a diffeomorphism and time evolution is the time evolution with a smearing function given by the Lie derivative of the parameter. There is furthermore a Gauss constraint term, which is due to the transformation property of the one form gauge field and its conjugate momentum under the diffeomorphism constraint.
Equations (3.28) and (3.29) moreover show that the algebra is actually an open- or pseudo-algebra, since the smearing functions on the right-hand side depend on canonical coordinates—see Refs. 15 and 21 for more information. Nonetheless, the term algebra is commonly used in such cases.
Note that even in pure general relativity, the smearing function of the diffeomorphism constraint in Eq. (3.28) contains the inverse metric. The constraint algebra of pure general relativity has been discussed in Refs. 24, 25, and 26. In addition, the canonical formulation and constraint algebra of 11 dimensional supergravity have been discussed in Refs. 27 and 28.
Equations (3.31), (3.33), and (3.34) show that diffeomorphisms, U(1)27 gauge transformations, and Lorentz transformations each form a proper subalgebra of their own. Equation (3.33) thus also prove that the gauge field is indeed Abelian.
Equation (3.36) tells us that the Poisson bracket of the diffeomorphism constraint and the Lorentz constraint is given by a Lorentz transformation with the Lie derivative of the parameter. Using the redefinition of Eq. (2.26), one can make this term vanish, however, with the same unintended consequences as mentioned before.
We now know the full algebra of constraints and see that it does indeed close under the Poisson bracket. This fact implies that all canonical constraints Poisson-commute weakly—i.e., up to constraint terms the Poisson brackets of any two constraints vanish—or equivalently that all constraints are first class. This is no longer true, once one considers the full theory including fermions.
D. Extended Hamiltonian
The extended Hamiltonian is constructed from the total Hamiltonian [Eq. (2.29)] by adding all first class constraints with arbitrary coefficients. Since all the constraints are first class, the extended Hamiltonian is the linear sum over all constraints with arbitrary coefficient functions [see Eq. (3.38)]. The difference between the total and the extended Hamiltonian is in this case that the parameters of the secondary constraints and the Lorentz constraints become completely arbitrary, hence allowing for more general gauge transformations than the Lagrangian time evolution,
The most general time evolution of a phase space function F is then given by Eq. (3.39). Since we already know all the gauge transformations, we can use this to efficiently compute the time evolution of any function using Eq. (3.38) and the linearity of the Poisson bracket,
The time evolution described by the extended Hamiltonian and the canonical Hamiltonian is equivalent for observables, since they are by definition gauge invariant.
E. Counting the degrees of freedom
With the full set of constraints as well as the constraint algebra known, we can re-derive the number of physical degrees of freedom of the theory. In Table III, we list all the field degrees of freedom and the primary and secondary constraints.
Fields . | No. . | Primary constraints . | No. . | Secondary constraints . | No. . |
---|---|---|---|---|---|
N | 1 | Π(N) | 1 | Hamilton constraint | 1 |
Na | 4 | Π(Na) | 4 | Diffeomorphism constraints | 4 |
ema | 16 | Lorentz constraints | 6 | … | 0 |
M(MN) | 42 | … | 0 | … | 0 |
27 | 27 | Gauss constraints | 27 | ||
108 | … | 0 | … | 0 | |
Total | 198 | Total | 38 | Total | 32 |
Fields . | No. . | Primary constraints . | No. . | Secondary constraints . | No. . |
---|---|---|---|---|---|
N | 1 | Π(N) | 1 | Hamilton constraint | 1 |
Na | 4 | Π(Na) | 4 | Diffeomorphism constraints | 4 |
ema | 16 | Lorentz constraints | 6 | … | 0 |
M(MN) | 42 | … | 0 | … | 0 |
27 | 27 | Gauss constraints | 27 | ||
108 | … | 0 | … | 0 | |
Total | 198 | Total | 38 | Total | 32 |
We count a total of 198 field variables—or 396 = 2 · 198 canonical coordinates in phase space. To arrive at this result, we have applied the implicit coset constraints to the scalar fields—as outlined in Appendix D—to reduce the number of scalar fields to the 42 independent variables, we need to describe the /USp(8) coset. We also count a total of 70 = 38 + 32 canonical constraints, all of which are first class constraints and hence have to be counted twice.15 Thus, the theory has 396 − 2 · 70 = 256 physical dimensions in phase space or equivalently 128 physical (bosonic) degrees of freedom. This is in agreement with the well-known result that—at each point in space—maximal supergravity has 128 bosonic degrees of freedom.2,21,29,30
IV. SUMMARY AND OUTLOOK
Starting from the Lagrangian formulation of the bosonic sector of the invariant supergravity theory in five dimensions, we have constructed the Hamiltonian formulation of that theory. We then constructed the complete set of 70 canonical constraints. We calculated the gauge transformations generated by the canonical constraints and found that they generate time evolution, diffeomorphisms, Lorentz transformations, U(1)27 gauge transformations, and shift transformations. As is to be expected, the symmetry is not generated by canonical constraints since it is a global symmetry. We found that the algebra of gauge transformations closes and that all constraints of the bosonic theory are first class. Hence, the extended Hamiltonian—describing the most general time evolution of the theory, where the full gauge freedom is manifest—was constructed by summing over the complete set of constraints with fully arbitrary coefficient functions. We finally confirmed the well-known result that the number of physical degrees of freedom of the theory is 128.
This work is intended to lay the foundation for the canonical analysis of the full exceptional field theory. Due to the complexity of the exceptional generalized geometry, the complicated topological term—so far also lacking an explicit non-integral expression—and the many covariantization terms involved, the complete treatment of the exceptional generalized geometry is beyond the scope of this preparatory work. As a result of this preparation, we can now focus on the more difficult tasks of understanding the internal exceptional geometry and working out its canonical structure. As was explained in the Introduction, there are many open questions one can ask about canonical exceptional field theory and the exceptional generalized geometry. Perhaps one of the most intriguing questions concerns the exact role of the section condition—notably its relevance to the closure of the gauge algebra. To cast light on this and other topics, we plan to proceed with the canonical analysis of the full exceptional field theory in a future publication.
ACKNOWLEDGMENTS
This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Programme (“Exceptional Quantum Gravity,” Grant Agreement No. 740209).
L.T.K. was supported by the International Max Planck Research School for Mathematical and Physical Aspects of Gravitation, Cosmology and Quantum Field Theory.
L.T.K. is particularly grateful to Axel Kleinschmidt for his support, help, and many useful discussions. Furthermore, L.T.K. would like to thank Matteo Broccoli, Franz Ciceri, Jan Gerken, Emanuel Malek, Hermann Nicolai, and Stefan Theisen for comments and useful discussions.
DATA AVAILABILITY
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
APPENDIX A: BASICS OF CANONICAL ANALYSIS
In this section, we briefly review the basics of how to analyze a theory using the canonical formalism. Canonical analysis is a synonym for Hamiltonian analysis. A full introduction to this subject is far beyond the scope of this work, and we omit many details in this section. For an in-depth treatment, see Refs. 14 and 15. This section is mainly based on the first chapters in Ref. 15.
The starting point of the canonical analysis is most often the Lagrangian formulation of a theory. Starting from an action functional S, given by the d-dimensional space–time integral over some Lagrangian density , which, in turn, depends only on some fields qn(x) and their time derivatives [see Eq. (A1)]. The space–time coordinates are xμ with indices μ = 0, …, d − 1. Note that in this section, the Latin indices are used to label fields and constraints of the theory.
The canonical formalism aims to treat the fields and their conjugate canonical momenta on an equal footing. We introduce the canonical momenta pn as defined by Eq. (A2). In the case of a field theory, the derivative is to be understood as a functional derivative. We define the Hamiltonian density as the Legendre transformation of the Lagrangian density with respect to the time derivatives of the fields [see Eq. (A3)]. One can then rewrite the action functional as in Eq. (A4),
Due to the Legendre transformation, we have now found a function that depends on two sets of variables, which we call the canonical variablesqn(x) and the canonical momentapn(x). We call the tuple (q, p) the canonical coordinates or phase space coordinates and the space that they describe phase space.
We can think of gauge theories as theories where, at any given time, the dynamical content of the theory is relative to an arbitrary reference frame.15 Hence, the general solution of a gauge theory necessarily contains arbitrary functions of time, since it is always permitted to transform the reference frame. This arbitrariness leads to the canonical coordinates not being completely independent, and therefore, it is equivalent to the existence of constraints on the phase space. In a gauge theory, we will always find constraints {Φm(q, p) = 0, m = 1, …, M} that only depend on the canonical coordinates and not on their time derivatives.
The first type of constraint follows directly from the definition of the canonical momenta [Eq. (A2)], which is why we call them primary constraints. The simplest primary constraint is given by a vanishing canonical momentum Φ(q, p) = pn = 0; however, more complicated types are also common. Since we have not made use of the time evolution, the primary constraints do not restrict the kinematics and are true identities.
A consequence of the existence of primary constraints is that the Hamiltonian becomes non-unique in phase space since we are free to add primary constraints with arbitrary coefficients um(q, p) to the Hamiltonian. We call this function the total Hamiltonian [see Eq. (A5)]. The phase space hypersurface described by {Φm = 0, ∀m} is called the primary constraint surface, and on this surface, the Hamiltonian is still uniquely defined,
For two phase space functions F, G, we define the Poisson bracket by the following equation:
Using the Poisson bracket, we can write the time evolution of a phase space function F that follows from the total Hamiltonian of Eq. (A5) as Eq. (A7). At this point, this is the most general time evolution we can write down
Since we need the primary constraints to hold for all times—in order for the formalism to be consistent—we arrive at Eq. (A8). If Eq. (A8) yields a relation that is independent of the arbitrary parameters um and independent of the primary constraints, then we call it a secondary constraint. Since we need to ensure that it is also conserved in time, we iterate this procedure for all secondary constraints and potentially end up with more constraints. The consistency requirements that stem from secondary constraints are sometimes also referred to as tertiary, quaternary, etc., constraints. For reasonable physical theories, the termination of this procedure is guaranteed,
The only difference between primary and secondary constraints is that secondary constraints do restrict the kinematics, since we have made use of the time evolution in order to find them. Note that this means that we are not allowed to add the secondary constraints to the Hamiltonian in the manner that we have done with the primary constraints in Eq. (A5). Since further distinction between primary and secondary constraint is not needed, we denote the complete set of constraints by {Φj, j = 1, …, J}.
If we apply the time evolution equation (A7) to the full set of constraints, we will not find any new constraints. However, the set of time evolution equations can be seen as a set of differential equations for the—a priori—arbitrary phase space functions um(q, p). Solving this system, we can write the general solution using the homogeneous solution and a particular solution Um as in Eq. (A9). The coefficients of the homogeneous solution va are then truly arbitrary,
We write the ≈ sign to indicate an equality that holds on the constraint surface, e.g., , we call this relation weak equality. The time evolution generated by the total Hamiltonian via is equivalent to the Lagrangian time evolution.
An important property of the constraints is whether they are first class [see Eq. (A10)] or second class constraints [see Eq. (A11)]. Second class constraints require further treatment, since they do not occur in the theory being analyzed in this work and we again refer the reader to Ref. 15. Note that the definition of the class relies on the weak equality, meaning that first class constraints Poisson-commute with all other constraints up to terms proportional to constraints. This also implies that if all the constraints are first class, the Poisson bracket algebra of the constraints closes automatically,
First class constraints can be interpreted as generators of gauge transformations. Mathematically, this is only guaranteed for first class primary constraints; however, for reasonable physical theories, this is also true for first class secondary constraints. This is also known as the Dirac conjecture. Counterexamples exist and are known.15 Gauge transformations δF on a phase space function F are generated as in Eq. (A12), where ,
We can add all first class constraints to the total Hamiltonian to describe the time evolution with the full gauge freedom accounted for. We call this the extended Hamiltonian [see Eq. (A13)], where {γa} is the set of all first class constraints,
The extended Hamiltonian generates time evolution via Eq. (A14). This can be seen as an extension of the Lagrangian framework since the full gauge freedom is now manifest in the time evolution,
This only concerns gauge variant quantities, since for gauge invariant quantities (observables), all the Hamiltonian time evolutions are equivalent .
Furthermore, it is a general feature of generally covariant theories that the Hamiltonian vanishes weakly —implying that the Hamiltonian consists of a linear combination of constraints. In particular, general relativity and, thus, also all supergravity theories are generally covariant theories. In this form, the time evolution is interpreted as a yet another gauge transformation.
APPENDIX B: POISSON BRACKET RELATIONS
In this appendix, we list some Poisson bracket relations that are intermediate results or otherwise useful for computations,
APPENDIX C: OTHER USEFUL FORMULAS
In this appendix, we discuss some general formulas that are needed for the computations of the main sections.
From the definition of the (vielbein) determinant and the fact that its variation can be expressed as , we find the following identity:
Furthermore, we can exploit the fact that we can over-antisymmetrize a tensor to make it vanish—e.g., we take an object with D + 1 indices in D dimensions and antisymmetrize them to get zero—this is sometimes called the Schouten identity. One can think of this identity as the fact that there cannot be D + 1 linearly independent vectors in D dimensions. Equation (C2) states the identity in four dimensions, where ϵ is the Levi–Cività symbol in four dimensions and vk are vector components,
If we expand this expression, we arrive at the following equation:
APPENDIX D: TREATMENT OF COSET CONSTRAINT EXAMPLE SL(n)/SO(n)
As explained in Sec. I C, we want to treat the USp(8) coset constraints on MMN as being implicit. To explain this formalism, we consider the much simpler coset SL(n)/SO(n) as an example to demonstrate how one would explicitly or implicitly treat the coset constraints.
We start with the sigma model Lagrangian of a generic symmetric matrix MMN with M, N = 1, …, n and just the time derivative for simplicity. The inverse matrix is defined via . We then explicitly add the SL(n)/SO(n) coset constraint c ≔ det(M) − 1 with the Lagrange parameter ϕ to the Lagrangian arriving at Eq. (D1). This is the starting point for the explicit treatment of the coset constraints,
We can now calculate the canonical momenta and find the primary constraint (D3),
The canonical Hamiltonian of this theory is given by Eq. (D4), where ΠMN(M) ≔ −ΠRS(M) MRMMSN. The total Hamiltonian is given by ,
As explained in Appendix A, we now need to check the consistency of the primary constraint (D3) using Eq. (A7) and find that there is one secondary constraint given by Eq. (D5) as intended,
Considering Eq. (A7) now for the consistency of the secondary constraint, we find that there is one tertiary constraint given by Eq. (D6). In this calculation, we have used Eq. (D7), which follows directly from the fundamental Poisson bracket of a symmetric matrix with its canonical momentum (D8)—which is in form identical to the fundamental Poisson bracket in Eq. (3.7),
The tertiary constraint p—induced by the consistency of the coset constraint det(M) = 1—can be interpreted as a coset constraint on the momenta implying the tracelessness of the sl(n) algebra element MRN ΠNS(M).
Applying Eq. (A7) to the tertiary constraint, we do not find any new constraints and the constraint system is consistent.
We immediately find that the primary constraint Π(ϕ) is a first class constraint. The secondary and tertiary constraints however form a second class system of constraints since the right-hand side of Eq. (D11) necessarily contains a constant term and cannot be rewritten solely in terms of the secondary constraint—the factor n here comes from ,
The presence of these two second class constraints tells us that we should define a Dirac bracket {g, f}DB as defined in Ref. 15 given by Eq. (D12), where f, g are two functions on phase space. We find by construction that all constraints commute in the Dirac bracket,
One may have noted that Eq. (D7)—which is a general result for any symmetric matrix—seems incompatible with the secondary constraint det(M) = 1, which defines the SL(n)/SO(n) coset. This seeming inconsistency is resolved by the fact that we are not allowed to apply the canonical constraints before evaluating all Poisson brackets fully.
For the Dirac bracket, this is no longer true, however,15 and we are free to apply the canonical constraints inside the Dirac bracket yielding the following more expected equation:
We can furthermore compute the number of degrees of freedom. We have the canonical variables MMN, ΠKL(M), ϕ, Π(ϕ), which have components—a priori the variables MMN, ΠKL(M) are still generic symmetric matrices. We have one first class primary, one second class secondary, and one second class tertiary constraints, which means we should count them as 2 · 1 + 1 + 1.15 This leaves us with n2 + n − 2 physical degrees of freedom in phase space or physical degrees of freedom in field space.
This is the full explicit treatment of the coset constraints in the SL(n)/SO(n) case.
The implicit treatment of the coset constraints uses the Lagrangian without adding the coset constraints explicitly . We can nonetheless apply them after all Poisson brackets have been computed, thus considering them implicitly. The price one pays for the implicit treatment of the constraints is that we are not able to apply canonical constraints inside the brackets. In return, we are able to skip the canonical analysis of the coset constraints—which are much more complicated for the USp(8) coset—if we are not interested in applying them inside brackets. We can compute the canonical degrees of freedom by looking at the dimension of the coset, in this case dim(SL(n)/SO(n)). We thus arrive at the same result as in the explicit treatment.
For an alternative canonical treatment of coset space sigma models using the vielbein formalism, see Ref. 31.
REFERENCES
We use the vielbein formalism because it makes the full Lorentz symmetry manifest and this makes it possible to include the symmetry in the canonical analysis. Furthermore, this formalism is necessary when coupling gravity to fermions. We will not be dealing with fermions in this work, but the vielbein formalism makes it easier to extend the work to fermions.
One can consider other values of κ if one is not interested in the maximally supersymmetric theory. In Ref. 23, the ungauged minimal supergravity theory with the value is considered (without the scalar field terms and with a single gauge field) and yields a G2 symmetry upon compactification to three dimensions.