We provide a visual and intuitive introduction to effectively calculating in 2-groups along with explicit examples coming from non-Abelian 1- and 2-form gauge theory. In particular, we utilize string diagrams, tools similar to tensor networks, to compute the parallel transport along a surface using approximations on a lattice. We prove a convergence theorem for the surface transport in the continuum limit. Locality is used to define infinitesimal parallel transport, and two-dimensional algebra is used to derive finite versions along arbitrary surfaces with sufficient orientation data. The correct surface ordering is dictated by two-dimensional algebra and leads to an interesting diagrammatic picture for gauge fields interacting with particles and strings on a lattice. The surface ordering is inherently complicated, but we prove a simplification theorem confirming earlier results of Schreiber and Waldorf. Assuming little background, we present a simple way to understand some abstract concepts of higher category theory. In doing so, we review all the necessary categorical concepts from the tensor network point of view as well as many aspects of higher gauge theory.

We use string diagrams to express many concepts in gauge theory in the broader context of two-dimensional algebra. By two-dimensional algebra, we mean the manipulation of algebraic quantities along surfaces. Such manipulations are dictated by 2-category theory and we include a thorough and visual introduction to 2-categories based on string diagrams. Such string diagrams, including their close relatives known as tensor networks, have been found to provide exceptionally clear interpretations in areas such as open quantum systems,1 foundations of quantum mechanics,2 entanglement entropy,3 and braiding statistics in topological condensed matter theory,4 to name a few.

We postulate simple rules for associating algebraic data to surfaces with boundary and use the rules of two-dimensional algebra to derive non-Abelian surface transport from infinitesimal pieces arising from a triangulation/cubulation of the surface. One of the novelties in this work is an analytic proof for the convergence of surface transport together with a more direct derivation of the iterated surface integral than what appears in Ref. 5, for instance. To be as self-contained as possible, we include discussions on gauge transformations, orientation data on surfaces, and a two-dimensional calculation of a Wilson cube deriving the curvature 3-form. We also review ordinary transport for particles to make the transition from one-dimensional algebra to two-dimensional algebra less mysterious.

Ordinary algebra, matrix multiplication, group theory, etc., are special cases of one-dimensional algebra in the sense that they can all be described by ordinary category theory. For example, a group is a type of category that consists of only a single object. Thanks to the advent of higher category theory, beginning with the work of Bénabou on 2-categories,6 it has been possible to conceive of a general framework for manipulating algebraic quantities in higher dimensions. In particular, monoidal categories and the string diagrams associated with them7 can be viewed as 2-categories with a single object. The special case of this where all algebraic quantities have inverses are known as 2-groups, with a simple review given in Ref. 8 and a more thorough investigation in Ref. 9. We do not expect that the reader is knowledgeable of these definitions and we only assume that the reader knows about Lie groups (even a heuristic knowledge will suffice since our formulas will be expressed for matrix groups).

While there already exist several articles,8,10,11,12 introducing the “conceptual” basic ideas of higher gauge theory and parallel transport for strings in terms of category theory and even a book by Schreiber describing the mathematical framework of higher-form gauge theories,13 there are few articles that provide explicit and computationally effective methods for calculating such parallel transport.14 Although Girelli and Pfeiffer explain many ideas, most results useful for computations are infinitesimal and it is not clear how to build local quantities from the infinitesimal ones.11 Baez and Schreiber15 focus on similar aspects as we do in this article, but our presentation is significantly simplified since we assume certain results on path spaces without further discussion, such as relationships between differential forms on a manifold and smooth functions on its path space, and therefore do not deal with the delicate analytical issues on such path spaces. Our goal is to provide tools and visualizations to perform more intuitive calculations involving mainly calculus and matrix algebra.

In 1973, Kalb and Ramond first introduced the idea of coupling classical Abelian gauge fields to strings.16 Actions for interacting charged strings were written down together with equations of motions for both the fields and the strings themselves. Furthermore, some of the quantization of the theory was discussed. The next big step took place in 1985 with the work of Teitelboim (aka Bunster) and Henneaux, who introduced higher form Abelian gauge fields that could couple to higher-dimensional manifolds.17,18 In Ref. 17, Teitelboim studied the generalization of parallel transport for higher dimensional surfaces and concluded that non-Abelian p-form gauge fields for p ≥ 2 cannot be coupled to p-dimensional manifolds in order to construct parallel transport. The conclusion was that the only possibilities for string interactions involved Abelian gauge fields. As a result, it seemed that only a few tried to get around this in the early 1980s. For example, the non-Abelian Stokes theorem came from analyzing these issues in the context of Yang-Mills theories and confinement19 (see also, for instance, Sec. 5.3 of Ref. 20). Although such calculations led people to believe defining non-Abelian surface parallel transport is possible, the expressions were not invariant under reparametrizations and they did not seem well-controlled under gauge transformations. Without a different perspective, interest in it seemed to fade.

The crux of the argument of Teitelboim is related to the fact that higher homotopy groups are Abelian. This is sometimes also known as the Eckmann-Hilton argument.8 However, Whitehead in 1949 realized that higher “relative” homotopy groups can be described by “non-Abelian” groups.21 In fact, it was Whitehead who introduced the concept of a crossed module to describe homotopy 2-types. This work was in the area of algebraic topology and the connection between crossed modules and higher groups was not made until much later. A review of this is given in Ref. 8. Eventually, non-Abelian generalizations of parallel transport for surfaces were made by Girelli and Pfeiffer using category theory and ideas from homotopy theory stressing that one should also associate differential form data with lower-dimensional submanifolds.11 Before this, most of the work on non-Abelian forms associated with higher-dimensional objects did not discuss parallel transport but developed the combinatorial and cocycle data building on the foundational work of Breen and Messing.22,10,23 This cocycle perspective eventually led to the field of non-Abelian differential cohomology.13,24,25 The idea of decorating lower-dimensional manifolds is consistent with the explicit locality exhibited in the extended functorial field theory approach to axiomatizing quantum field theories.26,27,28,29 Recently, in a series of four papers, Schreiber and Waldorf axiomatized parallel transport along curves and surfaces,30,5,31,12 building on an earlier work of Caetano and Picken.32 

We have already indicated one of the motivations of pursuing an understanding of parallel transport along surfaces, namely, in the context of string theory. Strings can be charged under non-Abelian groups and interact via non-Abelian differential forms. Just as parallel transport can be used to described non-perturbative effects in ordinary gauge theories for particles, parallel transport along higher-dimensional surfaces might be used to describe non-perturbative effects in string theory and M-theory. Yet another use of parallel transport is in the context of lattice gauge theory where it is used to construct actions whose continuum limit approaches Yang-Mills type actions.33 

Higher form symmetries have also been of recent interest in high energy physics and condensed matter in the exploration of surface operators and charges for higher-dimensional excitations.34 However, the forms in the latter are strictly Abelian and the proper mathematical framework for describing them is provided by “Abelian” gerbes (also known as higher bundles)35,36 and Deligne cohomology. Higher non-Abelian forms appear in many other contexts in physics, such as in a stack of D-branes in string theory,37 in the ABJM model,38 and in the quantum field theory on the M5-brane.39 In fact, the authors of Ref. 38 show how higher gauge theories provide a unified framework for describing certain M-brane models and how the 3-algebras of Ref. 40 can be described in this framework. Further work, including an explicit action for modeling M5-branes, was provided recently in Ref. 41.

Although a description of the non-Abelian forms themselves is described by higher differential cohomology,13 parallel transport seems to require additional flatness conditions on these forms.8,15,11,10,13,42 For example, in the special case of surfaces, this condition is known as the vanishing of the fake curvature. Some argue that this condition should be dropped and the existence of parallel transport is not as important for such theories.43 However, our perspective is to take this condition seriously and work out some of its consequences. Indeed, since higher-dimensional objects can be charged in many physical models besides just string theory, parallel transport might be used to study non-perturbative or effective aspects of these theories, an important tool to understand quantization (see the discussion at the end of Ref. 44). Because it is not yet known how to avoid these flatness conditions, further investigation is necessary, with some recent progress by Waldorf.25,42

Therefore, because of the subject’s infancy, it is a good idea to devote some time to “calculating” surface transport explicitly to better understand how branes of different dimensions can be charged under various gauge groups. Here, we focus on the case of two-dimensional surfaces such as strings or D1-branes. However, we make no explicit reference to any known physical models. For these, the reader is referred to the other studies in the literature, for example, Ref. 41 and the references therein.

In Sec. II, we describe how categorical ideas can be used to express a mix of algebraic and geometric concepts. In Sec. II A, we review “string diagrams” for ordinary categories and how group theory arises as a special case of ordinary category theory. In Sec. II B, we define 2-categories and other relevant structures, providing a two-dimensional visualization of the algebraic quantities in terms of string diagrams. In Sec. II C, we specialize to the case where the algebraic data are invertible. We restrict attention to strict 2-groups, which is sufficient for many interesting applications.34,45,38,41,46

In Sec. III, we describe how gauge theory for 0-dimensional objects (particles) and 1-dimensional objects (strings) can be expressed conveniently in the language of two-dimensional algebra. In detail, in Sec. III A, we review how classical gauge theory for particles is described categorically. We include a review of the formula for parallel transport describing it in terms of one-dimensional algebra as an iterated integral obtained from a lattice discretization and a limiting procedure. In Sec. III B, we include several crucial calculations for gauge theory for 1-dimensional objects (strings) expressing everything in terms of two-dimensional algebra. In particular, we derive the local infinitesimal data of a higher gauge theory. To the best of our knowledge, these ideas seem to have first been analyzed in Refs. 22, 11, and 15, though our inspiration for this viewpoint came from Ref. 47. Furthermore, we use the rules of two-dimensional algebra to derive an “explicit” formula for the discretized and continuous limit versions of the local parallel transport of non-Abelian gauge fields along a surface. Although such a formula appears in the literature,15,5 we provide a more intuitive derivation as well as a useful expression for lattice computations. We provide a picture for the correct surface ordering needed to describe parallel transport along surfaces with non-Abelian gauge fields in Proposition 3.5 and the discussion surrounding this new result. We then proceed to prove that the surface ordering can be dramatically simplified in Theorem 3.8. In Remark 3.9, we show that our resulting formula agrees with the one given by Schreiber and Waldorf which was obtained through different means.5 In Sec. III C, we study the gauge covariance of the earlier expressions and derive the infinitesimal counterparts in terms of differential forms. In Sec. III D, we discuss the subtle issue of orientations of surfaces and how our formalism incorporates them. In Sec. III E, we again use two-dimensional algebra to calculate a Wilson cube on a lattice and from it obtain the 3-form curvature. We then study how it changes under gauge transformations showing consistency with the results of Girelli and Pfeiffer.11 

Finally, in Sec. IV, we discuss some indication as to how these ideas might be used in physical situations and indicate several open questions.

We will present categories in terms of what are known as “string diagrams” since we find that they are simpler to manipulate and compute with when working with 2-categories. Therefore, we will define categories, functors, and natural transformations in terms of string diagrams. Afterward, we will make a simplification and discuss special examples of categories known as groups.

Definition 2.1.

A “category,” denoted by C, consists of

  • a collection of 1-d domains (also known as objects)

(labeled for now by some color),
  • (ii)

    between any two 1-d domains, a collection (which could be empty) of 0-d defects (also known as morphisms)

(labeled by lower-case Roman letters),
  • (iii)

    an “in series” composition rule

whenever 1-d domains match, and
  • (iv)

    between every 1-d domain and itself, a specified 0-d defect

called the identity.

These data must satisfy the conditions that

  • the composition rule is associative and

  • the identity 0-d defect is a left and right identity for the composition rule.

Remark 2.2.
Technically, 0-d defects have a direction/orientation. In this paper, the convention is that we read the expressions from right to left. Hence, g is thought of as “beginning” at A and “ending” at R or transitioning from A to R. In many cases, as in the theory of groups, we will always be able to go back by an inverse operation. However, in general, g will merely be a transformation from A to R. If at any point confusion may arise as to the direction, we will signify with an arrow close to the 0-d defect
and we may occasionally use the notation
(2.1)
instead and call the 1-d domains objects and the 0-d defects morphisms. For the reader familiar with categories, we are visualizing morphisms in terms of their Poincaré duals. The motivation for using the terminology of domains and defects comes from physics (see Remark 2.9 for more details).

Example 2.3.

Let G be a group. From G, one can construct a category, denoted by BG, consisting of only a single domain (say, red) and the collection of 0-d defects from that domain to itself consists of all the elements of G. The composition is group multiplication. The identity at the single domain is the identity of the group.

The previous example of a category is the one in which all 0-d defects are invertible.

Definition 2.4.

Let C and D be two categories. A “functor” F:CD is an assignment sending 1-d domains in C to 1-d domains in D and 0-d defects in C to 0-d defects in D satisfying

  • the source-target matching condition

  • (b)

    preservation of the identity

  • (c)

    and preservation of the composition in series

This last condition can be expressed by saying that the following triangle of defects commutes
meaning that going left along the top two parts of the triangle and composing in series is the same as going left along the bottom.

There are several ways to think about what functors do. One, a functor can be viewed as a “construction” in the sense that one begins with data and from them constructs new data in a consistent way. Another perspective is that functors are invariants and give a way of associating information that only depends on the isomorphism class of 1-d defects. Another perspective that we will find useful in this article is to think of a functor as attaching algebraic data to geometric data. We will explore this last idea in Sec. III A and generalize it in Sec. III B. Yet another perspective is to view categories more algebraically and think of a functor as a generalization of a group homomorphism since the third condition in Definition 2.4 resembles this concept. We will explore this last perspective in in the following example:

Example 2.5.

Let G and H be two groups, and let BG and BH be their associated one-object categories as discussed in Example 2.3. Then, functors F:BGBH are in one-to-one correspondence with group homomorphisms f : GH.

Definition 2.6.
Let C and D be two categories and F,G:CD be two functors. A “natural transformation” σ: FG is an assignment sending 1-d domains of C to 0-d defects of D in such a way so that
and to every 0-d defect
the condition
must hold.
The last condition in the definition of a natural transformation can be thought of as saying both ways of composing in the following “square”
are equal (the arrows have been drawn to be clear about the order in which one should multiply), i.e., as an algebraic equation without pictures
(2.2)
Natural transformations can be composed though we will not need this now and will instead discuss this in greater generality for 2-categories later.

Example 2.7.
Let G be a group and BG be its associated category. Let VectK be the category of vector spaces over a field K. Namely, the 1-d domains are vector spaces and the 0-d defects are K-linear operators between vector spaces. Let us analyze what a functor ρ:BGVectK is. To the single 1-d domain of BG,ρ assigns to it some vector space, V. To every group element gG, i.e., to every 0-d defect of BG,ρ assigns an invertible operator ρ(g): VV. This assignment satisfies ρ(e) = idV and ρ(gh) = ρ(g)ρ(h). Thus, the functor ρ encodes the data of a representation of G. Now, let ρ and ρ′ be two representations, where the vector space associated with ρ′ is denoted by V′. A natural transformation σ: ρρ′ consists of a single linear operator σ: VV′, satisfying the condition
(2.3)
for all gG. In other words, a natural transformation encodes the datum of an intertwiner of representations of G. Intertwiners are used to relate two different representations. For instance, the Fourier transform is a unitary intertwiner between the position and momentum representations of the Heisenberg algebra in quantum mechanics. As another class of examples, all tensor operators in quantum mechanics are intertwiners.48 

2-categories provide one realization of manipulating algebraic data in two dimensions.

Definition 2.8.

A “2-category,” denoted by C, consists of

  • a collection of 2-d domains (also known as objects)

(labeled for now by some color),
  • (ii)

    between any two 2-d domains, a collection (which could be empty) of 1-d defects (also known as 1-morphisms or domain walls)

(labeled by lower-case Roman letters),
  • (iii)

    between any two 1-d defects that are themselves between the same two 2-d domains, a collection (which could be empty) of 0-d defects (also known as 2-morphisms or excitations)

(labeled by lower case Greek letters),
  • (iv)

    an “in parallel” composition (also known as horizontal composition) rule for 1-d defects

  • (v)

    an “in series” composition (aka vertical composition) rule for 0-d defects

  • (vi)

    an “in parallel” composition (aka horizontal composition) rule for 0-d defects

and
  • (vii)

    every 2-d domain R has both an identity 1-d defect and an identity 0-d defect

respectively, and every 1-d defect has an identity 2-d defect
These data must satisfy the following conditions:
  • All composition rules are associative (this will be implicit in drawing the diagrams as we have).

  • The identities obey rules exhibiting them as identities for the compositions.

  • The composition in series and in parallel must satisfy the “interchange law”

meaning that the final diagram is unambiguous, i.e.,
(2.4)

These laws guarantee the well-definedness of concatenating defects in all allowed combinations.

Remark 2.9.
Technically, both 1-d defect and 0-d defects have directions. Our convention in this paper is that 1-d defects are read from right to left and 0-d defects are read from top to bottom. The above depiction of 2-categories is related to the usual presentation of 2-categories via
Occasionally, it will be convenient to move diagrams around and draw them sideways or in other directions for visual purposes. In these cases, we will label the directionality when it might be unclear. We prefer the “string diagram” approach as opposed to the “globular” approach because string diagrams are used in more areas of physics such as in condensed matter49 and open quantum systems.1 The terminology of domains, domain walls, defects, and excitations comes from physics.49 
Using this definition, we can make sense of combinations of defects such as
interpreting it as the composition in parallel of the top two 1-d defects along the common 2-d domain (drawn in green)
In fact, a 0-d defect can have any valence with respect to 1-d defects
but it is important to keep in mind which 1-d defects are incoming and outgoing. Our convention is that all incoming 1-d defects come from above the 0-d defect and all outgoing 1-d defects go towards the bottom of the page. Occasionally, we will go against this convention, and we will rely on the context to be clear, or to be cautious, we may include arrows to indicate the direction. For example, this last 4-valence diagram might be drawn as
Furthermore, we can define composition in parallel between a 1-d defect and a 0-d defect as in
by viewing the 1-d defect with an identity 0-d defect and then use the already defined composition of 0-d defects in parallel
A similar idea can be used if the right side was just a 1-d defect. Using these rules, we can make sense of diagrams such as
by extending the left “dangling” 1-d defect to the bottom and the right “dangling” 1-d defect to the top as follows:
Then, we can compose in parallel to obtain
and finally compose in series
One must be cautious in such an expression. It does not make sense to compose σ with idg alone in series because k is an outgoing 1-d defect from σ. Therefore, the expression σidiidgτ must be calculated by first composing in parallel and then one can compose the results in series as we have done. It may be less ambiguous to write this expression as (σidi)(idgτ).More details can be found in Joyal and Street’s seminal paper on the invariance of string diagrams under continuous deformations50 or in many introductory accounts of string diagrams in 2-categories. Examples of 2-categories related to groups will be given in Sec. II C.

Example 2.10.
Let Hilb be the category of Hilbert spaces, i.e., 1-d domains are Hilbert spaces and 0-d defects are bounded linear operators. Let HilbIsom be the subcategory whose 1-d domains are Hilbert spaces and whose 0-d defects are isometries. Finally, let HilbIsomproj be the 2-category whose 2-d domains are Hilbert spaces, 1-d defects are isometries, and 0-d defects are elements of U(1). More precisely, given two Hilbert spaces H and H and two isometries L,K:HH, a 0-d defect from L to K is an element λU(1) such that K = λL. The composition in series is given by the product of elements in U(1)
and the in parallel composition is also defined by the product of elements in U(1)
The products LL′ and KK′ are given by the composition of linear operators. The reader should check that this is indeed a 2-category.

Example 2.11.
A common 2-category that appears in tensor networks in quantum information theory is Hilb.1 In this 2-category, there is only a single object (2-d domain). The 1-d defects are Hilbert spaces and 0-d defects are bounded linear transformations. The parallel composition of Hilbert spaces and bounded linear transformations is the tensor product. The series composition of linear transformations is the functional composition of these operators. It is a basic property of the tensor product and functional composition that if HfKgJ and HfKgJ are given, then
(2.5)
This equality is precisely the interchange law for the compositions in 2-categories, but writing the composition in two dimensions, namely, vertically and horizontally, makes it more clear that these expressions are equal. Note that the identity Hilbert space for the parallel composition, the tensor product, is the Hilbert space of complex numbers C. Technically, this is not an identity on the nose, nor is the tensor product strictly associative, but one can safely ignore this issue due to MacLane’s coherence theorem on monoidal categories.51 

Remark 2.12.

Kitaev and Kong provide more examples of 2-categories in their discussion of domains, defects, and excitations in the context of condensed matter.49 In their language, we are viewing excitations as generalized defects.

Definition 2.13.
Let C and D be two 2-categories. A “(normalized) weak functor” F:CD is an assignment sending d-dimensional domains/defects of C to d-dimensional domains/defects of D together with an assignment cF that associates with every pair of parallel composable 1-d defects f and g in C, an invertible 0-d defect in D interpolating from F(f)F(g) to F(fg) as in
These assignments must satisfy the following conditions:
  • The assignment F is such that all sources and targets are respected, i.e.,

  • (b)

    All identities are preserved (this is the “normalized” condition).

  • (c)

    For any 1-d defect f

the equalities
i.e.,
(2.6)
must hold.
  • (d)

    To every triple of parallel composable 1-d defects

the equality
i.e.,
(2.7)
must hold.

If cf,gF is the identity for all f and g in C, then F is said to be a “strict functor.”

Remark 2.14.
For each pair of composable 1-d defects f and g, the 0-d defect cf,gF can be thought of as filling in the triangle from the comments following Definition 2.4 by enlarging the 1-d domains to 2-d domains and enlarging the 0-d defects to 1-d defects. Condition (d) resembles associativity. In fact, it is an example of a cocycle condition and will be discussed more in the following example (in particular, this definition allows one to define higher cocycles for non-Abelian groups). Condition (d) can also be re-written as
which illustrates a connection to Pachner moves for triangulations of surfaces. However, this latter presentation requires arrows to keep track of incoming vs outgoing directions.

Examples of weak functors abound. For example, projective representations are described by weak functors that are not strict functors as will be explained in the following example. Weak functors can also be used to define the local cocycle data of higher bundles.24 Since we will be working locally for simplicity, we will make little use of weak functors, but have included their discussion here for completeness and so that the standard definitions of higher bundles may be less mysterious.14,12,24 Strict functors will be used as a means of “defining” parallel transport along surfaces in gauge theory in Sec. III B. Natural transformations will be used to “define” gauge transformations of such functors and their infinitesimal counterparts will be “derived” from these definitions.

Example 2.15.
Let G be a group and BG be its associated category (see Example 2.3). Every category, such as BG, can be given the structure of a 2-category by adding only identity 0-d defects. Namely, there is only a single 2-d domain, the 1-d defects are elements of G, and the 0-d defects are all identities. This 2-category will also be denoted by BG. Let HilbIsomproj be the 2-category introduced in Example 2.10. A weak normalized functor ρ:BGHilbIsomproj encodes the data of a Hilbert space H, a function ρ:GU(H), and a function cρ: G × GU(1) in such a way so that to every pair of elements g, hG
i.e.,
(2.8)
and also
(2.9)
This provides the datum of a (normalized) projective unitary representation of G on a Hilbert space H (ignoring any continuity conditions).52,53 Notice that c automatically satisfies the condition
(2.10)
for all g, h, kG by multiple applications of (2.8). Hence, any (normalized) projective unitary representation automatically satisfies (2.10) and therefore defines a weak normalized functor ρ:BGHilbIsomproj since (2.10) is condition (2.7) in this case.

Definition 2.16.

Let F,G:CD be two weak functors between two 2-categories. A “natural transformation” σ: FG is an assignment sending k-d domains/defects of C to (k − 1)-d defects of D for k = 1, 2 satisfying the following conditions:

  • The assignment is such that

and
(the diagram on the right can be thought of as filling in the square from the comments following Definition 2.6—rotate the square by 45° counterclockwise to see this more clearly).
  • (b)

    To every pair of parallel composable 1-d defects

the equality
i.e.,
(2.11)
must hold.
  • (c)

    To every identity 1-d defect idR, the equality

(2.12)
must hold.
  • (d)

    To every 0-d defect

the equality
i.e.,
(2.13)
must hold.

Such string diagram pictures facilitate certain kinds of computations54 (for instance, compare the definition of natural transformation in Fig. 10 of Ref. 54). Natural transformations between functors can be thought of as symmetries. For example, just as natural transformations of functors between ordinary categories can be used to describe intertwiners for ordinary representations, natural transformations of functors between 2-categories can be used to describe intertwiners of projective representations.

Example 2.17.
Using the notation of Example 2.15, let ρ,π:BGHilbIsomproj be two projective unitary representations on H and K with cocycles cρ and cπ, respectively. A natural transformation σ: ρπ provides an isometry σKH:HK and a function σ: GU(1), whose value on g is denoted by σg and fits into
which, in particular, says
(2.14)
This provides the data of an intertwiner of projective unitary representations.53 Notice that σ automatically satisfies the condition
(2.15)
for all g, hG. To see this, first notice that
(2.16)
Applying σKH*, the adjoint of σKH, on the left of these equations and canceling the ρ(g)ρ(h) from the right reduces to condition (2.15) because σKH is an isometry (so that σKH*σKH=idH). Hence, an intertwiner of projective unitary representations automatically satisfies (2.15) and therefore defines a natural transformation ρπ since (2.15) is condition (2.11) in this case [Eq. (2.13) is vacuously true].

It will be important to compose natural transformations. This will correspond to iterating gauge transformations successively.

Definition 2.18.
Let E,F,G:CD be two weak functors between two 2-categories and let σ: FG and λ: EF be two natural transformations. The vertical composition of σ with λ, written as (read from top to bottom)
(2.17)
is a natural transformation EG defined by the assignment
on 2-d domains and
on 1-d domains.

Technically, one should check this indeed defines a natural transformation. This is a good exercise in two-dimensional algebra. There are actually similar symmetries between natural transformations, called “modifications,” which we define for completeness.

Definition 2.19.

Let F,G:CD be two weak functors between two 2-categories and σ, ρ: FG be two natural transformations. A “modification” m: σρ assigns to every 2-d domain of C a 0-d defect in D such that the following conditions hold:

  • The assignment is such that

  • (b)

    To every 1-d defect

the equality
i.e.,
(2.18)
must hold.

A convenient class of 2-categories are those for which there is only a single 2-d domain and all defects are invertible under all compositions. Such a 2-category is called a “2-group.” 2-groups therefore only have labels on 1-d and 0-d defects. They can be described more concretely in terms of more familiar objects, namely, ordinary groups.

Definition 2.20.
A “crossed module” is a quadruple G(H,G,τ,α) consisting of two groups, G and H, and two group homomorphisms, τ: HG and α: G → Aut(H), satisfying the two conditions
(2.19)
and
(2.20)
for all gG and h, h′ ∈ H. Here, Aut(H) is the automorphism group of H, i.e., invertible group homomorphisms from H to itself. If the groups G and H are Lie groups and the maps τ and α are smooth [technically, α smooth means the associated map G × HH sending (g, h) to αg(h) is smooth], then (H, G, τ, α) is called a “Lie crossed module.”

Examples of crossed modules abound.

Example 2.21.

Let G be any group, HG, τ ≔ idG, and let α be conjugation.

Example 2.22.

Let H be any group, G ≔ Aut(H), let τ(h) be the automorphism defined by τ(h)(h′) ≔ hhh−1 for all h, h′ ∈ H, and set α ≔ idAut(H).

Example 2.23.

Let N be a normal subgroup of G. Set HN, τ the inclusion, and α the conjugation.

Example 2.24.

Let G be a Lie group, τ: HG a covering space, and α conjugation by a lift. For instance, exp{2πi}:RS1 and the quotient map SU(n) → SU(n)/Z(n) give examples. Here, SU(n) is the set of n × n special unitary matrices and Z(n) is its center, i.e., elements of the form e2πik/nidn with kZ.

Example 2.25.

Let G be the trivial group consisting of a single element, H any Abelian group, τ the constant group homomorphism, and α the unique map sending the identity of G to idH.

Remark 2.26.

It is “not” possible for H to be a non-Abelian group if G is trivial. In fact, for an arbitrary crossed module, (H, G, τ, α), ker(τ) is always a central subgroup of H.

We now use crossed modules to construct examples of 2-categories, specifically 2-groups.

Example 2.27.
Let G(H,G,τ,α) be a crossed module. From G, one can construct a 2-category, denoted by BG, consisting only of a single 2-d domain, the 1-d defects are labeled by elements of G and the 0-d defects are labeled by elements of H. However, such labels must be of the form
Composition of 1-d defects in parallel is the group multiplication in G just as in BG (see Example 2.3). Composition of 0-d defects in series is defined by
Composition of 0-d defects in parallel is defined by
Notice that the outgoing 1-d defect is consistent with our definitions because
(2.21)
due to (2.20).
The identities are given as follows. The identity 1-d defect associated with the single 2-d domain is the 1-d defect labeled by e, the identity of G. The identity 0-d defect associated with a 1-d defect labeled by g is labeled by e, the identity of H. We abuse notation slightly and label both identities of G and H by e. It follows from these two definitions that the identity 0-d defect associated with the single 2-d domain is labeled by the identity on both the 1-d and 0-d defects. These three identities are depicted visually as
respectively.
The inverse of the 1-d defect labeled by g for the parallel composition of 1-d defects is just the 1-d defect labeled by g−1. Inverses for 0-d defects are depicted for series composition by
and parallel composition by
and similarly on the left. Notice that 0-d defects have two inverses, one for each of the two compositions.

This last class of examples of 2-groups from crossed modules will be used throughout this paper. In fact, all 2-groups arise in this way.

Theorem 2.28.
For every 2-group, let G be the set of 1-d defects and let H be the set of 0-d defects of the form
(i.e., 0-d defects whose source 1-d defect is e). Define τ: H → G by τ(h) ≔ g from 0-d defects of the above form. Set αg(h) to be the resulting 0-d defect obtained from the composition
The product in G is obtained from the composition of 1-d defects in parallel and the product in H is obtained from the composition of 0-d defects in series. With this structure, (H, G, τ, α) is a crossed module. Furthermore, this correspondence between crossed modules and 2-groups extends to an equivalence of 2-categories.8 

We now provide some examples of 2-groups along with weak functors between them to illustrate their meaning.

Example 2.29.
Let G be a group and H be a Hilbert space. Let U(H) denote the unitary operators of H. Let G be the crossed module ({1}, G, !, !), where the ! stand for the trivial map and trivial action, respectively. Let U(H) be the crossed module (U(1),U(H),τ,α) with τ(eiθ)eiθidH and α be the trivial action. By definition, a weak functor ρ:BGBU(H) consists of a function ρ:GU(H) and a function cρ: G × GU(1) of the form sending (g, h) to
which, in particular, says
(2.22)
satisfying
(2.23)
for all gG, and
(2.24)
for all g, h, kG. This is the definition of a (normalized) projective representation of G on H and is really a special case of Example 2.15, where the Hilbert space is fixed from the start. The crossed module U(H) introduced here is actually the automorphism crossed module (in analogy with the automorphism group) of the Hilbert space H viewed as a 2-d domain in the 2-category HilbIsomproj.

The following fact will be used in distinguishing two types of gauge transformations. It allows one to decompose an arbitrary gauge transformation into a composition of these two types.

Proposition 2.30.
LetCbe a category viewed as a 2-category so that its 1-d domains become 2-d domains, its 0-d defects become 1-d defects, and its 0-d defects are all identity 0-d defects. LetG(H,G,τ,α)be a crossed module with associated 2-groupBGandF,F:CBGbe two strict functors (so that cFand cF′are identities). A natural transformation σ: F ⇒ F′ consists of a function from 2-d domains ofCto G, denoted by g
and a function from 1-d defects ofCto H, denoted by h
which says that
(2.25)
satisfying the axioms in the definition of a natural transformation. Thus, σ can be written as the pair (g, h). Furthermore, there exists a strict functorF:CBGsuch that the natural transformation σ decomposes into a vertical composition (recall Definition 2.18) of the natural transformations (g, e): F ⇒ F″ and (e, h): F″ ⇒ F, i.e.,
(2.26)
Namely, for any 1-d defectzγy,

Proof.
Define F:CBG by sending a 1-d defect zγy of C to
and sending a 0-d defect
of C to
Using these definitions, one should check F″ is indeed a strict functor, both (g, e): FF″ and (e, h): F″ ⇒ F′ are natural transformations, and σ is the vertical composition of (g, e) followed by (e, h).

In classical electromagnetism, or gauge theory, in general, the equations of motion dictate the dynamics. In particular, the field strength, not the gauge potential, appears in the equations of motion. The vector potential becomes relevant when formulating the equations of motion as a variational principle which is itself a reference point towards quantization.55,13 The exponentiated action and parallel transports of gauge theory are realized precisely in this intermediate stage of local prequantum field theory which lies between classical field theory and quantum field theory. We will focus on special 1-d and 2-d field theories, i.e., particle mechanics and string theory. The particle case is provided as a review as well as to set the notation. We will use the 2-dimensional algebra of Secs. II B and II C to explicitly compute parallel transport and its change under gauge transformations. The novelty here, compared with the results of Ref. 5, for instance, is the explicit calculations on a cubic lattice and a direct derivation of the formula for the parallel transport including convergence results. Although our main results are Propositions 3.5 and Theorem 3.8, the diagrammatic picture developed for how these gauge fields interact with combinations of edges and plaquettes in a lattice should be fruitful for future applications.

The solution to the initial value problem (IVP)

(3.1)

at time T with A(t) being a time-dependent n × n matrix is

(3.2)

where T stands for time-ordering with earlier times appearing to the right, namely,

(3.3)

where f: {1, , k} → {1, , k} is any bijection such that

(3.4)

Remark 3.1.

The choice of sign convention (3.1) is to be consistent with Refs. 14, 56, and 30. Be warned, however, as this sign will lead to different conventions for other related forms such as the curvature 2-form, the connection 2-form, and gauge transformation relations. Certain authors use other conventions.57,58 Yet another convention is to include an imaginary factor.47 

The IVP (3.1) shows up in several contexts such as (a) solving Schrödinger’s equation with A(t) = iH(t) for a time-dependent Hamiltonian and ψ being a vector in the space on which H acts and (b) calculating the parallel transport along a curve in gauge theory, where A is the local vector potential, a matrix-valued (or Lie algebra-valued) differential form on a smooth manifold M. This integral goes under many names: Dyson series, Picard iteration, path/time-ordered exponential, Berry phase, etc.

As an approximation, the solution to this differential equation can be obtained by breaking up a curve into infinitesimal paths
and associating the group elements
(3.5)
with these infinitesimal paths and multiplying those group elements in the order dictated by the path. In this notation, we have used local coordinates {xμ} and the Einstein summation convention for these local coordinates. The i subscript on μ is meant to distinguish the summations at different times {ti}.dxμidtti stands for evaluating the derivative of the path at the time ti. Furthermore, Δti should be thought of as the length of the infinitesimal interval from ti to ti+1, namely, Δti = ti+1ti, and will be used later as an approximation for calculating integrals. For simplicity, we may take it to be Δti=1n if our parametrization is defined on [0, 1] and if there are n subintervals. Furthermore, by locality, the group elements should be of the form (3.5) to lowest order in approximation. Preserving the order dictated by the path, the result of multiplying all these elements is
(3.6)
Expanding out to lowest order (since the paths are infinitesimal) gives
(3.7)
where 1 denotes the identity matrix. Reorganizing terms results in
(3.8)
which is exactly the path-ordered integral appearing in (3.2) after taking the n limit in which the Δti are replaced by dti. There are several things to check to confirm this claim. First, to see that the limit as n of the partial products coming from (3.7) converges, we use the fact that this product converges if and only if the sequence of partial sums
(3.9)
converges as n (cf. Sec. 8.10 in Ref. 59). Technically, one should be a bit more careful since the matrices change as a function of n. Our “if and only if” claim would be correct if we replace n with an arbitrary partition and take limits over partitions because any two partitions have a common refinement. Another proof of convergence can be done using Picard’s method.60 In (3.9), the norm can be taken to be the operator norm for matrices in any representation. If one defines the real-valued function
(3.10)
on the domain of the path, then the convergence of this sum is equivalent to the existence of the Riemann integral of the function A over [0, 1] (one could have made these definitions for any partition of the interval to relate it more precisely to the Riemann integral61). Since the Lie algebra-valued differential form A is smooth and since the path is smooth, A is smooth and therefore integrable so that
(3.11)
Second, one should note that the sums in (3.8) are automatically ordered so that they become integrals over simplices in the n limit. This follows from the equality
(3.12)
giving an additional 1k! for the volume of the k-simplex. For example, the double sum term above with the i > j ≥ 1 becomes the double-integral term over the 2-simplex. The lowest order terms resemble integrals while the latter terms do not [for example, see the last term in (3.8)]. However, as n, the latter terms get “pushed out” to infinity and (3.2) is what remains. More precise derivations can be found in Refs. 56 and 62. We picture the group element (3.8) as all the number of ways in which A interacts with the particle preserving the order of the path
(3.13)
Thus, given a path γ: [0, 1] → M, we denote the parallel transport group element in (3.8), after taking the n limit, by triv(γ). The notation triv(γ) is used because we will always work in a local trivialization of a bundle with connection. This choice is also made to be consistent with an earlier work,14 as well as Ref. 30. Three key properties of the parallel transport are that (a) it is reparametrization invariant, (b) if one has two paths connected at their endpoints as in
then
(3.14)
and finally (c) it is a smooth function from paths in M to the Lie group G. This resembles the definition of a functor. To state the relationship between parallel transport and functors more precisely, we note that triv(γ) is invariant under more than just reparametrizations of γ. It is also invariant under thin homotopy. Hence, an appropriate domain on which triv is defined is a (smooth) category P1M known as the “thin path groupoid” of M. A “groupoid” is a category all of whose 0-d defects are invertible. Briefly, the thin path groupoid P1M consists of points of M and certain equivalence classes of paths of M. In terms of 1-d domains and 0-d defects, we actually use the Poincaré dual so that points in M correspond to 1-d domains (which are now better thought of as objects) and paths in M correspond to 0-d defects (which are now better thought of as morphisms). More details on the thin path groupoid can be found in Refs. 14 and 30 with a proof of thin homotopy invariance as well as smoothness of triv in the latter reference. Fortunately, we will not need such technical details for our calculations. All we should keep in mind is that triv:P1MBG associates group elements with paths
smoothly and the path ordered integral arises from smoothness, breaking up the path into infinitesimal pieces, and using the generalized group homomorphism property. Namely, associated with such a path γ and a decomposition of that path into a concatenation of paths
(3.15)
let
(3.16)
Then, the parallel transport is approximated by the product
given in (3.6) and the time ordering is automatic. This is essentially what we mean by one-dimensional algebra: one-dimensional algebra is the theory of categories and functors.
The symmetries associated with the parallel transport are given by functions MG. More precisely, let triv,triv:P1MBG be two parallel transport functors defined by vector potentials A and A′, respectively. A “finite gauge transformation” from A to A′ is a smooth function g: MG satisfying
(3.17)
In order for this equation to make sense, we must view G as a matrix group, which we will always assume to facilitate computations and simplify formulas. However, this assumption is not necessary for any statements made here. Condition (3.17) for a gauge transformation is equivalent (see Ref. 30) to the condition that for any path γ from y to z,
(3.18)
which in turn is equivalent to the statement that g: MG defines a smooth natural transformation from triv to triv′ (see Definition 2.6). A sketch of this equivalence can be seen by discretizing a path tx(t) into n pieces and using expression (3.7) for the approximation of the parallel transport. Applying a gauge transformation to each piece gives
(3.19)
where the product is in the specified order as in (3.7). Taylor expanding out the latter group element gives
(3.20)
to first order in Δt. Plugging this into (3.19) gives
(3.21)
where we have dropped the term
(3.22)
since it is second order in Δti. Finally, since
(3.23)
it is reasonable to identify the corresponding terms giving
(3.24)
which reproduces (3.17). This latter perspective of functors and natural transformations will be used in the sequel to “define” parallel transport along two-dimensional surfaces (with some data on orientations). This was first made precise in Ref. 30 though the formulation in terms of functors had been expressed earlier.15 

Remark 3.2.

Most of the calculations in this paper will follow this sort of logic. Although similar techniques were used in Refs. 11 and 15, we were largely motivated by the analysis in Ref. 47 and hope that our treatment will be accessible to a larger audience. More rigorous results can be found in Refs. 30, 5, 31, and 12.

Understanding higher form non-Abelian gauge fields has been a long-standing problem in physics, particularly in string theory and M-theory (see, for instance, the end of Refs. 63). Some progress is being made to answer some of these problems with the use of higher gauge theory (see Ref. 41 and the references therein). Although we do not aim to solve these problems, we hope to indicate the important role played by category theory in understanding certain aspects of these theories. We will show how 2-categories and the laws set up in Secs. II B and II C naturally lead to the notion of parallel transport along surfaces. This will also illustrate how explicit calculations can be done in 2-groups. Parallel transport will obey an important gluing condition analogous to the gluing condition for paths. Gauge transformations will be studied in Sec. III C. Furthermore, we will produce an explicit formula analogous to the Dyson series expansion for paths. Although an integral formula is known in the literature,5 the derivation there is not entirely direct nor is it obvious how the formulas are derived from, say, a cubic lattice approximation. A sketch is included in Ref. 15 in Sec. 2.3.2, but further analysis was done in path space, which we feel is more difficult. Indeed, the goal of that work was to relate gerbes with connection on manifolds to connections on their corresponding path spaces. Furthermore, although experts are aware of how bigons are related to more general surfaces, we explicitly perform our calculations on “reasonable” surfaces, namely, squares, for clearer visualization. Our method is more in line with the types of calculations done in lattice gauge theory.20 

We feel it is important to express surface transport in a more computationally explicit manner using a lattice and “derive,” from the ground up, a visualization of the surface-ordered integral sketched in Fig. 15 in Ref. 14. This is done in Proposition 3.5, Theorem 3.8, and the surrounding text. Just as the G-valued parallel transport along paths in a manifold M is described by a functor triv:P1MBG, crossed-module G-valued parallel transport along surfaces should be described by a functor from some 2-category associated with paths and surfaces in M to the 2-group BG. Ideally, such a 2-category should be a version of the (extended) 2-dimensional cobordism 2-category over the manifold M to mimic the ideas of functorial field theories. However, this has not yet been achieved in this form for general 2-groups. In fact, a version of this has only recently been achieved for the 1-dimensional case by Berwick-Evans and Pavlov.64 An earlier work on Abelian gerbes indicates this should be the case in general though this has not been fully worked out.65 Part of the reason is due to the fact that the representation theory for higher groups is a rather young subject.66 

Fortunately, a related solution exists if one works with a 2-category of paths and homotopies. This 2-category is denoted by P2M. It is more natural to describe this category in terms of the Poincaré dual of string diagrams. Namely, objects of P2M are points of M, 1-morphisms of P2M are thin homotopy classes of paths in M, and 2-morphisms are thin-homotopy classes of “bigons” in M. A bigon is essentially a homotopy between two paths whose endpoints agree.

Definition 3.3.
Let γ and δ be two paths from x to y parametrized by t ∈ [0, 1] such that there exists an ϵ > 0 with γ(t) = δ(t) for all t ∈ [0, ϵ] ∪ [1 − ϵ, 1]. A bigon from γ to δ is a map Σ: [0, 1] × [0, 1] → M such that there exists an ϵ > 0 with
(3.25)
It is helpful to visualize such a bigon as

Definition 3.4.

Two bigons Σ and Γ from paths γ to δ are “thinly homotopic” if there exists a smooth map of a 3-dimensional cube into M whose top face is Σ, whose bottom face is Γ, and similarly for the other face for the paths γ and δ along with their endpoints (all of these assume some constancy in a small neighborhood of each face). Furthermore, and most importantly, this map cannot sweep out any volume in M, i.e., its rank is strictly less than 3.

More details can be found in Refs. 14 and 5 though again such technicalities will be avoided here. Thus, a strict smooth functor triv:P2MBG is a smooth assignment
that, by the conventions of 2-groups in Sec. II C, says
(3.26)
Furthermore, this assignment satisfies a homomorphism property in the following sense. Bigons can be glued together in series and in parallel by a choice of parametrization. By the thin homotopy assumption, the value of the bigons is independent of such parametrizations. It might seem undesirable to restrict ourselves to surfaces of this form. However, this is no serious matter because every compact surface can be expressed in this manner under suitable identifications living on sets of measure zero. For example, a surface of genus two with three boundary components with orientations shown (the orientation of the surface itself is clockwise) is depicted on the right as a bigon beginning at the path γ (in blue) and ending at the path δ (in yellow) both of which are loops beginning at the same basepoint which is the top left corner of the octagon on the left
The identifications on the outer boundary of the octagon are standard ways of representing a genus two surface.67 Furthermore, one can always triangulate or cubulate such a surface. If one chooses triangulations, then one merely needs to know the parallel transport on triangles
and if one cubulates a surface, then one needs to know it for squares
Thus, in order to find an explicit formula for the parallel transport along surfaces with non-trivial topology, it suffices to calculate the parallel transport along, say, a square. Squares are also sometimes more convenient to use for continuum limiting procedures as opposed to triangles.68 Functoriality for gluing squares together implies
and using the rules of two-dimensional algebra, this composition is
(3.27)
Similarly, for gluing along a different edge
the composition of the 0-d defects is
(3.28)
These two ways of composing squares will form the basis for later computations. If one also wishes to attach a square in a somewhat arbitrary way such as
then this attachment must be oriented in such a way that (a) the boundary orientation agrees with the orientation of the first surface and (b) the two surface orientations combine to form a consistent orientation when glued together. So, for example,
is an allowed gluing orientation (more on orientations and their physical meaning is discussed in Sec. III D). In this case, if we label all the vertices, edges, and squares, then the parallel transport along the glued surface is
which reads
(3.29)
on the resulting 0-d defect. This result will play a crucial role in Remark 3.9.
Using all of these results, we can take an arbitrary worldsheet (with orientations giving it the structure of a bigon), break it up into infinitesimal squares
and approximate the parallel transport along an infinitesimal square
where
(3.30)
and
(3.31)
denote the parallel transport along infinitesimal paths and
(3.32)
denotes the parallel transport along infinitesimal squares. Here,
(3.33)
and for an n × n square grid, these can be set as Δsi=1n=Δtj. Our convention is to include all combinatorial factors into our Einstein summation convention since these are cumbersome to carry. With respect to the usual Einstein summation convention, such an expression in exponential (3.32) would have a 12. This is due to the fact that B is a 2-form. For a k-form, the factor we are leaving out is 1k!. Note that in order for the association before Eq. (3.30) to be consistent with our 2-group conventions, it must be true that
(3.34)
or equivalently
(3.35)
at least to lowest non-trivial order. The term on the right-hand-side of (3.35) is precisely the parallel transport along the infinitesimal square (for the purpose of this calculation, we have dropped the Δsi and Δtj from the notation to avoid clutter—this should cause no confusion because these quantities are always coupled with their corresponding derivatives s and t, respectively)
(3.36)
to lowest order, which is a standard result, reproduced here to illustrate the methods that will be employed in more involved calculations later. Here,
(3.37)
is the curvature of A. Meanwhile, the left-hand-side of (3.35) is
(3.38)
to lowest order. Here, τ̲:hg is the derivative of the map τ: HG at the identity, i.e., on the Lie algebras [see  Appendix A for more on the infinitesimal version of (H, G, τ, α)]. This therefore forces the condition
(3.39)
which is known in the literature as the “vanishing of the fake curvature.”8 Finally, we can expand out these exponentials of differential forms and multiply all terms together analogously to what was done for a path. An arbitrary worldsheet is described by a map (via some reparametrization if necessary) from [0, 1] × [0, 1] to some target manifold and is naturally a bigon with the orientation induced by having (s, t) a right-handed coordinate system. Breaking up such a bigon into infinitesimal squares (one can also use arbitrary partitions—see  Appendix B) allows one to associate the above exponentials on the Poincaré dual of the cubulation of the worldsheet
In the above figure, a cubulation of the domain [0, 1] × [0, 1] is shown on the right together with its Poincaré dual. This rotated (s, t) coordinate system was chosen to agree with our earlier convention on two-dimensional algebra [cf. (3.27) and (3.28)]. To be a bit more clear, since a bigon is a map [0, 1] × [0, 1] → M with conditions described by (3.25), we can visualize it as
We prefer to use these rectangular coordinates to more easily express our results for cubic lattices. We will consider a 5 × 5 grid for concreteness ( Appendix B contains an analysis for more general partitions). The goal is to associate with each square in this grid the 2-group elements (3.30)–(3.32), etc., and then to multiply all of these elements together using the rules for 2-group multiplication set up in Sec. II C. In order to do this, we use the rules set up earlier on how to read such diagrams and this requires us to extend the 1-d defects of the Poincaré dual to the top and bottom of the page using identity 0-d defects drawn on the (s, t) domain of the worldsheet. The identities are drawn in yellow to illustrate where they are and not because the 2-d domain is different—there is only a single 2-d domain in a 2-group—see Sec. II C. One could have also added identities in many other consistent ways. The end results would all be the same (to lowest order) due to the interchange law (2.4)
To more easily relate this picture to the earlier ones for 2-groups, it helps us to draw horizontal lines to distinguish the order of multiplication
and then to tilt the angles of the identities (only the top half is drawn)
which now makes it easy to see we can first compose each row in parallel and then compose the results in the remaining column in series. We explicitly label (some of) the 1-d and 0-d defects
and then multiply each row in parallel. The first row starting at the top looks like
The result on the 1-d defects is just the usual group multiplication product, while the result on the 0-d defects is
(3.40)
The 0-d defects of the next several rows are all given by the following equations:
(3.41)
(3.42)
(3.43)
(3.44)
(3.45)
(3.46)
(3.47)
and finally
(3.48)
The result of composing all of these in series gives
(3.49)
which, when expressed in terms of usual group multiplication in H, becomes
(3.50)
We can visualize this mess more easily by expanding out each bij to lowest order (since we already know that the a’s give the one-dimensional parallel transport, we do not have to expand them out) and examining the terms with zero Bij’s, terms with one Bij, terms that involve the product of two Bij’s of different indices, and so on. For example, expanding out just the first two terms on the right-hand side of (3.50) gives (a prime on the second ν1 index has been adjoined to remain consistent with the Einstein summation convention)
(3.51)
Expanding all of these products out and separating the terms order by order (the order is now determined by the area elements) results in a single zeroth order term given by just the identity and 25 first order terms with a single B (some of these terms are written underneath the pictures to more clearly illustrate our convention)
These pictures express the fact that we calculate the ordinary parallel transport along a specified path between the point (s, t) = (s6, t6) and another point (si+1, tj+1) (represented by a blue line) and conjugate each B field at (si, tj) (represented by a blue square) by that parallel transport using α. Then, we sum over all points at which B has been specified. There are k=124k=24(25)2=252=600 second order terms, i.e., terms with two B’s,
In this long expression, there are 24 terms in the first 3 rows of pictures, 23 in the two rows after that, up until we get 2 + 1 shown in the last row. This is consistent with the counting 252. Now, we should do this sum for all products of B’s ranging from terms with 0 B’s to terms with 25 B’s. For example, a term with 4 B’s might look like
but a term such as
does “not” appear in expression (3.50) due to automatic ordering. To be clear, this ordering is given as follows:
where the earlier terms begin at 1 and appear from right to left when expressed algebraically using group multiplication. The total number of all terms in such an expansion is enormous and is given by
(3.52)
or more generally
(3.53)
if we have an n × n grid. There are many things to check to make sense of the n limit. First, we need to argue why the product from (3.50) generalized for an n × n grid converges as n.

Proposition 3.5.

Let trivnbe the generalization of the expression given in (3.40) for an n × n grid decomposition. Then, the sequencetrivnnNconverges as n → ∞.

Proof.

To prove this, one can introduce trivP for any partition P of the unit square. Then, one needs to show that this quantity has a well-defined limit over all partitions ordered by refinement. This argument has been deferred to  Appendix B.

Secondly, as a result of expanding out these products into sums of different orders, we should be sure that the sum of all such terms coming from (3.50) for any given order converges as the spacing goes to zero, i.e., as n. The terms with k B’s have an additional factor of 1n2k associated with the area elements on which they are approximated (we are ignoring the factors coming from the aijs and aijt terms because we can see from these pictures that in the n limit these terms describe the parallel transport along a path as was discussed in Sec. III A—cf.  Appendix B). The ratio of the number of “all” such terms for kn22 to this factor is
(3.54)
where denotes the floor function. Note that the product term satisfies
(3.55)
because it is a product of numbers strictly less than or equal to 1 for all i. Hence,
(3.56)
For kn22, the ratio n2k/n2k decays even more strongly because n2k is symmetric at n22 and hence n2k begins to decrease for larger values of k, while the 1n2k factor remains and increases as k gets larger.

Proposition 3.6.
For eachkNand n ≥ k, let trivn,kdenote the kth order terms obtained from expanding out trivnto lowest order (see Proposition 3.5). First, for eachnN,there exists a positive real number Mn> 0 such that
(3.57)
for all k ∈ {0, 1, , n2}. Second, for eachkN,there exists anNNand a positive real number M > 0 such that
(3.58)
for all n ≥ N. Finally,
(3.59)

Proof.

This argument has been deferred to  Appendix B.

This result is analogous to the bound obtained for ordinary parallel transport.56 Explicitly computing the kth order sum as n is intractable due to the complicated ordering of terms present [see the ordering on a 5 × 5 grid before Eq. (3.52)]. Fortunately, we can simplify the expression trivn by rearranging and reorganizing all of these terms. For example, consider terms with two B’s. There are terms with two B’s at different “vertical heights” such as
and terms with B’s at the same height such as
As explained in (3.52), due to the automatic ordering, there do not exist terms with the order flipped in the above two images. Therefore, the total number of terms with two B’s at the same height is (n = 5 in our picture)
(3.60)
where the second equality comes from a neat fact about Pascal’s triangle
(3.61)
The ratio of terms with two B’s at the same height to the total number of terms with two B’s is
(3.62)
Note that the limit of this quantity as n is
(3.63)
Hence, terms that involve a product of two B’s that also appear at the same height become negligible as compared to those that do not in the n limit. One might wonder if this is true for any product of B’s. Clearly, this is false when we have a product of k B’s and k > 2n − 1 since “every” configuration has at least one row in which B occurs at least twice. However, it is true for k sufficiently smaller than n. This leads us to an interesting combinatorial problem in its own right.
The number of configurations of k blocks in an n × n grid tilted 45° such that no two blocks appear at the same height is
(3.64)
where
(3.65)
denotes the number of available blocks at a given height i. The ratio of this number to the total number of configurations of k blocks (not necessarily restricted to lie on different heights) is
(3.66)

Lemma 3.7.
For any ϵ > 0 andKN,there exists an integer N > K such that
(3.67)
for all n ≥ N and k ≤ K, i.e.,
(3.68)
for allkN.
Although the graph in Fig. 1 illustrating this should be convincing, it is not a substitute for a proof. The Proof of Lemma 3.7 is quite involved and is given in  Appendix C. Instead, we offer a rough estimate analysis via averaging. The average value of ln is
(3.69)
Hence, to a good approximation for large n and small k,
(3.70)
where the second line comes from the fact that there are 2n1k terms in the summation. Hence, to a good approximation,
(3.71)
Since k is fixed, the right-hand-side tends to 1 as n.

FIG. 1.

A plot of Rn,k for various values of n and k indicating limnRn,k = 1.

FIG. 1.

A plot of Rn,k for various values of n and k indicating limnRn,k = 1.

Close modal

Theorem 3.8.
Let trivnbe the generalization of the expression given in (3.40) for an n × n grid decomposition. Lettrivnredbe the same expression but with all terms in which B occurs at least twice at the same height removed. For any ϵ > 0, there exists anNNsuch that
(3.72)
for all n ≥ N.

Proof.
Let M be the maximum value of the norms of all quantities of the form αa56sa46sa36sa35ta25s(B14). The difference trivntrivnred only consists of contributions from terms in which there exist at least two B’s that occur at the same height. Fix ϵ > 0. To begin, let K be large enough so that
(3.73)
which is possible since the series for the exponential converges. Furthermore, by Lemma 3.7, for any ϵ > 0, there exists an N large enough so that
(3.74)
Using these two results, the value of the norm of the difference trivntrivnred is bounded by
(3.75)
Thus, heuristically, as n, the number of terms for which at least two B’s are at the same height is a set of measure zero with respect to all possibilities and hence we can ignore them in the calculation of the surface ordered parallel transport after taking the n limit. This gives the following picture for the surface-iterated integral. Let γs,t be the (thin) path
The limit of the expression (3.49) as n is therefore given by a sum of iterated integrals
with path-ordering “only” in the vertical direction. In more detail, the surface-ordered integral is depicted schematically as an infinite sum of terms expressed by placing B at the starting points of the drawn paths and conjugating it by parallel transport along the path connecting to it using A and α. Then, we use an ordinary integral over the horizontal direction to get a 1-form (similar to what is done in Refs. 15 and 5). Finally, we use the usual path-ordered integral in the vertical direction. More explicitly, by changing coordinates to
(3.76)
one can express γs,t in terms of u and v. We write this path as γu,v. Using this, the surface parallel transport is given by
(3.77)
where B(u, v) stands for
(3.78)
The sum in (3.77) is absolutely convergent by Propositions 3.5 and 3.6 and Theorem 3.8. In other words,
(3.79)

Remark 3.9.
Although our formula for the surface ordered product for parallel transport has a similar form to the one given by Schreiber and Waldorf in their Eq. (2.27) of Ref. 5, they do not look equal. Here, we provide an argument that shows our two formulas are equal. This fact will follow from the defining properties of a crossed module. In terms of bigons, the following figure depicts the differences between these two conventions of defining the surface ordered product:
for a bigon Σ: γδ between paths γ, δ: xy. The left bigon depicts our automatic ordering that was derived directly from 2-group multiplication with the red quantity appearing to the right of the blue quantity, i.e.,
(3.80)
The right bigon depicts the ordering convention chosen by Ref. 5. This can be seen by noting that in their formula, some inverses and minus signs appear, which we have avoided. These amount to computing the parallel transport on the leftover form, which they call AΣ, along the “reverse” direction. Also notice that the path they use to act on the B field via the α action from the crossed module actually goes around quite differently than ours. For them, the order is swapped and the expression is written as
(3.81)
Term-by-term, these expressions are in fact “different.” However, when they are combined and all terms are taken to account in computing the full parallel transport, the resulting 2-group elements describing the parallel transport are the “same” so that our formulas agree. To see this, we will compare what happens in both of our conventions if we add on an infinitesimal square to an already computed surface transport [cf. (3.29)]. Fix (s, t) ∈ [0, 1] × [0, 1] and consider a bigon whose parallel transport has been computed up to s and an additional bigon is to be added at this point
(3.82)
Set
(3.83)
Our convention is that the resulting parallel transport of adding this extra piece is given by αgP(hΔ)h, whereas Schreiber and Waldorf’s convention (due to the inverses appearing in their formulas) would give hαggSW1(hΔ). To better compare these, notice that τ(h)g = gPg1gSW, which is equivalent to ggSW1=τ(h1)gPg1. Hence,
(3.84)
by the properties of crossed module multiplication. Now, by continuity of triv, these actions, and the group multiplications and since limϵ0g1 = e, the identity element in the group G, we have
(3.85)
so that our formulas for the parallel transport along surfaces agree.

In Sec. III A, we described gauge transformations as natural transformations of parallel transport functors for paths. In this section, we will use this as the “definition” of a gauge transformation and derive the corresponding formulas for differential forms. As before, let G(H,G,τ,α) be a crossed module, BG be its associated 2-group, and M be a smooth manifold. A “(first order) gauge transformation” from a parallel transport functor triv:P2MBG to another triv:P2(M)BG is a (smooth) natural transformation triv ⇒ triv′. By Definition 2.16 and Proposition 2.30, such a natural transformation consists of a pair of (smooth) functions g: MG and h: P1MH satisfying the conditions described in that proposition. Namely, to every thin path zγy,

(3.86)

to every pair of composable thin paths zγyδx,

(3.87)

to every point xM,

(3.88)

and finally to any worldsheet,

viewed as a bigon from γδ to ζξ, the equality

holds [this follows from condition (d) in Definition 2.16]. Reading this diagram is a bit tricky without the arrows (recall Remark 2.9). More explicitly, this equality says

i.e.,

(3.89)

or equivalently by our earlier condition (3.87)

(3.90)

By Proposition 2.30, such a natural transformation can be decomposed as

(3.91)

A gauge transformation of the type (g, e) is typically called a (first order) “thin gauge transformation” and one of the type (e, h) is called a (first order) “fat gauge transformation.”69 Thus, Proposition 2.30 implies that an arbitrary gauge transformation of the first kind can be decomposed into a thin and fat gauge transformation. Using this, we can calculate infinitesimal versions of the functions g: MG and h: P1MH for small paths, i.e., for a point xM and a tangent vector at x. This was already done for g: MG at the end of Sec. III A with result (3.17). For h: P1MH, let tx(t) parametrize an infinitesimal path γ. Then, to lowest order in Δt,

(3.92)

for some 1-form φΩ1(M;h). Thus, plugging these expressions into (3.86), a fat gauge transformation from (A, B) to (A′, B′) infinitesimally gives

(3.93)

because g has been set to be the identity. Expanding out to lowest order in Δt gives

(3.94)

giving the relationship

(3.95)

for a fat gauge transformation. We already calculated what happens for a thin gauge transformation in Sec. III A. Using Proposition 2.30, combining (3.17) with this gives

(3.96)

for an arbitrary gauge transformation. The B field under an arbitrary gauge transformation changes according to (3.89). By substituting the necessary forms, this expression on the left-hand-side of (3.89) becomes (to avoid clutter, we have not explicitly written Δs and Δt)

(3.97)

where it is understood that all terms now are evaluated at (s, t), to lowest order. Meanwhile, the right-hand-side of (3.89) is

(3.98)

again to lowest order. Equating (3.97) with (3.98) gives

(3.99)

in components or

(3.100)

as an equation in terms of differential forms. We write such a gauge transformation as

(3.101)

This and (3.96) agrees with Proposition 2.10 of Ref. 5. We can express this purely in terms of A, B, g, and φ as

(3.102)

This will be useful later.

Now suppose (g, h), (g′, h′): triv ⇒ triv′ are two first order gauge transformations. A “second order gauge transformation” a: (g, h) ⇛ (g′, h′) is a modification from (g, h) to (g′, h′). By Definition 2.19, this consists of a function a: MH fitting into

which, in particular, says

(3.103)

satisfying the condition that to any path yγx,

(3.104)

Expanding out this expression on infinitesimal paths gives

(3.105)

which to lowest order says

(3.106)

Note that if hH, the function αh: GH is defined by Ggαg(h) so that αh̲:gh is the derivative. This result gives the condition (after multiplying by a−1 on the right)

(3.107)

on components and

(3.108)

as h-valued differential forms. This and (3.103) exactly agree with Proposition 2.11 of Ref. 5. Physically, a second order gauge transformation is a gauge transformation for the 1-form φ, which itself is a type of field even though it appears as a (fat) gauge transformation for the 1-form and 2-form gauge potentials.

It is well-known that given a path yγx, the parallel transport along the reversed oriented path γ−1 is the inverse

(3.109)

where triv:P1MBG is the (local) parallel transport functor. This can be viewed as a consequence of thin homotopy invariance and functoriality of parallel transport. Namely, although the paths γγ−1 and γ−1γ are “not” constant paths (the notation γ−1 is therefore a bit abusive), they are thinly homotopic to constant paths and hence give the same value on triv. Thus,

(3.110)

verifying (3.109). In this section, we will explore analogous results for reversing different kinds of orientations on bigons. We therefore include arrows for clarity

Technically, there is one more possibility given by Σ̲:δ1γ1. However, this possibility is a combination of the above two, namely, Σ̲=Σ1¯. The meaning of these different possibilities can be explained physically as follows. A string may be given the additional datum of an orientation. Furthermore, as it moves in time, it has an additional directionality. These two directional orientations are precisely encoded in the definition of a bigon/2-morphism. These different orientation reversals are given by time reversal for Σ¯ and spatial orientation reversal for Σ−1. The case Σ̲ corresponds to both reversals (the order of reversal does not matter since the operations commute). Note that the different orientations on a bigon can be expressed as an orientation of edges on the boundary and an orientation of the surface. The above bigons correspond to the following surfaces with associated orientations

respectively.

A necessary and sufficient condition for such orientations on surfaces and edges to give rise to a bigon is the following. Given a map of a polygon Σ into M, the boundary consists of the edges of the polygon. The union of the oriented edges consistent with the orientation of the polygon must be connected. Similarly, the union of the orientated edges with negative orientation with respect to the induced one from the polygon must also be connected. Then, the source of the bigon is the union of the consistent edges and the target is the union of the oppositely oriented edges. An example together with a nonexample are

respectively (blue, with arrows written using >, corresponds to an orientation agreeing with the induced one from the surface, while yellow, with arrows written using ≫, disagrees with that orientation).

Going back to the three bigons and their orientations at the beginning of this section, we notice that several of these bigons can be composed with one another. For instance,

and

after applying thin homotopies. Therefore, these bigons provide inverses in series and in parallel, respectively, of Σ. This, together with functoriality of triv and the inverses discussed in Example 2.27, implies

(3.111)

and therefore describes how parallel transport along surfaces changes under reversals in surface orientations and boundary orientations, respectively. For completeness, for the last possible orientation Σ̲:δ1γ1, we have

(3.112)

In the following, we go through some additional calculations in higher gauge theory. Just as the curvature F of a 1-form connection A can be obtained by calculating the parallel transport along an infinitesimal loop, the 2-curvature of a 2-form connection (A, B) can be obtained by calculating the surface transport along an infinitesimal sphere, which on a cubic lattice corresponds to a cube. We will perform this calculation explicitly and study some properties of the resulting 3-form curvature. Similar analysis was done on a tetrahedron in the context of categorical higher gauge theory in Ref. 11. Another version was explored for cubes in Ref. 70 though the latter did not formulate their results using the notion of a 2-group.

Let (r, s, t) ↦ x(r, s, t) be an infinitesimal cube and consider the following domain for that cube along with the infinitesimal path that goes first along the r direction, then in the s direction, and finally in the t direction. Our convention is that (r, s, t) is a right-handed coordinate frame, i.e., drdsdt is the volume form

Such a cube can be expressed as a bigon by the following composition of plaquette bigons that begin and end at the same path starting at the top left and moving clockwise:

The corresponding 2-group elements are given as follows: We begin with the first surface

where we use the shorthand notation

(3.113)

as well as

(3.114)

and similarly for the other terms. We also write ϵ instead of Δr, Δs, or Δt and use the derivatives to remind ourselves of the direction. We have also assumed for simplicity that our coordinates are centered at the origin and the lattice spacing is ϵ in each direction. Working out this diagram infinitesimally on the 0-d defect gives

(3.115)

to lowest order. As usual, rather than writing out the Δr, Δs, Δt, we use the number and type of derivatives appearing to keep track of the order. The other terms are given as follows:

(3.116)
(3.117)
(3.118)
(3.119)
(3.120)

The composition of all of these elements is given by the following diagram (with the light shaded blue squares depicting the faces on the cube):

The result of multiplying these out using two-dimensional algebra gives

(3.121)

This is yet another manifestation of two-dimensional algebra. The result of multiplying all these terms is given as follows, order by order. The zeroth order term is 1. There are no first order terms. The second order terms are given by

(3.122)

by antisymmetry of Bμν in the μ and ν indices. Thus, the only nonzero terms are the zeroth and third order terms (up to third order). One type of the third order terms is given by

(3.123)

and vanishes again by antisymmetry of Bμν and commutativity of partial derivatives. The final result of expanding out (3.121) is therefore

(3.124)

In analogy to the curvature 2-form associated with a 1-form potential A obtained by calculating the holonomy along an infinitesimal square, we define this third order term to be the “3-form curvature” associated with the pair (A, B) and denote it by H (although this notation conflicts with our crossed module notation, it should cause no confusion in this section based on its usage). In terms of components, it is given by

(3.125)

and using differential form notation

(3.126)

This definition and result agrees with (3.28) of Ref. 11 and Lemma A.11 in Ref. 5. As was also pointed out in Ref. 11,

(3.127)

by the Bianchi identity. Since kerτ̲ is a central Lie subalgebra of h, this means that H is a 3-form with values in an Abelian Lie algebra (see Remark 2.26). Under a first order gauge transformation (A,B)(g,φ)(A,B) as in (3.101), the 3-form curvature changes to

(3.128)

where the underlined terms cancel and the other terms with underbraces will be calculated and compared momentarily. At this point, it is useful to simplify some of these terms by applying τ̲ and calculating the results in terms of commutators, etc. For example,

(3.129)

We can safely equate the terms inside the τ̲. This is a helpful trick and we will use it to calculate all other terms. For instance, one can show using this trick that

(3.130)

Although this is a trick and not completely rigorous, it works for all of the calculations we will do. These formulas can all be found in Appendix A of Ref. 25 and can be derived more carefully. Since α̲ is a derivation,

(3.131)

which cancels with the term [φ,α̲gAg1dgg1(φ)] in (3.128). Furthermore, note that

(3.132)

for any g-valued 1-form X and for any h-valued 1-form Y. Although τ̲ has been applied to derive these equalities, the expressions inside τ̲ are still equal. This equality implies

(3.133)

One of the more cumbersome set of terms in (3.128) is

(3.134)

Combining this with the result preceding it gives just a single term α̲gFg1(φ). Putting all of this together, we obtain

(3.135)

Finally, by the properties of crossed modules and by the vanishing of the fake curvature,

(3.136)

so that the above formula reduces further to

(3.137)

In particular, H is invariant under fat gauge transformations. This result agrees with what was discovered in Ref. 11.

We have illustrated that 2-category theory can be implemented and used in such a way as to calculate parallel transport along two-dimensional surfaces, such as worldsheets of strings. We did this explicitly for gauge groups that are not necessarily Abelian via an approximation technique that can be implemented numerically. We have done this using string diagram techniques to facilitate 2-categorical techniques and bring higher category theory to a wider audience. Although Girelli and Pfeiffer have calculated infinitesimal gauge transformations and curvature forms via similar techniques11 and Schreiber and Waldorf provided a formula for the parallel transport along a surface,5 our infinitesimal methods give a much more explicit and direct construction of the iterated surface integral from elementary building blocks. This fills in some of the arguments sketched by Baez and Schreiber in Ref. 15, particularly in Sec. 2.3.2 (see also Sec. 11.4.1 of Schreiber’s thesis71). Schreiber and Waldorf’s integral in Ref. 5 was obtained from consistency conditions, and then they proved that it satisfies the necessary functorial properties expected of surface holonomy. In relation to other studies, such surface-ordered integrals have been used recently in constructing a Hochschild complex for surface transport.72 The novelty of our result is that we derived the formula for surface parallel transport from scratch using a discretization of our surface. To the best of our knowledge, this is the first appearance of such an explicit construction together with analytical results on convergence (Proposition 3.5) and a simplification providing a manageable surface-ordered integral by reducing the surface ordering to a single direction as opposed to two (this result is embodied in Theorem 3.8). By implementing string diagrams, we have also provided a more friendly visualization. Furthermore, we have avoided using path spaces explicitly and have simplified many arguments.

We hope that we have illustrated how two-dimensional algebra can be used for explicit calculations. If developed further, these ideas might be used to explain physical phenomena that utilize algebraic manipulations in more than one dimension more naturally. Such higher-dimensional algebra appears in many situations. For example, elements and molecules combine in a variety of ways forming complicated compounds, amino acids, and proteins. These are objects that use three dimensions to configure themselves. Therefore, a natural and faithful representation of them would involve a sort of 3-dimensional algebra. Another example occurs in painting. Given a painting, it is much simpler for us to “read” a 2-dimensional painting than to view all the pixels making it up in a straight 1-dimensional list. Both perspectives contain the same information-theoretical data, but the 2-d form is naturally and immediately recognized. As another even more speculative example, it is known that the entropy of a black hole is proportional to the surface area of its horizon. This may lead one to believe that the microstates of the theory can be expressed as living on a lower-dimensional world. This in turn then suggests the possibility that a two-dimensional algebra might be useful in describing some of the properties of these microstates. Although these ideas are entirely speculative, our point is that one can imagine that the one-dimensional algebra we have been using for so long is only the tip of an iceberg of algebraic structures. Higher category theory opens us to these other possibilities.

There are still many open questions in this relatively young field. One is how to construct useful actions in physics that model phenomena with non-Abelian higher form gauge fields and also the interactions with matter fields. Some recent progress in this direction has been made by Sämann and others—see Ref. 41 and the references therein. Work on the pure gauge field side was initiated in the work of Pfeiffer using a 2-categorical approach.10 To proceed, it seems that a better suited representation theory for 2-categories will be useful.66 Furthermore, characteristics for 2-groups73,74 and traces54,75 may need to be studied further to better understand what gauge invariant combinations are possible. Although the number of higher gauge theory examples are increasing76,77,39,69,38,14,41 some work is still required to solidify the role of higher gauge theory in lattice gauge theory and other areas of physics. Other lattice gauge theory approaches existed earlier78,79,80 with a renewed interest in Refs. 70 and 81, but it is not clear to us how these approaches to higher lattice gauge theory are related to the rest of the literature mentioned above.

In the realm of string theory and M-theory, beginning with early studies of Witten, Myers, and others,82,37 ideas for the construction of non-Abelian gauge theories on a stack of D-branes83 and their low energy effective actions were discussed. These effective actions are swarmed with higher form non-Abelian gauge fields, but the precise mathematical formulation is still lacking. Nevertheless, it is likely that non-Abelian differential cohomology13 is relevant based on the recent work on M5-branes in which it plays an essential role.39 Some arguments used to describe such effective actions are not always entirely straightforward and involve consistency conditions (such as T-duality37 and scattering amplitude calculations84) rather than direct derivations. It is therefore possible that a more thorough investigation may involve understanding non-perturbative effects, one of which is dictated by transport. On the other hand, due to the non-commutative nature of the normal coordinates to these branes,85 this may involve a modification of such transport to the setting of non-commutative geometry. These and many other ideas have also been briefly discussed in Ref. 71, and several such open questions can be found there.

We express our sincere thanks to Urs Schreiber and Radboud University in Nijmegen, Holland, who hosted us for several productive days in the summer of 2012 during which a preliminary version of some ideas here were prepared and presented there. We also thank Urs for many helpful comments and suggestions. We would like to thank Stefan Andronache, Sebastian Franco, Cheyne Miller, V. P. Nair, Xing Su, Steven Vayl, Scott O. Wilson, and Zhibai Zhang for discussions, ideas, interest, and insight. Most of this work was done when the author was at the CUNY Graduate Center under the NSF Graduate Research Fellowship Grant No. 40017-01-04 and during a Capelloni Dissertation Fellowship. This work is an updated version of a part of the author’s Ph.D. thesis.88 

C

category/2-category

G

group

BG

one-object groupoid

σ: FG

natural transformation from functor F to G

G(H,G,τ,α)

crossed module

BG

one-object 2-groupoid

A

1-form potential

M

smooth manifold

1

identity matrix

triv(γ)

local transport along a path γ

P1M

path groupoid of M

g

(thin) gauge transformation

P2M

path 2-groupoid of M

triv(Σ)

local transport along a bigon Σ

B

2-form potential

F

2-form curvature of A

φ

fat gauge transformation

H

3-form curvature of (A, B)

(h,g,τ̲,α̲)

differential crossed module

Here, we briefly review the infinitesimal version (h,g,τ̲,α̲) of a Lie crossed module (H, G, τ, α) including the many relations that these maps satisfy. We also make some comments on differential forms with values in g and h. This information can also be found in many articles on the subject of higher gauge theory such as Refs. 15 and 11, especially Waldorf’s concise one page formula sheet in Appendix A of Ref. 25. Martins and Miković also have an exceptionally clear and thorough exposition in Sec. 2.1 of Ref. 69.

τ̲:hg is the derivative of τ: HG at the identity and is a Lie algebra homomorphism since τ is a Lie group homomorphism. Notice that α can be equivalently described as a function α: G × HH that is a group homomorphism in each component separately. As a result, for any fixed gG, αg: HH is a Lie group homomorphism and hence has a derivative at the identity denoted by αg̲:hh. This map, besides being a Lie algebra homomorphism, satisfies the additional property that

(A1)

for all Yh and gG. Similarly, although α: G × HH is not a group homomorphism, it is smooth and its derivative α̲:g×hh is a well-defined linear map. It is a derivation once the g coordinate is fixed, i.e.,

(A2)

for all Xg and Y,Zh.α̲ also satisfies

(A3)

for all X,Xg and Yh. Finally,

(A4)

and

(A5)

for all Xg and Y,Zh.

Once combined with differential forms, the maps α̲ and τ̲ are extended in the appropriate way (see Ref. 56, Part II, Chap. 3, in the section on the Bianchi identity for details on differential forms with values in Lie algebras). For instance, α̲ is a graded derivation in its second coordinate. To clarify the notation used throughout, consider differential forms AΩ1(M;g),FΩ2(M;g),φΩ1(M;h),BΩ2(M;h). When we write expressions such as α̲A(φ) or α̲A(B), we mean the following. First, let {ta}a{1,,dimg} be a basis for g and {sb}b{1,,dimh} be a basis for h. Then

(A6)

where a summation over repeated indices is assumed and where Aa, φbΩ1(M) and Fa, BbΩ2(M) for all indices. Then by definition,

(A7)

and similarly for any other forms. Because we use Lie algebra valued forms, the bracket is graded. For instance,

(A8)

but

(A9)

since φ is a 1-form and B is a 2-form. The last two equalities follow if we think of our Lie algebras as coming from matrix Lie algebras, which we often do. The general formula is

(A10)

where |ω| and |η| are the degrees of the forms ωΩ|ω|(M;h) and ηΩ|η|(M;h). Other properties are derived as needed in calculations in the article.

This appendix serves to prove the convergence of the surface-ordered product (3.50) as n and to also prove upper bounds on the kth order terms when expanded out. For this, we will first relax our conditions and work with arbitrary partitions of the unit square. We will follow the conventions of Munkres and use the results there without further Ref. 86. The surface-ordered product is well-defined for each partition P and will be denoted by trivP. We will also use the notation γs,t to denote the path defined after Theorem 3.8. It will be helpful to define the function

(B1)

Proof of Proposition 3.5.
By smoothness of A, B, and parallel transport along paths, B is a continuous function. First note that for any point (si, tj) ∈ [0, 1] on the intersection points of the grid formed by some partition P,
(B2)
where γsi,tjQ is the path obtained from expanding out the 2-group multiplication with respect to the partition Q. The ordering on partitions is given by refinement, and the above limit is taken over all refinements of P. This limit is valid due to the smoothness of all expressions. Visually, this limit is also reasonable. For example, consider the following set of refinements and how γsi,tjQ looks like with respect to these partitions
Taking the limit over partitions
shows that the paths converge to γs,t and the value of the alpha action along the path at B depends only on the point (s, t). Hence, the partial products of terms of the form
(B3)
which define trivP to lowest order, converge if and only if the sum of terms of the form
(B4)
converges (cf. Sec. 8.10 in Ref. 59). However, the limit of the sum of these terms over all partitions is exactly the definition of the Riemann integral of the (continuous) function B over the unit cube
(B5)
Since the integral exists, the sum converges and hence the product converges.

Proof of Proposition 3.6.
For the first claim, set Mn to be the maximum value of the norm of the expressions of the form αa56sa46sa36sa35ta25s(B14). Then
(B6)
for all k ∈ {0, 1, …, n2}. For the second claim, we note that limnMn converges to a finite value due to the arguments preceding the Proof of Proposition 3.5 in this appendix. This is because the sequence of such terms converges, upon refinement, to a term of the form αtriv(γs,t)̲B(s,t). Set
(B7)
which is finite and well-defined by compactness and smoothness of all inputs. For each ϵ > 0, let ÑϵN be large enough so that |MnM̃|<ϵ for all nÑϵ. Choose any ϵ > 0 and set MM̃+ϵ and NmaxÑϵ,2k+1. From this, it follows that
(B8)
for all k ∈ {0, 1, …, n2} for all nN. The final claim follows from the triangle inequality and the fact that ϵ can be taken to be arbitrarily small.

This appendix serves to give a rigorous proof of Lemma 3.7. For the proof of this lemma, it is useful to rewrite Sn,k as

(C1)

where it is understood that any sum operation on the left acts on everything to the right. Before working out this summation to obtain a more explicit formula, for each nZ+, define the function

(C2)

Explicitly, this simplifies to Ref. 87 

(C3)

where δqp is the Kronecker delta function and Br is the Bernoulli number defined, for instance, by the coefficients in the power series expansion (thought of as a formal power series in the variable x)

(C4)

The first few of these Bernoulli numbers are

(C5)

while the first few ϕn are

(C6)

Examining ϕn(p) a little more, one immediately notices the crucial result

(C7)

Now, Sn,k can be written as a polynomial in the ϕn’s

(C8)

where

(C9)

and so on (a more explicit formula will be given momentarily). For example, one obtains the following expressions for small values of k:

(C10)

Looking back at the expressions for Sn,k, one sees that there is a recursion relation for Sn,k. Setting Sn,0 ≔ 1, this recursion relation reads

(C11)

This recursion relation can be used to express Sn,k purely in terms of the ϕn’s and is given by

(C12)

where it is understood that the sum terminates earlier if any of the j’s are larger than 1. For example, if there are s of them, then

(C13)

Therefore, fixing k, one obtains

(C14)

To see this, first notice that jr = 1 for all r ∈ {1, , s} when s = k in which case the denominator and numerator in (C14) are equal and the limit is 1. However, when s < k, by the formula for ϕn(p) in (C3) and the asymptotics of this given in (C7),

(C15)

Hence,

(C16)

Finally going back to Rn,k and using this fact gives

(C17)
1.
C. J.
Wood
,
J. D.
Biamonte
, and
D. G.
Cory
, “
Tensor networks and graphical calculus for open quantum systems
,”
Quantum Inf. Comput.
15
(
9-10
),
759
811
(
2015
); e-print arXiv:1111.6950 [quant-ph].
2.
S.
Abramsky
and
B.
Coecke
, “
A categorical semantics of quantum protocols
,” in
Proceedings of the 19th IEEE Conference on Logic in Computer Science (LiCS’04)
(
IEEE Computer Science Press
,
2004
); e-print arXiv:0402130 [quant-ph].
3.
R.
Orús
, “
A practical introduction to tensor networks: Matrix product states and projected entangled pair states
,”
Ann. Phys.
349
,
117
158
(
2014
).
4.
N.
Bonesteel
, “
Topological quantum computation
,” in
Lecture series given at the Workshop Mathematics of Topological Phases of matter on June 1, 2017
,
Simons Center for Geometry and Physics
,
2017
, Video available at http://scgp.stonybrook.edu/video_portal/video.php?id=3227.
5.
U.
Schreiber
and
K.
Waldorf
, “
Smooth functors vs. differential forms
,”
Homol., Homotopy Appl.
13
,
143
203
(
2011
); e-print arXiv:0802.0663 [math.DG].
6.
J.
Bénabou
, “
Introduction to bicategories
,” in
Reports of the Midwest Category Seminar
, Lecture Notes in Mathematics (
Springer
,
Berlin
,
1967
), Vol. 47, pp.
1
77
.
7.
A.
Joyal
,
R.
Street
, and
D.
Verity
, “
Traced monoidal categories
,”
Math. Proc. Cambridge Philos. Soc.
119
(
3
),
447
468
(
1996
).
8.
J. C.
Baez
and
J.
Huerta
, “
An invitation to higher gauge theory
,”
Gen. Relativ. Gravitation
43
,
2335
2392
(
2011
); e-print arXiv:1003.4485 [hep-th].
9.
J. C.
Baez
and
A.
Lauda
, “
Higher dimensional algebra V: 2-groups
,”
Theory Appl. Categories
12
,
423
491
(
2004
); e-print arXiv:0307200 [math.QA].
10.
H.
Pfeiffer
, “
Higher gauge theory and a non-Abelian generalization of 2-form electrodynamics
,”
Ann. Phys.
308
,
447
477
(
2003
); e-print arXiv:0304074 [hep-th].
11.
F.
Girelli
and
H.
Pfeiffer
, “
Higher gauge theory—Differential versus integral formulation
,”
J. Math. Phys.
45
,
3949
3971
(
2004
); e-print arXiv:0309173 [hep-th].
12.
U.
Schreiber
and
K.
Waldorf
, “
Connections on non-Abelian gerbes and their holonomy
,”
Theory Appl. Categories
28
,
476
540
(
2013
); e-print arXiv:0808.1923 [math.DG].
13.
U.
Schreiber
, “
Differential cohomology in a cohesive topos
,” e-print arXiv:1310.7930 [math-ph] (
2013
), updated at https://ncatlab.org/schreiber/show/differential+cohomology+in+a+cohesive+topos.
14.
A. J.
Parzygnat
, “
Gauge invariant surface holonomy and monopoles
,”
Theory Appl. Categories
30
(
42
),
1319
1428
(
2015
); e-print arXiv:1410.6938 [math-ph].
15.
J. C.
Baez
and
U.
Schreiber
, “
Higher gauge theory: 2-connections on 2-bundles
,” preprint arXiv:0412325 [hep-th] (
2004
), version available at http://math.ucr.edu/home/baez/2conn.pdf.
16.
M.
Kalb
and
P.
Ramond
, “
Classical direct interstring actions
,”
Phys. Rev. D
9
,
2273
2284
(
1974
).
17.
C.
Teitelboim
, “
Gauge invariance for extended objects
,”
Phys. Lett. B
167
(
1
),
63
68
(
1986
).
18.
M.
Henneaux
and
C.
Teitelboim
, “
p-form electrodynamics
,”
Found. Phys.
16
,
583
617
(
1986
).
19.
I. Y.
Aref’eva
, “
Non-Abelian Stokes formula
,”
Theor. Math. Phys.
43
,
353
356
(
1980
).
20.
Y.
Makeenko
,
Methods of Contemporary Gauge Theory
(
Cambridge University Press
,
2002
).
21.
J. H. C.
Whitehead
, “
Combinatorial homotopy. II
,”
Bull. Am. Math. Soc.
55
,
453
496
(
1949
).
22.
R.
Attal
, “
Combinatorics of non-Abelian gerbes with connection and curvature
,”
Ann. Fond. Broglie
29
,
609
634
(
2004
); e-print arXiv:0203056 [math-ph].
23.
L.
Breen
and
W.
Messing
, “
Differential geometry of gerbes
,”
Adv. Math.
198
,
732
846
(
2005
); e-print arXiv:0106083 [math.AG].
24.
C.
Wockel
, “
Principal 2-bundles and their gauge 2-groups
,”
Forum Math.
23
(
3
),
565
610
(
2011
); e-print arXiv:0803.3692 [math.DG].
25.
K.
Waldorf
, “
A global perspective to connections on principal 2-bundles
,”
Forum Math.
30
,
809
843
(
2017
); e-print arXiv:1608.00401 [math.DG].
26.
G.
Segal
, “
The definition of conformal field theory
,” in
Topology, Geometry and Quantum Field Theory, London Mathematical Society Lecture Note Series
, edited by
U.
Tilmann
(
Cambridge University Press
,
Cambridge
,
2004
), Vol. 308, pp.
421
577
.
27.
M.
Atiyah
, “
New invariants of 3- and 4-dimensional manifolds
,”
Proc. Symp. Pure Math.
48
,
285
299
(
1988
).
28.
J. C.
Baez
and
J.
Dolan
, “
Higher-dimensional algebra and topological quantum field theory
,”
J. Math. Phys.
36
,
6073
6105
(
1995
); e-print arXiv:9503002 [math.QA].
29.
J.
Lurie
, “
On the classification of topological field theories
,”
Curr. Dev. Math.
2008
,
129
280
; e-print arXiv:0905.0465 [math.CT].
30.
U.
Schreiber
and
K.
Waldorf
, “
Parallel transport and functors
,”
Homotopy Relat. Struct.
4
,
187
244
(
2009
); e-print arXiv:0705.0452 [math.DG].
31.
U.
Schreiber
and
K.
Waldorf
, “
Local theory for 2-functors on path 2-groupoids
,”
J. Homotopy Relat. Struct.
12
(
3
),
617
658
(
2017
); e-print arXiv:1303.4663 [math.GN].
32.
A.
Caetano
and
R. F.
Picken
, “
An axiomatic definition of holonomy
,”
Int. J. Math.
05
,
835
848
(
1994
).
33.
K. G.
Wilson
, “
Confinement of quarks
,”
Phys. Rev. D
10
(
8
),
2445
2458
(
1974
).
34.
D.
Gaiotto
,
A.
Kapustin
,
N.
Seiberg
, and
B.
Willett
, “
Generalized global symmetries
,”
J. High Energy Phys.
2015
,
172
; e-print arXiv:1412.5148 [hep-th].
35.
M.
Mackaay
and
R.
Picken
, “
Holonomy and parallel transport for Abelian gerbes
,”
Adv. Math.
170
,
287
339
(
2002
); e-print arXiv:0007053 [math.DG].
36.
T.
Tradler
,
S. O.
Wilson
, and
M.
Zeinalian
, “
Equivariant holonomy for bundles and Abelian gerbes
,”
Commun. Math. Phys.
315
,
39
108
(
2012
); e-print arXiv:1106.1668 [math.DG].
37.
R. C.
Myers
, “
Dielectric-branes
,”
J. High Energy Phys.
1999
(
12
),
022
; e-print arXiv:9910053 [hep-th].
38.
S.
Palmer
and
C.
Sämann
, “
M-brane models from non-Abelian gerbes
,”
J. High Energy Phys.
2012
,
010
; e-print arXiv:1203.5757 [hep-th].
39.
D.
Fiorenza
,
H.
Sati
, and
U.
Schreiber
, “
Multiple M5-branes, string 2-connections, and 7d nonabelian Chern-Simons theory
,”
Adv. Theor. Math. Phys.
18
,
229
321
(
2014
); e-print arXiv:1201.5277 [hep-th].
40.
J.
Bagger
and
N.
Lambert
, “
Modeling multiple M2’s
,”
Phys. Rev. D
75
,
045020
(
2007
); e-print arXiv:0611108 [hep-th].
41.
C.
Sämann
and
L.
Schmidt
, “
Towards an M5-brane model I: A 6d superconformal field theory
,”
J. Math. Phys.
59
(
4
),
043502
(
2018
); e-print arXiv:1712.06623 [hep-th].
42.
K.
Waldorf
, “
Parallel transport in principal 2-bundles
,”
Higher Structures
2
(
1
),
57
115
(
2018
).
43.
C.-S.
Chu
, “
A theory of non-Abelian tensor gauge field with non-Abelian gauge symmetry G x G
,”
Nucl. Phys.
B866
,
43
57
(
2013
); e-print arXiv:1108.5131 [hep-th].
44.
U.
Schreiber
, “
Higher pre-quantized geometry
,” in
Lecture Given at the Workshop New Spaces for Mathematics and Physics Formal and Philosophical Reflections on October 1, 2015
(
Institute Henri Poincaré
,
2015
).
45.
S.
Gukov
and
A.
Kapustin
, “
Topological quantum field theory, nonlocal operators, and gapped phases of gauge theories
,” preprint arXiv:1307.4793 [hep-th] (
2013
).
46.
E.
Sharpe
, “
Notes on generalized global symmetries in QFT
,”
Fortschr. Phys.
63
(
11-12
),
659
682
(
2015
); e-print arXiv:1508.04770 [hep-th].
47.
H.-M.
Chan
and
S. T.
Tsou
,
Some Elementary Gauge Theory Concepts
(
World Scientific
,
1993
).
48.
B. C.
Hall
,
Quantum Theory for Mathematicians
, Graduate Texts in Mathematics (
Springer
,
New York
,
2013
), Vol. 267.
49.
A.
Kitaev
and
L.
Kong
, “
Models for gapped boundaries and domain walls
,”
Commun. Math. Phys.
313
(
2
),
351
373
(
2012
); e-print arXiv:1404.2634 [hep-th].
50.
A.
Joyal
and
R.
Street
, “
The geometry of tensor calculus. I
,”
Adv. Math.
88
(
1
),
55
112
(
1991
).
51.
S.
Mac Lane
, “
Natural associativity and commutativity
,”
Rice Univ. Stud.
49
(
4
),
28
46
(
1963
).
52.
S.
Weinberg
,
The Quantum Theory of Fields
(
Cambridge University Press
,
1995
), Vol. 1.
53.
J.
Tappe
, “
Irreducible projective representations of finite groups
,”
Manuscr. Math.
22
,
33
45
(
1977
).
54.
K.
Ponto
and
M.
Shulman
, “
Shadows and traces in bicategories
,”
J. Homotopy Relat. Struct.
8
,
151
200
(
2013
); e-print arXiv:0910.1306 [math.CT].
55.
U.
Schreiber
, “
Higher prequantum geometry
,”preprint arXiv:1601.05956 [math-ph] (
2016
).
56.
J.
Baez
and
J. P.
Muniain
,
Gauge Fields, Knots and Gravity
, Series on Knots and Everything (
World Scientific
,
1994
), Vol. 4.
57.
D.
Husemöller
,
Fibre Bundles
, Graduate Texts in Mathematics, 3 ed. (
Springer-Verlag
,
1994
), Vol. 20.
58.
J. W.
Milnor
and
J. D.
Stasheff
,
Characteristic Classes
, Annals of Mathematics Studies (
Princeton University Press
,
1974
), Vol. 76.
59.
J. H. M.
Wedderburn
,
Lectures on Matrices
(
Dover Publications, Inc.
,
New York
,
1964
).
60.
E.
Nelson
,
Topics in Dynamics. I: Flows
, Mathematical Notes (
Princeton University Press
,
Princeton, NJ
;
University of Tokyo Press
,
Tokyo
,
1969
).
61.
S.
Abbott
,
Understanding Analysis
, Undergraduate Texts in Mathematics, 2 ed. (
Springer
,
2015
).
62.
V.
Bryant
,
Metric Spaces
(
Cambridge University Press
,
1985
).
63.
E.
Witten
, “
Conformal field theory in four and six dimensions
,” in
Topology, Geometry and Quantum Field Theory
, London Mathematical Society Lecture Note Series, edited by
U.
Tilmann
(
Cambridge University Press
,
Cambridge
,
2004
), Vol. 308, pp.
405
419
; e-print arXiv:0712.0157 [math.RT].
64.
D.
Berwick-Evans
and
D.
Pavlov
, “
Smooth one-dimensional topological field theories are vector bundles with connection
,” preprint arXiv:1501.00967 [math.AT] (
2015
).
65.
R.
Picken
, “
TQFT’s and gerbes
,”
Algebr. Geom. Topol.
4
,
243
272
(
2004
); e-print arXiv:0302065 [math.DG].
66.
J. C.
Baez
,
A.
Baratin
,