Author Notes
A variational quantum eigensolver (VQE) solves the ground state problem of a given Hamiltonian by finding the parameters of a quantum circuit Ansatz that minimizes the Hamiltonian expectation value. Among possible quantum circuit Ansätze, the Hamiltonian variational Ansatz (HVA) is widely studied for quantum many-body problems as the Ansatz with sufficiently large depth is theoretically guaranteed to express the ground state. However, since the HVA shares the same symmetry with the Hamiltonian, it is not necessarily good at finding symmetry-broken ground states that prevail in nature. In this paper, we systematically explore the limitations of the HVA for solving symmetry-broken systems and propose an alternative quantum circuit Ansatz with symmetry-breaking layers. With extensive numerical simulations, we show that the proposed Ansatz finds the ground state in depth significantly shorter than the bare HVA when the target Hamiltonian has symmetry-broken ground states.
I. INTRODUCTION
Experimental progress in controlling quantum systems has allowed for the first quantum computational advantage claim1 in recent years. This achievement initiated the race between noisy intermediate-scale quantum2 (NISQ) computers and the state-of-the-art classical algorithms.3 Now, a NISQ device with hundreds of qubits is available,4 and those devices are expected to solve a practical computational problem beyond the reach of classical computers. Among possible applications, a variational quantum eigensolver (VQE)5,6 that solves the ground state problem of quantum many-body Hamiltonian has gained much attention recently (see Ref. 7 for a recent review).
The VQE combines a parameterized quantum circuit and a classical optimization algorithm: A quantum circuit evaluates the expectation value of the Hamiltonian and its derivatives for the output quantum state, whereas a classical optimizer finds better parameters that minimize the energy. As solving the ground state problem of quantum Hamiltonians is difficult for classical computers, one may easily get a possible advantage of VQEs. Still, it is less understood which Ansatz and classical optimizer should be used to solve a given Hamiltonian efficiently.
One of the most widely studied Ansätze for solving many-body spin Hamiltonians is the Hamiltonian variational Ansatz (HVA).8,9 Inspired by a short-depth quantum algorithm for solving combinatory optimization problems,10 the HVA is constructed using rotating gates whose generators are the terms of the Hamiltonian, which resembles the Suzuki–Trotter decomposition.11 Even though the HVA solves many different Hamiltonians reliably,12–14 however, it is not necessarily good at solving problems with a symmetry-broken nature, which prevails in many-body systems. Indeed, there are local Hamiltonians whose ground states cannot be generated by this Ansatz in a constant depth, albeit such a circuit exists, as the circuit obeys the same symmetry as the Hamiltonian.15
In this paper, we devise a symmetry-breaking Ansatz and explore its power for solving many-body Hamiltonians. We construct our Ansatz by adding symmetry-breaking layers to the HVA. The power of those symmetry-breaking Ansätze is numerically tested using the transverse-field Ising (TFI) and the transverse-field cluster (TFC) models, which have the and symmetries, respectively. Our results show that the symmetry-breaking Ansatz can find the ground state in a constant depth when the Hamiltonian has symmetry-broken ground states, whereas the bare HVA requires a linear depth for solving the same problem. We also show that, by adding a symmetry penalizing term to the loss function, one can choose a particular symmetry-broken state among degenerate ground states.
II. PRELIMINARIES
A. Hamiltonian variational Ansatz
From the above construction, it is clear that the HVA describes the ground state when D is sufficiently large.
However, it is still unclear whether the depth provided by the HVA is optimal (up to a constant factor) for describing the ground state. In this paper, we show that the answer is negative using symmetry-broken systems.
B. Phases related to symmetries
Quantum many-body systems often break a symmetry characterized by a group. When it happens, a system has more than two ground states connected by a group generator. For example, if the symmetry is broken, we can find two ground states |GS0⟩ and |GS1⟩ such that |GS0⟩ = G|GS1⟩, where is a generator of this symmetry (or a representation of it, more generally), which satisfies 16,17 (see also Ref. 18 for a general introduction).
Conventionally, the symmetry-breaking phenomena are understood using Landau’s theory.19 In this theory, symmetry-broken ground states are determined by an order parameter defined by an expectation value of a local observable O. For the symmetry-broken ground states, one has an order parameter O0 = ⟨GS0|O|GS0⟩ and the other has O1 = ⟨GS1|O|GS1⟩ = −O0. Contrarily, symmetry-preserving ground states have ⟨GS|O|GS⟩ = 0. Thus, one can diagnose whether the system breaks the symmetry by computing the order parameter.
A phase transition occurs when the ground state breaks (or restores) a new symmetry as the control parameter of the Hamiltonian changes. For example, the TFI given in Eq. (2) has the strength of the external magnetic field h as a parameter. The Hamiltonian has the symmetry under the spin-flip; it does not change under the transformation Zi ↔ −Zi. However, the symmetry of the ground state is broken when |h| < 1, i.e., it has two symmetry-broken ground states when |h| < 1 in the thermodynamic limit. In Fig. 1, we plot the order parameter ⟨Zi⟩ as a function of h20 for one of the ground states. One finds that ⟨Zi⟩ ≠ 0 for h < 1, whereas ⟨Zi⟩ = 0 for all h > 1. This implies that a phase transition occurs at h = 1, and the symmetry is broken when h < 1.
The order parameter ⟨Zi⟩ as a function of h computed for the ground state of the transverse-field Ising model given by Eq. (2). When h < 1, there are two ground states, and the order parameter is computed for one of the ground states. Due to the translational invariance, ⟨Zi⟩ does not depend on i.
The order parameter ⟨Zi⟩ as a function of h computed for the ground state of the transverse-field Ising model given by Eq. (2). When h < 1, there are two ground states, and the order parameter is computed for one of the ground states. Due to the translational invariance, ⟨Zi⟩ does not depend on i.
In general, true symmetry breaking is only observed in the thermodynamic limit, i.e., a system often has a symmetry-preserving unique ground state for any finite N even when it has degenerated ground states in the thermodynamic limit. Thus, we need a diagnostic that can tell whether a system breaks symmetry using the ground states for finite-size systems. Correlation functions are widely used for that purpose. For example, in the 1D system, consider local observables {Oi} where each Oi supports site i. Then, the correlation function of the ground state is given by C(i, j) = ⟨OiOj⟩ − ⟨Oi⟩⟨Oj⟩, where ⟨⋅⟩ is the expectation value for the ground state from a system of size N. If C(i, j) converges to a constant as |j − i| → ∞ (we also need to increase N accordingly), the system has symmetry broken ground states in the thermodynamic limit.18
With the advent of topological physics,21–23 however, systems with degenerate ground states related by a symmetry that an order parameter or correlation functions cannot diagnose are discovered. Such an example includes the AKLT model,24 which has been extensively studied over decades. The model has two ground states in the open boundary condition, but the correlation function decays exponentially, and none of the local observable can distinguish the two states. This type of phase is named symmetry-protected topological (SPT) phase.25
To incorporate both the conventional (symmetry-broken) and the SPT phases under a single theory, a modern definition of quantum phases has been introduced.26,27 This theory is technically involved, but in essence, it tells that two Hamiltonians (H0 and H1) are in the same phase if there exists a local and gapped parameterized Hamiltonian H(τ) (0 ≤ τ ≤ 1) such that H(0) = H0, H(1) = H1, and H(τ) has the same symmetry for all 0 ≤ τ ≤ 1 (i.e., there exists an operator P such that [P, H(τ)] = 0). Here, we say H(τ) is gapped when the difference between the lowest and the first-excited eigenvalues of H(τ) is bounded below by a constant regardless of the system size N. This definition generalizes Landau’s theory since the gap closes in the thermodynamic limit during symmetry-breaking phase transitions.
Since the adiabatic evolution of a gapped Hamiltonian is described by a constant-depth quantum circuit,28 the above definition is directly translated into a language of quantum circuits. Let us assume that Hamiltonians H0 and H1 have ground states |ψ0⟩ and |ψ1⟩, respectively. Then, |ψ0⟩ and |ψ1⟩ are in the same SPT phase if there exists a unitary operator U such that (1) U can be expressed by a local and constant depth quantum circuit, (2) |ψ1⟩ = U|ψ0⟩, and (3) [U, P] = 0, where P is the symmetry generator.
If we drop the symmetry restriction from the definition of the SPT phases, we arrive at the definition of topological phases.29 Namely, two ground states are in different topological phases if we cannot find any local constant-depth circuit U that connects the two states. It is also known that true topological phases do not exist in the one-dimensional spin systems, i.e., one can always find a constant-depth circuit connecting between ground states of any gapped 1D local Hamiltonians.26,27,30,31 In the following section, we will use such a distinction between the SPT and topological phases to design an efficient quantum circuit Ansatz.
III. SYMMETRY BREAKING ANSATZ
The first model we study is the TFI model for N qubits defined in Eq. (2), where we impose the periodic boundary condition ZN+1 = Z1. This model has two distinct (ferromagnetic and paramagnetic) phases depending on the strength of h protected by the spin-flip symmetry P = ∏iXi.32 The critical point of this model h = 1 is well known.20 This implies that if there is a circuit that commutes with P and connects two ground states in different phases (i.e., ground states for h > 1 and h < 1), the circuit depth must be larger than a constant.26 On the other hand, a finite-depth circuit that connects two different ground states exists if we do not restrict such symmetry to a circuit since the system is gapped unless h = 1.26,31
Symmetry-preserving/breaking Ansätze with the same D [Eqs. (6) and (9)] have different numbers of total gates. However, the number of entangling gates of both circuits is the same. When implemented on NISQ devices, the latter dominates the quality of the output states since the expected fidelity of entangling gates is much lower than single-qubit gates (see, e.g., Refs. 34 and 35). Thus, in the following discussion, we mainly compare the results from these Ansätze with the same D.
To observe the effect of symmetry-breaking layers, we simulated the VQE for the TFI using the bare HVA and our symmetry-breaking Ansatz given in Eq. (9). We optimize parameters of the Ansatz using the quantum natural gradient:36–38 For each epoch t, we update parameters as , where η is the learning rate, is the quantum Fisher matrix, and λt is a (step dependent) regularization constant. We choose this optimizer as it works more reliably in solving the ground problem for both classical neural networks39 and VQEs.13 For the quantum Fisher matrix, we use the centered one 37 mostly (unless otherwise stated), but the uncentered one 36 is also considered when it improves the performance. The difference between the two can be understood using the notion of the projected Hilbert space.38 For the hyperparameters, we typically use η = 0.01 and λt = max(100.0 × 0.9t, 10−3).
We present the optimized normalized energies for different N and D in Fig. 2 when h = 0.5. We have used the Fisher matrix with the centering term , and initial values of parameters {ϑj, κj ϕj} are sampled from the normal distribution with σ = 0.001 besides D = 3, 5 at Fig. 2(b). When D = 3 (all N) and 5 (for N ≥ 12), this initialization does not give any better energy than the Ansatz without symmetry-breaking layers. We instead found that initializing parameters for the symmetry breaking layers {ϕj} with samples from finds better optima in these cases. However, this initialization does not change the results for D = 4 and performs even worse when D ≥ 6 (see the Appendix for detailed comparisons).
Converged normalized energy as a function of system size N and the number of blocks D (a) without and (b) with symmetry breaking layers for the transverse field Ising (TFI) model where EGS is the true ground state energy. We have run 12 independent VQE runs and taken the best-optimized energy for each N and D. Initial parameters are sampled randomly from the normal distribution besides D = 3, 5 at (b) where we have added 2π/D to the parameters of the symmetry breaking layers (see the main text for details). Note that the total number of layers L is 2D for the bare HVA but 3D for our symmetry-breaking Ansatz.
Converged normalized energy as a function of system size N and the number of blocks D (a) without and (b) with symmetry breaking layers for the transverse field Ising (TFI) model where EGS is the true ground state energy. We have run 12 independent VQE runs and taken the best-optimized energy for each N and D. Initial parameters are sampled randomly from the normal distribution besides D = 3, 5 at (b) where we have added 2π/D to the parameters of the symmetry breaking layers (see the main text for details). Note that the total number of layers L is 2D for the bare HVA but 3D for our symmetry-breaking Ansatz.
Figure 2(a) shows that the ground state is only found for D ≥ N/2 when symmetry-breaking layers are absent, which is consistent with Refs. 12 and 13. On the other hand, results with symmetry-breaking layers [Fig. 2(b)] clearly demonstrate that converged energies are significantly improved for 5 ≤ D < N/2. Most importantly, converged normalized energies are for all N when D ≥ 9, which implies that our Ansatz finds a constant-depth circuit for solving the ground state. In addition, the results show that there is a finite-size effect up to N = 18 where the accurate ground state is only obtained when D ≥ N/2.
However, the converged energies and the true ground state show a large discrepancy when D = 4. In fact, the results for D = 4 are even worse than those of D = 3, which signals that the optimizer gets stuck in local minima. This type of convergence problem is already observed in Ref. 13. To obtain a better convergence for D = 4, we employ a transfer learning technique. Instead of starting from a randomly initialized circuit, we insert a block in the middle of the converged circuit from D = 3 and perturb all parameters by adding small numbers sampled from (where we typically use σ′ = 0.01). We then optimize the full circuit using the quantum natural gradient.
We show converged energies from the transfer learning in Fig. 3. Figure 3(a) illustrates that the transfer learning succeeds in finding a better optimum for D = 4. However, the result for (N, D) = (10, 5) is worse than that of the random initialization, which implies that the transfer learning does not necessarily find the global optima. We additionally plot a learning curve for (N, D) = (18, 4) in Fig. 3(b) where the initial energy is much higher due to an added perturbation, but it eventually finds a quantum state with lower energy.
(a) Converged normalized energies for system size N and circuit depth D (a) from the transfer learning. Results for D = 3 are from the random initialization (which are shown in Fig. 2). For D = 4, we have trained a circuit after adding a randomly initialized block in the middle of the converged circuit from D = 3 (for the same N). Likewise, we obtained results for D = 5 using the converged circuits from D = 4. (b) A learning curve from the transfer learning when (N, D) = (18, 4). The dashed line indicates the converged energy from (N, D) = (18, 3).
(a) Converged normalized energies for system size N and circuit depth D (a) from the transfer learning. Results for D = 3 are from the random initialization (which are shown in Fig. 2). For D = 4, we have trained a circuit after adding a randomly initialized block in the middle of the converged circuit from D = 3 (for the same N). Likewise, we obtained results for D = 5 using the converged circuits from D = 4. (b) A learning curve from the transfer learning when (N, D) = (18, 4). The dashed line indicates the converged energy from (N, D) = (18, 3).
Converged normalized energies for system size N and the number of blocks D (a) without and (b) with symmetry breaking layers for the transverse field cluster (TFC) model.
Converged normalized energies for system size N and the number of blocks D (a) without and (b) with symmetry breaking layers for the transverse field cluster (TFC) model.
In summary, our Ansatz does not find a constant-depth circuit for solving the ground state of the TFC based on the results up to N = 20. We attribute this to the finite-size effect and expect that a much larger system size N (beyond our computational capacity) is required to take advantage of symmetry-breaking Ansatz.
IV. PREPARING A SYMMETRY BROKEN GROUND STATE
The main idea is minimizing the expectation value of O = Hcluster + α1P1 + α2P2 for suitable α1 and α2, instead of the Hamiltonian itself. We expect the obtained state to be an eigenstate with the corresponding eigenvalue ±1 of Pi when sign(αi) = ∓1.
For example, we study the VQE for preparing the ground state with P1 = P2 = −1 using (N, D) = (14, 8). We have tested VQEs with the varying learning rates η ∈ [0.01, 0.025, 0.05, 0.1] and α1 = α2 = α ∈ [0.1, 0.3, 0.5, 1.0, 2.0, 4.0] and found that the desired state can be prepared by the best for η = 0.025, α = 2.0. In particular, all VQE runs have converged to ⟨P1⟩ = ⟨P2⟩ = −1 (accuracy within 10−8) with these values of η and α. The resulting learning curve is shown in Fig. 5(a). However, the converged energy for D = 8 is significantly worse than the results from other models we have studied above (we still note that this is comparable to results from the Adam optimizer, which gives normalized energies ).14 We thus further study whether increasing D helps the convergence in Fig. 5(b). The results show that the converged energies are getting more accurate as we increase D. Precisely, 47 instances out of 48 independent VQEs runs have converged to when D = 20. This contrasts the barren plateaus narrative42 that more expressive quantum circuits are prone to suffer the optimization problem.43 We believe that the behavior of our circuits is related to the known fact in classical machine learning that over-parameterized networks converge better,44 but we leave a detailed study for future work.
(a) Learning curve of the VQE for preparing the ground state with ⟨P1⟩ = ⟨P2⟩ = −1.0. A circuit with (N, D) = (14, 8) is used. The inset shows the learning curves for ⟨P1⟩ and ⟨P2⟩. (b) Converged normalized energies for difference circuit depths D. For each D, we show converged energies from 48 independent VQE runs.
(a) Learning curve of the VQE for preparing the ground state with ⟨P1⟩ = ⟨P2⟩ = −1.0. A circuit with (N, D) = (14, 8) is used. The inset shows the learning curves for ⟨P1⟩ and ⟨P2⟩. (b) Converged normalized energies for difference circuit depths D. For each D, we show converged energies from 48 independent VQE runs.
V. CONCLUSION AND OUTLOOK
We studied the limitation of the bare HVA for solving a system with symmetry-broken ground states and proposed an Ansatz with symmetry-breaking layers to overcome this problem. Such a symmetry-breaking Ansatz could enhance the overall performance of the VQE. Especially for the transverse-field Ising model, we observed that our symmetry-breaking Ansatz finds a constant-depth circuit for the ground state, whereas the bare HVA requires the depth to be linear in the system size. We further proposed a technique to choose a specific symmetry broken ground state among possible ground states. This was possible by adding a penalizing term to the cost function.
We expect that one can observe similar behavior for continuous symmetry, e.g., U(1) or SU(2). However, given that those symmetries are not broken in 1D,45 we should use 2D models to observe the effect of symmetry-breaking layers. Symmetry-broken 2D spin models are also physically interesting as they are related to the lattice gauge theory.46 However, we leave the detailed study of 2D models for future work, as numerical simulation of such models is more demanding.
Our symmetry-breaking Ansatz can be implemented on real quantum hardware. In this case, one should decompose a gate into the set of gates supported by a target quantum device. For example, a gate used for the transverse-field cluster model is not naturally supported by most quantum computing architectures.47–52 Instead, it can be decomposed into, e.g., , which may increase overall circuit depths. Still, as our symmetry-breaking Ansatz can provide a scaling advantage, there is a certain number of qubits that our circuit can solve the problem in a shorter depth than the bare HVA even after such a decomposition.
Incoherent noise can be a significant problem when implemented on noisy quantum hardware. While a weak incoherent noise may help to find a symmetry-broken ground state when the target state is not entangled,53 we generally expect that the noise destroys the quantumness of the circuit, and the circuit outputs a decohered state that is far from a true ground state.54 As the circuit depth is the main factor limited by this type of noise, our symmetry-breaking Ansatz may again have an advantage over the bare HVA when the noise is considered.
ACKNOWLEDGMENTS
The author thanks David Wierichs for helpful discussions. This project was funded by the Deutsche Forschungsgemeinschaft under Germany’s Excellence Strategy—Cluster of Excellence Matter and Light for Quantum Computing (ML4Q) EXC 2004/1-390534769 and within the CRC network TR 183 (Project Grant No. 277101999) as part of project B01. Numerical simulations were performed on the JUWELS cluster at the Forschungszentrum Juelich.
AUTHOR DECLARATIONS
Conflict of Interest
The authors have no conflicts to disclose.
Author Contributions
Chae-Yeun Park: Resources (equal); Software (equal); Supervision (equal); Writing – original draft (lead).
DATA AVAILABILITY
The source code for the current paper can be found in the Github repository.55
APPENDIX: OPTIMIZATION OF THE ANSATZ WITH SYMMETRY BREAKING LAYERS
When the Ansatz contains symmetry-breaking layers, an optimization algorithm easily gets stuck in local minima. We here compare several different setups for the transverse field Ising model we have studied in Sec. III. We show results from the transverse Ising model with different initial values and using the centered and uncentered Fisher matrix in Fig. 6. We can see that the result with {ϕj} initialized from and using the centered Fisher matrix is the most natural. However, when D = 3, 5, the results with {ψj} initialized from show better convergence.
Converged normalized energies for the TFI with different optimization setups. We have used the centered and uncentered Fisher matrix for the upper and lower rows, respectively. The left and right columns show results from different initial values for the symmetry-breaking layers. For each N and D, we have taken the best-optimized energy from 12 independent VQE runs.
Converged normalized energies for the TFI with different optimization setups. We have used the centered and uncentered Fisher matrix for the upper and lower rows, respectively. The left and right columns show results from different initial values for the symmetry-breaking layers. For each N and D, we have taken the best-optimized energy from 12 independent VQE runs.
We have also found that using the uncentered Fisher matrix improves convergence for D = 4 where the centered Fisher matrix failed to find an appropriate optimum. Our results suggest that the learning landscape of the VQEs with symmetry-breaking layers is rugged, especially when the parameters are not sufficient to describe the ground state accurately.