The reconstruction of the state of a multipartite quantum mechanical system represents a fundamental task in quantum information science. At its most basic, it concerns a state of a bipartite quantum system whose subsystems are subjected to local operations. We compare two different methods for obtaining the original state from the state resulting from the action of these operations. The first method involves quantum operations called Petz recovery maps, acting locally on the two subsystems. The second method is called matrix (or state) reconstruction and involves local, linear maps that are not necessarily completely positive. Moreover, we compare the quantities on which the maps employed in the two methods depend. We show that any state that admits Petz recovery also admits state reconstruction. However, the latter is successful for a strictly larger set of states. We also compare these methods in the context of a finite spin chain. Here, the state of a finite spin chain is reconstructed from the reduced states of a few neighbouring spins. In this setting, state reconstruction is the same as the matrix product operator reconstruction proposed by Baumgratz et al. [Phys. Rev. Lett. 111, 020401 (2013)]. Finally, we generalize both these methods so that they employ long-range measurements instead of relying solely on short-range correlations embodied in such local reduced states. Long-range measurements enable the reconstruction of states which cannot be reconstructed from measurements of local few-body observables alone and hereby we improve existing methods for quantum state tomography of quantum many-body systems.
I. INTRODUCTION
Consider a bipartite quantum state ρXY that is transformed to a state under the action of local quantum operations and . These local operations could either correspond to (i) undesirable noise (resulting from unavoidable interactions of the quantum system XY with its environment) or they could correspond to (ii) local measurements made by an experimenter doing quantum state tomography. We are interested in determining the conditions under which the state can be transformed back to the original state ρXY with maps that act locally on X′ and Y′. In the case (i), these would be the conditions under which the effect of the noise can be reversed, whereas in the case (ii), these would be the conditions under which reconstruction of the original state from the outcome of the experimenter’s chosen measurements is possible.
The question whether can be transformed back to ρXY can be answered with different methods. If the transformation is to be achieved with quantum operations, an answer is provided by the Petz recovery map2,3 under a condition on the mutual information of the two states. If general linear (not necessarily completely positive) maps are allowed in the transformation, one can use a matrix reconstruction method. This matrix reconstruction method is related to MPO (i.e., matrix product operator) reconstruction and the so-called pseudoskeleton (or CUR) matrix decompositions.1,4,5 In either case, the construction of the maps that transform into ρXY does not require complete information on ρXY if suitable maps and are used. In this case, the transformation can be used for efficient quantum state tomography of ρXY with less measurements than necessary for standard quantum state tomography.
A fundamental quantity in quantum information theory is the quantum relative entropy D(ρ∥σ) between a state ρ and a positive semi-definite operator σ (see Sec. II C for its definition). It acts as a parent quantity for other entropic quantities arising in quantum information theory, e.g., von Neumann entropy, conditional entropy, and mutual information. When ρ and σ are both states, D(ρ∥σ) also has an operational interpretation as a measure of distinguishability between the two states.6,7 One of its most important properties is its monotonicity under the joint action of a quantum operation (say, ). This is also called the data processing inequality (DPI) and is given by
The condition under which the above inequality is saturated was obtained by Petz8 and has found important applications in quantum information theory. Petz proved that equality in the DPI holds if and only if there exists a recovery map, given by a quantum operation which reverses the action of on both ρ and σ, i.e., and . Petz also obtained an explicit form of such a recovery map, which is often called the Petz recovery map. Petz’s condition on the equality in the DPI immediately yields a necessary and sufficient condition under which the conditional mutual information I(A:C|B) of a tripartite state ρABC is zero,3 which in turn is the condition under which strong subadditivity (SSA) of the von Neumann entropy (arguably the most powerful entropic inequality in quantum information theory) is saturated. Petz’s result, when applied to the problem studied in this paper, implies that the original state ρXY can be recovered from the transformed state if and only if the mutual information I(X:Y)ρ of ρXY is equal to the mutual information of .3,9 Moreover, a valid recovery map is a tensor product of maps acting locally on X′ and Y′, each having the structure of a Petz recovery map. A detailed discussion of Petz’s result and of the quantities on which the Petz recovery maps depend is given in Sec. II C.
The data processing inequality of the relative entropy implies a DPI for the mutual information,
The mutual information quantifies the amount of correlations that exist between the two subsystems of a bipartite quantum state. Another measure of such correlations is the operator Schmidt rank10,11 of the state, which we denote as OSR(X:Y)ρ for a bipartite state ρXY [see Eq. (8) for its definition].
In the following, we discuss the main results of this paper. We show that the operator Schmidt rank also satisfies a DPI
where is the state obtained from ρXY via the local quantum operations and , as discussed above. The DPI for the operator Schmidt rank is directly implied by the fact that the matrix rank satisfies rk(MN) ≤ rk(M)rk(N) for any two matrices M and N (see Corollary 11 for details). We show that can be transformed into ρXY with local maps if and only if the DPI of the operator Schmidt rank is saturated. Our proof does not guarantee that the maps that transform into ρXY are completely positive but it also does not require that ρXY and are positive semidefinite or that and are completely positive. The proof proceeds by transforming the reconstruction problem into a reconstruction problem for a general, rectangular matrix. Here, we provide an extension of the known pseudoskeleton decomposition,4,5 which is also known as CUR decomposition and which can reconstruct a low-rank matrix from few of its rows and columns. Our method reconstructs a matrix M from the matrix products LM and MR if the rank of M equals the rank of LMR; L, M, and R are the general rectangular matrices.
We explore the relation between Petz recovery and state/MPO reconstruction for the case of 2, 4, and n parties. State/MPO reconstruction, when compared to Petz recovery, is shown to be possible for a strictly larger set of states but requires more information.
The state of an n-partite quantum system, such as n spins in a linear chain, can be represented as a matrix product operator (MPO) with MPO bond dimensions given by the operator Schmidt ranks OSR(1, …, k: k + 1, …, n) (between the sites 1, …, k and k + 1, …, n;12,13). If the operator Schmidt ranks are all bounded by a constant D, the MPO representation is given in terms of ∼nD2 complex numbers, which is much less than the number of entries of the density matrix of the n-partite quantum system. Baumgratz et al.1 presented a condition under which an MPO representation of the state of an n-partite quantum system can be reconstructed from the reduced states of few neighbouring systems (MPO reconstruction). We will demonstrate that their work implies, for the case where the local operations and are partial traces, that can be transformed into ρXY if the two states have equal operator Schmidt rank.
The ability to reconstruct the state of an n-partite quantum system from reduced states of l < n systems, as provided, e.g., by MPO reconstruction, is advantageous for quantum state tomography of many-body systems. Standard quantum state tomography requires the expectation values of a number of observables which grow exponentially with n. If the full state can be reconstructed from l-body reduced states, then the number of observables grows exponentially with l but only linearly with the number of reduced states. MPO reconstruction uses the reduced states of blocks of l neighbouring sites on a linear chain. As the number of such blocks increases linearly with n, MPO reconstruction enables quantum state tomography with a number of observables which increases only linearly with n.
We call a method for quantum state tomography efficient if it requires only polynomially many (in n) sufficiently simple observables (more details on permitted observables are given in Sec. VI A, Remark 18). Here, we assume that exact expectation values are available. For a given method to be useful in practice, it is however necessary that the quantum state can be estimated up to a fixed estimation error using approximate expectation values from measurements on at most polynomially many (in n) copies of the state. In this paper, we discuss only the number of necessary observables but not the number of necessary copies of the state. Numerical simulations indicate that, e.g., MPO reconstruction and similar methods are efficient also in the number of necessary copies.1,14–16
There are multipartite quantum states (e.g., states of a spin chain) that admit an efficient matrix product state (MPS) or MPO representation but which cannot be reconstructed from reduced states of a few of its parties (e.g., a few neighbouring sites of the spin chain). The n-qubit GHZ state is an example of such a state (Sec. V A). However, it has been shown that the GHZ state can be reconstructed from a number of observables linear in n, provided global observables (i.e., those which act on the whole system) are allowed.14,15 The necessary observables are given by simple tensor products15 or simple tensor products and unitary control of few neighbouring sites.14 We generalize MPO reconstruction and a similar technique based on the Petz recovery map17 to use a certain class of long-range measurements which includes those just mentioned as special cases (Sec. VI). We represent a long-range measurement as a sequence of local quantum operations followed by the measurement of a local observable. However, a tensor product of single-party observables, whose expectation value can be obtained by a simple, sequential measurement of the single-party observables, already constitutes an allowed long-range measurement.
The example of the GHZ state shows that long-range measurements enable the recovery or reconstruction of a larger set of states than those obtained by local few-body observables. Our reconstruction and recovery methods provide a representation of the reconstructed state in terms of a sequence of local linear maps which is equivalent to an MPO representation. For methods based on the Petz recovery map, the local linear maps are quantum operations, and because of this, a PMPS (locally purified MPS) representation can be obtained (Ref. 18, Appendix A 5). A PMPS representation is advantageous because it can be computationally demanding to determine whether a given MPO representation represents a positive semidefinite operator,18 whereas a PMPS representation always represents a positive semidefinite operator. Our work on the reconstruction of spin chain states is partially based on similar ideas developed in the context of tensor train (TT) representations19 and there is also related work on Tucker and hierarchical Tucker representations.5,20,21
The remainder of the paper is structured as follows: Sec. II introduces notation, definitions, MPS/MPO representations, and known results on the Petz recovery map. Section III shows how a low-rank matrix reconstruction technique enables bipartite state reconstruction, i.e., a transformation of into ρXY. We also prove that approximate matrix reconstruction is possible if a low-rank matrix is perturbed by a small high-rank component (Sec. III B). We apply the Petz recovery map to the bipartite setting in Sec. IV and investigate the relation between Petz recovery and state reconstruction in Sec. V. Any state which admits Petz recovery is found to also admit state reconstruction. In Sec. VI, we discuss reconstruction of spin chain states with Petz recovery maps and state reconstruction. If reconstruction is performed with local reduced states (Sec. VI A), a known application of the Petz recovery map17 and the known MPO reconstruction technique1 are obtained. In Sec. VI B, we reconstruct spin chain states from recursively defined long-range measurements. We show that successful recovery of a given spin chain state implies successful reconstruction both for local reduced states and for long-range measurements. The set of states which can be reconstructed with long-range measurements is seen to be strictly larger than the set of states which can be reconstructed with measurements on local reduced states. Long-range measurements were used in earlier work on the reconstruction of pure states14 and we show that our methods can recover or reconstruct these states if the same long-range measurements are used.
II. PRELIMINARIES
A. Notation and basic definitions
In this paper, all Hilbert spaces are finite-dimensional. We use capital letters A, B, C, … to denote quantum systems with Hilbert spaces , , , ..., and set . For notational simplicity, we often use A to denote both the system and its associated Hilbert space, when there is no cause for confusion. If n systems are involved, we denote their Hilbert spaces by , ..., and tensor products of the latter by .
We denote the set of linear maps from A to B by and the set of linear operators on A by . If tensor products are involved, we use the notation . The trace of a linear operator (or a square matrix ) is denoted by Tr(F). A quantum state (or density matrix) of a system A is a positive semi-definite operator with unit trace. Let denote the set of quantum states in . In the case of a pure quantum state , , we refer to both and |ψ⟩ as the pure state. For any state , its von Neumann entropy is defined as S(A)ρ = −Tr(ρA log (ρA)). In this paper, all logarithms are taken to base 2.
For any , let F* denote its Hermitian adjoint, supp(F) denote its support, rk(F) denote its rank, ∥F∥ denote its operator norm (largest singular value), and σmin(F) denote its smallest non-zero singular value. A linear operator is an observable if it is Hermitian. The Hilbert-Schmidt inner product on is denoted by
The vector space becomes a Hilbert space when equipped with this inner product.
The notation denotes the set of linear maps from to . This includes the set of quantum operations (or superoperators) from A to B which are given by linear, completely positive, trace-preserving (CPTP) linear maps . We use the shorthand notation to indicate such a quantum operation. Given any linear map , its Hermitian adjoint (with respect to the Hilbert-Schmidt inner product) is denoted by , i.e., for all , .
Since we are dealing with finite-dimensional Hilbert spaces, all linear operators and maps are represented by matrices. Given a matrix (or a linear map M), let M* denote its conjugate transpose matrix, denotes its (element-wise) complex conjugate matrix, and M+ denotes its Moore–Penrose pseudoinverse. The following four properties of the pseudoinverse also define it uniquely:22,23
Given a real number t ≥ 0, we define where Mt is obtained from M by replacing its singular values that are smaller than or equal to t by zero.
For a system A, we choose an operator basis which is orthonormal in the Hilbert–Schmidt inner product
Given a basis element , we denote its dual element (in the Hilbert-Schmidt inner product) by ,
The identity map, id, on can then be expressed as
This is nothing but the resolution of the identity operator for the Hilbert space .
Consider a linear map . Since the vector spaces and have the same finite dimension, , it is possible to define a bijective linear map between the two spaces. To do so, we define the components of and in terms of the operator bases from above
Given a linear operator , we define a linear map by
We denote the matrix representation of in the operator basis chosen above by Mρ. The maps and ρ ↦ Mρ defined by Eq. (7) are of course bijective. Note that ρ can be represented by a matrix of size dXdY × dXdY, while can be represented by the matrix . The transpose map is defined in the same operator basis, i.e., .
Given a linear operator , its operator Schmidt rank is given by
The operator Schmidt rank is equal to
which can be shown as follows: The matrix representation Mρ of can be written as
Since the components of Mρ and ρ are related by , we have
where Gik and Hkj are the components of the matrices G and H. This shows that the operator Schmidt rank cannot exceed s = rk(Mρ). Now suppose that the operator Schmidt rank was less than that, i.e., r = OSR(X:Y)ρ < s = rk(Mρ). Then, a decomposition of ρ as in Eq. (8) implies that
B. MPS, MPO, and PMPS representations
In this section, we introduce frequently used efficient representations of pure and mixed quantum states on n systems. We call a representation efficient if it describes a state with a number of parameters (i.e., complex numbers) which increases at most polynomially with n. The number of parameters of a particular representation of a state is accordingly given by the total number of entries of all involved vectors, matrices, and tensors. For example, a pure state of n quantum systems of dimension d has dn parameters and is not an efficient representation. To discuss whether a given representation is efficient or not, we use the following notation: For a function f(n), we write or if there is a polynomial g(n) such that f(n) ≤ g(n). We write if there are constants c1 and c2 such that f(n) ≤ c1 exp (c2n) holds for all n.
First, we introduce the matrix product state (MPS) representation (see, e.g., Ref. 13), which is also known as tensor train (TT) representation.24 Consider n quantum systems of dimensions d1, …, dn respectively and let be an orthonormal basis of the kth system. An MPS representation of a pure state on n systems is given by
where D0 = Dn = 1, and ik ∈ {1, …, dk}. The condition D0 = Dn = 1 ensures that G1(i1) and Gn(in) are row and column vectors, while the Gk(ik) for k between 1 and n can be matrices. The matrix sizes Dk are called the bond dimensions of the representation. The maximal local dimension and the maximal bond dimension are indicated by d = maxkdk and D = maxkDk. For and , the total number of parameters of the MPS representation is and the representation is efficient. The bond dimension Dk of any MPS representation of |ψ⟩ is larger than or equal to the Schmidt rank of |ψ⟩ for the bipartition 1, …, k|k + 1, …, n and a representation with all bond dimensions equal to the corresponding Schmidt ranks can always be determined (see, for example, Ref. 13). We discuss the analogous property of the matrix product operator (MPO) representation in more detail.
A matrix product operator (MPO) representation25,26 of a mixed state on n systems is given by
where D0 = Dn = 1, and ik, jk ∈ {1, …, dk}. Alternatively, an MPO representation may be given in terms of operator bases ,
where D0 = Dn = 1, , and . If the operator basis is used, Eq. (15) turns into Eq. (14). As before, we denote the maximal local and bond dimensions by d = maxkdk and D = maxkDk. The number of parameters of an MPO representation is at most nd2D2 and it is an efficient representation if and . The operator Schmidt ranks of ρ provide lower bounds to the bond dimensions of any MPO representation of ρ,12
This becomes clear if we rewrite Eq. (14) as follows:
where and . The sum runs over bk ∈ {1, …, Dk}. It can also be shown that a representation with equality in Eq. (16) always exists (see, for example, Ref. 13). If the linear operator ρ represented by an MPO is a quantum state, it is desirable to ensure that ρ is positive semi-definite. However, deciding whether a given MPO represents a positive semi-definite operator is an NP-hard problem in the number of parameters of the representation,18 i.e., a numerical solution in polynomial (in n) time may not be obtained. As an alternative, one can use a PMPS (locally purified MPS) representation of the mixed state. A PMPS representation represents a positive semidefinite linear operator by definition. PMPS representations are also called evidently positive representations and they are introduced in Subsection 5 of the Appendix.
Suppose that a quantum state was prepared via quantum operations , i.e.,
where . Clearly, this is an efficient representation of the quantum state ρ as it is described by at most nd6 parameters. It is known that such a representation can be efficiently, i.e., with at most poly(n) computational time, converted into an MPO representation or a PMPS representation.17,18 Subsection 5 of the Appendix provides the details of the conversion and of the PMPS (locally purified MPS) representation. The state recovery and reconstruction techniques presented in Sec. VI provide a representation of the reconstructed state which is similar to Eq. (18). Lemma 38 in the Appendix provides PMPS and MPO representations of the recovered state for techniques based on the Petz recovery map and an MPO representation of the reconstructed state for state reconstruction results.
The structure of MPO and PMPS representations was used in Ref. 27 in the analysis of translationally invariant (TI) states of infinite spin chains. MPOs and PMPSs correspond to finitely correlated states (FCS) and C*-FCS, respectively, where the infinite chain has been replaced by a finite chain and where the requirement of translational invariance has been dropped (Definitions 2.2 and 2.4 in Ref. 27). In classical probability theory, a similar structure, called a hidden Markov model (HMM), has been used already in 1957 (see Ref. 28 and references therein). For more information about other similar structures, refer to Ref. 18.
C. The Petz recovery map
The (quantum) relative entropy, for two quantum states , was defined by Umegaki29 as
if supp ρ ⊆ supp σ, and is set equal to +∞ otherwise. For a bipartite quantum state , the mutual information between the subsystems A and B is defined in terms of the von Neumann entropies of ρAB and its reduced states ρA = TrBρAB and ρB = TrAρAB,
It can also be expressed in terms of the relative entropy as follows:
The quantum conditional mutual information (QCMI) of a tripartite quantum state is given by
and is expressed in terms of the von Neumann entropy as follows:
As mentioned in the Introduction, a fundamental property of the quantum relative entropy is its monotonicity under quantum operations. This is given by the data processing inequality (DPI): for quantum states and a quantum operation acting on ,
For the choice ρ = ρABC, σ = ρAB ⊗ ρC, and , the DPI (24) implies that the QCMI of a tripartite state ρABC is always non-negative. Using the definition (23) of the QCMI, we further infer that
which is the well-known strong subadditivity (SSA) property of the von Neumann entropy.
Equality in the DPI (24) was first discussed by Petz.8 A necessary and sufficient condition for equality in the DPI (24) was derived by Petz2 and Hayden et al.3 and is stated in the following theorem.
Further, Hayden et al. derived the following necessary and sufficient condition on the structure of tripartite states satisfying equality in the SSA (25) (see Theorem 6 in Ref. 3).
III. RECONSTRUCTION OF BIPARTITE STATES SUBJECTED TO LOCAL OPERATIONS
Let be a bipartite state and let be a state obtained from ρXY by the action of local operations
where and denote quantum operations (or more generally, linear maps). We are interested in the conditions under which the original state ρXY can be reconstructed from with local maps, i.e., with reconstruction maps and .
Our reconstruction scheme is particularly useful for states ρ with low operator Schmidt rank because then a reconstruction of ρ can be achieved with fewer measurements than required for standard quantum state tomography, as discussed in Remarks 18 and 29 (see also Sec. V A). The operator Schmidt rank of ρ is equal to the rank of the matrix Mρ [Eqs. (7) and (9); Mρ has size ]. Hence, in Sec. III A, we first consider the more general problem of reconstruction of low-rank matrices (which are not necessarily states). Section III B discusses the stability of our matrix reconstruction technique and Sec. III C shows how it can be used to reconstruct a quantum state.
A. Reconstruction of low-rank matrices
Suppose that we want to obtain a matrix but we only know the entries of the matrix products LM and MR where L and R are r × m and n × s complex matrices. We refer to LM, MR, and LMR as the marginals of the matrix M. Proposition 3 states that M can indeed be obtained from LM and MR if the condition rk(LMR) = rk(M) holds. This rank condition implies r, s ≥ rk(M). If the rank of M is much smaller than its maximal value, min{m, n}, this provides a way to obtain M from LM and MR which, taken together, have much fewer entries than M. If the matrices L and R are restricted to submatrices of permutation matrices, the matrix products LM and MR comprise selected rows and columns of M. In this case, Proposition 3 provides a reconstruction of a low-rank matrix M from few rows and columns (cf. Ref. 4).
Furthermore, rk(LM) = rk(M) implies rk(LMR) = rk(MR).
rk(LM) = rk(M) implies rk(LMR) = rk(MR): The equality rk(LM) = rk(M) implies M = M(LM)+ LM [use the “⇒” direction of Eq. (31) for R = ]. As a consequence, MR = M(LM)+ LMR and rk(MR) ≤ rk(LMR) hold. The converse inequality rk(LMR) ≤ rk(MR) always holds and we arrive at rk(LMR) = rk(MR).
“⇐” of Eq. (31): Assume that M = MR X LM holds for some matrix X. The equality M = MR X LM implies rk(M) ≤ rk(MR) and rk(M) ≤ rk(LM). The converse inequalities rk(MR) ≤ rk(M) and rk(ML) ≤ rk(M) always hold. As a consequence, we have rk(LM) = rk(M) and rk(MR) = rk(M). Above, we saw that the former equality implies rk(LMR) = rk(MR) which, together with the latter equality rk(MR) = rk(M), proves the theorem.
A violation of the rank condition rk(LMR) = rk(M) does not in general imply that there is no method to obtain M from LM and MR. As a trivial example, consider L = and R = 0. Then, the rank condition is violated for all M ≠ 0, but M is obtained trivially from LM = M.□
(Related work). Proposition 3 states that M can be obtained from LM and MR if rk(LMR) = rk(M) holds. Special cases of Proposition 3 have appeared before in several places. If r = s = rk(M) and L and R select exactly r = rk(M) rows and columns of M, the decomposition M = MR(LMR)−1LM is known as skeleton decomposition of M.4 Decompositions of the form M = MR X LM where L and R select rows and columns of M are known as pseudoskeleton/CUR decomposition of M and it has been recognized that the truncated Moore–Penrose pseudoinverse may provide a good approximation if r = s < rk(M) and suitable rows, columns, and threshold τ are chosen;4 we come back to the case of approximately low rank in Sec. III B. The case r = s = rk(M), X = (LMR)+ is contained in the results on tensor decompositions by Caiafa and Cichocki.21 This matrix decomposition with X = (LMR)+, restricted L and R but general r, s ≥ rk(M) forms the basis of MPO reconstruction1 which is discussed in Sec. VI.
B. Stability of the reconstruction under perturbation
Suppose that we have a matrix S which satisfies the rank condition
for given matrices L and R. We want to reconstruct the perturbed matrix
and ϵ = ∥E∥/∥S∥ quantifies the magnitude of the perturbation E relative to the unperturbed matrix S. In Theorem 6, we provide a reconstruction and show that it is close to M if the magnitude ϵ of the perturbation E is small enough. A bound on the distance in operator norm between the reconstruction and M is provided by
and Theorem 6 provides a bound on .
Recall that given a matrix M, we define and Mτ is given by M with singular values smaller or equal to τ replaced by zero.
For the interpretation of the theorem, it is convenient to use the case with ∥L∥ = ∥S∥ = ∥R∥ = 1 and η = 1. Theorem 6 shows that the reconstruction reconstructs the low-rank component S of M = S + E up to a small error if the smallest singular value γ of the low-rank component LSR is much larger than the norm ϵ of the noise component. In addition, the threshold τ must be chosen larger than the noise norm ϵ but smaller than γ − ϵ. Subsection 1 of the Appendix discusses examples which show that the bound from Theorem 6 is optimal up to constants and that the reconstruction error can diverge as ϵ approaches zero if small singular values in LMR are not truncated.
Choosing a suitable threshold τ is equivalent to estimating the rank of the low rank contribution S. If the rank and support of S are known, the measurements L and R can be chosen such that LSR becomes invertible. For this special case, an upper bound on the reconstruction error has been given by Caiafa and Cichocki.21 Their bound also includes constants that depend on LSR and may diverge as γ approaches zero. In Subsection 3 of the Appendix, we generalize their approach to our more general setting and obtain a bound that is similar to Theorem 6.
The following lemma was used in the proof of Theorem 6:
C. Reconstruction of bipartite states
Let be a bipartite quantum state and let be a state obtained from it by the action of local operations
where and denote quantum operations. We are interested in the conditions under which the original state ρ can be reconstructed from τ with local quantum operations, i.e., with and . This question can be answered with the matrix decomposition from Sec. III A without using the positivity properties of ρ, , and . The result is provided by the following theorem:
The operator τ is given by , therefore always holds (Corollary 11). Let Eq. (41) hold. Again by Corollary 11, the converse inequality also holds. As a consequence, the two operator Schmidt ranks must be equal.
The remainder of the section provides the ingredients used in the preceding proof. It also provides a data processing inequality (DPI) for the operator Schmidt rank which is used below.
In the Introduction, we saw that the operator Schmidt rank is given by where is a linear map. As corollary from Lemma 10, we obtain the monotonicity of the operator Schmidt rank under local maps, i.e., a data processing inequality.
Use the property [Eq. (9)], the identity (Lemma 10) and the rank inequality rk(AB) ≤ min{rk(A), rk(B)} for arbitrary matrices or linear maps A and B.
IV. PETZ RECOVERY OF BIPARTITE STATES SUBJECTED TO LOCAL QUANTUM OPERATIONS
In Sec. III, we considered a linear operator subjected to local linear maps and ,
In Sec. III C, we presented a condition under which ρ can be reconstructed from τ via local linear maps. Here, we discuss the same question for a bipartite quantum state and quantum operations and . The answer is obtained by restricting Theorem 1 to the bipartite setting, i.e., by inserting ρ = ρXY, σ = ρX ⊗ ρY, and .9
In Sec. V, we explore the relation between bipartite state reconstruction (Theorem 8) and bipartite Petz recovery (Corollary 12).
V. COMPARISON OF PETZ RECOVERY AND STATE RECONSTRUCTION
In this section, we compare Petz recovery with state reconstruction for a bipartite quantum state subject to local quantum operations and ,
The reconstruction is to be achieved via local linear maps
State reconstruction and the Petz recovery map both provide maps and under the assumption of different conditions on ρ and τ (Theorem 8 and Corollary 12). There is the following evident relation between state reconstruction and Petz recovery.
The premise of Eq. (57) implies that the CPTP maps from Eq. (58) exist (Corollary 12). These CPTP maps are linear maps that satisfy Eq. (59), which in turn implies that the conclusion of Eq. (57) holds (Theorem 8).
A counterexample for the converse implication of Eq. (57) will be provided in Sec. V A.
Suppose that the conclusion of Eq. (57) holds while its premise does not hold. If both linear maps and were CPTP, the equality would be implied by Eq. (59) and would follow [since the converse inequality always holds because of ]. This would contradict our assumption; i.e., at least one of and is not CPTP. For example, the reconstruction maps for the W state on four qubits (Sec. V A) are non-positive.□
Theorem 13 implies that any state which admits Petz recovery also admits state reconstruction. Table I compares the quantities on which the state reconstruction maps and the Petz recovery maps depend: Both methods require knowledge of the quantum operations and . The marginal states ρX and ρY are sufficient for computing the Petz recovery maps. To compute the state reconstruction maps, the states and are needed. The marginals ρX, ρY, , and (which are also needed for the recovery maps) can be inferred from the states and . However, in addition, these states contain correlations between the systems X and Y′ and between X′ and Y, respectively. This means that state reconstruction requires more input data for a reconstruction of ρ than state recovery. On the other hand, Theorem 13 and the examples in Subsection V A show that state reconstruction is successful for a strictly larger set of states than Petz recovery.
A. Comparison for four-partite systems
After the comparison of state reconstruction and Petz recovery for bipartite quantum systems, we apply this result to the more specific case of quantum systems which comprise four subsystems. Specifically, we consider four systems A, B, C, and D. We insert the partial trace TrA: AB → B for and the partial trace TrD: CD → C for in Theorem 13. Accordingly, reconstruction of from its reduced state ρBC = TrAD(ρ) is achieved with maps and as
A straightforward application of Theorem 13 provides the implication
Since the W state on four qubits satisfies the conclusion of the last equation but not its premise, it constitutes a counterexample for the converse implication. In addition, it provides a counterexample for the converse implication of Eq. (57) in Theorem 13. On n qubits, the W state is given by
and the operator Schmidt rank and mutual information values of the four-qubit W state WABCD are provided in Table III on p. 16.
Above, we presented one possible application of bipartite Petz recovery (Corollary 12) to a quadripartite system. It turns out that Petz recovery can be applied to a quadripartite system in three different ways. The first row of Table II corresponds to the application of state reconstruction and Petz recovery to a quadripartite system as presented above. Rows two and three of Table II present two different ways to apply Petz recovery to a quadripartite system. In total, we have one possible application of state reconstruction and three possible applications of Petz recovery and for each application, there is a condition for successful reconstruction/recovery. These conditions read as follows:
The implication (C2) ⇒ (C1) follows from Theorem 13 with the substitutions given in the first row of Table II.
Equation (C2) ⇒ Eq. (C3): The inequality I(B : C)ρ ≤ I(B : CD)ρ ≤ I(AB : CD)ρ always holds, therefore I(B : C)ρ = I(AB : CD)ρ implies I(B : CD)ρ = I(AB : CD)ρ. The latter can be written with the conditional mutual information as I(A : CD|B) = 0 [Eq. (22)]. The CMI in turn is also equal to I(A : CD|B) = I(A : BCD) − I(A : B), which shows the desired equality I(A : B) = I(A : BCD).
Equation (C3) ⇒ Eq. (C4): The inequality I(A : B)ρ ≤ I(A : BC)ρ ≤ I(A : BCD)ρ always holds, therefore I(A : B)ρ = I(A : BCD)ρ implies I(A : BC)ρ = I(A : BCD)ρ.
The relations between Eqs. (C1)–(C4) from Lemma 15 are illustrated in Fig. 2. The figure also shows which of the conditions are satisfied by the example states from Table III. For example, the W state WABCD on four qubits does not satisfy (C2)–(C4). We can understand that WABCD cannot satisfy (C4) by considering the following known result: If (C4) holds, then Theorem 2 tells us that the reduced state ρAD must be a separable state. However, the reduced state TrBC(WABCD) has a non-positive semidefinite partial transpose and therefore is inseparable, i.e., entangled:32,33 The entanglement in the reduced state on AD mandates that Eq. (C4) is not satisfied.
Figure 2 illustrates that reconstruction and the different applications of Petz recovery work for different subsets of all quadripartite states but one should not forget that they also require different reduced states of ρ in order to recover . Table IV shows the necessary reduced states for each case. In all four cases, the full state can be reconstructed from marginal states on only two or three of the systems. Each scheme enables quantum state tomography with incomplete information (i.e., the necessary marginals) if the corresponding condition is assumed to hold. Each scheme also relies on the fact that correlations as measured by the operator Schmidt rank or the mutual information are less than maximal; this restriction is imposed by the conditions (C1)–(C4).
. | κ(B:C) . | κ(AB:CD) . | I(B:C) . | I(AB:CD) . | I(A:B) . | I(A:BC) . | I(A:BCD) . | C1 . | C2 . | C3 . | C4 . |
---|---|---|---|---|---|---|---|---|---|---|---|
cGHZABCD | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ✓ | ✓ | ✓ | ✓ |
ρA ⊗ WBCD | 2 | 2 | ≈0.92 | ≈1.84 | 0 | 0 | 0 | ✓ | – | ✓ | ✓ |
WABC ⊗ ρD | 2 | 2 | ≈0.92 | ≈1.84 | ≈0.92 | ≈1.84 | ≈1.84 | ✓ | – | – | ✓ |
WABCD | 2 | 2 | ≈0.62 | 2 | ≈0.62 | 1 | ≈1.62 | ✓ | – | – | – |
ρA ⊗ GHZBCD | 1 | 2 | 1 | 2 | 0 | 0 | 0 | – | – | ✓ | ✓ |
GHZABC ⊗ ρD | 1 | 2 | 1 | 2 | 1 | 2 | 2 | – | – | – | ✓ |
GHZABCD | 1 | 2 | 1 | 2 | 1 | 1 | 2 | – | – | – | – |
. | κ(B:C) . | κ(AB:CD) . | I(B:C) . | I(AB:CD) . | I(A:B) . | I(A:BC) . | I(A:BCD) . | C1 . | C2 . | C3 . | C4 . |
---|---|---|---|---|---|---|---|---|---|---|---|
cGHZABCD | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ✓ | ✓ | ✓ | ✓ |
ρA ⊗ WBCD | 2 | 2 | ≈0.92 | ≈1.84 | 0 | 0 | 0 | ✓ | – | ✓ | ✓ |
WABC ⊗ ρD | 2 | 2 | ≈0.92 | ≈1.84 | ≈0.92 | ≈1.84 | ≈1.84 | ✓ | – | – | ✓ |
WABCD | 2 | 2 | ≈0.62 | 2 | ≈0.62 | 1 | ≈1.62 | ✓ | – | – | – |
ρA ⊗ GHZBCD | 1 | 2 | 1 | 2 | 0 | 0 | 0 | – | – | ✓ | ✓ |
GHZABC ⊗ ρD | 1 | 2 | 1 | 2 | 1 | 2 | 2 | – | – | – | ✓ |
GHZABCD | 1 | 2 | 1 | 2 | 1 | 1 | 2 | – | – | – | – |
The exact values of the numerical constants are , , , and .
Method . | Cond. . | Input . | Domain/Range . | Depends on . |
---|---|---|---|---|
Reconstruction | (C1) | ρBC | B → AB, C → CD | ρABC, ρBCD |
Recovery | (C2) | ρBC | B → AB, C → CD | ρAB, ρCD |
Recovery | (C3) | ρBCD | B → AB | ρAB |
Recovery | (C4) | ρBCD | BC → ABC | ρABC |
VI. EFFICIENT RECONSTRUCTION OF STATES ON SPIN CHAINS VIA RECURSIVELY DEFINED MEASUREMENTS
Under suitable conditions, the state of a linear spin chain with n spins can be reconstructed from marginal states of few neighbouring spins with the Petz recovery map17 or with state reconstruction.1 In Sec. VI A, we explore the relation between Petz recovery and state reconstruction in that setting. In Sec. VI B, we generalize both techniques to use long-range measurements instead of or in addition to short-ranged correlations found in marginal states of few neighbouring spins. This allows for the efficient recovery/reconstruction of a larger set of states, as is explained in the following.
a. Motivation for long-range measurements. Consider the following quantum states on n qubits:
All states from the set have the same reduced state cGHZk on k < n qubits. No recovery or reconstruction method which receives only local reduced states as input can distinguish between the states from the set Sn and this is also the reason why no method could recover or reconstruct the four-qubit state |GHZ0,4⟩ = |GHZ4⟩ in Sec. V A. Note that the pure states |GHZα,n⟩ can be represented as an MPS with bond dimension two (because they are the superposition of two pure product states) and that all states from the set Sn can be represented as an MPO with bond dimension at most four (because they are the sum of at most four tensor product operators).13
We call an MPS representation efficient if its bond dimension is at most and a we call a tomography scheme efficient if expectation values of at most poly(n) simple observables are needed; a possible definition of a simple observable is provided in Remark 18. Standard quantum state tomography is not efficient because it requires ∼exp(n) expectation values. By contrast, it has been shown that any pure state which admits an efficient MPS representation can be determined efficiently from observables with a simple structure.36 The tomography scheme from Ref. 14 is efficient for the states |GHZα,n⟩ but recovery/reconstruction methods based on local reduced states must fail for these states. In Sec. VI B, we extend both Petz recovery and state reconstruction in a way which allows the long-range measurements from Ref. 14 to be used and thus the states |GHZα,n⟩ to be reconstructed successfully. What is more, we show that there are mixed states that cannot be reconstructed from local reduced states but can be reconstructed from long-range measurements (Remark 31). This shows that Petz recovery and state reconstruction with long-range measurements can reconstruct more states than prior techniques (recovery/reconstruction from local reduced states and the tomography scheme from Ref. 14). Furthermore, state reconstruction can reconstruct any MPO of bond dimension D from ∼nD2 expectation values of global tensor product observables, as has been shown in related prior work.19 We build upon that to show that successful, efficient Petz recovery with long-range measurements implies that efficient state reconstruction with long-range measurements is also possible (Theorem 32).
b. Prior work: MPO reconstruction.1 Many physically interesting quantum states can be represented efficiently, i.e., with poly(n) parameters, via an MPO representation.1 However, standard quantum state tomography requires ∼exp(n) different expectation values in ρ to reconstruct ρ, even if ρ admits such an efficient MPO representation. As an improvement over that, it has been shown37 that almost all states with an MPO representation of bond dimension D can be reconstructed from their reduced states on ∼log(D) neighbouring spins if a suitable reconstruction scheme is used.1 We refer to this reconstruction scheme as MPO reconstruction and we rederive it in Theorem 17 as a consequence of our result on bipartite state reconstruction (Theorem 8).
c. Prior work: Cross approximation of tensor trains.19 Our generalization of state reconstruction to long-range measurements in Theorem 27 can be used to construct an MPO representation of the quantum state (Remark 29). An MPO representation of a quantum state is exactly the same as a tensor train representation of ρ if the operator ρ is regarded as a vector from the tensor product vector space . Reference 19 provides a means to reconstruct a tensor of low tensor train rank (i.e., an MPO of low bond dimension) from few entries. This procedure is called tensor train cross approximation. When applied to quantum states, tensor train cross approximation allows for the reconstruction of a quantum state from the expectation values of few tensor product observables. Theorem 27 is more general because it admits more general measurements; e.g., it also permits the measurements introduced in Ref. 14 (cf. Remarks 23 and 33).
d. Prior work: Markov entropy decomposition.17 The strong subadditivity (SSA) property of the von Neumann entropy of a tripartite state ρ ≡ ρABC [cf. (25) of Sec. II C] can be expressed in terms of the conditional entropy S(A|B)ρ = S(AB)ρ − S(B)ρ,
If we choose arbitrary subsets , the entropy S(ρ) = S(1, …, n)ρ can be rewritten and upper-bounded as follows:
In the second step, we applied Eq. (67) n − 2 times. The sets are called Markov shields and the upper bound SM(ρ) is called the Markov entropy.17 In the following, we consider the particular choice . In that case, the conditional entropies depend only on the reduced state ρk,k+1. As a consequence, the Markov entropy SM(ρ) is an upper bound on S(ρ) which depends only on the nearest-neighbour reduced states ρk,k+1 (k ∈ {1, …, n − 1}). For a nearest-neighbour Hamiltonian , the energy is determined by the same reduced states ρk,k+1. Therefore, lower bounds to the free energy F = E − TS of a thermal state at temperature T can be found with a variational algorithm which only uses the reduced states ρk,k+1.17
Equation (69) was obtained by applying Eq. (67) n − 2 times for A = {k + 1}, B = {k} and C = {1, …, k − 1} (k ∈ {2, …, n − 1}). These inequalities are equivalent to the following inequalities [because Eq. (67) is equivalent to I(B : C) ≤ I(AB : C)]:
If equality holds in Eq. (70) or, equivalently, in Eq. (69), the global state ρ can be obtained from the reduced states ρk,k+1 (k ∈ {1, …, n − 1}) via Petz recovery maps (see the supplementary material and main text of Ref. 17). We state this known result in Theorem 16 and show that these conditions imply that MPO reconstruction (as stated in Theorem 17) is possible (Theorem 19).
A. Reconstruction of states from local marginal states
The state of the spin chain is where is the tensor product of the single-spin Hilbert spaces. For each k ∈ {1, …, n}, we partition the spins on the chain into two parts
The marginal states , for k ∈ {2, …, n}, can be defined recursively via
Each partial trace Trk+1 is a local CPTP map. If the partial trace Trk+1 does not decrease the mutual information between {1, …, k − 1} and {k, k + 1} for all k ∈ {2, …, n − 1}, then the n-spin state ρ can be recovered from marginal states of two neighbouring spins.17
For k ∈ {2, …, n − 1}, apply Corollary 12 with X = X′ = Xk−1, , , , and .
In a similar way, if the partial traces do not decrease certain operator Schmidt ranks, their actions can be reverted with state reconstruction.1,19
For k ∈ {3, …, n − 1}, apply Theorem 8 with X = Xk−1, X′ = {k − 1}, , , , . Recall that implies where and (Lemma 10). Therefore, the reconstruction map is given by with .
The result from Theorem 17 has been obtained previously in Ref. 1 under the name reconstruction of quantum states or MPO reconstruction. For a discussion of further related work,19 see Remark 34.
(Efficient recovery/reconstruction). We call a recovery or reconstruction method to obtain efficient if it satisfies the following conditions. The method provides an efficient representation of ρ (cf. Sec. II B). This representation of ρ can be constructed from suitable input data in at most poly(n) computational time. As a consequence, the size of the input data may be at most poly(n) [i.e., at most poly(n) complex numbers]. The necessary input data may be obtained from at most poly(n) different tensor product expectation values, i.e., expectation values of the form where , , , and Y′ is an ancilla system of dimension .38 The quantum operation is constructed from at most poly(n) quantum operations whose input and output dimension is at most poly(n). This severely restricts the available measurements because the number of two-qubit gates required to implement an arbitrary n-qubit unitary is exponential in n.39
Standard quantum state tomography is not efficient because it fails to satisfy any of these criteria. For example, in quantum state tomography, ∼exp(n) expectation values are required in order to determine ρ.
Clearly, Theorems 16 and 17 satisfy all of these criteria because efficient representations are provided and the necessary input data consist only of two- and three-spin marginals of ρ. Lemma 38 also provides efficient MPO and PMPS representations for Theorem 16 and an efficient MPO representation for Theorem 17.□
Note that the operator Schmidt rank condition (75) is different from the mutual information condition (73) in that it contains {k − 1} instead of Xk−1 on the very left. If the partial trace which maps Xk−1 onto {k − 1} was left out, the state would be needed to construct . Construction of would need and the reconstruction would be neither efficient nor useful. Despite this difference, we show that the premise of state recovery (Theorem 16) implies the premise of state reconstruction (Theorem 17).
I(B : C) = I(AB : CD) holds if and only if I(A : B) = I(A : BC) and I(AB : C) = I(AB : CD).
“⇐”: Let I(A : B) = I(A : BC) and I(AB : C) = I(AB : CD) hold. The former equality implies Eq. (81) and this shows that I(B : C) = I(AB : C) = I(AB : CD) holds.
B. Long-ranged measurements
In this subsection, we generalize recovery and reconstruction to use certain long-range measurements as input and show that successful recovery implies that successful reconstruction is also possible.
1. Recovery from long-ranged measurements
Recovery and reconstruction of a spin chain state from few-body marginals required that correlations (as measured by the mutual information or the operator Schmidt rank) do not decrease under the following partial traces (Fig. 3):
In order to incorporate long-range measurements, we introduce ancillary systems (k ∈ {0, …, n}, ), quantum operations and define via (Fig. 3)
The relation between long-range measurements and the maps is explained Remark 22. If the mutual information does not decrease when is applied, then Theorem 24 provides a reconstruction of ρ from the states (details are specified in the theorem). Before we state the theorem, we explain that measurements on σk correspond to recursively defined long-range measurements on ρ and we observe that suitable ancilla systems and operations can be determined for any pure MPS.
For k ∈ {2, …, n}, apply Corollary 12 with X = X′ = Xk−1, , , , and .
Theorem 24 provides Theorem 16 by restricting to the special case (k ∈ {1, …, n − 1}), (k ∈ {2, …, n − 1}), and using Eq. (88) only for k ∈ {2, …, n − 1}.□
Denote by the maximal dimension of any ancillary system. If , the recovery scheme from Theorem 24 is efficient (it satisfies all conditions from Remark 18). Lemma 38 provides efficient PMPS and MPO representations of ρ.□
2. Reconstruction from long-ranged measurements
State reconstruction can be generalized similarly but it requires that additional ancillary systems and linear maps are introduced (Fig. 4):
For k ∈ {2, …, n}, apply Theorem 8 with X = Xk−1, , , , , and . The equality from the theorem becomes . Recall that Eq. (90) implies where and (Lemma 10). Therefore, the reconstruction map is given by with .
Theorem 27 provides Theorem 17 by restricting to the special case , , , and using Eq. (93) only for k ∈ {3, …, n − 1}.□
Denote by and the maximal dimensions. If and , the reconstruction scheme from Theorem 27 is efficient (it satisfies all conditions from Remark 18). Lemma 38 provides an efficient MPO representation of ρ.
Efficient reconstruction implies that a given state can be reconstructed from a number of expectation values which grow polynomially instead of exponentially with n. This improvement can only be achieved if the to-be-reconstructed state is not a completely general quantum state of n systems. In the following, we show that the condition for efficient reconstruction in particular implies that the operator Schmidt ranks of the state are restricted to growing polynomially (instead of exponentially) with n.
For , the maximal value of the operator Schmidt rank is and it is assumed, e.g., for maximally entangled pure states. Suppose that ρ can be reconstructed efficiently. The equality [Eq. (93)] implies (Corollary 11). The rank of is, in turn, upper bounded by , i.e., the operator Schmidt rank of ρ is at most . In conclusion, any state that admits an efficient reconstruction with Theorem 27 has a small operator Schmidt rank in the sense that it does not grow exponentially but only polynomially with the number of spins n.
3. Recovery vs. reconstruction for long-ranged measurements
In this section, we show that the conditions for state recovery (Theorem 24) imply that state reconstruction (Theorem 27) is also possible. The premise of Theorem 24 implies the premise of Theorem 27 for . However, Theorem 27 does not provide a useful reconstruction with because the necessary input σn for the construction of would be σn = ρ. In Theorem 19, we used the symmetry of the conditional mutual information to work around this but this is no longer possible because was introduced. Note that Eq. (87) implies the same equality for operator Schmidt ranks and that Eq. (88) provides MPO representations of the ρk (Lemma 38). It is well known that maps suitable for Theorem 27 can be obtained directly from the matrices of the MPO representation after the matrices have been transformed into a suitable orthogonal (mixed-canonical) form (Refs. 13 and 24; see also Remark 35 in the Appendix). The maps obtained in this way are given by partial isometries on the vector space of linear operators. Such a map is not guaranteed to be completely positive or trace preserving, i.e., it does not represent a quantum operation and it may not allow for an efficient implementation in a given quantum experiment. An alternative construction has been put forward in Ref. 19: Here, maps are provided whose matrix representation is given by a submatrix of a permutation matrix in a product basis of .42 We use this result to prove that efficient recovery implies efficient reconstruction in Theorem 32. Remark 33 discusses advantages and disadvantages of the two different choices for mentioned in this paragraph.
Lemma 38 provides an MPO representation of the states ρk from Eq. (88). It is well known that maps can be chosen recursively such that holds if an MPO representation of ρk−1 is given.13,24 As is implied by (87) (Theorem 13), it is clear that Eq. (97) holds as well. It was also recognized that the maps can be chosen such that their matrix representation is a submatrix of a permutation matrix.19 We provide a self-contained description of the corresponding procedure in Lemma 36.
The singular values of equal those of if the maps are suitable partial isometries on the vector space of linear operators (cf. Remark 35). For reconstruction stability (Theorem 6), this is the optimal case (if the maps are predefined). If the maps are restricted to submatrices of permutation matrices, the singular values of are smaller than or equal to those of (because has a unit operator norm). If the smallest non-zero singular value decreases, then stability of the reconstruction is reduced (Theorem 6; cf. Refs. 19 and 43). In the worst case, the smallest non-zero singular value decreases by a factor exponential in n because of the recursive construction of the .43 However, empirical results show that this worst-case behaviour is usually not observed in practice.1,19,43,44
If the maps are not predefined, the singular values of equal those of if the maps and are suitable partial isometries on the vector space of linear operators (Remark 35). In this case, Theorem 32 allows reconstruction of an arbitrary MPO (or matrix product state/tensor train) with optimal reconstruction stability. However, it remains an open question whether this can be fully exploited, e.g., in the reconstruction of quantum states as the necessary measurements may not allow for an efficient implementation if the maps and are general partial isometries on the vector space of linear operators.
The situation is different if the state ρ is a pure matrix product state. Here, partial isometries that act on the Hilbert spaces themselves can be obtained (Ref. 14, cf. Remarks 23 and 35). These partial isometries can be implemented via unitary control of the quantum system and they have the property that they preserve the singular values of . This also shows that the tomography scheme for pure matrix product states based on local unitary operations and proposed in Ref. 14 provides maps and for state recovery and reconstruction with optimal stability.□
(Related work). Note that nowhere in the proof of Theorems 8 and 27 did we use the fact that ρ is a linear operator on . The theorems apply equally well to arbitrary vectors on tensor product vector spaces . The components of ⟨i1 … in|ψ⟩ of |ψ⟩ in a product basis define a tensor (i.e., an array with n indices).
A result similar to Theorem 27 has been obtained before in the context of tensor train representations.19,43,44 Their result is formulated for a tensor with n indices, i.e., replace by , by , by , etc. They restrict to . In this case, the pseudoinverse in the reconstruction maps (defined in Theorem 27) is just the regular inverse (cf. Remark 5). They also restrict and to submatrices of permutation matrices in a fixed product basis. In addition, they provide an algorithm which attempts to determine suitable maps and incrementally and efficiently. Similar work has been carried out for the Tucker and hierarchical Tucker tensor representations5,20,21 and the relation between this work and the matrix reconstruction from Sec. III A will be explored elsewhere.45□
ACKNOWLEDGMENTS
We acknowledge discussions with Oliver Marty. Work in Ulm was supported by an Alexander von Humboldt Professorship, the ERC Synergy grant BioQ, the EU projects QUCHIP and EQUAM, and the US Army Research Office Grant No. W91-1NF-14-1-0133. Work in Hannover was supported by the DFG through SFB 1227 (DQ-mat) and the RTG 1991, the ERC grants QFTCMPS and SIQS, and the cluster of excellence EXC201 Quantum Engineering and Space-Time Research.
APPENDIX:ADDITIONAL RESULTS AND BACKGROUND INFORMATION
1. Optimality of the stability bound
The following examples show that the bound from Theorem 6 is optimal up to constants and that the reconstruction error can diverge as ϵ approaches zero if small singular values in LMR are not truncated.
Theorem 6 provides an upper bound on the reconstruction error of a reconstructible matrix S (the signal) which is perturbed by some error matrix E. The following example shows that the upper bound from the theorem is optimal up to constants:
The eigenvalues of S are so approaches unity as Δ → 0. For simplicity, we might assume that using η ≈ 1 is sufficient for the discussion of this example but we keep η > 1. We have ∥E∥/η = ϵ. Suppose that we choose ϵ and τ such that . Set c = τ + 2ϵ, i.e., , and set . Then < 1 and . In addition, we obtain
The eigenvalues of LSR are ±Δ. This provides us γ = Δ/η = τ + 2ϵ. Therefore, the condition ϵ ≤ τ < γ − ϵ is automatically satisfied and, as a consequence, 2ϵ < γ holds as well. LER can change the eigenvalues of LSR at most by ϵ (cf. proof of Lemma 7), so no truncation occurs. In this case, the reconstruction error has exactly the scaling from the theorem
Note that the conditions from above imply and that γ = τ + 2ϵ ≥ 3ϵ. The latter implies −ϵ ≥ −γ/3 and 3(γ − ϵ) ≥ 2γ ≥ γ. Combining the relations provides the bound used above.
One may ask whether thresholds τ outside the interval permitted by the theorem reconstruct M successfully. In this example, a threshold that is large enough to produce a different reconstruction will replace at least one of the two singular values of the reconstruction by zero. As the two singular values of S are equal, the reconstruction error will be at least ∥S∥ in this case, i.e., larger thresholds do not provide a successful reconstruction in the sense that the error in operator norm is significantly smaller than ∥S∥. In this example, neither smaller nor larger thresholds (than the ones permitted by Theorem 6) provide an improved reconstruction: Smaller thresholds do not change the reconstruction at all because thresholding does not reduce the rank of LMR in this example. However, the following example shows that thresholding is in general necessary to obtain an error that satisfies the bound from Theorem 6. We keep L and R from above and choose
We have η = ∥S∥ = 1, γ = 1 and the eigenvalues of E/ϵ are 1 and −1 + 2ϵ2 such that ∥E∥ = ϵ; we choose such that choosing a τ from ϵ ≤ τ < 1 − ϵ is permitted. The eigenvalues of LMR are 1 and ϵ3. We obtain (using )
Without truncating small singular values, the error diverges as ϵ → 0, i.e., it does not satisfy the bound from Theorem 6. Here, the effect of E is completely erased by truncation:
2. The stability bound for matrices with non-unit operator norm
In this section, we provide an argument that extends the proof of Theorem 6 from matrices S with unit operator norm to matrices S with arbitrary operator norm. Suppose that the matrix M is the sum of a signal S and a noise contribution E, M = S + E. The signal satisfies rk(S) = rk(LSR), but we only know the strength ∥E∥ of the noise. Suppose that for ∥S∥ = ∥L∥ = ∥R∥ = 1, we obtain some error bound of the form
We can obtain an error bound for M′ = S′ + E′ where S′, L′, and R′ have arbitrary norms as follows: Set M = M′/∥S′∥, S = S/∥S′∥, E = E′/∥S′∥, L = L′/∥ L′∥, R = R′/∥R′∥. With these definitions, we have
where τ′ = ∥L∥∥R∥∥S∥τ. Therefore, the bound from the last but one equation implies
In proofs, we assume ∥S∥ = ∥L∥ = ∥R∥ = 1 and we use ϵ = ∥E∥.
3. Alternative proof of the stability bound
In this section, we obtain a bound similar to the one from Theorem 6 using the ansatz by Caiafa and Cichocki.21
As above, we use M = S + E, ∥S∥ = ∥L∥ = ∥R∥ = 1 and rk(S) = rk(LSR).
Note that rk(S) = rk(LSR) implies rk(S) = rk(LS) = rk(SR) = rk(LSR). We use the matrix reconstruction Proposition 3 several times, sometimes with L or R replaced by the identity matrix. The proposition, e.g., provides S = S(LS)+ LS. Using that identity, we obtain the following two equalities:
In the same way, we obtain
We will also use
We decompose into three parts
We insert Eq. (A6) for M at the beginning of the expression and Eq. (A8) for M on the end of the expression. In the following equations, spaces separate factors which come from different equations. In part (A) below, we insert Eq. (A9):
The expression in Eq. (A11a) is equal to S. We use the relation and obtain the following bound:
This bound has been given by Caiafa and Cichocki21 for the case that L and R have exactly r = rk(S) rows and columns (such that the matrix LSR is invertible). They proceed by defining constants a, b, and c which are independent of the threshold τ and of noise strength ϵ = ∥E∥ and obtain a bound of the form aτ + bϵ + cϵ2/τ.
We continue by analyzing how all terms in the last equation depend on L, R, and S. This will provide a bound similar to that of Theorem 6.
Because LSR, LS, and SR have all rank r = rk(S), the relation σmin(LSR) = σr(LSR) holds for these three matrices. We obtain
where the first inequality is provided by Ref. 46 (Theorem 3.3.16, p. 178). This provides
and the same bound applies to ∥R(SR)+S∥. Note that γ ≤ ∥S∥ = 1.Using 1 ≤ 1/γ, we obtain
and
The inequality holds for arbitrary values of γ, τ, and ϵ.
Now, we assume ϵ ≤ τ < γ − ϵ and use bounds from Lemma 7. This provides
Without the assumption ϵ ≤ τ < γ − ϵ, we obtain
4. Known results on matrix product representations
This section reviews known results on matrix product state/tensor train representations used in Sec. VI B 3. It also provides full formal details for the results which were used.
Given a tensor , a matrix product representation of the tensor is given by
where D0 = Dn = 1, , , ik ∈ {1, …, dk}, and m ∈ {1, …, n − 1}. For simplicity, may be used. The Gk are called the cores of the representation while the matrices Gk(ik) give the representation its name. The left and right unfoldings of the cores are given by
and they have the same entries as Gk, e.g., . This notation is partially inspired by Ref. 48. The left and right interface matrices are given by
where G≤0 = 1 and G>n = 1. The unfolding tk is the d1, …, dk × dk+1, …, dn matrix with the same entries as t and it can be written as
It is well known that a singular value decomposition of the unfolding tk can be obtained efficiently:13,24
The following lemma provides an efficient, incremental construction of matrices U≤k and V>k such that the matrix has the same rank as tm. More general matrices are permitted than in Eq. (A26) and the rank is preserved [Eq. (A30c)] but the singular values of can differ from those of tm. The proof of Lemmata 36 and 37 has been sketched in Ref. 19. In the premise of the following Lemma, it is possible to choose Uk and Vk as submatrices of permutation matrices (the case considered in Ref. 19), but the actual proof is independent of this choice.
In the last Lemma, it was possible to choose Uj as submatrices of permutation matrices. The following lemma shows that this implies that the U≤j are submatrices of permutation matrices as well and that the position of the non-zero entries of U≤j can be computed efficiently.
5. Sequence of local quantum operations as PMPS representation
This section introduces the locally purified matrix product state (PMPS) representation and discusses the known fact that a sequentially prepared mixed quantum state can be represented as a PMPS or as an MPO (Lemma 38). The PMPS representation12,26 provides an alternative to the MPO representation for positive semidefinite operators such as mixed quantum states. The purification is given in terms of n ancilla systems of dimensions with bases . A PMPS representation of ρ is given by
Given the tensors Gk of a PMPS representation, the tensors of an MPO representation are given by
where the overline denotes the complex conjugate. Equation (A45) shows that given a PMPS representation of bond dimension D, we can directly construct an MPO representation with bond dimension D2. However, an MPO representation with bond dimensions smaller than D2 can exist. It has been shown that there is a family of quantum states on n systems which can be represented as an MPO with bond dimension independent of n but the bond dimension of any PMPS representation of those states increases with n.12 This is an advantage of the MPO representation, but on the other hand, deciding whether a given MPO representation represents a positive semidefinite operator is an NP-hard problem, i.e., a solution in polynomial (in n) time is unlikely.18 The PMPS representation has the advantage that it always represents a positive semidefinite operator by definition. The relative merits of the MPO and PMPS representations of a mixed quantum state depend on the application.
Suppose that a quantum state was prepared via quantum operations , i.e.,
Clearly, this is an efficient representation of the quantum state ρ as it is described by at most nd6 parameters. It is known that such a representation can be efficiently—i.e., with at most poly(n) computational time—converted into an MPO representation or a PMPS representation.17,18 The following lemma provides the technical details of the conversion.
For the MPO representation, evaluate the operator basis elements of ρ from Eq. (A47) and compare with the operator basis elements of the representation [Eq. (15)]. For the PMPS representation, evaluate the matrix entries of ρ from Eq. (A47) inserting Eq. (A49) and compare with the matrix entries of the representation [Eq. (A44)].
6. General linear maps as measurements
Consider a quantum state and an arbitrary linear map . Above, we work with linear maps which are not necessarily CPTP and therefore do not represent a physical operation on the quantum state. Such a map is of relevance only if can be obtained from the outcomes of physical measurements on ρ. In this section, we show how this can be achieved, allowing the reconstruction scheme from Sec. III C to be used for quantum state tomography.
First, we construct a set of observables () whose expectation values Tr(Giρ) in the state ρ can be used to compute . Second, we construct a positive-operator-valued measure (POVM) with elements such that the outcome probabilities Tr(Eiρ) in the state ρ can also be used to compute .
We denote the components of in an operator basis of X by si,
The key tool is the following property of the map :
Since Hi may not be Hermitian, we use its Hermitian and skew-Hermitian components
Using the observables Gi, the components si can be expressed as follows:
In other words, the expectation values of the Gi provide the real and imaginary parts of si,
If these expectation values can be measured, we already obtain a way to obtain from physical measurements on ρ even if is not CPTP. Furthermore, we construct a POVM whose measurement on ρ also allows to determine . We choose coefficients and c > 0 such that the following operators become positive semidefinite:
We define POVM elements by
Clearly, the expectation values of the Gi are related to the POVM probabilities by
The coefficients si of can be obtained from these expectation values using Eq. (A55). As a consequence, the POVM probabilities of the given POVM allow us to determine even if is not CPTP.
REFERENCES
The reduced states and do not depend on the sign. The values of the operator Schmidt rank are .
This holds true if ρ is to be reconstructed from marginal states of ρ which include at most ρABC and ρBCD.
This is shown by the tomography scheme based on unitary operations introduced in Ref. 14. We discuss it in more detail in Remark 23.
Assume that all spins have the same dimension d = dk, k ∈ {1, …, n}. Lemma 1 in the supplementary material of Ref. 1 states that ρ can be reconstructed from reduced states on l + r + 1 neighbouring spins if a certain condition is satisfied. This condition can be satisfied only if both D2 ≤ d2l and D2 ≤ d2r hold. However, if these two inequalities are satisfied, the given conditions almost always hold for MPO matrices with random entries.
We introduce the ancilla system to capture the precise definition of the measurements in Sec. VI B.
The observables may be given, e.g., by the elements of a positive-operator-valued measure.
The swap gate is given by SWAPkl(⋅) = Skl ⋅ Skl, where |i⟩ and |j⟩ are orthonormal bases of and .
We provide a formal description of the corresponding part of their work in Lemmata 36 and 37.