We derive a measurement-independent asymptotic continuity bound on the observational entropy for general positive operator valued measures measurements, making essential use of its property of bounded concavity. The same insight is used to obtain continuity bounds for other entropic quantities, including the measured relative entropy distance to a convex set of states under a general set of measurements. As a special case, we define and study conditional observational entropy, which is an observational entropy in one (measured) subsystem conditioned on the quantum state in another (unmeasured) subsystem. We also study continuity of relative entropy with respect to a jointly applied channel, finding that observational entropy is uniformly continuous as a function of the measurement. But we show by means of an example that this continuity under measurements cannot have the form of a concrete asymptotic bound.

Observational entropy has recently emerged (in fact, re-emerged, cf. Refs. 1–4) in studies of non-equilibrium statistical mechanics as a useful unifying framework to describe coarse-grained entropy in classical and quantum systems,5–7 with applications across thermodynamics and quantum information theory.8–27 

In the quantum case, for any measurement described by a positive operator valued measures (POVM) M=(Mi)iI, (Mi ≥ 0, iMi=1), and quantum state described by a density matrix ρ, observational entropy
(1)
describes an entropy of the state ρ “coarse-grained” by measurement M. The pi describe a probability distribution over “macrostates” (i.e. measurement outcomes), and the Vi describe “volumes” of macrostates analogous to the phase space volumes defining Boltzmann entropy.
The formula (1) defining observational entropy can be recast in relative entropy forms more suggestive of its information-theoretic content. In particular, with ρ defined on a Hilbert space H of finite dimension d,
(2)
with D denoting either quantum or classical relative entropy depending on its arguments. Here M is implemented by quantum-classical channel
(3)
called the measuring channel of M, and pi = tr(Miρ), qi=tr(Mi1/d) are the observed probability distributions induced by M on ρ and on the maximally mixed state, respectively.

It is evident from the definition that observational entropy is a continuous function of the state ρ. For practical purposes, however, it is most useful to realize this continuity statement in terms of concrete continuity bounds, as has also been done for many other entropic quantities.28–32 

One naive version of such a bound follows quickly from other known bounds. Expanding the definition (1) one can separate observational entropy into a sum of observed Shannon entropy H(p) = −∑ipi log pi over measurement outcomes, and a mean Boltzmann entropy,
(4)
Bounding the Shannon and Boltzmann terms separately one can obtain, given two states ρ, σ such that 12ρσ1ϵ,
(5)
Here |M| is the number of outcomes of the POVM M (cardinality of the index set I), the function h(x) = −x log x − (1 − x)log(1 − x) is the binary Shannon entropy, and ‖log Vi = maxi| log Vi|. The first two terms in the bottom line arise from the classical Zhang–Audenaert continuity bound on Shannon entropy,32–34 while the latter term provides a loose bound on the Boltzmann term. Both terms are derived using elementary properties relating the quantum trace norm to probabilities.34 

Though apparently useful insofar as it implies |ΔS| → 0 as ϵ → 0 for any particular M, the bound (5) fails to be universal, depending on the measurement M both through the number of outcomes |M| and through the volume terms Vi. Moreover, it is unacceptably loose, as evidenced by the following pathology. Given any POVM M, one can define another one M′, with twice as many outcomes, by splitting each POVM element in half and counting it twice (Mi → two copies Mi/2). Letting MM′ the observational entropies SM(ρ) and SM(σ) are both invariant, but by repeating this transformation the bound (5) can be made arbitrarily loose (noting that after Vi ≤ 1 each iteration separately loosens both terms). This pathology highlights the need for an improved and universal bound.

One expects the shortcomings above to be common to any bound attempting to treat the Shannon and Boltzmann terms separately. On the other hand, one might hope that the presence of normalizing volume terms in the definition (1) may actually help improve the bound compared to a Shannon entropy rather than [as occurs in (5)] making it looser. We will show this is indeed the case, improving the bound by treating the terms together.

In particular, the purpose of this note is to demonstrate that, for any measurement M and any states ρ, σ such that 12ρσ1ϵ, observational entropy obeys a continuity bound of the form
(6)
This bound is universal in that it is independent of M, and depends only on the Hilbert space dimension d and the trace distance between the states.
The function g(ϵ) arising in the bound is defined by g(x) = −x log x + (1 + x)log(1 + x) for x > 0, and g(0) ≡ 0. It derives here from the binary Shannon entropy h(x) = −x log x − (1 − x)log(1 − x), where it arises as
(7)
Elsewhere it arises as an entropy in bosonic systems.35 This g(x) is particularly convenient, as throughout x ∈ [0, ) it is continuous, monotonic increasing, non-negative, and concave.

The main bound (6) appears in Theorem 6. In order to establish it, we first will observe that observational entropy has the same bounded concavity property typical of other entropic quantities, that is, it is concave but not too concave. We then establish a general continuity bound based only on bounded concavity, obtaining the main result as a corollary. The method of proof follows closely the Proof of Lemma 2 of Ref. 32, which obtained a closely related tight bound on quantum conditional entropy, and which in turn is closely related to methods used in Refs. 36 and 37, this method sometimes being known as the Alicki–Fannes–Winter (AFW) trick.32,36 The general bound Proposition 5 given here is closely related to Lemma 7 of Ref. 32, and provides a slightly different generalization thereof. Continuity bounds of this form are often referred to as asymptotic continuity due to the bound per qudit scaling as ϵ log d in the n-copy regime.38,39

In addition to the main observational entropy result, we also discuss continuity results for other entropic quantities that may be of general interest. In Proposition 8 we consider relative entropy distances of the form
(8)
to a convex set of states χ under channels M,N. In Theorem 11 we then consider measured relative entropies under restricted measurements40 
(9)
where DM(ρσ) is the classical relative entropy observed under measurement M. And in Proposition 12 we apply these results to the case of an observational conditional entropy SMA(A|B)ρ motivated by relation to entropic uncertainty principles. All of these bounds are derived by appealing to a general bound in the form
(10)
shown in Proposition 5 for Z(ρ) any function obeying bounded concavity/convexity. As a final step, in Corollary 15 we show that SM(ρ) and DM(ρσ) are continuous as functions of the measurement M (in an appropriate topology on POVMs). This result is a special case of Theorem 14, where we show uniform continuity of
(11)
as a function of channel Φ. In Example 13 we show limitations preventing simple asymptotic bounds from being obtained for the measurement continuity.

In the following we continue to consider a finite dimensional Hilbert space H of dimension d unless otherwise specified. Quantum states ρ are positive semidefinite Hermitian operators normalized to unit trace. Measurements are described by POVMs (positive operator valued measures), defined as collections of positive semidefinite Hermitian operators summing to the identity, assumed here to have a finite number of outcomes. The von Neumann entropy is denoted by S(ρ) = −tr(ρ log ρ), and Shannon entropy by H(p) = −∑ipi log pi. The binary Shannon entropy function is denoted by h(x), and g(x) is as above. Observational entropy SM(ρ) is defined by (1).

An elementary property of Shannon entropy is bounded concavity41 (in the case of classical Shannon entropy this property can be derived directly from a simple chain rule computation, but it can also be inferred as a subcase of the more commonly emphasized quantum generalization), which we state here as a lemma for later use.

Lemma 1.
Let λk be a probability distribution and pk a probability distribution for each k. The Shannon entropy H is concave but not too concave,
(12)
Note that (pk)i=pi|k is a conditional probability distribution and kλkpki=kλkpi|k.

Bounded concavity of observational entropy follows from the bounded concavity of Shannon entropy.

Lemma 2
[observational entropy (OE) Bounded Concavity]. Let λk be a probability distribution and ρk a state for each k. The observational entropy SM is concave but not too concave,
(13)

Proof.
Let ρ = ∑kλkρk and pi = tr(Miρ). Then pi = ∑kλk pi|k, where pi|k = tr(Miρk). Observe that only the difference in Shannon entropy contributes to the difference in OE, since the mean Boltzmann entropy is equal for the two distributions. That is, we have [abusing the notation H(pi) ↔ H(p) for simplicity where unambiguous]
(14)
(15)
(16)
(17)
But Lemma 1 precisely says that 0 ≤ ΔHH(λ), the desired result.□

In general bounded concavity for real functions on convex sets may be defined as follows.

Definition 3
(General Bounded Concavity/Convexity). A function Z:SR defined on a convex set S is bounded concave if, for any finite convex combination ρ = ∑kλkρk,
(18)
A function Z is bounded convex ifZ is bounded concave.

In the next section we derive a continuity bound for functions of quantum states that applies to any function with a bounded concavity or convexity property of this form, with observational entropy continuity as a corollary.

The continuity bound proved below is derived by appealing to bounded concavity. Similar methods have been used to derive continuity results for other entropic quantities in the literature, such as in Refs. 32, 3639, and 4246. The following decomposition is essential to performing the main trick of the proof, sometimes referred to as the Alicki-Fannes-Winter trick (cf. Refs. 32 and 36). Whereas the previous section only involved convex structure, the derivation of the following decomposition also involves topology via the trace norm. The resulting continuity bound depends on both the convex and topological structures of the space of quantum states.

Lemma 4
(ωΔ Decomposition). Let ρ, σ be quantum states, and define ϵ=12ρσ1. There exists a state ω and states Δ± such that
(19)

Proof.
Let ρ, σ be states such that ‖ρσ1 = 2ϵ, and let
(20)
The positive and negative parts (ρσ)± are defined in the usual way,34 by diagonalizing the Hermitian matrix ρσ, then separating into the sum of a non-negative diagonal and a non-positive diagonal matrix. That is, ρσ = UDU = U(D+ + D)U = ϵΔ+ϵΔ. By definition Δ± ≥ 0 since they are Hermitian with only non-negative eigenvalues. Since ρσ is traceless, we have tr(Δ+) = tr(Δ) = ‖Δ±1, with the last equality by positive semidefiniteness. Further, ‖ρσ1 = ‖D+ + D1 = ‖D+1 + ‖D1 = ‖ϵΔ+1 + ‖ϵΔ1 = 2ϵ‖Δ±1 = 2ϵ using unitary invariance of the trace norm and the fact that D± are diagonal. Thus tr(Δ±) = 1, and Δ± are states.
Now define
(21)
This ω is a convex combination of states, so tr(ω) = 1 and ω is a state. The above relations taken together with ω can be viewed as expressing a linear decomposition
(22)
(23)
From (21) with (22) it is straightforward to verify that
(24)
which completes the proof.□

Next appears a general continuity bound for any function Z(ρ) obeying bounded concavity on quantum states. Since (25) is invariant under Z → −Z this implies the same bound for functions obeying bounded convexity.

Proposition 5.

Let Z(ρ) be any function satisfying bounded concavity [cf. (18)] on quantum states. We have the following continuity bound.

Let ρ, σ be states such that 12ρσ1ϵ. Then
(25)
where g(ϵ) is as in (7) and κ = sup(μ,ν)|Z(μ) − Z(ν)| is the supremum absolute difference in Z between any two states.

Proof.
Suppose ϵ=12ρσ1, and suppose there exist a state ω and states Δ± such that
(26)
The existence of such states was demonstrated in Lemma 4. From (26) we have two different convex decompositions ω = λ ω1 + (1 − λ) ω2, both with the same coefficients λ = 1/(1 + ϵ). By the assumption (18) of bounded concavity,
(27)
holds for both of these convex decompositions. The trick is simply to use ω1, ω2 = ρ, Δ for one side of the inequality and ω1, ω2 = σ, Δ+ for the other. This amounts to
(28)
Rearranging yields
(29)
Finally we take the magnitude and apply triangle inequality,
(30)
(31)
The second inequality follows since a magnitude difference in Z between two states is by definition less than or equal to κ. Finally with λ = 1/(1 + ϵ) and h(λ) = h(1 − λ) we obtain
(32)
One can confirm the bound is monotonic in ϵ. Therefore if it holds for ϵ=12ρσ1, as was just shown, then it also holds for ϵ′ ≥ ϵ. This completes the proof.□

In the following sections this general bound is evaluated for the observational entropy and extended to a form that includes several related quantities.

The general statement Proposition 5 implies a continuity bound on observational entropy.

Theorem 6
(OE Continuity). Let ρ, σ be quantum states such that 12ρσ1ϵ. Then observational entropy obeys the continuity bound
(33)
where g(ϵ) is as in (7) and d is the Hilbert space dimension.

Proof.

By Lemma 2, SM obeys bounded concavity, so the bound follows from Proposition 5. An elementary property of observational entropy is that 0 ≤ SM(ρ) ≤ log d.10 Thus we have κ = sup(μ,ν)|SM(μ) − SM(ν)| = supμSM(μ) = log d.□

The main result, a measurement-independent continuity bound on observational entropy, has therefore been established.

As an aside we note that bounds on both the Shannon and von Neumann entropies can be derived from the observational entropy continuity, as shown in the following corollary. The bounds thus obtained are looser than the known optimized forms.30 

Corollary 7.
Let ρ, σ be states such that 12ρσ1ϵ. Then von Neumann entropy S obeys
(34)
Let pi, qi be N-outcome probability distributions such that 12pq1ϵ. Then Shannon entropy H obeys
(35)
In this form these are both special cases of Theorem 6, however, tighter bounds can be derived by other methods.

Proof.

We derive these as corollaries of Theorem 6. For the quantum case suppose (wlg) that S(ρ) ≥ S(σ). Let Mσ measure in the σ eigenbasis. Then |SMσ(ρ)SMσ(σ)|=SMσ(ρ)S(σ)S(ρ)S(σ)=|S(ρ)S(σ)|. For the classical case, let ρ = ∑ipi|i⟩⟨i| and σ = ∑iqi|i⟩⟨i| in N-dimensional Hilbert space, and M0 measure in the |i⟩⟨i| basis. (At least N dimensions are needed to embed p, q classically.) Since the states are diagonal we have ρσ1=pq12ϵ. Then measuring M0 obtains pi, qi and we have |SM0(ρ)SM0(σ)|=|H(pi)H(qi)|.□

The general Proposition 5 likewise provides a continuity bound on measured relative entropies associated with a particular measurement, as well as on the quantum relative entropy.

Proposition 8.
Let χ be a convex set of states (or more generally, of positive semidefinite Hermitian operators). Let M and N be arbitrary quantum channels. Then
(36)
is bounded convex. Thus Z(ρ) obeys the bound (25) of Proposition 5. This ensures continuity whenever the maximum variation κ is finite.

Proof.
Let ρ = ∑kλkρk. First note that for any fixed σχ we have
(37)
since D(M(ρ)N(σ))=S(M(ρ))trM(ρ)logN(σ) and applying linearity of the channels. Thus
(38)
by bounded concavity of von Neumann entropy. Note that D(M(ρ)N(σ))= if and only if kλkD(M(ρk)N(σ))= because the von Neumann terms are finite in finite dimension, and that (38) can be rearranged so the infinite case = = is well defined.
Now we introduce the infimum over σχ. It is immediate that the upper bound on convexity is retained, since
(39)
using that the sum of infima is no greater than the infimum of the sum and (38).
The upper convexity bound just derived holds for the infimum of any family of bounded convex functions. In the particular case considered here, the lower convexity bound also holds, using convexity of the set χ and joint convexity of the relative entropy. In particular, since Z(ρ) is an infimum, it follows that for any ρ and any δ > 0, there exists a state γχ such that D(M(ρ)N(γ))Z(ρ)+δ. Thus fix δ > 0 and let D(M(ρk)N(γk))Z(ρk)+δ, with ν = ∑kλkγk. Then
(40)
Since this holds for all δ > 0, it follows that ∑kλk Z(ρk) ≥ Z(ρ). The first inequality expresses the initial infimum assumption, the second is joint convexity of relative entropy and linearity of the channels, and the last that νχ using convexity of the set.□

Remark 9.

If N(χ) contains an element of full rank the maximum variation κ is necessarily finite. If M=N and χ is a set of states containing the maximally mixed state, then κ ≤ log d.

Proof.

If N(χ) contains an element N(σ) of full rank then all |Z(ρ)|logd+logN(σ) and therefore κ is finite. Meanwhile if M=N and χσ is a set of states, then D(ρσ)D(M(ρ)M(σ))0, so if also (1/d)χ then κ ≤ supρS(ρ) ≤ log d.□

In particular, Proposition 8 in the case M=N=ΦM [cf. (3)] demonstrates a bound on measured relative entropy distance to a convex set of states, given the single measurement M. A natural question is whether this extends to include measured relative entropies associated with an allowed class of measurements M,40,42
(41)
In full generality we take χ a convex set of states and M a set of measurements, and consider the quantity
(42)
Due to the supremum in DM this Z(ρ) fails to obey bounded convexity, and the bound (25) cannot be applied directly. Nonetheless, we demonstrate below that a closely related continuity result can be obtained. This provides a more general and systematic extension to related results previously appearing in Ref. 42, and also subsumes the entropic bounds already considered above.

A crucial step is the following minimax lemma.

Lemma 10.
Let χ be a compact convex set of states. Let M be a set of quantum channels that is closed under flagged finite convex combination, that is, such that if MkM, and λk is a finite probability distribution, then also kλkMk|kk|M. Then
(43)

Operationally motivated classes M of transformations often have the required property automatically, such as LOCC, SEP, PPT channels in composite systems.

Proof.
This lemma has been established previously in Ref. 47 on the grounds of a general minimax theorem of Ref. 48. For completeness we include its demonstration here. We argue based on the minimax theorem in the form of Sion.49 Namely, consider the function ipiD(Mi(ρ)Mi(σ)) of two arguments, one a state σχ, the other a finite-support probability distribution over channels from M. The first set (χ) is convex and compact by assumption, the second (denoted PM) is convex by construction but otherwise an arbitrary space. For simplicity assume that our objective function f(σ, p) is finite; it is then continuous, and it is convex in the first and linear—in particular concave—in the second argument. Hence, by Ref. 49 we get
Note that in general we need the sum over i weighted by pi’s, otherwise the final supremum is not large enough. But if the set of channels is closed under taking flagged convex combinations, i.e. with Mi it also contains MipiMi|ii|, then ipiD(Mi(ρ)Mi(σ))=D(M(ρ)M(σ)), hence
which concludes the proof in the case that f is always finite. To cover the case of values +, one can modify the argument, removing the infinities from the objective function; or use the minimax theorem from Ref. 48 as done in Ref. 47.□

For classes M of measurements closed under disjoint convex combination, and compact convex sets χ of states, we have—with the help of the minimax lemma—the following continuity theorem for measured relative entropies Z(ρ)=infσχDM(ρσ).

We remark that many commonly invoked measurement classes (such as LOCC, SEP, PPT measurements in composite systems) automatically fulfill the required closure property. Furthermore, given some primitive set of measurements M0 not already doing so, from an operational perspective it is perfectly feasible to complete this to an M that is closed in this way, given the capability to choose which measurements to perform based on classical randomness.

Theorem 11.
Let χ be a compact convex set of states. Let M be a convex set of measurements that is also closed under disjoint convex combination (that is, convex combinations taken on the disjoint union of the outcome sets). Let ΦM as in (3) and define
(44)
Then for any two states ρ, ρsuch that 12ρρ1ϵ,
(45)
Here
(46)
and g(ϵ) is as in (7).

Proof.
Let ΦM denote the set of measuring channels associated with class M, defined as in (3). Clearly Z(ρ)=infσχsupMMD(ΦM(ρ)ΦM(σ))=infσχsupΦΦMD(ΦM(ρ)ΦM(σ)). Now the measuring channel is convex linear in M on a fixed outcome set, ΦkλkMk=kλkΦMk. To perform disjoint convex combination, which we denote ⊕kλkMk, one first extends each outcome set to the disjoint union of outcome sets, padding with zeroes, then convexly combines on the fixed set. Suppose ΦMkΦM. It can be seen that Φ=kλkΦMk|kk|=ΦkλkMk, which is also in ΦM since kλkMkM. Therefore the assumptions of the Lemma 10 are fulfilled, and it holds
(47)
This provides the key step to relate Z(ρ) to a form where bounded convexity can be exploited.
Proceeding, first using the minimax equality, and next that a difference of suprema is no less than the supremum difference, one obtains
(48)
(49)
(50)
But by Proposition 8 this ZM(ρ) is bounded convex and obeys (25), hence further
(51)
(52)
establishing the result.□

This provides asymptotic continuity of the entropies of restricted measurement first considered by Piani.40 Similar results were also obtained recently in independent work, in the context of local classes of measurements, and will appear elsewhere.50 

The main limitation of the Theorem 11 is the requirement that χ be convex. One may also be interested in measured relative entropy distances to non-convex sets of states such as coherent or classically correlated states. In this regard the theorem can be compared to results of Ref. 42, which obtained a looser bound but allows χ to be any set star-shaped around the maximally mixed state. Obtaining a bound for star-shaped χ with an optimal dimensional factor remains a useful future direction.

The above results are applied in this section to derive continuity bounds on an observational conditional entropy in bipartite systems, defined by
(53)
a quantity motivated by applications to entropic uncertainty principles with memory.51 This conditional form describes quantum uncertainty in system B having made a measurement MA in system A.

We have the following bound. Note that the dimensional factor of log d arising from maximum variation is reduced by a factor of two from the corresponding bound on quantum conditional entropy S(A|B)ρ=D(ρ1AρB).32 

Proposition 12.
Let ρ, σ be quantum states such that 12ρσ1ϵ. Then
(54)
where g(ϵ) is as in (7) and dA is the Hilbert space dimension in system A.

Proof.
We derive the bound as a consequence of Theorem 8. To establish the link we note that the observational conditional entropy SMA(A|B)ρ defined by (53) admits the variational formula
(55)
where the infimum is over all states ωB in system B. (We will return to the justification of this variational formula after concluding the rest of the proof.) In light of this equality, the difference SMA(A|B)ρSMA(A|B)σ is precisely equal to a difference between infimized relative entropies of the form in Theorem 8, in particular with M=N=ϕMAIB and χ={1AdAωB|ωB a  state  on  B}. Thus the Theorem 8 demonstrates the bound (25) holds, and it only remains to evaluate the maximum variation κ.

To determine κ it suffices to obtain the bound 0SMA(A|B)ρlogdA for all ρ. From non-negativity of the relative entropy, the upper bound SMA(A|B)ρlogdA follows immediately. To establish the lower bound we will make use of the following observations:

  • With ΦM a measurement channel as in (3), (ΦMI)ρ = ∑j|j⟩⟨j| ⊗ pj ρj yields a classical-quantum state, with pj = tr(MjρA) the induced probability distribution over M outcomes, and ρj=trA((Mj1)ρ)/pj some conditional states.

  • For any two such classical-quantum states, D(ρρ)=D(pjpj)+jpjD(ρjρj).

  • With pj = tr(MjρA) and qj = tr(Mj)/dA, one has SM(ρA) = log dAD(pjqj) the marginal observational entropy in system A.

  • With pj, ρj as in (i), one has ∑jpjρj = ρB since jMj=1A.

  • SMA(A|B)ρ is concave in ρ.We thus have
    (56)
    (57)
    (58)
    The first equality is by definition, the second uses (i) and (ii), and the third uses (iii) and (iv). Finally, we note that for pure states |ψAB⟩ we have that S(ρA) = S(ρB) and therefore that SMA(ρA)S(ρB). Thus for pure states we have SMA(A|B)|ψψ|0. But by concavity this non-negativity extends to convex combinations and so in general it holds that SMA(A|B)ρ0.

To complete the proof, note that since 0SMA(A|B)ρlogdA, the same bounds hold for the infimized relative entropy in (55). Thus the maximum variation κ in Theorem 8 is given by κ = supμ,ν|Z(μ) − Z(ν)| ≤ supμZ(μ) ≤ log dA.

It remains to justify the variational formula (55). Applying the same methods used to obtain (58) earlier, (55) can be seen to become SMA(ρA)+jpjS(ρj)infωBtr(ρBlogωB). But it follows immediately from D(ρBωB) ≥ 0 that tr(−ρB log ωB) ≥tr(−ρB log ρB) for any ωB, establishing that ωB = ρB saturates the infimum.□

The preceding sections have demonstrated concrete uniform continuity bounds on observational entropy SM(ρ) (and related quantities) under variations in the state ρ. An equally important question is that of continuity under changes in the measurement M. With a suitable norm in the space of measurements (namely the diamond norm between measuring channels), can similar bounds be obtained on |SM(ρ) − SM(ρ)| for a fixed state?

We will show that although the observational entropy SM(ρ) is indeed a continuous function of M, no simple concrete bounds of the type obtained earlier can hold. A strong limitation is exhibited by the following example.

Example 13.
Let M=(Π,1Π) be a binary measurement, and M̃=(1Π,Π) its permutation. Consider a convex combination of this binary M with its permutation,
(59)
This has POVM elements Mλ=(12λ)Π+2λ(1/2),(12λ)(1Π)+2λ(1/2). It follows that also
(60)
where 12 denotes the trivial binary measurement with elements (1/2,1/2). This shows that mixing M with λ of its permutation is equivalent to mixing M with 2λ of pure noise.

Observe that measuring channels are convex linear in the measurement, namely Φ(1−λ)M+λN = (1 − λM + λΦN. Thus the diamond norm distance between channels implementing Mλ and M is bounded by 12ΦMλΦM=12λ(ΦM̃ΦM)λ due to the universal bounds on trace distances between states.34 This provides a strong sense in which Mλ and M are close together for continuity purposes, and ensures in particular that the probability distributions (outcome statistics) induced by Mλ, M on any state differ by no more than λ in the total variational distance (half of the 1 norm).

Continuing, suppose ρ = |ψ⟩⟨ψ| is pure and M=(|ψψ|,1|ψψ|) measures ρ perfectly. The observational entropy of M and its permutation are equal in general, and in the present case one finds SM(ρ)=SM̃(ρ)=0. On the other hand, for the convex combination Mλ one has probabilities and volumes
(61)
leading to an observational entropy
(62)
This can be viewed as Boltzmann and Shannon terms by expanding the logs.
Now the entropy SMλ(ρ) may be used to analyze both the bounded concavity, and directly the continuity, of observational entropy viewed as a function of measurement. First consider the bounded concavity. In the present case we have an entropy concavity difference
(63)
since Mλ=(1λ)M+λM̃ with the M,M̃ each giving zero entropy. And second consider directly the continuity. In the present case we have measurements such that 12ΦMλΦMλ and whose entropy difference is
(64)
since the latter term gives zero. Therefore general bounds on continuity and concavity can be no better than in this particular case.

We make the following observations {with SλSMλ(ρ) and d ≥ 2 and λ ∈ [0, 1/2]}:

  • Any λ > 0 implies Sλ > 0. For fixed d, as λ → 0 also Sλ → 0, consistent with continuity.

  • S1/2 = log d, so (63) shows that the naive concavity bound H(λ) that held for states [cf. (18)] can be violated by a maximum amount.

  • For fixed λ > 0, one can achieve Sλ/log d arbitrarily close to 1 by taking d sufficiently large. This shows that there is no continuous f(λ) such that f(0) = 0 and Sλf(λ) log d.

The final observation implies a strong limitation on asymptotic continuity bounds for measurements: there cannot be any continuous function f(λ) with f(0) = 0 such that
(65)
for all 12ΦNΦMλ. This form includes all bounds of the type considered earlier for continuity of various entropies with respect to states.

Moreover, one might have hoped to approach the measurement continuity problem similarly to the method used for states, by attempting some form of an AFW trick (modified for channels) based on bounded concavity. However, the example shows that not only does the naive concavity bound by H(λ) fail, but also (likewise with the continuity above) that no concavity bound of the form f(λ) log d is possible.

Due to these limitations, at present we settle for the following more abstract statement of continuity over measurements. We give here a general statement containing observational entropy as a special case.

Theorem 14.

Let ρ, σ be states such that D(ρσ) < . Then F(Φ)=DΦ(ρ)Φ(σ) is a uniformly continuous function of the quantum channel Φ.

Proof.

We are in finite dimension, so the quantum channels form a finite-dimensional convex set and all norms induce the same topology. For concreteness we use the diamond norm. To show continuity we have to show that for any Φ and any δ > 0, there exists an ϵ > 0 such that ΦΦ<ϵ implies |F(Φ) − F(Φ′)| < δ.

To begin, for any channel Φ define Φs = (1 − s)Φ + , which mixes Φ with noise, letting τ(X)=tr(X)1/d prepare maximally mixed state. Observe that F(Φ) is convex in Φ and is finite and non-negative, and that F(τ) = 0. It follows that F(Φ) ≥ (1 − s)F(Φ) ≥ Fs). Therefore for any Φ one obtains |F(Φ) − Fs)| ≤ sF(Φ).

Additionally, for any Φ, the states Φs(ρ) and Φs(σ) are positive definite. The set P of positive definite matrices is an open set in the 2-norm matrix topology (the usual Euclidean topology on matrix elements). Over the domain μ,νP, the function D(μν) is continuous, by the elementary reasoning that it is a composition of matrix multiplication, linear combination, trace, and log, which are all continuous functions on the positive set P. Thus for any δs > 0 there exists an ϵs > 0 such that if both Φs(ρ)Φs(ρ)2<ϵs and Φs(σ)Φs(σ)2<ϵs, then |F(Φs)F(Φs)|<δs. This ϵs must be chose sufficiently small so that both ϵs-balls lie entirely in P.

Meanwhile, for channels Φ, Φ′ it holds ΦsΦs=(1s)(ΦΦ)ΦΦ. And therefore for any state ν we have Φs(ν)Φs(ν)2Φs(ν)Φs(ν)1ΦsΦsΦΦ.

Now we combine the ingredients. For any 1 > s > 0, δs > 0 let ϵs be as above, and suppose ΦΦ<ϵs. Observe that
(66)
(67)
(68)
(69)
The first inequality is triangle inequality. The second follows from the observations listed above. The last is relative entropy monotonicity.

Finally, choose a δ > 0. Since we assumed D(ρσ) < , we can choose s > 0 sufficiently small so that 2sD(ρσ) < δ. Next choose any δs > 0 sufficiently small so that also 2sD(ρσ) + δs < δ. There exists an ϵs > 0 as above so that ΦΦ<ϵs implies |F(Φs)F(Φs)|<δs and therefore also that |F(Φ) − F(Φ′)| < δ. This completes the proof of continuity. The finite dimensional quantum channels form a compact set, thus continuity of F implies it is also uniformly continuous. One can also argue directly (but with more technical difficulty) for uniform continuity by fixing s = δ/4D(ρσ) and using uniform continuity of log on the domain [s/d, ).□

Observational entropy is determined by D(Φ(ρ)‖Φ(σ)) in the case σ=1/d and Φ a measuring channel. Since D(ρ1/d)< always, this implies that (in the topology induced by the diamond norm on ΦM) observational entropy SM(ρ) is a continuous function of the measurement. In this form the continuity is useful only for measurements on the same outcome set.

The continuity under measurements can be extended to measurements on arbitrary outcome sets by defining an appropriate topology. Such a topology can be induced by a simulation distance defined (for POVMs M, N) by
(70)
where the infimum defining the “one-way” simulation distance γ(M,N) is over stochastic maps Λ from the M outcome set to the N outcome set, so that ΛM and N are measurements with the same set of outcomes. The function γ(M, N) can be shown to obey the axioms of a (pseudo-)metric, and therefore induces a topology on the space of POVMs. This distance captures how well two POVMs can approximate one another using classical postprocessing by stochastic channels—if both can emulate each other perfectly by postprocessing alone, they are effectively the same POVM. In terms of this topology we have continuity of measured relative entropy DM with respect to M, and therefore also of SM the observational entropy.

Corollary 15.

Let ρ, σ be states such that D(ρσ) < . Then F(M) = DM(ρσ) is a continuous function of the measurement M in the simulation distance topology. This also implies continuity of F0(M) = SM(ρ) as a particular case. The continuity is uniform (at least for d < ).

Proof.

Fix δ > 0 and let ϵ = ϵ(δ) as supplied by Theorem 14. Choose any c with ϵ > c > 0, and let ϵ′ = (ϵc)/4 > 0. Suppose γ(M, N) < ϵ′. From the definition of γ, there exist Λ, Λ′ such that ‖ΦΛM − ΦN < ϵ and ‖ΦΛ′N − ΦM < ϵ. Thus |DΛMDN| < δ and |DΛ′NDM| < δ. We then have DMDΛM > DNδ and also DM < DΛ′N + δDN + δ, which follow using relative entropy monotonicity, and thus |DMDN| < δ. This completes the proof.

Put more simply, we stated that for small γ(M, N) we have DNDΛ′NDMDΛMDN, which implies DMDN, where ≥ are by monotonicity and ≈ are by the earlier continuity.□

The main Theorem 6 described a continuity bound (33) on observational entropy that is universal, in the sense of being independent of the POVM M defining the coarse-graining, and with a dimensional factor log d of the Hilbert space dimension.

This bound improves upon the naive bound (5) insofar as it scales only with log d independently of the number of measurement outcomes |M|. This reflects the intuitive status of observational entropy as a classical entropy in quantum systems, compared, for example to the Shannon entropy of an observable. The improved universality does come with a small trade-off, as when M is a projective measurement in a complete orthonormal basis, the naive bound (5) is slightly stronger. We note that (33) can only be relevant when ρ, σ have overlapping support. In case the supports are orthogonal, the trace distance ϵ = 1 is maximized, and (33) is weaker than the trivial |ΔSM| ≤ log d. This demonstrates that (33) cannot be tight in general—for fixed ϵ, there is not generally a choice of ρ, σ, M for which the inequality is saturated.

Meanwhile, in the case of continuity with respect to measurements, it was seen that although observational entropy is continuous, explicit bounds on the convergence are difficult (if not impossible) to obtain. Other notable limitations on the present results were the necessity of a convex reference set for measured relative entropy continuity (as discussed earlier), and the restriction throughout the paper to finite dimensional spaces. Extending our results to the infinite dimensional case encounters difficulties associated with the possibility of infinite Boltzmann and von Neumann entropies, and doing so carefully will be a useful continuation.

One can also consider a classical version of the observational entropy development for density distributions ρ, σ on phase space Γ. The trace is an integral on Γ, the norm the L1 norm, and the measurement M a partition of unity. The dimensional factor d (better labelled V in the classical case) is the total volume of phase space in an appropriately chosen measure. The resulting bound (33) holds equally well in the classical case so long as one can restrict consideration to a finite region of phase space.

As with the quantum case, the classical (33) can be nontrivial (stronger than |ΔSM| ≤ log V) only when ρ, σ have support on an overlapping region of phase space. In this sense, the closeness of entropy ensured by the bound arises only from a probability to be in exactly the same state. Further one may observe that the bound will be time independent for isolated systems, whose dynamics are norm preserving. This can be considered in terms of the second law of thermodynamics for coarse-grained entropies. Suppose ρ is concentrated at some particular “second-law-violating” point in phase space, one which evolves to a lower entropy state. Any σ sufficiently close to ρ in the norm distance must also be second-law-violating, which at first glance seems to violate the principle that such states are rare. Fortunately, by the above considerations, σ may be concentrated in any phase space region disjoint from ρ, even arbitrarily nearby, without any restriction on its entropic dynamics. In this way the bound is consistent with the principle that nearby phase space points of a chaotic system may have drastically different dynamics.

The authors thank Ludovico Lami for comments regarding the measured relative entropy continuity and the minimax lemma, and Niklas Galke, Philipp Strasberg, and Giulio Gasbarri for helpful discussions.

The authors acknowledge support by MICIIN with funding from European Union NextGenerationEU (Grant No. PRTR-C17.I1) and by Generalitat de Catalunya. AW is furthermore supported by the European Commission QuantERA grant ExTRaQT (Spanish MICINN Project No. PCI2022-132965), by the Spanish MINECO (Project No. PID2019-107609GB-I00) with the support of FEDER funds, the Generalitat de Catalunya (Project No. 2017-SGR-1127), by the Alexander von Humboldt Foundation, as well as the Institute for Advanced Study of the Technical University Munich.

The authors have no conflicts to disclose.

Joseph Schindler: Conceptualization (equal); Formal analysis (equal); Investigation (equal); Writing – original draft (equal); Writing – review & editing (equal). Andreas Winter: Conceptualization (equal); Formal analysis (equal); Investigation (equal); Writing – review & editing (equal).

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

1.
J.
von Neumann
, “
Proof of the ergodic theorem and the H-theorem in quantum mechanics
,”
Eur. Phys. J. H
35
,
201
(
2010
), Translation of: Beweis des Ergodensatzes und des H-Theorems in der neuen Mechanik; arXiv:1003.2133 [physics.hist-ph].
2.
N. G.
Van Kampen
, “
Quantum statistics of irreversible processes
,”
Physica
20
,
603
(
1954
).
3.
J.
von Neumann
,
Mathematical Foundations of Quantum Mechanics
(
Princeton University Press
,
1955
).
4.
A.
Wehrl
, “
General properties of entropy
,”
Rev. Mod. Phys.
50
,
221
(
1978
).
5.
D.
Šafránek
,
J. M.
Deutsch
, and
A.
Aguirre
, “
Quantum coarse-grained entropy and thermodynamics
,”
Phys. Rev. A
99
,
010101
(
2019
); arXiv:1707.09722 [quant-ph].
6.
D.
Šafránek
,
A.
Aguirre
,
J.
Schindler
, and
J. M.
Deutsch
, “
A brief introduction to observational entropy
,”
Found. Phys.
51
,
101
(
2021
); arXiv:2008.04409 [quant-ph].
7.
P.
Strasberg
and
A.
Winter
, “
First and second law of quantum thermodynamics: A consistent derivation based on a microscopic definition of entropy
,”
PRX Quantum
2
,
030202
(
2021
); arXiv:2002.08817 [quant-ph].
8.
D.
Šafránek
,
J. M.
Deutsch
, and
A.
Aguirre
, “
Quantum coarse-grained entropy and thermalization in closed systems
,”
Phys. Rev. A
99
,
012103
(
2019
); arXiv:1803.00665 [quant-ph].
9.
D.
Šafránek
,
A.
Aguirre
, and
J. M.
Deutsch
, “
Classical dynamical coarse-grained entropy and comparison with the quantum version
,”
Phys. Rev. E
102
,
032106
(
2020
); arXiv:1905.03841 [cond-mat.stat-mech].
10.
D.
Šafránek
and
J.
Thingna
, “
Quantifying information extraction using generalized quantum measurements
,”
Phys. Rev. A
108
, 032413 (2023).
11.
A.
Wehrl
, “
On the relation between classical and quantum-mechanical entropy
,”
Rep. Math. Phys.
16
,
353
(
1979
).
12.
A.
Riera-Campeny
,
A.
Sanpera
, and
P.
Strasberg
, “
Quantum systems correlated with a finite bath: Nonequilibrium dynamics and thermodynamics
,”
PRX Quantum
2
,
010340
(
2021
); arXiv:2008.02184 [quant-ph].
13.
P.
Strasberg
,
M. G.
Díaz
, and
A.
Riera-Campeny
, “
Clausius inequality for finite baths reveals universal efficiency improvements
,”
Phys. Rev. E
104
,
L022103
(
2021
); arXiv:2012.03262 [quant-ph].
14.
F.
Buscemi
,
J.
Schindler
, and
D.
Šafránek
, “
Observational entropy, coarse quantum states, and Petz recovery: Information-theoretic properties and bounds
,”
New J. Phys.
25
,
053002
(
2022
); arXiv:2209.03803 [quant-ph].
15.
L.
Amadei
,
H.
Liu
, and
A.
Perez
, “
Unitarity and information in quantum gravity: A simple example
,”
Front. Astron. Space Sci.
8
,
46
(
2021
); arXiv:1912.09750 [gr-qc].
16.
X.
Zhou
and
Z.-J.
Zheng
, “
Relations between the quantum correlation entropy and quantum discord for X-states in multipartite systems
,”
Eur. Phys. J. Plus
137
,
625
(
2022
).
17.
X.
Zhou
and
Z.-J.
Zheng
, “
Relations between the observational entropy and Rényi information measures
,”
Quantum Inf. Process.
21
,
228
(
2022
).
18.
R.
Hamazaki
, “
Speed limits for macroscopic transitions
,”
PRX Quantum
3
,
020319
(
2022
); arXiv:2110.09716 [cond-mat.stat-mech].
19.
D.
Faiez
,
D.
Šafránek
,
J. M.
Deutsch
, and
A.
Aguirre
, “
Typical and extreme entropies of long-lived isolated quantum systems
,”
Phys. Rev. A
101
,
052101
(
2020
); arXiv:1908.07083 [quant-ph].
20.
R.
Modak
and
S.
Aravinda
, “
Observational entropic study of Anderson localization
,”
Phys. Rev. A
106
,
062217
(
2022
); arXiv:2209.10273 [quant-ph].
21.
A.
Stokes
, “
Nonconjugate quantum subsystems
,”
Phys. Rev. E
106
,
034111
(
2022
); arXiv:2106.11017 [quant-ph].
22.
P.
Strasberg
,
Quantum Stochastic Thermodynamics: Foundations and Selected Applications
(
Oxford University Press
,
2022
).
23.
P.
Strasberg
,
A.
Winter
,
J.
Gemmer
, and
J.
Wang
, “
Classicality, Markovianity, and local detailed balance from pure-state dynamics
,”
Phys. Rev. A
108
,
012225
(
2022
); arXiv:2209.07977 [quant-ph].
24.
D.
Šafránek
,
D.
Rosa
, and
F.
Binder
, “
Work extraction from unknown quantum sources
,”
Phys. Rev. Lett.
130
,
210401
(
2022
); arXiv:2209.11076 [quant-ph].
25.
S.
PG
,
R.
Modak
, and
S.
Aravinda
, “
Witnessing quantum chaos using observational entropy
,”
Phys. Rev. E
107
,
064204
(
2022
); arXiv:2212.01585 [quant-ph].
26.
X.
Zhou
, “
Dynamical behavior of quantum correlation entropy under the noisy quantum channel for multiqubit systems
,”
Int. J. Theor. Phys.
62
,
23
(
2023
).
27.
D.
Šafránek
and
D.
Rosa
, “
Expectation values from any quantum measurement
,”
Phys. Rev. A
108
,
022208
(
2023
); arXiv:2301.10428 [quant-ph].
28.
M.
Fannes
, “
A continuity property of the entropy density for spin lattice systems
,”
Commun. Math. Phys.
31
,
291
(
1973
).
29.
M.
Grabowski
and
P.
Staszewski
, “
On continuity properties of the entropy of an observable
,”
Rep. Math. Phys.
11
,
233
(
1977
).
30.
K. M. R.
Audenaert
, “
A sharp continuity estimate for the von Neumann entropy
,”
J. Phys. A: Math. Theor.
40
,
8127
(
2007
); arXiv:quant-ph/0610146 [quant-ph].
31.
K. M. R.
Audenaert
and
J.
Eisert
, “
Continuity bounds on the quantum relative entropy
,”
J. Math. Phys.
46
,
102104
(
2005
); arXiv:quant-ph/0503218 [quant-ph].
32.
A.
Winter
, “
Tight uniform continuity bounds for quantum entropies: Conditional entropy, relative entropy distance and energy constraints
,”
Commun. Math. Phys.
347
,
291
(
2016
); arXiv:1507.07775 [quant-ph].
33.
Z.
Zhang
, “
Estimating mutual information via Kolmogorov distance
,”
IEEE Trans. Inf. Theory
53
,
3280
(
2007
).
34.
M. M.
Wilde
,
Quantum Information Theory
(
Cambridge University Press
,
2013
); arXiv:1106.1445v8.
35.
A. S.
Holevo
and
R. F.
Werner
, “
Evaluating capacities of Bosonic Gaussian channels
,”
Phys. Rev. A
63
,
032312
(
2001
); arXiv:quant-ph/9912067 [quant-ph].
36.
R.
Alicki
and
M.
Fannes
, “
Continuity of quantum conditional information
,”
J. Phys. A: Math. Gen.
37
,
L55
(
2004
); arXiv:quant-ph/0312081 [quant-ph].
37.
M.
Mosonyi
and
F.
Hiai
, “
On the quantum Rényi relative entropies and related capacity formulas
,”
IEEE Trans. Inf. Theory
57
,
2474
(
2011
); arXiv:0912.1286 [quant-ph].
38.
M. J.
Donald
and
M.
Horodecki
, “
Continuity of relative entropy of entanglement
,”
Phys. Lett. A
264
,
257
(
1999
); arXiv:quant-ph/9910002 [quant-ph].
39.
B.
Synak-Radtke
and
M.
Horodecki
, “
On asymptotic continuity of functions of quantum states
,”
J. Phys. A: Math. Gen.
39
,
L423
(
2006
); arXiv:quant-ph/0507126 [quant-ph].
40.
M.
Piani
, “
Relative entropy of entanglement and restricted measurements
,”
Phys. Rev. Lett.
103
,
160504
(
2009
); arXiv:0904.2705 [quant-ph].
41.
M. A.
Nielsen
and
I. L.
Chuang
,
Quantum Computation and Quantum Information: 10th Anniversary Edition
(
Cambridge University Press
,
2010
).
42.
K.
Li
and
A.
Winter
, “
Relative entropy and squashed entanglement
,”
Commun. Math. Phys.
326
,
63
(
2014
); arXiv:1210.3181 [quant-ph].
43.
M. E.
Shirokov
, “
Advanced Alicki–Fannes–Winter method for energy-constrained quantum systems and its use
,”
Quantum Inf. Process.
19
,
164
(
2020
); arXiv:1907.02458 [quant-ph].
44.
A.
Bluhm
,
Á.
Capel
,
P.
Gondolf
, and
A.
Pérez-Hernández
, “
Continuity of quantum entropic quantities via almost convexity
,”
IEEE Trans. Inf. Theory
69
,
5869
(
2022
); arXiv:2208.00922 [quant-ph].
45.
A.
Bluhm
,
Á.
Capel
,
P.
Gondolf
, and
A.
Pérez-Hernández
, “
General continuity bounds for quantum relative entropies
,” arXiv:2305.10140 [quant-ph] (
2023
).
46.
M. E.
Shirokov
, “
Quantifying continuity of characteristics of composite quantum systems
,”
Phys. Scr.
98
,
042002
(
2023
); arXiv:2201.11477 [quant-ph].
47.
F. G. S. L.
Brandão
,
A. W.
Harrow
,
J. R.
Lee
, and
Y.
Peres
, “
Adversarial hypothesis testing and a quantum Stein’s lemma for restricted measurements
,”
IEEE Trans. Inf. Theory
66
,
5037
(
2020
); arXiv:1308.6702.
48.
B.
Farkas
and
S. G.
Révész
, “
Potential theoretic approach to rendezvous numbers
,”
Monatsh. Math.
148
,
309
(
2006
); arXiv:math/0503423 [math.CA].
49.
M.
Sion
, “
On general minimax theorems
,”
Pac. J. Math.
8
,
171
(
1958
).
50.
L.
Lami
, private communication (
2022
).
51.
P. J.
Coles
,
M.
Berta
,
M.
Tomamichel
, and
S.
Wehner
, “
Entropic uncertainty relations and their applications
,”
Rev. Mod. Phys.
89
,
015002
(
2017
); arXiv:1511.04857 [quant-ph].
Published open access through an agreement with Universitat Autònoma de Barcelona