We introduce a class of channels called heralded channels, which generalize the quantum erasure channel composed with an arbitrary other quantum channel. We show that monogamy of squashed entanglement limits the superadditivity of Holevo capacity of heralded channels in the regime of rare success (frequent erasure). We prove that in the limit of near-zero success probability, the classical capacity of the heralded channel converges to its Holevo information, which is equal to success probability times the single-letter Holevo information of the channel with which erasure is composed. We also show how entanglement monogamy applies to non-classicality in quantum games, and we consider how faithful monogamous entanglement measures may bound other entanglement-dependent quantities in many-party scenarios.

Entanglement powers many of the core non-classical capabilities in the quantum world.1 Unlike classical correlations, entanglement is monogamous. A finite-dimensional quantum system can hold only a finite amount of entanglement, preventing it from entangling strongly with many others simultaneously. This property has some benefits, including natural privacy of quantum correlations, provided one can verify their quantum nature. By contrast, it also seems a severe limitation. If we require entanglement between subsets of disjoint subsystems but do not know in advance which will be relevant, monogamy limits the potential advantage from entanglement.

Our main example comes from the theory of communication over quantum channels.2 A challenge and opportunity lies in the phenomenon of superadditivity, in which the capacity (maximum possible rate) to transmit information over a collection of quantum channels may exceed the sum of their individual capacities.3–6 For classical channels, the Shannon capacity is additive, allowing a single-letter entropy expression—a formula defined for one use of the channel. By contrast, so far there is no such single-letter expression for the classical capacity of a quantum channel, and an open question as to the general tractability of capacity calculations.7 By the HSW (Holevo, Schumacher, and Westmoreland) theorem,8,9 the classical capacity C(Φ) of a quantum channel Φ is given by regularization of Holevo information χ(Φ), which maximizes the entropy expression for m copies of the channel at once, taking m to infinity.

In many laboratory settings, we expect that not every run of a protocol will succeed. This is a common paradigm in linear optics: photon sources,10 protocols,11,12 and gates13 all have a probability or amplitude of failing, in such a way that we might only discover after a measurement, or at the end of the experiment. Many of these situations can be described by quantum erasure channels, which have some probability of outputting a flagged error state. Sometimes the probabilistic failure may be composed with successful but imperfect transmission. Keeping with the experimental and photonics literature, we call these heralded channels14–18 (Fig. 1). These are a special case of flagged quantum channels, as discussed in Refs. 5, 19, and 20, in which the classical output is alerted to the outcome of some non-deterministic channel selection process. A flagged channel randomly selects among a fixed ensemble of quantum channels to apply to its input states and reports which was applied along with the output state. Some cases of these channels are mentioned in Refs. 21, 22, and 4. Flagged channels differ from the compound channels discussed in Refs. 23–25, which consistently apply an unknown channel taken from a known set. Flagged channels draw randomly from a set of known channels at each use, revealing the result of this random drawing at the output. A heralded channel is a special case of flagged channel, which chooses between outcomes interpreted as success or failure.

FIG. 1.

Diagram of outcomes of a flagged channel that sometimes applies Φ and other times applies Ψ. We do not know for each turn which will be applied at preparation time, only at the output.

FIG. 1.

Diagram of outcomes of a flagged channel that sometimes applies Φ and other times applies Ψ. We do not know for each turn which will be applied at preparation time, only at the output.

Close modal

In these situations, the potential contribution of superadditivity is limited. Superadditivity relies on entangling input states to send over multiple uses of the channel. Entanglement as a quantum phenomenon is monogamous.26–28 Thus unpredictably assigning nearly additive channels to many of the inputs dilutes the available entanglement for superadditivity of the desired ones. For example, even if one could efficiently prepare a large number of entangled photons at the input to a set of optical quantum channels, losing many of these photons means the entanglement after transmission is small. In this situation, an entangled scheme has limited or no advantage over a probably simpler non-entangled input, because entanglement monogamy limits the average entanglement between potentially superadditive outputs.

We directly apply the monogamy of squashed entanglement and its faithfulness with the trace distance.29 Our results suggest one circumstance in which we can avoid the full difficulty of regularization. Surprisingly, post-selecting on success may not produce an exactly equivalent situation to having a channel without the failure probability, as it limits the phenomenon of superadditivity. We also show that with limited entangled blocksize, the number of inputs that are not separable in a quantum code, a wide variety of quantities become nearly additive for heralded channels.

In addition to the experimental motivations for understanding heralding, we derive a new technique of fundamental interest in estimating quantum channel capacity and other entropy expressions. Instead of relying on the details of the channel, we show that entanglement monogamy limits the contribution of superadditivity when considering a small subset of outputs randomly chosen from a larger set. Going beyond the case of capacity and based on the faithfulness of squashed entanglement,30 any entanglement-dependent quantity that is faithful in the trace distance should also be faithful in the squashed entanglement, admitting a bound from entanglement monogamy. We briefly consider the consequences in a general form of quantum game (Sec. VII) to illustrate the principle.

The faithfulness of squashed entanglement with trace distance implies a very general bound on a wide variety of quantities. For example, all entanglement measures that are convex, faithful, and continuous in trace distance are weakly monogamous. One may derive this by using the original entanglement measure’s implied lower bound on the trace distance to a separable state, the implied lower bound on squashed entanglement, and finally the monogamy of squashed entanglement. As claimed in Ref. 29, “squashed entanglement is about the distance from highly extendible states.” These extendible states are bipartite, for which it is possible to consider one of the parties to be replicated a large number of times in the environment. It is not possible to entangle a large network of low-dimensional, identical parties, in such a way that the majority of small, arbitrary subnetworks are individually entangled. We expect these ideas to go beyond the applications in this paper to many scenarios, including multipartite systems.

Let Φ be a quantum channel. That is, a completely positive trace preserving map which sends density operators (positive and trace one) of one Hilbert space HA to another HB. The HSW theorem8,9 proves that the classical capacity of a quantum channel C(Φ) is given by regularization of its Holevo information χ(Φ) as follows:

where S(σ) = −tr(σ log σ) is the von Neumann entropy and I(X; B) = S(X) + S(B) − S(XB) is the mutual information. The supremum runs over all classical-quantum inputs

and σ = Φ(ρ), σx = Φ(ρx) are the outputs.

We define for λ ∈ [0, 1] the generalized quantum erasure channel with 1 − λ erasure probability,

(1)

where Y is the heralding/flagging classical output and σ is some fixed state. Using monogamy and faithfulness of squashed entanglement, we have the following estimate of its classical capacity:

Theorem 1.1.
For any quantum channel Φ with output dimension at most d,
(2)

Here the asymptotic expression is taken for small λ, noting the significant dimension-dependence. Our theorem is most effective at describing small channels that erase often. We emphasize that as λ → 0, the Holevo capacity approaches an additive regime regardless of Φ. As discussed in Sec. VI, the existence of this asymptotic regime does not follow directly from convexity or continuity of the Holevo capacity. The technical derivation of this result appears in Sec. IV, in which we also consider the heralded channels with fixed success number, Zkn(Φ), which erase nk random locations out of the n inputs (k being the success number). The main consequence is that there exists an approximately additive regime in the limit of low success probability or heavy post-selection. In particular, we emphasize the following corollary.

Corollary 1.2.
For any channel Φ,

This notion is motivated by a common experimental scenario: we perform an experiment many times with some success rate λ, but we only wish to consider the instances in which the experiment was run successfully. For example, we may post-select on having the expected number of photons at the end of an optics experiment, discarding a much larger number of trials in which probabilistic photon sources did not produce exactly one photon each. This is equivalent to projecting the total outcome onto the heralding signal |0⟩ ⟨0|Y subspace. Even though we discard the failures, they still matter for superadditive quantities such as classical capacity. In the limit as λ → 0, χ(Φ) takes on the operational meaning of post-selected classical capacity. We suspect that this will be true for other superadditive quantities as well.

Going beyond capacity, we construct a form of quantum game in which Alice prepares a joint state via free communication with many “B” players simultaneously, though the dimension of her system is fixed and does not depend on the number of B players. After this initial state is prepared, the referee randomly selects one B player, and Alice plays a bipartite no-communication game with that B player. When there are many B players, monogamy of entanglement limits the extent to which the average value of these games with a quantum pre-shared state may exceed that with a classical pre-shared state. See Sec. VII B for the details of this setup.

We organize this work as follows: Sec. II introduces some notation and reviews channel capacity, squashed entanglement, and continuity of entropies. In Sec. III, we explain our main technique, which we call “heralded averaging.” Section IV applies this technique with monogamy of squashed entanglement to a variety of Holevo information quantities for heralded channels. Section V shows Theorem 1.1 and relates heralded switch channels to erasure channels. Section VI compares our result to additivity bounds derived by convexity and continuity of Holevo information. Section VII discusses other applications of our principles to other capacities and quantum games. We end with discussion and conclusions in Sec. VIII.

Let {Φ1, …, Φn} and {Ψ1, …, Ψn} be two classes of quantum channels. For k < n, we define the flagged switch channel as follows:

(3)

The flagged switch channel is a convex combination of tensor products of “Φ” and “Ψ” channels, randomly selecting k positions at which to apply the corresponding “Φ” channel, and applying “Ψ” channels at all others. Here R specifies a set of positions., and ΦR denotes the tensor product Φi1...Φik on subsytems Ai1...Aik, where {i1,...,ik}R. In other words, R specifies the set of outputs containing “Φ” channels. Rc is its complement. RRY is an extra classical output revealing R. We assume that the flagging process produces such a classical signal Y, such as by detection of a secondary photon at the output of a source10 or the outcome of an attempted quantum experiment. That is, by the time it reaches the output, the flagging signal has already been converted into a classical register Y of value R, the states of which we denote |RR|. Note that we do not switch the positions of input channels, only choosing between alternatives at each position. If all Ψj are erasure operations which have a fixed output Ψj(ρ) = Θ(ρ) = σ for some state σ, we call

the heralded channel with a fixed success number. We consider the fixed-output positions to represent failure. To shorten our notation, in the following, we will write

where the classes of channel Φ = {Φ1, …, Φn} and Ψ = {Ψ1, …, Ψn} are clear. Some cases of these channels are mentioned in Refs. 21, 22, and 4. We also consider the quantum erasure channel combined with another channel, described in Sec. V, to be a heralded channel. This is actually more common in physical scenarios: many processes randomly fail in a detectable way.

Our notion of the flagged channel differs from that of the compound channels discussed in Refs. 23–25, as they consider an unknown channel taken from a known set that is applied consistently, while we consider a new channel that draws randomly from a set of known channels at each use, but which reveals the result of this random drawing at the output.

In addition to the classical capacity, we will also discuss the potential Holevo capacity introduced in Ref. 31. The potential Holevo capacity is the maximum Holevo information gain of a channel when assisted with another channel,

It is clear that χ(pot)(Φ) ≥ C(Φ) ≥ χ(Φ), and we say that the channel’s Holevo information is additive if C(Φ) = χ(Φ) and strongly additive if χ(pot)(Φ) = χ(Φ). In both cases, its classical capacity C(Φ) is fully characterized by the single-letter Holevo information χ(Φ). We refer to the book by Wilde2 for more information about quantum channel capacity and other basics in quantum Shannon theory.

The squashed entanglement of a bipartite state ρAB, defined in Ref. 30, is

where the infimum runs over all extensions ρABC of ρAB. We will use the following properties of the squashed entanglement in this paper:

  • Convexity: Let ρAB=xpxρxAB be a convex combination of states {ρx}. Then

(4)
  • (ii)

    Monogamy: Let ρAB1Bk be a (k + 1)-partite state. Then

(5)
  • (iii)

    1-norm faithfulness:

(6)

where ∥·∥1 is the trace class norm.The convexity and monogamy of squashed entanglement, as well as Property (iv), are basic properties.30 The faithfulness of Schatten norms are proved by Brandão et al.32 and Li and Winter.29 The readers are also referred to the survey paper33 for more information about measures of quantum entanglement.

Given a bipartite state ρAB, its conditional entropy (conditional on B) is given by S(A|B)ρ = S(AB)ρS(B)ρ. The continuity of conditional entropy is characterized by the Alicki-Fannes inequality, as introduced in Ref. 34 and later refined by Winter in Ref. 35. For two bipartite states ρ and σ and δ ∈ [0, 1],

(7)

where h(p) = −p log p − (1 − p) log(1 − p) is the binary entropy function and |A| is the dimension of A. Note that for 0 ≤ ϵ ≤ 1,

We may also use the following (weaker but simpler) variant of (7):

(8)

For simplicity, we will rewrite this with δ = 2ϵ, obtaining

(9)

For a channel Φ: AB and a bipartite state ρB0A, we introduce the following notation of squashed entanglement:

and similarly for conditional entropy and mutual information

Lemma 3.1.
Let Zk(Φ) be a heralded channel with a fixed success number. For any state ρB0A1An,
where L = ⌊n/k⌋, the largest integer less than or equal to n/k.

Proof.
Let Sn be the symmetric group of the integer set [n] = {1, …, n}. The heralded channel Zk(Φ) can be rewritten as
where R can be any k-subset of [n] and the summation runs over all permutations σ. Then we find L disjoint subset R1, R2, …, RL with each |Rl| = k and have
Exchanging the summation, we have
Here the first inequality follows from convexity and the fact |σ(Rl)σ(Rl)|Y is a classical signal. The second equality is because Θ’s are trivial channels. The last inequality is the data processing inequality of squashed entanglement (squashed entanglement is not increasing under local operation). Note that for any permutation σ, σ(R1), …, σ(RL) are disjoint positions because R1, R2, …, RL are. By monogamy of squashed entanglement, we know, for any σ,
which completes the proof.
The above argument can be adapted to tensor products of heralded channels. Let Φi={Φi,1,Φi,2,,Φi,ni},1im be m classes of quantum channels such that each class consists of ni quantum channels. Let A=i=1m(j=1niAi,j) be the total input system and B=i=1m(j=1niBi,j) be the quantum part of the output system. Then the tensor product of heralded channels
is from A to BY, where the heralding signal R now is an ensemble of the heralding signals Ri ⊂ [ni] for each Zki(Φi).

Lemma 3.2.
LetZki(Φi),1imbe a family of heralded channels. For any state ρB0A, whereA=i=1m(A1,iAni,i), we have
whereL=minini/ki.

Proof.
Let Sn1,,nm=Sn1××Snm be the direct product of the permutation groups Sn1,,Snm. Denote σ=(σ1,,σm)Sn1,,nm where for each i such that 1 ≤ im, σiSni. The tensor product of heralded channels can be rewritten as
where σ(R)=σ1(R1)σm(Rm) is the ensemble of shifted positions and the summation runs over all permutations σSn1××Snm. Given L = minini/ki⌋, in each index set [ni], we can choose mutually disjoint subset R1,i, R2,i, …, RL,i with |Rl,i| = k for all l. Write R(l) = Rl,1,Rl,2, …, Rl,m. Following the same argument in Lemma 3.1, we have
Note that for any permutation σSn1,,nm, σ(R(1)), …, σ(R(L)) are disjoint index subsets because R(1), R(2), …, R(L) are. Thus by entanglement monogamy, for any σ,
which completes the proof.

Remark 3.3.

We may replace the above squashed entanglement by any information measure obeying convexity (4) and monogamy (5). In the future, it is possible that other faithful, monogamous entanglement measures may be discovered, yielding superior bounds via the same techniques as in this paper.

Theorem 3.4.
Leti=1mZki(Φi):ABbe the tensor of a family of heralded channels. Let B0be an extra system with dimension |B0|. Supposeλ̲=1/minini/kiis small enough such thatδ=3.1|B0|λ̲log|B0|42. Then for any state ρB0A, there exists a state ηB0B that is separable between B0and B such that
and
where h(δ) = −δ log δ − (1 − δ) log(1 − δ).

Proof.
By Lemma 3.2,
Then we apply the faithfulness of squashed entanglement (6) to show the existence of a separable η such that
The Alicki-Fannes inequality in Eq. (7) completes the theorem.

Remark 3.5.

Any function that depends on entanglement and is faithful in the 1-norm can be bounded by the method of Theorem 3.4. This generality comes at the cost of strong dimension-dependence. While in Sec. IV we use the Holevo superadditivity to illustrate applications of this result, it is really a demonstration that the monogamy of a faithful entanglement measure carries through to all entanglement-dependent quantities. The ultimate implication is that in the asymptotic limit of rare success and fixed dimensions, all quantities that depend on entanglement between outputs should approach their separable values.

We first state the technical version of our theorems regarding Holevo information in Sec. IV A and leave the proofs of the main Theorem 4.1 and its corollaries to Sec. IV B.

Our main technical theorem of this section considers tensor products of heralded channels or flagged switch channels, showing that heralding removes the superadditivity from a channel by diluting the available entanglement. It is the building block we use to prove the simpler corollaries and Theorem 1.1. In the following, we always assume that Φi={Φi,1,,Φi,ni},Ψi={Ψi,1,,Ψi,ni} are pairs of families of channels such that for each (i, j), Φi,j, Ψi,j: Ai,jBi,j share the same input and output system.

Theorem 4.1.
LetZki(Φi;Ψi),1imbe a family of flagged switch channels and Φ0: A0B0be an arbitrary channel. Suppose thatλ̲=1/minini/kiis small enough such thatδ=3.10|B0|λ̲log|B0|42. Then
where h(ϵ) = −ϵ log ϵ − (1 − ϵ) log(1 − ϵ), and

The above theorem bounds the additivity violation between heralded channels and an arbitrary extra channel Φ0. The form is analogous to an approximate strong additivity for heralded channels with small λ̲, but it depends on the dimension of output system B0. So it is not truly a strong additivity, as the bound becomes trivial when Φ0 is a channel with a sufficiently large output space. Based off of Theorem 4.1, we prove the following approximate additivity in the regime of small λ̲.

Corollary 4.2.
Let the output system Bi,jto each Φi,jbe of dimension at most d, and letλ̲=1/minini/kibe small enough such thatδ=3.1dλ̲logd42. Then,
with af(d, δ) being defined as in Theorem 4.1. As a consequence, for a single flagged switch channel Zk(Φ; Ψ),

Unlike Theorem 4.1, this corollary depends only on the dimension of the constituent channels Φi,j. Therefore, it is an approximate additivity when λ̲ is small enough.

Finally, we derive a bound analogous to strong additivity entirely in terms of single-letter expressions.

Corollary 4.3.
Let the output system to each Φi,jbe of dimension at most d, Φ0have the output dimension d0, andλ̲=1/minini/kibe small enough such thatδ=3.1dλ̲logd42andδ0=3.1d0λ̲logd042. Then,
where af(d, δ) and af(d0, δ0) are defined as in Theorem 4.1.

This corollary combines Theorem 4.1 and Corollary 4.2 to bound all superadditivity present, allowing us to write the Holevo information of a product of heralded channels enhanced by an arbitrary Φ0 as a sum of single-letter expressions and correction terms. We say that this bound is merely analogous to strong additivity (rather than an actual strong additivity) due to the dimension-dependence on the channel Φ0, which prevents us from bounding the full potential capacity of a heralded channel.

We start with a lemma showing that Holevo information is not increasing when conditioning on a separable extra system.

Lemma 4.4.
Let Φ0: A0B0be a quantum channel. Let B be any quantum system andρxA0Bbe a family of separable bipartite states. Then for any η = xpxηx, where ηx = id ⊗ Φ0(ρx) and {px} probability distribution, we have

Proof.
By separability, for each x, we may write
as a convex combination of product states. Define a classical to quantum channel Φcq: XB, where X is an extra classical system, by
From this, we define the classical-quantum states
and observe that ηx=Φcq(ηx) for each x. Applying the data processing inequality for conditional entropy,
This has the form of the Holevo information for the state x,jpxpx,j|x,jx,j|ηx,j. Therefore, it is less than the Holevo information of Φ0.

The next lemma replaces each Ψ channel by its potential capacity in an upper bound, which reduces the discussion of Theorem 4.1 to heralded channels.

Lemma 4.5.
LetZki(Φi;Ψi),1imbe a family of flagged switch channels and let Φ0be an arbitrary channel Φ0: A0B0. Then

Proof.
Let B=B0(i=1mj=1niBj,i) be the full quantum output system. For a classical quantum input state ρXA=xpx|xx|ρxA, the output state is
Note that the marginal distributions (reduced density) on the two classical system XY are independent. Thus we have
where
is the outcome of the heralding signal R. In each I(X: B)ω(R), we could then replace Rc systems by the potential capacities of Ψ channels without decreasing the expression. That is,
where
is the corresponding output of heralded channels. The result follows from summing over all R with uniform probabilities.

Lemma 4.6.
Let Φ0 ⊗ Φ1: A0AB0B be a tensor product of channels. Let X be a classical system, with states xX. Let ρxbe the input state to the channel for each x and ρ = xpxρx. Let ωx = Φ0 ⊗ Φ1(ρx) and ω = xpxωx. Assumeas a state η = xpxηxthat is separable between B0and B such that for each x,
Then

Proof.
For a classical-quantum state ωXB0B=xpx|xx|ωxB0B,
(10)
The second half of this expression is upper bounded by the Holevo information of Φ1(ρXA) if
For the first half, we use the factorization property,
where “id” represents the identity channel. Hence by assumption, ηxXA0A such that ∀ xX,
and by the convexity of the 1-norm,
The main subtlety of this part of the proof is that we need ηxB0 to be an output state of Φ0, but the B part can be arbitrary. This is because we use this state to estimate only the S(B0|B) terms and not the S(B) terms. What we have just shown is that we can prepare a separable input state to the channel Φ0idA that is close in the 1-norm sense to the output of idA0Φ1. By contractivity of the trace distance under quantum channels,
By the Alicki-Fannes inequality (7), for all x,
and
We now use the triangle inequality and Lemma 4.4,
Returning to Eq. (10), we obtain
The final inequality in the lemma follows from Eq. (8).

Proof of Theorem 4.1.
First, by Lemma 4.5, it is sufficient to estimate χ(Φ0i=1mZki(Φi)). By Theorem 3.4, for each ρx, there exists state ηxA0AY separable between A0 and AY such that
Let δ=3.1|B0|λ̲log|B0|4. Apply Lemma 4.6.

Proof of Corollary 4.2.
As in the proof of theorem 4.1, Lemma 4.5 implies we need only estimate the Holevo information of i=1mZki(Φi). We start by the averaging on the jth position being a position in the set R (of non-erased outputs) for a heralded channel Zk1, …, Φn). Since each R contains k positions, up to a re-ordering based on the classical signal,
(11)
Here in Eq. (11), we rearrange the heralding sum by separating the jth position in the index set R. Knowing this position in advance at the input does not lower the achievable rate, so we may estimate the channel capacity from above by moving this position to the front. Note that for the coefficients, knk=nn1k1. Denote δ=3.1|B0|λ̲log|B0|4. We use the convexity of Holevo information (Ref. 36, Proposition 4.3) and Theorem 4.1,
We may repeat this procedure to separate out all Φ positions in each Zki(Φi). As we replace each Φi,j by its Holevo information plus the correction term, we are reducing n and k by the same amount, so λ̲=1/minki/ni does not increase. Thus 2δ log d + (2 + δ)h(δ/(2 + δ)) is a uniform bound for the correction term at all steps. Therefore,
Regularizing the expression for Zk(Φ)m yields the bound for classical capacity.

We define a generalized erasure channel as an erasure channel composed with some channel Φ in its successful trial,

where λ ∈ [0, 1] and σ is some fixed state. We call the successes those cases in which Φ was applied with a classical signal of |0⟩ ⟨0|. Zλ(Φ) can be viewed as a heralded channel in terms of success probability, rather than imposing a definite number of successes. Indeed, tensor products of the erasure channel can be expressed as a probabilistic sum over heralded channels,

This is easy to see by noting that each erasure channel is a binary heralding with probability λ, so the overall distribution of successes is binomial. In the following, we denote

as the heralded failure channel with n identical Φ’s and fixed success number k.

Theorem 5.1.
For any n and λ,
In particular, the classical capacity of the erasure channel Zλ(Φ) can be rewritten by

We prepare our proof with a lemma.

Lemma 5.2.
Let k1k2. For any classical-quantum input ρ,

Proof.
By the zero contribution from erased outputs and classical conditionality of the mutual information,
Therefore, we have
The inequality follows from Lemma 4.4. The last step is because each k1-subset P has been counted nk1k2k1 times as a subset of some k2-set R. The inequality follows similarly.

Proof of Theorem 5.1.
Let A be the input system of Φ. Then the two channels Zλ(Φ)n and Zλnn(Φ) have the same input An. Denote ρXAn by a classical-quantum state. By the triangle inequality and Lemma 5.2,
Recall that the variance of binomial distribution is (1 − λ). We obtain by the Hölder inequality that

Corollary 5.3.
Let d be the dimension of the output system for Φ andδ=3.1dλlogd42. Then

Corollary 5.3 follows from Theorem 5.1 and Corollary 4.2. It is the technical version of Theorem 1.1.

It is known in Ref. 36, Proposition 4.3, that the Holevo information and its potential analog are convex functions of quantum channels. For a d-dimensional generalized erasure, we know the convexity bound as follows:

where the last inequality is the trivial bound χpot(Φ) ≤ log d. Suppose Φ has output dimension d. Our monogamy bounds in Corollary 5.3 are nontrivial when

(12)

If χ(Φ) < log d (the channel is not perfect), there is always a region of small λ such that our monogamy bound is better.

It is possible to obtain stronger bounds than this convexity by using knowledge of the channel. Leditzky et al. in Ref. 37 estimated the classical capacity of a channel Φ by the smallest diamond norm distance to a weakly additive channel.

Theorem 6.1
(Corollary 11.7 of Ref. 37). Let Ψ: AB be a quantum channel with weakly additivity Holevo capacity, i.e., χn) = (Ψ) for all n. Assume that Φ: AB is a quantum channel such that
(13)
where ϵ ∈ [0, 1] and ∥·∥is the diamond norm. Then
(14)
where g(ϵ) = (1 + ϵ) log(1 + ϵ) − ϵ log ϵ.
The above theorem requires an estimate on the diamond norm distance between a channel Φ and the class of weakly additive channels. In Ref. 37, this is estimated by considering the distance to the weakly additive classes of more specific channels such as Hadamard channels and entanglement breaking channels. Denote
as the infimum of the diamond norm distance of Φ to an entanglement breaking channel Φ′. Assume that Φ0 is an entanglement breaking channel attaining ϵEB(Φ). For entanglement breaking Ψ, we know by the heralded (direct sum) structure that
and Zλ0, Ψ) is also entanglement-breaking. Then Theorem 6.1 tells us
(15)
For the erasure channel in which Ψ = Θ is the complete erasure channel, these bounds reduce to
(16)
For asymptotically small λ, the term g(λ2ϵEB(Φ))O(ϵEB(Φ)λlogλ) ultimately dominates and causes the bound to become worse than linear in λ, whenever Φ is not entanglement-breaking and ϵEB(Φ) > 0. However, for λ not too small, this bound may often be a substantial refinement over the simpler convexity bound λ log d. It is likely that the super-additivity is no greater than λ(C(Φ) − χ(Φ)) since the enhancement to additivity from the mixture of trivial channels and Φ instances is smaller than that available by having Φ channels at predefined locations. The heralded channel Zkn(Φ) is not entanglement breaking unless k = 0 or Φ is entanglement-breaking.

Our correction term in Corollary 5.3 decays asymptotically as O(λ54logλ), going beyond (16) and convexity bound when λ ≪ 1/|B|. This implies that the super-additivity decays faster than the linear term in the asymptotic limit of low success probability. As a consequence, we obtain Corollary 1.2 that when the success rate λ → 0, the classical capacity of the heralded channel Zλ(Φ) averaged by λ converges to the Holevo information of Φ. This cannot be obtained by the bound (16) from the diamond norm.

While we would like to immediately extend our results to quantum and private capacities, these quantities have a more complicated interaction with heralding and erasure. The quantum erasure channel, for instance, has zero quantum capacity if the success probability is less than 1/2,38 subsuming the values for which our main near-additivity results would apply. It is possible that the quantum capacity has a different monogamy phenomenon from Holevo information, but that is beyond the scope of this paper. We might instead consider the quantum and private capacities with feedback, but these are much more complicated to estimate with entropy techniques.39 See Refs. 2 and 31 for more information on different capacities.

In practice, the realities of quantum hardware often limit the size of entangled blocks. When the success probability of a heralding process is small compared with the entangled blocksize of inputs, most successes will be the only success in their respective block. In this case, we show that superadditivity is limited by a simpler argument. For example, recent proposals for trapped ion computing schemes mention heralded photonic Bell state analysis as a method for connecting trapped ion arrays, a necessity due to the number-limiting interactions of trapped ions in the same array.40 The array size restricts the entangled input blocksize for inter-array communication, so lossy optical links may exhibit superadditivity-limiting effects even with 2-way classical assistance. The generation of large-scale entangled states between many individually controllable qubits is at the time of this writing an ongoing challenge in experimental quantum information. Realistic calculations of attainable quantum and private rates with today’s hardware probably should assume small entangled blocksizes.

Proposition 7.1.
Let Φ1, …, Φnbe a family of quantum channels, and let λ ≪ 1/n. Let F(1)be a function mapping densities to positive real numbers and define its output on a quantum channel by F(1)(Φ) = maxρ{F(1)(Φ(ρ))}. Assume that F(1)is superadditive on quantum channels, additive for separable input states, convex in the input state and channel, and admit a well-defined expression of the form F(pot)(Φ) = maxΨ{F(1)(Φ ⊗ Ψ) − F(1)(Ψ)} such that F(pot)(Θ) = 0 if Θ is a trivial channel for which the output provides no information about the input. Then

Proof.
If only one success has occurred at position i, then
(17)
This follows from the fact that the trivial channels provide no information about the input and, being uncorrelated with it, are therefore not entangled either. Since F is additive on separable states and 0 on these trivial channels, they contribute nothing. For some tensor product ΦRΘRc, define Ψ1⋯Ψn such that ΦRΘRc=Ψ1Ψn,
(18)
where the last inequality is obtained by iterating and discarding the F(pot)i) terms that are zero anyway. Using Eqs. (18) and (17), and the convexity of F(1) in the channel,
In the last inequality, we used, for small λ,

Remark 7.2.

The intuition behind Proposition 7.1 is that when the erasure probability of an erasure channel is sufficiently high compared to the maximum entangled blocksize of the encoder, it is unlikely that more than one non-trivial channel will appear in any block. This result is not based on entanglement monogamy.

Remark 7.3.

Proposition 7.1 applies to the block-limited coherent information Q(m), but when λ ≪ 1, the unassisted quantum capacity vanishes anyway due to antidegradability of the erasure channel.38 Proposition 7.1 applies to the Holevo information χ as well as the classical capacity with limited entanglement assistance, as discussed in Ref. 41. We conjecture that it will apply to the quantum capacity with classical feedback and 2-way classical communication, as discussed in Ref. 39, but we cannot yet confirm the necessary properties to apply our results, as we do not have a one-shot entropy expression for these quantities.

Going beyond the case of communication, the idea that entanglement monogamy implies bounds on entanglement-based super-additivity also applies to non-local games. Vidick et al. used monogamy in tripartite systems to show that entangled multi-prover games have values close to their classical counterparts, making their values similarly difficult to approximate.42 Several previous authors have noted the monogamy of quantum correlations43,44 and the connection between symmetric extendibility and hidden variable explanations.45 Here we show how the monogamy of squashed entanglement and k-extendibility yield a very general bound on multiplayer quantum games.

A two-player game G = (A, B, X, Y, π, v) is played between a referee and two isolated players, Alice and Bob, who communicate only with the referee and not between themselves. The referee chooses a question pair (x, y) according to some probability distribution π on the question alphabets X × Y, sending x to Alice and y to Bob. The two players respond with answers a and b, respectively, from answer sets A and B. They win the game if v(x, y, a, b) = 1 for the verification function v: X × Y × A × B → {0, 1} and lose otherwise. The classical value of the game

is the maximum winning probability when Alice and Bob are allowed to use optimal deterministic strategies aax(ω) = bby(ω) = 1 based on some classical correlation (common randomness) P(ω). The entangled or quantum value of the game

allows Alice and Bob to answer the question by performing positive-operator valued measures (POVMs) aExa=1,bFyb=1 with some auxiliary bipartite entangled state ρ. It is clear that for all games, val(G) ≤ val*(G). A bound on the value with classical states, such as Bell’s inequality, can be evidence of quantum non-locality (see Ref. 46 for more information on non-local games).

We consider the application of entanglement monogamy in the following scenario. Suppose the player Alice has a single system A, which can share classical correlation or quantum entanglement with a large number of “Bob” players B1, …, Bn simultaneously. The referee randomly (with equal probability) selects a “Bob” player Bi and plays the corresponding bipartite game Gi = (A, Bi, Xi, Yi, πi, vi) with Alice and Bi. This is analogous to the heralding process because although the two players A and Bi know the game Gi after Bi is selected by the referee, the strategy and auxiliary resource, classical or quantum, has to be prepared between Alice and B1, …, Bn before the game is played. In this situation, the non-classicality in entangled games will be bounded by entanglement monogamy if the size of Alice’s system is fixed because the average amount of entanglement Alice’s system can have with each Bi is limited by the number of “Bob” players.

For {Gi}1in, we define the average entangled value when Alice having an at most d-dimensional quantum system as follows:

where for each x and y, Exa,F1,yb,,Fn,yb are POVMs on A, B1, …, Bn, respectively, and ρAB1Bn is a multipartite state with |A| being at most d. Also, the average classical value is given by

because the classical correlation used for different Gi can be combined.

Theorem 7.4.
Let G1, …, Gnbe a family of bipartite games that Alice plays respectively with the players B1, …, Bn. Then

Proof.
Let G = (A, B, X, Y, π, v) be a bipartite game. For fixed axillary systems A,B and POVMs Exa,Fyb, the value function can be viewed as a positive linear functional lG on the trace-class S1(AB),
Note that lG is of norm at most 1. Then for a separable σ and an arbitrary ρ,
Now suppose that the axillary quantum system A of Alice is of dimension at most d. We know by the monogamy of entanglement (5) that for any ρAB1Bn,
It follows from the faithfulness of squashed entanglement that there exists a state σABi separable on A and Bi such that
Thus,
and then the average entangled value obeys
By Hölder’s inequality, we have
Therefore,

We can see that with large n and fixed d, the classicality violation decays to 0. In keeping with the results of Ref. 47, however, increasing the dimension of Alice and Bobs’ systems simultaneously with the number of measurements can compensate for a decline in entanglement. For this reason, we do not expect a bound that is independent of the dimension of Alice’s system.

This simple application in quantum games may give some intuition behind the phenomenon in Theorem 7.1, and possibly those in Sec. IV A. The existence of a monogamous, faithful entanglement measure (like squashed entanglement) necessarily constrains entanglement-dependent quantities to have a sort of monogamy of their own. This technique can be applied to any situation in which one seeks entanglement-dependent effects in the presence of a heralding-like process.

It follows intuitively that the monogamy of a faithful entanglement measure would also imply monogamy for entanglement-dependent quantities. Recent advances in the theory of squashed entanglement29 have combined these properties in a sufficiently quantitative way as to directly derive bounds on entropy expressions. As a potential consequence of entangled coding, we expect superadditivity of channel capacity to be monogamous. If many entangled systems enhance the capacity of a single channel, monogamy would intuitively imply that the enhancement due to each is correspondingly limited. While this alone might be an interesting anecdote and of potential interest to routines designed to optimize quantum input states, the heralded channel provides a direct operational application of the monogamy of superadditivity. When an arbitrary channel is randomly distributed amongst a larger number of copies of a strongly additive channel, entanglement in the input state goes to waste, and the combined channel loses superadditivity.

One application is to situations encountered in experimental physics, such as photonics, where the non-trivial channel created by decoherence combines with the erasure channel arising from photon loss. Due to the difficulty of creating and maintaining large-scale entanglement in transmitted photons, and the likelihood that entangling inputs with eventually destroyed copies sacrifice some capacity, many real-world quantum information systems will probably transmit in the nearly additive realm of Theorem 7.1 and not gain much by attempting to exploit superadditivity.

Our bound on the superadditivity of Holevo information is probably not tight, mostly due to the dimension constant and fourth-root power. The weakness of this bound in high dimension may stem from the generality of the continuity estimates, which do not exploit any detailed model of the Holevo capacity or the specific channel under study. The monogamy of squashed entanglement is itself fairly strong. The proof of faithfulness discussed in Ref. 29 first converts entanglement into symmetric extendibility and then to 1-norm distance. Finally, we convert this back to an entropic quantity. Each of these conversions comes with a penalty in constants or polynomial order. Future work may consider a path that either directly uses extendibility of the heralded output or examines whether extendibility directly limits superadditivity. It should be possible to improve these bounds for some cases if one is more specific about the channels composed with erasure.

Finally, we note that this technique is not limited to capacity or entropy expressions. Any quantity that depends on entanglement and is faithful with the trace norm may show a monogamy-like effect due to the comparison with squashed entanglement. For example, all entanglement measures that are faithful and comparable to the trace distance are at least monogamous up to a 4th-root dependence. This broader implication of entanglement monogamy bounding entanglement-dependent quantities may also be significant.

We thank Mark Wilde and Debbie Leung for helpful conversations. We thank Paul Kwiat and the Kwiat lab at UIUC for conversations and experiences that gave us a sense of the importance of heralded channels in practical applications. We thank the anonymous referees for the careful reading and constructive suggestions. This material is based upon work supported by NSF Graduate Research Fellowship Program DGE-1144245, and partially supported by NSF-DMS 1501103.

1.
F. G. S. L.
Brandão
and
G.
Gour
, “
Reversible framework for quantum resource theories
,”
Phys. Rev. Lett.
115
(
7
),
070503
(
2015
).
2.
M. M.
Wilde
,
Quantum Information Theory
(
Cambridge University Press
,
2013
).
3.
M. B.
Hastings
, “
Superadditivity of communication capacity using entangled inputs
,”
Nat. Phys.
5
(
4
),
255
257
(
2009
).
4.
G.
Smith
and
J.
Yard
, “
Quantum communication with zero-capacity channels
,”
Science
321
(
5897
),
1812
1815
(
2008
).
5.
T.
Cubitt
,
D.
Elkouss
,
W.
Matthews
,
M.
Ozols
,
D.
Pèrez-Garcìa
, and
S.
Strelchuk
, “
Unbounded number of channel uses may be required to detect quantum capacity
,”
Nat. Commun.
6
,
7793
(
2015
).
6.
K.
Li
,
A.
Winter
,
Z.
XuBo
, and
G.
GuangCan
, “
Private capacity of quantum channels is not additive
,”
Phys. Rev. Lett.
103
(
12
),
120501
(
2009
).
7.
P. W.
Shor
, “
Quantum information theory: The bits don’t add up
,”
Nat. Phys.
5
(
4
),
247
248
(
2009
).
8.
A. S.
Holevo
, “
The capacity of the quantum channel with general signal states
,”
IEEE Trans. Inf. Theory
44
(
1
),
269
273
(
1998
).
9.
B.
Schumacher
and
M. D.
Westmoreland
, “
Sending classical information via noisy quantum channels
,”
Phys. Rev. A
56
(
1
),
131
138
(
1997
).
10.
M.
Tomamichel
,
C. C. W.
Lim
,
N.
Gisin
, and
R.
Renner
, “
Tight finite-key analysis for quantum cryptography
,”
Nat. Commun.
3
,
634
(
2012
).
11.
N.
Pisenti
,
C. P. E.
Gaebler
, and
T. W.
Lynn
, “
Distinguishability of hyperentangled Bell states by linear evolution and local projective measurement
,”
Phys. Rev. A
84
(
2
),
022340
(
2011
).
12.
J.
Calsamiglia
and
N.
Lütkenhaus
, “
Maximum efficiency of a linear-optical Bell-state analyzer
,”
Appl. Phys. B
72
(
1
),
67
71
(
2014
).
13.
E.
Knill
,
R.
Laflamme
, and
G. J.
Milburn
, “
A scheme for efficient quantum computation with linear optics
,”
Nature
409
(
6816
),
46
52
(
2001
).
14.
T. E.
Northup
and
R.
Blatt
, “
Quantum information transfer using photons
,”
Nat. Photonics
8
(
5
),
356
363
(
2014
).
15.
J.
Hofmann
,
M.
Krug
,
N.
Ortegel
,
L.
Gérard
,
M.
Weber
,
W.
Rosenfeld
, and
H.
Weinfurter
, “
Heralded entanglement between widely separated atoms
,”
Science
337
(
6090
),
72
75
(
2012
).
16.
H.
Bernien
,
B.
Hensen
,
W.
Pfaff
,
G.
Koolstra
,
M. S.
Blok
,
L.
Robledo
,
T. H.
Taminiau
,
M.
Markham
,
D. J.
Twitchen
,
L.
Childress
, and
R.
Hanson
, “
Heralded entanglement between solid-state qubits separated by three metres
,”
Nature
497
(
7447
),
86
90
(
2013
).
17.
I.
Usmani
,
C.
Clausen
,
F.
Bussières
,
N.
Sangouard
,
M.
Afzelius
, and
N.
Gisin
, “
Heralded quantum entanglement between two crystals
,”
Nat. Photonics
6
(
4
),
234
237
(
2012
).
18.
P.
Maunz
, “
Heralded quantum gate between remote quantum memories
,”
Phys. Rev. Lett.
102
(
25
),
250502
(
2009
).
19.
D.
Elkouss
, “
Nonconvexity of private capacity and classical environment-assisted capacity of a quantum channel
,”
Phys. Rev. A
94
(
4
),
040301
(
2016
).
20.
C. A.
Fuchs
, “
Nonorthogonal quantum states maximize classical information capacity
,”
Phys. Rev. Lett.
79
(
6
),
1162
1165
(
1997
).
21.
I.
Mashhad
, “
Symmetry, asymmetry and quantum information
,” Ph.D. thesis,
University of Waterloo
,
2012
.
22.
F. G. S. L.
Brandão
,
J.
Oppenheim
, and
S.
Strelchuk
, “
When does noise increase the quantum capacity?
,”
Phys. Rev. Lett.
108
(
4
),
040501
(
2012
).
23.
I.
Bjelakovic
,
H.
Boche
, and
J.
Noetzel
, “
On quantum capacity of compound channels
,”
Phys. Rev. A
78
(
4
),
042331
(
2008
); e-print arXiv:0808.1007.
24.
I.
Bjelakovic
and
H.
Boche
, “
Classical capacities of compound and averaged quantum channels
,”
IEEE Trans Inf. Theory.
55
(
7
),
3360
3374
(
2009
).
25.
I.
Bjelakovic
,
H.
Boche
,
G.
Janßen
, and
J.
Nötzel
, “
Arbitrarily varying and compound classical-quantum channels and a note on quantum zero-error capacities
,” in
Information Theory, Combinatorics, and Search Theory
(Springer,
2013
), pp.
247
283
.
26.
V.
Coffman
,
J.
Kundu
, and
W.
K Wootters
, “
Distributed entanglement
,”
Phys. Rev. A
61
(
5
),
052306
(
2000
).
27.
M.
Koashi
and
A.
Winter
, “
Monogamy of quantum entanglement and other correlations
,”
Phys. Rev. A
69
(
2
),
022309
(
2004
).
28.
T. J.
Osborne
and
F.
Verstraete
, “
General monogamy inequality for bipartite qubit entanglement
,”
Phys. Rev. Lett.
96
(
22
),
220503
(
2006
).
29.
K.
Li
and
A.
Winter
, “
Squashed entanglement, k-extendibility, quantum Markov chains, and recovery maps
,”
Found. Phys.
1
15
(
2018
).
30.
M.
Christandl
and
A.
Winter
, “
Squashed entanglement: An additive entanglement measure
,”
J. Math. Phys.
45
(
3
),
829
840
(
2004
).
31.
A.
Winter
and
D.
Yang
, “
Potential capacities of quantum channels
,”
IEEE Trans. Inf. Theory
62
(
3
),
1415
1424
(
2016
).
32.
F. G. S. L.
Brandão
,
M.
Christandl
, and
J.
Yard
, “
Faithful squashed entanglement
,”
Commun. Math. Phys.
306
(
3
),
805
830
(
2011
).
33.
R.
Horodecki
,
P.
Horodecki
,
M.
Horodecki
, and
K.
Horodecki
, “
Quantum entanglement
,”
Rev. Mod. Phys.
81
(
2
),
865
942
(
2009
).
34.
R.
Alicki
and
M.
Fannes
, “
Continuity of quantum conditional information
,”
J. Phys. A: Math. Gen.
37
(
5
),
L55
(
2004
).
35.
A.
Winter
, “
Tight uniform continuity bounds for quantum entropies: Conditional entropy, relative entropy distance and energy constraints
,”
Commun. Math. Phys.
347
(
1
),
291
313
(
2016
).
36.
L.
Gao
,
M.
Junge
, and
N.
LaRacuente
, “
Capacity estimates for TRO channels
,” e-print arXiv:1609.08594 [quant-ph] (
2016
).
37.
F.
Leditzky
,
E.
Kaur
,
N.
Datta
, and
M. M.
Wilde
, “
Approaches for approximate additivity of the Holevo information of quantum channels
,” e-print arXiv:1709.01111 [quant-ph] (
2017
).
38.
C. H.
Bennett
, “
Capacities of quantum erasure channels
,”
Phys. Rev. Lett.
78
(
16
),
3217
3220
(
1997
).
39.
D.
Leung
,
J.
Lim
, and
P.
Shor
, “
Capacity of quantum erasure channel assisted by backwards classical communication
,”
Phys. Rev. Lett.
103
(
24
),
240505
(
2009
).
40.
K. R.
Brown
,
J.
Kim
, and
C.
Monroe
, “
Co-designing a scalable quantum computer with trapped atomic ions
,”
npj Quantum Information.
16034
(
2016
).
41.
M.
Junge
and
C.
Palazuelos
, “
Channel capacities via p-summing norms
,”
Adv. Math.
272
,
350
398
(
2015
).
42.
J.
Kempe
,
H.
Kobayashi
,
K.
Matsumoto
,
B.
Toner
, and
T.
Vidick
, “
Entangled games are hard to approximate
,”
SIAM J. Comput.
40
(
3
),
848
877
(
2011
).
43.
B.
Toner
, “
Monogamy of non-local quantum correlations
,”
Proc. R. Soc. London, Ser. A
465
(
2101
),
59
69
(
2009
).
44.
M. P.
Seevinck
, “
Monogamy of correlations versus monogamy of entanglement
,”
Quantum Inf. Process.
9
(
2
),
273
294
(
2010
).
45.
M.
Barbara Terhal
,
C.
Andrew Doherty
, and
D.
Schwab
, “
Symmetric extensions of quantum states and local hidden variable theories
,”
Phys. Rev. Lett.
90
(
15
),
157903
(
2003
).
46.
C.
Palazuelos
and
T.
Vidick
, “
Survey on nonlocal games and operator space theory
,”
J. Math. Phys.
57
(
1
),
015220
(
2016
).
47.
M.
Junge
and
C.
Palazuelos
, “
Large violation of Bell inequalities with low entanglement
,”
Commun. Math. Phys.
306
(
3
),
695
(
2011
).