Phase estimation is used in many quantum algorithms, particularly in order to estimate energy eigenvalues for quantum systems. When using a single qubit as the probe (used to control the unitary we wish to estimate the eigenvalue of), it is not possible to measure the phase with a minimum mean-square error. In standard methods, there would be a logarithmic (in error) number of control qubits needed in order to achieve this minimum error. Here, we show how to perform this measurement using only two control qubits, thereby reducing the qubit requirements of the quantum algorithm. To achieve this task, we prepare the optimal control state one qubit at a time, at the same time as applying the controlled unitaries and inverse quantum Fourier transform. As each control qubit is measured, it is reset to | 0 then entangled with the other control qubit, so only two control qubits are needed.

Quantum phase estimation was originally applied in quantum algorithms for the task of period finding, as in Shor's algorithm.1 Later, quantum phase estimation was applied to the task of estimating eigenvalues for Hamiltonians in quantum chemistry.2 The appropriate way to perform quantum phase estimation is different between these two applications, due to the different costs of the operations that phase estimation is being performed on. In particular, for Shor's algorithm, one can perform arbitrary powers at low cost (logarithmic in the power). In contrast, for estimating eigenvalues of Hamiltonians, the cost of simulating the time evolution is (at least) proportional to the time, so the phase estimation procedure should attempt to minimize the total evolution time. At the same time, the mean-square error (MSE) in the estimate should be minimized.

A key part of quantum algorithms for phase estimation is the inverse quantum Fourier transform. This operation can be decomposed into a “semiclassical” form,3 where one performs measurements on the control qubits in sequence, with rotations controlled according to the results of previous measurements. In the form of phase estimation as in Shor's algorithm, the control qubits would be in an equal superposition state, which is just a tensor product of | + states on the individual qubits. In that scenario, only one control qubit need be used at a time, because it can be prepared in the | + state, used as a control, then rotated and measured before the next qubit is used.

This procedure with the control qubits in | + states gives a probability distribution for the error approximately a sinc squared, which has a significant probability for large errors. That is still suitable for Shor's algorithm, because it is possible to take large powers of the operators with relatively small cost, which suppresses the phase measurement error. On the other hand, for quantum chemistry where there is a cost of Hamiltonian simulation proportional to time, the large error of the sinc-squared is a problem. Then, it is more appropriate to use qubits in an entangled state,4 which was originally derived in an optical context in 1996.5 

In 2000, we6 analyzed the problem of how to perform measurements on these states in a Mach–Zehnder interferometer.7 The same year, Jon Dowling introduced NOON states in the context of lithography8 and then, in 2002, showed how NOON states may be used in interferometry for phase measurement.9,10 A drawback to using NOON states is that they are highly sensitive to loss. In 2010, one of us (D.W.B.) visited Jon Dowling's group to work on the problem of how to generate states that are more resistant to loss and effectively perform measurements with them. This resulted in the publication (separately from Jon of Ref. 11), followed by our first joint publication.12 We continued collaborating with Jon for many years on phase measurement,13 as well as state preparation,14 and Boson-sampling inspired cryptography.15 

In separate work, we showed how to combine results from multiple NOON states in order to provide highly accurate phase measurements suitable for quantum algorithms.16 Phase measurement via NOON states is analogous to taking a | + state and performing a controlled UN on a target system in quantum computing. The photons in the arms of the interferometer are analogous to the control qubit in quantum computing, with the phase shift from UN instead arising from an optical phase shift between the arms of the interferometer. The NOON state gives very high frequency variation of the probability distribution for the phase, rather than a probability distribution with a single peak. In 2007, we showed how to combine the results from NOON states with different values of N in order to provide a phase measurement analogous to the procedure giving a sinc-squared distribution in quantum algorithms.16 (It was experimentally demonstrated with multiple passes through the phase shift rather than NOON states.)

A further advance in Ref. 16 was to show how to use an adaptive procedure, still with individual | + states, in order to give the “Heisenberg limited” phase estimate. That is, rather than the mean-square error scaling as it does for the sinc-squared, it scales as it does for the optimal (entangled) control state. This procedure still only uses a single control qubit at a time, so is suitable for using in quantum algorithms where the number of qubits available is strongly limited; this is why it was used, for example, in Ref. 17. On the other hand, although it gives the optimal scaling, the constant factor is not optimal, and improved performance is provided by using the optimal entangled state.

In this paper, we show how to achieve the best of both worlds. That is, we show how to provide the optimal phase estimate (with the correct constant factor), while only increasing the number of control qubits by one. It is, therefore, suitable for quantum algorithms with a small number of qubits, while enabling the minimum complexity for a given required accuracy.

In Sec. II, we discuss the optimal state for phase estimation and how its usage can be combined with the semiclassical quantum Fourier transform. Then, in Sec. III, we introduce a orthogonal basis of states for subsets of qubits and prove a recursive form. Finally, in Sec. IV, we show how the recursive form can be translated into a sequence of two-qubit unitaries to create the optimal state.

The optimal states for phase estimation from Ref. 5 are of the form,
(1)
where N is the total photon number in two modes and n is the photon number in one of the modes, as for example in a Mach–Zehnder interferometer. It is also possible to consider the single-mode case, where N is a maximum photon number and | n is a Fock state.
In either case, a physical phase shift of ϕ results in a state of the form
(2)
The ideal “canonical” phase measurement is then a positive operator-valued measure (POVM) with elements18 
(3)
where
(4)
Here, we are using ϕ ˇ for the result of the measurement, as distinct from the actual phase ϕ. Such a canonical measurement typically cannot be implemented using standard linear optical elements, though it can be approximated with adaptive measurements.7 
It is easily seen that the error distribution after the measurement is then
(5)
In contrast, if one were to use the state with an equal distribution over basis states, then the error probability distribution would be of the following form, that is close to a sinc squared,
(6)
The error distributions for these two states are shown in Fig. 1. The central peak for the equal superposition state is a little narrower, but it has large tails in the distribution, whereas the probabilities of large errors for the optimal state are strongly suppressed.
The optimal state (1) is optimal for minimizing a slightly different measure of error than usual. The Holevo variance for a distribution can be taken as19,
(7)
This measure has the advantages that it is naturally modulo 2 π, as is suitable for phase, and approaches infinity for a flat distribution (with no phase information). Moreover, it approaches the usual variance for suitably narrowly peaked distributions. As the variance of the error distribution does not penalize biased estimates, it is often useful to quantify the quality of the measurement using the mean-square error. The equivalent for the Holevo variance is
(8)
This measure is analogous to the mean-square error. One could also take the measure, as in Ref. 5,
(9)
and the optimization problem is equivalent. The optimal state (1) gives a minimum Holevo variance of
(10)
It is also possible to consider minimization of the mean-square error, but there is not a simple analytic solution.20 
In the case of phase measurements in quantum computing, ϕ would instead be obtained from a unitary operator U with eigenvalue e i ϕ. If the target system is in the corresponding eigenstate of U, denoted | ϕ , and if state | n is used to control application of Un, then the ϕ-dependent state from Eq. (2) is again obtained. In practice, the integer n is represented in binary in ancilla qubits. Then, the most-significant bit, n1, is used to control U, the next most significant bit, n2, is used to control U2, and so forth. In general,
(11)
so | n j is used to control U 2 j 1. This procedure is depicted in Fig. 2.
Here, we have taken m to be the number of bits. In practice, it is convenient to take N + 1 to be a power of 2, so N = 2 m 1. In order to estimate the phase, one wishes to perform the canonical measurement on the ancilla qubits. To explain this measurement, it is convenient to consider the POVM with N + 1 states | ϕ ˇ j with ϕ ˇ j = 2 π j / ( N + 1 ) for j = 0 to N. Then, the states | ϕ ˇ are mutually orthogonal. Such a projective measurement can then be obtained if one can perform the unitary operation
(12)
That is, it maps the state | ϕ ˇ j to a computational basis state | j , so a measurement in the computational basis gives the result for the phase. This operation is the inverse of the usual quantum Fourier transform, which would map from | j to | ϕ ˇ j .

If one aims to obtain the original POVM, one can randomly (with equal probability) select δ ϕ [ 0 , 2 π / ( N + 1 ) ], choose the states with ϕ ˇ j = 2 π j / ( N + 1 ) + δ ϕ, and then perform a measurement in the basis | ϕ ˇ j with this randomly chosen offset. The complete measurement, including the random choice of δ ϕ, is then equivalent to the POVM with the set of outcomes over a continuous range of ϕ ˇ. This approach can be used in order to give a measurement that is covariant (has an error distribution independent of the system phase ϕ). In practice, it is not usually needed, so we will not consider it further in this paper.

In order to obtain the estimate for the phase, one should, therefore, perform the inverse quantum Fourier transform on the control qubits. The inverse quantum Fourier transform can be performed in a semiclassical way, by performing measurements on successive qubits followed by controlled rotations.3 The usual terminology is the “semiclassical Fourier transform,” though this is the inverse transform. An example with three qubits is given in Fig. 3. The bottom (least significant qubit) is measured first. The result is used to control a phase rotation on the middle qubit. Then, the middle qubit is measured, and the results of both measurements are used to control phase rotations on the top qubit. The net result is the same as performing the inverse quantum Fourier transform and measuring in the computational basis.

A further advantage of this procedure is that the fact that the controlled U operations are also performed in sequence means that the sequences can be matched. That is, we have the combined procedure as shown in Fig. 4. In the case where control registers are prepared in an equal superposition state, the qubits are unentangled. This means that preparation of each successive qubit can be delayed until it is needed, as shown in Fig. 5.

What this means is that only one control qubit need be used at once. The preparation of the next control qubit can be delayed until after measurement of the previous one and that qubit can be reset and reused. That is useful in quantum algorithms with a limited number of qubits available and is also useful in quantum phase estimation. In that case, one can replace the control qubits with NOON states with photon numbers that are powers of 2. Then, these NOON states can be measured in sequence to give a canonical measurement of phase, even though a canonical measurement of phase would not be possible on a single two-mode state. In Ref. 16, we demonstrated this approach, using multiple passes through a phase shift rather than NOON states.

The drawback now is that, even though it is possible to perform the canonical measurement, a suboptimal state is being used. We would like to be able to perform measurements achieving the minimum Holevo phase variance. In Ref. 16, we showed that, by using multiple NOON states of each number, it is possible to obtain the desired scaling with total photon number, even though there is a different constant factor so the true minimum error is not achieved.

Up to this point, this section has been revision of prior work. What is new here is that we show how to prepare the optimal state for phase measurement in a sequential way, so the number of qubits that need be used at once is minimized. We will show how the optimal state can be prepared using a sequence of two-qubit operations, as in Fig. 6.

When the optimal state is prepared in this way, its preparation may be delayed until the qubits are needed, as shown in Fig. 7. This is illustrated with three control qubits, where introduction of the third qubit can be delayed until the first qubit is measured (where we are counting the control qubits from the bottom to the top). That is, the first qubit can be measured, reset to | 0 , then used as what is shown as the third (top) qubit. This means that even though there are three control qubits drawn, only two control qubits are used. In general, with more qubits in the optimal state, introduction of each additional qubit can be delayed until after measurement of the qubit two places down, so only two control qubits are actually used.

The reason why it is possible to prepare the optimal state in this way is that it is a superposition of two unentangled states. The sine is a combination of a positive and negative complex exponential as
(13)
When N = 2 m 1, we can write this as
(14)
where M = N + 2 = 2 m + 1. In the last line, we have used
(15)
and then
(16)
In order to write the optimal state in a more compact way, we define the following:
(17)
Then, the optimal state in Eq. (14) can be written as
(18)
That is, it is an equally weighted superposition of two states, which are each unentangled between all qubits. What this means is that any bipartite split of the state will have Schmidt number 2, so the entanglement across the bipartite split can be represented on a single qubit on one side. We use that principle in the state preparation. At any stage, after performing the two-qubit operation between qubit j and j + 1, there will be the correct bipartite entanglement in the split between qubits up to j and qubits from j + 1 to m. However, at that stage qubits from 1 to j − 1 have not been initialized yet, so the entanglement across the bipartite split (for qubits 1 to j) is represented just on qubit j.
In this section, we show how to create the optimal state from Eq. (18) recursively. We introduce the partial tensor product states
(19)
Because | ψ opt is a linear combination of | ϕ [ m ] ± , the state of qubits 1 to is be represented as a linear combination of | ϕ [ ] ± . In order to describe the operations needed to prepare the state | ψ opt , we need to describe the state of qubits 1 to in terms of orthogonal states, which we will denote by | Φ [ ] ± . These orthogonal (but not normalized) states are
(20)
It is possible to prove that these states are orthogonal as in the following Lemma.
Lemma 1. The states | Φ [ ] + and | Φ [ ] defined in Eq. (20) are orthogonal:
(21)
Proof. From Eq. (17), we have
(22)
which is real. In turn, that implies ϕ [ ] | ϕ [ ] + is real and equal to ϕ [ ] + | ϕ [ ] . Moreover, because | Φ j ± are normalized, so are | Φ [ ] ± . Therefore, we obtain
(23)
Here, the first two terms cancel because they are both 1 (due to normalization), and the second two terms cancel because they are equal (because they are real). □
Next, we wish to show that there is a simple recurrence relation for the states | Φ [ ] ± , in their normalized form
(24)
We will use the recurrence relation to derive the sequence of two-qubit operations to prepare the initial state. The result is as follows.
Lemma 2. The states | Φ ̃ [ ] + and | Φ ̃ [ ] defined in Eq. (20) and Eq. (24), have recurrence relation
(25)
where
(26)
(27)
(28)
Proof. To prove this, we start by noting that
(29)
Therefore, we can see that
(30)
which implies
(31)
Similarly, we find
(32)
which implies
(33)
This gives us recurrence relations for | Φ [ ] ± , which can be written
(34)
Let us define the normalization
(35)
In terms of this, the recurrence relation for the normalized states is
(36)
The normalization can be determined using
(37)
which gives
(38)
That gives us the ratios of norms
(39)
Hence, Eq. (37) is the form of the recurrence relation required. □
In Sec. III, we showed that it is possible to construct an orthonormal basis for the state on qubits 1 to as | Φ ̃ [ ] ± , which satisfy the recursive relation
(40)
for constants μ ( s , ± ). Moreover, it is obvious from the definitions that | Φ ̃ [ m ] + = | ψ opt . Therefore, the optimum state may be constructed in a recursive way from | Φ ̃ [ m 1 ] ± on qubits 1 to m − 1, which can be constructed from states | Φ ̃ [ m 2 ] ± on qubits 1 to m − 2, and so forth.
In order to prepare the optimal state, we can apply a stepwise procedure where the principle is to use a single qubit to flag which of | Φ [ ] ± is to be prepared on the remaining qubits 1 to . We start from qubits m and m − 1, and work back to qubits 1 and 2. It is convenient to describe the operations as acting on qubits initialized as | + . Then, we initially perform an operation on qubits m and m − 1 that maps
(41)
The principle of this operation is that it corresponds to the recursion
(42)
At this stage, we only have two qubits, so the states | Φ ̃ [ m 1 ] ± on qubits 1 to m − 1 need to be represented by | ± on a single qubit.
It is trivial to see that unitary U m 1 exists; it can explicitly be performed by rotating qubit m as
(43)
We can alternatively describe the operation as having the matrix form in the | ± basis
(44)
where * indicates entries where the value is unimportant.
We then perform U m 2 on qubits m − 1 and m − 2, down to U1 on qubits 1 and 2. The unitary U needs to map
(45)
This operation corresponds to the recursion given in Eq. (40), and the states | ± on qubit are being used to represent | Φ ̃ [ ] ± on qubits 1 to . This operation has the matrix entries
(46)
This operation may be achieved in the following way. Define the single-qubit rotations V ( 0 , + ) and V ( 1 ) to act as
(47)
Then, U corresponds to the controlled operation
(48)
This method could be used for U m 1, though the method described above is simpler.

After performing this sequence of unitaries, we then need to map | ± to | Φ ̃ [ 1 ] ± on qubit 1. This is a simple single-qubit unitary operation, which can be combined with U1 to give the correct final state with a sequence of two-qubit unitary operations. Thus, our recursive expression for the states gives us a sequence of two-qubit unitaries to create the optimal state.

To be more specific about what operation is needed,
(49)
so that
(50)
(51)
That gives the normalized states
(52)
Therefore, the operation needed is an i phase shift on | .

We have shown how to create the optimal state for phase estimation, in the sense of minimizing Holevo variance, using a sequence of two-qubit operations. When combining this sequential process with the semiclassical quantum Fourier transform, we can entangle new qubits after measuring control qubits in such a way that only two control qubits are needed. This means that the qubit that is measured can be reset and used as the new qubit to be entangled, meaning that only two control qubits are used.

In quantum algorithms where phase estimation is needed with a small number of logical qubits, this approach is ideal. Previously the method used was either many entangled qubits, increasing the size of the quantum computer needed, or a single control qubit, which significantly increases the error. In our method, the number of control qubits is only increased by 1, while giving the minimal error. Here, the quantity being exactly minimized is the Holevo variance, which is very close to the mean-square error (MSE) for sharply peaked distributions. If one were interested in minimizing MSE, then these states give the same leading order term for MSE as the minimum MSE,20 so these states are still suitable.

Our method of preparing the state, although it has been derived for the specific case of the optimal state for minimizing Holevo variance, could also be applied to other states that are a superposition of two unentangled states. The crucial feature is that the Schmidt number is 2 for any bipartite split across the qubits. One could also consider states with larger Schmidt number and use a larger number of qubits as controls. That could potentially be used for states that are optimal for minimizing other measures of error. For example, one could consider methods of approximating Kaiser windows, or the digital prolate spheroidal sequence, as is suitable for optimizing confidence intervals.21 

Another interesting question is whether this procedure could be demonstrated with photons. A scheme with optimal phase states for N = 2 using two photons was demonstrated in Ref. 22. With the preparation scheme we have outlined, it would potentially be possible to demonstrate these states with larger N, though it would require entangling operations that might require nonlinear optical elements.

D.W.B. worked on this project under a sponsored research agreement with Google Quantum AI. D.W.B. is also supported by Australian Research Council Discovery Project Nos. DP190102633, DP210101367, and DP220101602.

The authors have no conflicts to disclose.

Peyman Najafi: Formal analysis (equal); Investigation (equal); Writing – original draft (lead). Pedro Costa: Formal analysis (supporting); Investigation (supporting); Writing – review & editing (supporting). Dominic Berry: Conceptualization (lead); Formal analysis (equal); Funding acquisition (lead); Investigation (equal); Methodology (lead); Project administration (lead); Supervision (lead); Writing – original draft (supporting); Writing – review & editing (lead).

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

1.
P.
Shor
, in
Proceedings 35th Annual Symposium on Foundations of Computer Science, Santa Fe, New Mexico
(IEEE,
1994
), pp.
124
134
.
2.
A.
Aspuru-Guzik
,
A. D.
Dutoi
,
P. J.
Love
, and
M.
Head-Gordon
,
Science
309
,
1704
(
2005
).
3.
R. B.
Griffiths
and
C.-S.
Niu
,
Phys. Rev. Lett.
76
,
3228
(
1996
).
4.
R.
Babbush
,
C.
Gidney
,
D. W.
Berry
,
N.
Wiebe
,
J.
McClean
,
A.
Paler
,
A.
Fowler
, and
H.
Neven
,
Phys. Rev. X
8
,
041015
(
2018
).
5.
A.
Luis
and
J.
Peřina
,
Phys. Rev. A
54
,
4564
(
1996
).
6.
When referring to other publications, the word “we” is used to mean D.W.B. and the other coauthors on those other publications.
7.
D. W.
Berry
and
H. M.
Wiseman
,
Phys. Rev. Lett.
85
,
5098
(
2000
).
8.
A. N.
Boto
,
P.
Kok
,
D. S.
Abrams
,
S. L.
Braunstein
,
C. P.
Williams
, and
J. P.
Dowling
,
Phys. Rev. Lett.
85
,
2733
(
2000
).
9.
P.
Kok
,
H.
Lee
, and
J. P.
Dowling
,
Phys. Rev. A
65
,
052104
(
2002
).
10.
H.
Lee
,
P.
Kok
, and
J. P.
Dowling
,
J. Mod. Opt.
49
,
2325
(
2002
).
11.
H. T.
Dinani
and
D. W.
Berry
,
Phys. Rev. A
90
,
023856
(
2014
).
12.
H. T.
Dinani
,
M. K.
Gupta
,
J. P.
Dowling
, and
D. W.
Berry
,
Phys. Rev. A
93
,
063804
(
2016
).
13.
Z.
Huang
,
K. R.
Motes
,
P. M.
Anisimov
,
J. P.
Dowling
, and
D. W.
Berry
,
Phys. Rev. A
95
,
053837
(
2017
).
14.
K. R.
Motes
,
R. L.
Mann
,
J. P.
Olson
,
N. M.
Studer
,
E. A.
Bergeron
,
A.
Gilchrist
,
J. P.
Dowling
,
D. W.
Berry
, and
P. P.
Rohde
,
Phys. Rev. A
94
,
012344
(
2016
).
15.
Z.
Huang
,
P. P.
Rohde
,
D. W.
Berry
,
P.
Kok
,
J. P.
Dowling
, and
C.
Lupo
,
Quantum
5
,
447
(
2021
).
16.
B. L.
Higgins
,
D. W.
Berry
,
S. D.
Bartlett
,
H. M.
Wiseman
, and
G. J.
Pryde
,
Nature
450
,
393
(
2007
).
17.
I. D.
Kivlichan
,
C.
Gidney
,
D. W.
Berry
,
N.
Wiebe
,
J.
McClean
,
W.
Sun
,
Z.
Jiang
,
N.
Rubin
,
A.
Fowler
et al
Quantum
4
,
296
(
2020
).
18.
B. C.
Sanders
,
G. J.
Milburn
, and
Z.
Zhang
,
J. Mod. Optics
44
,
1309
(
1997
).
19.
A. S.
Holevo
,
Quantum Probability and Applications to the Quantum Theory of Irreversible Processes
, edited by
L.
Accardi
,
A.
Frigerio
, and
V.
Gorini
(
Springer
,
Berlin/Heidelberg
,
1984
), pp.
153
172
.
20.
D. W.
Berry
,
M. J. W.
Hall
,
M.
Zwierz
, and
H. M.
Wiseman
,
Phys. Rev. A
86
,
053813
(
2012
).
21.
D. W.
Berry
,
Y.
Su
,
C.
Gyurik
,
R.
King
,
J.
Basso
,
A. D. T.
Barba
,
A.
Rajput
,
N.
Wiebe
,
V.
Dunjko
et al, arXiv:2209.13581 (
2022
).
22.
S.
Daryanoosh
,
S.
Slussarenko
,
D. W.
Berry
,
H. M.
Wiseman
, and
G. J.
Pryde
,
Nat. Commun.
9
,
4606
(
2018
).