Phased array transducers (PATs) are used in many applications, from airborne ultrasonic tactile displays to acoustic levitation. Acoustic holograms play a significant role in determining the performance of these applications. Many PATs and optimizers have been developed; however, only the following have been demonstrated in the literature: “phase” and “phase and amplitude” control of transducers and “phase” and “amplitude” only control at target points. Thus, most of the combinations of transducer state and target acoustic field conditions are yet to be explored. Here, we explore such combinations using Diff-PAT, one of the latest acoustic hologram optimizers. Diff-PAT is based on automatic differentiation and stochastic gradient descent. This optimizer achieves higher accuracy than conventional optimizers. We formulated multiple loss functions and wave propagators to enable each combination of the operation mode and quantitatively assessed the performance of each combination. The developed optimizers will offer new opportunities in the field and could allow further simplifications in PAT applications.

Acoustic holograms are ubiquitous in ultrasonic applications in medicine, biology, physics, and chemistry. Consequently, the accuracy and precision of acoustic holograms are becoming a significant point of interest. In mid-air ultrasonics, the phased array transducer (PAT) is the most popular method to render acoustic holograms; PATs have been designed by multiple authors.1–3 They have been applied in many applications, such as in airborne ultrasonic tactile displays (AUTDs),4 in acoustic levitation,5 and for display purposes.6,7 Acoustic hologram optimizers for PATs have also been investigated. Long et al. first developed an eigensolver and Tikhonov regularization based method (ES) in 2014,8 and Marzo and Drinkwater developed an iterative back propagation (IBP) method in 2018.5 Plasencia et al. developed the Gerchberg–Saxton (GS-PAT) and the “corrected version of the eigensolver and Tikhonov regularization based method” (CES) in 2020.9 The GS-PAT and IBP are modified versions of the Gerchberg–Saxton (GS) algorithm. Moreover, Sakiyama et al. developed the Levenberg–Marquardt (LM) algorithm based method in 2020,10 and Suzuki et al. demonstrated a Greedy Algorithm (GA) with brute-force search method in 2021.11 Most recently, Fushimi et al. developed a new acoustic hologram optimization method based on the stochastic gradient descent algorithm and automatic differentiation called Diff-PAT.12 

While many optimization methods and PATs have been developed, these optimizers can only optimize for phase or amplitude at the target point. In addition, only “phase” and “phase and amplitude” control methods have been explored for the side of transducers. The currently explored and unexplored combination of transducers and target point control methods in PAT systems is summarized in Fig. 1; much of the combinations are yet to be explored. Thus, in this paper, we developed a unifying methodological framework for sound-field computation that allows all prior approaches in PATs to be modeled (e.g., IBP and GS-PAT) while also exploring other approaches that, even if, in principle, applicable,13–18 had never been adapted to PATs. Achieving acoustic holograms with “amplitude” only control of transducers may contribute to further simplification of PATs. In addition, more advanced access to pressure amplitude and phase at the target point could offer a new opportunity in ultrasonic applications such as acoustic manipulation,5,19 acoustic streaming,20 and ultrasonic tactile display.4,8

FIG. 1.

Explored and unexplored combinations in PATs. The classification of the target point control was set to observe whether the optimizer can control the given variable to an arbitrarily set target.

FIG. 1.

Explored and unexplored combinations in PATs. The classification of the target point control was set to observe whether the optimizer can control the given variable to an arbitrarily set target.

Close modal

Here, Diff-PAT was selected as the basis of the generalized acoustic hologram optimizer because Diff-PAT was shown to be more effective than a conventional algorithm based on eigensolver or GS methods. Furthermore, it is simple to implement different modes of wave propagation and loss functions in Diff-PAT. By exploring the unchartered opportunities in the field, this paper also aims to demonstrate the flexibility of the newly developed optimizer. The basic components for the gradient-descent based hologram optimizer are similar to the original Diff-PAT. The acoustic field at the target point (pa) is calculated using linear superposition of pressure contribution from each transducer,

(1)

where T is the total number of transducers, d(xc, xt) is the Euclidean distance between xc and xt, and Pref is the reference pressure amplitude. Here, a single sided PAT with 16 × 16 (T = 256) transducers is assumed; D(θ)=2J1(krsinθ)krsinθ is the far-field directivity function for the piston source based on the angle θ, which is evaluated as the angle between the transducer normal and target point xc; r = 5 mm is the transducer radius, and J1 is the Bessel function of the first kind of order 1; k=2πfc0 is the wavenumber given the acoustic frequency, f = 40 kHz, and the speed of sound c0; Pref and c0 are assumed to be 1 Pa at 1 m and 346 ms−1, respectively; pa is a complex number; the amplitude Aac=pa, and the phase ϕac=Arg(pa) at the target point are calculated accordingly. The optimization function is defined as

(2)

The contents and size of the optimization variable τ depend on the transducer drive mode (phase, amplitude, and P&A). For phase only control, τ = ϕt with size T and At = 1 were defined for all transducers. For amplitude only control, τ = At with size T and ϕt = 0 for all transducers. For P&A control, τ = [Ψ] with ΨC and size T were defined. Since passing on the complex number to the optimizer was considered unfavorable, the input values were separated into real and imaginary numbers; τpa=[R(Ψ)I(Ψ)] with size 2T in the actual implementation. The amplitude and complex angle of Ψ were used to determine At and ϕt. For both amplitude only and P&A control, the amplitude of the transducer is limited to values between 0 and 1. In order to enforce this, the amplitude of the transducer was normalized as At=τmax(τ).

Different mode of control at the target point is achieved by customizing the loss function for each purpose. In general, the loss function is defined as

(3)

This method is similar to methods proposed by Li et al.,17 who first demonstrated P&A control at the target plane using a single phase plate. Eq. (3) determines the loss function by evaluating the difference of real and imaginary parts separately. Similar to the optimization variable τ, the target variable Ω depends on the mode at the target point. Here, we assume G number of target points. For phase only control at the target point, Ω = ϕg with size G is set, while Aa = Ag = 1. For amplitude only control at the target point, Ω = Ag with size G is set, while ϕa = ϕg = 0. For P&A control, Ω = [Ag, ϕg] with size 2C. The methods proposed above are examples of the many different implementation approaches. In all cases considered in this article, the optimizers are tasked to optimize the variables until the iteration number reached n = 1000. These loss functions as defined above were programmed in Python 3.7.10, and the derivative of the loss function with respect to the optimization variables τ was automatically differentiated using JAX21 (ver. 0.2.17). The Adam optimizer22 was used to update the current guess of τn: τn=τn1αm̂n(v̂n+ε), where m̂n=mt1β1n and v̂n=vn1β2n are the bias-corrected first and second raw moment estimates, respectively. mn=β1mn1+(1β1)δLδτ, and vn=β2vn1+(1β2)δLδτ2. The hyperparameters α, β1, β2, and ɛ are 0.1, 0.9, 0.999, and 1 × 10−8, respectively. These values are arbitrary; further improvements may be achieved by tuning these parameters. The initial guess of τ was generated randomly. The sets of codes necessary to fully reproduce all results and graphs in this paper are made available as shown in the Data Availability statement.23 

The generalized acoustic hologram optimizer can combine any propagation modes and loss functions and allows advanced acoustic field control. Figure 2 shows an example field when P&A control is specified for both transducer and target point sides. This can generate a phase surfer19 using PATs and effectively transport particles at the surface of the water using the phase gradient.5 This offers a new opportunity in particle manipulation or acoustic streaming.

FIG. 2.

Demonstration of phase surfer using P&A control at transducer and target point sides. Here, 50 target points were specified with a radius of 40 mm. The target pressure amplitude was 50 Pa, and the target phase was set such that it goes around the circle from 0 to 2π.

FIG. 2.

Demonstration of phase surfer using P&A control at transducer and target point sides. Here, 50 target points were specified with a radius of 40 mm. The target pressure amplitude was 50 Pa, and the target phase was set such that it goes around the circle from 0 to 2π.

Close modal

To further assess the effectiveness of the optimizer in each combination, we assigned a set of target positions, amplitudes, and phases. Here, two cases were studied with a varying number of focal points, N = 2 and 4. For each case, 1000 sets of target points were randomly generated such that they lie within region of interest (ROI) (x = [−0.05, 0.05], y = [−0.05, 0.05], and z = [0.05, 0.15] m). The pressure amplitudes were assigned such that the minimum pressure amplitude for each point is 10 Pa and such that the sum of each target pressure adds up to the maximum acoustic pressure amplitude of the PAT (identified by averaging the acoustic pressure amplitude of a single focus at the vertex of ROI; in this case, it is 950 Pa). The target phases were assigned such that they lie between [0, 2π]. The performance of the optimizer was measured using two indexes: target pressure accuracy (Ra=AgAac) and phase error (Rp=cosϕgcosϕac). The result of performance assessments is shown in Figs. 3(a) and 3(b). As it can be confusing to have phase control for a target point and transducer; a combination of letters and numbers will be used to specify each combination. For example, [A], [B], and [C] specify “phase” only, “amplitude” only, and “P&A” control of transducers, respectively; [i], [ii], and [iii] specify “phase” only, “amplitude” only, and “P&A” control at target points, respectively. Furthermore, A:i is a combination of phase only transducer control with phase only control at the target point, and C:iii denotes a combination of P&A for both the transducer and target point.

FIG. 3.

Box-and-whisker plot comparing the performance of optimizer in each combination using target pressure accuracy (Ra) and phase error (Rp). The red line shows the median, the blue box shows Q1 and Q3, and the whiskers show 1.5 times the interquartile range. Black circles in the plot show outliers, beyond the whisker range. 1000 sets of target points were randomly assigned; (a) shows the results when two control points (N = 2) are specified; (b) shows the results for N = 4.

FIG. 3.

Box-and-whisker plot comparing the performance of optimizer in each combination using target pressure accuracy (Ra) and phase error (Rp). The red line shows the median, the blue box shows Q1 and Q3, and the whiskers show 1.5 times the interquartile range. Black circles in the plot show outliers, beyond the whisker range. 1000 sets of target points were randomly assigned; (a) shows the results when two control points (N = 2) are specified; (b) shows the results for N = 4.

Close modal

It is evident that [i] can be achieved well in all transducer driving modes and control number points, as shown in Figs. 3(a) and 3(b). The average phase error (Rp) for A:i, B:i, and C:i is 9.8 × 10−4, 1.61 × 10−5, and 9.7 × 10−6, respectively, when N = 2 and 1.26 × 10−5, 6.73 × 10−6, and 8.08 × 10−5, respectively, when N = 4. [B] achieves good phase error in comparison with [A] and [C].

However, when [ii] is assigned, the performance of [B] drops significantly in comparison to that of [A] and [C]. The transducer amplitude only method is considerably more difficult than phase-only control since the acoustic field can only be defined via “the relative position of the focal point and the transducer position” and “the amplitude of each transducer.” The average target pressure accuracies (Ra) for A:ii, B:ii, and C:ii are 1.00, 0.603, and 1.00, respectively, when N = 2 and 1.00, 0.859, and 1.00, respectively, when N = 4. The lack of performance by [B] is attributed to the fact that on average, 45.4% and 44.0% of the transducers in PATs were off (off state is defined when transducer amplitude <1%) when N = 2 and N = 4, respectively (see the supplementary material for an example of transducer amplitude distribution). Thus, the achievable target pressure amplitudes by PATs drop significantly and cannot achieve high pressure amplitudes when assigned. The improved performance of B:ii when N = 4 (in comparison to B:ii in N = 2) is attributed to the pressure assignment method requiring each target point to “share” the maximum pressure amplitude (thus reducing the assigned target amplitude average). Since some transducers are already at 100% capacity (see the supplementary material), uniform scaling of the transducers to improve the performance is not possible.

Similar trends are observed in [iii] when both phase and amplitude are specified at the target point. The average target pressure accuracy (Ra) for A:iii, B:iii, and C:iii is 1.00, 0.501, and 1.00 when N = 2 and 1.00, 0.665, and 1.00, respectively, when N = 4. The average phase error (Rp) is 8.48 × 10−4, 0.0907, and 9.03 × 10−4, respectively, when N = 2 and 9.66 × 10−5, 0.0957, and 1.35 × 10−4, respectively, when N = 4.

The performance difference between A:ii and C:ii or A:iii and C:iii is not statistically significant (i.e., equivalent performance) in matching the target specifications (see the supplementary material for details on statistical analysis). Hence, changing from [A] to [C] does not significantly change the performance, and the benefit of adding amplitude control in transducer side is not apparent solely from Fig. 3.

On the other hand, a close inspection of the convergence graph demonstrates that in some circumstances, the loss value of [A] is lower than that of [C] (see the supplementary material). Since the solution set of [A] is a subset of [C], [C] should be better than [A]. We believe the increased number of optimization variables and complexity of optimizing both amplitude and phase as optimization variables decrease the performance of the [C] optimizer. However, as demonstrated in Fig. 3, the performance between [A] and [C] is not significant. In addition, the current performance of [C] is sufficiently better than that of traditional optimizers in the field.9,12 Further exploration to improve [C] optimizer may not be necessary.

Figures 4(a) and 4(b) are examples of acoustic field as optimized by A:ii and C:ii; they show that the output field is significantly different between (a) A:ii and (b) C:ii. The ambient noise surrounding the focal point is qualitatively reduced in Fig. 4(b) despite using the same loss function (i.e., the loss function only evaluates the acoustic amplitude at the target point). To assess this ambient noise more quantitatively, we calculated the sum of the pressure amplitude in the xy plane (square area with a side length of 0.03 m and focal point at the center) for A:ii and C:ii (excluding the circular region with a radius of 2.5λ around the focal point, see Fig. 4) using 1000 sets of optimized field data from Fig. 3 (N = 2). The results are as shown in Fig. 5, and we quantitatively confirm the reduction in ambient noise in C:ii (this trends are continued to be seen in N = 4, 8, 16, and 32 as shown in the supplementary material). The reason behind this reduction in ambient noise is unclear, and further study is required to understand the difference in results. The reduction in ambient noise around the focal point is suitable for applications such as airborne ultrasonic tactile displays (AUTDs),4,8 and Diff-PAT with [C] could be better suited for AUTDs than [A].

FIG. 4.

Acoustic pressure field optimized by (a) A:ii and (b) C:ii. The residual ambient pressure amplitude surrounding focal points in (a) is more visible than in (b). The black cross shows the position of the focal point. The white dashed box and circle in (a) are examples of how the ambient noise was analyzed in Fig. 5.

FIG. 4.

Acoustic pressure field optimized by (a) A:ii and (b) C:ii. The residual ambient pressure amplitude surrounding focal points in (a) is more visible than in (b). The black cross shows the position of the focal point. The white dashed box and circle in (a) are examples of how the ambient noise was analyzed in Fig. 5.

Close modal
FIG. 5.

Histogram showing the sum of ambient noises surrounding the focal point using 1000 samples from Fig. 2 (N = 2) determined by summing the pressure amplitude around a square box in the xy plane with a side length of 3 cm while excluding the 2.5λ radial region from the focal point. Results for N = 4, 8, 16, and 32 are made available in the supplementary material.

FIG. 5.

Histogram showing the sum of ambient noises surrounding the focal point using 1000 samples from Fig. 2 (N = 2) determined by summing the pressure amplitude around a square box in the xy plane with a side length of 3 cm while excluding the 2.5λ radial region from the focal point. Results for N = 4, 8, 16, and 32 are made available in the supplementary material.

Close modal

In conclusion, we demonstrated a generalized acoustic hologram optimizer that can control phase, amplitude, and phase and amplitude at the transducer side and target point. The optimizer was based on Diff-PAT, and various loss functions and optimization variables were developed to achieve the target combination. The performance of the optimizer with each combination was quantitatively assessed; we discussed the potential and limitations. We showed that [A] and [C] perform equally when assessed at the target point. However, around the focal point, [C] suppresses the ambient noise and makes it more suitable for use in AUTDs than [A]. PAT has already been demonstrated to be a versatile and useful tool for ultrasonic applications. A more advanced control of PATs and the insights we obtained in this study will aid further development of applications in the field.

See the supplementary material for the transducer amplitude distribution, statistical analysis, convergence graph, and broader assessment of noise reduction in P&A.

This work was funded by the Strategic Research Platform toward Digital Nature powered by Pixie Dust Technologies, Inc. This work was partially supported by JSPS KAKENHI, Grant No. 21K14103. We would like to thank Editage (www.editage.com) for English language editing.

The authors have no conflicts to disclose.

The data that support the findings of this study are available within this article and its supplementary material. The program codes and the underlying dataset to fully recreate the figures in the manuscripts are made openly available23 in Zenodo at 10.5281/zenodo.5701914.

1.
A.
Marzo
,
A.
Barnes
, and
B. W.
Drinkwater
, “
TinyLev: A multi-emitter single-axis acoustic levitator
,”
Rev. Sci. Instrum.
88
,
085105
(
2017
).
2.
R.
Morales
,
I.
Ezcurdia
,
J.
Irisarri
,
M. A. B.
Andrade
, and
A.
Marzo
, “
Generating airborne ultrasonic amplitude patterns using an open hardware phased array
,”
Appl. Sci.
11
,
2981
(
2021
).
3.
S.
Suzuki
,
S.
Inoue
,
M.
Fujiwara
,
Y.
Makino
, and
H.
Shinoda
, “
AUTD3: Scalable airborne ultrasound tactile display
,”
IEEE Trans. Haptics
(published online,
2021
).
4.
T.
Hoshi
,
M.
Takahashi
,
T.
Iwamoto
, and
H.
Shinoda
, “
Noncontact tactile display based on radiation pressure of airborne ultrasound
,”
IEEE Trans. Haptics
3
,
155
165
(
2010
).
5.
A.
Marzo
and
B. W.
Drinkwater
, “
Holographic acoustic tweezers
,”
Proc. Natl. Acad. Sci. U. S. A.
116
,
84
89
(
2018
).
6.
Y.
Ochiai
,
T.
Hoshi
, and
J.
Rekimoto
, “
Pixie dust: Graphics generated by levitated and animated objects in computational acoustic-potential field
,”
ACM Trans. Graphics
33
,
85
(
2014
).
7.
T.
Fushimi
,
A.
Marzo
,
B. W.
Drinkwater
, and
T. L.
Hill
, “
Acoustophoretic volumetric displays using a fast-moving levitated particle
,”
Appl. Phys. Lett.
115
,
064101
(
2019
).
8.
B.
Long
,
S. A.
Seah
,
T.
Carter
, and
S.
Subramanian
, “
Rendering volumetric haptic shapes in mid-air using ultrasound
,”
ACM Trans. Graphics
33
,
1
10
(
2014
).
9.
D. M.
Plasencia
,
R.
Hirayama
,
R.
Montano-Murillo
, and
S.
Subramanian
, “
GS-PAT: High-speed multi-point sound-fields for phased arrays of transducers
,”
ACM Trans. Graphics
39
,
138
(
2020
).
10.
E.
Sakiyama
,
A.
Matsubayashi
,
D.
Matsumoto
,
M.
Fujiwara
,
Y.
Makino
, and
H.
Shinoda
, “
Midair tactile reproduction of real objects
,” in
Haptics: Science, Technology, Applications
, edited by
I.
Nisky
,
J.
Hartcher-O’Brien
,
M.
Wiertlewski
, and
J.
Smeets
(
Springer International Publishing
,
Cham
,
2020
), pp.
425
433
.
11.
S.
Suzuki
,
M.
Fujiwara
,
Y.
Makino
, and
H.
Shinoda
, “
Radiation pressure field reconstruction for ultrasound midair haptics by Greedy algorithm with brute-force search
,”
IEEE Trans. Haptics
(published online,
2021
).
12.
T.
Fushimi
,
K.
Yamamoto
, and
Y.
Ochiai
, “
Acoustic hologram optimisation using automatic differentiation
,”
Sci. Rep.
11
,
12678
(
2021
); arXiv:2012.02431.
13.
R.
Piestun
,
B.
Spektor
, and
J.
Shamir
, “
On-axis binary-amplitude computer generated holograms
,”
Opt. Commun.
136
,
85
92
(
1997
).
14.
J.
Liu
,
W.
Hsieh
,
T.
Poon
, and
P. W. M.
Tsang
, “
Complex Fresnel hologram display using a single SLM
,”
Appl. Opt.
50
,
H128
(
2011
).
15.
J.
Liu
,
M.
Wu
, and
P. W. M.
Tsang
, “
3D display by binary computer-generated holograms with localized random down-sampling and adaptive intensity accumulation
,”
Opt. Express
28
,
24526
(
2020
).
16.
M. D.
Brown
, “
Phase and amplitude modulation with acoustic holograms
,”
Appl. Phys. Lett.
115
,
053701
(
2019
).
17.
J.
Li
,
Z.
Lv
,
Z.
Hou
, and
Y.
Pei
, “
Comparison of balanced direct search and iterative angular spectrum approaches for designing acoustic holography structure
,”
Appl. Acoust.
175
,
107848
(
2020
).
18.
Z.
Ma
,
K.
Melde
,
A. G.
Athanassiadis
,
M.
Schau
,
H.
Richter
,
T.
Qiu
, and
P.
Fischer
, “
Spatial ultrasound modulation by digitally controlling microbubble arrays
,”
Nat. Commun.
11
,
4537
(
2020
).
19.
K.
Melde
,
A. G.
Mark
,
T.
Qiu
, and
P.
Fischer
, “
Holograms for acoustics
,”
Nature
537
,
518
522
(
2016
).
20.
K.
Hasegawa
,
L.
Qiu
,
A.
Noda
,
S.
Inoue
, and
H.
Shinoda
, “
Electronically steerable ultrasound-driven long narrow air stream
,”
Appl. Phys. Lett.
111
,
064104
(
2017
).
21.
J.
Bradbury
,
R.
Frostig
,
P.
Hawkins
,
M. J.
Johnson
,
C.
Leary
,
D.
Maclaurin
, and
S.
Wanderman-Milne
, JAX: Composable transformations of Python+NumPy programs,
2018
.
22.
D. P.
Kingma
and
J.
Ba
, “
Adam: A method for stochastic optimization
,” in
3rd International Conference on Learning Representations, ICLR 2015–Conference Track Proceedings
(
2015
), pp.
1
15
; arXiv:1412.6980.
23.
T.
Fushimi
,
K.
Yamamoto
, and
Y.
Ochiai
(
2021
). “
Applied Diff-PAT
,” Zenodo. .

Supplementary Material