Distortion product otoacoustic emissions (DPOAEs) have been accurately measured with an intensimetric technique, involving simultaneous measure of pressure and velocity in the ear canal, which allows one to correctly calibrate both the input stimuli and the otoacoustic emission (OAE) level. Suitable combinations of standard intensimetric quantities, active intensity and power density [Stanzial, Shiffrer, and Sacchi, J. Acoust. Soc. Am. 131, 269–280 (2012)], are used to equalize the stimuli transmitted to the middle ear, and to estimate the DPOAE level emitted by the eardrum. The DPOAE intensimetric spectra are consistent with those recorded with a high-quality conventional otoacoustic probe with state-of-the-art calibration of both stimulus and OAE response [Charaziak and Shera, J. Acoust. Soc. Am. 141, 515–525 (2017)], demonstrating the applicability of the intensimetric method to OAE measurements.
1. Introduction
Otoacoustic emissions1 (OAEs) are acoustic signals measurable in the ear canal, generated by the cochlear active nonlinear amplification mechanism, potentially suitable for accurate objective diagnostics of cochlear function. One of the main uncertainties still limiting the OAE diagnostic power is related to the formation of standing waves in the closed ear canal cavity. OAEs are typically measured inserting a probe into the ear canal. The probe, housing miniaturized loudspeakers and microphones, acoustically seals the ear canal cavity, terminated at the opposite end by the eardrum. Longitudinal resonant modes develop in this approximately cylindrical cavity, with frequencies critically dependent on the individual ear canal length and on the probe insertion depth. As a consequence, (1) the stimulus pressure measured at the probe position is significantly different from that fed to the eardrum, and (2) the OAE pressure measured at the probe position is different from that emitted by the eardrum. The largest differences occur near the resonance frequencies of the cavity, which depend on the probe insertion depth. Both problems, which enhance the intrinsic variability of OAE level estimates, decreasing the OAE diagnostic power, have been recently addressed by Charaziak and Shera,2 who used Thévenin calibration of the sound sources both to correct the stimulus calibration and to convert the measured OAE level to that actually emitted by the eardrum.
In this study, we perform the same task in a model-independent way, using a pressure-velocity detector and suitable intensimetric physical quantities.3 Although acoustic waves consist of pressure and velocity oscillating fields, p and v, which propagate at the speed of sound, only their pressure component is typically measured in clinical applications. Recently, miniaturized intensimetric probes have become available, which simultaneously measure at the same place both the pressure and the velocity field.3 Measuring also the velocity component of the acoustic wave field does not provide redundant information; on the contrary, simultaneous pressure and velocity measurements allow one to estimate the acoustic impedances of the system constituted by the probe and the acoustic environment.3 A few studies have already used p-v measurements in the ear canal to estimate pressure at the eardrum,4,5 mainly for headphone equalization purposes. More directly, suitable combinations of intensimetric quantities may be used to solve in a model-independent fashion the problem of obtaining calibrated OAE measurements independent of the probe insertion depth, and, more generally, of the resonant properties of the individual ear canal.
2. Theory
2.1 Intensimetric physical quantities
The active intensity3 of the acoustic wave field is defined, in the frequency domain, as
with ω = 2πf. It can be measured at any point inside the ear canal (e.g., at the probe position) and measures the net flow of the acoustic energy across the measurement region. Although velocity is a vector quantity, we consider here its longitudinal component only. It can be demonstrated that this quantity, measured at the stimulus frequencies f1 and f2, also represents the distortion product otoacoustic emission (DPOAE) stimulus intensity IS transmitted to the middle ear. Therefore, equalizing the active intensity of the stimuli across frequencies means equalizing the acoustic intensity transmitted to the middle ear, almost equivalently to equalizing the forward pressure.6
The power density of the acoustic wave field is defined instead as
where ρ is the air density and c the speed of sound.
It may be easily demonstrated, using the acoustic potential formalism,3 that the DPOAE intensity emitted by the eardrum may be evaluated at the DP frequency, to a good approximation, as the arithmetic mean of the active intensity and the power density, measured anywhere inside the ear canal (e.g., at the probe position):
which may also be defined as the DPOAE emitted intensity. More accurate expressions of the intensity emitted by the eardrum may be computed, keeping into account multiple internal reflections in the closed ear canal cavity:
where k is the wave number, L is the length of the closed ear canal cavity, R1, R2 are the reflection coefficients at the eardrum and at the probe, and the last term is negligible in most ears. The length L can be estimated from the resonant pattern of the ear canal response to the stimuli. The values of R1 and R2 may be computed, in principle, from the same intensimetric measurements, which have been proposed indeed for measuring the impedance/reflectivity of the middle ear,3 but, for the present OAE application, the estimate of R2 is affected by a large uncertainty, because it needs a signal coming from the eardrum. One may use the DPOAE signal itself, but its signal-to-noise-ratio (SNR) is obviously much lower than that of stimuli. Therefore, in the analysis of experimental data, the simpler approximation of Eq. (3) will be used.
3. Methods
3.1 Intensimetric DPOAE acquisition
A Microflown p-v detector PU-Match Probe (Microflown Technologies, Arnhem, the Netherlands) and a couple of Etymotic ER-2 loudspeakers (Etymotic Research, Inc., Elk Grove Village, IL), were housed in a teflon cylindrical cavity, terminated by a copper tube of 8 mm diameter, which was inserted a few mm into the ear canal of the subject. Deeper insertion depths could be obtained with a different kind of cavity, but the system was not optimized in this sense, also because one of the advantages of the proposed method is its insensitivity to the insertion depth. Two slow linear chirp stimuli were separately fed to the two loudspeakers (to avoid internal generation of distortion), with instantaneous frequencies in a constant ratio r = f2(t)/f1(t) = 1.22, which generated a DPOAE response chirp at frequency fDP(t) = 2f1(t)–f2(t). The chirp sweep rates were suitably set in order to get dfDP/dt = 800 Hz/s. Complex spectra of the two stimuli and of the DPOAE response were separately obtained for the pressure and velocity detectors by applying Fourier analysis to 50% overlapping time frames of duration 50 ms, yielding a frequency resolution of 20 Hz. In the measurement frequency range, the cochlear round-trip delay was much shorter than the time frame duration, so the instantaneous DPOAE frequency coincided with fDP(t) within a Fourier Transform frequency bin. From the pressure and velocity spectra, active intensity and power density were computed in the frequency domain.
The two stimulus levels transmitted to the middle ear were equalized across frequency by equalizing their active intensity spectra, i.e., the voltage chirps fed to the loudspeakers were multiplied by suitable envelope functions to get equalized spectra transmitted through the middle ear. Equation (3) was used to convert the intensimetric quantities measured at the probe position to the DPOAE power emitted by the eardrum.
3.2 Conventional DPOAE acquisition
An Etymotic ER-10X probe, consisting of two loudspeakers and a low-noise microphone, was used for the conventional OAE measurements, in which only the pressure of the acoustic wave field is measured, both for the stimuli and for the DPOAE response. In this case, the equalization of the stimulus transmitted to the middle ear was accomplished by equalizing the forward component of the pressure wave, according to a well-established procedure.6 This procedure requires preliminary Thévenin calibration of the sound source in a set of tubes of different length and known impedance, which allows one to compute the forward component of the pressure by evaluating the load impedance and the reflection coefficients in the same conditions of the OAE measurements (probe inserted in the ear canal). The same signal generation and acquisition technique of the intensimetric measurements was used, but, in this case, forward pressure was equalized across frequencies for the two stimulus chirps, whereas the DPOAE spectrum emitted by the eardrum was obtained from that measured by the probe microphone at the ear canal entrance by applying the emitted pressure level (EPL) correction proposed by Charaziak and Shera.2
4. Results
Once expressed in dB units, the two OAE quantities yielded by the two experiments, IE and EPL,2 may be directly compared, if equivalent reference quantities are used for the conversion to dB units (p0 = 20μPa for EPL and p02/ρc for IE). The two measurements performed on the same ear are shown in Fig. 1. Unfortunately, the noise floor of our intensimetric setup is quite high, due to the noise level of its velocity and pressure sensors, and to its size, which makes it difficult to insert it into the ear canal with the same depth and acoustical insulation of a state-of-the-art OAE probe. Despite the much lower SNR, the intensimetric technique yields a DPOAE spectrum that is in rather good agreement with that obtained with the conventional technique, reproducing also some of the characteristic oscillating features known as the DPOAE fine-structure, although additional spectral fluctuations appear, due to the high noise floor.
Comparison between the DPOAE spectra (estimated as the power level emitted by the eardrum with equalized stimuli transmitted to the middle ear) obtained with the intensimetric p-v technique (a) and with the Thévenin-based EPL method (b) proposed by Charaziak and Shera (Ref. 2).
Comparison between the DPOAE spectra (estimated as the power level emitted by the eardrum with equalized stimuli transmitted to the middle ear) obtained with the intensimetric p-v technique (a) and with the Thévenin-based EPL method (b) proposed by Charaziak and Shera (Ref. 2).
It is well-known7 that the DPOAE spectral fine-structure is due to interference between two components with different phase-frequency relation, a (typically dominant) nonlinear distortion component, generated by a wave-fixed mechanism, with almost constant phase (as a consequence of the cochlear scaling symmetry), and a reflection component, generated by a place-fixed mechanism, with rapidly rotating phase. Time-frequency filtering8 allows one to separate the two DPOAE components in the time-frequency domain. Even if intensimetric quantities neglect the phase information, DPOAE amplitude spectra possess the information necessary to separate the components in the time or (better) in the time-frequency domain, because phase and amplitude fluctuations are intrinsically related to each other in causal systems. Figure 2 compares the time-frequency representations of the two spectra of Fig. 1, showing a strong zero-delay component, associated with the DPOAE nonlinear distortion component, and a fainter long-delay component, associated with coherent reflection.7 Even the time-frequency position of the most intense spots of the faint delayed components approximately correspond, although in the intensimetric case (right) their intensity is close to the noise floor. Because of the lack of phase information, in the intensimetric case the distribution is specular symmetric with respect to the frequency axis. The higher background luminosity is again a consequence of the higher noise level of the intensimetric probe.
Time frequency representation, computed with the continuous wavelet transform, of the DPOAE complex spectrum (left) obtained with the conventional technique, and of the DPOAE emitted intensity spectrum (right) obtained with the intensimetric p-v technique.
Time frequency representation, computed with the continuous wavelet transform, of the DPOAE complex spectrum (left) obtained with the conventional technique, and of the DPOAE emitted intensity spectrum (right) obtained with the intensimetric p-v technique.
The overall agreement is rather satisfactory, considering that the two methods differ as regards (1) the type of probe detector, (2) the physical quantities used to calibrate the stimuli, (3) those used to extract the DPOAE signal emitted by the eardrum, and (4) the probe insertion depth in the ear canal. Moreover, the higher order correction factors of Eq. (4) have been neglected in this analysis.
A severe limitation of the intensimetric setup of this study is clearly related to the relatively low SNR of both pressure (particularly if compared with a state-of-the-art low-noise OAE microphone, such as that used in this study) and velocity measurements performed with currently available intensimetric commercial probes, which are typically used in industrial contexts, and are therefore not optimized yet for the faint OAE signal levels. The encouraging results of this study should motivate further technological development of low-noise miniaturized intensimetric probes. Lower-noise probes would be indeed necessary for effectively using the proposed method in OAE clinical applications, which, in hearing-impaired subjects, require detection of very weak signals. The proposed technique has been tested first on DPOAEs, because of their simple acquisition technique and high SNR, but it can be straightforwardly applied to SFOAE and transient-evoked OAE (TEOAE) acquisition. In the SFOAE acquisition using the suppressor technique, the two input signals to be calibrated at the eardrum would be the probe and suppressor active intensity, whereas the residual emitted intensity would represent the SFOAE response. In the TEOAE case, equalization of the stimulus active intensity could be performed in the frequency domain as for SFOAEs and DPOAEs, and the actual calibrated transient stimulus (click, tone burst, or fast chirp) voltage to be fed to the loudspeaker would be obtained by FT. The TEOAE response would be recorded as two separate p(t) and v(t) signals, from which the TEOAE intensity emitted by the eardrum would be computed in the frequency domain.
5. Conclusions
Intensimetric techniques are used for the first time to measure high-resolution DPOAE spectra, with model-independent calibration of the transmitted power to the middle ear and reconstruction of the DPOAE power emitted by the eardrum. Despite the high noise floor of the intensimetric method, the good agreement with the results obtained with state-of-the-art conventional techniques, involving pressure measurements only, and elaborate Thévenin calibration methods, suggests that the proposed methodology could be fruitfully applied to the objective diagnostics of the cochlear function, once lower-noise and smaller-size intensimetric probes become available.