To scrutinize the binaural contribution to speech-in-noise reception, four groups of elderly participants with or without audiometric asymmetry <2 kHz and with or without near-normal binaural intelligibility level difference (BILD) completed tests of monaural and binaural phase sensitivity as well as cognitive function. Groups did not differ in age, overall degree of hearing loss, or cognitive function. Analyses revealed an influence of BILD status but not audiometric asymmetry on monaural phase sensitivity, strong correlations between monaural and binaural detection thresholds, and monaural and binaural but not cognitive BILD contributions. Furthermore, the N0Sπ threshold at 500 Hz predicted BILD performance effectively.

Although hearing-impaired listeners can differ substantially in terms of their speech-in-noise abilities, the responsible factors are yet to be fully understood (e.g., Dillon, 2012). In principle, monaural, binaural, and cognitive factors can all play a role. More recently, sensitivity to phase (or temporal fine structure) information has emerged as a promising predictor of speech-in-noise abilities (e.g., Strelcyk and Dau, 2009; Santurette and Dau, 2012). Phase information is encoded monaurally and then transmitted to the auditory brainstem where the two ear signals are combined. Good monaural coding fidelity is required for an accurate representation of binaural phase information, and some studies indicate that this underlies the ability to hear out speech against spatially separated noise or speech maskers (e.g., Strelcyk and Dau, 2009; Neher et al., 2012). The binaural contribution to speech reception in noise, which depends on a processing mechanism below ∼1.5 kHz, can be assessed using the binaural intelligibility level difference (BILD) measure (e.g., Kollmeier et al., 1990). Interaural audiometric differences have been related to impaired binaural hearing abilities (Jerger et al., 1984), suggesting that ear asymmetries in monaural coding fidelity affect this mechanism. Relations between cognitive abilities and speech reception in spatially complex situations have also been reported (e.g., Neher et al., 2012).

A better understanding of the factors involved in binaural speech-in-noise reception could promote individually tailored diagnostics and treatments (e.g., with hearing devices). However, this requires good experimental control over the factors of interest. In an effort to accomplish this, the current study recruited four groups of elderly participants with or without near-normal BILD and with or without audiometric asymmetry <2 kHz that were matched in terms of age and overall degree of hearing loss. Using measures of monaural and binaural phase sensitivity as well as cognitive function, these groups were then characterized further to shed more light on the underlying processes.

The current study was approved by the ethics committee of the University of Oldenburg. All participants provided written informed consent and received financial compensation.

Initially, 77 sensorineurally hearing-impaired participants aged 61–85 with a large spread in audiometric asymmetry <2 kHz were recruited. All of them reported normal or corrected-to-normal vision, and all but six of them were bilateral hearing aid users. Pure-tone average hearing loss as calculated across 0.5, 1, 2, and 4 kHz as well as left and right ears (PTA4) ranged from 34 to 69 dB hearing level (HL) (mean: 53 dB HL). Low-frequency (LF) pure-tone average hearing loss as calculated across 0.125, 0.25, 0.5, 0.75, 1, and 1.5 kHz as well as left and right ears (PTALF) ranged from 19 to 69 dB HL (mean: 42 dB HL). The absolute difference across left and right ears (ΔLR) in PTALF (PTALFΔLR) ranged from 0 to 41 dB (mean: 12 dB). (Results would be very similar if PTALF was calculated across 0.125, 0.25, 0.5, and 1 kHz.) Following the BILD measurements, 40 of these participants were tested further.

Testing was carried out in a well-lit soundproof booth. A computer screen was used for displaying the user interfaces and visual stimuli. Audio playback was via an Auritec (Hamburg, Germany) Earbox Highpower soundcard and a pair of Sennheiser (Wennebostel, Germany) HDA200 headphones. The BILD and psychoacoustic stimuli were spectrally shaped according to the “National Acoustic Laboratories-Revised Profound” prescription rule (National Acoustic Laboratories, Sydney, Australia; Dillon, 2012) to ensure audibility similar to that provided by many clinical hearing aid fittings.

To quantify BILD performance, 50%-correct speech reception thresholds (SRTs) were measured. Using free-field head-related impulse responses (Gardner and Martin, 1994), the speech, S, was simulated to come from 0° and the noise, N, from 90° (S0N90) or 270° (S0N270) azimuth. Stimuli were presented either binaurally or monaurally to the ear opposite N. The Oldenburg sentence material (Wagener et al., 1999) was used for S and stationary speech-shaped noise for N. N was calibrated to a nominal sound pressure level (SPL) of 65 dB in the 0° direction. The level of S was varied adaptively (nominal starting level: 68 dB SPL). Initially, three training runs were carried out, followed by one test run per condition in randomized order. Each time, a test list consisting of 20 five-word sentences was used. Following Kollmeier et al. (1990), the BILD was obtained by taking the difference between the binaural and monaural SRTs per spatial configuration (S0N90 or S0N270), yielding the change in signal-to-noise ratio (SNR) due to binaural interaction. For the analyses, the two resultant BILD estimates were averaged. Typically, normal-hearing listeners achieve BILDs of ∼4 dB (Kollmeier et al., 1990; Santurette and Dau, 2012).

Sensitivity to phase information in the presence of noise was assessed using monaural random frequency modulation detection (RFMD) and binaural masking level difference (BMLD) measurements. A 3-interval 3-alternative forced-choice paradigm coupled with a 1-up 2-down procedure was used. Intervals were 500 ms long, included 25-ms raised-cosine ramps, and were separated by 333 ms of silence. On each trial, one randomly chosen interval contained the target stimulus and the other two intervals the reference stimulus. Following a training run, two measurements (test, retest) were performed per condition. A measurement was terminated after ten reversals and the threshold estimated by taking the geometric mean of the last six reversal points.

The RFMD measurements mimicked those of Kortlang et al. (2016). They were performed at test frequencies, fc, of 0.5 and 1 kHz. As maskers, 8-equivalent-rectangular-bandwidths-wide (Glasberg and Moore, 1990) Gaussian noises centered at fc were used. The tones and noises were presented at 62 and 65 dB SPL (nominal), respectively. Random amplitude modulation (AM) with a root-mean-square (RMS) depth of −12 dB was applied to each tone. The reference stimuli contained a tone without frequency modulation (FM) but with AM. The target stimuli contained a tone with AM and FM. In the adaptive procedure, the RMS frequency excursion from fc was varied (starting value: 30% of fc). Multiplicative step sizes of 2, 1.5, and 1.25 were used, and the step size was decreased after an upper reversal.

The BMLD measurements were also performed at 0.5 and 1 kHz with noise signals essentially identical to those used for the RFMD measurements. The noise, N, and tone, S, were either presented diotically (N0S0) or N was presented diotically and S with an interaural phase shift of 180° (N0Sπ). The reference stimuli contained only N and the target stimuli both S and N. In the adaptive procedure, the level of S was varied. The starting SNR was 0 dB. Additive step sizes of 6, 3, and 1 dB were used, and the step size was decreased after an upper reversal. The BMLD was obtained by taking the difference between the N0S0 and N0Sπ thresholds.

To check for any top–down influences on BILD performance, two visual cognitive measures were included: A reading span test (Carroll et al., 2015) and a “distractibility” test (Zimmermann and Fimm, 2012). The reading span test measures the ability (in %-correct) to recall a series of first and final sentence words. The (nonverbal) distractibility test measures the change in response time (in milliseconds) to target stimuli due to preceding distractors. Both measures are described in detail in Neher (2014).

The BILD data ranged from −0.4 to 5.2 dB (mean: 2.6 dB). They were only weakly correlated with age (Spearman's correlation coefficient, ρ = −0.30, p < 0.009), PTA4 (ρ = −0.24, p < 0.036), and PTALF (ρ < −0.42, p < 0.001). For further testing, four subgroups were defined. In doing so, the aim was to (1) achieve orthogonal variation in BILD and PTALFΔLR, (2) control for age and PTA4, and (3) obtain a reasonably large sample size of N = 4 × 10. This approach maximized the experimental contrast within and between BILD and PTALFΔLR for the available sample, under the constraint that the subgroups were not allowed to differ in age or PTA4. This resulted in two subgroups with “near-normal” mean BILDs of ∼3.7 dB and two subgroups with “abnormal” mean BILDs of 1.5 dB (see Table 1 and Fig. 1). Furthermore, two subgroups had a “symmetrical” mean PTALFΔLR of 3 dB and the other two an “asymmetrical” mean PTALFΔLR of ∼24 dB.

Table 1.

Means (and ranges) for the age, PTA4, PTALFΔLR, and BILD data of each subgroup.

SubgroupNAge (yr)PTA4 (dB HL)PTALFΔLR (dB)BILD (dB)
PTALF symmetrical 10 74 52 3.8 
BILD near-normal  (63, 80) (46, 59) (1, 6) (2.9, 5.2) 
PTALF symmetrical 10 75 52 1.5 
BILD abnormal  (70, 80) (45, 61) (0, 6) (0.2, 2.3) 
PTALF asymmetrical 10 70 49 23 3.5 
BILD near-normal  (62, 75) (35, 57) (15, 35) (2.6, 4.7) 
PTALF asymmetrical 10 75 56 24 1.5 
BILD abnormal  (67, 80) (49, 58) (17, 39) (−0.4, 2.5) 
SubgroupNAge (yr)PTA4 (dB HL)PTALFΔLR (dB)BILD (dB)
PTALF symmetrical 10 74 52 3.8 
BILD near-normal  (63, 80) (46, 59) (1, 6) (2.9, 5.2) 
PTALF symmetrical 10 75 52 1.5 
BILD abnormal  (70, 80) (45, 61) (0, 6) (0.2, 2.3) 
PTALF asymmetrical 10 70 49 23 3.5 
BILD near-normal  (62, 75) (35, 57) (15, 35) (2.6, 4.7) 
PTALF asymmetrical 10 75 56 24 1.5 
BILD abnormal  (67, 80) (49, 58) (17, 39) (−0.4, 2.5) 
Fig. 1.

Boxplots of the RFMD, N0S0, N0Sπ, and BMLD data of the four participant subgroups averaged across 0.5 and 1 kHz (*p < 0.05, **p < 0.01, ***p < 0.001). Also shown are the BILD and PTALFΔLR data. Differences among subgroups with different BILD or PTALFΔLR status were all significant at p < 0.0001.

Fig. 1.

Boxplots of the RFMD, N0S0, N0Sπ, and BMLD data of the four participant subgroups averaged across 0.5 and 1 kHz (*p < 0.05, **p < 0.01, ***p < 0.001). Also shown are the BILD and PTALFΔLR data. Differences among subgroups with different BILD or PTALFΔLR status were all significant at p < 0.0001.

Close modal

Out of 640 measured thresholds, three RFMD thresholds and one N0S0 threshold from four different listeners were discarded due to large tracking excursions around the threshold estimates. Reliability of the remaining data was very good, as indicated by strong test-retest correlations (all ρ > 0.88, all p < 0.00001). For all further analyses, either the geometric mean of the test–retest measurements or the single remaining threshold was used. Furthermore, the RFMD thresholds were averaged across ears (interaural asymmetry in RFMD was also considered but not found to be predictive). Analyses revealed an influence of BILD status on many of the psychoacoustic measurements, with the near-normal subgroups outperforming the abnormal subgroups. In contrast, an influence of PTALFΔLR status was only evident in the BMLD data. Results were similar across the two test frequencies. The differential effects of BILD and PTALFΔLR status are evident from Fig. 1, which shows the RFMD, N0S0, N0Sπ, and BMLD data of the four subgroups averaged across 0.5 and 1 kHz together with statistical results.

Regarding the cognitive data, the subgroups had mean recall performances of 31% to 44% correct on the reading span task and mean response time changes of 4 to 46 ms on the distractibility task. Subgroup status did not affect the mean scores (both χ2(3) < 2.5, both p > 0.4).

To examine potential relations among age, hearing threshold levels (HTLs), and the psychoacoustic and cognitive measures, a correlation analysis was performed. Following Bonferroni (N = 44) correction, no correlations with age or the cognitive measures were found (all ρ < |0.27|, all p > 0.05). Table 2 shows the correlations among the psychoacoustic measures and HTLs. It is noteworthy that the RFMD thresholds were only moderately correlated with HTLs, strongly correlated with the N0S0 and N0Sπ thresholds, and not correlated with the BMLDs. Correlations between BILD performance and the various predictors were also examined. Following Bonferroni (N = 13) correction, no correlations with the cognitive measures were found (both ρ < |0.25|, both p > 0.05). The strongest correlations emerged with N0Sπ, BMLD, and RFMD at 500 Hz (Table 3).

Table 2.

Spearman's correlation coefficients for the psychoacoustic and HTL data (top cell entries: 0.5 kHz; bottom cell entries: 1 kHz). *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001, *****p < 0.00001 after Bonferroni correction.

HTLRFMDN0S0N0SπBMLD
RFMD 0.52*    
0.53*     
N0S0 0.49 0.68****   
0.53* 0.76*****    
N0Sπ 0.69**** 0.65*** 0.61**  
0.45 0.58** 0.67****   
BMLD −0.58** −0.39 −0.14 −0.84***** 
−0.28 −0.24 −0.15 −0.80*****  
HTLRFMDN0S0N0SπBMLD
RFMD 0.52*    
0.53*     
N0S0 0.49 0.68****   
0.53* 0.76*****    
N0Sπ 0.69**** 0.65*** 0.61**  
0.45 0.58** 0.67****   
BMLD −0.58** −0.39 −0.14 −0.84***** 
−0.28 −0.24 −0.15 −0.80*****  
Table 3.

Spearman's correlation coefficients for the BILD, age, PTA4, PTALF, and psychoacoustic data (top cell entries: 0.5 kHz; bottom cell entries: 1 kHz). *p < 0.05, **p < 0.01, ***p < 0.001, *****p < 0.00001 after Bonferroni correction.

AgePTA4PTALFRFMDN0S0N0SπBMLD
BILD −0.38 −0.29 −0.45* −0.53** −0.46* −0.72***** 0.59*** 
    −0.25 −0.28 −0.59*** 0.52** 
AgePTA4PTALFRFMDN0S0N0SπBMLD
BILD −0.38 −0.29 −0.45* −0.53** −0.46* −0.72***** 0.59*** 
    −0.25 −0.28 −0.59*** 0.52** 

To test for independent BILD contributions of these predictors, linear regression analyses were carried out. For reasons of statistical rigor, maximally four predictors were used, and age and PTALF were always controlled for. The most predictive model included N0Sπ at 500 Hz (R2 = 52%; p < 0.0001), age (R2 = 8%; p = 0.010), and PTALF (R2 = 0%; p > 0.6). The second most predictive model included BMLD at 500 Hz (R2 = 39%; p = 0.005), age (R2 = 10%; p = 0.010), RFMD at 500 Hz (R2 = 7%; p = 0.034), and PTALF (R2 = 0%; p > 0.8). Analyses of the residuals revealed that both models satisfied the requirements for linearity and normality.

Consistent with earlier findings (Kollmeier et al., 1990; Santurette and Dau, 2012), inter-individual BILD differences were large (>5 dB). Standard audiological measures were ineffective predictors of these differences, as reflected by the relatively weak correlations with age, PTA4, and PTALF (Sec. 3.1) and the fact that it was possible to define four subgroups with marked BILD differences that were independent of the effects of PTALFΔLR, age, and PTA4 (Table 1). For auditory profiling purposes, a measure of binaural processing abilities in noise thus appears to be informative. Because the N0Sπ threshold at 500 Hz could predict the BILD effectively (Sec. 3.3), it constitutes a suitable (and time-efficient) candidate for this.

The moderate correlations between the RFMD and HTL data (Table 2) suggest that the RFMD measure captures additional supra-threshold abilities. The fact that there was an influence of BILD status but not audiometric asymmetry on the RFMD thresholds (Fig. 1) also supports this. The strong correlations between the RFMD and binaural detection thresholds (Table 2) are consistent with the view that monaural phase sensitivity facilitates binaural processing abilities (e.g., Strelcyk and Dau, 2009). Interestingly, however, the RFMD thresholds and BMLDs were not correlated (Table 2) and contributed separately to the BILD prediction (Sec. 3.3), suggesting that both monaural and binaural factors play a role for binaural squelch abilities. In contrast, cognitive factors were unrelated to BILD performance, consistent with previous findings (Santurette and Dau, 2012).

Together, these results provide handles for characterizing, modeling, and compensating individual speech-in-noise deficits, and research is underway to address these issues.

The author thanks the Hörzentrum Oldenburg GmbH for recruiting the participants and performing the measurements, and Steffen Kortlang for providing the RFMD code. This research was funded by the DFG Cluster of Excellence EXC 1077/1 “Hearing4all” and by Phonak AG, Switzerland. Parts of it were presented at the 2016 International Hearing Aid Research Conference, Lake Tahoe, CA, August 10–14, 2016.

1.
Carroll
,
R.
,
Meis
,
M.
,
Schulte
,
M.
,
Vormann
,
M.
,
Kiessling
,
J.
, and
Meister
,
H.
(
2015
). “
Development of a German reading span test with dual task design for application in cognitive hearing research
,”
Int. J. Audiol.
54
,
136
141
.
2.
Dillon
,
H.
(
2012
).
Hearing Aids
(
Boomerang Press
,
Sydney, Australia
).
3.
Gardner
,
B.
, and
Martin
,
K.
(
1994
). “
HRTF measurements of a KEMAR dummy-head microphone
,” Massachusetts Institute of Technology Media Lab Perceptual Computing, Cambridge, MA.
4.
Glasberg
,
B. R.
, and
Moore
,
B. C. J.
(
1990
). “
Derivation of auditory filter shapes from notched-noise data
,”
Hear. Res.
47
,
103
138
.
5.
Jerger
,
J.
,
Brown
,
D.
, and
Smith
,
S.
(
1984
). “
Effect of peripheral hearing loss on the masking level difference
,”
Arch. Otolaryngol.
110
,
290
296
.
6.
Kollmeier
,
B.
,
Müller
,
C.
,
Kinkel
,
M.
, and
Eysholdt
,
U.
(
1990
). “
Binaural hearing in impaired listeners: Speech intelligibility under spatial conditions
,”
Audiologische Akustik (Audiol. Acoust.)
29
,
76
84
.
7.
Kortlang
,
S.
,
Mauermann
,
M.
, and
Ewert
,
S. D.
(
2016
). “
Suprathreshold auditory processing deficits in noise: Effects of hearing loss and age
,”
Hear. Res.
331
,
27
40
.
8.
Neher
,
T.
(
2014
). “
Relating hearing loss and executive functions to hearing aid users' preference for, and speech recognition with, different combinations of binaural noise reduction and microphone directionality
,”
Front. Neurosci.
8
,
391
.
9.
Neher
,
T.
,
Lunner
,
T.
,
Hopkins
,
K.
, and
Moore
,
B. C. J.
(
2012
). “
Binaural temporal fine structure sensitivity, cognitive function, and spatial speech recognition of hearing-impaired listeners (L)
,”
J. Acoust. Soc. Am.
131
,
2561
2564
.
10.
Santurette
,
S.
, and
Dau
,
T.
(
2012
). “
Relating binaural pitch perception to the individual listener's auditory profile
,”
J. Acoust. Soc. Am.
131
,
2968
2986
.
11.
Strelcyk
,
O.
, and
Dau
,
T.
(
2009
). “
Relations between frequency selectivity, temporal fine-structure processing, and speech reception in impaired hearing
,”
J. Acoust. Soc. Am.
125
,
3328
3345
.
12.
Wagener
,
K.
,
Brand
,
T.
, and
Kollmeier
,
B.
(
1999
). “
Development and evaluation of a sentence test for the German language. I-III: Design, optimization and evaluation of the Oldenburg sentence test
,”
Zeitschrift für Audiologie (Audiol. Acoust.)
38
,
4
15
, 44–56, 86–95.
13.
Zimmermann
,
P.
, and
Fimm
,
B.
(
2012
).
Testbatterie zur Aufmerksamkeitsprüfung—Version Mobilität (Test Battery for the Assessment of Attentional Skills—Mobility Version
) (
Psytest
,
Herzogenrath, Germany
).