The ability to recognize speech involves sensory, perceptual, and cognitive processes. For much of the history of speech perception research, investigators have focused on the first and third of these, asking how much and what kinds of sensory information are used by normal and impaired listeners, as well as how effective amounts of that information are altered by “top-down” cognitive processes. This experiment focused on perceptual processes, asking what accounts for how the sensory information in the speech signal gets organized. Two types of speech signals processed to remove properties that could be considered traditional acoustic cues (amplitude envelopes and sine wave replicas) were presented to 100 listeners in five groups: native English-speaking (L1) adults, 7-, 5-, and 3-year-olds, and native Mandarin-speaking adults who were excellent second-language (L2) users of English. The L2 adults performed more poorly than L1 adults with both kinds of signals. Children performed more poorly than L1 adults but showed disproportionately better performance for the sine waves than for the amplitude envelopes compared to both groups of adults. Sentence context had similar effects across groups, so variability in recognition was attributed to differences in perceptual organization of the sensory information, presumed to arise from native language experience.

1.
Abramson
,
A. S.
, and
Lisker
,
L.
(
1967
). “
Discriminability along the voicing continuum: Cross-language tests
,” in
Proceedings of the Sixth International Congress of Phonetic Sciences
, Prague, pp.
569
573
.
2.
Beddor
,
P. S.
, and
Strange
,
W.
(
1982
). “
Cross-language study of perception of the oral-nasal distinction
,”
J. Acoust. Soc. Am.
71
,
1551
1561
.
3.
Benkí
,
J. R.
(
2003
). “
Quantitative evaluation of lexical status, word frequency, and neighborhood density as context effects in spoken word recognition
,”
J. Acoust. Soc. Am.
113
,
1689
1705
.
4.
Best
,
C. T.
,
Studdert-Kennedy
,
M.
,
Manuel
,
S.
, and
Rubin-Spitz
,
J.
(
1989
). “
Discovering phonetic coherence in acoustic patterns
,”
Percept. Psychophys.
45
,
237
250
.
5.
Binder
,
J. R.
,
Liebenthal
,
E.
,
Possing
,
E. T.
,
Medler
,
D. A.
, and
Ward
,
B. D.
(
2004
). “
Neural correlates of sensory and decision processes in auditory object identification
,”
Nat. Neurosci.
7
,
295
301
.
6.
Boothroyd
,
A.
(
1968
). “
Statistical theory of the speech discrimination score
,”
J. Acoust. Soc. Am.
43
,
362
367
.
7.
Boothroyd
,
A.
(
1985
). “
Evaluation of speech production of the hearing impaired: Some benefits of forced-choice testing
,”
J. Speech Hear. Res.
28
,
185
196
.
8.
Boothroyd
,
A.
, and
Nittrouer
,
S.
(
1988
). “
Mathematical treatment of context effects in phoneme and word recognition
,”
J. Acoust. Soc. Am.
84
,
101
114
.
9.
Boysson-Bardies
,
B.
,
de Sagart
,
L.
,
Halle
,
P.
, and
Durand
,
C.
(
1986
). “
Acoustic investigations of cross-linguistic variability in babbling
,” in
Precursors of Early Speech
, edited by
B.
Lindblom
and
R.
Zetterstrom
, (
Stockton
,
New York
), pp.
113
126
.
10.
Bradlow
,
A. R.
, and
Alexander
,
J. A.
(
2007
). “
Semantic and phonetic enhancements for speech-in-noise recognition by native and non-native listeners
,”
J. Acoust. Soc. Am.
121
,
2339
2349
.
11.
Bregman
,
A. S.
(
1990
).
Auditory Scene Analysis
(
MIT
,
Cambridge, MA
).
12.
Broadbent
,
D. E.
(
1967
). “
Word-frequency effect and response bias
,”
Psychol. Rev.
74
,
1
15
.
13.
Crowther
,
C. S.
, and
Mann
,
V.
(
1994
). “
Use of vocalic cues to consonant voicing and native language background: The influence of experimental design
,”
Percept. Psychophys.
55
,
513
525
.
14.
Dau
,
T.
,
Ewert
,
S.
, and
Oxenham
,
A. J.
(
2009
). “
Auditory stream formation affects comodulation masking release retroactively
,”
J. Acoust. Soc. Am.
125
,
2182
2188
.
15.
Dorman
,
M. F.
,
Lindholm
,
J. M.
, and
Hannley
,
M. T.
(
1985
). “
Influence of the first formant on the recognition of voiced stop consonants by hearing-impaired listeners
,”
J. Speech Hear. Res.
28
,
377
380
.
16.
Duffy
,
J. R.
, and
Giolas
,
T. G.
(
1974
). “
Sentence intelligibility as a function of key word selection
,”
J. Speech Hear. Res.
17
,
631
637
.
17.
Eisenberg
,
L. S.
,
Shannon
,
R. V.
,
Schaefer Martinez
,
A.
,
Wygonski
,
J.
, and
Boothroyd
,
A.
(
2000
). “
Speech recognition with reduced spectral cues as a function of age
,”
J. Acoust. Soc. Am.
107
,
2704
2710
.
18.
Elliott
,
L. L.
(
1979
). “
Performance of children aged 9 to 17 years on a test of speech intelligibility in noise using sentence material with controlled word predictability
,”
J. Acoust. Soc. Am.
66
,
651
653
.
19.
Erber
,
N. P.
(
1971
). “
Evaluation of special hearing aids for deaf children
,”
J. Speech Hear Disord.
36
,
527
537
.
20.
Flege
,
J. E.
, and
Port
,
R.
(
1981
). “
Cross-language phonetic interference: Arabic to English
,”
Lang Speech
24
,
125
146
.
21.
Flege
,
J. E.
,
Schmidt
,
A. M.
, and
Wharton
,
G.
(
1996
). “
Age of learning affects rate-dependent processing of stops in a second language
,”
Phonetica
53
,
143
161
.
22.
Fu
,
Q.
,
Zeng
,
F. G.
,
Shannon
,
R. V.
, and
Soli
,
S. D.
(
1998
). “
Importance of tonal envelope cues in Chinese speech recognition
,”
J. Acoust. Soc. Am.
104
,
505
510
.
23.
Giolas
,
T. G.
,
Cooker
,
H. S.
, and
Duffy
,
J. R.
(
1970
). “
The predictability of words in sentences
,”
J. Aud Res.
10
,
328
334
.
24.
Goldman
,
R.
, and
Fristoe
,
M.
(
2000
).
Goldman Fristoe 2: Test of Articulation
(
American Guidance Service, Inc.
,
Circle Pines, MN
).
25.
Gottfried
,
T. L.
, and
Beddor
,
P. S.
(
1988
). “
Perception of temporal and spectral information in French vowels
,”
Lang Speech
31
,
57
75
.
26.
Griffiths
,
T. D.
, and
Warren
,
J. D.
(
2004
). “
What is an auditory object?
,”
Nat. Rev. Neurosci.
5
,
887
892
.
27.
Howes
,
D.
(
1957
). “
On the relationship between the intelligibility and frequency of occurrence of English words
,”
J. Acoust. Soc. Am.
29
,
296
307
.
28.
Johnson
,
J. S.
, and
Newport
,
E. L.
(
1989
). “
Critical period effects in second language learning: The influence of maturational state on the acquisition of English as a second language
,”
Appl. Cognit. Psychol.
21
,
60
99
.
29.
Kalikow
,
D. N.
,
Stevens
,
K. N.
, and
Elliott
,
L. L.
(
1977
). “
Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability
,”
J. Acoust. Soc. Am.
61
,
1337
1351
.
30.
Leek
,
M. R.
,
Dorman
,
M. F.
, and
Summerfield
,
Q.
(
1987
). “
Minimum spectral contrast for vowel identification by normal-hearing and hearing-impaired listeners
,”
J. Acoust. Soc. Am.
81
,
148
154
.
31.
Liberman
,
A. M.
,
Cooper
,
F. S.
,
Harris
,
K. S.
, and
MacNeilage
,
P. F.
(
1962
). “
A motor theory of speech perception
,” in
Proceedings of the Speech Communication Seminar
, Stockholm, pp.
1
12
.
32.
Lotto
,
A. J.
,
Hickok
,
G. S.
, and
Holt
,
L. L.
(
2009
). “
Reflections on mirror neurons and speech perception
,”
Trends Cogn. Sci.
13
,
110
114
.
33.
MacKain
,
K. S.
,
Best
,
C. T.
, and
Strange
,
W.
(
1981
). “
Categorical perception of English /r/ and /l/ by Japanese bilinguals
,”
Appl. Psycholinguist.
2
,
369
390
.
34.
Mayo
,
C.
,
Scobbie
,
J. M.
,
Hewlett
,
N.
, and
Waters
,
D.
(
2003
). “
The influence of phonemic awareness development on acoustic cue weighting strategies in children’s speech perception
,”
J. Speech Lang. Hear. Res.
46
,
1184
1196
.
35.
Mayo
,
L. H.
,
Florentine
,
M.
, and
Buus
,
S.
(
1997
). “
Age of second-language acquisition and perception of speech in noise
,”
J. Speech Lang. Hear. Res.
40
,
686
693
.
36.
Miller
,
G. A.
,
Heise
,
G. A.
, and
Lichten
,
W.
(
1951
). “
The intelligibility of speech as a function of the context of the test materials
,”
J. Exp. Psychol.
41
,
329
335
.
37.
Nilsson
,
M.
,
Soli
,
S. D.
, and
Gelnett
,
D. J.
(
1996
).
Development and Norming of a Hearing in Noise Test for Children
(
House Ear Institute
,
Los Angeles, CA
).
38.
Nilsson
,
M.
,
Soli
,
S. D.
, and
Sullivan
,
J. A.
(
1994
). “
Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise
,”
J. Acoust. Soc. Am.
95
,
1085
1099
.
39.
Nittrouer
,
S.
(
1992
). “
Age-related differences in perceptual effects of formant transitions within syllables and across syllable boundaries
,”
J. Phonetics
20
,
351
382
.
40.
Nittrouer
,
S.
, and
Boothroyd
,
A.
(
1990
). “
Context effects in phoneme and word recognition by young children and older adults
,”
J. Acoust. Soc. Am.
87
,
2705
2715
.
41.
Nittrouer
,
S.
, and
Chapman
,
C.
(
2009
). “
The effects of bilateral electric and bimodal electric-acoustic stimulation on language development
,”
Trends Amplif.
13
,
190
205
.
42.
Nittrouer
,
S.
, and
Lowenstein
,
J. H.
(
2009
). “
Does harmonicity explain children’s cue weighting of fricative-vowel syllables?
,”
J. Acoust. Soc. Am.
125
,
1679
1692
.
43.
Nittrouer
,
S.
,
Lowenstein
,
J. H.
, and
Packer
,
R.
(
2009
). “
Children discover the spectral skeletons in their native language before the amplitude envelopes
,”
J. Exp. Psychol. Hum. Percept. Perform.
35
,
1245
1253
.
44.
Nittrouer
,
S.
, and
Miller
,
M. E.
(
1997a
). “
Predicting developmental shifts in perceptual weighting schemes
,”
J. Acoust. Soc. Am.
101
,
2253
2266
.
45.
Nittrouer
,
S.
, and
Miller
,
M. E.
(
1997b
). “
Developmental weighting shifts for noise components of fricative-vowel syllables
,”
J. Acoust. Soc. Am.
102
,
572
580
.
46.
Nittrouer
,
S.
, and
Studdert-Kennedy
,
M.
(
1987
). “
The role of coarticulatory effects in the perception of fricatives by children and adults
,”
J. Speech Hear. Res.
30
,
319
329
.
47.
Remez
,
R. E.
,
Rubin
,
P. E.
,
Berns
,
S. M.
,
Pardo
,
J. S.
, and
Lang
,
J. M.
(
1994
). “
On the perceptual organization of speech
,”
Psychol. Rev.
101
,
129
156
.
48.
Remez
,
R. E.
,
Rubin
,
P. E.
,
Pisoni
,
D. B.
, and
Carrell
,
T. D.
(
1981
). “
Speech perception without traditional speech cues
,”
Science
212
,
947
949
.
49.
Repp
,
B. H.
(
1982
). “
Phonetic trading relations and context effects: New experimental evidence for a speech mode of perception
,”
Psychol. Bull.
92
,
81
110
.
50.
Revoile
,
S. G.
,
Holden-Pitt
,
L.
, and
Pickett
,
J. M.
(
1985
). “
Perceptual cues to the voiced-voiceless distinction of final fricatives for listeners with impaired or with normal hearing
,”
J. Acoust. Soc. Am.
77
,
1263
1265
.
51.
Rosenzweig
,
M. R.
, and
Postman
,
L.
(
1957
). “
Intelligibility as a function of frequency of usage
,”
J. Exp. Psychol.
54
,
412
422
.
52.
Shannon
,
R. V.
,
Zeng
,
F. G.
,
Kamath
,
V.
,
Wygonski
,
J.
, and
Ekelid
,
M.
(
1995
). “
Speech recognition with primarily temporal cues
,”
Science
270
,
303
304
.
53.
Stelmachowicz
,
P. G.
,
Lewis
,
D. E.
,
Kelly
,
W. J.
, and
Jesteadt
,
W.
(
1990
). “
Speech perception in low-pass filtered noise for normal and hearing-impaired listeners
,”
J. Speech Hear. Res.
33
,
290
297
.
54.
Surprenant
,
A. M.
, and
Watson
,
C. S.
(
2001
). “
Individual differences in the processing of speech and nonspeech sounds by normal-hearing listeners
,”
J. Acoust. Soc. Am.
110
,
2085
2095
.
55.
Sussman
,
J. E.
(
1993
). “
Auditory processing in children’s speech perception: Results of selective adaptation and discrimination tasks
,”
J. Speech Hear. Res.
36
,
380
395
.
56.
Uchanski
,
R. M.
,
Davidson
,
L. S.
,
Quadrizius
,
S.
,
Reeder
,
R.
,
Cadieux
,
J.
,
Kettel
,
J.
, and
Chole
,
R. A.
(
2009
). “
Two ears and two (or more?) devices: A pediatric case study of bilateral profound hearing loss
,”
Trends Amplif.
13
,
107
123
.
57.
Van Engen
,
K. J.
, and
Bradlow
,
A. R.
(
2007
). “
Sentence recognition in native- and foreign-language multi-talker background noise
,”
J. Acoust. Soc. Am.
121
,
519
526
.
58.
Watson
,
J. M. M.
(
1997
). “
Sibilant-vowel coarticulation in the perception of speech by children with phonological disorder
,” Ph.D. thesis,
Queen Margaret College
, Edinburgh.
59.
Zampini
,
M. L.
,
Clarke
,
C.
, and
Green
,
K. P.
(
2000
). “
Language experience and the perception of stop consonant voicing in Spanish: The case of late English-Spanish bilinguals
,” in
Spanish Applied Linguistics at the Turn of the Millennium
, edited by
R. P.
Leow
and
C.
Sanz
, (
Cascadilla
,
Somerville, MA
), pp.
194
209
.
You do not currently have access to this content.