Changes in magnitude and variability of duration, fundamental frequency, formant frequencies, and spectral envelope of children’s speech are investigated as a function of age and gender using data obtained from 436 children, ages 5 to 17 years, and 56 adults. The results confirm that the reduction in magnitude and within-subject variability of both temporal and spectral acoustic parameters with age is a major trend associated with speech development in normal children. Between ages 9 and 12, both magnitude and variability of segmental durations decrease significantly and rapidly, converging to adult levels around age 12. Within-subject fundamental frequency and formant-frequency variability, however, may reach adult range about 2 or 3 years later. Differentiation of male and female fundamental frequency and formant frequency patterns begins at around age 11, becoming fully established around age 15. During that time period, changes in vowel formant frequencies of male speakers is approximately linear with age, while such a linear trend is less obvious for female speakers. These results support the hypothesis of uniform axial growth of the vocal tract for male speakers. The study also shows evidence for an apparent overshoot in acoustic parameter values, somewhere between ages 13 and 15, before converging to the canonical levels for adults. For instance, teenagers around age 14 differ from adults in that, on average, they show shorter segmental durations and exhibit less within-subject variability in durations, fundamental frequency, and spectral envelope measures.

1.
Crystal
,
T. H.
, and
House
,
A. S.
(
1988
). “
A note on the variability of timing control
,”
J. Speech Hear. Res.
31
,
497
502
.
2.
Davis
,
S.
, and
Mermelstein
,
P.
(
1980
). “
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
,”
IEEE Trans. Acoust., Speech, Signal Process.
28
(
4
),
357
366
.
3.
Eguchi
,
S.
, and
Hirsh
,
I. J.
(
1969
). “
Development of speech sounds in children
,”
Acta Oto-Laryngol. Suppl.
257
,
1
51
.
4.
Fant, G. (1975). “Non-uniform vowel normalization,” STL-QPSR 2-3/1975, 1–19.
5.
Goldstein, U. G. (1980). “An articulatory model for the vocal tracts of growing children,” Ph.D. thesis (MIT, Cambridge, MA).
6.
Hillenbrand
,
J.
,
Getty
,
L. A.
,
Clark
,
M. J.
, and
Wheeler
,
K.
(
1995
). “
Acoustic characteristics of American English vowels
,”
J. Acoust. Soc. Am.
97
,
3099
3111
.
7.
Hollien
,
H.
,
Green
,
R.
, and
Massey
,
K.
(
1994
). “
Longitudinal research on adolescent voice change in males
,”
J. Acoust. Soc. Am.
96
,
2646
2654
.
8.
Kent
,
R. D.
(
1976
). “
Anatomical and neuromuscular maturation of the speech mechanism: Evidence from acoustic study
,”
J. Speech Hear. Res.
19
,
421
445
.
9.
Kent
,
R. D.
, and
Forner
,
L. L.
(
1980
). “
Speech segment durations in sentence recitations by children and adults
,”
Journal of Phonetics
8
,
157
168
.
10.
Klatt
,
D. H.
(
1974
). “
The duration of /s/ in English words
,”
J. Speech Hear. Res.
17
,
51
63
.
11.
Ljolje, A., and Riley, M. D. (1991). “Automatic segmentation and labeling of speech,” Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (Toronto, Canada), pp. 473–476.
12.
Miller, J. D., Lee, S., Uchanski, R. M., Heidbreder, A. H., Richman, B. B., and Tadlock, J. (1996). “Creation of two children’s speech databases,” Proceedings of the ICASSP (Atlanta, GA), pp. 849–852.
13.
Palethorpe
,
S.
,
Wales
,
R.
,
Clark
,
J. E.
, and
Senserrick
,
T.
(
1996
). “
Vowel classification in children
,”
J. Acoust. Soc. Am.
100
,
3843
3851
.
14.
Peterson
,
G. E.
, and
Barney
,
H. L.
(
1952
). “
Control methods used in a study of the vowels
,”
J. Acoust. Soc. Am.
24
,
175
184
.
15.
Potamianos, A., Narayanan, S., and Lee, S. (1997). “Automatic speech recognition for children,” Proceedings of European Conference on Speech, Communication and Technology (Rhodes, Greece), pp. 2371–2734.
16.
Rabiner, L. R., and Juang, B-H. (1993). Fundamentals of Speech Recognition (Prentice-Hall, Englewood Cliffs, NJ).
17.
Secrest, B. G., and Doddington, G. R. (1983). “An integrated pitch tracking algorithm for speech systems,” Proceedings of the ICASSP (Boston, MA), pp. 1352–1355.
18.
Sharkey
,
S. G.
, and
Folkins
,
J. H.
(
1985
). “
Variability of lip and jaw movements in children and adult: Implications for the development of speech motor control
,”
J. Speech Hear. Res.
28
,
8
15
.
19.
Smith
,
B.
(
1978
). “
Temporal aspects of English speech production: A developmental perspective
,”
Journal of Phonetics
6
,
37
67
.
20.
Smith
,
B. L.
(
1992
). “
Relationships between duration and temporal variability in children’s speech
,”
J. Acoust. Soc. Am.
91
,
2165
2174
.
21.
Smith
,
B. L.
, and
Kenney
,
M. K.
(
1994
). “
Variability control in speech production tasks performed by adults and children
,”
J. Acoust. Soc. Am.
96
,
699
705
.
22.
Smith
,
B.
,
Kenney
,
M. K.
, and
Hussain
,
S.
(
1995
). “
A longitudinal investigation of duration and temporal variability in children’s speech production
,”
J. Acoust. Soc. Am.
99
,
2344
2349
.
23.
Smith
,
B. L.
, and
McLeane-Muse
,
A.
(
1986
). “
Articulatory movement characteristics of labial consonant productions by children and adults
,”
J. Acoust. Soc. Am.
80
,
1321
1328
.
24.
Stathopoulos
,
E. T.
(
1995
). “
Variability revisited: an acoustic, aerodynamic and respiratory kinematic comparison of children and adults during speech
,”
Journal of Phonetics
23
,
67
80
.
25.
Whalen
,
D.
, and
Levitt
,
A.
(
1995
). “
The universality of intrinsic F0 of vowels
,”
Journal of Phonetics
23
,
349
366
.
26.
Yang, C. S., and Kasuya, H. (1994). “Accurate measurement of vocal tract shapes from magnetic resonance images of child, female, and male subjects,” Proceedings of the International Conference on Speech Language Processing (Yokohama, Japan), pp. 623–626.
27.
Yang
,
C.-S.
, and
Kasuya
,
H.
(
1995
). “
Uniform and non-uniform normalization of vocal tracts measured by MRI across male, female and child subjects
,”
IEICE Trans. Inf. and Syst.
E78-D
, No 6,
732
737
.
This content is only available via PDF.
You do not currently have access to this content.