In a scheme for the mechanical recognition of speakers, it is desirable to use acoustic parameters that are closely related to voice characteristics that distinguish speakers. This paper describes an investigation of an efficient approach to selecting such parameters, which are movitated by known relations between the voice signal and vocal‐tract shapes and gestures. Rather than general measurements over the extent of an utterance, only significant features of selected segments are used. A simulation of a speaker recognition system was performed by manually locating speech events within utterances and using parameters measured at these locations to classify the speakers. Useful parameters were found in fundamental frequency, features of vowel and nasal consonant spectra, estimation of glottal source spectrum slope, word duration, and voice onset time. These parameters were tested in speaker recognition paradigms using simple linear classification procedures. When only 17 such parameters were used, no errors were made in speaker identification from a set of 21 adult male speakers. Under the same conditions, speaker verification errors of the order of 2% were also obtained.
Skip Nav Destination
Article navigation
June 1972
June 01 1972
Efficient Acoustic Parameters for Speaker Recognition Available to Purchase
Jared J. Wolf
Jared J. Wolf
Bolt Beranek and Newman, Incorporated, Cambridge, Massachusetts 02138
Search for other works by this author on:
Jared J. Wolf
Bolt Beranek and Newman, Incorporated, Cambridge, Massachusetts 02138
J. Acoust. Soc. Am. 51, 2044–2056 (1972)
Article history
Received:
March 30 1970
Citation
Jared J. Wolf; Efficient Acoustic Parameters for Speaker Recognition. J. Acoust. Soc. Am. 1 June 1972; 51 (6B): 2044–2056. https://doi.org/10.1121/1.1913065
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Drawer-like tunable ventilated sound barrier
Yong Ge, Yi-jun Guan, et al.
Related Content
Relational spectral features for place of articulation in nasal consonants
J. Acoust. Soc. Am. (January 1990)
Use of vowel duration information in a large vocabulary word recognizer
J. Acoust. Soc. Am. (August 1989)
The Perception of Speech and Its Relation to Telephony
J. Acoust. Soc. Am. (March 1950)