Existing pitch tracking algorithms are not precise nor reliable enough to be useful in feature‐based recognition systems. It is possible, however, to analyze the errors produced by a particular algorithm and then reduce these errors using post‐processing techniques. Error patterns were analyzed for a time‐domain pitch tracker [W. H. Tucker and R. H. T. Bates, IEEE Trans. ASSP‐26, 597–604 (1978)] and a post‐processing algorithm, based on artificial intelligence techniques, was written in order to eliminate errors. Performance was compared for the original and modified versions of the pitch tracker for a number of speakers using both isolated words and sentences. All types of errors were reduced by the post‐processing algorithm. Voiced‐voiceless decisions were performed with less than 1% error for 2080 letters produced by males and females. The fundamental frequency microstructure was tracked sufficiently well to be used in extracting phonetic features in a feature‐based recognition system.
Skip Nav Destination
Article navigation
November 1982
August 12 2005
A powerful post‐processing algorithm for time domain pitch trackers
Philippe Specker
Philippe Specker
Department of Computer Science, Carnegie‐Mellon University, Schenley Park, Pittsburgh, PA 15213
Search for other works by this author on:
Philippe Specker
Department of Computer Science, Carnegie‐Mellon University, Schenley Park, Pittsburgh, PA 15213
J. Acoust. Soc. Am. 72, S32 (1982)
Citation
Philippe Specker; A powerful post‐processing algorithm for time domain pitch trackers. J. Acoust. Soc. Am. 1 November 1982; 72 (S1): S32. https://doi.org/10.1121/1.2019827
Download citation file:
104
Views
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
Related Content
Alignment classification method to facilitate automatic acoustic‐phonetic statistics collection
J. Acoust. Soc. Am. (August 2005)
Unifying dynamic programming methods
J. Acoust. Soc. Am. (August 2005)
Decisions about features
J. Acoust. Soc. Am. (August 2005)
Very large vocabulary recognition (VLVR): using prosodic and spectral filters
J. Acoust. Soc. Am. (August 2005)
A generalized frequency tracker
J. Acoust. Soc. Am. (August 2005)