A sawtooth waveform inspired pitch estimator (SWIPE) has been developed for speech and music. SWIPE estimates the pitch as the fundamental frequency of the sawtooth waveform whose spectrum best matches the spectrum of the input signal. The comparison of the spectra is done by computing a normalized inner product between the spectrum of the signal and a modified cosine. The size of the analysis window is chosen appropriately to make the width of the main lobes of the spectrum match the width of the positive lobes of the cosine. , a variation of SWIPE, utilizes only the first and prime harmonics of the signal, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms. The authors’ tests indicate that SWIPE and performed better on two spoken speech and one disordered voice database and one musical instrument database consisting of single notes performed at a variety of pitches.
Skip Nav Destination
Article navigation
September 2008
September 01 2008
A sawtooth waveform inspired pitch estimator for speech and music
Arturo Camacho;
Arturo Camacho
Computational NeuroEngineering Laboratory,
University of Florida
, Gainesville, Florida 32611
Search for other works by this author on:
John G. Harris
John G. Harris
Computational NeuroEngineering Laboratory,
University of Florida
, Gainesville, Florida 32611
Search for other works by this author on:
J. Acoust. Soc. Am. 124, 1638–1652 (2008)
Article history
Received:
December 05 2007
Accepted:
June 02 2008
Citation
Arturo Camacho, John G. Harris; A sawtooth waveform inspired pitch estimator for speech and music. J. Acoust. Soc. Am. 1 September 2008; 124 (3): 1638–1652. https://doi.org/10.1121/1.2951592
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
Vowel signatures in emotional interjections and nonlinguistic vocalizations expressing pain, disgust, and joy across languages
Maïa Ponsonnet, Christophe Coupé, et al.
The alveolar trill is perceived as jagged/rough by speakers of different languages
Aleksandra Ćwiek, Rémi Anselme, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
Computing pitch of speech and music using a sawtooth waveform inspired pitch estimator
J Acoust Soc Am (November 2007)
Extending the sawtooth wave form inspired pitch estimator with an auditory‐based preprocessing stage.
J Acoust Soc Am (April 2009)
A frame selective dynamic programming approach for noise robust pitch estimation
J. Acoust. Soc. Am. (April 2018)
Robust fundamental frequency estimation in sustained vowels: Detailed algorithmic comparisons and information fusion with adaptive Kalman filtering
J. Acoust. Soc. Am. (May 2014)
Analysis and measurement of the modulation transfer function of harmonic shear wave induced phase encoding imaging
J. Acoust. Soc. Am. (May 2014)