An algorithm is presented for the estimation of the fundamental frequency of speech or musical sounds. It is based on the well-known autocorrelation method with a number of modifications that combine to prevent errors. The algorithm has several desirable features. Error rates are about three times lower than the best competing methods, as evaluated over a database of speech recorded together with a laryngograph signal. There is no upper limit on the frequency search range, so the algorithm is suited for high-pitched voices and music. The algorithm is relatively simple and may be implemented efficiently and with low latency, and it involves few parameters that must be tuned. It is based on a signal model (periodic signal) that may be extended in several ways to handle various forms of aperiodicity that occur in particular applications. Finally, interesting parallels may be drawn with models of auditory processing.
Skip Nav Destination
Article navigation
April 2002
April 03 2002
YIN, a fundamental frequency estimator for speech and music
Alain de Cheveigné;
Alain de Cheveigné
Ircam-CNRS, 1 place Igor Stravinsky, 75004 Paris, France
Search for other works by this author on:
Hideki Kawahara
Hideki Kawahara
Wakayama University
Search for other works by this author on:
J. Acoust. Soc. Am. 111, 1917–1930 (2002)
Article history
Received:
June 07 2001
Accepted:
January 09 2002
Citation
Alain de Cheveigné, Hideki Kawahara; YIN, a fundamental frequency estimator for speech and music. J. Acoust. Soc. Am. 1 April 2002; 111 (4): 1917–1930. https://doi.org/10.1121/1.1458024
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Pay-Per-View Access
$40.00
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Recommendations on bioacoustical metrics relevant for regulating exposure to anthropogenic underwater sound
Klaus Lucke, Alexander O. MacGillivray, et al.
Related Content
Acoustic and laryngographic measures of the laryngeal reflexes of linguistic prominence and vocal effort in German
J. Acoust. Soc. Am. (February 2010)
A microcomputer‐based system for high‐speed processing of speech materials using pitch‐synchronous LPC analysis and synthesis
J Acoust Soc Am (August 2005)
Running autocorrelation method of F0 estimation
J Acoust Soc Am (May 2001)
Formant estimation with linear prediction
J Acoust Soc Am (August 2005)
A sawtooth waveform inspired pitch estimator for speech and music
J. Acoust. Soc. Am. (September 2008)