The principles of the existing pitch estimation techniques are often different and complementary in nature. In this work, a frame selective dynamic programming (FSDP) method is proposed which exploits the complementary characteristics of two existing methods, namely, sub-harmonic to harmonic ratio (SHR) and sawtooth-wave inspired pitch estimator (SWIPE). Using variants of SHR and SWIPE, the proposed FSDP method classifies all the voiced frames into two classes—the first class consists of the frames where a confidence score maximization criterion is used for pitch estimation, while for the second class, a dynamic programming (DP) based approach is proposed. Experiments are performed on speech signals separately from KEELE, CSLU, and PaulBaghsaw corpora under clean and additive white Gaussian noise at 20, 10, 5, and 0 dB SNR conditions using four baseline schemes including SHR, SWIPE, and two DP based techniques. The pitch estimation performance of FSDP, when averaged over all SNRs, is found to be better than those of the baseline schemes suggesting the benefit of applying smoothness constraint using DP in selected frames in the proposed FSDP scheme. The VuV classification error from FSDP is also found to be lower than that from all four baseline schemes in almost all SNR conditions on three corpora.
Skip Nav Destination
,
,
Article navigation
April 2018
April 20 2018
A frame selective dynamic programming approach for noise robust pitch estimation Available to Purchase
Chiranjeevi Yarra;
Chiranjeevi Yarra
a)
1
Department of Electrical Engineering, Indian Institute of Science (IISc)
, Bangalore, 560012, India
Search for other works by this author on:
Om D. Deshmukh;
Om D. Deshmukh
2
Xerox Research Center India
, Bangalore, 560103, India
Search for other works by this author on:
Prasanta Kumar Ghosh
Prasanta Kumar Ghosh
1
Department of Electrical Engineering, Indian Institute of Science (IISc)
, Bangalore, 560012, India
Search for other works by this author on:
Chiranjeevi Yarra
1,a)
Om D. Deshmukh
2
Prasanta Kumar Ghosh
1
1
Department of Electrical Engineering, Indian Institute of Science (IISc)
, Bangalore, 560012, India
2
Xerox Research Center India
, Bangalore, 560103, India
a)
Electronic mail: [email protected]
J. Acoust. Soc. Am. 143, 2289–2300 (2018)
Article history
Received:
November 09 2017
Accepted:
March 27 2018
Citation
Chiranjeevi Yarra, Om D. Deshmukh, Prasanta Kumar Ghosh; A frame selective dynamic programming approach for noise robust pitch estimation. J. Acoust. Soc. Am. 1 April 2018; 143 (4): 2289–2300. https://doi.org/10.1121/1.5031129
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
218
Views
Citing articles via
Climatic and economic fluctuations revealed by decadal ocean soundscapes
Vanessa M. ZoBell, Natalie Posdaljian, et al.
Variation in global and intonational pitch settings among black and white speakers of Southern American English
Aini Li, Ruaridh Purse, et al.
The contribution of speech rate, rhythm, and intonation to perceived non-nativeness in a speaker's native language
Ulrich Reubold, Robert Mayr, et al.
Related Content
ChildAugment: Data augmentation methods for zero-resource children's speaker verification
J. Acoust. Soc. Am. (March 2024)
A sawtooth waveform inspired pitch estimator for speech and music
J. Acoust. Soc. Am. (September 2008)
Performance analysis of various fundamental frequency estimation algorithms in the context of pathological speech
J. Acoust. Soc. Am. (November 2022)
Impact of phase estimation on single-channel speech separation based on time-frequency masking
J. Acoust. Soc. Am. (June 2017)
Extending the sawtooth wave form inspired pitch estimator with an auditory‐based preprocessing stage.
J. Acoust. Soc. Am. (April 2009)