The fundamental frequency (fo) is pivotal for quantifying vocal-fold characteristics. However, the accuracy of fo estimation in hoarse voices is notably low, and no definitive algorithm for fo estimation has been previously established. In this study, we introduce an algorithm named, “Spectral-based fo Estimator Emphasized by Domination and Sequence (SFEEDS),” which enhances the spectrum method and conducted comparative analyses with conventional estimation methods. We analyzed 454 voice samples and used conventional methods and SFEEDS to calculate fo. The ground truth of fo was determined as the lowest frequency within the most dominant harmonic complex observed on the spectrogram. Subsequently, we assessed the concordance between each fo-estimation method and the fo ground truth. We also examined the variations in the accuracy of these methods when analyzing speech with hoarseness. Regardless of hoarseness, the fo-estimation accuracy was significantly greater by SFEEDS than by conventional methods. Moreover, whereas the conventional methods impaired fo-estimation accuracy in samples with roughness, the SFEEDS algorithm was robust and significantly reduced subharmonic errors. The SFEEDS fo-estimation algorithm accurately estimated the fo of both normal and hoarse voices.
Skip Nav Destination
,
,
,
,
,
,
,
,
,
,
,
Article navigation
December 2024
December 24 2024
Robust fundamental frequency-detection algorithm unaffected by the presence of hoarseness in human voice
Itsuki Kitayama;
Itsuki Kitayama
1
Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine
, Osaka 565-0871, Japan
Search for other works by this author on:
Kiyohito Hosokawa
;
Kiyohito Hosokawa
a)
1
Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine
, Osaka 565-0871, Japan
2
Department of Otorhinolaryngology, Osaka International Medical & Science Center
, Osaka 543-0035, Japan
Search for other works by this author on:
Shinobu Iwaki;
Shinobu Iwaki
3
Department of Rehabilitation, Kobe University Hospital
, Hyogo 650-0017, Japan
Search for other works by this author on:
Misao Yoshida;
Misao Yoshida
4
Department of Rehabilitation, Sakai Heisei Hospital
, Osaka 599-8236, Japan
Search for other works by this author on:
Akira Miyauchi;
Akira Miyauchi
5
Department of Surgery, Kuma Hospital
, Hyogo 650-0011, Japan
Search for other works by this author on:
Toshihiro Kishikawa;
Toshihiro Kishikawa
1
Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine
, Osaka 565-0871, Japan
Search for other works by this author on:
Hidenori Tanaka;
Hidenori Tanaka
1
Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine
, Osaka 565-0871, Japan
Search for other works by this author on:
Takeshi Tsuda;
Takeshi Tsuda
1
Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine
, Osaka 565-0871, Japan
Search for other works by this author on:
Takashi Sato;
Takashi Sato
1
Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine
, Osaka 565-0871, Japan
Search for other works by this author on:
Yukinori Takenaka;
Yukinori Takenaka
1
Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine
, Osaka 565-0871, Japan
Search for other works by this author on:
Makoto Ogawa;
Makoto Ogawa
1
Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine
, Osaka 565-0871, Japan
Search for other works by this author on:
Hidenori Inohara
Hidenori Inohara
1
Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine
, Osaka 565-0871, Japan
Search for other works by this author on:
Itsuki Kitayama
1
Kiyohito Hosokawa
1,2,a)
Shinobu Iwaki
3
Misao Yoshida
4
Akira Miyauchi
5
Toshihiro Kishikawa
1
Hidenori Tanaka
1
Takeshi Tsuda
1
Takashi Sato
1
Yukinori Takenaka
1
Makoto Ogawa
1
Hidenori Inohara
1
1
Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine
, Osaka 565-0871, Japan
2
Department of Otorhinolaryngology, Osaka International Medical & Science Center
, Osaka 543-0035, Japan
3
Department of Rehabilitation, Kobe University Hospital
, Hyogo 650-0017, Japan
4
Department of Rehabilitation, Sakai Heisei Hospital
, Osaka 599-8236, Japan
5
Department of Surgery, Kuma Hospital
, Hyogo 650-0011, Japan
a)
Email: [email protected]
J. Acoust. Soc. Am. 156, 4217–4228 (2024)
Article history
Received:
June 22 2024
Accepted:
November 25 2024
Citation
Itsuki Kitayama, Kiyohito Hosokawa, Shinobu Iwaki, Misao Yoshida, Akira Miyauchi, Toshihiro Kishikawa, Hidenori Tanaka, Takeshi Tsuda, Takashi Sato, Yukinori Takenaka, Makoto Ogawa, Hidenori Inohara; Robust fundamental frequency-detection algorithm unaffected by the presence of hoarseness in human voice. J. Acoust. Soc. Am. 1 December 2024; 156 (6): 4217–4228. https://doi.org/10.1121/10.0034624
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
111
Views
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
Related Content
Effect of combined source ( F ) and filter (formant) variation on red deer hind responses to male roars
J. Acoust. Soc. Am. (May 2008)
The impact of brief restriction to articulation on children's subsequent speech production
J. Acoust. Soc. Am. (February 2018)
Cortical sensitivity to periodicity of speech sounds
J. Acoust. Soc. Am. (April 2008)
Effects of vowel coproduction on the timecourse of tone recognition
J. Acoust. Soc. Am. (April 2020)
Nonlinear frequency compression: Influence of start frequency and input bandwidth on consonant and vowel recognition
J. Acoust. Soc. Am. (February 2016)