This paper presents a method of spectral warping for helium speech unscrambling by means of converting the vocal tract shape estimated from the PARCOR analysis. Speech produced in a high‐pressure, helium‐oxygen atmosphere has particular distortions. One peculiarity is a disparity in the rates of formant frequency shift and fundamental frequency shift. The formant frequency shift consists of two factors. One is due to the sound velocity change, rendering a linear shift, and the other is due to the high pressure, rendering a nonlinear shift. In order to convert the helium speech to normal speech, therefore, the authors have developed an analysis—synthesis system involving various types of conversion processes for spectral warping. The conversion process presented in this paper includes two operations. The first is to increase the number of sections in an acoustic tube model of the vocal tract for the linear shift, and the second is to modify the vocal tract area function for the nonlinear shift. Experimental examinations to determine the feasibility of this approach, and to compare it with other approaches, such as impulse response stretching and power spectrum conversion, are discussed. [Work supported by Grant‐in‐Aid for Sci. Res. 146089.]
Skip Nav Destination
,
,
Article navigation
November 1978
August 11 2005
Conversion of the vocal tract shape for spectral warping by a PARCOR analysis—synthesis system Free
Hisayoshi Suzuki;
Hisayoshi Suzuki
Electronics Dept., Faculty of Engineering, Shizuoka University, Hamamatsu, 432 Japan
Search for other works by this author on:
Gen Ooyama;
Gen Ooyama
Electronics Dept., Faculty of Engineering, Shizuoka University, Hamamatsu, 432 Japan
Search for other works by this author on:
Satoshi Wakayama
Satoshi Wakayama
Electronics Dept., Faculty of Engineering, Shizuoka University, Hamamatsu, 432 Japan
Search for other works by this author on:
Hisayoshi Suzuki
Gen Ooyama
Satoshi Wakayama
Electronics Dept., Faculty of Engineering, Shizuoka University, Hamamatsu, 432 Japan
J. Acoust. Soc. Am. 64, S160 (1978)
Citation
Hisayoshi Suzuki, Gen Ooyama, Satoshi Wakayama; Conversion of the vocal tract shape for spectral warping by a PARCOR analysis—synthesis system. J. Acoust. Soc. Am. 1 November 1978; 64 (S1): S160. https://doi.org/10.1121/1.2003952
Download citation file:
36
Views
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Drawer-like tunable ventilated sound barrier
Yong Ge, Yi-jun Guan, et al.
Related Content
Displaying speech as vocal tract area function pictures
J. Acoust. Soc. Am. (August 2005)
Evaluation of lifters by frequency response
J. Acoust. Soc. Am. (August 2005)
Some comparisons between the articulation rates of LPC and diphone or monophone‐based synthesis by rules
J. Acoust. Soc. Am. (August 2005)
A low bit rate vocoder based on an improved cepstral method
J. Acoust. Soc. Am. (August 2005)
LPC speech at 1200 bits per second using optimized frame repeat
J. Acoust. Soc. Am. (August 2005)