According to the model of speech production, the characteristic parameter of speech can be divided into two parts: excitation and vocal tract parameters. Atal proposed the multipulse excitation model that can produce high‐quality sythesized speech. This research shows that the intensity, duration, and pitch mode of single syllable Chinese produced by multipulse excitation may be changed when the adaptive method is utilized to process its multipulse sequences and vocal tract parameter. There are about 10 000 Chinese words in common use, but the pronunciation of many words is the same, so that only about 1300 syllables are independent. The Chinese language is a tone language. Each Chinese word is of four pitch modes, and the vocal tract parameter for the four modes of one word is almost the same. Therefore, there are 400 independent vocal tract parameters and 1300 multipulse sequences in Chinese. Based on the above strategy, a new method of sythesizing Chinese by rules has been proposed. The intelligibility and naturalness of the synthesized speech are satisfactory.
Skip Nav Destination
Article navigation
November 1988
Article Contents
August 13 2005
Synthesis of Chinese by rules based on a multipulse excitation model Free
Li Changli;
Li Changli
Institute of Acoustics, Academia Sinica, Beijing, People's Republic of China
Search for other works by this author on:
Me Fuyuan
Me Fuyuan
Institute of Acoustics, Academia Sinica, Beijing, People's Republic of China
Search for other works by this author on:
Li Changli
Institute of Acoustics, Academia Sinica, Beijing, People's Republic of China
Me Fuyuan
Institute of Acoustics, Academia Sinica, Beijing, People's Republic of China
J. Acoust. Soc. Am. 84, S23 (1988)
Citation
Li Changli, Me Fuyuan; Synthesis of Chinese by rules based on a multipulse excitation model. J. Acoust. Soc. Am. 1 November 1988; 84 (S1): S23. https://doi.org/10.1121/1.2026229
Download citation file:
30
Views
Citing articles via
Variation in global and intonational pitch settings among black and white speakers of Southern American English
Aini Li, Ruaridh Purse, et al.
Effects of network selection and acoustic environment on bounding-box object detection of delphinid whistles using a deep learning tool
Peter C. Sugarman, Elizabeth L. Ferguson, et al.
Introduction to the special issue on: Advances in soundscape: Emerging trends and challenges in research and practice
Francesco Aletta, Bhan Lam, et al.
Related Content
Statistical modeling of dynamic spectral patterns for a speech synthesizer
J. Acoust. Soc. Am. (August 2005)
A speech synthesis system by rule in Japanese
J. Acoust. Soc. Am. (August 2005)
Excitation problem in speech synthesis
J. Acoust. Soc. Am. (August 2005)
Changing pitch and duration in LPC synthesized speech using multipulse excitation
J. Acoust. Soc. Am. (August 2005)
Decoding the speech code—Applications of temporal decomposition
J. Acoust. Soc. Am. (August 2005)