The dynamics of airflow during speech production may often result in some small or large degree of turbulence. In this paper, the geometry of speech turbulence as reflected in the fragmentation of the time signal is quantified by using fractal models. An efficient algorithm for estimating the short-time fractal dimension of speech signals based on multiscale morphological filtering is described, and its potential for speech segmentation and phonetic classification discussed. Also reported are experimental results on using the short-time fractal dimension of speech signals at multiple scales as additional features in an automatic speech-recognition system using hidden Markov models, which provide a modest improvement in speech-recognition performance.
Skip Nav Destination
March 01 1999
Fractal dimensions of speech sounds: Computation and application to automatic speech recognition
Petros Maragos, Alexandros Potamianos; Fractal dimensions of speech sounds: Computation and application to automatic speech recognition. J. Acoust. Soc. Am. 1 March 1999; 105 (3): 1925–1932. https://doi.org/10.1121/1.426738
Download citation file: