This paper introduces a combinational feature extraction approach to improve speech recognition systems. The main idea is to simultaneously benefit from some features obtained from Poincaré section applied to speech reconstructed phase space (RPS) and typical Mel frequency cepstral coefficients (MFCCs) which have a proved role in speech recognition field. With an appropriate dimension, the reconstructed phase space of speech signal is assured to be topologically equivalent to the dynamics of the speech production system, and could therefore include information that may be absent in linear analysis approaches. Moreover, complicated systems such as speech production system can present cyclic and oscillatory patterns and Poincaré sections could be used as an effective tool in analysis of such trajectories. In this research, a statistical modeling approach based on Gaussian mixture models (GMMs) is applied to Poincaré sections of speech RPS. A final pruned feature set is obtained by applying an efficient feature selection approach to the combination of the parameters of the GMM model and MFCC-based features. A hidden Markov model-based speech recognition system and TIMIT speech database are used to evaluate the performance of the proposed feature set by conducting isolated and continuous speech recognition experiments. By the proposed feature set, 5.7% absolute isolated phoneme recognition improvement is obtained against only MFCC-based features.
Skip Nav Destination
,
,
Article navigation
September 2010
Research Article|
August 11 2010
Statistical modeling of speech Poincaré sections in combination of frequency analysis to improve speech recognition performance Available to Purchase
Ayyoob Jafari;
Ayyoob Jafari
a)
1Department of Biomedical Engineering,
Amirkabir University of Technology
, Tehran, Iran
2Department of Biomedical Engineering,
Islamic Azad University
, Qazvin Branch, Iran
Search for other works by this author on:
Farshad Almasganj;
Farshad Almasganj
b)
1Department of Biomedical Engineering,
Amirkabir University of Technology
, Tehran, Iran
Search for other works by this author on:
Maryam Nabi Bidhendi
Maryam Nabi Bidhendi
c)
1Department of Biomedical Engineering,
Amirkabir University of Technology
, Tehran, Iran
Search for other works by this author on:
Ayyoob Jafari
1,2,a)
Farshad Almasganj
1,b)
Maryam Nabi Bidhendi
1,c)
1Department of Biomedical Engineering,
Amirkabir University of Technology
, Tehran, Iran
2Department of Biomedical Engineering,
Islamic Azad University
, Qazvin Branch, Iran
a)
Electronic addresses: [email protected] and [email protected]. Telephone: +989125103094. FAX: +982166943093.
b)
Electronic mail: [email protected]. Telephone: +989123048783. FAX: +982164542372.
c)
Electronic mail: [email protected].
Chaos 20, 033106 (2010)
Article history
Received:
November 25 2009
Accepted:
June 23 2010
Citation
Ayyoob Jafari, Farshad Almasganj, Maryam Nabi Bidhendi; Statistical modeling of speech Poincaré sections in combination of frequency analysis to improve speech recognition performance. Chaos 1 September 2010; 20 (3): 033106. https://doi.org/10.1063/1.3463722
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
Reservoir computing with the minimum description length principle
Antony Mizzi, Michael Small, et al.
Recent achievements in nonlinear dynamics, synchronization, and networks
Dibakar Ghosh, Norbert Marwan, et al.
Data-driven nonlinear model reduction to spectral submanifolds via oblique projection
Leonardo Bettini, Bálint Kaszás, et al.
Related Content
Evaluation of formant-like features on an automatic vowel classification task
J. Acoust. Soc. Am. (September 2004)
Poincaré recurrence statistics as an indicator of chaos synchronization
Chaos (April 2014)
Plykin-type attractor in nonautonomous coupled oscillators
Chaos (February 2009)
Highlighting interlanguage phoneme differences based on similarity matrices and convolutional neural network
J. Acoust. Soc. Am. (January 2021)
Boundary transformation representation of attractor shape deformation
Chaos (August 2021)