In this paper we present a model called the Modified Phase-Opponency (MPO) model for single-channel speech enhancement when the speech is corrupted by additive noise. The MPO model is based on the auditory PO model, proposed for detection of tones in noise. The PO model includes a physiologically realistic mechanism for processing the information in neural discharge times and exploits the frequency-dependent phase properties of the tuned filters in the auditory periphery by using a cross-auditory-nerve-fiber coincidence detection for extracting temporal cues. The MPO model alters the components of the PO model such that the basic functionality of the PO model is maintained but the properties of the model can be analyzed and modified independently. The MPO-based speech enhancement scheme does not need to estimate the noise characteristics nor does it assume that the noise satisfies any statistical model. The MPO technique leads to the lowest value of the LPC-based objective measures and the highest value of the perceptual evaluation of speech quality measure compared to other methods when the speech signals are corrupted by fluctuating noise. Combining the MPO speech enhancement technique with our aperiodicity, periodicity, and pitch detector further improves its performance.
Skip Nav Destination
,
,
Article navigation
June 2007
June 01 2007
Speech enhancement using the modified phase-opponency model Available to Purchase
Om D. Deshmukh;
Om D. Deshmukh
a)
Department of Electrical and Computer Engineering and Institute for Systems Research,
University of Maryland
, College Park, Maryland 20742
Search for other works by this author on:
Carol Y. Espy-Wilson;
Carol Y. Espy-Wilson
Department of Electrical and Computer Engineering and Institute for Systems Research,
University of Maryland
, College Park, Maryland 20742
Search for other works by this author on:
Laurel H. Carney
Laurel H. Carney
Department of Biomedical and Chemical Engineering and Institute for Sensory Research,
Syracuse University
, Syracuse, New York 13244
Search for other works by this author on:
Om D. Deshmukh
a)
Carol Y. Espy-Wilson
Laurel H. Carney
Department of Electrical and Computer Engineering and Institute for Systems Research,
University of Maryland
, College Park, Maryland 20742a)
Electronic mail: [email protected]
J. Acoust. Soc. Am. 121, 3886–3898 (2007)
Article history
Received:
February 01 2006
Accepted:
February 13 2007
Citation
Om D. Deshmukh, Carol Y. Espy-Wilson, Laurel H. Carney; Speech enhancement using the modified phase-opponency model. J. Acoust. Soc. Am. 1 June 2007; 121 (6): 3886–3898. https://doi.org/10.1121/1.2714913
Download citation file:
Pay-Per-View Access
$40.00
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Citing articles via
Focality of sound source placement by higher (ninth) order ambisonics and perceptual effects of spectral reproduction errors
Nima Zargarnezhad, Bruno Mesquita, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Variation in global and intonational pitch settings among black and white speakers of Southern American English
Aini Li, Ruaridh Purse, et al.
Related Content
A noise‐reduction strategy for speech based on phase‐opponency detectors
J. Acoust. Soc. Am. (April 2005)
Speech enhancement based on modified phase‐opponency detectors
J. Acoust. Soc. Am. (September 2005)
A machine for neural computation of acoustical patterns with application to real time speech recognition
AIP Conf. Proc. (August 1986)
Speech enhancement for noise‐robust speech recognition.
J. Acoust. Soc. Am. (October 2008)
Pure state consciousness and its local reduction to neuronal space
AIP Conf. Proc. (January 2013)