Bone and tissue conducted speech has been used in noisy environments to provide a relatively high signal-to-noise ratio signal. However, the limited bandwidth of bone and tissue conducted speech degrades the quality of the speech signal. Moreover in very noisy conditions, bandwidth extension of the bone and tissue conducted speech becomes problematic. In this paper, speech generated from bone and tissue conduction captured using an in-ear microphone is enhanced using adaptive filtering and a non-linear bandwidth extension method. Objective and subjective tests are used to evaluate the performance of the proposed techniques. Both evaluations show a statistically significant quality enhancement of the noisy in-ear microphone speech with ρ<0.0001 after denoising and ρ<0.01 after bandwidth extension.

1.
Bauer
,
P.
, and
Fingscheidt
,
T.
(
2008
). “
An hmm-based artificial bandwidth extension evaluated by cross-language training and test
,” in
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
,
Las Vegas, NV
, pp.
4589
4592
.
2.
Berger
,
E. H.
(
2003
).
The Noise Manual
(
American Industrial Hygiene Assoc.
,
Falls Church, VA
),
796
p.
3.
Bou Serhal
,
R. E.
,
Falk
,
T. H.
, and
Voix
,
J.
(
2013
). “
Integration of a distance sensitive wireless communication protocol to hearing protectors equipped with in-ear microphones
,”
Proc. Meet. Acoust.
19
,
040013
.
4.
Bouserhal
,
R. E.
,
Falk
,
T. H.
, and
Voix
,
J.
(
2015
). “
On the potential for artificial bandwidth extension of bone and tissue conducted speech: A mutual information study
,” in
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
,
Brisbane, Australia
, pp.
5108
5112
.
5.
Brummund
,
M. K.
,
Sgard
,
F.
,
Petit
,
Y.
, and
Laville
,
F.
(
2014
). “
Three-dimensional finite element modeling of the human external ear: Simulation study of the bone conduction occlusion effecta)
,”
J. Acoust. Soc. Am.
135
(
3
),
1433
1444
.
6.
Casali
,
J. G.
, and
Berger
,
E. H.
(
1996
). “
Technology advancements in hearing protection circa 1995: Active noise reduction, frequency/amplitude-sensitivity, and uniform attenuation
,”
Am. Indust. Hygiene Assoc.
57
(
2
),
175
185
.
7.
Davis
,
G. M.
(
2002
).
Noise Reduction in Speech Applications
, Vol.
7
(
CRC Press
,
Boca Raton, FL
).
8.
Dekens
,
T.
, and
Verhelst
,
W.
(
2013
). “
Body conducted speech enhancement by equalization and signal fusion
,”
IEEE Trans. Audio, Speech, Lang. Process.
21
(
12
),
2481
2492
.
9.
Falk
,
T. H.
,
Sejdic
,
E.
,
Chau
,
T.
, and
Chan
,
W.-Y.
(
2010
).
Spectro-Temporal Analysis of Auscultatory Sounds (In-Tech Publishing, Croatia)
,
Chap.
5, pp.
93
104
.
10.
Gan
,
W. S.
,
Mitra
,
S.
, and
Kuo
,
S. M.
(
2005
). “
Adaptive feedback active noise control headset: Implementation, evaluation and its extensions
,”
IEEE Trans. Consumer Electron.
51
(
3
),
975
982
.
11.
Iser
,
B.
, and
Schmidt
,
G.
(
2008
). “
Bandwidth extension of telephony speech
,” in
Speech and Audio Processing in Adverse Environments
(
Springer
,
Heidelberg, Germany
), pp.
135
184
.
12.
ITU-R, Rec.
(
2001
). BS 1534-1,
Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA)
(
International Telecommunication Union Geneva
,
Switzerland
).
13.
ITU-T, Rec.
(
2011
). P
863
,
Perceptual Objective Listening Quality Assessment (POLQA)
(
International Telecommunication Union
,
Geneva, Switzerland
).
14.
Jax
,
P.
, and
Vary
,
P.
(
2003
). “
On artificial bandwidth extension of telephone speech
,”
Signal Process.
83
(
8
),
1707
1719
.
15.
Kondo
,
K.
,
Fujita
,
T.
, and
Nakagawa
,
K.
(
2006
). “
On equalization of bone conducted speech for improved speech quality
,” in
IEEE International Symposium on Signal Processing and Information Technology
,
Vancouver, British Columbia, Canada
, pp.
426
431
.
16.
Kornagel
,
U.
(
2006
). “
Techniques for artificial bandwidth extension of telephone speech
,”
Signal Process.
86
(
6
),
1296
1306
.
17.
Lezzoum
,
N.
,
Gagnon
,
G.
, and
Voix
,
J.
(
2016
). “
Echo threshold between passive and electro-acoustic transmission paths in digital hearing protection devices
,”
Int. J. Industr. Ergon.
53
,
372
379
.
18.
Li
,
M.
,
Cohen
,
I.
, and
Mousazadeh
,
S.
(
2014
). “
Multisensory speech enhancement in noisy environments using bone-conducted and air-conducted microphones
,” in
IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)
,
Chengdu, China
, pp.
1
5
.
19.
Liu
,
Z.
,
Zhang
,
Z.
,
Acero
,
A.
,
Droppo
,
J.
, and
Huang
,
X.
(
2004
). “
Direct filtering for air-and bone-conductive microphones
,” in
IEEE 6th Workshop on Multimedia Signal Processing
,
Siena, Italy
, pp.
363
366
.
20.
Manolakis
,
D. G.
,
Ingle
,
V. K.
, and
Kogon
,
S. M.
(
2005
).
Statistical and Adaptive Signal Processing: Spectral Estimation, Signal Modeling, Adaptive Filtering, and Array Processing
, Vol.
46
(
Artech House
,
Norwood, MA
),
816
pp.
21.
Martinek
,
R.
, and
Zidek
,
J.
(
2010
). “
Use of adaptive filtering for noise reduction in communications systems
,” in
International Conference on Applied Electronics (AE)
(
Pilsen
,
Czech Republic
), pp.
1
6
.
22.
McBride
,
M.
,
Tran
,
P.
,
Letowski
,
T.
, and
Patrick
,
R.
(
2011
). “
The effect of bone conduction microphone locations on speech intelligibility and sound quality
,”
Appl. Ergonom.
42
(
3
),
495
502
.
23.
Murphy
,
W.
,
Davis
,
R.
,
Byrne
,
D.
, and
Franks
,
J.
(
2005
). “
Advanced hearing protector study: Conducted at General Motors metal fabricating division
,” Flint Metal Center, Flint, Michigan.
24.
Nadon
,
V.
,
Bockstael
,
A.
,
Botteldooren
,
D.
,
Lina
,
J.-M.
, and
Voix
,
J.
(
2015
). “
Individual monitoring of hearing status: Development and validation of advanced techniques to measure otoacoustic emissions in suboptimal test conditions
,”
Appl. Acoust.
89
,
78
87
.
25.
OSHA.
(
1983
). Occupational Noise Exposure: Hearing Conservation Amendment, Final Rule. Occupational Safety and Health Administration, 29CFR191095 Fed Regist
48
(
46
), pp.
9738
9797
.
26.
Paliwal
,
K.
, and
Kleijn
,
W.
(
1995
). “
Quantization of LPC parameters
,” in
Speech Coding and Synthesis
(
Elsevier Science B.V.
,
Amsterdam
) pp.
433
466
.
27.
Rahman
,
M. S.
, and
Shimamura
,
T.
(
2011
). “
Intelligibility enhancement of bone conducted speech by an analysis-synthesis method
,” in
IEEE 54th International Midwest Symposium on Circuits and Systems (MWSCAS)
,
Seoul, Korea (South)
, pp.
1
4
.
28.
Rothauser
,
E.
,
Chapman
,
W.
,
Guttman
,
N.
,
Nordby
,
K.
,
Silbiger
,
H.
,
Urbanek
,
G.
, and
Weinstock
,
M.
(
1969
). “
IEEE recommended practice for speech quality measurements
,”
IEEE Trans. Audio Electroacoust.
17
(
3
),
225
246
.
29.
Sayed
,
A. H.
(
2003
).
Fundamentals of Adaptive Filtering
(
John Wiley & Sons
,
Vancouver, British Columbia, Canada
),
1118
pp.
30.
Seltzer
,
M. L.
,
Acero
,
A.
, and
Droppo
,
J.
(
2005
). “
Robust bandwidth extension of noise-corrupted narrowband speech
,” in
INTERSPEECH
,
Lisbon, Portugal
, pp.
1509
1512
.
31.
Shin
,
H. S.
,
Kang
,
H.-G.
, and
Fingscheidt
,
T.
(
2012
). “
Survey of speech enhancement supported by a bone conduction microphone
,” in
Proceedings of Speech Communication; 10. ITG Symposium
,
VDE, Berlin, Germany
, pp.
1
4
.
32.
Thang
,
T. V.
,
Kimura
,
K.
,
Unoki
,
M.
, and
Akagi
,
M.
(
2006
). “
A study on restoration of bone-conducted speech with mtf-based and lp-based models
,”
J. Signal Process.
10
(
6
),
407
417
.
33.
Turan
,
M. T.
, and
Erzin
,
E.
(
2013
). “
Enhancement of throat microphone recordings by learning phone-dependent mappings of speech spectra
,” in
2013 IEEE International Conference on Acoustics, Speech and Signal Processing
,
Vancouver, British Columbia, Canada
, pp.
7049
7053
.
34.
Valin
,
J.-M.
, and
Lefebvre
,
R.
(
2000
). “
Bandwidth extension of narrowband speech for low bit-rate wideband coding
,” in
IEEE Proceedings of the Workshop on Speech Coding
,
Delavan, WI
, pp.
130
132
.
35.
Varga
,
A.
, and
Steeneken
,
H. J.
(
1993
). “
Assessment for automatic speech recognition: Ii. noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems
,”
Speech Commun.
12
(
3
),
247
251
.
36.
Voix
,
J.
, and
Laville
,
F.
(
2009
). “
The objective measurement of individual earplug field performance
,”
J. Acoust. Soc. Am.
125
(
6
),
3722
3732
.
37.
Vu
,
T. tat
,
Unoki
,
M.
, and
Akagi
,
M.
(
2008
). “
An LP-based blind model for restoring bone-conducted speech
,” in
Second International Conference on Communications and Electronics (ICCE)
,
Hanoi, Vietnam
, pp.
212
217
.
38.
Zheng
,
Y.
,
Liu
,
Z.
,
Zhang
,
Z.
,
Sinclair
,
M.
,
Droppo
,
J.
,
Deng
,
L.
,
Acero
,
A.
, and
Huang
,
X.
(
2003
). “
Air-and bone-conductive integrated microphones for robust speech detection and enhancement
,” in
IEEE Workshop on Automatic Speech Recognition and Understanding. ASRU'03
,
Saint Thomas, U.S. Virgin Islands
, pp.
249
254
.
You do not currently have access to this content.