Acquisition of dynamic articulatory data is of major importance for studying speech production. It turns out that one technique alone often is not enough to get a correct coverage of the whole vocal tract at a sufficient sampling rate. Ultrasound (US) imaging has been proposed as a good acquisition technique for the tongue surface because it offers a good temporal sampling, does not alter speech production, is cheap, and is widely available. However, it cannot be used alone and this paper describes a multimodal acquisition system which uses electromagnetography sensors to locate the US probe. The paper particularly focuses on the calibration of the US modality which is the key point of the system. This approach enables US data to be merged with other data. The use of the system is illustrated via an experiment consisting of measuring the minimal tongue to palate distance in order to evaluate and design Magnetic Resonance Imaging protocols well suited for the acquisition of three-dimensional images of the vocal tract. Compared to manual registration of acquisition modalities which is often used in acquisition of articulatory data, the approach presented relies on automatic techniques well founded from geometrical and mathematical points of view.
Skip Nav Destination
Article navigation
February 2016
February 04 2016
Multimodal acquisition of articulatory data: Geometrical and temporal registration
Michaël Aron;
Michaël Aron
Institut Supérieur de l'Electronique et du Numérique
, Brest, France
Search for other works by this author on:
Marie-Odile Berger;
Marie-Odile Berger
Institut de Recherche en Informatique et en Automatique, Centre National de la Recherche Scientifique, Université de Lorraine, Laboratoire Lorrain de Recherche en Informatique et ses Applications
, Vandœuvre-lès-Nancy, France
Search for other works by this author on:
Erwan Kerrien;
Erwan Kerrien
Institut de Recherche en Informatique et en Automatique, Centre National de la Recherche Scientifique, Université de Lorraine, Laboratoire Lorrain de Recherche en Informatique et ses Applications
, Vandœuvre-lès-Nancy, France
Search for other works by this author on:
Brigitte Wrobel-Dautcourt;
Brigitte Wrobel-Dautcourt
Institut de Recherche en Informatique et en Automatique, Centre National de la Recherche Scientifique, Université de Lorraine, Laboratoire Lorrain de Recherche en Informatique et ses Applications
, Vandœuvre-lès-Nancy, France
Search for other works by this author on:
Blaise Potard;
Blaise Potard
Institut de Recherche en Informatique et en Automatique, Centre National de la Recherche Scientifique, Université de Lorraine, Laboratoire Lorrain de Recherche en Informatique et ses Applications
, Vandœuvre-lès-Nancy, France
Search for other works by this author on:
Yves Laprie
Yves Laprie
a)
Institut de Recherche en Informatique et en Automatique, Centre National de la Recherche Scientifique, Université de Lorraine, Laboratoire Lorrain de Recherche en Informatique et ses Applications
, Vandœuvre-lès-Nancy, France
Search for other works by this author on:
a)
Electronic mail: Yves.Laprie@loria.fr
J. Acoust. Soc. Am. 139, 636–648 (2016)
Article history
Received:
May 04 2015
Accepted:
January 08 2016
Citation
Michaël Aron, Marie-Odile Berger, Erwan Kerrien, Brigitte Wrobel-Dautcourt, Blaise Potard, Yves Laprie; Multimodal acquisition of articulatory data: Geometrical and temporal registration. J. Acoust. Soc. Am. 1 February 2016; 139 (2): 636–648. https://doi.org/10.1121/1.4940666
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Pay-Per-View Access
$40.00
Citing articles via
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Co-speech head nods are used to enhance prosodic prominence at different levels of narrow focus in French
Christopher Carignan, Núria Esteve-Gibert, et al.
In a presentation, Ted once said I'd like my epitaph to be “I simplified.”
Paul Schomer, Truls Gjestland
Related Content
Co-registration of speech production datasets from electromagnetic articulography and real-time magnetic resonance imaging
J. Acoust. Soc. Am. (January 2014)
Co-registration of articulographic and real-time magnetic resonance imaging data for multimodal analysis of rapid speech
J Acoust Soc Am (September 2012)
4D magnetic resonance imaging atlas construction using temporally aligned audio waveforms in speech
J. Acoust. Soc. Am. (November 2021)
Bi-stable vocal fold adduction: A mechanism of modal-falsetto register shifts and mixed registration
J. Acoust. Soc. Am. (April 2014)
Imaging for understanding speech communication: Advances and challenges
J Acoust Soc Am (April 2005)