The recent increasing availability of comprehensive real-time MRI data of the vocal tract and concomitant progress in air-tissue boundary segmentation present novel opportunities for articulatory modeling. PCA-based articulatory models represent vocal tract configurations as weighted linear combinations of articulatory components that characterize vocal tract shaping patterns. Historically, most such models have been developed using data from a single speaker, and their direct application to data from multiple speakers may provide components that are not comparable across speakers. A technique that can address this issue is PARAFAC, which has been applied in the past to analyze data corresponding to tongue contour X-ray tracings of 10 English vowels from 5 speakers. PARAFAC introduces an additional level of weighting of the articulatory components that is constant for, and therefore characteristic of, each speaker. We revisited PARAFAC and ran a successful pilot on real-time MRI air tissue-boundaries of the entire vocal tract from 4 speakers, each uttering two repetitions of a set of 4 Shibboleth sentences, yielding 289 phones per speaker. Application on a much larger and diverse real-time MRI dataset already collected by our team will provide crucial progress toward true cross-speaker articulatory modeling. [Work supported by NIH and NSF.]
Skip Nav Destination
Article navigation
October 2019
Meeting abstract. No PDF available.
October 01 2019
Toward cross-speaker articulatory modeling
Asterios Toutios;
Asterios Toutios
Univ. of Southern California, 3740 McClintock Ave., EEB 400, Los Angeles, CA 90089, [email protected]
Search for other works by this author on:
Shrikanth S. Narayanan
Shrikanth S. Narayanan
Univ. of Southern California, Los Angeles, CA
Search for other works by this author on:
J. Acoust. Soc. Am. 146, 3085 (2019)
Citation
Asterios Toutios, Reed Blaylock, Louis Goldstein, Shrikanth S. Narayanan; Toward cross-speaker articulatory modeling. J. Acoust. Soc. Am. 1 October 2019; 146 (4_Supplement): 3085. https://doi.org/10.1121/1.5137718
Download citation file:
93
Views
Citing articles via
All we know about anechoic chambers
Michael Vorländer
Day-to-day loudness assessments of indoor soundscapes: Exploring the impact of loudness indicators, person, and situation
Siegbert Versümer, Jochen Steffens, et al.
A survey of sound source localization with deep learning methods
Pierre-Amaury Grumiaux, Srđan Kitić, et al.
Related Content
How an aglossic speaker produces an alveolar-like percept without a functional tongue tip
J. Acoust. Soc. Am. (June 2020)
Two cross‐linguistic factors underlying tongue shapes for vowels
J Acoust Soc Am (June 1996)
Characterization of inter-speaker articulatory variability: A two-level multi-speaker modelling approach based on MRI data
J. Acoust. Soc. Am. (April 2019)
Reducing speaker variation in Germanic: Improved results with PARAFAC
J Acoust Soc Am (August 2005)
On the lingual organization of the German vowel system
J Acoust Soc Am (August 1999)