Second-language learners often experience major difficulties in producing non-native speech sounds. This paper introduces a training method that uses a real-time analysis of the acoustic properties of vowels produced by non-native speakers to provide them with immediate, trial-by-trial visual feedback about their articulation alongside that of the same vowels produced by native speakers. The Mahalanobis acoustic distance between non-native productions and target native acoustic spaces was used to assess L2 production accuracy. The experiment shows that 1 h of training per vowel improves the production of four non-native Danish vowels: the learners' productions were closer to the corresponding Danish target vowels after training. The production performance of a control group remained unchanged. Comparisons of pre- and post-training vowel discrimination performance in the experimental group showed improvements in perception. Correlational analyses of training-related changes in production and perception revealed no relationship. These results suggest, first, that this training method is effective in improving non-native vowel production. Second, training purely on production improves perception. Finally, it appears that improvements in production and perception do not systematically progress at equal rates within individuals.

1.
Akahane-Yamada
,
R.
,
McDermott
,
E.
,
Adachi
,
T.
,
Kawahara
,
H.
, and
Pruitt
,
J.-S.
(
1998
). “
Computer-based second language production training by using spectrographic representation and HMM-based speech recognition scores
,”
Proc. Interspeech
5
,
1
4
.
2.
Alario
,
F. X.
,
Goslin
,
J.
,
Michel
,
V.
, and
Laganaro
,
M.
(
2010
). “
The functional origin of the foreign accent: Evidence from the syllable-frequency effect in bilingual speakers
,”
Psychol. Sci.
21
(
1
),
15
20
.
3.
Aliaga-García
,
C.
, and
Mora
,
J. C.
(
2009
). “
Assessing the effects of phonetic training on L2 sound perception and production
,” in
Recent Research in Second Language Phonetics/Phonology: Perception and Production
, edited by
M. A.
Watkins
,
A. S.
Rauber
, and
B. O.
Baptista
(
Cambridge Scholars Publishing
,
Newcastle upon Tyne, UK
), pp.
2
31
.
4.
Barr
,
D. J.
,
Levy
,
R.
,
Scheepers
,
C.
, and
Tily
,
H. J.
(
2013
). “
Random effects structure for confirmatory hypothesis testing: Keep it maximal
,”
J. Memory Lang.
68
(
3
),
255
278
.
5.
Basbøl
,
H.
(
2005
).
The Phonology of Danish
(
Oxford University Press
,
New York
), pp.
86
92
.
6.
Best
,
C. T.
(
1995
). “
A direct cross-realist view of cross-language speech perception
,” in
Speech Perception and Linguistic Experience: Theoretical and Methodological Issues
, edited by
W.
Strange
(
York Press
,
Baltimore, MD
), pp.
171
204
).
7.
Boersma
,
P.
, and
Weenink
,
D.
(
2010
). “
Praat: Doing phonetics by computer
,” [Computer program]. Version 5.2, http://www.praat.org/ (Last viewed October 19, 2010).
8.
Bradlow
,
A. R.
,
Pisoni
,
D. B.
,
Akahane-Yamada
,
R.
, and
Tohkura
,
Y.
(
1997
). “
Training Japanese listeners to identify English /r/ and /l/: IV. Some effects of perceptual learning on speech production
,”
J. Acoust. Soc. Am.
101
(
4
),
2299
2310
.
9.
Carey
,
M.
(
2004
). “
CALL visual feedback for pronunciation of vowels: Kay Sona-Match
,”
CALICO J.
21
(
3
),
571
601
.
10.
Catford
,
J. C.
, and
Pisoni
,
D. B.
(
1970
). “
Auditory vs. articulatory training in exotic sounds
,”
Modern Lang. J.
54
(
7
),
477
481
.
11.
Darcy
,
I.
,
Dekydtspotter
,
L.
,
Sprouse
,
R. A.
,
Glover
,
J.
,
Kaden
,
C.
,
McGuire
,
M.
, and
Scott
,
J. H.
(
2012
). “
Direct mapping of acoustics to phonology: On the lexical encoding of front rounded vowels in L1 English-L2 French acquisition
,”
Second Lang. Res.
28
(
1
),
5
40
.
12.
Davis
,
M. H.
,
Di Betta
,
A. M.
,
Macdonald
,
M. J.
, and
Gaskell
,
M. G.
(
2009
). “
Learning and consolidation of novel spoken words
,”
J. Cogn. Neurosci.
21
(
4
),
803
820
.
13.
Delvaux
,
V.
,
Huet
,
K.
,
Piccaluga
,
M.
, and
Harmegnies
,
B.
(
2013
). “
Production training in Second Language Acquisition: A comparison between objective measures and subjective judgments
,”
Proc. Interspeech
14
,
2375
2379
.
14.
Dowd
,
A.
,
Smith
,
J.
, and
Wolfe
,
J.
(
1998
). “
Learning to pronounce vowel sounds in a foreign language using acoustic measurements of the vocal tract as feedback in real time
,”
Lang. Speech
41
(
1
),
1
20
.
15.
Escudero
,
P.
, and
Boersma
,
P.
(
2004
). “
Bridging the gap between L2 speech perception research and phonological theory
,”
Stud. Second Lang. Acquisit.
26
(
04
),
551
585
.
16.
Flege
,
J. E.
(
1995
). “
Second language speech learning theory, findings, and problems
,” in
Speech Perception and Linguistic Experience: Issues in Cross-Language Research
, edited by
W.
Strange
(
York Press
,
Timonium, MD
), pp.
233
277
.
17.
Flege
,
J. E.
(
2002
). “
Interactions between the native and second-language phonetic systems
,” in
An Integrated View of Language Development: Papers in Honor of Henning Wode
, edited by
P.
Burmeister
,
T.
Piske
, and
A.
Rohde
(
Wissenschaftlicher Verlag Trier
,
Trier
), pp.
217
243
.
18.
Flege
,
J. E.
,
MacKay
,
I. R.
, and
Meador
,
D.
(
1999
). “
Native Italian speakers' perception and production of English vowels
,”
J. Acoust. Soc. Am.
106
(
5
),
2973
2987
.
19.
Georgeton
,
L.
,
Paillereau
,
N.
,
Landron
,
S.
,
Gao
,
J.
, and
Kamiyama
,
T.
(
2012
). “
Analyse formantique des voyelles orales du français en contexte isolé: à la recherche d'une référence pour les apprenants de FLE (Formant analysis of the French oral vowels in isolated context: In a quest of a reference for French learners)
,” in
Proceedings of JEP-TALN-RECITAL
, pp.
145
152
.
55.
Golestani
,
N.
, and
Pallier
,
C.
(
2007
). “
Anatomical correlates of foreign speech sound production
,”
Cerebral Cortex
17
(
4
),
929
934
.
20.
Grønnum
,
N.
(
1997
). “
Danish vowels: The psychological reality of a morphophonemic representation
,”
Proc. Journée dáEtudes Linguistiques [A day of Linguistic Studies]
pp.
91
97
.
21.
Guenther
,
F. H.
(
1994
). “
A neural network model of speech acquisition and motor equivalent speech production
,”
Biol. Cybern
72
(
1
),
43
53
.
22.
Hu
,
X.
,
Ackermann
,
H.
,
Martin
,
J. A.
,
Erb
,
M.
,
Winkler
,
S.
, and
Reiterer
,
S. M.
(
2013
). “
Language aptitude for pronunciation in advanced second language (L2) learners: Behavioural predictors and neural substrates
,”
Brain Lang.
127
(
3
),
366
376
.
23.
Ingram
,
J. C.
, and
Park
,
S.-G.
(
1997
). “
Cross-language vowel perception and production by Japanese and Korean learners of English
,”
J. Phonetics
25
(
3
),
343
370
.
24.
Kartushina
,
N.
, and
Frauenfelder
,
U. H.
(
2014
). “
On the effects of L2 perception and of individual differences in L1 production on L2 pronunciation
,”
Frontiers Psychol.
5
(
1246
),
1
17
.
25.
Lametti
,
D. R.
,
Rochet-Capellan
,
A.
,
Neufeld
,
E.
,
Shiller
,
D. M.
, and
Ostry
,
D. J.
(
2014
). “
Plasticity in the human speech motor system drives changes in speech perception
,”
J. Neurosci.
34
(
31
),
10339
10346
.
26.
Leather
,
J.
(
1996
). “
Interrelation of perceptual and productive learning in the initial acquisition of second-language tone
,” in
Second-Language Speech: Structure and Process
, edited by
A.
James
and
J.
Leather
(
Mouton de Gruyter
,
Berlin
), pp.
75
101
.
27.
Liberman
,
A. M.
, and
Mattingly
,
I. G.
(
1985
). “
The motor theory of speech perception revised
,”
Cognition
21
(
1
),
1
36
.
28.
Lively
,
S. E.
,
Logan
,
J. S.
, and
Pisoni
,
D. B.
(
1993
). “
Training Japanese listeners to identify English/r/and/l/. II: The role of phonetic environment and talker variability in learning new perceptual categories
,”
J. Acoust. Soc. Am.
94
(
3 Pt 1
),
1242
1255
.
29.
Loizou
,
P.
(
1998
). “
COLEA: A MATLAB software tool for speech analysis
,” Dallas, TX. Available at http://ecs.utdallas.edu/loizou/speech/colea.htm (Last viewed October 4, 2011).
30.
Lopez-Soto
,
T.
, and
Kewley-Port
,
D.
(
2009
). “
Relation of perception training to production of codas in English as a Second Language
,”
J. Acoust. Soc. Am.
125
,
2756
.
31.
Massaro
,
D. W.
,
Bigler
,
S.
,
Chen
,
T. H.
,
Perlman
,
M.
, and
Ouni
,
S.
(
2008
). “
Pronunciation training: The role of eye and ear
,”
Proc. Interspeech
9
,
2623
2626
.
32.
Ménard
,
L.
,
Schwartz
,
J.-L.
,
Boë
,
L.-J.
,
Kandel
,
S.
, and
Vallée
,
N.
(
2002
). “
Auditory normalization of French vowels synthesized by an articulatory model simulating growth from birth to adulthood
,”
J. Acoust. Soc. Am.
111
(
4
),
1892
1905
.
33.
Öster
,
A.-M.
(
1997
). “
Auditory and visual feedback in spoken L2 teaching
,” in Reports from the Department of Phonetics, Umeå University, PHONUM 4,
145
148
.
34.
Pallier
,
C.
,
Colomé
,
A.
, and
Sebastián-Gallés
,
N.
(
2001
). “
The influence of native-language phonology on lexical access: Exemplar-based versus abstract lexical entries
,”
Psychol. Sci.
12
(
6
),
445
449
.
35.
Peperkamp
,
S.
, and
Bouchon
,
C.
(
2011
). “
The relation between perception and production in L2 phonological processing
,”
Proc. Interspeech
12
,
161
164
.
36.
Perrachione
,
T. K.
,
Lee
,
J.
,
Ha
,
L. Y. Y.
, and
Wong
,
P. C. M.
(
2011
). “
Learning a novel phonological contrast depends on interactions between individual differences and training paradigm design
,”
J. Acoust. Soc. Am.
130
(
1
),
461
472
.
37.
Pillot-Loiseau
,
C.
,
Antolík Kocjančič
,
T.
, and
Kamiyama
,
T.
(
2013
). “
Contribution of ultrasound visualisation to improving the production of the French /y/-/u/ contrast by four Japanese learners
,” in
Proceedings of the PPLC13: Phonetics, Phonology, Languages in Contact. Contact Varieties, Multilingualism, Second Language Learning
, pp.
86
89
.
38.
Piske
,
T.
,
MacKay
,
I. R.
, and
Flege
,
J. E.
(
2001
). “
Factors affecting degree of foreign accent in an L2: A review
,”
J. Phon.
29
(
2
),
191
215
.
39.
Saito
,
K.
, and
Lyster
,
R.
(
2012
). “
Effects of form-focused instruction and corrective feedback on L2 pronunciation development of /ɹ/ by Japanese Learners of English
,”
Lang. Learn.
62
(
2
),
595
633
.
40.
Sheldon
,
A.
, and
Strange
,
W.
(
1982
). “
The acquisition of /r/ and /l/ by Japanese learners of English: Evidence that speech production can precede speech perception
,”
Appl. Psycholinguist.
3
(
3
),
243
261
.
41.
Simmonds
,
A. J.
,
Wise
,
R. J. S.
,
Dhanjal
,
N. S.
, and
Leech
,
R.
(
2011
). “
A comparison of sensory-motor activity during speech in first and second languages
,”
J. Neurophys.
106
(
1
),
470
478
.
42.
Steinlen
,
A. K.
(
2005
).
The Influence of Consonants on Native and Non-native Vowel Production: A Cross-Linguistic Study
(
Gunter Narr Verlag
,
Tübingen
), pp.
71
117
.
43.
Wik
,
P.
(
2004
). “
Designing a virtual language tutor
,” in
Proceedings of the XVIIth Swedish Phonetics Conference
, Fonetik, pp.
136
139
.
44.
Wilson
,
S. M.
, and
Gick
,
B.
(
2006
). “
Ultrasound technology and second language acquisition research
,”
Proc. Generative Approach. Second Lang. Acquisit.
8
,
148
152
.
45.
Wong
,
J. W. S.
(
2013
). “
The effects of perceptual and or productive training on the perception and production of English vowels /I/ and /i:/ by Cantonese ESL learners
,”
Proc. Interspeech
14
,
2113
2117
.
You do not currently have access to this content.