Automatic labeling methods have been developed which allow speech recognition systems to be trained and tested on very large data bases (up to 140 000 word tokens). But are these automatic labeling methods accurate enough to collect statistics of direct value to speech science? Despite the success of automatic methods in effectively training speech recognition systems, the alignment routines still make a small percentage of gross phone alignment errors—in sharp contrast to human spectrogram or waveform experts. Automatic training procedures seem, nonetheless, to be tolerant of this small percentage of misaligned sounds provided they are given a sufficiently large number of correctly aligned instances of each sound. In this paper, a new method is proposed to directly address the following classification problem: is a particular putative phone alignment correct or is it an alignment error? Applying this classification method to a large data base previously labeled by a conventional automatic routine, acoustic‐phonetic statistics for each sound may be obtained for all instances of that sound which are classified as correctly labeled.

This content is only available via PDF.