This study investigates the fusion of multiple formant-trajectory- and fundamental-frequency-trajectory-based (f0-trajectory-based) forensic-voice-comparison systems. Each system was based on tokens of a single phoneme: tokens of Chinese /ei1/, /ai2/, and /iau1/ (numbers indicate tones). Human-supervised formant-trajectory and f0-trajectory measurements were made on tokens from a database of recordings of 60 female speakers of Chinese. Discrete cosine transforms (DCT) were fitted to the trajectories and the DCT coefficients used to calculate likelihood ratios via the multivariate kernel density (MVKD) formula. The individual-phoneme systems were fused with each other and with a baseline mel-frequency cepstral-coefficient (MFCC) Gaussian-mixture-model universal-background-model (GMM-UBM). The latter made use of the entire speech-active portion of the recordings. Tests were conducted using high-quality recordings as nominal suspect samples and mobile-to-landline transmitted recordings as nominal offender samples. Fusion of the phoneme-systems with the baseline system via logistic regression did not lead to any substantial improvement in validity, and reliability deteriorated.
Skip Nav Destination
Article navigation
2 June 2013
ICA 2013 Montreal
2–7 June 2013
Montreal, Canada
Speech Communication: Session 1pSCc: Distinguishing Between Science and Pseudoscience in Forensic Acoustics II
May 17 2013
Fusion of multiple formant-trajectory- and fundamental-frequency-based forensic-voice-comparison systems: Chinese /ei1/, /ai2/, and /iau1/
Cuiling Zhang;
Cuiling Zhang
Department of Foresnic Science & Technology, China Criminal Police Universty, 83 Tawan Street, Shenyang, Liaoning 110854 China
Search for other works by this author on:
Ewald Enzinger
Ewald Enzinger
Forensic Voice Comparison Laboratory, School of Electrical Engineering and Telecommunications, University of New South Wales, UNSW Sydney, New South Wales 2052 Australia
Search for other works by this author on:
Proc. Mtgs. Acoust. 19, 060044 (2013)
Article history
Received:
January 20 2013
Accepted:
January 21 2013
Citation
Cuiling Zhang, Ewald Enzinger; Fusion of multiple formant-trajectory- and fundamental-frequency-based forensic-voice-comparison systems: Chinese /ei1/, /ai2/, and /iau1/. Proc. Mtgs. Acoust. 2 June 2013; 19 (1): 060044. https://doi.org/10.1121/1.4798793
Download citation file:
Citing articles via
Flyback sonic booms from Falcon-9 rockets: Measured data and some considerations for future models
Mark C. Anderson, Kent L. Gee, et al.
Related Content
Examining long-term formant distributions as a discriminant in forensic speaker comparisons under a likelihood ratio framework
Proc. Mtgs. Acoust. (May 2013)
Mismatched distances from speakers to telephone in a forensic-voice-comparison case
Proc. Mtgs. Acoust. (May 2013)
Establishing typicality: A closer look at individual formants
Proc. Mtgs. Acoust. (May 2013)