Forensic speaker recognition in Chinese: A multivariate likelihood ratio discrimination on /i/ and /y/

Cuiling Zhang*, Geoffrey Stewart Morrison, Philip Rose

*Corresponding author for this work

Research output: Chapter in Book/Published conference outputConference publication

Abstract

A likelihood-ratio-based forensic speaker discrimination was conducted using the mean formant frequencies of Standard Chinese /i/ and /y/ tokens produced by 64 male speakers. The speech data were relatively forensically realistic in that they were relatively extemporaneous, were recorded over the telephone, and were from three non-contemporaneous recording sessions. A multivariate-kernel-density formula was used to calculate cross-validated likelihood ratios comparing all possible same-speaker and different-speaker combinations across sessions. Results were comparable with those previously obtained with laboratory speech in other languages. In general, greater strength of evidence was obtained for recording sessions separated by one week than for recording sessions separated by one month.

Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association
Subtitle of host publicationINTERSPEECH 2008
Pages1937-1940
Number of pages4
Publication statusPublished - 2008

Publication series

NameProceedings of Interspeech
ISSN (Print)1990 9772

Keywords

  • Chinese
  • Forensic speaker recognition
  • Likelihood ratio

Fingerprint

Dive into the research topics of 'Forensic speaker recognition in Chinese: A multivariate likelihood ratio discrimination on /i/ and /y/'. Together they form a unique fingerprint.

Cite this