雑音に頑健な話者照合のための基本周波数情報の利用

Translated title of the contribution: Use of F_0 information for noise-robust speaker verification

浅見 太一, 岩野 公司, 古井 貞熙, Koji IWANO

Research output: Contribution to journalMisc

Abstract

This paper proposes a noise-robust speaker verification method using prosodic information. This method uses logF_0 and △logF_0 as prosodic features. They are combined with segmental features such as cepstral parameters. F_0 is extracted by a noise-robust method using the Hough transform which is applied to time-cepstrum images. The segmental and prosodic features are combined and modeled by multi-stream HMMs. Speaker verification experiments were conducted using four-connected-digit utterances of Japanese, contaminated by white noise with various SNRs. Experimental results show that equal error rates were reduced in all SNR conditions. The best reduction was observed at lOdB SNR condition; the error rate was reduced by 39.9% from the baseline method using only segmental features.
Translated title of the contributionUse of F_0 information for noise-robust speaker verification
Original languageJapanese
Pages (from-to)1 - 6
JournalIEICE technical report. Speech
Volume104
Issue number87
StatePublished - 21 May 2004

Fingerprint

Dive into the research topics of 'Use of F_0 information for noise-robust speaker verification'. Together they form a unique fingerprint.

Cite this