An Evaluation of DTW, AA and ARVM for Fixed-Text Speaker Identification
Atanas Ouzounov
Institute of Information Technologies, 1113 Sofia,
Abstract: Three different methodologies for automatic speaker identification have been evaluated in the paper, namely the well known Dynamic Time Warping (DTW), the Auto-Regressive Vector Models (ARVM) and an Algebraic Approach (AA). The aim of our study is to examine the effectiveness of these approaches in the fixed-text speaker identification task with short phrases in Bulgarian language collected over noisy telephone channels. Furthermore, two well-known speech features, namely the Linear Predictive Coding derived Cepstrum (LPCC) and the Mel-Frequency Cepstral
Coefficients (MFCC) were evaluated. As experimental results shown
the joint work of the ARVM and the MFCC outperforms the all others approaches used in this study.
Keywords: speaker identification, mel-frequency cepstrum, linear predictive coding cepstrum, algebraic approach.