Cybernetics and Information Technologies, Vol. 3, No 1, 2003

Abstract: Three different methodologies for automatic speaker identification have been evaluated in the paper, namely the well known Dynamic Time Warping (DTW), the Auto-Regressive Vector Models (ARVM) and an Algebraic Approach (AA). The aim of our study is to examine the effectiveness of these approaches in the fixed-text speaker identification task with short phrases in Bulgarian language collected over noisy telephone channels. Furthermore, two well-known speech features, namely the Linear Predictive Coding derived Cepstrum (LPCC) and the Mel-Frequency Cepstral Coefficients (MFCC) were evaluated. As experimental results shown the joint work of the ARVM and the MFCC outperforms the all others approaches used in this study.

Keywords: speaker identification, mel-frequency cepstrum, linear predictive coding cepstrum, algebraic approach.