INSTITUTE OF INFORMATION TECHNOLOGIES - BAS

Cybernetics and Information Technologies
Volume 3, No 1. Sofia, 2003, Bulgarian Academy of Sciences


An Evaluation of DTW, AA and ARVM for Fixed-Text Speaker Identification

Atanas Ouzounov
Institute of Information Technologies, 1113 Sofia,

Abstract: Three different methodologies for automatic speaker identification have been evaluated in the paper, namely the well known Dynamic Time Warping (DTW), the Auto-Regressive Vector Models (ARVM) and an Algebraic Approach (AA). The aim of our study is to examine the effectiveness of these approaches in the fixed-text speaker identification task with short phrases in Bulgarian language collected over noisy telephone channels. Furthermore, two well-known speech features, namely the Linear Predictive Coding derived Cepstrum (LPCC) and the Mel-Frequency Cepstral Coefficients (MFCC) were evaluated. As experimental results shown the joint work of the ARVM and the MFCC outperforms the all others approaches used in this study.

Keywords: speaker identification, mel-frequency cepstrum, linear predictive coding cepstrum, algebraic approach.