INSTITUTE OF INFORMATION TECHNOLOGIES - BAS
Cybernetics and Information Technologies
Volume 6, No 3. Sofia, 2006, Bulgarian Academy of Sciences
Robust Features and Neural Network for Noisy Speech Detection
Atanas Ouzounov
Institute of Information Technologies, 1113 Sofia
E-mail: atanas@iinf.bas.bg
Abstract:
In this paper, the effectiveness of three features in speech detection tasks is experimentally studied. The first feature is obtained by processing of the spectral autocorrelation function, while the second one is based on the multi-band spectral entropy. The well-known mel-cepstrum is utilized as a third feature. A multi-layer perceptron based speech detector is developed and speech detection tasks with noisy data are carried out for each feature. The performance analysis of the speech detection results is done using the ROC curves and measures. The experimental results revealed that the feature obtained by processing of the spectral autocorrelation function is more suitable for noisy speech detection than the other two features.
Keywords:
Speech Detection, Spectral Entropy, Neural Network, ROC Curve