Cybernetics and Information Technologies, Vol. 3, No 2, 2003

Abstract: The paper describes the BG-SRDat (BulGarian language Speaker Recognition DATabase) - a corpus in Bulgarian language, recorded over noisy analog telephone channels and intended for speaker recognition. The BG-SRDat comprises two separated speech corpora, called Speech Data 1 (SD1) and Speech Data 2 (SD2), respectively. The SD1 is a reading text from a newspaper and its average length is about 40 seconds. The SD2 is a short phrase with length of about 2 seconds. The SD1 and the SD2 are uttered in various sessions by different number of speakers (male) - 26 and 13, respectively. To achieve more realistic real-world conditions the speech data is collected by different types of telephone calls (internal-routing, local and long-distance) and acoustical environments (noisy offices, halls and streets). The BG-SRDat purpose is to help the researchers to evaluate various speaker recognition techniques for noisy telephone speech in Bulgarian language and to select the more promising one.