INSTITUTE OF INFORMATION TECHNOLOGIES - BAS

Cybernetics and Information Technologies
Volume 3, No 2. Sofia, 2003, Bulgarian Academy of Sciences


BG-SRDat: A Corpus in Bulgarian Language for Speaker Recognition over Telephone Channels

Atanas Ouzounov

Institute of Information Technologies, 1113 Sofia E-mail: atanas@iinf.bas.bg


Abstract: The paper describes the BG-SRDat (BulGarian language Speaker Recognition DATabase) - a corpus in Bulgarian language, recorded over noisy analog telephone channels and intended for speaker recognition. The BG-SRDat comprises two separated speech corpora, called Speech Data 1 (SD1) and Speech Data 2 (SD2), respectively. The SD1 is a reading text from a newspaper and its average length is about 40 seconds. The SD2 is a short phrase with length of about 2 seconds. The SD1 and the SD2 are uttered in various sessions by different number of speakers (male) - 26 and 13, respectively. To achieve more realistic real-world conditions the speech data is collected by different types of telephone calls (internal-routing, local and long-distance) and acoustical environments (noisy offices, halls and streets). The BG-SRDat purpose is to help the researchers to evaluate various speaker recognition techniques for noisy telephone speech in Bulgarian language and to select the more promising one.

Keywords: Speech corpora, speech databases, speaker recognition.