BG-SRDat: A Corpus in Bulgarian Language
for Speaker Recognition over Telephone Channels
Atanas Ouzounov
Institute of Information Technologies, 1113 Sofia
E-mail: atanas@iinf.bas.bg
Abstract: The paper describes the BG-SRDat (BulGarian language Speaker
Recognition DATabase) - a corpus in Bulgarian language, recorded over noisy
analog telephone channels and intended for speaker recognition. The BG-SRDat
comprises two separated speech corpora, called Speech Data 1 (SD1) and Speech
Data 2 (SD2), respectively. The SD1 is a reading text from a newspaper and its
average length is about 40 seconds. The SD2 is a short phrase with length of
about 2 seconds. The SD1 and the SD2 are uttered in various sessions by different
number of speakers (male) - 26 and 13, respectively. To achieve more realistic
real-world conditions the speech data is collected by different types of telephone
calls (internal-routing, local and long-distance) and acoustical environments
(noisy offices, halls and streets). The BG-SRDat purpose is to help the researchers
to evaluate various speaker recognition techniques for noisy telephone speech in
Bulgarian language and to select the more promising one.
Keywords: Speech corpora, speech databases, speaker recognition.