Uncovering Bias in ASR Systems

Marcio Fuckner; Sophie Horsman; Pascal Wiggers; Iskaj Janssen

product

Uncovering Bias in ASR Systems

Evaluating Wav2vec2 and Whisper for Dutch speakers

Beschrijving

It is crucial that ASR systems can handle the wide range of variations in speech of speakers from different demographic groups, with different speaking styles, and of speakers with (dis)abilities. A potential quality-of-service harm arises when ASR systems do not perform equally well for everyone. ASR systems may exhibit bias against certain types of speech, such as non-native accents, different age groups and gender. In this study, we evaluate two widely-used neural network-based architectures: Wav2vec2 and Whisper on potential biases for Dutch speakers. We used the Dutch speech corpus JASMIN as a test set containing read and conversational speech in a human-machine interaction setting. The results reveal a significant bias against non-natives, children and elderly and some regional dialects. The ASR systems generally perform slightly better for women than for men.

Trefwoorden

speech recognition

dutch

Bias

Publicatiedatum

Type

Multifile

DOI

Niet bekend

Uncovering Bias in ASR Systems

Evaluating Wav2vec2 and Whisper for Dutch speakers

Beschrijving

SpeD_Bias_ASR_Paper.pdf

Gebruiksrecht

Toegangsrecht

SpeD_Bias_ASR_Paper.pdf

Gebruiksrecht

Toegangsrecht

URL 1

Gebruiksrecht

Toegangsrecht

Trefwoorden

Embed code

Publicatiedatum

Type

DOI

Marcio Fuckner

Marcio Fuckner

Sophie Horsman

Pascal Wiggers

Hogeschool van Amsterdam

Navigeer naar