Science Cast

BUT CHiME-7 system description

Martin Karafia´tOctober 20, 2023 10:03am

Views (311)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

BUT CHiME-7 system description

arXivPDFOctober 18, 2023 12:00am

Authors

Martin Karafiát, Karel Veselý, Igor Szöke, Ladislav Mošner, Karel Beneš, Marcin Witkowski, Germán Barchi, Leonardo Pepino

Abstract

This paper describes the joint effort of Brno University of Technology (BUT), AGH University of Krakow and University of Buenos Aires on the development of Automatic Speech Recognition systems for the CHiME-7 Challenge. We train and evaluate various end-to-end models with several toolkits. We heavily relied on Guided Source Separation (GSS) to convert multi-channel audio to single channel. The ASR is leveraging speech representations from models pre-trained by self-supervised learning, and we do a fusion of several ASR systems. In addition, we modified external data from the LibriSpeech corpus to become a close domain and added it to the training. Our efforts were focused on the far-field acoustic robustness sub-track of Task 1 - Distant Automatic Speech Recognition (DASR), our systems use oracle segmentation.

TwitterandLinkedIn

0 comments

Add comment

BUT CHiME-7 system description

BUT CHiME-7 system description

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments