This technical report details our submission system to the CHiME-7 DASR
...
Transducer is one of the mainstream frameworks for streaming speech
reco...
Speech pre-training has shown great success in learning useful and gener...
Self-supervised learning (SSL) models have achieved considerable improve...
Multilingual end-to-end models have shown great improvement over monolin...
In this work, we present a novel method, named AV2vec, for learning
audi...