Discrete audio representation, aka audio tokenization, has seen renewed
...
Large language models (LLMs) have shown great promise for capturing
cont...
Multilingual Automatic Speech Recognition (ASR) models are capable of
tr...
This work presents a novel methodology for calculating the phonetic
simi...
We propose an algorithm to extract noise-robust acoustic features from n...
End-to-end (E2E) systems are fast replacing the conventional systems in ...
Language identification (LID) has relevance in many speech processing
ap...
Code-switching refers to the usage of two languages within a sentence or...