We introduce a multilingual speaker change detection model (USM-SCD) tha...
This paper introduces the research effort of an undergraduate research t...
Image segmentation serves as a critical tool across a range of applicati...
In real-world traffic, there are various uncertainties and complexities ...
Distributed online learning is gaining increased traction due to its uni...
We introduce AudioPaLM, a large language model for speech understanding ...
We introduce the Universal Speech Model (USM), a single large model that...
Privacy protection and nonconvexity are two challenging problems in
dece...
We propose a new dynamic average consensus algorithm that is robust to
i...
We propose a novel method to accelerate training and inference process o...
We address differential privacy for fully distributed aggregative games ...
We study in this paper privacy protection in fully distributed Nash
equi...
The distributed computation of a Nash equilibrium in aggregative games i...
By enabling multiple agents to cooperatively solve a global optimization...
Decentralized stochastic optimization is the basic building block of mod...
Decentralized optimization is gaining increased traction due to its
wide...
Average consensus plays a key role in distributed networks, with applica...
We summarize the results of a host of efforts using giant automatic spee...
Attention-based models have been gaining popularity recently for their s...
Transformer-based models have achieved state-of-the-art performance on s...
In this paper, we summarize the application of transformer and its strea...
This paper proposes an efficient memory transformer Emformer for low lat...
In this work, we first show that on the widely used LibriSpeech benchmar...
Transformers, originally proposed for natural language processing (NLP)
...
Transformer-based acoustic modeling has achieved great suc-cess for both...
Although n-gram language models (LMs) have been outperformed by the
stat...
We explore options to use Transformer networks in neural transducer for
...
Supervised ASR models have reached unprecedented levels of accuracy, tha...
Deep acoustic models typically receive features in the first layer of th...
We propose and evaluate transformer-based acoustic models (AMs) for hybr...
Decentralized heading control is crucial for robotic network operations ...
Average consensus underpins key functionalities of distributed systems
r...
End-to-end modeling (E2E) of automatic speech recognition (ASR) blends a...
Spoken language understanding system is traditionally designed as a pipe...