Engineering knowledge-based (or expert) systems require extensive manual...
Hidden-unit BERT (HuBERT) is a widely-used self-supervised learning (SSL...
Transducer and Attention based Encoder-Decoder (AED) are two widely used...
It has been known that direct speech-to-speech translation (S2ST) models...
ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitat...
Direct speech-to-speech translation (S2ST), in which all components can ...
Reversible data hiding in encrypted domain(RDH-ED) can not only protect ...
Direct speech-to-speech translation (S2ST) is among the most challenging...
In a sentence, certain words are critical for its semantic. Among them, ...
The amount of labeled data to train models for speech tasks is limited f...
We describe a method to jointly pre-train speech and text in an
encoder-...
We present the first direct simultaneous speech-to-speech translation
(S...
In a speech-to-speech translation (S2ST) pipeline, the text-to-speech (T...
In this paper, we describe our end-to-end multilingual speech translatio...
Pretraining and multitask learning are widely used to improve the speech...
We present a direct speech-to-speech translation (S2ST) model that trans...
Multi-head attention has each of the attention heads collect salient
inf...
We propose an effective approach to utilize pretrained speech and text m...
Attention-based sequence-to-sequence modeling provides a powerful and el...
We introduce fairseq S2T, a fairseq extension for speech-to-text (S2T)
m...
One of the main challenges for end-to-end speech translation is data
sca...
Translational distance-based knowledge graph embedding has shown progres...
Machine reading comprehension(MRC) has attracted significant amounts of
...
Recent years have seen great success in the use of neural seq2seq models...
Multi-hop reading comprehension (RC) across documents poses new challeng...
The I4U consortium was established to facilitate a joint entry to NIST
s...
Speech emotion recognition (SER) has attracted great attention in recent...
This paper aims to improve the widely used deep speaker embedding x-vect...
Knowledge graph embedding has been an active research topic for knowledg...