We propose a neural audio generative model, MDCTNet, operating in the
pe...
It is crucial to choose the appropriate scale in order to build an effec...
In this technical report, we introduce Effidit (Efficient and Intelligen...
People perceive the world with multiple senses (e.g., through hearing so...
Chinese BERT models achieve remarkable progress in dealing with grammati...
We present a Chinese BERT model dubbed MarkBERT that uses word informati...
This paper presents a Pathways approach to handle many tasks at once. Ou...
Whole word masking (WWM), which masks all subwords corresponding to a wo...
The standard BERT adopts subword-based tokenization, which may break a w...
Music performance synthesis aims to synthesize a musical score into a na...
We provide a speech coding scheme employing a generative model based on
...
Here we present a novel approach to conditioning the SampleRNN generativ...