Task-agnostic knowledge distillation attempts to address the problem of
...
Recent progress in diffusion models has revolutionized the popular techn...
Estimated time of arrival (ETA) prediction, also known as travel time
es...
Pre-trained language models have achieved state-of-the-art results in va...
Pre-trained models have achieved state-of-the-art results in various Nat...
Pretrained language models (PLMs) such as BERT adopt a training paradigm...
This paper describes the system designed by ERNIE Team which achieved th...
Code switching is a linguistic phenomenon that may occur within a
multil...
We present a novel language representation model enhanced by knowledge c...