Structure pruning is an effective method to compress and accelerate neur...
Masked language modeling (MLM) has been widely used for pre-training
eff...
In this paper, we propose Dynamic Self-Attention (DSA), a new self-atten...
Open Directory Project (ODP) has been successfully utilized in text
clas...
Recently, implicit representation models, such as embedding or deep lear...