research
          
      
      ∙
      03/30/2023
    oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes
In this paper, we introduce the range of oBERTa language models, an easy...
          
            research
          
      
      ∙
      05/25/2022
     
             
  
  
     
                             share
 share