research
∙
06/08/2023
Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS
Existing sentence textual similarity benchmark datasets only use a singl...
research
∙
05/03/2023
Can Large Language Models Be an Alternative to Human Evaluations?
Human evaluation is indispensable and inevitable for assessing the quali...
research
∙
10/06/2022
How Far Are We from Real Synonym Substitution Attacks?
In this paper, we explore the following question: how far are we from re...
research
∙
04/10/2022
Re-Examining Human Annotations for Interpretable NLP
Explanation methods in Interpretable NLP often explain the model's decis...
research
∙
04/09/2022
Understanding, Detecting, and Separating Out-of-Distribution Samples and Adversarial Samples in Text Classification
In this paper, we study the differences and commonalities between statis...
research
∙
09/08/2021
On the Transferability of Pre-trained Language Models: A Study from Artificial Datasets
Pre-training language models (LMs) on large-scale unlabeled text data ma...
research
∙
12/22/2020