research
∙
07/10/2023
Measuring Lexical Diversity in Texts: The Twofold Length Problem
The impact of text length on the estimation of lexical diversity has cap...
research
∙
06/22/2022
Comparing Formulaic Language in Human and Machine Translation: Insight from a Parliamentary Corpus
A recent study has shown that, compared to human translations, neural ma...
research
∙
05/23/2022
Please, Don't Forget the Difference and the Confidence Interval when Seeking for the State-of-the-Art Status
This paper argues for the widest possible use of bootstrap confidence in...
research
∙
03/10/2022
SATLab at SemEval-2022 Task 4: Trying to Detect Patronizing and Condescending Language with only Character and Word N-grams
A logistic regression model only fed with character and word n-grams is ...
research
∙
02/05/2022
A simple language-agnostic yet very strong baseline system for hate speech and offensive content identification
For automatically identifying hate speech and offensive content in tweet...
research
∙
07/08/2021
Using CollGram to Compare Formulaic Language in Human and Neural Machine Translation
A comparison of formulaic sequences in human and neural machine translat...
research
∙
05/20/2021
LAST at SemEval-2021 Task 1: Improving Multi-Word Complexity Prediction Using Bigram Association Measures
This paper describes the system developed by the Laboratoire d'analyse s...
research
∙
04/29/2021
Using Fisher's Exact Test to Evaluate Association Measures for N-grams
To determine whether some often-used lexical association measures assign...
research
∙
04/27/2021