Traditional recommender systems leverage users' item preference history ...
Training data attribution (TDA) methods offer to trace a model's predict...
Pretrained large language models (LLMs) are able to solve a wide variety...
Text-based safety classifiers are widely used for content moderation and...
This paper explores a novel application of textual semantic similarity t...
We considers how a particular kind of graph corresponds to multiplicativ...
Each year, expert-level performance is attained in increasingly-complex
...
Natural interaction with recommendation and personalized search systems ...
Scaling language models with more data, compute and parameters has drive...
User posts whose perceived toxicity depends on the conversational contex...
This paper introduces a simple and effective form of data augmentation f...
Platforms that support online commentary, from social networks to news s...
We present a new dataset of approximately 44000 comments labeled by
crow...
Moderation is crucial to promoting healthy on-line discussions. Although...
We introduce the Constructive Comments Corpus (C3), comprised of 12,000
...
Unintended bias in Machine Learning can manifest as systemic differences...
This report examines the Pinned AUC metric introduced and highlights som...
We present a corpus that encompasses the complete history of conversatio...
One of the main challenges online social systems face is the prevalence ...
The damage personal attacks cause to online discourse motivates many
pla...