Causal inference is one of the hallmarks of human intelligence. While th...
In this paper, we conduct a thorough investigation into the reasoning
ca...
Current large language models can perform reasonably well on complex tas...
Many NLP classification tasks, such as sexism/racism detection or toxici...
In NLP, models are usually evaluated by reporting single-number performa...
Language Models pretrained on large textual data have been shown to enco...
Hate speech detection is complex; it relies on commonsense reasoning,
kn...
Decision making theories such as Fuzzy-Trace Theory (FTT) suggest that
i...
Obtaining meaningful quality scores for machine translation systems thro...
In this paper, we tackle the Arabic Fine-Grained Hate Speech Detection s...
Large language models, which are often trained for hundreds of thousands...
Recently, there has been a surge of interest in the NLP community on the...
To date, efforts in the code-switching literature have focused for the m...
Schizophrenia is one of the most disabling mental health conditions to l...
Mixture of Experts layers (MoEs) enable efficient scaling of language mo...
Large-scale autoregressive language models such as GPT-3 are few-shot
le...
Commonsense knowledge can be leveraged for identifying causal relations ...
Do language models have beliefs about the world? Dennett (1995) famously...
Community Question Answering (CQA) fora such as Stack Overflow and Yahoo...
Modern sentence encoders are used to generate dense vector representatio...
Is bias amplified when neural machine translation (NMT) models are optim...
The scarcity of parallel data is a major obstacle for training high-qual...
Community Question Answering (CQA) forums such as Stack Overflow and Yah...
In this work, we test the performance of two bidirectional transformer-b...
Neural sequence models can generate highly fluent sentences but recent
s...
In many languages like Arabic, diacritics are used to specify pronunciat...
Neural abstractive summarization models are prone to generate content
in...
We describe a method for developing broad-coverage semantic dependency
p...
The increased focus on misinformation has spurred development of data an...
Intent classification (IC) and slot filling (SF) are core components in ...
Summarizing data samples by quantitative measures has a long history, wi...
Diacritic restoration has gained importance with the growing need for
ma...
Lexical ambiguity, a challenging phenomenon in all natural languages, is...
The blurry line between nefarious fake news and protected-speech satire ...
We present an overview of the second shared task on language identificat...
We present our effort to create a large Multi-Layered representational
r...
Data annotation is an important and necessary task for all NLP applicati...
We address the problem of Part of Speech tagging (POS) in the context of...
Natural Language Understanding (NLU) is a core component of dialog syste...
Vector averaging remains one of the most popular sentence embedding meth...
In the third shared task of the Computational Approaches to Linguistic
C...
Linguistic Code Switching (CS) is a phenomenon that occurs when multilin...
In this paper we present an emotion classifier model submitted to the
Se...
We develop and investigate several cross-lingual alignment approaches fo...
We describe a transfer method based on annotation projection to develop ...
Cross-lingual word vectors are typically obtained by fitting an orthogon...
Cross-lingual word vectors are typically obtained by fitting an orthogon...
This paper describes the ARIEL-CMU submissions to the Low Resource Human...
Schizophrenia is one of the most disabling and difficult to treat of all...
We evaluated various compositional models, from bag-of-words representat...