Mona Diab

research

∙ 06/09/2023

Can Large Language Models Infer Causation from Correlation?

Causal inference is one of the hallmarks of human intelligence. While th...

0 Zhijing Jin, et al. ∙

research

∙ 05/19/2023

OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models

In this paper, we conduct a thorough investigation into the reasoning ca...

0 Badr AlKhamissi, et al. ∙

research

∙ 12/16/2022

ALERT: Adapting Language Models to Reasoning Tasks

Current large language models can perform reasonably well on complex tas...

0 Ping Yu, et al. ∙

research

∙ 10/14/2022

Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values

Many NLP classification tasks, such as sexism/racism detection or toxici...

2 Yejin Bang, et al. ∙

research

∙ 10/04/2022

Text Characterization Toolkit

In NLP, models are usually evaluated by reporting single-number performa...

0 Daniel Simig, et al. ∙

research

∙ 09/30/2022

Depth-Wise Attention (DWAtt): A Layer Fusion Method for Data-Efficient Classification

Language Models pretrained on large textual data have been shown to enco...

0 Muhammad ElNokrashy, et al. ∙

research

∙ 05/25/2022

ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection

Hate speech detection is complex; it relies on commonsense reasoning, kn...

0 Badr AlKhamissi, et al. ∙

research

∙ 05/25/2022

GisPy: A Tool for Measuring Gist Inference Score in Text

Decision making theories such as Fuzzy-Trace Theory (FTT) suggest that i...

0 Pedram Hosseini, et al. ∙

research

∙ 05/17/2022

Consistent Human Evaluation of Machine Translation across Language Pairs

Obtaining meaningful quality scores for machine translation systems thro...

2 Daniel Licht, et al. ∙

research

∙ 05/16/2022

Meta AI at Arabic Hate Speech 2022: MultiTask Learning with Self-Correction for Hate Speech Classification

In this paper, we tackle the Arabic Fine-Grained Hate Speech Detection s...

0 Badr AlKhamissi, et al. ∙

research

∙ 05/02/2022

OPT: Open Pre-trained Transformer Language Models

Large language models, which are often trained for hundreds of thousands...

8 Susan Zhang, et al. ∙

research

∙ 04/12/2022

A Review on Language Models as Knowledge Bases

Recently, there has been a surge of interest in the NLP community on the...

7 Badr AlKhamissi, et al. ∙

research

∙ 02/19/2022

CALCS 2021 Shared Task: Machine Translation for Code-Switched Data

To date, efforts in the code-switching literature have focused for the m...

2 Shuguang Chen, et al. ∙

research

∙ 01/25/2022

A Quantitative and Qualitative Analysis of Schizophrenia Language

Schizophrenia is one of the most disabling mental health conditions to l...

2 Amal Alqahtani, et al. ∙

research

∙ 12/20/2021

Efficient Large Scale Language Modeling with Mixtures of Experts

Mixture of Experts layers (MoEs) enable efficient scaling of language mo...

10 Mikel Artetxe, et al. ∙

research

∙ 12/20/2021

Few-shot Learning with Multilingual Language Models

Large-scale autoregressive language models such as GPT-3 are few-shot le...

8 Xi Victoria Lin, et al. ∙

research

∙ 12/16/2021

Commonsense Knowledge-Augmented Pretrained Language Models for Causal Reasoning Classification

Commonsense knowledge can be leveraged for identifying causal relations ...

0 Pedram Hosseini, et al. ∙

research

∙ 11/26/2021

Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs

Do language models have beliefs about the world? Dennett (1995) famously...

5 Peter Hase, et al. ∙

research

∙ 11/11/2021

AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization

Community Question Answering (CQA) fora such as Stack Overflow and Yahoo...

0 Alexander R. Fabbri, et al. ∙

research

∙ 06/02/2021

Discrete Cosine Transform as Universal Sentence Encoder

Modern sentence encoders are used to generate dense vector representatio...

0 Nada Almarwani, et al. ∙

research

∙ 06/01/2021

Gender Bias Amplification During Speed-Quality Optimization in Neural Machine Translation

Is bias amplified when neural machine translation (NMT) models are optim...

5 Adithya Renduchintala, et al. ∙

research

∙ 05/31/2021

Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data

The scarcity of parallel data is a major obstacle for training high-qual...

7 Wei-Jen Ko, et al. ∙

research

∙ 04/17/2021

Multi-Perspective Abstractive Answer Summarization

Community Question Answering (CQA) forums such as Stack Overflow and Yah...

0 Alexander R. Fabbri, et al. ∙

research

∙ 03/25/2021

Predicting Directionality in Causal Relations in Text

In this work, we test the performance of two bidirectional transformer-b...

0 Pedram Hosseini, et al. ∙

research

∙ 11/05/2020

Detecting Hallucinated Content in Conditional Neural Sequence Generation

Neural sequence models can generate highly fluent sentences but recent s...

2 Chunting Zhou, et al. ∙

research

∙ 06/07/2020

A Multitask Learning Approach for Diacritic Restoration

In many languages like Arabic, diacritics are used to specify pronunciat...

0 Sawsan Alqahtani, et al. ∙

research

∙ 05/07/2020

FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization

Neural abstractive summarization models are prone to generate content in...

0 Esin Durmus, et al. ∙

research

∙ 04/30/2020

Mutlitask Learning for Cross-Lingual Transfer of Semantic Dependencies

We describe a method for developing broad-coverage semantic dependency p...

0 Maryam Aminian, et al. ∙

research

∙ 04/27/2020

DeSePtion: Dual Sequence Prediction and Adversarial Examples for Improved Fact-Checking

The increased focus on misinformation has spurred development of data an...

0 Christopher Hidey, et al. ∙

research

∙ 04/22/2020

Learning to Classify Intents and Slot Labels Given a Handful of Examples

Intent classification (IC) and slot filling (SF) are core components in ...

0 Jason Krone, et al. ∙

research

∙ 03/19/2020

Diversity, Density, and Homogeneity: Quantitative Characteristic Metrics for Text Collections

Summarizing data samples by quantitative measures has a long history, wi...

0 Yi-An Lai, et al. ∙

research

∙ 12/14/2019

Efficient Convolutional Neural Networks for Diacritic Restoration

Diacritic restoration has gained importance with the growing need for ma...

0 Sawsan Alqahtani, et al. ∙

research

∙ 12/10/2019

Homograph Disambiguation Through Selective Diacritic Restoration

Lexical ambiguity, a challenging phenomenon in all natural languages, is...

0 Sawsan Alqahtani, et al. ∙

research

∙ 10/02/2019

Identifying Nuances in Fake News vs. Satire: Using Semantic and Linguistic Cues

The blurry line between nefarious fake news and protected-speech satire ...

0 Or Levi, et al. ∙

research

∙ 09/28/2019

Overview for the Second Shared Task on Language Identification in Code-Switched Data

We present an overview of the second shared task on language identificat...

0 Giovanni Molina, et al. ∙

research

∙ 09/28/2019

Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data

We present our effort to create a large Multi-Layered representational r...

0 Mona Diab, et al. ∙

research

∙ 09/28/2019

WASA: A Web Application for Sequence Annotation

Data annotation is an important and necessary task for all NLP applicati...

0 Fahad AlGhamdi, et al. ∙

research

∙ 09/28/2019

Part of speech tagging for code switched data

We address the problem of Part of Speech tagging (POS) in the context of...

0 Fahad AlGhamdi, et al. ∙

research

∙ 09/18/2019

CASA-NLU: Context-Aware Self-Attentive Natural Language Understanding for Task-Oriented Chatbots

Natural Language Understanding (NLU) is a core component of dialog syste...

0 Arshit Gupta, et al. ∙

research

∙ 09/06/2019

Efficient Sentence Embedding using Discrete Cosine Transform

Vector averaging remains one of the most popular sentence embedding meth...

0 Nada Almarwani, et al. ∙

research

∙ 06/10/2019

Named Entity Recognition on Code-Switched Data: Overview of the CALCS 2018 Shared Task

In the third shared task of the Computational Approaches to Linguistic C...

0 Gustavo Aguilar, et al. ∙

research

∙ 05/31/2019

Leveraging Pretrained Word Embeddings for Part-of-Speech Tagging of Code Switching Data

Linguistic Code Switching (CS) is a phenomenon that occurs when multilin...

0 Fahad AlGhamdi, et al. ∙

research

∙ 05/23/2019

GWU NLP Lab at SemEval-2019 Task 3: EmoContext: Effective Contextual Information in Models for Emotion Detection in Sentence-level in a Multigenre Corpus

In this paper we present an emotion classifier model submitted to the Se...

0 Shabnam Tafreshi, et al. ∙

research

∙ 04/11/2019

Scalable Cross-Lingual Transfer of Neural Sentence Embeddings

We develop and investigate several cross-lingual alignment approaches fo...

0 Hanan Aldarmaki, et al. ∙

research

∙ 04/05/2019

Cross-Lingual Transfer of Semantic Roles: From Raw Text to Semantic Roles

We describe a transfer method based on annotation projection to develop ...

0 Maryam Aminian, et al. ∙

research

∙ 03/08/2019

Context-Aware Cross-Lingual Mapping

Cross-lingual word vectors are typically obtained by fitting an orthogon...

0 Hanan Aldarmaki, et al. ∙

research

∙ 03/08/2019

Context-Aware Crosslingual Mapping

Cross-lingual word vectors are typically obtained by fitting an orthogon...

0 Hanan Aldarmaki, et al. ∙

research

∙ 02/24/2019

The ARIEL-CMU Systems for LoReHLT18

This paper describes the ARIEL-CMU submissions to the Low Resource Human...

0 Aditi Chaudhary, et al. ∙

research

∙ 10/22/2018

Predictive Linguistic Features of Schizophrenia

Schizophrenia is one of the most disabling and difficult to treat of all...

0 Efsun Sarioglu Kayi, et al. ∙

research

∙ 06/12/2018

Evaluation of Unsupervised Compositional Representations

We evaluated various compositional models, from bag-of-words representat...

0 Hanan Aldarmaki, et al. ∙

Mona Diab

Featured Co-authors

Sign in with Google

Consider DeepAI Pro