b'Angela Fan'

research

∙ 08/25/2023

Ngambay-French Neural Machine Translation (sba-Fr)

In Africa, and the world at large, there is an increasing focus on devel...

0 Sakayo Toadoum Sari, et al. ∙

research

∙ 07/18/2023

Llama 2: Open Foundation and Fine-Tuned Chat Models

In this work, we develop and release Llama 2, a collection of pretrained...

0 Hugo Touvron, et al. ∙

research

∙ 05/23/2023

Revisiting Machine Translation for Cross-lingual Classification

Machine Translation (MT) has been widely used for cross-lingual classifi...

0 Mikel Artetxe, et al. ∙

research

∙ 11/02/2022

RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question

Existing metrics for evaluating the quality of automatically generated q...

8 Alireza Mohammadshahi, et al. ∙

research

∙ 07/11/2022

No Language Left Behind: Scaling Human-Centered Machine Translation

Driven by the goal of eradicating language barriers on a global scale, m...

9 NLLB team, et al. ∙

research

∙ 04/12/2022

Generating Full Length Wikipedia Biographies: The Impact of Gender Bias on the Retrieval-Based Generation of Women Biographies

Generating factual, long-form text such as Wikipedia articles raises thr...

0 Angela Fan, et al. ∙

research

∙ 03/14/2022

Reasoning over Public and Private Data in Retrieval-Based Systems

Users and organizations are generating ever-increasing amounts of privat...

1 Simran Arora, et al. ∙

research

∙ 10/15/2021

Tricks for Training Sparse Translation Models

Multi-task learning with an unbalanced data distribution skews model lea...

8 Dheeru Dua, et al. ∙

research

∙ 10/15/2021

Alternative Input Signals Ease Transfer in Multilingual Machine Translation

Recent work in multilingual machine translation (MMT) has focused on the...

0 Simeng Sun, et al. ∙

research

∙ 08/06/2021

Facebook AI WMT21 News Translation Task Submission

We describe Facebook's multilingual model submission to the WMT2021 shar...

0 Chau Tran, et al. ∙

research

∙ 06/06/2021

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

One of the biggest challenges hindering progress in low-resource and mul...

0 Naman Goyal, et al. ∙

research

∙ 05/13/2021

Not All Memories are Created Equal: Learning to Forget by Expiring

Attention mechanisms have shown promising results in sequence modeling t...

0 Sainbayar Sukhbaatar, et al. ∙

research

∙ 04/18/2021

AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages

Pretrained multilingual models are able to perform cross-lingual transfe...

12 Abteen Ebrahimi, et al. ∙

research

∙ 04/11/2021

Non-Autoregressive Semantic Parsing for Compositional Task-Oriented Dialog

Semantic parsing using sequence-to-sequence models allows parsing of dee...

3 Arun Babu, et al. ∙

research

∙ 04/01/2021

CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN

The two main research threads in computer-based music generation are: th...

0 Giorgio Barnabo, et al. ∙

research

∙ 12/30/2020

Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA

While research on explaining predictions of open-domain QA systems (ODQA...

0 Ana Valeria Gonzalez, et al. ∙

research

∙ 11/16/2020

Facebook AI's WMT20 News Translation Task Submission

This paper describes Facebook AI's submission to WMT20 shared news trans...

0 Peng-Jen Chen, et al. ∙

research

∙ 11/10/2020

Generating Fact Checking Briefs

Fact checking at scale is difficult – while the number of active fact ch...

0 Angela Fan, et al. ∙

research

∙ 11/10/2020

Multilingual AMR-to-Text Generation

Generating text from structured data is challenging because it requires ...

0 Angela Fan, et al. ∙

research

∙ 10/21/2020

Beyond English-Centric Multilingual Machine Translation

Existing work in translation demonstrated the potential of massively mul...

11 Angela Fan, et al. ∙

research

∙ 10/01/2020

Nearest Neighbor Machine Translation

We introduce k-nearest-neighbor machine translation (kNN-MT), which pred...

0 Urvashi Khandelwal, et al. ∙

research

∙ 09/04/2020

KILT: a Benchmark for Knowledge Intensive Language Tasks

Challenging problems such as open-domain question answering, fact checki...

6 Fabio Petroni, et al. ∙

research

∙ 08/02/2020

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning

Recent work demonstrates the potential of multilingual pretraining of cr...

0 Yuqing Tang, et al. ∙

research

∙ 06/22/2020

Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions

We present our view of what is necessary to build an engaging open-domai...

19 Stephen Roller, et al. ∙

research

∙ 05/01/2020

Multi-Dimensional Gender Bias Classification

Machine learning models are trained to find patterns in data. NLP models...

0 Emily Dinan, et al. ∙

research

∙ 05/01/2020

Multilingual Unsupervised Sentence Simplification

Progress in Sentence Simplification has been hindered by the lack of sup...

0 Louis Martin, et al. ∙

research

∙ 04/27/2020

Augmenting Transformers with KNN-Based Composite Memory for Dialogue

Various machine learning tasks can benefit from access to external infor...

0 Angela Fan, et al. ∙

research

∙ 04/15/2020

Training with Quantization Noise for Extreme Model Compression

We tackle the problem of producing compact models, maximizing their accu...

30 Angela Fan, et al. ∙

research

∙ 04/15/2020

Training with Quantization Noise for Extreme Fixed-Point Compression

We tackle the problem of producing compact models, maximizing their accu...

3 Angela Fan, et al. ∙

research

∙ 02/21/2020

Accessing Higher-level Representations in Sequential Transformers with Feedback Memory

Transformers are feedforward networks that can process input tokens in p...

5 Angela Fan, et al. ∙

research

∙ 11/20/2019

Generating Interactive Worlds with Text

Procedurally generating cohesive and interesting game environments is ch...

29 Angela Fan, et al. ∙

research

∙ 11/10/2019

Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation

Models often easily learn biases present in the training data, and their...

0 Emily Dinan, et al. ∙

research

∙ 10/18/2019

Using Local Knowledge Graph Construction to Scale Seq2Seq Models to Multi-Document Inputs

Query-based open-domain NLP tasks require information synthesis from lon...

0 Angela Fan, et al. ∙

research

∙ 09/25/2019

Reducing Transformer Depth on Demand with Structured Dropout

Overparameterized transformer networks have obtained state of the art re...

10 Angela Fan, et al. ∙

research

∙ 07/22/2019

ELI5: Long Form Question Answering

We introduce the first large-scale corpus for long-form question answeri...

0 Angela Fan, et al. ∙

research

∙ 07/15/2019

GLOSS: Generative Latent Optimization of Sentence Representations

We propose a method to learn unsupervised sentence representations in a ...

0 Sidak Pal Singh, et al. ∙

research

∙ 04/01/2019

fairseq: A Fast, Extensible Toolkit for Sequence Modeling

fairseq is an open-source sequence modeling toolkit that allows research...

0 Myle Ott, et al. ∙

research

∙ 03/07/2019

Learning to Speak and Act in a Fantasy Text Adventure Game

We introduce a large scale crowdsourced text adventure game as a researc...

0 Jack Urbanek, et al. ∙

research

∙ 02/04/2019

Strategies for Structuring Story Generation

Writers generally rely on plans or sketches to write long stories, but m...

0 Angela Fan, et al. ∙

research

∙ 01/29/2019

Pay Less Attention with Lightweight and Dynamic Convolutions

Self-attention is a useful mechanism to build generative models for lang...

0 Felix Wu, et al. ∙

research

∙ 11/03/2018

Wizard of Wikipedia: Knowledge-Powered Conversational agents

In open-domain dialogue intelligent agents should exhibit the use of kno...

0 Emily Dinan, et al. ∙

research

∙ 05/13/2018

Hierarchical Neural Story Generation

We explore story generation: creative systems that can build coherent an...

0 Angela Fan, et al. ∙

research

∙ 11/14/2017

Controllable Abstractive Summarization

Current models for document summarization ignore user preferences such a...

0 Angela Fan, et al. ∙

research

∙ 01/12/2017

Prior matters: simple and general methods for evaluating and improving topic quality in topic modeling

Latent Dirichlet Allocation (LDA) models trained without stopword remova...

0 Angela Fan, et al. ∙

research

∙ 12/23/2016

Language Modeling with Gated Convolutional Networks

The pre-dominant approach to language modeling to date is based on recur...

0 Yann N. Dauphin, et al. ∙

Angela Fan

Featured Co-authors

Sign in with Google

Consider DeepAI Pro