Do NLP Models Know Numbers? Probing Numeracy in Embeddings

09/17/2019
by   Eric Wallace, et al.
0

The ability to understand and work with numbers (numeracy) is critical for many complex reasoning tasks. Currently, most NLP models treat numbers in text in the same way as other tokens---they embed them as distributed vectors. Is this enough to capture numeracy? We begin by investigating the numerical reasoning capabilities of a state-of-the-art question answering model on the DROP dataset. We find this model excels on questions that require numerical reasoning, i.e., it already captures numeracy. To understand how this capability emerges, we probe token embedding methods (e.g., BERT, GloVe) on synthetic list maximum, number decoding, and addition tasks. A surprising degree of numeracy is naturally present in standard embeddings. For example, GloVe and word2vec accurately encode magnitude for numbers up to 1,000. Furthermore, character-level embeddings are even more precise---ELMo captures numeracy the best for all pre-trained methods---but BERT, which uses sub-word units, is less exact.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2022

Number Entity Recognition

Numbers are essential components of text, like any other word tokens, fr...
research
09/06/2021

Improving Numerical Reasoning Skills in the Modular Approach for Complex Question Answering on Text

Numerical reasoning skills are essential for complex question answering ...
research
03/24/2021

Representing Numbers in NLP: a Survey and a Vision

NLP systems rarely give special consideration to numbers found in text. ...
research
10/23/2022

Do Language Models Understand Measurements?

Recent success of pre-trained language models (PLMs) has stimulated inte...
research
10/06/2022

Teaching Neural Module Networks to Do Arithmetic

Answering complex questions that require multi-step multi-type reasoning...
research
08/31/2021

Effectiveness of Deep Networks in NLP using BiDAF as an example architecture

Question Answering with NLP has progressed through the evolution of adva...
research
03/31/2021

Models and numbers: Representing the world or imposing order?

We argue for a foundational epistemic claim and a hypothesis about the p...

Please sign up or login with your details

Forgot password? Click here to reset