Leveraging Subword Embeddings for Multinational Address Parsing

06/29/2020
by   Marouane Yassine, et al.
0

Address parsing consists of identifying the segments that make up an address such as a street name or a postal code. Because of its importance for tasks like record linkage, address parsing has been approached with many techniques. Neural network methods defined a new state-of-the-art for address parsing. While this approach yielded notable results, previous work has only focused on applying neural networks to achieve address parsing of addresses from one source country. We propose an approach in which we employ subword embeddings and a Recurrent Neural Network architecture to build a single model capable of learning to parse addresses from multiple countries at the same time while taking into account the difference in languages and address formatting systems. We achieved accuracies around 99 pre-processing nor post-processing needed. In addition, we explore the possibility of transferring the address parsing knowledge attained by training on some countries' addresses to others with no further training. This setting is also called zero-shot transfer learning. We achieve good results for 80 the countries (34 out of 41), almost 50 state-of-the-art performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2021

Multinational Address Parsing: A Zero-Shot Evaluation

Address parsing consists of identifying the segments that make up an add...
research
11/15/2014

Deep Deconvolutional Networks for Scene Parsing

Scene parsing is an important and challenging prob- lem in computer visi...
research
08/27/2018

Zero-shot Transfer Learning for Semantic Parsing

While neural networks have shown impressive performance on large dataset...
research
08/03/2022

Benchmarking zero-shot and few-shot approaches for tokenization, tagging, and dependency parsing of Tagalog text

The grammatical analysis of texts in any human language typically involv...
research
08/30/2019

Hierarchical Pointer Net Parsing

Transition-based top-down parsing with pointer networks has achieved sta...
research
12/14/2021

Maximum Bayes Smatch Ensemble Distillation for AMR Parsing

AMR parsing has experienced an unprecendented increase in performance in...
research
12/22/2020

Progressive One-shot Human Parsing

Prior human parsing models are limited to parsing humans into classes pr...

Please sign up or login with your details

Forgot password? Click here to reset