Résumé Parsing as Hierarchical Sequence Labeling: An Empirical Study

09/13/2023
by   Federico Retyk, et al.
0

Extracting information from résumés is typically formulated as a two-stage problem, where the document is first segmented into sections and then each section is processed individually to extract the target entities. Instead, we cast the whole problem as sequence labeling in two levels – lines and tokens – and study model architectures for solving both tasks simultaneously. We build high-quality résumé parsing corpora in English, French, Chinese, Spanish, German, Portuguese, and Swedish. Based on these corpora, we present experimental results that demonstrate the effectiveness of the proposed models for the information extraction task, outperforming approaches introduced in previous work. We conduct an ablation study of the proposed architectures. We also analyze both model performance and resource efficiency, and describe the trade-offs for model deployment in the context of a production environment.

READ FULL TEXT
research
07/02/2019

Sequence Labeling Parsing by Learning Across Representations

We use parsing as sequence labeling as a common framework to learn acros...
research
02/27/2019

Viable Dependency Parsing as Sequence Labeling

We recast dependency parsing as a sequence labeling problem, exploring s...
research
08/11/2016

Learning Dynamic Hierarchical Models for Anytime Scene Labeling

With increasing demand for efficient image and video analysis, test-time...
research
04/10/2022

Reducing Model Jitter: Stable Re-training of Semantic Parsers in Production Environments

Retraining modern deep learning systems can lead to variations in model ...
research
10/23/2020

NLNDE at CANTEMIST: Neural Sequence Labeling and Parsing Approaches for Clinical Concept Extraction

The recognition and normalization of clinical information, such as tumor...
research
10/27/2022

Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence Labeling

Boundary information is critical for various Chinese language processing...
research
10/15/2020

Token Sequence Labeling vs. Clause Classification for English Emotion Stimulus Detection

Emotion stimulus detection is the task of finding the cause of an emotio...

Please sign up or login with your details

Forgot password? Click here to reset