tmVar 3.0: an improved variant concept recognition and normalization tool

04/07/2022
by   Chih-Hsuan Wei, et al.
0

Previous studies have shown that automated text-mining tools are becoming increasingly important for successfully unlocking variant information in scientific literature at large scale. Despite multiple attempts in the past, existing tools are still of limited recognition scope and precision. We propose tmVar 3.0: an improved variant recognition and normalization tool. Compared to its predecessors, tmVar 3.0 is able to recognize a wide spectrum of variant related entities (e.g., allele and copy number variants), and to group different variant mentions belonging to the same concept in an article for improved accuracy. Moreover, tmVar3 provides additional variant normalization options such as allele-specific identifiers from the ClinGen Allele Registry. tmVar3 exhibits a state-of-the-art performance with over 90 F-measure in variant recognition and normalization, when evaluated on three independent benchmarking datasets. tmVar3 is freely available for download. We have also processed the entire PubMed and PMC with tmVar3 and released its annotations on our FTP. Availability: ftp://ftp.ncbi.nlm.nih.gov/pub/lu/tmVar3

READ FULL TEXT
research
09/29/2020

Realistic Image Normalization for Multi-Domain Segmentation

Image normalization is a building block in medical image analysis. Conve...
research
04/03/2019

A Large-Scale Comparison of Historical Text Normalization Systems

There is no consensus on the state-of-the-art approach to historical tex...
research
12/12/2019

Local Context Normalization: Revisiting Local Normalization

Normalization layers have been shown to improve convergence in deep neur...
research
11/11/2019

A hybrid text normalization system using multi-head self-attention for mandarin

In this paper, we propose a hybrid text normalization system using multi...
research
09/18/2019

Most General Variant Unifiers

Equational unification of two terms consists of finding a substitution t...
research
03/04/2023

Decompose, Adjust, Compose: Effective Normalization by Playing with Frequency for Domain Generalization

Domain generalization (DG) is a principal task to evaluate the robustnes...
research
09/22/2020

Variant-based Equational Unification under Constructor Symbols

Equational unification of two terms consists of finding a substitution t...

Please sign up or login with your details

Forgot password? Click here to reset