Antibody Watch: Text Mining Antibody Specificity from the Literature

08/05/2020
by   Chun-Nan Hsu, et al.
0

Motivation: Antibodies are widely used reagents to test for expression of proteins. However, they might not always reliably produce results when they do not specifically bind to the target proteins that their providers designed them for, leading to unreliable research results. While many proposals have been developed to deal with the problem of antibody specificity, they may not scale well to deal with the millions of antibodies that are available to researchers. In this study, we investigate the feasibility of automatically generating a report to alert users of problematic antibodies by extracting statements about antibody specificity reported in the literature. Results: Our goal is to construct an "Antibody Watch" knowledge base containing supporting statements of problematic antibodies. We developed a deep neural network system and tested its performance with a corpus of more than two thousand articles that reported uses of antibodies. We divided the problem into two tasks. Given an input article, the first task is to identify snippets about antibody specificity and classify if the snippets report that any antibody exhibits nonspecificity, and thus is problematic. The second task is to link each of these snippets to one or more antibodies mentioned in the snippet. The experimental evaluation shows that our system can accurately perform both classification and linking tasks with weighted F-scores over 0.925 and 0.923, respectively, and 0.914 overall when combined to complete the joint task. We leveraged Research Resource Identifiers (RRID) to precisely identify antibodies linked to the extracted specificity snippets. The result shows that it is feasible to construct a reliable knowledge base about problematic antibodies by text mining.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/17/2020

Semi-Automating Knowledge Base Construction for Cancer Genetics

In this work, we consider the exponentially growing subarea of genetics ...
research
06/01/2021

Studying Duplicate Logging Statements and Their Relationships with Code Clones

In this paper, we focus on studying duplicate logging statements, which ...
research
05/02/2020

GenericsKB: A Knowledge Base of Generic Statements

We present a new resource for the NLP community, namely a large (3.5M+ s...
research
12/02/2022

Joint Open Knowledge Base Canonicalization and Linking

Open Information Extraction (OIE) methods extract a large number of OIE ...
research
06/12/2020

Do Dogs have Whiskers? A New Knowledge Base of hasPart Relations

We present a new knowledge-base of hasPart relationships, extracted from...
research
04/21/2017

A Semantic QA-Based Approach for Text Summarization Evaluation

Many Natural Language Processing and Computational Linguistics applicati...

Please sign up or login with your details

Forgot password? Click here to reset