Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines

07/03/2023
by   Ellery Smith, et al.
0

With the proliferation of research means and computational methodologies, published biomedical literature is growing exponentially in numbers and volume. As a consequence, in the fields of biological, medical and clinical research, domain experts have to sift through massive amounts of scientific text to find relevant information. However, this process is extremely tedious and slow to be performed by humans. Hence, novel computational information extraction and correlation mechanisms are required to boost meaningful knowledge extraction. In this work, we present the design, implementation and application of a novel data extraction and exploration system. This system extracts deep semantic relations between textual entities from scientific literature to enrich existing structured clinical data in the domain of cancer cell lines. We introduce a new public data exploration portal, which enables automatic linking of genomic copy number variants plots with ranked, related entities such as affected genes. Each relation is accompanied by literature-derived evidences, allowing for deep, yet rapid, literature search, using existing structured data as a springboard. Our system is publicly available on the web at https://cancercelllines.org

READ FULL TEXT
research
02/26/2019

A framework for information extraction from tables in biomedical literature

The scientific literature is growing exponentially, and professionals ar...
research
09/14/2021

A system for information extraction from scientific texts in Russian

In this paper, we present a system for information extraction from scien...
research
04/16/2020

Smaller p-values in genomics studies using distilled historical information

Medical research institutions have generated massive amounts of biologic...
research
11/19/2020

Entity Recognition and Relation Extraction from Scientific and Technical Texts in Russian

This paper is devoted to the study of methods for information extraction...
research
09/04/2023

Into the Single Cell Multiverse: an End-to-End Dataset for Procedural Knowledge Extraction in Biomedical Texts

Many of the most commonly explored natural language processing (NLP) inf...
research
02/09/2023

A Biomedical Knowledge Graph for Biomarker Discovery in Cancer

Structured and unstructured data and facts about drugs, genes, protein, ...
research
07/15/2023

Automated Knowledge Modeling for Cancer Clinical Practice Guidelines

Clinical Practice Guidelines (CPGs) for cancer diseases evolve rapidly d...

Please sign up or login with your details

Forgot password? Click here to reset