Generalising sequence models for epigenome predictions with tissue and assay embeddings

08/22/2023
by   Jacob Deasy, et al.
0

Sequence modelling approaches for epigenetic profile prediction have recently expanded in terms of sequence length, model size, and profile diversity. However, current models cannot infer on many experimentally feasible tissue and assay pairs due to poor usage of contextual information, limiting in silico understanding of regulatory genomics. We demonstrate that strong correlation can be achieved across a large range of experimental conditions by integrating tissue and assay embeddings into a Contextualised Genomic Network (CGN). In contrast to previous approaches, we enhance long-range sequence embeddings with contextual information in the input space, rather than expanding the output space. We exhibit the efficacy of our approach across a broad set of epigenetic profiles and provide the first insights into the effect of genetic variants on epigenetic sequence model training. Our general approach to context integration exceeds state of the art in multiple settings while employing a more rigorous validation procedure.

READ FULL TEXT

page 4

page 11

page 12

page 13

page 14

page 15

research
08/11/2023

Designing a User Contextual Profile Ontology: A Focus on the Vehicle Sales Domain

In the digital age, it is crucial to understand and tailor experiences f...
research
12/09/2018

Joint Vertebrae Identification and Localization in Spinal CT Images by Combining Short- and Long-Range Contextual Information

Automatic vertebrae identification and localization from arbitrary CT im...
research
09/17/2020

More Embeddings, Better Sequence Labelers?

Recent work proposes a family of contextual embeddings that significantl...
research
03/30/2021

Locally-Contextual Nonlinear CRFs for Sequence Labeling

Linear chain conditional random fields (CRFs) combined with contextual w...
research
11/14/2014

Predictive Encoding of Contextual Relationships for Perceptual Inference, Interpolation and Prediction

We propose a new neurally-inspired model that can learn to encode the gl...
research
11/03/2022

Contextual information integration for stance detection via cross-attention

Stance detection deals with the identification of an author's stance tow...
research
05/22/2023

Friendly Neighbors: Contextualized Sequence-to-Sequence Link Prediction

We propose KGT5-context, a simple sequence-to-sequence model for link pr...

Please sign up or login with your details

Forgot password? Click here to reset