Towards the Latent Transcriptome

10/08/2018
by   Assya Trofimov, et al.
0

In this work we propose a method to compute continuous embeddings for kmers from raw RNA-seq data, in a reference-free fashion. We report that our model captures information of both DNA sequence similarity as well as DNA sequence abundance in the embedding latent space. We confirm the quality of these vectors by comparing them to known gene sub-structures and report that the latent space recovers exon information from raw RNA-Seq data from acute myeloid leukemia patients. Furthermore we show that this latent space allows the detection of genomic abnormalities such as translocations as well as patient-specific mutations, making this representation space both useful for visualization as well as analysis.

READ FULL TEXT
research
12/06/2018

Traversing Latent Space using Decision Ferns

The practice of transforming raw data to a feature space so that inferen...
research
08/26/2022

Comparing multiple latent space embeddings using topological analysis

The latent space model is one of the well-known methods for statistical ...
research
10/17/2019

Mapper Based Classifier

Topological data analysis aims to extract topological quantities from da...
research
09/16/2019

Unaligned Sequence Similarity Search Using Deep Learning

Gene annotation has traditionally required direct comparison of DNA sequ...
research
07/22/2022

TRUST-LAPSE: An Explainable Actionable Mistrust Scoring Framework for Model Monitoring

Continuous monitoring of trained ML models to determine when their predi...
research
09/26/2019

Mathematical Reasoning in Latent Space

We design and conduct a simple experiment to study whether neural networ...
research
09/01/2019

Latent Space Representations of Hypergraphs

The increasing prevalence of relational data describing interactions amo...

Please sign up or login with your details

Forgot password? Click here to reset