Learning Disentangled Representations of Texts with Application to Biomedical Abstracts

04/19/2018
by   Sarthak Jain, et al.
0

We propose a method for learning disentangled sets of vector representations of texts that capture distinct aspects. We argue that such representations afford model transfer and interpretability. To induce disentangled embeddings, we propose an adversarial objective based on the (dis)similarity between triplets of documents w.r.t. specific aspects. Our motivating application concerns embedding abstracts describing clinical trials in a manner that disentangles the populations, interventions, and outcomes in a given trial. We show that the induced representations indeed encode these targeted clinically salient aspects and that they can be effectively used to perform aspect-specific retrieval. We demonstrate that the approach generalizes beyond this motivating example via experiments on two multi-aspect review corpora.

READ FULL TEXT

page 14

page 15

page 16

research
07/15/2023

AspectCSE: Sentence Embeddings for Aspect-based Semantic Textual Similarity using Contrastive Learning and Structured Knowledge

Generic sentence embeddings provide a coarse-grained approximation of se...
research
03/07/2018

Inferencing Based on Unsupervised Learning of Disentangled Representations

Combining Generative Adversarial Networks (GANs) with encoders that lear...
research
08/18/2020

Linear Disentangled Representations and Unsupervised Action Estimation

Disentangled representation learning has seen a surge in interest over r...
research
06/24/2019

Gauge theory and twins paradox of disentangled representations

Achieving disentangled representations of information is one of the key ...
research
04/14/2021

Disentangling Representations of Text by Masking Transformers

Representations from large pretrained models such as BERT encode a range...
research
10/06/2020

Are "Undocumented Workers" the Same as "Illegal Aliens"? Disentangling Denotation and Connotation in Vector Spaces

In politics, neologisms are frequently invented for partisan objectives....
research
05/18/2021

Multi-Aspect Temporal Network Embedding: A Mixture of Hawkes Process View

Recent years have witnessed the tremendous research interests in network...

Please sign up or login with your details

Forgot password? Click here to reset