Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification

06/17/2020
by   Anton Alekseev, et al.
0

Deep learning architectures based on self-attention have recently achieved and surpassed state of the art results in the task of unsupervised aspect extraction and topic modeling. While models such as neural attention-based aspect extraction (ABAE) have been successfully applied to user-generated texts, they are less coherent when applied to traditional data sources such as news articles and newsgroup documents. In this work, we introduce a simple approach based on sentence filtering in order to improve topical aspects learned from newsgroups-based content without modifying the basic mechanism of ABAE. We train a probabilistic classifier to distinguish between out-of-domain texts (outer dataset) and in-domain texts (target dataset). Then, during data preparation we filter out sentences that have a low probability of being in-domain and train the neural model on the remaining sentences. The positive effect of sentence filtering on topic coherence is demonstrated in comparison to aspect extraction models trained on unfiltered texts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/26/2017

Dataset Construction via Attention for Aspect Term Extraction with Distant Supervision

Aspect Term Extraction (ATE) detects opinionated aspect terms in sentenc...
research
11/08/2016

Sentence Ordering and Coherence Modeling using Recurrent Neural Networks

Modeling the structure of coherent texts is a key NLP problem. The task ...
research
02/13/2018

Attention based Sentence Extraction from Scientific Articles using Pseudo-Labeled data

In this work, we present a weakly supervised sentence extraction techniq...
research
08/18/2019

TDAM: a Topic-Dependent Attention Model for Sentiment Analysis

We propose a topic-dependent attention model for sentiment classificatio...
research
10/13/2022

Ensemble Creation via Anchored Regularization for Unsupervised Aspect Extraction

Aspect Based Sentiment Analysis is the most granular form of sentiment a...
research
01/01/2017

Aspect-augmented Adversarial Networks for Domain Adaptation

We introduce a neural method for transfer learning between two (source a...
research
07/31/2017

Combining Thesaurus Knowledge and Probabilistic Topic Models

In this paper we present the approach of introducing thesaurus knowledge...

Please sign up or login with your details

Forgot password? Click here to reset