Enhancing Latent Space Clustering in Multi-filter Seq2Seq Model: A Reinforcement Learning Approach

09/25/2021
by   Zhaokun Xue, et al.
0

In sequence-to-sequence language processing tasks, sentences with heterogeneous semantics or grammatical structures may increase the difficulty of convergence while training the network. To resolve this problem, we introduce a model that concentrates the each of the heterogeneous features in the input-output sequences. Build upon the encoder-decoder architecture, we design a latent-enhanced multi-filter seq2seq model (LMS2S) that analyzes the latent space representations using a clustering algorithm. The representations are generated from an encoder and a latent space enhancer. A cluster classifier is applied to group the representations into clusters. A soft actor-critic reinforcement learning algorithm is applied to the cluster classifier to enhance the clustering quality by maximizing the Silhouette score. Then, multiple filters are trained by the features only from their corresponding clusters, the heterogeneity of the training data can be resolved accordingly. Our experiments on semantic parsing and machine translation demonstrate the positive correlation between the clustering quality and the model's performance, as well as show the enhancement our model has made with respect to the ordinary encoder-decoder model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2021

Representation Learning in Sequence to Sequence Tasks: Multi-filter Gaussian Mixture Autoencoder

Heterogeneity of sentences exists in sequence to sequence tasks such as ...
research
02/22/2022

Learning Cluster Patterns for Abstractive Summarization

Nowadays, pre-trained sequence-to-sequence models such as BERTSUM and BA...
research
04/01/2021

Learning Deep Latent Subspaces for Image Denoising

Heterogeneity exists in most camera images. This heterogeneity manifests...
research
10/22/2021

Clustering of Bank Customers using LSTM-based encoder-decoder and Dynamic Time Warping

Clustering is an unsupervised data mining technique that can be employed...
research
08/22/2021

Self-Supervised Delineation of Geological Structures using Orthogonal Latent Space Projection

We developed two machine learning frameworks that could assist in automa...
research
05/11/2021

Variational Auto Encoder Gradient Clustering

Clustering using deep neural network models have been extensively studie...
research
11/15/2021

Recursive Self-Improvement for Camera Image and Signal Processing Pipeline

Current camera image and signal processing pipelines (ISPs), including d...

Please sign up or login with your details

Forgot password? Click here to reset