Attentive VQ-VAE

09/20/2023
by   Mariano Rivera, et al.
0

We present a novel approach to enhance the capabilities of VQVAE models through the integration of an Attentive Residual Encoder (AREN) and a Residual Pixel Attention layer. The objective of our research is to improve the performance of VQVAE while maintaining practical parameter levels. The AREN encoder is designed to operate effectively at multiple levels, accommodating diverse architectural complexities. The key innovation is the integration of an inter-pixel auto-attention mechanism into the AREN encoder. This approach allows us to efficiently capture and utilize contextual information across latent vectors. Additionally, our models uses additional encoding levels to further enhance the model's representational power. Our attention layer employs a minimal parameter approach, ensuring that latent vectors are modified only when pertinent information from other pixels is available. Experimental results demonstrate that our proposed modifications lead to significant improvements in data representation and generation, making VQVAEs even more suitable for a wide range of applications.

READ FULL TEXT
research
10/10/2016

Modelling Sentence Pairs with Tree-structured Attentive Encoder

We describe an attentive encoder that combines tree-structured recursive...
research
02/27/2020

CATA++: A Collaborative Dual Attentive Autoencoder Method for Recommending Scientific Articles

Recommender systems today have become an essential component of any comm...
research
05/27/2019

SAIN: Self-Attentive Integration Network for Recommendation

With the growing importance of personalized recommendation, numerous rec...
research
11/18/2019

FFA-Net: Feature Fusion Attention Network for Single Image Dehazing

In this paper, we propose an end-to-end feature fusion at-tention networ...
research
07/13/2021

Visual Parser: Representing Part-whole Hierarchies with Transformers

Human vision is able to capture the part-whole hierarchical information ...
research
12/17/2019

Jointly Trained Image and Video Generation using Residual Vectors

In this work, we propose a modeling technique for jointly training image...
research
07/03/2019

Image Super-Resolution Using Attention Based DenseNet with Residual Deconvolution

Image super-resolution is a challenging task and has attracted increasin...

Please sign up or login with your details

Forgot password? Click here to reset