Attention-guided Multi-step Fusion: A Hierarchical Fusion Network for Multimodal Recommendation

04/24/2023
by   Yan Zhou, et al.
0

The main idea of multimodal recommendation is the rational utilization of the item's multimodal information to improve the recommendation performance. Previous works directly integrate item multimodal features with item ID embeddings, ignoring the inherent semantic relations contained in the multimodal features. In this paper, we propose a novel and effective aTtention-guided Multi-step FUsion Network for multimodal recommendation, named TMFUN. Specifically, our model first constructs modality feature graph and item feature graph to model the latent item-item semantic structures. Then, we use the attention module to identify inherent connections between user-item interaction data and multimodal data, evaluate the impact of multimodal data on different interactions, and achieve early-step fusion of item features. Furthermore, our model optimizes item representation through the attention-guided multi-step fusion strategy and contrastive learning to improve recommendation performance. The extensive experiments on three real-world datasets show that our model has superior performance compared to the state-of-the-art models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/28/2023

Enhancing Dyadic Relations with Homogeneous Graphs for Multimodal Recommendation

User interaction data in recommender systems is a form of dyadic relatio...
research
02/13/2019

Interest-Related Item Similarity Model Based on Multimodal Data for Top-N Recommendation

Nowadays, the recommendation systems are applied in the fields of e-comm...
research
05/12/2023

Knowledge Soft Integration for Multimodal Recommendation

One of the main challenges in modern recommendation systems is how to ef...
research
08/28/2019

Attention-based Fusion for Outfit Recommendation

This paper describes an attention-based fusion method for outfit recomme...
research
04/08/2021

Multimodal Fusion Refiner Networks

Tasks that rely on multi-modal information typically include a fusion mo...
research
11/13/2022

A Tale of Two Graphs: Freezing and Denoising Graph Structures for Multimodal Recommendation

Multimodal recommender systems utilizing multimodal features (e.g. image...
research
02/21/2022

GIFT: Graph-guIded Feature Transfer for Cold-Start Video Click-Through Rate Prediction

Short video has witnessed rapid growth in China and shows a promising ma...

Please sign up or login with your details

Forgot password? Click here to reset