Personalized Showcases: Generating Multi-Modal Explanations for Recommendations

06/30/2022
by   An Yan, et al.
0

Existing explanation models generate only text for recommendations but still struggle to produce diverse contents. In this paper, to further enrich explanations, we propose a new task named personalized showcases, in which we provide both textual and visual information to explain our recommendations. Specifically, we first select a personalized image set that is the most relevant to a user's interest toward a recommended item. Then, natural language explanations are generated accordingly given our selected images. For this new task, we collect a large-scale dataset from Google Local (i.e., maps) and construct a high-quality subset for generating multi-modal explanations. We propose a personalized multi-modal framework which can generate diverse and visually-aligned explanations via contrastive learning. Experiments show that our framework benefits from different modalities as inputs, and is able to produce more diverse and expressive explanations compared to previous methods on a variety of evaluation metrics.

READ FULL TEXT

page 1

page 2

page 3

page 8

research
09/28/2022

UCEpic: Unifying Aspect Planning and Lexical Constraints for Explainable Recommendation

Personalized natural language generation for explainable recommendations...
research
05/22/2020

T-RECS: a Transformer-based Recommender Generating Textual Explanations and Integrating Unsupervised Language-based Critiquing

Supporting recommendations with personalized and relevant explanations i...
research
10/07/2021

Explanation as a process: user-centric construction of multi-level and multi-modal explanations

In the last years, XAI research has mainly been concerned with developin...
research
02/11/2021

Personalized Embedding-based e-Commerce Recommendations at eBay

Recommender systems are an essential component of e-commerce marketplace...
research
07/07/2020

PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest

Latent user representations are widely adopted in the tech industry for ...
research
09/29/2020

Where is the Model Looking At?–Concentrate and Explain the Network Attention

Image classification models have achieved satisfactory performance on ma...
research
03/11/2022

REX: Reasoning-aware and Grounded Explanation

Effectiveness and interpretability are two essential properties for trus...

Please sign up or login with your details

Forgot password? Click here to reset