Multi-BERT for Embeddings for Recommendation System

08/24/2023
by   Shashidhar Reddy Javaji, et al.
0

In this paper, we propose a novel approach for generating document embeddings using a combination of Sentence-BERT (SBERT) and RoBERTa, two state-of-the-art natural language processing models. Our approach treats sentences as tokens and generates embeddings for them, allowing the model to capture both intra-sentence and inter-sentence relations within a document. We evaluate our model on a book recommendation task and demonstrate its effectiveness in generating more semantically rich and accurate document embeddings. To assess the performance of our approach, we conducted experiments on a book recommendation task using the Goodreads dataset. We compared the document embeddings generated using our MULTI-BERT model to those generated using SBERT alone. We used precision as our evaluation metric to compare the quality of the generated embeddings. Our results showed that our model consistently outperformed SBERT in terms of the quality of the generated embeddings. Furthermore, we found that our model was able to capture more nuanced semantic relations within documents, leading to more accurate recommendations. Overall, our results demonstrate the effectiveness of our approach and suggest that it is a promising direction for improving the performance of recommendation systems

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2020

Vec2Sent: Probing Sentence Embeddings with Natural Language Generation

We introspect black-box sentence embeddings by conditionally generating ...
research
06/30/2020

Segmentation Approach for Coreference Resolution Task

In coreference resolution, it is important to consider all members of a ...
research
02/07/2021

Spoiler Alert: Using Natural Language Processing to Detect Spoilers in Book Reviews

This paper presents an NLP (Natural Language Processing) approach to det...
research
04/28/2023

Are the Best Multilingual Document Embeddings simply Based on Sentence Embeddings?

Dense vector representations for textual data are crucial in modern NLP....
research
11/15/2017

BoostJet: Towards Combining Statistical Aggregates with Neural Embeddings for Recommendations

Recommenders have become widely popular in recent years because of their...
research
06/17/2020

Quick Lists: Enriched Playlist Embeddings for Future Playlist Recommendation

Recommending playlists to users in the context of a digital music servic...
research
01/25/2021

A Hybrid Approach to Measure Semantic Relatedness in Biomedical Concepts

Objective: This work aimed to demonstrate the effectiveness of a hybrid ...

Please sign up or login with your details

Forgot password? Click here to reset