Content Based Document Recommender using Deep Learning

10/23/2017
by   Nishant Nikhil, et al.
0

With the recent advancements in information technology there has been a huge surge in amount of data available. But information retrieval technology has not been able to keep up with this pace of information generation resulting in over spending of time for retrieving relevant information. Even though systems exist for assisting users to search a database along with filtering and recommending relevant information, but recommendation system which uses content of documents for recommendation still have a long way to mature. Here we present a Deep Learning based supervised approach to recommend similar documents based on the similarity of content. We combine the C-DSSM model with Word2Vec distributed representations of words to create a novel model to classify a document pair as relevant/irrelavant by assigning a score to it. Using our model retrieval of documents can be done in O(1) time and the memory complexity is O(n), where n is number of documents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2022

Enhanced vectors for top-k document retrieval in Question Answering

Modern day applications, especially information retrieval webapps that i...
research
12/06/2019

Document Network Embedding: Coping for Missing Content and Missing Links

Searching through networks of documents is an important task. A promisin...
research
12/09/2021

From Scattered Sources to Comprehensive Technology Landscape: A Recommendation-based Retrieval Approach

Mapping the technology landscape is crucial for market actors to take in...
research
10/28/2021

An AI-based Approach for Tracing Content Requirements in Financial Documents

The completeness (in terms of content) of financial documents is a funda...
research
11/24/2018

Novelty and Coverage in context-based information filtering

We present a collection of algorithms to filter a stream of documents in...
research
06/23/2020

On the Programmatic Generation of Reproducible Documents

Reproducible document standards, like R Markdown, facilitate the program...
research
06/07/2018

Content-Based Quality Estimation for Automatic Subject Indexing of Short Texts under Precision and Recall Constraints

Semantic annotations have to satisfy quality constraints to be useful fo...

Please sign up or login with your details

Forgot password? Click here to reset