Efficiently Discovering Hammock Paths from Induced Similarity Networks

02/17/2010
by   M. Shahriar Hossain, et al.
0

Similarity networks are important abstractions in many information management applications such as recommender systems, corpora analysis, and medical informatics. For instance, by inducing similarity networks between movies rated similarly by users, or between documents containing common terms, and or between clinical trials involving the same themes, we can aim to find the global structure of connectivities underlying the data, and use the network as a basis to make connections between seemingly disparate entities. In the above applications, composing similarities between objects of interest finds uses in serendipitous recommendation, in storytelling, and in clinical diagnosis, respectively. We present an algorithmic framework for traversing similarity paths using the notion of `hammock' paths which are generalization of traditional paths. Our framework is exploratory in nature so that, given starting and ending objects of interest, it explores candidate objects for path following, and heuristics to admissibly estimate the potential for paths to lead to a desired destination. We present three diverse applications: exploring movie similarities in the Netflix dataset, exploring abstract similarities across the PubMed corpus, and exploring description similarities in a database of clinical trials. Experimental results demonstrate the potential of our approach for unstructured knowledge discovery in similarity networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2018

A Semantic-Rich Similarity Measure in Heterogeneous Information Networks

Measuring the similarities between objects in information networks has f...
research
05/27/2014

A Topic Model Approach to Multi-Modal Similarity

Calculating similarities between objects defined by many heterogeneous d...
research
07/27/2022

The Leaf Clinical Trials Corpus: a new resource for query generation from clinical trial eligibility criteria

Identifying cohorts of patients based on eligibility criteria such as me...
research
10/14/2022

Shadfa 0.1: The Iranian Movie Knowledge Graph and Graph-Embedding-Based Recommender System

Movies are a great source of entertainment. However, the problem arises ...
research
02/08/2019

Covariance and Correlation Kernels on a Graph in the Generalized Bag-of-Paths Formalism

This work derives closed-form expressions computing the expectation of c...
research
12/25/2017

Recurrent Meta-Structure for Robust Similarity Measure in Heterogeneous Information Networks

Similarity measure as a fundamental task in heterogeneous information ne...
research
02/21/2016

Interactive Storytelling over Document Collections

Storytelling algorithms aim to 'connect the dots' between disparate docu...

Please sign up or login with your details

Forgot password? Click here to reset