Just an Update on PMING Distance for Web-based Semantic Similarity in Artificial Intelligence and Data Mining

01/09/2017
by   Valentina Franzoni, et al.
0

One of the main problems that emerges in the classic approach to semantics is the difficulty in acquisition and maintenance of ontologies and semantic annotations. On the other hand, the Internet explosion and the massive diffusion of mobile smart devices lead to the creation of a worldwide system, which information is daily checked and fueled by the contribution of millions of users who interacts in a collaborative way. Search engines, continually exploring the Web, are a natural source of information on which to base a modern approach to semantic annotation. A promising idea is that it is possible to generalize the semantic similarity, under the assumption that semantically similar terms behave similarly, and define collaborative proximity measures based on the indexing information returned by search engines. The PMING Distance is a proximity measure used in data mining and information retrieval, which collaborative information express the degree of relationship between two terms, using only the number of documents returned as result for a query on a search engine. In this work, the PMINIG Distance is updated, providing a novel formal algebraic definition, which corrects previous works. The novel point of view underlines the features of the PMING to be a locally normalized linear combination of the Pointwise Mutual Information and Normalized Google Distance. The analyzed measure dynamically reflects the collaborative change made on the web resources.

READ FULL TEXT

page 1

page 2

page 3

research
01/19/2017

Semantic Evolutionary Concept Distances for Effective Information Retrieval in Query Expansion

In this work several semantic approaches to concept-based query expansio...
research
12/17/2016

Web-based Semantic Similarity for Emotion Recognition in Web Objects

In this project we propose a new approach for emotion recognition using ...
research
04/11/2010

Probabilistic Semantic Web Mining Using Artificial Neural Analysis

Most of the web user's requirements are search or navigation time and ge...
research
02/20/2015

Web Similarity

Normalized web distance (NWD) is a similarity or normalized semantic dis...
research
10/21/2020

Effective Data Scraping Strategies and Resources for Digital Marketers

Data scraping is not a new practice. It pre-dates the internet and exist...
research
03/23/2021

HSEarch: semantic search system for workplace accident reports

Semantic search engines, which integrate the output of text mining (TM) ...
research
12/05/2013

ABC-SG: A New Artificial Bee Colony Algorithm-Based Distance of Sequential Data Using Sigma Grams

The problem of similarity search is one of the main problems in computer...

Please sign up or login with your details

Forgot password? Click here to reset