D2KE: From Distance to Kernel and Embedding

02/14/2018
by   Lingfei Wu, et al.
0

For many machine learning problem settings, particularly with structured inputs such as sequences or sets of objects, a distance measure between inputs can be specified more naturally than a feature representation. However, most standard machine models are designed for inputs with a vector feature representation. In this work, we consider the estimation of a function f:X→ based solely on a dissimilarity measure d:X×X→ between inputs. In particular, we propose a general framework to derive a family of positive definite kernels from a given dissimilarity measure, which subsumes the widely-used representative-set method as a special case, and relates to the well-known distance substitution kernel in a limiting case. We show that functions in the corresponding Reproducing Kernel Hilbert Space (RKHS) are Lipschitz-continuous w.r.t. the given distance metric. We provide a tractable algorithm to estimate a function from this RKHS, and show that it enjoys better generalizability than Nearest-Neighbor estimates. Our approach draws from the literature of Random Features, but instead of deriving feature maps from an existing kernel, we construct novel kernels from a random feature map, that we specify given the distance measure. We conduct classification experiments with such disparate domains as strings, time series, and sets of vectors, where our proposed framework compares favorably to existing distance-based learning methods such as k-nearest-neighbors, distance-substitution kernels, pseudo-Euclidean embedding, and the representative-set method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2016

Kernel Distribution Embeddings: Universal Kernels, Characteristic Kernels and Kernel Metrics on Distributions

Kernel mean embeddings have recently attracted the attention of the mach...
research
10/27/2016

On Bochner's and Polya's Characterizations of Positive-Definite Kernels and the Respective Random Feature Maps

Positive-definite kernel functions are fundamental elements of kernel me...
research
04/09/2012

On Power-law Kernels, corresponding Reproducing Kernel Hilbert Space and Applications

The role of kernels is central to machine learning. Motivated by the imp...
research
10/06/2009

Functional learning through kernels

This paper reviews the functional aspects of statistical learning theory...
research
11/09/2018

Relative Error RKHS Embeddings for Gaussian Kernels

We show how to obliviously embed into the reproducing kernel Hilbert spa...
research
05/04/2020

Lecture notes: Efficient approximation of kernel functions

These lecture notes endeavour to collect in one place the mathematical b...
research
06/26/2023

Tanimoto Random Features for Scalable Molecular Machine Learning

The Tanimoto coefficient is commonly used to measure the similarity betw...

Please sign up or login with your details

Forgot password? Click here to reset