The Weight Function in the Subtree Kernel is Decisive

04/10/2019
by   Romain Azaïs, et al.
0

Tree data are ubiquitous because they model a large variety of situations, e.g., the architecture of plants, the secondary structure of RNA, or the hierarchy of XML files. Nevertheless, the analysis of these non-Euclidean data is difficul per se. In this paper, we focus on the subtree kernel that is a convolution kernel for tree data introduced by Vishwanathan and Smola in the early 2000's. More precisely, we investigate the influence of the weight function from a theoretical perspective and in real data applications. We establish on a 2-classes stochastic model that the performance of the subtree kernel is improved when the weight of leaves vanishes, which motivates the definition of a new weight function, learned from the data and not fixed by the user as usually done. To this end, we define a unified framework for computing the subtree kernel from ordered or unordered trees, that is particularly suitable for tuning parameters. We show through two real data classification problems the great efficiency of our approach, in particular with respect to the ones considered in the literature, which also states the high importance of the weight function. Finally, a visualization tool of the significant features is derived.

READ FULL TEXT
research
09/27/2020

A Weighted Quiver Kernel using Functor Homology

In this paper, we propose a new homological method to study weighted dir...
research
09/26/2022

Renewable Composite Quantile Method and Algorithm for Nonparametric Models with Streaming Data

We are interested in renewable estimations and algorithms for nonparamet...
research
09/05/2018

Gene Shaving using influence function of a kernel method

Identifying significant subsets of the genes, gene shaving is an essenti...
research
05/08/2018

Several Tunable GMM Kernels

While tree methods have been popular in practice, researchers and practi...
research
03/25/2020

A Unified Framework for Multiclass and Multilabel Support Vector Machines

We propose a novel integrated formulation for multiclass and multilabel ...
research
02/06/2023

Moving Least Squares Approximation using Variably Scaled Discontinuous Weight Function

Functions with discontinuities appear in many applications such as image...

Please sign up or login with your details

Forgot password? Click here to reset