Generalising Recursive Neural Models by Tensor Decomposition

06/17/2020
by   Daniele Castellana, et al.
0

Most machine learning models for structured data encode the structural knowledge of a node by leveraging simple aggregation functions (in neural models, typically a weighted sum) of the information in the node's neighbourhood. Nevertheless, the choice of simple context aggregation functions, such as the sum, can be widely sub-optimal. In this work we introduce a general approach to model aggregation of structural context leveraging a tensor-based formulation. We show how the exponential growth in the size of the parameter space can be controlled through an approximation based on the Tucker tensor decomposition. This approximation allows limiting the parameters space size, decoupling it from its strict relation with the size of the hidden encoding space. By this means, we can effectively regulate the trade-off between expressivity of the encoding, controlled by the hidden size, computational complexity and model generalisation, influenced by parameterisation. Finally, we introduce a new Tensorial Tree-LSTM derived as an instance of our framework and we use it to experimentally assess our working hypotheses on tree classification scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/18/2020

Tensor Decompositions in Recursive NeuralNetworks for Tree-Structured Data

The paper introduces two new aggregation functions to encode structural ...
research
11/02/2020

Learning from Non-Binary Constituency Trees via Tensor Decomposition

Processing sentence constituency trees in binarised form is a common and...
research
05/31/2019

Bayesian Tensor Factorisation for Bottom-up Hidden Tree Markov Models

Bottom-Up Hidden Tree Markov Model is a highly expressive model for tree...
research
06/30/2020

Approximation with Tensor Networks. Part II: Approximation Rates for Smoothness Classes

We study the approximation by tensor networks (TNs) of functions from cl...
research
12/02/2021

Approximation by tree tensor networks in high dimensions: Sobolev and compositional functions

This paper is concerned with convergence estimates for fully discrete tr...
research
08/11/2022

Interaction Decompositions for Tensor Network Regression

It is well known that tensor network regression models operate on an exp...
research
12/15/2020

Learning Aggregation Functions

Learning on sets is increasingly gaining attention in the machine learni...

Please sign up or login with your details

Forgot password? Click here to reset