On Negative Transfer and Structure of Latent Functions in Multi-output Gaussian Processes

04/06/2020
by   Moyan Li, et al.
13

The multi-output Gaussian process (MGP) is based on the assumption that outputs share commonalities, however, if this assumption does not hold negative transfer will lead to decreased performance relative to learning outputs independently or in subsets. In this article, we first define negative transfer in the context of an MGP and then derive necessary conditions for an MGP model to avoid negative transfer. Specifically, under the convolution construction, we show that avoiding negative transfer is mainly dependent on having a sufficient number of latent functions Q regardless of the flexibility of the kernel or inference procedure used. However, a slight increase in Q leads to a large increase in the number of parameters to be estimated. To this end, we propose two latent structures that scale to arbitrarily large datasets, can avoid negative transfer and allow any kernel or sparse approximations to be used within. These structures also allow regularization which can provide consistent and automatic selection of related outputs.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset