Learning from Conditional Distributions via Dual Embeddings

07/15/2016
by   Bo Dai, et al.
0

Many machine learning tasks, such as learning with invariance and policy evaluation in reinforcement learning, can be characterized as problems of learning from conditional distributions. In such problems, each sample x itself is associated with a conditional distribution p(z|x) represented by samples {z_i}_i=1^M, and the goal is to learn a function f that links these conditional distributions to target values y. These learning problems become very challenging when we only have limited samples or in the extreme case only one sample from each conditional distribution. Commonly used approaches either assume that z is independent of x, or require an overwhelmingly large samples from each conditional distribution. To address these challenges, we propose a novel approach which employs a new min-max reformulation of the learning from conditional distribution problem. With such new reformulation, we only need to deal with the joint distribution p(z,x). We also design an efficient learning algorithm, Embedding-SGD, and establish theoretical sample complexity for such problems. Finally, our numerical experiments on both synthetic and real-world datasets show that the proposed approach can significantly improve over the existing algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2018

Anaconda: A Non-Adaptive Conditional Sampling Algorithm for Distribution Testing

We investigate distribution testing with access to non-adaptive conditio...
research
12/19/2021

Wasserstein Generative Learning of Conditional Distribution

Conditional distribution is a fundamental quantity for describing the re...
research
10/19/2021

A Deep Generative Approach to Conditional Sampling

We propose a deep generative approach to sampling from a conditional dis...
research
02/22/2019

Learning about an exponential amount of conditional distributions

We introduce the Neural Conditioner (NC), a self-supervised machine able...
research
09/30/2017

Decontamination of Mutual Contamination Models

Many machine learning problems can be characterized by mutual contaminat...
research
05/28/2020

Learning How To Learn Within An LSM-based Key-Value Store

We introduce BOURBON, a log-structured merge (LSM) tree that utilizes ma...
research
06/25/2018

Mimic and Classify : A meta-algorithm for Conditional Independence Testing

Given independent samples generated from the joint distribution p(x,y,z)...

Please sign up or login with your details

Forgot password? Click here to reset