Domain Divergences: a Survey and Empirical Analysis

10/23/2020
by   Abhinav Ramesh Kashyap, et al.
0

Domain divergence plays a significant role in estimating the performance of a model when applied to new domains. While there is significant literature on divergence measures, choosing an appropriate divergence measures remains difficult for researchers. We address this shortcoming by both surveying the literature and through an empirical study. We contribute a taxonomy of divergence measures consisting of three groups – Information-theoretic, Geometric, and Higher-order measures – and identify the relationships between them. We then ground the use of divergence measures in three different application groups – 1) Data Selection, 2) Learning Representation, and 3) Decisions in the Wild. From this, we identify that Information-theoretic measures are prevalent for 1) and 3), and higher-order measures are common for 2). To further help researchers, we validate these uses empirically through a correlation analysis of performance drops. We consider the current contextual word representations (CWR) to contrast with the older word distribution based representations for this analysis. We find that traditional measures over word distributions still serve as strong baselines, while higher-order measures with CWR are effective.

READ FULL TEXT

page 8

page 15

research
12/12/2018

Divergence measures estimation and its asymptotic normality theory : Discrete case

In this paper we provide the asymptotic theory of the general phi-diverg...
research
09/14/2023

Generalized Decomposition of Multivariate Information

Since its introduction, the partial information decomposition (PID) has ...
research
07/27/2022

Informational properties of the family of cubic rank transmuted distributions

Recently, cubic rank transmuted (CRT) distribution was introduced and st...
research
04/22/2020

An information-theoretic approach to the analysis of location and co-location patterns

We propose a statistical framework to quantify location and co-location ...
research
07/10/2011

Information-Theoretic Measures for Objective Evaluation of Classifications

This work presents a systematic study of objective evaluations of abstai...
research
10/05/2018

Corrections to "Wyner's Common Information under Rényi Divergence Measures"

In this correspondence, we correct an erroneous argument in the proof of...
research
10/28/2015

Canonical Divergence Analysis

We aim to analyze the relation between two random vectors that may poten...

Please sign up or login with your details

Forgot password? Click here to reset