Wasserstein Distance Regularized Sequence Representation for Text Matching in Asymmetrical Domains

10/15/2020
by   Weijie Yu, et al.
0

One approach to matching texts from asymmetrical domains is projecting the input sequences into a common semantic space as feature vectors upon which the matching function can be readily defined and learned. In real-world matching practices, it is often observed that with the training goes on, the feature vectors projected from different domains tend to be indistinguishable. The phenomenon, however, is often overlooked in existing matching models. As a result, the feature vectors are constructed without any regularization, which inevitably increases the difficulty of learning the downstream matching functions. In this paper, we propose a novel match method tailored for text matching in asymmetrical domains, called WD-Match. In WD-Match, a Wasserstein distance-based regularizer is defined to regularize the features vectors projected from different domains. As a result, the method enforces the feature projection function to generate vectors such that those correspond to different domains cannot be easily discriminated. The training process of WD-Match amounts to a game that minimizes the matching loss regularized by the Wasserstein distance. WD-Match can be used to improve different text matching methods, by using the method as its underlying matching model. Four popular text matching methods have been exploited in the paper. Experimental results based on four publicly available benchmarks showed that WD-Match consistently outperformed the underlying methods and the baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2019

Multi-marginal Wasserstein GAN

Multiple marginal matching problem aims at learning mappings to match a ...
research
04/15/2016

Match-SRNN: Modeling the Recursive Matching Structure with Spatial RNN

Semantic matching, which aims to determine the matching degree between t...
research
02/13/2020

Regularizing activations in neural networks via distribution matching with the Wasserstein metric

Regularization and normalization have become indispensable components in...
research
03/01/2020

Joint Wasserstein Distribution Matching

Joint distribution matching (JDM) problem, which aims to learn bidirecti...
research
12/29/2014

A simple coding for cross-domain matching with dimension reduction via spectral graph embedding

Data vectors are obtained from multiple domains. They are feature vector...
research
11/06/2016

A Compare-Aggregate Model for Matching Text Sequences

Many NLP tasks including machine comprehension, answer selection and tex...
research
08/15/2012

Evaluating Ontology Matching Systems on Large, Multilingual and Real-world Test Cases

In the field of ontology matching, the most systematic evaluation of mat...

Please sign up or login with your details

Forgot password? Click here to reset