Transductive Learning with String Kernels for Cross-Domain Text Classification

11/02/2018
by   Radu Tudor Ionescu, et al.
0

For many text classification tasks, there is a major problem posed by the lack of labeled data in a target domain. Although classifiers for a target domain can be trained on labeled text data from a related source domain, the accuracy of such classifiers is usually lower in the cross-domain setting. Recently, string kernels have obtained state-of-the-art results in various text classification tasks such as native language identification or automatic essay scoring. Moreover, classifiers based on string kernels have been found to be robust to the distribution gap between different domains. In this paper, we formally describe an algorithm composed of two simple yet effective transductive learning approaches to further improve the results of string kernels in cross-domain settings. By adapting string kernels to the test set without using the ground-truth test labels, we report significantly better accuracy rates in cross-domain English polarity classification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2018

Improving the results of string kernels in sentiment analysis and Arabic dialect identification by adapting them to your test set

Recently, string kernels have obtained state-of-the-art results in vario...
research
04/18/2023

A Two-Stage Framework with Self-Supervised Distillation For Cross-Domain Text Classification

Cross-domain text classification aims to adapt models to a target domain...
research
04/21/2018

Automated essay scoring with string kernels and word embeddings

In this work, we present an approach based on combining string kernels a...
research
09/16/2018

Cross-Domain Labeled LDA for Cross-Domain Text Classification

Cross-domain text classification aims at building a classifier for a tar...
research
04/15/2022

Learning to Adapt Domain Shifts of Moral Values via Instance Weighting

Classifying moral values in user-generated text from social media is cri...
research
02/15/2021

Generation for adaption: a Gan-based approach for 3D Domain Adaption inPoint Cloud

Recent deep networks have achieved good performance on a variety of 3d p...
research
06/20/2022

Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data

Domain adaptive text classification is a challenging problem for the lar...

Please sign up or login with your details

Forgot password? Click here to reset