A Neighbourhood Framework for Resource-Lean Content Flagging

03/31/2021
by   Sheikh Muhammad Sarwar, et al.
8

We propose a novel interpretable framework for cross-lingual content flagging, which significantly outperforms prior work both in terms of predictive performance and average inference time. The framework is based on a nearest-neighbour architecture and is interpretable by design. Moreover, it can easily adapt to new instances without the need to retrain it from scratch. Unlike prior work, (i) we encode not only the texts, but also the labels in the neighbourhood space (which yields better accuracy), and (ii) we use a bi-encoder instead of a cross-encoder (which saves computation time). Our evaluation results on ten different datasets for abusive language detection in eight languages shows sizable improvements over the state of the art, as well as a speed-up at inference time.

READ FULL TEXT
research
04/04/2023

SimCSum: Joint Learning of Simplification and Cross-lingual Summarization for Cross-lingual Science Journalism

Cross-lingual science journalism generates popular science stories of sc...
research
02/15/2022

Enhancing Cross-lingual Prompting with Mask Token Augmentation

Prompting shows promising results in few-shot scenarios. However, its st...
research
09/03/2019

Target Language-Aware Constrained Inference for Cross-lingual Dependency Parsing

Prior work on cross-lingual dependency parsing often focuses on capturin...
research
08/29/2019

Translate and Label! An Encoder-Decoder Approach for Cross-lingual Semantic Role Labeling

We propose a Cross-lingual Encoder-Decoder model that simultaneously tra...
research
05/23/2023

μPLAN: Summarizing using a Content Plan as Cross-Lingual Bridge

Cross-lingual summarization consists of generating a summary in one lang...
research
11/01/2020

TransQuest: Translation Quality Estimation with Cross-lingual Transformers

Recent years have seen big advances in the field of sentence-level quali...
research
02/09/2021

AI-based Blackbox Code Deobfuscation: Understand, Improve and Mitigate

Code obfuscation aims at protecting Intellectual Property and other secr...

Please sign up or login with your details

Forgot password? Click here to reset