Environment-biased Feature Ranking for Novelty Detection Robustness

09/21/2023
by   Stefan Smeu, et al.
0

We tackle the problem of robust novelty detection, where we aim to detect novelties in terms of semantic content while being invariant to changes in other, irrelevant factors. Specifically, we operate in a setup with multiple environments, where we determine the set of features that are associated more with the environments, rather than to the content relevant for the task. Thus, we propose a method that starts with a pretrained embedding and a multi-env setup and manages to rank the features based on their environment-focus. First, we compute a per-feature score based on the feature distribution variance between envs. Next, we show that by dropping the highly scored ones, we manage to remove spurious correlations and improve the overall performance by up to 6 synthetic benchmark, that we introduce for this task.

READ FULL TEXT
research
02/16/2022

Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective

Natural language understanding (NLU) models tend to rely on spurious cor...
research
06/02/2000

Novelty Detection on a Mobile Robot Using Habituation

In this paper a novelty filter is introduced which allows a robot operat...
research
07/26/2018

Novelty Detection Meets Collider Physics

Novelty detection is the machine learning task to recognize data, which ...
research
02/28/2023

Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments

Learning to detect, characterize and accommodate novelties is a challeng...
research
06/23/2022

NovelCraft: A Dataset for Novelty Detection and Discovery in Open Worlds

In order for artificial agents to perform useful tasks in changing envir...
research
08/12/2020

Null-sampling for Interpretable and Fair Representations

We propose to learn invariant representations, in the data domain, to ac...
research
02/19/2020

Identifying Invariant Factors Across Multiple Environments with KL Regression

Many datasets are collected from multiple environments (e.g. different l...

Please sign up or login with your details

Forgot password? Click here to reset