Accuracy on In-Domain Samples Matters When Building Out-of-Domain detectors: A Reply to Marek et al. (2021)

05/24/2022
by   Yinhe Zheng, et al.
0

We have noticed that Marek et al. (2021) try to re-implement our paper Zheng et al. (2020a) in their work "OodGAN: Generative Adversarial Network for Out-of-Domain Data Generation". Our paper proposes a model to generate pseudo OOD samples that are akin to IN-Domain (IND) input utterances. These pseudo OOD samples can be used to improve the OOD detection performance by optimizing an entropy regularization term when building the IND classifier. Marek et al. (2021) report a large gap between their re-implemented results and ours on the CLINC150 dataset (Larson et al., 2019). This paper discusses some key observations that may have led to such a large gap. Most of these observations originate from our experiments because Marek et al. (2021) have not released their codes1. One of the most important observations is that stronger IND classifiers usually exhibit a more robust ability to detect OOD samples. We hope these observations help other researchers, including Marek et al. (2021), to develop better OOD detectors in their applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/04/2023

Multiplicity Boost Of Transit Signal Classifiers: Validation of 69 New Exoplanets Using The Multiplicity Boost of ExoMiner

Most existing exoplanets are discovered using validation techniques rath...
research
03/27/2017

Theoretical Evaluation of Li et al.'s Approach for Improving a Binary Watermark-Based Scheme in Remote Sensing Data Communications

This letter is about a principal weakness of the published article by Li...
research
05/19/2016

Stereotyping and Bias in the Flickr30K Dataset

An untested assumption behind the crowdsourced descriptions of the image...
research
07/28/2014

Beyond KernelBoost

In this Technical Report we propose a set of improvements with respect t...
research
06/30/2014

Information Transfer in Swarms with Leaders

Swarm dynamics is the study of collections of agents that interact with ...
research
10/31/2021

Interpreting Deep Knowledge Tracing Model on EdNet Dataset

With more deep learning techniques being introduced into the knowledge t...
research
10/03/2017

Parameter estimation of platelets deposition: Approximate Bayesian computation with high performance computing

A numerical model that quantitatively describes how platelets in a shear...

Please sign up or login with your details

Forgot password? Click here to reset