Toward Auto-evaluation with Confidence-based Category Relation-aware Regression

04/17/2023
by   Jiexin Wang, et al.
0

Auto-evaluation aims to automatically evaluate a trained model on any test dataset without human annotations. Most existing methods utilize global statistics of features extracted by the model as the representation of a dataset. This ignores the influence of the classification head and loses category-wise confusion information of the model. However, ratios of instances assigned to different categories together with their confidence scores reflect how many instances in which categories are difficult for the model to classify, which contain significant indicators for both overall and category-wise performances. In this paper, we propose a Confidence-based Category Relation-aware Regression (C^2R^2) method. C^2R^2 divides all instances in a meta-set into different categories according to their confidence scores and extracts the global representation from them. For each category, C^2R^2 encodes its local confusion relations to other categories into a local representation. The overall and category-wise performances are regressed from global and local representations, respectively. Extensive experiments show the effectiveness of our method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/13/2023

Cartesian Differential Kleisli Categories

Cartesian differential categories come equipped with a differential comb...
research
11/16/2017

Zero-Shot Learning via Category-Specific Visual-Semantic Mapping

Zero-Shot Learning (ZSL) aims to classify a test instance from an unseen...
research
04/12/2023

Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss

Filler words like “um" or “uh" are common in spontaneous speech. It is d...
research
05/14/2020

Deep Hierarchical Classification for Category Prediction in E-commerce System

In e-commerce system, category prediction is to automatically predict ca...
research
06/28/2019

Uncovering the Semantics of Wikipedia Categories

The Wikipedia category graph serves as the taxonomic backbone for large-...
research
03/15/2021

Generating CCG Categories

Previous CCG supertaggers usually predict categories using multi-class c...
research
02/13/2020

Summarizing the performances of a background subtraction algorithm measured on several videos

There exist many background subtraction algorithms to detect motion in v...

Please sign up or login with your details

Forgot password? Click here to reset