ePillID Dataset: A Low-Shot Fine-Grained Benchmark for Pill Identification

05/28/2020
by   Naoto Usuyama, et al.
0

Identifying prescription medications is a frequent task for patients and medical professionals; however, this is an error-prone task as many pills have similar appearances (e.g. white round pills), which increases the risk of medication errors. In this paper, we introduce ePillID, the largest public benchmark on pill image recognition, composed of 13k images representing 8184 appearance classes (two sides for 4092 pill types). For most of the appearance classes, there exists only one reference image, making it a challenging low-shot recognition setting. We present our experimental setup and evaluation results of various baseline models on the benchmark. The best baseline using a multi-head metric-learning approach with bilinear features performed remarkably well; however, our error analysis suggests that they still fail to distinguish particularly confusing classes. The code and data are available at <https://github.com/usuyama/ePillID-benchmark>.

READ FULL TEXT

page 1

page 4

research
04/01/2020

Revisiting Pose-Normalization for Fine-Grained Few-Shot Recognition

Few-shot, fine-grained classification requires a model to learn subtle, ...
research
04/07/2019

Compare More Nuanced:Pairwise Alignment Bilinear Network For Few-shot Fine-grained Learning

The recognition ability of human beings is developed in a progressive wa...
research
08/06/2018

Visual Question Generation for Class Acquisition of Unknown Objects

Traditional image recognition methods only consider objects belonging to...
research
07/10/2019

A New Benchmark and Approach for Fine-grained Cross-media Retrieval

Cross-media retrieval is to return the results of various media types co...
research
08/01/2022

Improving Fine-Grained Visual Recognition in Low Data Regimes via Self-Boosting Attention Mechanism

The challenge of fine-grained visual recognition often lies in discoveri...
research
11/21/2022

Place Recognition under Occlusion and Changing Appearance via Disentangled Representations

Place recognition is a critical and challenging task for mobile robots, ...
research
12/02/2019

Patchy Image Structure Classification Using Multi-Orientation Region Transform

Exterior contour and interior structure are both vital features for clas...

Please sign up or login with your details

Forgot password? Click here to reset