Pixel-wise Ear Detection with Convolutional Encoder-Decoder Networks

02/01/2017
by   Žiga Emeršič, et al.
0

Object detection and segmentation represents the basis for many tasks in computer and machine vision. In biometric recognition systems the detection of the region-of-interest (ROI) is one of the most crucial steps in the overall processing pipeline, significantly impacting the performance of the entire recognition system. Existing approaches to ear detection, for example, are commonly susceptible to the presence of severe occlusions, ear accessories or variable illumination conditions and often deteriorate in their performance if applied on ear images captured in unconstrained settings. To address these shortcomings, we present in this paper a novel ear detection technique based on convolutional encoder-decoder networks (CEDs). For our technique, we formulate the problem of ear detection as a two-class segmentation problem and train a convolutional encoder-decoder network based on the SegNet architecture to distinguish between image-pixels belonging to either the ear or the non-ear class. The output of the network is then post-processed to further refine the segmentation result and return the final locations of the ears in the input image. Different from competing techniques from the literature, our approach does not simply return a bounding box around the detected ear, but provides detailed, pixel-wise information about the location of the ears in the image. Our experiments on a dataset gathered from the web (a.k.a. in the wild) show that the proposed technique ensures good detection results in the presence of various covariate factors and significantly outperforms the existing state-of-the-art.

READ FULL TEXT

page 4

page 6

page 7

page 8

page 11

research
06/28/2021

Fractal Pyramid Networks

We propose a new network architecture, the Fractal Pyramid Networks (PFN...
research
09/07/2023

Feature Enhancer Segmentation Network (FES-Net) for Vessel Segmentation

Diseases such as diabetic retinopathy and age-related macular degenerati...
research
06/27/2022

De-END: Decoder-driven Watermarking Network

With recent advances in machine learning, researchers are now able to so...
research
09/13/2021

CarNet: A Lightweight and Efficient Encoder-Decoder Architecture for High-quality Road Crack Detection

Pixel-wise crack detection is a challenging task because of poor continu...
research
10/09/2018

UOLO - automatic object detection and segmentation in biomedical images

We propose UOLO, a novel framework for the simultaneous detection and se...
research
01/27/2020

¶ILCRO: Making Importance Landscapes Flat Again

Convolutional neural networks have had a great success in numerous tasks...
research
07/18/2017

The Devil is in the Decoder

Many machine vision applications require predictions for every pixel of ...

Please sign up or login with your details

Forgot password? Click here to reset