Deep TEN: Texture Encoding Network

12/08/2016
by   Hang Zhang, et al.
0

We propose a Deep Texture Encoding Network (Deep-TEN) with a novel Encoding Layer integrated on top of convolutional layers, which ports the entire dictionary learning and encoding pipeline into a single model. Current methods build from distinct components, using standard encoders with separate off-the-shelf features such as SIFT descriptors or pre-trained CNN features for material recognition. Our new approach provides an end-to-end learning framework, where the inherent visual vocabularies are learned directly from the loss function. The features, dictionaries and the encoding representation for the classifier are all learned simultaneously. The representation is orderless and therefore is particularly useful for material and texture recognition. The Encoding Layer generalizes robust residual encoders such as VLAD and Fisher Vectors, and has the property of discarding domain specific information which makes the learned convolutional features easier to transfer. Additionally, joint training using multiple datasets of varied sizes and class labels is supported resulting in increased recognition performance. The experimental results show superior performance as compared to state-of-the-art methods using gold-standard databases such as MINC-2500, Flickr Material Database, KTH-TIPS-2b, and two recent databases 4D-Light-Field-Material and GTOS. The source code for the complete system are publicly available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/02/2018

A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification

A novel learnable dictionary encoding layer is proposed in this paper fo...
research
01/01/2020

Histogram Layers for Texture Analysis

We present a histogram layer for artificial neural networks (ANNs). An e...
research
08/22/2022

Multilayer deep feature extraction for visual texture recognition

Convolutional neural networks have shown successful results in image cla...
research
11/25/2014

Deep convolutional filter banks for texture recognition and segmentation

Research in texture recognition often concentrates on the problem of mat...
research
03/29/2018

Deep Texture Manifold for Ground Terrain Recognition

We present a texture network called Deep Encoding Pooling Network (DEP) ...
research
12/29/2018

A Deep Learning based Framework to Detect and Recognize Humans using Contactless Palmprints in the Wild

Contactless and online palmprint identfication offers improved user conv...
research
03/08/2023

UT-Net: Combining U-Net and Transformer for Joint Optic Disc and Cup Segmentation and Glaucoma Detection

Glaucoma is a chronic visual disease that may cause permanent irreversib...

Please sign up or login with your details

Forgot password? Click here to reset