Approaches of large-scale images recognition with more than 50,000 categoris

07/26/2020
by   Wanhong Huang, et al.
0

Though current CV models have been able to achieve high levels of accuracy on small-scale images classification dataset with hundreds or thousands of categories, many models become infeasible in computational or space consumption when it comes to large-scale dataset with more than 50,000 categories. In this paper, we provide a viable solution for classifying large-scale species datasets using traditional CV techniques such as.features extraction and processing, BOVW(Bag of Visual Words) and some statistical learning technics like Mini-Batch K-Means,SVM which are used in our works. And then mixed with a neural network model. When applying these techniques, we have done some optimization in time and memory consumption, so that it can be feasible for large-scale dataset. And we also use some technics to reduce the impact of mislabeling data. We use a dataset with more than 50, 000 categories, and all operations are done on common computer with l 6GB RAM and a CPU of 3. OGHz. Our contributions are: 1) analysis what problems may meet in the training processes, and presents several feasible ways to solve these problems. 2) Make traditional CV models combined with neural network models provide some feasible scenarios for training large-scale classified datasets within the constraints of time and spatial resources.

READ FULL TEXT
research
02/03/2021

Deep CNNs for large scale species classification

Large Scale image classification is a challenging problem within the fie...
research
10/30/2020

Classifying Malware Images with Convolutional Neural Network Models

Due to increasing threats from malicious software (malware) in both numb...
research
11/07/2021

High Performance Out-of-sample Embedding Techniques for Multidimensional Scaling

The recent rapid growth of the dimension of many datasets means that man...
research
12/06/2013

Dual coordinate solvers for large-scale structural SVMs

This manuscript describes a method for training linear SVMs (including b...
research
01/20/2017

A Large-scale Dataset and Benchmark for Similar Trademark Retrieval

Trademark retrieval (TR) has become an important yet challenging problem...
research
02/22/2021

Silent Data Corruptions at Scale

Silent Data Corruption (SDC) can have negative impact on large-scale inf...
research
12/09/2015

ShapeNet: An Information-Rich 3D Model Repository

We present ShapeNet: a richly-annotated, large-scale repository of shape...

Please sign up or login with your details

Forgot password? Click here to reset