GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing

07/09/2019
by   Jian Guo, et al.
5

We present GluonCV and GluonNLP, the deep learning toolkits for computer vision and natural language processing based on Apache MXNet (incubating). These toolkits provide state-of-the-art pre-trained models, training scripts, and training logs, to facilitate rapid prototyping and promote reproducible research. We also provide modular APIs with flexible building blocks to enable efficient customization. Leveraging the MXNet ecosystem, the deep learning models in GluonCV and GluonNLP can be deployed onto a variety of platforms with different programming languages. Benefiting from open source under the Apache 2.0 license, GluonCV and GluonNLP have attracted 100 contributors worldwide on GitHub. Models of GluonCV and GluonNLP have been downloaded for more than 1.6 million times in fewer than 10 months.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2018

Modular Mechanistic Networks: On Bridging Mechanistic and Phenomenological Models with Deep Neural Networks in Natural Language Processing

Natural language processing (NLP) can be done using either top-down (the...
research
09/16/2021

Torch.manual_seed(3407) is all you need: On the influence of random seeds in deep learning architectures for computer vision

In this paper I investigate the effect of random seed selection on the a...
research
07/26/2017

TensorLayer: A Versatile Library for Efficient Deep Learning Development

Deep learning has enabled major advances in the fields of computer visio...
research
05/18/2023

A Survey on Time-Series Pre-Trained Models

Time-Series Mining (TSM) is an important research area since it shows gr...
research
12/17/2022

Foundation models in brief: A historical, socio-technical focus

Foundation models can be disruptive for future AI development by scaling...
research
05/30/2019

Deep Learning Approach for Receipt Recognition

Inspired by the recent successes of deep learning on Computer Vision and...
research
03/29/2021

LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis

Recent advances in document image analysis (DIA) have been primarily dri...

Please sign up or login with your details

Forgot password? Click here to reset