ScaleNAS: One-Shot Learning of Scale-Aware Representations for Visual Recognition

11/30/2020
by   Hsin-Pai Cheng, et al.
0

Scale variance among different sizes of body parts and objects is a challenging problem for visual recognition tasks. Existing works usually design dedicated backbone or apply Neural architecture Search(NAS) for each task to tackle this challenge. However, existing works impose significant limitations on the design or search space. To solve these problems, we present ScaleNAS, a one-shot learning method for exploring scale-aware representations. ScaleNAS solves multiple tasks at a time by searching multi-scale feature aggregation. ScaleNAS adopts a flexible search space that allows an arbitrary number of blocks and cross-scale feature fusions. To cope with the high search cost incurred by the flexible space, ScaleNAS employs one-shot learning for multi-scale supernet driven by grouped sampling and evolutionary search. Without further retraining, ScaleNet can be directly deployed for different visual recognition tasks with superior performance. We use ScaleNAS to create high-resolution models for two different tasks, ScaleNet-P for human pose estimation and ScaleNet-S for semantic segmentation. ScaleNet-P and ScaleNet-S outperform existing manually crafted and NAS-based methods in both tasks. When applying ScaleNet-P to bottom-up human pose estimation, it surpasses the state-of-the-art HigherHRNet. In particular, ScaleNet-P4 achieves 71.6 COCO test-dev, achieving new state-of-the-art result.

READ FULL TEXT
research
09/16/2019

Pose Neural Fabrics Search

Neural Architecture Search (NAS) technologies have been successfully per...
research
07/13/2020

MS-NAS: Multi-Scale Neural Architecture Search for Medical Image Segmentation

The recent breakthroughs of Neural Architecture Search (NAS) have motiva...
research
05/21/2020

Powering One-shot Topological NAS with Stabilized Share-parameter Proxy

One-shot NAS method has attracted much interest from the research commun...
research
11/21/2019

AutoShrink: A Topology-aware NAS for Discovering Efficient Neural Architecture

Resource is an important constraint when deploying Deep Neural Networks ...
research
12/13/2020

EfficientPose: Efficient Human Pose Estimation with Neural Architecture Search

Human pose estimation from image and video is a vital task in many multi...
research
12/07/2021

RSBNet: One-Shot Neural Architecture Search for A Backbone Network in Remote Sensing Image Recognition

Recently, a massive number of deep learning based approaches have been s...
research
12/13/2020

Efficient Human Pose Estimation by Learning Deeply Aggregated Representations

In this paper, we propose an efficient human pose estimation network (DA...

Please sign up or login with your details

Forgot password? Click here to reset