Large Field and High Resolution: Detecting Needle in Haystack

04/10/2018
by   Hadar Gorodissky, et al.
0

The growing use of convolutional neural networks (CNN) for a broad range of visual tasks, including tasks involving fine details, raises the problem of applying such networks to a large field of view, since the amount of computations increases significantly with the number of pixels. To deal effectively with this difficulty, we develop and compare methods of using CNNs for the task of small target localization in natural images, given a limited "budget" of samples to form an image. Inspired in part by human vision, we develop and compare variable sampling schemes, with peak resolution at the center and decreasing resolution with eccentricity, applied iteratively by re-centering the image at the previous predicted target location. The results indicate that variable resolution models significantly outperform constant resolution models. Surprisingly, variable resolution models and in particular multi-channel models, outperform the optimal, "budget-free" full-resolution model, using only 5% of the samples.

READ FULL TEXT

page 2

page 5

page 10

page 12

page 13

research
08/10/2022

PatchDropout: Economizing Vision Transformers Using Patch Dropout

Vision transformers have demonstrated the potential to outperform CNNs i...
research
12/08/2017

Image Inpainting for High-Resolution Textures using CNN Texture Synthesis

Deep neural networks have been successfully applied to problems such as ...
research
02/02/2017

Pixel Recursive Super Resolution

We present a pixel recursive super resolution model that synthesizes rea...
research
11/20/2019

MetH: A family of high-resolution and variable-shape image challenges

High-resolution and variable-shape images have not yet been properly add...
research
02/12/2020

Analysis Of Multi Field Of View Cnn And Attention Cnn On H E Stained Whole-slide Images On Hepatocellular Carcinoma

Hepatocellular carcinoma (HCC) is a leading cause of cancer-related deat...
research
10/22/2018

Field Of Interest Proposal for Augmented Mitotic Cell Count: Comparison of two Convolutional Networks

Most tumor grading systems for human as for veterinary histopathology ar...
research
07/24/2021

Efficient Dataflow Modeling of Peripheral Encoding in the Human Visual System

Computer graphics seeks to deliver compelling images, generated within a...

Please sign up or login with your details

Forgot password? Click here to reset