Appearance Shock Grammar for Fast Medial Axis Extraction from Real Images

We combine ideas from shock graph theory with more recent appearance-based methods for medial axis extraction from complex natural scenes, improving upon the present best unsupervised method, in terms of efficiency and performance. We make the following specific contributions: i) we extend the shock graph representation to the domain of real images, by generalizing the shock type definitions using local, appearance-based criteria; ii) we then use the rules of a Shock Grammar to guide our search for medial points, drastically reducing run time when compared to other methods, which exhaustively consider all points in the input image;iii) we remove the need for typical post-processing steps including thinning, non-maximum suppression, and grouping, by adhering to the Shock Grammar rules while deriving the medial axis solution; iv) finally, we raise some fundamental concerns with the evaluation scheme used in previous work and propose a more appropriate alternative for assessing the performance of medial axis extraction from scenes. Our experiments on the BMAX500 and SK-LARGE datasets demonstrate the effectiveness of our approach. We outperform the present state-of-the-art, excelling particularly in the high-precision regime, while running an order of magnitude faster and requiring no post-processing.

READ FULL TEXT

page 2

page 4

page 6

page 8

page 11

page 12

page 13

page 14

research
03/24/2017

AMAT: Medial Axis Transform for Natural Images

We introduce Appearance-MAT (AMAT), a generalization of the medial axis ...
research
04/17/2022

The Z-axis, X-axis, Weight and Disambiguation Methods for Constructing Local Reference Frame in 3D Registration: An Evaluation

The local reference frame (LRF), as an independent coordinate system gen...
research
07/19/2018

Hybrid scene Compression for Visual Localization

Localizing an image wrt. a large scale 3D scene represents a core task f...
research
09/20/2019

Deep 3D-Zoom Net: Unsupervised Learning of Photo-Realistic 3D-Zoom

The 3D-zoom operation is the positive translation of the camera in the Z...
research
12/07/2021

Natural Answer Generation: From Factoid Answer to Full-length Answer using Grammar Correction

Question Answering systems these days typically use template-based langu...
research
02/05/2015

A Framework for Symmetric Part Detection in Cluttered Scenes

The role of symmetry in computer vision has waxed and waned in importanc...
research
06/16/2015

End-to-end people detection in crowded scenes

Current people detectors operate either by scanning an image in a slidin...

Please sign up or login with your details

Forgot password? Click here to reset