Automatic Separation of Compound Figures in Scientific Articles

06/03/2016
by   Mario Taschwer, et al.
0

Content-based analysis and retrieval of digital images found in scientific articles is often hindered by images consisting of multiple subfigures (compound figures). We address this problem by proposing a method to automatically classify and separate compound figures, which consists of two main steps: (i) a supervised compound figure classifier (CFC) discriminates between compound and non-compound figures using task-specific image features; and (ii) an image processing algorithm is applied to predicted compound images to perform compound figure separation (CFS). Our CFC approach is shown to achieve state-of-the-art classification performance on a published dataset. Our CFS algorithm shows superior separation accuracy on two different datasets compared to other known automatic approaches. Finally, we propose a method to evaluate the effectiveness of the CFC-CFS process chain and use it to optimize the misclassification loss of CFC for maximal effectiveness in the process chain.

READ FULL TEXT

page 2

page 26

research
02/03/2021

Learning to identify image manipulations in scientific publications

Adherence to scientific community standards ensures objectivity, clarity...
research
12/21/2012

Topic Extraction and Bundling of Related Scientific Articles

Automatic classification of scientific articles based on common characte...
research
01/23/2022

Mixed X-Ray Image Separation for Artworks with Concealed Designs

In this paper, we focus on X-ray images of paintings with concealed sub-...
research
03/07/2023

Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning

Transformer has shown advanced performance in speech separation, benefit...
research
08/30/2022

Compound Figure Separation of Biomedical Images: Mining Large Datasets for Self-supervised Learning

With the rapid development of self-supervised learning (e.g., contrastiv...
research
01/25/2021

A Two-stage Framework for Compound Figure Separation

Scientific literature contains large volumes of complex, unstructured fi...
research
03/28/2023

Make the Most Out of Your Net: Alternating Between Canonical and Hard Datasets for Improved Image Demosaicing

Image demosaicing is an important step in the image processing pipeline ...

Please sign up or login with your details

Forgot password? Click here to reset