CubiCasa5K: A Dataset and an Improved Multi-Task Model for Floorplan Image Analysis

04/03/2019
by   Ahti Kalervo, et al.
0

Better understanding and modelling of building interiors and the emergence of more impressive AR/VR technology has brought up the need for automatic parsing of floorplan images. However, there is a clear lack of representative datasets to investigate the problem further. To address this shortcoming, this paper presents a novel image dataset called CubiCasa5K, a large-scale floorplan image dataset containing 5000 samples annotated into over 80 floorplan object categories. The dataset annotations are performed in a dense and versatile manner by using polygons for separating the different objects. Diverging from the classical approaches based on strong heuristics and low-level pixel operations, we present a method relying on an improved multi-task convolutional neural network. By releasing the novel dataset and our implementations, this study significantly boosts the research on automatic floorplan image analysis as it provides a richer set of tools for investigating the problem in a more comprehensive manner.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/06/2022

Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling

Given an aerial image, aerial scene parsing (ASP) targets to interpret t...
research
11/05/2020

A Multi-resolution Model for Histopathology Image Classification and Localization with Multiple Instance Learning

Histopathological images provide rich information for disease diagnosis....
research
04/29/2021

Segmentation-grounded Scene Graph Generation

Scene graph generation has emerged as an important problem in computer v...
research
03/26/2021

Supervised Chorus Detection for Popular Music Using Convolutional Neural Network and Multi-task Learning

This paper presents a novel supervised approach to detecting the chorus ...
research
12/10/2019

Understanding 3D CNN Behavior for Alzheimer's Disease Diagnosis from Brain PET Scan

In recent days, Convolutional Neural Networks (CNN) have demonstrated im...
research
08/06/2022

Multi-Task Transformer with uncertainty modelling for Face Based Affective Computing

Face based affective computing consists in detecting emotions from face ...
research
12/06/2016

Learning to Detect Multiple Photographic Defects

In this paper, we introduce the problem of simultaneously detecting mult...

Please sign up or login with your details

Forgot password? Click here to reset