MuraNet: Multi-task Floor Plan Recognition with Relation Attention

09/01/2023
by   Lingxiao Huang, et al.
0

The recognition of information in floor plan data requires the use of detection and segmentation models. However, relying on several single-task models can result in ineffective utilization of relevant information when there are multiple tasks present simultaneously. To address this challenge, we introduce MuraNet, an attention-based multi-task model for segmentation and detection tasks in floor plan data. In MuraNet, we adopt a unified encoder called MURA as the backbone with two separated branches: an enhanced segmentation decoder branch and a decoupled detection head branch based on YOLOX, for segmentation and detection tasks respectively. The architecture of MuraNet is designed to leverage the fact that walls, doors, and windows usually constitute the primary structure of a floor plan's architecture. By jointly training the model on both detection and segmentation tasks, we believe MuraNet can effectively extract and utilize relevant features for both tasks. Our experiments on the CubiCasa5k public dataset show that MuraNet improves convergence speed during training compared to single-task models like U-Net and YOLOv3. Moreover, we observe improvements in the average AP and IoU in detection and segmentation tasks, respectively.Our ablation experiments demonstrate that the attention-based unified backbone of MuraNet achieves better feature extraction in floor plan recognition tasks, and the use of decoupled multi-head branches for different tasks further improves model performance. We believe that our proposed MuraNet model can address the disadvantages of single-task models and improve the accuracy and efficiency of floor plan data recognition.

READ FULL TEXT
research
07/07/2023

TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task Foundation Model Learning

The AllInOne training paradigm squeezes a wide range of tasks into a uni...
research
05/04/2023

MTLSegFormer: Multi-task Learning with Transformers for Semantic Segmentation in Precision Agriculture

Multi-task learning has proven to be effective in improving the performa...
research
02/02/2023

AOP-Net: All-in-One Perception Network for Joint LiDAR-based 3D Object Detection and Panoptic Segmentation

LiDAR-based 3D object detection and panoptic segmentation are two crucia...
research
11/28/2019

Fruit Detection, Segmentation and 3D Visualisation of Environments in Apple Orchards

Robotic harvesting of fruits in orchards is a challenging task, since hi...
research
08/29/2019

Deep Floor Plan Recognition Using a Multi-Task Network with Room-Boundary-Guided Attention

This paper presents a new approach to recognize elements in floor plan l...
research
03/23/2023

Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition

As one of the major branches of automatic speech recognition, attention-...
research
10/27/2016

Single- and Multi-Task Architectures for Tool Presence Detection Challenge at M2CAI 2016

The tool presence detection challenge at M2CAI 2016 consists of identify...

Please sign up or login with your details

Forgot password? Click here to reset