MLRSNet: A Multi-label High Spatial Resolution Remote Sensing Dataset for Semantic Scene Understanding

by   Xiaoman Qi, et al.

To better understand scene images in the field of remote sensing, multi-label annotation of scene images is necessary. Moreover, to enhance the performance of deep learning models for dealing with semantic scene understanding tasks, it is vital to train them on large-scale annotated data. However, most existing datasets are annotated by a single label, which cannot describe the complex remote sensing images well because scene images might have multiple land cover classes. Few multi-label high spatial resolution remote sensing datasets have been developed to train deep learning models for multi-label based tasks, such as scene classification and image retrieval. To address this issue, in this paper, we construct a multi-label high spatial resolution remote sensing dataset named MLRSNet for semantic scene understanding with deep learning from the overhead perspective. It is composed of high-resolution optical satellite or aerial images. MLRSNet contains a total of 109,161 samples within 46 scene categories, and each image has at least one of 60 predefined labels. We have designed visual recognition tasks, including multi-label based image classification and image retrieval, in which a wide variety of deep learning approaches are evaluated with MLRSNet. The experimental results demonstrate that MLRSNet is a significant benchmark for future research, and it complements the current widely used datasets such as ImageNet, which fills gaps in multi-label image research. Furthermore, we will continue to expand the MLRSNet. MLRSNet and all related materials have been made publicly available at and



There are no comments yet.


page 7

page 12

page 13

page 19

page 24


BigEarthNet Deep Learning Models with A New Class-Nomenclature for Remote Sensing Image Understanding

Success of deep neural networks in the framework of remote sensing (RS) ...

Region Convolutional Features for Multi-Label Remote Sensing Image Retrieval

Conventional remote sensing image retrieval (RSIR) systems usually perfo...

Evaluating Explainable Artificial Intelligence Methods for Multi-label Deep Learning Classification Tasks in Remote Sensing

Although deep neural networks hold the state-of-the-art in several remot...

BigEarthNet-MM: A Large Scale Multi-Modal Multi-Label Benchmark Archive for Remote Sensing Image Classification and Retrieval

This paper presents the multi-modal BigEarthNet (BigEarthNet-MM) benchma...

Unifying Remote Sensing Image Retrieval and Classification with Robust Fine-tuning

Advances in high resolution remote sensing image analysis are currently ...

RSBNet: One-Shot Neural Architecture Search for A Backbone Network in Remote Sensing Image Recognition

Recently, a massive number of deep learning based approaches have been s...

Signal Conditioning for Learning in the Wild

The mammalian olfactory system learns rapidly from very few examples, pr...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.