The GeoLifeCLEF 2023 Dataset to evaluate plant species distribution models at high spatial resolution across Europe

08/07/2023
by   Christophe Botella, et al.
0

The difficulty to measure or predict species community composition at fine spatio-temporal resolution and over large spatial scales severely hampers our ability to understand species assemblages and take appropriate conservation measures. Despite the progress in species distribution modeling (SDM) over the past decades, SDM have just begun to integrate high resolution remote sensing data and their predictions are still entailed by many biases due to heterogeneity of the available biodiversity observations, most often opportunistic presence only data. We designed a European scale dataset covering around ten thousand plant species to calibrate and evaluate SDM predictions of species composition in space and time at high spatial resolution ( ten meters), and their spatial transferability. For model training, we extracted and harmonized five million heterogeneous presence-only records from selected GBIF datasets and 6 thousand exhaustive presence-absence surveys both sampled during 2017-2021. We associated species observations to diverse environmental rasters classically used in SDMs, as well as to 10 m resolution RGB and Near-Infra-Red satellite images and 20 years-time series of climatic variables and satellite point values. The evaluation dataset is based on 22 thousand standardized presence-absence surveys separated from the training set with a spatial block hold out procedure. The GeoLifeCLEF 2023 dataset is open access and the first benchmark for researchers aiming to improve the prediction of plant species composition at a very fine spatial grain and at continental scale. It is a space to explore new ways of combining massive and diverse species observations and environmental information at various scales. Innovative AI-based approaches, in particular, should be among the most interesting methods to experiment with on the GeoLifeCLEF 2023 dataset.

READ FULL TEXT

page 6

page 7

page 9

page 13

page 20

research
04/08/2020

The GeoLifeCLEF 2020 Dataset

Understanding the geographic distribution of species is a key concern in...
research
06/07/2021

Digital Taxonomist: Identifying Plant Species in Citizen Scientists' Photographs

Automatic identification of plant specimens from amateur photographs cou...
research
09/19/2023

Predicting fine-scale taxonomic variation in landscape vegetation using large satellite imagery data sets

Accurate information on the distribution of vegetation species is used a...
research
07/17/2023

Improving Data Efficiency for Plant Cover Prediction with Label Interpolation and Monte-Carlo Cropping

The plant community composition is an essential indicator of environment...
research
01/08/2021

Extracting Pasture Phenotype and Biomass Percentages using Weakly Supervised Multi-target Deep Learning on a Small Dataset

The dairy industry uses clover and grass as fodder for cows. Accurate es...
research
03/02/2019

Spatio-Temporal Vegetation Pixel Classification By Using Convolutional Networks

Plant phenology studies rely on long-term monitoring of life cycles of p...
research
09/19/2019

Evaluation of Deep Species Distribution Models using Environment and Co-occurrences

This paper presents an evaluation of several approaches of plants specie...

Please sign up or login with your details

Forgot password? Click here to reset