Reconstruction of Sentinel-2 Time Series Using Robust Gaussian Mixture Models – Application to the Detection of Anomalous Crop Development in wheat and rapeseed crops

10/22/2021
by   Florian Mouret, et al.
0

Missing data is a recurrent problem in remote sensing, mainly due to cloud coverage for multispectral images and acquisition problems. This can be a critical issue for crop monitoring, especially for applications relying on machine learning techniques, which generally assume that the feature matrix does not have missing values. This paper proposes a Gaussian Mixture Model (GMM) for the reconstruction of parcel-level features extracted from multispectral images. A robust version of the GMM is also investigated, since datasets can be contaminated by inaccurate samples or features (e.g., wrong crop type reported, inaccurate boundaries, undetected clouds, etc). Additional features extracted from Synthetic Aperture Radar (SAR) images using Sentinel-1 data are also used to provide complementary information and improve the imputations. The robust GMM investigated in this work assigns reduced weights to the outliers during the estimation of the GMM parameters, which improves the final reconstruction. These weights are computed at each step of an Expectation-Maximization (EM) algorithm by using outlier scores provided by the isolation forest algorithm. Experimental validation is conducted on rapeseed and wheat parcels located in the Beauce region (France). Overall, we show that the GMM imputation method outperforms other reconstruction strategies. A mean absolute error (MAE) of 0.013 (resp. 0.019) is obtained for the imputation of the median Normalized Difference Index (NDVI) of the rapeseed (resp. wheat) parcels. Other indicators (e.g., Normalized Difference Water Index) and statistics (for instance the interquartile range, which captures heterogeneity among the parcel indicator) are reconstructed at the same time with good accuracy. In a dataset contaminated by irrelevant samples, using the robust GMM is recommended since the standard GMM imputation can lead to inaccurate imputed values.

READ FULL TEXT

page 4

page 15

research
01/28/2022

A Robust and Flexible EM Algorithm for Mixtures of Elliptical Distributions with Missing Data

This paper tackles the problem of missing data imputation for noisy and ...
research
09/16/2018

Semiparametric fractional imputation using Gaussian mixture models for handling multivariate missing data

Item nonresponse is frequently encountered in practice. Ignoring missing...
research
09/14/2019

Semiparametric Imputation Using Conditional Gaussian Mixture Models under Item Nonresponse

Imputation is a popular technique for handling item nonresponse in surve...
research
07/14/2023

Combining multitemporal optical and SAR data for LAI imputation with BiLSTM network

The Leaf Area Index (LAI) is vital for predicting winter wheat yield. Ac...
research
04/12/2022

Spatiotemporal Estimation of TROPOMI NO2 Column with Depthwise Partial Convolutional Neural Network

Satellite-derived measurements are negatively impacted by cloud cover an...
research
11/12/2012

A Comparative Study of Gaussian Mixture Model and Radial Basis Function for Voice Recognition

A comparative study of the application of Gaussian Mixture Model (GMM) a...

Please sign up or login with your details

Forgot password? Click here to reset