Bayesian Heatmaps: Probabilistic Classification with Multiple Unreliable Information Sources

04/05/2019
by   Edwin Simpson, et al.
0

Unstructured data from diverse sources, such as social media and aerial imagery, can provide valuable up-to-date information for intelligent situation assessment. Mining these different information sources could bring major benefits to applications such as situation awareness in disaster zones and mapping the spread of diseases. Such applications depend on classifying the situation across a region of interest, which can be depicted as a spatial "heatmap". Annotating unstructured data using crowdsourcing or automated classifiers produces individual classifications at sparse locations that typically contain many errors. We propose a novel Bayesian approach that models the relevance, error rates and bias of each information source, enabling us to learn a spatial Gaussian Process classifier by aggregating data from multiple sources with varying reliability and relevance. Our method does not require gold-labelled data and can make predictions at any location in an area of interest given only sparse observations. We show empirically that our approach can handle noisy and biased data sources, and that simultaneously inferring reliability and transferring information between neighbouring reports leads to more accurate predictions. We demonstrate our method on two real-world problems from disaster response, showing how our approach reduces the amount of crowdsourced data required and can be used to generate valuable heatmap visualisations from SMS messages and satellite images.

READ FULL TEXT
research
06/11/2018

Aggregating Predictions on Multiple Non-disclosed Datasets using Conformal Prediction

Conformal Prediction is a machine learning methodology that produces val...
research
11/05/2021

A Semi-automatic Data Extraction System for Heterogeneous Data Sources: A Case Study from Cotton Industry

With the recent developments in digitisation, there are increasing numbe...
research
09/13/2022

Socially Enhanced Situation Awareness from Microblogs using Artificial Intelligence: A Survey

The rise of social media platforms provides an unbounded, infinitely ric...
research
11/27/2020

Interpretable Poverty Mapping using Social Media Data, Satellite Images, and Geospatial Information

Access to accurate, granular, and up-to-date poverty data is essential f...
research
09/27/2022

Identifying and Extracting Football Features from Real-World Media Sources using Only Synthetic Training Data

Real-world images used for training machine learning algorithms are ofte...
research
02/21/2018

Spatial Morphing Kernel Regression For Feature Interpolation

In recent years, geotagged social media has become popular as a novel so...
research
05/18/2017

Fusing restricted information

Information fusion deals with the integration and merging of data and in...

Please sign up or login with your details

Forgot password? Click here to reset