A Topic Modeling Approach to Classifying Open Street Map Health Clinics and Schools in Sub-Saharan Africa

12/22/2022
by   Joshua W. Anderson, et al.
0

Data deprivation, or the lack of easily available and actionable information on the well-being of individuals, is a significant challenge for the developing world and an impediment to the design and operationalization of policies intended to alleviate poverty. In this paper we explore the suitability of data derived from OpenStreetMap to proxy for the location of two crucial public services: schools and health clinics. Thanks to the efforts of thousands of digital humanitarians, online mapping repositories such as OpenStreetMap contain millions of records on buildings and other structures, delineating both their location and often their use. Unfortunately much of this data is locked in complex, unstructured text rendering it seemingly unsuitable for classifying schools or clinics. We apply a scalable, unsupervised learning method to unlabeled OpenStreetMap building data to extract the location of schools and health clinics in ten countries in Africa. We find the topic modeling approach greatly improves performance versus reliance on structured keys alone. We validate our results by comparing schools and clinics identified by our OSM method versus those identified by the WHO, and describe OSM coverage gaps more broadly.

READ FULL TEXT

page 3

page 7

research
11/30/2017

Predicting Severe Sepsis Using Text from the Electronic Health Record

Employing a machine learning approach we predict, up to 24 hours prior, ...
research
09/17/2020

Deploying machine learning to assist digital humanitarians: making image annotation in OpenStreetMap more efficient

Locating populations in rural areas of developing countries has attracte...
research
08/22/2021

Reflections, Learnings and Proposed Interventions on Data Validation and Data Use for Action in Health: A Case of Mozambique

The ideal of a country's health information system (HIS) is to develop p...
research
04/26/2022

Using Machine Learning to Fuse Verbal Autopsy Narratives and Binary Features in the Analysis of Deaths from Hyperglycaemia

Lower-and-middle income countries are faced with challenges arising from...
research
07/11/2023

Research Protocol for the Google Health Digital Well-being Study

The impact of digital device use on health and well-being is a pressing ...
research
04/19/2012

Avian Influenza (H5N1) Warning System using Dempster-Shafer Theory and Web Mapping

Based on Cumulative Number of Confirmed Human Cases of Avian Influenza (...

Please sign up or login with your details

Forgot password? Click here to reset