Global forensic geolocation with deep neural networks

05/28/2019
by   Neal S. Grantham, et al.
0

An important problem in forensic analyses is identifying the provenance of materials at a crime scene, such as biological material on a piece of clothing. This procedure, known as geolocation, is conventionally guided by expert knowledge of the biological evidence and therefore tends to be application-specific, labor-intensive, and subjective. Purely data-driven methods have yet to be fully realized due in part to the lack of a sufficiently rich data source. However, high-throughput sequencing technologies are able to identify tens of thousands of microbial taxa using DNA recovered from a single swab collected from nearly any object or surface. We present a new algorithm for geolocation that aggregates over an ensemble of deep neural network classifiers trained on randomly-generated Voronoi partitions of a spatial domain. We apply the algorithm to fungi present in each of 1300 dust samples collected across the continental United States and then to a global dataset of dust samples from 28 countries. Our algorithm makes remarkably good point predictions with more than half of the geolocation errors under 100 kilometers for the continental analysis and nearly 90 sample's country of origin for the global analysis. We suggest that the effectiveness of this model sets the stage for a new, quantitative approach to forensic geolocation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2019

Cluster Analysis of High-Dimensional scRNA Sequencing Data

With ongoing developments and innovations in single-cell RNA sequencing ...
research
01/14/2020

Methodologies for Successful Segmentation of HRTEM Images via Neural Network

High throughput analysis of samples has been a topic increasingly discus...
research
02/10/2019

Paradigm shift in electron-based crystallography via machine learning

Accurately determining the crystallographic structure of a material, org...
research
08/22/2023

The growth effects of tropical cyclones in the U.S.: new evidence from state to county level

Tropical cyclones have always been a concern for public authorities in t...
research
04/01/2022

A Physics-Guided Neural Operator Learning Approach to Model Biological Tissues from Digital Image Correlation Measurements

We present a data-driven workflow to biological tissue modeling, which a...
research
04/24/2018

SimpleQuestions Nearly Solved: A New Upperbound and Baseline Approach

The SimpleQuestions dataset is one of the most commonly used benchmarks ...
research
06/16/2018

A nonparametric spatial test to identify factors that shape a microbiome

The advent of high-throughput sequencing technologies has made data from...

Please sign up or login with your details

Forgot password? Click here to reset