Analysis of the first Genetic Engineering Attribution Challenge

10/14/2021
by   Oliver M. Crook, et al.
0

The ability to identify the designer of engineered biological sequences – termed genetic engineering attribution (GEA) – would help ensure due credit for biotechnological innovation, while holding designers accountable to the communities they affect. Here, we present the results of the first Genetic Engineering Attribution Challenge, a public data-science competition to advance GEA. Top-scoring teams dramatically outperformed previous models at identifying the true lab-of-origin of engineered sequences, including an increase in top-1 and top-10 accuracy of 10 percentage points. A simple ensemble of prizewinning models further increased performance. New metrics, designed to assess a model's ability to confidently exclude candidate labs, also showed major improvements, especially for the ensemble. Most winning teams adopted CNN-based machine-learning approaches; however, one team achieved very high accuracy with an extremely fast neural-network-free approach. Future work, including future competitions, should further explore a wide diversity of approaches for bringing GEA technology into practical use.

READ FULL TEXT

page 2

page 17

page 18

page 20

page 22

page 24

page 30

page 32

research
07/16/2021

Ranking labs-of-origin for genetically engineered DNA using Metric Learning

With the constant advancements of genetic engineering, a common concern ...
research
11/24/2021

Deep metric learning improves lab of origin prediction of genetically engineered plasmids

Genome engineering is undergoing unprecedented development and is now be...
research
01/30/2020

Authorship Attribution of Source Code: A Language-Agnostic Approach and Applicability in Software Engineering

Authorship attribution of source code has been an established research t...
research
02/13/2023

Machine Learning Model Attribution Challenge

We present the findings of the Machine Learning Model Attribution Challe...
research
11/22/2022

OpenFE: Automated Feature Generation beyond Expert-level Performance

The goal of automated feature generation is to liberate machine learning...
research
11/26/2019

CAWA: An Attention-Network for Credit Attribution

Credit attribution is the task of associating individual parts in a docu...
research
06/26/2018

The NIPS'17 Competition: A Multi-View Ensemble Classification Model for Clinically Actionable Genetic Mutations

This paper presents details of our winning solutions to the task IV of N...

Please sign up or login with your details

Forgot password? Click here to reset