Safe machine learning model release from Trusted Research Environments: The AI-SDC package

12/02/2022
by   Jim Smith, et al.
0

We present AI-SDC, an integrated suite of open source Python tools to facilitate Statistical Disclosure Control (SDC) of Machine Learning (ML) models trained on confidential data prior to public release. AI-SDC combines (i) a SafeModel package that extends commonly used ML models to provide ante-hoc SDC by assessing the vulnerability of disclosure posed by the training regime; and (ii) an Attacks package that provides post-hoc SDC by rigorously assessing the empirical disclosure risk of a model through a variety of simulated attacks after training. The AI-SDC code and documentation are available under an MIT license at https://github.com/AI-SDC/AI-SDC.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2020

Brain Predictability toolbox: a Python library for neuroimaging based machine learning

Summary Brain Predictability toolbox (BPt) represents a unified framewor...
research
12/06/2022

ACRO: A multi-language toolkit for supporting Automated Checking of Research Outputs

This paper discusses the development of an open source tool ACRO, (Autom...
research
02/13/2018

DataBright: Towards a Global Exchange for Decentralized Data Ownership and Trusted Computation

It is safe to assume that, for the foreseeable future, machine learning,...
research
09/14/2021

Tuna-AI: tuna biomass estimation with Machine Learning models trained on oceanography and echosounder FAD data

Echo-sounder data registered by buoys attached to drifting FADs provide ...
research
05/07/2023

PiML Toolbox for Interpretable Machine Learning Model Development and Validation

PiML (read π-ML, /`pai.`em.`el/) is an integrated and open-access Python...
research
07/01/2022

Shai-am: A Machine Learning Platform for Investment Strategies

The finance industry has adopted machine learning (ML) as a form of quan...
research
08/21/2023

Majorana Demonstrator Data Release for AI/ML Applications

The enclosed data release consists of a subset of the calibration data f...

Please sign up or login with your details

Forgot password? Click here to reset