Uncheatable Machine Learning Inference

08/08/2019
by   Mustafa Canim, et al.
0

Classification-as-a-Service (CaaS) is widely deployed today in machine intelligence stacks for a vastly diverse set of applications including anything from medical prognosis to computer vision tasks to natural language processing to identity fraud detection. The computing power required for training complex models on large datasets to perform inference to solve these problems can be very resource-intensive. A CaaS provider may cheat a customer by fraudulently bypassing expensive training procedures in favor of weaker, less computationally-intensive algorithms which yield results of reduced quality. Given a classification service supplier S, intermediary CaaS provider P claiming to use S as a classification backend, and customer C, our work addresses the following questions: (i) how can P's claim to be using S be verified by C? (ii) how might S make performance guarantees that may be verified by C? and (iii) how might one design a decentralized system that incentivizes service proofing and accountability? To this end, we propose a variety of methods for C to evaluate the service claims made by P using probabilistic performance metrics, instance seeding, and steganography. We also propose a method of measuring the robustness of a model using a blackbox adversarial procedure, which may then be used as a benchmark or comparison to a claim made by S. Finally, we propose the design of a smart contract-based decentralized system that incentivizes service accountability to serve as a trusted Quality of Service (QoS) auditor.

READ FULL TEXT
research
07/21/2021

Machine learning for assessing quality of service in the hospitality sector based on customer reviews

The increasing use of online hospitality platforms provides firsthand in...
research
12/16/2022

Natural Language Processing in Customer Service: A Systematic Review

Artificial intelligence and natural language processing (NLP) are increa...
research
03/09/2020

Ransomware as a Service using Smart Contracts and IPFS

Decentralized systems, such as distributed ledgers and the InterPlanetar...
research
11/29/2021

A Natural Language Processing and Deep Learning based Model for Automated Vehicle Diagnostics using Free-Text Customer Service Reports

Initial fault detection and diagnostics are imperative measures to impro...
research
06/30/2022

Using Person Embedding to Enrich Features and Data Augmentation for Classification

Today, machine learning is applied in almost any field. In machine learn...
research
05/05/2016

Improving Automated Patent Claim Parsing: Dataset, System, and Experiments

Off-the-shelf natural language processing software performs poorly when ...
research
07/26/2016

Leveraging Unstructured Data to Detect Emerging Reliability Issues

Unstructured data refers to information that does not have a predefined ...

Please sign up or login with your details

Forgot password? Click here to reset