FrugalMCT: Efficient Online ML API Selection for Multi-Label Classification Tasks

02/18/2021
by   Lingjiao Chen, et al.
13

Multi-label classification tasks such as OCR and multi-object recognition are a major focus of the growing machine learning as a service industry. While many multi-label prediction APIs are available, it is challenging for users to decide which API to use for their own data and budget, due to the heterogeneity in those APIs' price and performance. Recent work shows how to select from single-label prediction APIs. However the computation complexity of the previous approach is exponential in the number of labels and hence is not suitable for settings like OCR. In this work, we propose FrugalMCT, a principled framework that adaptively selects the APIs to use for different data in an online fashion while respecting user's budget. The API selection problem is cast as an integer linear program, which we show has a special structure that we leverage to develop an efficient online API selector with strong performance guarantees. We conduct systematic experiments using ML APIs from Google, Microsoft, Amazon, IBM, Tencent and other providers for tasks including multi-label image classification, scene text recognition and named entity recognition. Across diverse tasks, FrugalMCT can achieve over 90 reduction while matching the accuracy of the best single API, or up to 8 better accuracy while matching the best API's cost.

READ FULL TEXT

page 8

page 38

page 39

research
06/12/2020

FrugalML: How to Use ML Prediction APIs More Accurately and Cheaply

Prediction APIs offered for a fee are a fast-growing industry and an imp...
research
11/09/2018

Automated Multi-Label Classification based on ML-Plan

Automated machine learning (AutoML) has received increasing attention in...
research
09/18/2022

HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions

Commercial ML APIs offered by providers such as Google, Amazon and Micro...
research
05/09/2023

Semantic Embedded Deep Neural Network: A Generic Approach to Boost Multi-Label Image Classification Performance

Fine-grained multi-label classification models have broad applications i...
research
09/26/2018

A Novel Online Stacked Ensemble for Multi-Label Stream Classification

As data streams become more prevalent, the necessity for online algorith...
research
06/20/2022

DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations

Solving multi-label recognition (MLR) for images in the low-label regime...
research
12/03/2022

Harnessing label semantics to extract higher performance under noisy label for Company to Industry matching

Assigning appropriate industry tag(s) to a company is a critical task in...

Please sign up or login with your details

Forgot password? Click here to reset