DataCI: A Platform for Data-Centric AI on Streaming Data

06/27/2023
by   Huaizheng Zhang, et al.
0

We introduce DataCI, a comprehensive open-source platform designed specifically for data-centric AI in dynamic streaming data settings. DataCI provides 1) an infrastructure with rich APIs for seamless streaming dataset management, data-centric pipeline development and evaluation on streaming scenarios, 2) an carefully designed versioning control function to track the pipeline lineage, and 3) an intuitive graphical interface for a better interactive user experience. Preliminary studies and demonstrations attest to the easy-to-use and effectiveness of DataCI, highlighting its potential to revolutionize the practice of data-centric AI in streaming data contexts.

READ FULL TEXT

page 1

page 2

page 3

research
03/02/2023

Alexa Arena: A User-Centric Interactive Platform for Embodied AI

We introduce Alexa Arena, a user-centric simulation platform for Embodie...
research
11/26/2022

The Principles of Data-Centric AI (DCAI)

Data is a crucial infrastructure to how artificial intelligence (AI) sys...
research
12/07/2021

Augment Valuate : A Data Enhancement Pipeline for Data-Centric AI

Data scarcity and noise are important issues in industrial applications ...
research
11/13/2017

COMBINE: a novel drug discovery platform designed to capture insight and experience of users

The insight and experience gained by a researcher are often lost because...
research
07/19/2022

Active-Learning-as-a-Service: An Efficient MLOps System for Data-Centric AI

The success of today's AI applications requires not only model training ...
research
11/23/2021

AutoDC: Automated data-centric processing

AutoML (automated machine learning) has been extensively developed in th...
research
09/28/2021

Restructuring Serverless Computing with Data-Centric Function Orchestration

Serverless applications are usually composed of multiple short-lived, si...

Please sign up or login with your details

Forgot password? Click here to reset