DetAIL : A Tool to Automatically Detect and Analyze Drift In Language

11/03/2022
by   Nishtha Madaan, et al.
0

Machine learning and deep learning-based decision making has become part of today's software. The goal of this work is to ensure that machine learning and deep learning-based systems are as trusted as traditional software. Traditional software is made dependable by following rigorous practice like static analysis, testing, debugging, verifying, and repairing throughout the development and maintenance life-cycle. Similarly for machine learning systems, we need to keep these models up to date so that their performance is not compromised. For this, current systems rely on scheduled re-training of these models as new data kicks in. In this work, we propose to measure the data drift that takes place when new data kicks in so that one can adaptively re-train the models whenever re-training is actually required irrespective of schedules. In addition to that, we generate various explanations at sentence level and dataset level to capture why a given payload text has drifted.

READ FULL TEXT

page 2

page 6

research
04/07/2020

Adversarial Validation Approach to Concept Drift Problem in Automated Machine Learning Systems

In automated machine learning systems, concept drift in input data is on...
research
09/12/2019

Classifying Multilingual User Feedback using Traditional Machine Learning and Deep Learning

With the rise of social media like Twitter and of software distribution ...
research
07/28/2021

Clones in Deep Learning Code: What, Where, and Why?

Deep Learning applications are becoming increasingly popular. Developers...
research
03/03/2020

Data Invariants: On Trust in Data-Driven Systems

The reliability and proper function of data-driven applications hinge on...
research
02/20/2018

Bayesian Incremental Learning for Deep Neural Networks

In industrial machine learning pipelines, data often arrive in parts. Pa...
research
06/29/2023

A Query Language for Software Architecture Information (Extended version)

Software maintenance is an important part of a software system's life cy...
research
11/28/2020

Exoplanet Detection using Machine Learning

We introduce a new machine learning based technique to detect exoplanets...

Please sign up or login with your details

Forgot password? Click here to reset