Stream Efficient Learning

05/03/2023
by   Zhi-Hua Zhou, et al.
0

Data in many real-world applications are often accumulated over time, like a stream. In contrast to conventional machine learning studies that focus on learning from a given training data set, learning from data streams cannot ignore the fact that the incoming data stream can be potentially endless with overwhelming size and unknown changes, and it is impractical to assume to have sufficient computational/storage resource such that all received data can be handled in time. Thus, the generalization performance of learning from data streams depends not only on how many data have been received, but also on how many data can be well exploited timely, with resource and rapidity concerns, in addition to the ability of learning algorithm and complexity of the problem. For this purpose, in this article we introduce the notion of machine learning throughput, define Stream Efficient Learning and present a preliminary theoretical framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/17/2019

Rebalancing Learning on Evolving Data Streams

Nowadays, every device connected to the Internet generates an ever-growi...
research
08/29/2023

OEBench: Investigating Open Environment Challenges in Real-World Relational Data Streams

How to get insights from relational data streams in a timely manner is a...
research
06/30/2023

Distance Functions and Normalization Under Stream Scenarios

Data normalization is an essential task when modeling a classification s...
research
06/01/2022

Open Environment Machine Learning

Conventional machine learning studies generally assume close world scena...
research
08/15/2019

Double-Coupling Learning for Multi-Task Data Stream Classification

Data stream classification methods demonstrate promising performance on ...
research
08/20/2015

The ABACOC Algorithm: a Novel Approach for Nonparametric Classification of Data Streams

Stream mining poses unique challenges to machine learning: predictive mo...
research
07/16/2018

Time Series Deinterleaving of DNS Traffic

Stream deinterleaving is an important problem with various applications ...

Please sign up or login with your details

Forgot password? Click here to reset