GithubHelp home page GithubHelp logo

aleximb / automl-streams-research-paper Goto Github PK

View Code? Open in Web Editor NEW
1.0 3.0 0.0 9.46 MB

AutoML Techniques for Data Streams - Research Paper

Home Page: https://arxiv.org/abs/2106.07317

TeX 100.00%
automl data-streaming

automl-streams-research-paper's Introduction

AutoML Techniques for Data Streams

Abstract

Automated Machine Learning (AutoML) techniques benefitted from tremendous research progress recently. These developments and the continuous-growing demand for machine learning experts led to the development of numerous AutoML tools. Industry applications of machine learning on streaming data become more popular due to the increasing adoption of real-time streaming in IoT, microservices architectures, web analytics, and other fields. However, the AutoML tools assume that the entire training dataset is available upfront and that the underlying data distribution does not change over time. These assumptions do not hold in a data-stream-mining setting where an unbounded stream of data cannot be stored and is likely to manifest concept drift. This research surveys the state- of-the-art open-source AutoML tools, applies them to real and synthetic streamed data, and measures how their per- formance changes over time. For comparative purposes, batch, batch incremental and instance incremental estimators are applied and compared. Moreover, a meta-learning technique for online algorithm selection based on meta-feature extraction is proposed and compared, while model replacement and continual AutoML techniques are discussed. The results show that off-the-shelf AutoML tools can provide satisfactory results but in the presence of concept drift, detection or adaptation techniques have to be applied to maintain the predictive accuracy over time.

Paper

https://arxiv.org/abs/2106.07317

@article{DBLP:journals/corr/abs-2106-07317,
  author       = {Alexandru{-}Ionut Imbrea},
  title        = {Automated Machine Learning Techniques for Data Streams},
  journal      = {CoRR},
  volume       = {abs/2106.07317},
  year         = {2021},
  url          = {https://arxiv.org/abs/2106.07317},
  eprinttype    = {arXiv},
  eprint       = {2106.07317},
  timestamp    = {Wed, 16 Jun 2021 10:42:19 +0200},
  biburl       = {https://dblp.org/rec/journals/corr/abs-2106-07317.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

automl-streams-research-paper's People

Contributors

aleximb avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.