GithubHelp home page GithubHelp logo

murphy0409 / automl_alex Goto Github PK

View Code? Open in Web Editor NEW

This project forked from alex-lekov/automl_alex

0.0 0.0 0.0 21.94 MB

State-of-the art Automated Machine Learning python library for Tabular Data

License: MIT License

Python 99.71% Dockerfile 0.29%

automl_alex's Introduction

AutoML Alex

Downloads PyPI - Python Version PyPI CodeFactor Telegram License


State-of-the art Automated Machine Learning python library for Tabular Data

Works with Tasks:

  • Binary Classification

  • Regression

  • Multiclass Classification (in progress...)

Benchmark Results

bench

The bigger, the better
From AutoML-Benchmark

Scheme

scheme

Features

  • Automated Data Clean (Auto Clean)
  • Automated Feature Engineering (Auto FE)
  • Smart Hyperparameter Optimization (HPO)
  • Feature Generation
  • Feature Selection
  • Models Selection
  • Cross Validation
  • Optimization Timelimit and EarlyStoping
  • Save and Load (Predict new data)

Installation

pip install automl-alex

Docs

DocPage

๐Ÿš€ Examples

Classifier:

from automl_alex import AutoMLClassifier

model = AutoMLClassifier()
model.fit(X_train, y_train, timeout=600)
predicts = model.predict(X_test)

Regression:

from automl_alex import AutoMLRegressor

model = AutoMLRegressor()
model.fit(X_train, y_train, timeout=600)
predicts = model.predict(X_test)

DataPrepare:

from automl_alex import DataPrepare

de = DataPrepare()
X_train = de.fit_transform(X_train)
X_test = de.transform(X_test)

Simple Models Wrapper:

from automl_alex import LightGBMClassifier

model = LightGBMClassifier()
model.fit(X_train, y_train)
predicts = model.predict_proba(X_test)

model.opt(X_train, y_train,
    timeout=600, # optimization time in seconds,
    )
predicts = model.predict_proba(X_test)

More examples in the folder ./examples:

What's inside

It integrates many popular frameworks:

  • scikit-learn
  • XGBoost
  • LightGBM
  • CatBoost
  • Optuna
  • ...

Works with Features

  • Categorical Features

  • Numerical Features

  • Binary Features

  • Text

  • Datetime

  • Timeseries

  • Image

Note

  • With a large dataset, a lot of memory is required! Library creates many new features. If you have a large dataset with a large number of features (more than 100), you may need a lot of memory.

Realtime Dashboard

Works with optuna-dashboard

Dashboard

Run

$ optuna-dashboard sqlite:///db.sqlite3

Road Map

  • Feature Generation

  • Save/Load and Predict on New Samples

  • Advanced Logging

  • Add opt Pruners

  • Docs Site

  • DL Encoders

  • Add More libs (NNs)

  • Multiclass Classification

  • Build pipelines

Contact

Telegram Group

automl_alex's People

Contributors

alex-lekov avatar deepsource-autofix[bot] avatar deepsourcebot avatar itlek avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.