GithubHelp home page GithubHelp logo

lukasmasuch / best-of-ml-python Goto Github PK

View Code? Open in Web Editor NEW
2.0 2.0 0.0 1.97 MB

🏆 A ranked list of awesome machine learning python libraries. Updated weekly.

License: Creative Commons Attribution Share Alike 4.0 International

best-of awesome python machine-learning data-science awesome-list nlp list data-visualization tensorflow pytorch

best-of-ml-python's Introduction

Best of Machine Learning with Python

🏆 A ranked list of awesome machine learning python libraries. Updated weekly.

This curated list contains 610 awesome open-source projects with a total of 2M stars grouped into 31 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from Github and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!

Contents

Explanation

  • 🥇🥈🥉 Combined project-quality score
  • ⭐️ Star count from Github
  • 🐣 New project (less than 6 month old)
  • 💤 Inactive project (6 month no activity)
  • 💀 Dead project (12 month no activity)
  • ❗️ Warning (e.g. missing/risky license)
  • 👨‍💻 Contributors count from Github
  • 🔀 Fork count from Github
  • 📋 Issue count from Github
  • ⏱️ Last update timestamp on package manager
  • 📥 Download count from package manager
  • 📦 Number of dependent projects

Machine Learning Frameworks

Back to top

General-purpose machine learning and deep learning frameworks.

Tensorflow (🥇44 · ⭐ 140K) - An Open Source Machine Learning Framework for Everyone. Apache-2
  • GitHub (👨‍💻 2.9K · 🔀 80K · 📦 63K · 📋 23K - 14% open · ⏱️ 23.01.2020):

     git clone https://github.com/tensorflow/tensorflow
    
  • PyPi (📥 5.5M / month · 📦 23K · ⏱️ 23.12.2019):

     pip install tensorflow
    
  • Conda (⏱️ 22.10.2019):

     conda install -c anaconda tensorflow
    
scikit-learn (🥇41 · ⭐ 39K) - scikit-learn: machine learning in Python. BSD-3
  • GitHub (👨‍💻 1.8K · 🔀 19K · 📥 150 · 📦 89K · 📋 7.7K - 26% open · ⏱️ 22.01.2020):

     git clone https://github.com/scikit-learn/scikit-learn
    
  • PyPi (📥 13M / month · 📦 38K · ⏱️ 02.01.2020):

     pip install scikit-learn
    
  • Conda (⏱️ 09.01.2020):

     conda install -c anaconda scikit-learn
    
PyTorch (🥇39 · ⭐ 36K) - Tensors and Dynamic neural networks in Python with strong GPU acceleration. BSD-3
  • GitHub (👨‍💻 1.8K · 🔀 8.8K · 📦 22K · 📋 13K - 35% open · ⏱️ 23.01.2020):

     git clone https://github.com/pytorch/pytorch
    
  • PyPi (📥 920K / month · 📦 6.7K · ⏱️ 15.01.2020):

     pip install torch
    
Keras (🥇37 · ⭐ 47K) - Deep Learning for humans. MIT
  • GitHub (👨‍💻 890 · 🔀 17K · 📦 41K · 📋 9.8K - 28% open · ⏱️ 06.11.2019):

     git clone https://github.com/keras-team/keras
    
  • PyPi (📥 2M / month · 📦 15K · ⏱️ 17.09.2019):

     pip install keras
    
  • Conda (⏱️ 08.10.2018):

     conda install -c anaconda keras
    
PySpark (🥈36 · ⭐ 25K) - Apache Spark Python API. Apache-2
  • GitHub (👨‍💻 2.2K · 🔀 21K · 📦 280 · ⏱️ 23.01.2020):

     git clone https://github.com/apache/spark
    
  • PyPi (📥 2.6M / month · 📦 760 · ⏱️ 07.05.2019):

     pip install pyspark
    
StatsModels (🥈36 · ⭐ 4.6K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
  • GitHub (👨‍💻 250 · 🔀 1.8K · 📥 16 · 📦 18K · 📋 3.8K - 48% open · ⏱️ 23.01.2020):

     git clone https://github.com/statsmodels/statsmodels
    
  • PyPi (📥 1.7M / month · 📦 6.7K · ⏱️ 15.01.2020):

     pip install statsmodels
    
  • Conda (⏱️ 30.07.2019):

     conda install -c anaconda statsmodels
    
Theano (🥈35 · ⭐ 9.1K) - Theano is a Python library that allows you to define, optimize, and evaluate.. BSD-3
  • GitHub (👨‍💻 380 · 🔀 2.5K · 📦 7K · 📋 2.7K - 24% open · ⏱️ 15.01.2020):

     git clone https://github.com/Theano/Theano
    
  • PyPi (📥 320K / month · 📦 4.8K · ⏱️ 15.01.2019):

     pip install theano
    
  • Conda (⏱️ 12.09.2019):

     conda install -c anaconda theano
    
MXNet (🥈34 · ⭐ 18K) - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic,.. Apache-2
  • GitHub (👨‍💻 870 · 🔀 6.5K · 📥 62K · 📦 770 · 📋 8.7K - 18% open · ⏱️ 23.01.2020):

     git clone https://github.com/apache/incubator-mxnet
    
  • PyPi (📥 160K / month · 📦 440 · ⏱️ 22.01.2020):

     pip install mxnet
    
  • Conda (⏱️ 02.08.2019):

     conda install -c anaconda mxnet
    
XGBoost (🥈33 · ⭐ 18K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM).. Apache-2
  • GitHub (👨‍💻 420 · 🔀 7.2K · 📥 1.3K · 📦 5.2K · 📋 3.2K - 6% open · ⏱️ 23.01.2020):

     git clone https://github.com/dmlc/xgboost
    
  • PyPi (📥 2.4M / month · 📦 800 · ⏱️ 20.05.2019):

     pip install xgboost
    
LightGBM (🥈33 · ⭐ 10K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or.. MIT
  • GitHub (👨‍💻 150 · 🔀 2.7K · 📥 43K · 📦 2.1K · 📋 1.5K - 2% open · ⏱️ 20.01.2020):

     git clone https://github.com/microsoft/LightGBM
    
  • PyPi (📥 1.4M / month · 📦 560 · ⏱️ 29.11.2019):

     pip install lightgbm
    
  • Conda (⏱️ 17.11.2019):

     conda install -c anaconda lightgbm
    
Fastai (🥈32 · ⭐ 17K) - The fastai deep learning library, plus lessons and tutorials. Apache-2
  • GitHub (👨‍💻 540 · 🔀 5.9K · 📦 2.2K · 📋 970 - 0% open · ⏱️ 23.01.2020):

     git clone https://github.com/fastai/fastai
    
  • PyPi (📥 39K / month · 📦 180 · ⏱️ 28.12.2019):

     pip install fastai
    
Chainer (🥈32 · ⭐ 5.3K) - A flexible framework of neural networks for deep learning. MIT
  • GitHub (👨‍💻 320 · 🔀 1.3K · 📦 1.1K · 📋 2.2K - 12% open · ⏱️ 16.01.2020):

     git clone https://github.com/chainer/chainer
    
  • PyPi (📥 28K / month · 📦 740 · ⏱️ 16.01.2020):

     pip install chainer
    
Vowpal Wabbit (🥈31 · ⭐ 6.8K) - Vowpal Wabbit is a machine learning system which pushes the frontier of.. BSD-3
  • GitHub (👨‍💻 280 · 🔀 1.6K · 📦 27 · 📋 840 - 14% open · ⏱️ 23.01.2020):

     git clone https://github.com/VowpalWabbit/vowpal_wabbit
    
  • PyPi (📥 480K / month · 📦 7 · ⏱️ 07.12.2019):

     pip install vowpalwabbit
    
Catboost (🥈30 · ⭐ 4.8K) - A fast, scalable, high performance Gradient Boosting on Decision Trees.. Apache-2
  • GitHub (👨‍💻 510 · 🔀 690 · 📥 27K · 📋 1K - 18% open · ⏱️ 23.01.2020):

     git clone https://github.com/catboost/catboost
    
  • PyPi (📥 200K / month · 📦 81 · ⏱️ 25.12.2019):

     pip install catboost
    
TFlearn (🥈29 · ⭐ 9.4K · 💤) - Deep learning library featuring a higher-level API for TensorFlow. MIT
  • GitHub (👨‍💻 130 · 🔀 2.3K · 📦 1.9K · 📋 900 - 61% open · ⏱️ 10.01.2019):

     git clone https://github.com/tflearn/tflearn
    
  • PyPi (📥 46K / month · 📦 330 · ⏱️ 18.06.2017):

     pip install tflearn
    
PaddlePaddle (🥉28 · ⭐ 11K) - PArallel Distributed Deep LEarning: Machine Learning Framework from.. Apache-2
  • GitHub (👨‍💻 370 · 🔀 2.8K · 📥 15K · 📋 10K - 17% open · ⏱️ 23.01.2020):

     git clone https://github.com/PaddlePaddle/Paddle
    
  • PyPi (📥 3.4K / month · 📦 8 · ⏱️ 15.01.2020):

     pip install paddlepaddle
    
tensorpack (🥉28 · ⭐ 5.2K) - A Neural Net Training Interface on TensorFlow, with focus on speed +.. Apache-2
  • GitHub (👨‍💻 53 · 🔀 1.6K · 📥 67 · 📦 310 · 📋 1.2K - 0% open · ⏱️ 22.01.2020):

     git clone https://github.com/tensorpack/tensorpack
    
  • PyPi (📥 6.6K / month · 📦 48 · ⏱️ 01.09.2019):

     pip install tensorpack
    
CNTK (🥉27 · ⭐ 17K) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit. MIT
  • GitHub (👨‍💻 260 · 🔀 4.4K · 📥 13K · 📋 3.3K - 24% open · ⏱️ 06.08.2019):

     git clone https://github.com/microsoft/CNTK
    
  • PyPi (📥 3.1K / month · 📦 14 · ⏱️ 19.04.2019):

     pip install cntk
    
Turi Create (🥉27 · ⭐ 9.3K) - Turi Create simplifies the development of custom machine learning models. BSD-3
  • GitHub (👨‍💻 74 · 🔀 920 · 📥 1.4K · 📦 130 · 📋 1.5K - 33% open · ⏱️ 23.01.2020):

     git clone https://github.com/apple/turicreate
    
  • PyPi (📥 44K / month · 📦 34 · ⏱️ 18.12.2019):

     pip install turicreate
    
Sonnet (🥉27 · ⭐ 8.1K) - TensorFlow-based neural network library. Apache-2
  • GitHub (👨‍💻 37 · 🔀 1.2K · 📦 200 · 📋 130 - 16% open · ⏱️ 22.01.2020):

     git clone https://github.com/deepmind/sonnet
    
  • PyPi (📥 14K / month · 📦 80 · ⏱️ 06.09.2019):

     pip install dm-sonnet
    
Ignite (🥉27 · ⭐ 2.5K) - High-level library to help with training neural networks in PyTorch. BSD-3
  • GitHub (👨‍💻 48 · 🔀 250 · 📦 230 · 📋 320 - 10% open · ⏱️ 22.01.2020):

     git clone https://github.com/pytorch/ignite
    
  • PyPi (📥 14K / month · 📦 62 · ⏱️ 22.01.2020):

     pip install pytorch-ignite
    
pytorch-lightning (🥉26 · ⭐ 3.4K) - The lightweight PyTorch wrapper for ML researchers. Scale your.. Apache-2
  • GitHub (👨‍💻 77 · 🔀 340 · 📦 76 · 📋 380 - 20% open · ⏱️ 23.01.2020):

     git clone https://github.com/PyTorchLightning/pytorch-lightning
    
  • PyPi (📥 7.7K / month · 📦 7 · ⏱️ 21.01.2020):

     pip install pytorch-lightning
    
mlpack (🥉25 · ⭐ 3.1K) - mlpack: a scalable C++ machine learning library --. BSD-3
  • GitHub (👨‍💻 220 · 🔀 1.1K · 📋 1.1K - 10% open · ⏱️ 18.01.2020):

     git clone https://github.com/mlpack/mlpack
    
  • PyPi (📥 160 / month · ⏱️ 27.11.2019):

     pip install mlpack
    
dyNET (🥉25 · ⭐ 3K) - DyNet: The Dynamic Neural Network Toolkit. Apache-2
  • GitHub (👨‍💻 150 · 🔀 660 · 📦 110 · 📋 890 - 26% open · ⏱️ 10.01.2020):

     git clone https://github.com/clab/dynet
    
  • PyPi (📥 6K / month · 📦 28 · ⏱️ 18.09.2018):

     pip install dyNET
    
skorch (🥉25 · ⭐ 2.8K) - A scikit-learn compatible neural network library that wraps pytorch. BSD-3
  • GitHub (👨‍💻 26 · 🔀 210 · 📦 86 · 📋 270 - 17% open · ⏱️ 16.12.2019):

     git clone https://github.com/skorch-dev/skorch
    
  • PyPi (📥 4.9K / month · 📦 13 · ⏱️ 29.11.2019):

     pip install skorch
    
Neural Network Libraries (🥉25 · ⭐ 2.3K) - Neural Network Libraries. Apache-2
  • GitHub (👨‍💻 44 · 🔀 280 · 📥 270 · 📋 43 - 62% open · ⏱️ 21.01.2020):

     git clone https://github.com/sony/nnabla
    
  • PyPi (📥 1.7K / month · 📦 34 · ⏱️ 23.01.2020):

     pip install nnabla
    
Ludwig (🥉24 · ⭐ 6.3K) - Ludwig is a toolbox built on top of TensorFlow that allows to train and test.. Apache-2
  • GitHub (👨‍💻 59 · 🔀 690 · 📦 37 · 📋 370 - 22% open · ⏱️ 16.01.2020):

     git clone https://github.com/uber/ludwig
    
  • PyPi (📥 1.8K / month · 📦 2 · ⏱️ 24.07.2019):

     pip install ludwig
    
xLearn (🥉24 · ⭐ 2.6K) - High performance, easy-to-use, and scalable machine learning (ML) package,.. Apache-2
  • GitHub (👨‍💻 29 · 🔀 450 · 📥 850 · 📦 18 · 📋 260 - 59% open · ⏱️ 12.11.2019):

     git clone https://github.com/aksnzhy/xlearn
    
  • PyPi (📥 2.8K / month · 📦 1 · ⏱️ 04.12.2018):

     pip install xlearn
    
NeuPy (🥉22 · ⭐ 600) - NeuPy is a Tensorflow based python library for prototyping and building neural networks. MIT
  • GitHub (👨‍💻 6 · 🔀 120 · 📦 74 · 📋 250 - 8% open · ⏱️ 02.09.2019):

     git clone https://github.com/itdxer/neupy
    
  • PyPi (📥 1.1K / month · 📦 10 · ⏱️ 04.04.2019):

     pip install neupy
    
SHOGUN (🥉20 · ⭐ 2.6K) - The Shogun Machine Learning Toolbox. BSD-3
  • GitHub (👨‍💻 220 · 🔀 940 · 📋 1.4K - 38% open · ⏱️ 05.01.2020):

     git clone https://github.com/shogun-toolbox/shogun
    
  • PyPi (⏱️ 23.10.2018):

     pip install shogun-ml
    
Torchbearer (🥉20 · ⭐ 520) - torchbearer: A model fitting library for PyTorch. MIT
  • GitHub (👨‍💻 13 · 🔀 53 · 📦 18 · 📋 240 - 2% open · ⏱️ 16.01.2020):

     git clone https://github.com/pytorchbearer/torchbearer
    
  • PyPi (📥 300 / month · 📦 3 · ⏱️ 17.09.2019):

     pip install torchbearer
    
ThunderSVM (🥉19 · ⭐ 1.1K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
  • GitHub (👨‍💻 28 · 🔀 160 · 📥 100 · 📋 160 - 24% open · ⏱️ 01.01.2020):

     git clone https://github.com/Xtra-Computing/thundersvm
    
  • PyPi (📥 140 / month · ⏱️ 06.05.2019):

     pip install thundersvm
    
ThunderGBM (🥉17 · ⭐ 480) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
  • GitHub (👨‍💻 10 · 🔀 62 · 📋 32 - 31% open · ⏱️ 24.12.2019):

     git clone https://github.com/Xtra-Computing/thundergbm
    
  • PyPi (📥 60 / month · ⏱️ 25.10.2019):

     pip install thundergbm
    
Neural Tangents (🥉17 · ⭐ 390) - Fast and Easy Infinite Neural Networks in Python. Apache-2
  • GitHub (👨‍💻 8 · 🔀 44 · 📦 2 · 📋 15 - 40% open · ⏱️ 20.01.2020):

     git clone https://github.com/google/neural-tangents
    
  • PyPi (📥 200 / month · ⏱️ 16.01.2020):

     pip install neural-tangents
    
StarSpace (🥉14 · ⭐ 3.2K) - Learning embeddings for classification, retrieval and ranking. MIT
  • GitHub (👨‍💻 17 · 🔀 420 · 📋 180 - 18% open · ⏱️ 13.12.2019):

     git clone https://github.com/facebookresearch/StarSpace
    
Show 3 hidden projects...
NuPIC (🥉24 · ⭐ 6.1K) - Numenta Platform for Intelligent Computing is an implementation of Hierarchical.. ❗️AGPL-3.0
  • GitHub (👨‍💻 120 · 🔀 1.6K · 📦 64 · 📋 1.8K - 24% open · ⏱️ 23.10.2019):

     git clone https://github.com/numenta/nupic
    
  • PyPi (📥 850 / month · 📦 37 · ⏱️ 01.06.2018):

     pip install nupic
    
MindsDB (🥉21 · ⭐ 2.1K) - Machine Learning in one line of code. ❗️GPL-3.0
  • GitHub (👨‍💻 30 · 🔀 270 · 📋 160 - 11% open · ⏱️ 22.01.2020):

     git clone https://github.com/mindsdb/mindsdb
    
  • PyPi (📥 1.5K / month · 📦 3 · ⏱️ 19.03.2019):

     pip install mindsdb
    
ktrain (🥉20 · ⭐ 120) - ktrain is a Python library that makes deep learning and AI more accessible and.. MIT
  • GitHub (👨‍💻 4 · 🔀 33 · 📦 13 · 📋 45 - 11% open · ⏱️ 22.01.2020):

     git clone https://github.com/amaiya/ktrain
    
  • PyPi (📥 3K / month · 📦 1 · ⏱️ 22.01.2020):

     pip install ktrain
    

Data Visualization

Back to top

General-purpose and task-specific data visualization libraries.

Matplotlib (🥇38 · ⭐ 11K) - matplotlib: plotting with Python. ❗️Python-2.0
  • GitHub (👨‍💻 1K · 🔀 4.6K · 📦 170K · 📋 6.5K - 23% open · ⏱️ 23.01.2020):

     git clone https://github.com/matplotlib/matplotlib
    
  • PyPi (📥 7M / month · 📦 76K · ⏱️ 21.11.2019):

     pip install matplotlib
    
Bokeh (🥇37 · ⭐ 13K) - Interactive Data Visualization in the browser, from Python. BSD-3
  • GitHub (👨‍💻 460 · 🔀 3.2K · 📦 17K · 📋 5.7K - 8% open · ⏱️ 23.01.2020):

     git clone https://github.com/bokeh/bokeh
    
  • PyPi (📥 770K / month · 📦 5.9K · ⏱️ 07.08.2019):

     pip install bokeh
    
Seaborn (🥇33 · ⭐ 6.8K) - Statistical data visualization using matplotlib. BSD-3
  • GitHub (👨‍💻 120 · 🔀 1.1K · 📦 36K · 📋 1.5K - 6% open · ⏱️ 22.01.2020):

     git clone https://github.com/mwaskom/seaborn
    
  • PyPi (📥 1.2M / month · 📦 13K · ⏱️ 22.01.2020):

     pip install seaborn
    
Plotly (🥇33 · ⭐ 6K) - An open-source, interactive graphing library for Python (includes Plotly Express). MIT
  • GitHub (👨‍💻 110 · 🔀 1.3K · 📦 2 · 📋 1.2K - 32% open · ⏱️ 22.01.2020):

     git clone https://github.com/plotly/plotly.py
    
  • PyPi (📥 1.8M / month · 📦 5K · ⏱️ 22.01.2020):

     pip install plotly
    
  • NPM (📥 5.8K / month · 📦 4 · ⏱️ 22.01.2020):

     npm install plotlywidget
    
wordcloud (🥈31 · ⭐ 6.5K) - A little word cloud generator in Python. MIT
  • GitHub (👨‍💻 44 · 🔀 1.7K · 📦 3.6K · 📋 390 - 17% open · ⏱️ 08.01.2020):

     git clone https://github.com/amueller/word_cloud
    
  • PyPi (📥 200K / month · 📦 1.1K · ⏱️ 22.11.2019):

     pip install wordcloud
    
pyecharts (🥈30 · ⭐ 7.9K) - Python Echarts Plotting Library. MIT
  • GitHub (👨‍💻 25 · 🔀 1.8K · 📦 800 · 📋 1.1K - 3% open · ⏱️ 10.01.2020):

     git clone https://github.com/pyecharts/pyecharts
    
  • PyPi (📥 19K / month · 📦 280 · ⏱️ 03.01.2020):

     pip install pyecharts
    
pandas-profiling (🥈29 · ⭐ 4.2K) - Create HTML profiling reports from pandas DataFrame objects. MIT
  • GitHub (👨‍💻 42 · 🔀 570 · 📦 820 · 📋 230 - 26% open · ⏱️ 18.01.2020):

     git clone https://github.com/pandas-profiling/pandas-profiling
    
  • PyPi (📥 64K / month · 📦 160 · ⏱️ 08.01.2020):

     pip install pandas-profiling
    
  • Conda:

     conda install -c conda-forge pandas-profiling
    
Cufflinks (🥈29 · ⭐ 1.5K) - Productivity Tools for Plotly + Pandas. MIT
  • GitHub (👨‍💻 34 · 🔀 370 · 📦 870 · 📋 170 - 35% open · ⏱️ 07.01.2020):

     git clone https://github.com/santosjorge/cufflinks
    
  • PyPi (📥 120K / month · 📦 120 · ⏱️ 23.10.2019):

     pip install cufflinks
    
VisPy (🥈28 · ⭐ 2.2K) - High-performance interactive 2D/3D data visualization library. BSD-3
  • GitHub (👨‍💻 130 · 🔀 480 · 📦 230 · 📋 1K - 31% open · ⏱️ 20.01.2020):

     git clone https://github.com/vispy/vispy
    
  • PyPi (📥 19K / month · 📦 120 · ⏱️ 27.11.2019):

     pip install vispy
    
  • NPM (📥 61 / month · ⏱️ 27.07.2019):

     npm install vispy
    
datashader (🥈28 · ⭐ 2.1K) - Reveal everything even in your largest datasets, by turning them into images. BSD-3
  • GitHub (👨‍💻 30 · 🔀 280 · 📦 280 · 📋 420 - 34% open · ⏱️ 21.01.2020):

     git clone https://github.com/holoviz/datashader
    
  • PyPi (📥 9K / month · 📦 35 · ⏱️ 21.01.2020):

     pip install datashader
    
  • Conda:

     conda install -c conda-forge datashader
    
HoloViews (🥈28 · ⭐ 1.5K) - With Holoviews, your data visualizes itself. BSD-3
  • GitHub (👨‍💻 81 · 🔀 230 · 📋 2.2K - 28% open · ⏱️ 17.01.2020):

     git clone https://github.com/holoviz/holoviews
    
  • PyPi (📥 34K / month · 📦 170 · ⏱️ 22.11.2019):

     pip install holoviews
    
  • NPM (📥 2K / month · ⏱️ 01.07.2019):

     npm install @pyviz/jupyterlab_pyviz
    
UMAP (🥈27 · ⭐ 3.6K) - Uniform Manifold Approximation and Projection. BSD-3
  • GitHub (👨‍💻 50 · 🔀 350 · 📋 250 - 40% open · ⏱️ 13.01.2020):

     git clone https://github.com/lmcinnes/umap
    
  • PyPi (📥 32K / month · 📦 230 · ⏱️ 14.08.2019):

     pip install umap-learn
    
bqplot (🥈27 · ⭐ 2.6K) - Plotting library for IPython/Jupyter notebooks. Apache-2
  • GitHub (👨‍💻 45 · 🔀 380 · 📦 17 · 📋 430 - 43% open · ⏱️ 20.01.2020):

     git clone https://github.com/bloomberg/bqplot
    
  • PyPi (📥 14K / month · 📦 110 · ⏱️ 25.12.2019):

     pip install bqplot
    
  • NPM (📥 4.2K / month · 📦 10 · ⏱️ 25.12.2019):

     npm install bqplot
    
missingno (🥉26 · ⭐ 2K) - Missing data visualization module for Python. MIT
  • GitHub (👨‍💻 9 · 🔀 240 · 📦 710 · 📋 82 - 9% open · ⏱️ 19.11.2019):

     git clone https://github.com/ResidentMario/missingno
    
  • PyPi (📥 66K / month · 📦 29 · ⏱️ 29.06.2018):

     pip install missingno
    
Perspective (🥉24 · ⭐ 2.2K) - Streaming pivot visualization via WebAssembly. Apache-2
  • GitHub (👨‍💻 42 · 🔀 180 · 📦 29 · 📋 180 - 11% open · ⏱️ 16.01.2020):

     git clone https://github.com/finos/perspective
    
  • PyPi (📥 470 / month · 📦 4 · ⏱️ 19.12.2019):

     pip install perspective-python
    
  • NPM (📥 340 / month · ⏱️ 08.01.2020):

     npm install @finos/perspective-jupyterlab
    
PyVista (🥉24 · ⭐ 310) - 3D plotting and mesh analysis through a streamlined interface for the.. MIT
  • GitHub (👨‍💻 22 · 🔀 62 · 📦 49 · 📋 220 - 30% open · ⏱️ 23.01.2020):

     git clone https://github.com/pyvista/pyvista
    
  • PyPi (📥 2.5K / month · 📦 13 · ⏱️ 17.12.2019):

     pip install pyvista
    
facets-overview (🥉23 · ⭐ 6K) - Visualizations for machine learning datasets. Apache-2
  • GitHub (👨‍💻 24 · 🔀 740 · 📦 12 · 📋 120 - 45% open · ⏱️ 09.01.2020):

     git clone https://github.com/pair-code/facets
    
  • PyPi (📥 2.7K / month · ⏱️ 24.07.2019):

     pip install facets-overview
    
HyperTools (🥉23 · ⭐ 1.5K) - A Python toolbox for gaining geometric insights into high-dimensional data. MIT
  • GitHub (👨‍💻 19 · 🔀 140 · 📦 45 · 📋 180 - 35% open · ⏱️ 20.12.2019):

     git clone https://github.com/ContextLab/hypertools
    
  • PyPi (📥 380 / month · 📦 9 · ⏱️ 18.12.2019):

     pip install hypertools
    
pythreejs (🥉23 · ⭐ 580) - A Jupyter - Three.js bridge. BSD-3
  • GitHub (👨‍💻 23 · 🔀 120 · 📦 10 · 📋 170 - 25% open · ⏱️ 17.01.2020):

     git clone https://github.com/jupyter-widgets/pythreejs
    
  • PyPi (📥 9.7K / month · 📦 13 · ⏱️ 04.07.2019):

     pip install pythreejs
    
  • NPM (📥 2.2K / month · 📦 8 · ⏱️ 04.07.2019):

     npm install jupyter-threejs
    
vega (🥉23 · ⭐ 250) - IPython/Jupyter notebook module for Vega and Vega-Lite. BSD-3
  • GitHub (👨‍💻 9 · 🔀 35 · 📦 460 · 📋 81 - 12% open · ⏱️ 20.01.2020):

     git clone https://github.com/vega/ipyvega
    
  • PyPi (📥 12K / month · 📦 100 · ⏱️ 29.07.2019):

     pip install vega
    
  • Conda:

     conda install -c conda-forge vega
    
hvPlot (🥉23 · ⭐ 240) - A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews. BSD-3
  • GitHub (👨‍💻 16 · 🔀 37 · 📦 140 · 📋 240 - 35% open · ⏱️ 23.12.2019):

     git clone https://github.com/holoviz/hvplot
    
  • PyPi (📥 8.3K / month · 📦 13 · ⏱️ 10.10.2019):

     pip install hvplot
    
Chartify (🥉22 · ⭐ 2.4K) - Python library that makes it easy for data scientists to create charts. Apache-2
  • GitHub (👨‍💻 12 · 🔀 190 · 📦 38 · 📋 67 - 59% open · ⏱️ 27.11.2019):

     git clone https://github.com/spotify/chartify
    
  • PyPi (📥 7.7K / month · 📦 4 · ⏱️ 27.11.2019):

     pip install chartify
    
openTSNE (🥉21 · ⭐ 570) - Extensible, parallel implementations of t-SNE. BSD-3
  • GitHub (👨‍💻 7 · 🔀 59 · 📦 31 · 📋 44 - 13% open · ⏱️ 06.01.2020):

     git clone https://github.com/pavlin-policar/openTSNE
    
  • PyPi (📥 2.9K / month · 📦 2 · ⏱️ 04.09.2019):

     pip install opentsne
    
  • Conda:

     conda install -c conda-forge opentsne
    
Pandas-Bokeh (🥉21 · ⭐ 320) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
  • GitHub (👨‍💻 11 · 🔀 32 · 📦 44 · 📋 37 - 29% open · ⏱️ 19.11.2019):

     git clone https://github.com/PatrikHlobil/Pandas-Bokeh
    
  • PyPi (📥 7.7K / month · 📦 4 · ⏱️ 19.11.2019):

     pip install pandas-bokeh
    
PyWaffle (🥉20 · ⭐ 240) - Make Waffle Charts in Python. MIT
  • GitHub (👨‍💻 6 · 🔀 35 · 📦 10 · 📋 11 - 27% open · ⏱️ 02.01.2020):

     git clone https://github.com/gyli/PyWaffle
    
  • PyPi (📥 560 / month · ⏱️ 06.10.2019):

     pip install pywaffle
    
pdvega (🥉16 · ⭐ 330 · 💤) - Interactive plotting for Pandas using Vega-Lite. MIT
  • GitHub (👨‍💻 9 · 🔀 30 · 📦 29 · 📋 25 - 64% open · ⏱️ 29.03.2019):

     git clone https://github.com/altair-viz/pdvega
    
  • PyPi (📥 240 / month · 📦 2 · ⏱️ 01.02.2018):

     pip install pdvega
    
Show 4 hidden projects...
Altair (🥉26 · ⭐ 240 · 💀) - Declarative statistical visualization library for Python. BSD-3
  • GitHub (👨‍💻 50 · 🔀 19 · ⏱️ 18.05.2018):

     git clone https://github.com/ellisonbg/altair
    
  • PyPi (📥 120K / month · 📦 370 · ⏱️ 14.01.2020):

     pip install altair
    
pivottablejs (🥉19 · ⭐ 350 · 💀) - Dragndrop Pivot Tables and Charts for Jupyter/IPython Notebook, care of.. MIT
  • GitHub (👨‍💻 3 · 🔀 39 · 📦 70 · 📋 48 - 25% open · ⏱️ 04.12.2018):

     git clone https://github.com/nicolaskruchten/jupyter_pivottablejs
    
  • PyPi (📥 1.7K / month · 📦 14 · ⏱️ 15.01.2018):

     pip install pivottablejs
    
nx-altair (🥉13 · ⭐ 110) - Draw NetworkX graphs with Altair. MIT
  • GitHub (👨‍💻 3 · 🔀 13 · 📋 6 - 66% open · ⏱️ 16.09.2019):

     git clone https://github.com/Zsailer/nx_altair
    
  • PyPi (📥 230 / month · ⏱️ 29.11.2018):

     pip install nx-altair
    
nptsne (🥉12 · ⭐ 19) - nptsne is a numpy compatible python binary package that offers a number of APIs.. Apache-2
  • GitHub (👨‍💻 3 · 🔀 1 · ⏱️ 07.11.2019):

     git clone https://github.com/biovault/nptsne
    
  • PyPi (📥 43 / month · ⏱️ 07.11.2019):

     pip install nptsne
    

Text Data & NLP

Back to top

Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.

spaCy (🥇36 · ⭐ 16K) - Industrial-strength Natural Language Processing (NLP) with Python and Cython. MIT
  • GitHub (👨‍💻 450 · 🔀 2.7K · 📥 2.6K · 📦 9K · 📋 3.6K - 4% open · ⏱️ 20.01.2020):

     git clone https://github.com/explosion/spaCy
    
  • PyPi (📥 790K / month · 📦 2.9K · ⏱️ 21.11.2019):

     pip install spacy
    
nltk (🥇33 · ⭐ 8.6K) - Suite of libraries and programs for symbolic and statistical natural language.. Apache-2
  • GitHub (👨‍💻 350 · 🔀 2.3K · 📦 47K · 📋 1.4K - 18% open · ⏱️ 21.01.2020):

     git clone https://github.com/nltk/nltk
    
  • PyPi (📥 2.5M / month · 📦 18K · ⏱️ 20.08.2019):

     pip install nltk
    
transformers (🥇32 · ⭐ 21K) - Transformers: State-of-the-art Natural Language Processing for.. Apache-2
  • GitHub (👨‍💻 250 · 🔀 4.7K · 📥 920 · 📦 530 · 📋 2K - 22% open · ⏱️ 23.01.2020):

     git clone https://github.com/huggingface/transformers
    
  • PyPi (📥 120K / month · 📦 64 · ⏱️ 13.12.2019):

     pip install transformers
    
Rasa (🥇31 · ⭐ 7.6K) - Open source machine learning framework to automate text- and voice-based.. Apache-2
  • GitHub (👨‍💻 330 · 🔀 2.4K · 📦 280 · 📋 3.7K - 7% open · ⏱️ 23.01.2020):

     git clone https://github.com/RasaHQ/rasa
    
  • PyPi (📥 18K / month · 📦 32 · ⏱️ 18.12.2019):

     pip install rasa
    
AllenNLP (🥇30 · ⭐ 7.9K) - An open-source NLP research library, built on PyTorch. Apache-2
  • GitHub (👨‍💻 190 · 🔀 1.7K · 📦 620 · 📋 1.7K - 9% open · ⏱️ 22.01.2020):

     git clone https://github.com/allenai/allennlp
    
  • PyPi (📥 14K / month · 📦 120 · ⏱️ 25.09.2019):

     pip install allennlp
    
fastText (🥈29 · ⭐ 20K) - Library for fast text representation and classification. MIT
  • GitHub (👨‍💻 52 · 🔀 3.9K · 📦 590 · 📋 860 - 35% open · ⏱️ 03.01.2020):

     git clone https://github.com/facebookresearch/fastText
    
  • PyPi (📥 98K / month · 📦 190 · ⏱️ 27.06.2019):

     pip install fasttext
    
TextBlob (🥈29 · ⭐ 6.8K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun.. MIT
  • GitHub (👨‍💻 31 · 🔀 870 · 📥 72 · 📦 5.2K · 📋 200 - 27% open · ⏱️ 15.01.2020):

     git clone https://github.com/sloria/TextBlob
    
  • PyPi (📥 130K / month · 📦 1.9K · ⏱️ 24.02.2019):

     pip install textblob
    
sentencepiece (🥈29 · ⭐ 3.5K) - Unsupervised text tokenizer for Neural Network-based text generation. Apache-2
  • GitHub (👨‍💻 33 · 🔀 420 · 📥 3.5K · 📦 980 · 📋 260 - 16% open · ⏱️ 15.12.2019):

     git clone https://github.com/google/sentencepiece
    
  • PyPi (📥 260K / month · 📦 120 · ⏱️ 15.12.2019):

     pip install sentencepiece
    
flair (🥈28 · ⭐ 8K) - A very simple framework for state-of-the-art Natural Language Processing (NLP). MIT
  • GitHub (👨‍💻 98 · 🔀 940 · 📦 230 · 📋 940 - 40% open · ⏱️ 23.01.2020):

     git clone https://github.com/flairNLP/flair
    
  • PyPi (📥 10K / month · 📦 24 · ⏱️ 30.05.2019):

     pip install flair
    
fairseq (🥈28 · ⭐ 6.8K) - Facebook AI Research Sequence-to-Sequence Toolkit written in Python. MIT
  • GitHub (👨‍💻 180 · 🔀 1.6K · 📦 49 · 📋 1.1K - 9% open · ⏱️ 23.01.2020):

     git clone https://github.com/pytorch/fairseq
    
  • PyPi (📥 5K / month · 📦 20 · ⏱️ 04.12.2019):

     pip install fairseq
    
torchtext (🥈28 · ⭐ 2.1K) - Data loaders and abstractions for text and NLP. BSD-3
  • GitHub (👨‍💻 81 · 🔀 490 · 📦 1.5K · 📋 400 - 39% open · ⏱️ 21.01.2020):

     git clone https://github.com/pytorch/text
    
  • PyPi (📥 31K / month · 📦 680 · ⏱️ 15.01.2020):

     pip install torchtext
    
textacy (🥈28 · ⭐ 1.4K) - NLP, before and after spaCy. Apache-2
  • GitHub (👨‍💻 26 · 🔀 190 · 📦 330 · 📋 210 - 16% open · ⏱️ 03.09.2019):

     git clone https://github.com/chartbeat-labs/textacy
    
  • PyPi (📥 11K / month · 📦 56 · ⏱️ 03.09.2019):

     pip install textacy
    
inflect (🥈28 · ⭐ 370) - Correctly generate plurals, ordinals, indefinite articles; convert numbers to words. MIT
  • GitHub (👨‍💻 22 · 🔀 54 · 📋 52 - 28% open · ⏱️ 23.01.2020):

     git clone https://github.com/jazzband/inflect
    
  • PyPi (📥 530K / month · 📦 1.4K · ⏱️ 21.12.2019):

     pip install inflect
    
OpenNMT (🥈27 · ⭐ 3.8K) - Open Source Neural Machine Translation in PyTorch. MIT
  • GitHub (👨‍💻 150 · 🔀 1.4K · 📦 15 · 📋 1K - 5% open · ⏱️ 16.01.2020):

     git clone https://github.com/OpenNMT/OpenNMT-py
    
  • PyPi (📥 1.4K / month · 📦 2 · ⏱️ 13.12.2019):

     pip install OpenNMT-py
    
Dedupe (🥈27 · ⭐ 2.5K) - A python library for accurate and scalable fuzzy matching, record deduplication and.. MIT
  • GitHub (👨‍💻 52 · 🔀 360 · 📥 20 · 📦 130 · 📋 560 - 8% open · ⏱️ 12.01.2020):

     git clone https://github.com/dedupeio/dedupe
    
  • PyPi (📥 30K / month · 📦 78 · ⏱️ 27.07.2019):

     pip install dedupe
    
phonenumbers (🥈27 · ⭐ 2.4K) - Python port of Google's libphonenumber. Apache-2
  • GitHub (👨‍💻 20 · 🔀 290 · 📋 100 - 3% open · ⏱️ 16.01.2020):

     git clone https://github.com/daviddrysdale/python-phonenumbers
    
  • PyPi (📥 690K / month · 📦 2.3K · ⏱️ 16.01.2020):

     pip install phonenumbers
    
DeepPavlov (🥈26 · ⭐ 3.9K) - An open source library for deep learning end-to-end dialog systems and.. Apache-2
  • GitHub (👨‍💻 44 · 🔀 700 · 📦 72 · 📋 380 - 23% open · ⏱️ 29.11.2019):

     git clone https://github.com/deepmipt/DeepPavlov
    
  • PyPi (📥 3K / month · 📦 4 · ⏱️ 28.11.2019):

     pip install deeppavlov
    
ftfy (🥈26 · ⭐ 2.6K) - Fixes mojibake and other glitches in Unicode text, after the fact. MIT
  • GitHub (👨‍💻 16 · 🔀 83 · 📦 1.5K · 📋 78 - 7% open · ⏱️ 18.12.2019):

     git clone https://github.com/LuminosoInsight/python-ftfy
    
  • PyPi (📥 180K / month · 📦 630 · ⏱️ 07.08.2019):

     pip install ftfy
    
GluonNLP (🥈26 · ⭐ 1.9K) - Toolkit that enables easy text preprocessing, datasets loading and neural.. Apache-2
  • GitHub (👨‍💻 60 · 🔀 440 · 📦 77 · 📋 420 - 48% open · ⏱️ 14.01.2020):

     git clone https://github.com/dmlc/gluon-nlp
    
  • PyPi (📥 20K / month · 📦 11 · ⏱️ 15.01.2020):

     pip install gluonnlp
    
jellyfish (🥈26 · ⭐ 1.2K) - a python library for doing approximate and phonetic matching of strings. BSD-2
  • GitHub (👨‍💻 18 · 🔀 110 · 📦 1.1K · 📋 96 - 22% open · ⏱️ 09.01.2020):

     git clone https://github.com/jamesturk/jellyfish
    
  • PyPi (📥 290K / month · 📦 530 · ⏱️ 10.01.2019):

     pip install jellyfish
    
Sumy (🥈25 · ⭐ 2.2K) - Module for automatic summarization of text documents and HTML pages. Apache-2
  • GitHub (👨‍💻 19 · 🔀 380 · 📦 350 · 📋 81 - 14% open · ⏱️ 28.12.2019):

     git clone https://github.com/miso-belica/sumy
    
  • PyPi (📥 9.4K / month · 📦 180 · ⏱️ 19.05.2019):

     pip install sumy
    
CLTK (🥈25 · ⭐ 570) - The Classical Language Toolkit. MIT
  • GitHub (👨‍💻 98 · 🔀 280 · 📥 5 · 📦 96 · 📋 440 - 23% open · ⏱️ 17.11.2019):

     git clone https://github.com/cltk/cltk
    
  • PyPi (📥 1.8K / month · 📦 68 · ⏱️ 16.11.2019):

     pip install cltk
    
stanfordnlp (🥉24 · ⭐ 2.7K) - Official Stanford NLP Python Library for Many Human Languages. Apache-2
  • GitHub (👨‍💻 18 · 🔀 360 · 📦 120 · 📋 160 - 19% open · ⏱️ 23.10.2019):

     git clone https://github.com/stanfordnlp/stanfordnlp
    
  • PyPi (📥 13K / month · 📦 10 · ⏱️ 16.05.2019):

     pip install stanfordnlp
    
pytorch-nlp (🥉24 · ⭐ 1.6K) - Basic Utilities for PyTorch Natural Language Processing (NLP). BSD-3
  • GitHub (👨‍💻 14 · 🔀 180 · 📦 45 · 📋 52 - 19% open · ⏱️ 31.12.2019):

     git clone https://github.com/PetrochukM/PyTorch-NLP
    
  • PyPi (📥 3.3K / month · 📦 5 · ⏱️ 04.11.2019):

     pip install pytorch-nlp
    
TextDistance (🥉24 · ⭐ 1.6K) - Compute distance between sequences. 30+ algorithms, pure python implementation,.. MIT
  • GitHub (👨‍💻 6 · 🔀 130 · 📥 33 · 📦 92 · 📋 36 - 33% open · ⏱️ 14.11.2019):

     git clone https://github.com/life4/textdistance
    
  • PyPi (📥 66K / month · 📦 14 · ⏱️ 06.08.2019):

     pip install textdistance
    
pyahocorasick (🥉24 · ⭐ 430) - Python module (C extension and plain python) implementing Aho-Corasick.. BSD-3
  • GitHub (👨‍💻 17 · 🔀 63 · 📦 180 · 📋 84 - 25% open · ⏱️ 13.12.2019):

     git clone https://github.com/WojciechMula/pyahocorasick
    
  • PyPi (📥 84K / month · 📦 64 · ⏱️ 14.01.2019):

     pip install pyahocorasick
    
PyText (🥉23 · ⭐ 5.7K) - A natural language modeling framework based on PyTorch. BSD-3
  • GitHub (👨‍💻 94 · 🔀 670 · 📥 82 · 📦 15 · 📋 170 - 59% open · ⏱️ 21.01.2020):

     git clone https://github.com/facebookresearch/pytext
    
  • PyPi (📥 780 / month · 📦 1 · ⏱️ 05.12.2019):

     pip install pytext-nlp
    
Snips NLU (🥉23 · ⭐ 3.1K) - Snips Python library to extract meaning from text. Apache-2
  • GitHub (👨‍💻 17 · 🔀 360 · 📋 230 - 17% open · ⏱️ 15.01.2020):

     git clone https://github.com/snipsco/snips-nlu
    
  • PyPi (📥 4.5K / month · 📦 8 · ⏱️ 15.01.2020):

     pip install snips-nlu
    
MatchZoo (🥉23 · ⭐ 2.9K) - Facilitating the design, comparison and sharing of deep text matching models. Apache-2
  • GitHub (👨‍💻 35 · 🔀 760 · 📦 4 · 📋 430 - 6% open · ⏱️ 24.10.2019):

     git clone https://github.com/NTMC-Community/MatchZoo
    
  • PyPi (📥 520 / month · ⏱️ 24.10.2019):

     pip install matchzoo
    
Texar (🥉23 · ⭐ 1.8K) - Toolkit for Machine Learning, Natural Language Processing, and Text.. Apache-2
  • GitHub (👨‍💻 38 · 🔀 280 · 📦 4 · 📋 140 - 13% open · ⏱️ 02.01.2020):

     git clone https://github.com/asyml/texar
    
  • PyPi (📥 280 / month · 📦 2 · ⏱️ 19.11.2019):

     pip install texar
    
TensorFlow Text (🥉23 · ⭐ 530) - Making text a first-class citizen in TensorFlow. Apache-2
  • GitHub (👨‍💻 19 · 🔀 51 · 📦 32 · 📋 48 - 37% open · ⏱️ 21.01.2020):

     git clone https://github.com/tensorflow/text
    
  • PyPi (📥 64K / month · 📦 2 · ⏱️ 17.12.2019):

     pip install tensorflow-text
    
NLP Architect (🥉22 · ⭐ 2.3K) - A model library for exploring state-of-the-art deep learning topologies.. Apache-2
  • GitHub (👨‍💻 31 · 🔀 350 · 📦 1 · 📋 79 - 37% open · ⏱️ 22.01.2020):

     git clone https://github.com/NervanaSystems/nlp-architect
    
  • PyPi (📥 1.9K / month · ⏱️ 09.01.2020):

     pip install nlp-architect
    
Tokenizers (🥉22 · ⭐ 1.8K · 🐣) - Fast State-of-the-Art Tokenizers optimized for Research and Production. Apache-2
  • GitHub (👨‍💻 12 · 🔀 87 · 📦 15 · 📋 49 - 53% open · ⏱️ 22.01.2020):

     git clone https://github.com/huggingface/tokenizers
    
  • PyPi (📥 18K / month · ⏱️ 22.01.2020):

     pip install tokenizers
    
anaGo (🥉22 · ⭐ 1.3K) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech.. MIT
  • GitHub (👨‍💻 11 · 🔀 330 · 📦 19 · 📋 100 - 31% open · ⏱️ 23.10.2019):

     git clone https://github.com/Hironsan/anago
    
  • PyPi (📥 3.7K / month · ⏱️ 17.07.2018):

     pip install anago
    
PyTextRank (🥉22 · ⭐ 1.1K) - Python implementation of TextRank for phrase extraction and summarization of text.. MIT
  • GitHub (👨‍💻 7 · 🔀 240 · 📦 29 · 📋 33 - 21% open · ⏱️ 20.12.2019):

     git clone https://github.com/DerwenAI/pytextrank
    
  • PyPi (📥 2.5K / month · 📦 6 · ⏱️ 05.11.2019):

     pip install pytextrank
    
Kashgari (🥉21 · ⭐ 1.4K) - Kashgari is a Production-ready NLP Transfer learning framework for text-.. Apache-2
  • GitHub (👨‍💻 16 · 🔀 280 · 📦 5 · 📋 240 - 8% open · ⏱️ 27.12.2019):

     git clone https://github.com/BrikerMan/Kashgari
    
  • PyPi (📥 440 / month · 📦 1 · ⏱️ 26.06.2019):

     pip install kashgari-tf
    
scattertext (🥉21 · ⭐ 1.2K) - Beautiful visualizations of how language differs among document types. Apache-2
  • GitHub (👨‍💻 7 · 🔀 140 · 📦 58 · 📋 44 - 31% open · ⏱️ 05.01.2020):

     git clone https://github.com/JasonKessler/scattertext
    
  • PyPi (📥 1.4K / month · 📦 8 · ⏱️ 04.12.2019):

     pip install scattertext
    
spacy-transformers (🥉21 · ⭐ 580 · 🐣) - spaCy pipelines for pre-trained BERT, XLNet and GPT-2. MIT
  • GitHub (👨‍💻 13 · 🔀 60 · 📦 10 · 📋 100 - 35% open · ⏱️ 24.12.2019):

     git clone https://github.com/explosion/spacy-transformers
    
  • PyPi (📥 3.7K / month · 📦 1 · ⏱️ 28.10.2019):

     pip install spacy-transformers
    
DeepMatcher (🥉20 · ⭐ 2.7K) - Python package for performing Entity and Text Matching using Deep Learning. BSD-3
  • GitHub (👨‍💻 4 · 🔀 1K · 📦 2 · 📋 46 - 82% open · ⏱️ 30.12.2019):

     git clone https://github.com/anhaidgroup/deepmatcher
    
  • PyPi (📥 230 / month · ⏱️ 30.12.2019):

     pip install deepmatcher
    
DELTA (🥉20 · ⭐ 1.1K) - DELTA is a deep learning based natural language and speech processing platform. Apache-2
  • GitHub (👨‍💻 33 · 🔀 210 · 📋 57 - 12% open · ⏱️ 23.12.2019):

     git clone https://github.com/didi/delta
    
  • Dockerhub (📥 8.7K · ⏱️ 22.01.2020):

     docker pull zh794390558/delta
    
Sockeye (🥉20 · ⭐ 900) - Sequence-to-sequence framework with a focus on Neural Machine Translation.. Apache-2
  • GitHub (👨‍💻 41 · 🔀 250 · 📋 190 - 8% open · ⏱️ 16.01.2020):

     git clone https://github.com/awslabs/sockeye
    
  • PyPi (📥 520 / month · 📦 2 · ⏱️ 07.05.2019):

     pip install sockeye
    
finetune (🥉20 · ⭐ 590) - Scikit-learn style model finetuning for NLP. MPL-2.0
  • GitHub (👨‍💻 15 · 🔀 61 · 📦 4 · 📋 120 - 6% open · ⏱️ 22.01.2020):

     git clone https://github.com/IndicoDataSolutions/finetune
    
  • PyPi (📥 350 / month · 📦 2 · ⏱️ 10.01.2020):

     pip install finetune
    
YouTokenToMe (🥉20 · ⭐ 490) - Unsupervised text tokenizer focused on computational efficiency. MIT
  • GitHub (👨‍💻 6 · 🔀 29 · 📦 11 · 📋 24 - 41% open · ⏱️ 20.01.2020):

     git clone https://github.com/vkcom/youtokentome
    
  • PyPi (📥 2.3K / month · 📦 5 · ⏱️ 20.11.2019):

     pip install youtokentome
    
fast-bert (🥉19 · ⭐ 930) - Super easy library for BERT based NLP models. Apache-2
  • GitHub (👨‍💻 19 · 🔀 170 · 📋 140 - 53% open · ⏱️ 16.01.2020):

     git clone https://github.com/kaushaltrivedi/fast-bert
    
  • PyPi (📥 1.4K / month · 📦 2 · ⏱️ 14.01.2020):

     pip install fast-bert
    
textpipe (🥉19 · ⭐ 180) - Textpipe: clean and extract metadata from text. MIT
  • GitHub (👨‍💻 24 · 🔀 13 · 📦 3 · 📋 45 - 51% open · ⏱️ 03.01.2020):

     git clone https://github.com/textpipe/textpipe
    
  • PyPi (📥 340 / month · 📦 1 · ⏱️ 12.12.2019):

     pip install textpipe
    
NeuroNER (🥉18 · ⭐ 1.4K) - Named-entity recognition using neural networks. Easy-to-use and state-of-the-art.. MIT
  • GitHub (👨‍💻 7 · 🔀 410 · 📦 3 · 📋 140 - 57% open · ⏱️ 02.10.2019):

     git clone https://github.com/Franck-Dernoncourt/NeuroNER
    
  • PyPi (📥 110 / month · ⏱️ 02.10.2019):

     pip install pyneuroner
    
NeMo (🥉18 · ⭐ 1K · 🐣) - Neural Modules: a toolkit for conversational AI. Apache-2
  • GitHub (👨‍💻 33 · 🔀 110 · 📋 65 - 41% open · ⏱️ 23.01.2020):

     git clone https://github.com/NVIDIA/NeMo
    
  • PyPi (📥 440 / month · 📦 2 · ⏱️ 15.11.2019):

     pip install nemo-toolkit
    
FARM (🥉18 · ⭐ 510 · 🐣) - Fast & easy transfer learning for NLP. Harvesting language models for the.. Apache-2
  • GitHub (👨‍💻 13 · 🔀 55 · 📋 77 - 15% open · ⏱️ 23.01.2020):

     git clone https://github.com/deepset-ai/FARM
    
  • PyPi (📥 200 / month · 📦 1 · ⏱️ 28.11.2019):

     pip install farm
    
ParlAI (🥉17 · ⭐ 5.2K) - A framework for training and evaluating AI models on a variety of openly available.. MIT
  • GitHub (👨‍💻 100 · 🔀 980 · 📋 580 - 19% open · ⏱️ 23.01.2020):

     git clone https://github.com/facebookresearch/ParlAI
    
Headliner (🥉17 · ⭐ 160 · 🐣) - Easy training and deployment of seq2seq models. MIT
  • GitHub (👨‍💻 2 · 🔀 26 · 📦 2 · 📋 12 - 8% open · ⏱️ 23.01.2020):

     git clone https://github.com/as-ideas/headliner
    
  • PyPi (📥 550 / month · 📦 1 · ⏱️ 23.01.2020):

     pip install headliner
    
Translate (🥉15 · ⭐ 570) - Translate - a PyTorch Language Library. BSD-3
  • GitHub (👨‍💻 70 · 🔀 120 · 📋 66 - 72% open · ⏱️ 22.01.2020):

     git clone https://github.com/pytorch/translate
    
  • PyPi (📥 21 / month · ⏱️ 01.05.2018):

     pip install pytorch-translate
    
TransferNLP (🥉15 · ⭐ 280) - NLP library designed for reproducible experimentation management. MIT
  • GitHub (👨‍💻 6 · 🔀 16 · 📋 22 - 13% open · ⏱️ 27.11.2019):

     git clone https://github.com/feedly/transfer-nlp
    
  • PyPi (📥 38 / month · ⏱️ 25.06.2019):

     pip install transfer-nlp
    
Show 8 hidden projects...
gensim (🥇34 · ⭐ 10K) - Topic Modelling for Humans. ❗️LGPL-2.1
  • GitHub (👨‍💻 360 · 🔀 3.6K · 📥 1.7K · 📦 11K · 📋 1.4K - 18% open · ⏱️ 23.01.2020):

     git clone https://github.com/RaRe-Technologies/gensim
    
  • PyPi (📥 1.4M / month · 📦 3.7K · ⏱️ 09.07.2019):

     pip install gensim
    
fuzzywuzzy (🥈29 · ⭐ 6.4K) - Fuzzy String Matching in Python. ❗️GPL-2.0
  • GitHub (👨‍💻 66 · 🔀 680 · 📦 4.5K · 📋 150 - 42% open · ⏱️ 12.11.2019):

     git clone https://github.com/seatgeek/fuzzywuzzy
    
  • PyPi (📥 1.1M / month · 📦 3.1K · ⏱️ 20.08.2018):

     pip install fuzzywuzzy
    
langid (🥈26 · ⭐ 1.5K · 💀) - Stand-alone language identification system. BSD-3
  • GitHub (👨‍💻 9 · 🔀 250 · 📦 340 · 📋 65 - 35% open · ⏱️ 15.07.2017):

     git clone https://github.com/saffsd/langid.py
    
  • PyPi (📥 61K / month · 📦 250 · ⏱️ 05.04.2016):

     pip install langid
    
polyglot (🥈25 · ⭐ 1.5K) - Multilingual text (NLP) processing toolkit. ❗️GPL-3.0
  • GitHub (👨‍💻 26 · 🔀 260 · 📦 250 · 📋 170 - 64% open · ⏱️ 04.10.2019):

     git clone https://github.com/aboSamoor/polyglot
    
  • PyPi (📥 22K / month · 📦 57 · ⏱️ 03.07.2016):

     pip install polyglot
    
flashtext (🥉22 · ⭐ 4.1K · 💀) - Extract Keywords from sentence or Replace keywords in sentences. MIT
  • GitHub (👨‍💻 6 · 🔀 470 · 📦 130 · 📋 85 - 60% open · ⏱️ 09.11.2018):

     git clone https://github.com/vi3k6i5/flashtext
    
  • PyPi (📥 37K / month · 📦 32 · ⏱️ 16.02.2018):

     pip install flashtext
    
stop-words (🥉20 · ⭐ 100 · 💀) - Get list of common stop words in various languages in Python. BSD-3
  • GitHub (👨‍💻 8 · 🔀 18 · 📦 660 · 📋 12 - 25% open · ⏱️ 23.07.2018):

     git clone https://github.com/Alir3z4/python-stop-words
    
  • PyPi (📥 57K / month · 📦 200 · ⏱️ 23.07.2018):

     pip install stop-words
    
pyfasttext (🥉19 · ⭐ 230 · 💀) - Yet another Python binding for fastText. ❗️GPL-3.0
  • GitHub (👨‍💻 4 · 🔀 28 · 📥 210 · 📦 74 · 📋 47 - 40% open · ⏱️ 08.12.2018):

     git clone https://github.com/vrasneur/pyfasttext
    
  • PyPi (📥 1.8K / month · 📦 5 · ⏱️ 08.12.2018):

     pip install pyfasttext
    
pySBD (🥉18 · ⭐ 94) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary.. MIT
  • GitHub (👨‍💻 3 · 🔀 10 · 📦 7 · 📋 35 - 20% open · ⏱️ 13.11.2019):

     git clone https://github.com/nipunsadvilkar/pySBD
    
  • PyPi (📥 3.4K / month · 📦 2 · ⏱️ 13.11.2019):

     pip install pysbd
    

Image Data

Back to top

Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, character recognition, and classification.

scikit-image (🥇36 · ⭐ 3.5K) - Image processing in Python. BSD-2
  • GitHub (👨‍💻 410 · 🔀 1.4K · 📦 32K · 📋 1.8K - 32% open · ⏱️ 23.01.2020):

     git clone https://github.com/scikit-image/scikit-image
    
  • PyPi (📥 1.3M / month · 📦 14K · ⏱️ 22.10.2019):

     pip install scikit-image
    
torchvision (🥇33 · ⭐ 5.4K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3
  • GitHub (👨‍💻 240 · 🔀 2.6K · 📦 15K · 📋 860 - 30% open · ⏱️ 22.01.2020):

     git clone https://github.com/pytorch/vision
    
  • PyPi (📥 370K / month · 📦 4.6K · ⏱️ 15.01.2020):

     pip install torchvision
    
imgaug (🥇31 · ⭐ 8K) - Image augmentation for machine learning experiments. MIT
  • GitHub (👨‍💻 35 · 🔀 1.6K · 📦 2.3K · 📋 330 - 51% open · ⏱️ 18.01.2020):

     git clone https://github.com/aleju/imgaug
    
  • PyPi (📥 74K / month · 📦 280 · ⏱️ 24.09.2019):

     pip install imgaug
    
Tesseract (🥈30 · ⭐ 2.6K) - Python-tesseract is an optical character recognition (OCR) tool for python. Apache-2
  • GitHub (👨‍💻 31 · 🔀 390 · 📦 3.9K · 📋 180 - 3% open · ⏱️ 07.01.2020):

     git clone https://github.com/madmaze/pytesseract
    
  • PyPi (📥 230K / month · 📦 1.4K · ⏱️ 20.12.2019):

     pip install pytesseract
    
  • Conda:

     conda install -c conda-forge pytesseract
    
opencv-python (🥈30 · ⭐ 1.1K) - Automated CI toolchain to produce precompiled opencv-python, opencv-python-.. MIT
  • GitHub (👨‍💻 21 · 🔀 200 · 📋 240 - 7% open · ⏱️ 09.01.2020):

     git clone https://github.com/skvark/opencv-python
    
  • PyPi (📥 1.7M / month · 📦 13K · ⏱️ 21.11.2019):

     pip install opencv-python
    
imageio (🥈30 · ⭐ 680) - Python library for reading and writing image data. BSD-2
  • GitHub (👨‍💻 62 · 🔀 130 · 📦 16K · 📋 280 - 22% open · ⏱️ 13.01.2020):

     git clone https://github.com/imageio/imageio
    
  • PyPi (📥 1.7M / month · 📦 2.2K · ⏱️ 07.10.2019):

     pip install imageio
    
Face Recognition (🥈29 · ⭐ 31K) - The world's simplest facial recognition api for Python and the command.. MIT
  • GitHub (👨‍💻 32 · 🔀 8.7K · 📥 410 · 📋 910 - 46% open · ⏱️ 18.12.2019):

     git clone https://github.com/ageitgey/face_recognition
    
  • PyPi (📥 38K / month · 📦 410 · ⏱️ 21.08.2018):

     pip install face_recognition
    
MoviePy (🥈29 · ⭐ 6.2K) - Video editing with Python. MIT
  • GitHub (👨‍💻 100 · 🔀 820 · 📦 2.6K · 📋 750 - 49% open · ⏱️ 22.11.2019):

     git clone https://github.com/Zulko/moviepy
    
  • PyPi (📥 98K / month · 📦 900 · ⏱️ 01.10.2019):

     pip install moviepy
    
imutils (🥈28 · ⭐ 2.7K) - A series of convenience functions to make basic image processing operations such as.. MIT
  • GitHub (👨‍💻 19 · 🔀 650 · 📦 4.7K · 📋 140 - 52% open · ⏱️ 18.08.2019):

     git clone https://github.com/jrosebr1/imutils
    
  • PyPi (📥 130K / month · 📦 1.1K · ⏱️ 18.08.2019):

     pip install imutils
    
imageai (🥈27 · ⭐ 4.4K) - A python library built to empower developers to build applications and systems with.. MIT
  • GitHub (👨‍💻 13 · 🔀 1.2K · 📥 360K · 📦 200 · 📋 420 - 22% open · ⏱️ 14.01.2020):

     git clone https://github.com/OlafenwaMoses/ImageAI
    
  • PyPi (📥 9K / month · 📦 12 · ⏱️ 13.09.2019):

     pip install imageai
    
Albumentations (🥈27 · ⭐ 4.3K) - fast image augmentation library and easy to use wrapper around other.. MIT
  • GitHub (👨‍💻 51 · 🔀 560 · 📦 480 · 📋 200 - 32% open · ⏱️ 14.01.2020):

     git clone https://github.com/albumentations-team/albumentations
    
  • PyPi (📥 24K / month · 📦 55 · ⏱️ 15.11.2019):

     pip install albumentations
    
Augmentor (🥈26 · ⭐ 3.8K) - Image augmentation library in Python for machine learning. MIT
  • GitHub (👨‍💻 22 · 🔀 710 · 📦 150 · 📋 170 - 62% open · ⏱️ 17.01.2020):

     git clone https://github.com/mdbloice/Augmentor
    
  • PyPi (📥 3.8K / month · 📦 42 · ⏱️ 17.01.2020):

     pip install Augmentor
    
GluonCV (🥈26 · ⭐ 3.4K) - Gluon CV Toolkit. Apache-2
  • GitHub (👨‍💻 83 · 🔀 790 · 📦 95 · 📋 540 - 50% open · ⏱️ 20.01.2020):

     git clone https://github.com/dmlc/gluon-cv
    
  • PyPi (📥 21K / month · 📦 15 · ⏱️ 23.01.2020):

     pip install gluoncv
    
tesserocr (🥈26 · ⭐ 1.1K) - A Python wrapper for the tesseract-ocr API. MIT
  • GitHub (👨‍💻 21 · 🔀 160 · 📦 270 · 📋 180 - 29% open · ⏱️ 13.01.2020):

     git clone https://github.com/sirfz/tesserocr
    
  • PyPi (📥 26K / month · 📦 46 · ⏱️ 08.11.2019):

     pip install tesserocr
    
  • Conda:

     conda install -c conda-forge tesserocr
    
Kornia (🥉24 · ⭐ 1.8K) - Open Source Differentiable Computer Vision Library for PyTorch. Apache-2
  • GitHub (👨‍💻 42 · 🔀 180 · 📋 140 - 35% open · ⏱️ 23.01.2020):

     git clone https://github.com/kornia/kornia
    
  • PyPi (📥 15K / month · ⏱️ 05.10.2019):

     pip install kornia
    
ImageHash (🥉24 · ⭐ 1.4K) - A Python Perceptual Image Hashing Module. BSD-2
  • GitHub (👨‍💻 14 · 🔀 210 · 📦 700 · 📋 65 - 27% open · ⏱️ 04.10.2019):

     git clone https://github.com/JohannesBuchner/imagehash
    
  • PyPi (📥 98K / month · 📦 440 · ⏱️ 07.12.2017):

     pip install ImageHash
    
InsightFace (🥉22 · ⭐ 6K) - Face Analysis Project on MXNet. MIT
  • GitHub (👨‍💻 9 · 🔀 2.1K · 📦 7 · 📋 1K - 44% open · ⏱️ 17.12.2019):

     git clone https://github.com/deepinsight/insightface
    
  • PyPi (📥 730 / month · 📦 1 · ⏱️ 07.09.2019):

     pip install insightface
    
Image Deduplicator (🥉22 · ⭐ 2.9K) - Finding duplicate images made easy!. Apache-2
  • GitHub (👨‍💻 8 · 🔀 220 · 📦 3 · 📋 39 - 17% open · ⏱️ 13.01.2020):

     git clone https://github.com/idealo/imagededup
    
  • PyPi (📥 2.1K / month · 📦 1 · ⏱️ 11.12.2019):

     pip install imagededup
    
tensorflow-graphics (🥉22 · ⭐ 2K) - TensorFlow Graphics: Differentiable Graphics Layers for TensorFlow. Apache-2
  • GitHub (👨‍💻 13 · 🔀 190 · 📦 14 · 📋 53 - 20% open · ⏱️ 20.01.2020):

     git clone https://github.com/tensorflow/graphics
    
  • PyPi (📥 25K / month · 📦 1 · ⏱️ 09.05.2019):

     pip install tensorflow-graphics
    
Image Super-Resolution (🥉22 · ⭐ 1.5K) - Super-scale your images and run experiments with Residual.. Apache-2
  • GitHub (👨‍💻 5 · 🔀 270 · 📦 9 · 📋 87 - 27% open · ⏱️ 08.01.2020):

     git clone https://github.com/idealo/image-super-resolution
    
  • PyPi (📥 1.2K / month · 📦 4 · ⏱️ 08.01.2020):

     pip install ISR
    
  • Dockerhub (📥 61 · ⏱️ 01.04.2019):

     docker pull idealo/image-super-resolution-gpu
    
Pythia (🥉20 · ⭐ 3K) - A modular framework for vision & language multimodal research from Facebook AI.. BSD-3
  • GitHub (👨‍💻 21 · 🔀 380 · 📦 3 · 📋 170 - 22% open · ⏱️ 11.12.2019):

     git clone https://github.com/facebookresearch/pythia
    
MMDetection (🥉19 · ⭐ 8.2K) - Open MMLab Detection Toolbox and Benchmark. Apache-2
  • GitHub (👨‍💻 89 · 🔀 2.5K · 📦 3 · 📋 1.6K - 22% open · ⏱️ 22.01.2020):

     git clone https://github.com/open-mmlab/mmdetection
    
Face Alignment (🥉19 · ⭐ 3.5K) - 2D and 3D Face alignment library build using pytorch. BSD-3
  • GitHub (👨‍💻 15 · 🔀 770 · 📋 150 - 19% open · ⏱️ 30.08.2019):

     git clone https://github.com/1adrianb/face-alignment
    
  • PyPi (📥 1.6K / month · 📦 4 · ⏱️ 19.12.2018):

     pip install face-alignment
    
Luminoth (🥉19 · ⭐ 2.3K) - UNMAINTAINED. Deep Learning toolkit for Computer Vision. BSD-3
  • GitHub (👨‍💻 15 · 🔀 380 · 📥 8.7K · 📋 180 - 28% open · ⏱️ 07.01.2020):

     git clone https://github.com/tryolabs/luminoth
    
  • PyPi (📥 710 / month · ⏱️ 09.11.2018):

     pip install luminoth
    
detectron2 (🥉16 · ⭐ 7.5K · 🐣) - Detectron2 is FAIR's next-generation research platform for object.. Apache-2
  • GitHub (👨‍💻 61 · 🔀 1.3K · 📋 640 - 7% open · ⏱️ 22.01.2020):

     git clone https://github.com/facebookresearch/detectron2
    
Classy Vision (🥉16 · ⭐ 570 · 🐣) - An end-to-end PyTorch framework for image and video classification. MIT
  • GitHub (👨‍💻 12 · 🔀 63 · 📋 24 - 70% open · ⏱️ 23.01.2020):

     git clone https://github.com/facebookresearch/ClassyVision
    
  • PyPi (📥 630 / month · ⏱️ 22.01.2020):

     pip install classy_vision
    
  • Conda:

     conda install -c conda-forge classy_vision
    
keras-ocr (🥉16 · ⭐ 360 · 🐣) - A packaged and flexible version of the CRAFT text detector and Keras CRNN.. MIT
  • GitHub (👨‍💻 2 · 🔀 48 · 📋 17 - 17% open · ⏱️ 20.01.2020):

     git clone https://github.com/faustomorales/keras-ocr
    
  • PyPi (📥 1.2K / month · 📦 1 · ⏱️ 12.01.2020):

     pip install keras-ocr
    
pycls (🥉11 · ⭐ 540) - Codebase for Image Classification Research, written in PyTorch. MIT
  • GitHub (👨‍💻 5 · 🔀 32 · 📋 4 - 25% open · ⏱️ 13.01.2020):

     git clone https://github.com/facebookresearch/pycls
    
PySlowFast (🥉10 · ⭐ 1.9K · 🐣) - PySlowFast: video understanding codebase from FAIR for reproducing.. Apache-2
  • GitHub (👨‍💻 1 · 🔀 280 · 📋 97 - 29% open · ⏱️ 31.12.2019):

     git clone https://github.com/facebookresearch/SlowFast
    
Show 2 hidden projects...
Pillow (🥇37 · ⭐ 7K) - The friendly PIL fork (Python Imaging Library). ❗️PIL
  • GitHub (👨‍💻 310 · 🔀 1.4K · 📦 200K · 📋 1.8K - 11% open · ⏱️ 21.01.2020):

     git clone https://github.com/python-pillow/Pillow
    
  • PyPi (📥 8.5M / month · 📦 110K · ⏱️ 02.01.2020):

     pip install Pillow
    
Pillow-SIMD (🥉25 · ⭐ 1.3K) - The friendly PIL fork. ❗️PIL
  • GitHub (👨‍💻 310 · 🔀 56 · 📦 190 · 📋 53 - 22% open · ⏱️ 17.01.2020):

     git clone https://github.com/uploadcare/pillow-simd
    
  • PyPi (📥 68K / month · 📦 50 · ⏱️ 17.01.2020):

     pip install pillow-simd
    

Graph Data

Back to top

Libraries for graph processing, clustering, embedding, and machine learning tasks.

networkx (🥇35 · ⭐ 6.8K) - Official NetworkX source code repository. BSD-3
  • GitHub (👨‍💻 390 · 🔀 1.7K · 📥 35 · 📦 37K · 📋 2.2K - 12% open · ⏱️ 21.01.2020):

     git clone https://github.com/networkx/networkx
    
  • PyPi (📥 3.7M / month · 📦 20K · ⏱️ 17.10.2019):

     pip install networkx
    
PyTorch Geometric (🥇25 · ⭐ 6.4K) - Geometric Deep Learning Extension Library for PyTorch. MIT
  • GitHub (👨‍💻 69 · 🔀 990 · 📋 790 - 30% open · ⏱️ 17.01.2020):

     git clone https://github.com/rusty1s/pytorch_geometric
    
  • PyPi (📥 4.4K / month · 📦 12 · ⏱️ 29.08.2019):

     pip install torch-geometric
    
dgl (🥇24 · ⭐ 3.9K) - Python package built to ease deep learning on graph, on top of existing DL.. Apache-2
  • GitHub (👨‍💻 48 · 🔀 650 · 📋 440 - 29% open · ⏱️ 20.01.2020):

     git clone https://github.com/dmlc/dgl
    
  • PyPi (📥 3.6K / month · 📦 15 · ⏱️ 22.01.2020):

     pip install dgl
    
StellarGraph (🥈23 · ⭐ 910) - StellarGraph - Machine Learning on Graphs. Apache-2
  • GitHub (👨‍💻 22 · 🔀 120 · 📦 9 · 📋 380 - 29% open · ⏱️ 23.01.2020):

     git clone https://github.com/stellargraph/stellargraph
    
  • PyPi (📥 1.6K / month · 📦 2 · ⏱️ 20.01.2020):

     pip install stellargraph
    
Node2Vec (🥈21 · ⭐ 360) - Implementation of the node2vec algorithm. MIT
  • GitHub (👨‍💻 3 · 🔀 110 · 📦 41 · 📋 29 - 10% open · ⏱️ 13.09.2019):

     git clone https://github.com/eliorc/node2vec
    
  • PyPi (📥 9K / month · 📦 4 · ⏱️ 13.09.2019):

     pip install node2vec
    
graph-nets (🥈20 · ⭐ 4.1K) - Build Graph Nets in Tensorflow. Apache-2
  • GitHub (👨‍💻 8 · 🔀 600 · 📋 85 - 10% open · ⏱️ 12.12.2019):

     git clone https://github.com/deepmind/graph_nets
    
  • PyPi (📥 820 / month · 📦 7 · ⏱️ 04.11.2019):

     pip install graph-nets
    
torch-cluster (🥈20 · ⭐ 190) - PyTorch Extension Library of Optimized Graph Cluster Algorithms. MIT
  • GitHub (👨‍💻 12 · 🔀 35 · 📋 35 - 45% open · ⏱️ 06.01.2020):

     git clone https://github.com/rusty1s/pytorch_cluster
    
  • PyPi (📥 5.6K / month · 📦 4 · ⏱️ 11.08.2019):

     pip install torch-cluster
    
PyTorch-BigGraph (🥈19 · ⭐ 2K) - Generate embeddings from large-scale graph-structured data. BSD-3
  • GitHub (👨‍💻 10 · 🔀 270 · 📥 55 · 📋 82 - 17% open · ⏱️ 09.12.2019):

     git clone https://github.com/facebookresearch/PyTorch-BigGraph
    
  • PyPi (📥 500 / month · ⏱️ 01.05.2019):

     pip install torchbiggraph
    
AmpliGraph (🥉18 · ⭐ 1K) - Python library for Representation Learning on Knowledge Graphs.. Apache-2
  • GitHub (👨‍💻 12 · 🔀 89 · 📦 2 · 📋 140 - 14% open · ⏱️ 22.10.2019):

     git clone https://github.com/Accenture/AmpliGraph
    
  • PyPi (📥 200 / month · ⏱️ 22.10.2019):

     pip install ampligraph
    
Spektral (🥉18 · ⭐ 540) - Graph Neural Networks with Keras and Tensorflow. MIT
  • GitHub (👨‍💻 5 · 🔀 65 · 📦 2 · 📋 24 - 29% open · ⏱️ 13.01.2020):

     git clone https://github.com/danielegrattarola/spektral
    
  • PyPi (📥 290 / month · ⏱️ 06.12.2019):

     pip install spektral
    
Euler (🥉17 · ⭐ 2K) - A distributed graph deep learning framework. Apache-2
  • GitHub (👨‍💻 10 · 🔀 430 · 📋 200 - 64% open · ⏱️ 06.01.2020):

     git clone https://github.com/alibaba/euler
    
  • PyPi (📥 85 / month · ⏱️ 10.04.2019):

     pip install euler-gl
    
DeepGraph (🥉15 · ⭐ 210 · 💤) - Analyze Data with Pandas-based Networks. Documentation:. BSD-3
  • GitHub (👨‍💻 2 · 🔀 28 · 📦 1 · 📋 10 - 70% open · ⏱️ 08.04.2019):

     git clone https://github.com/deepgraph/deepgraph
    
  • PyPi (📥 220 / month · ⏱️ 08.04.2019):

     pip install deepgraph
    
  • Conda:

     conda install -c conda-forge deepgraph
    
OpenKE (🥉14 · ⭐ 1.6K) - An Open-Source Package for Knowledge Embedding (KE). MIT
  • GitHub (👨‍💻 10 · 🔀 510 · 📋 210 - 11% open · ⏱️ 04.01.2020):

     git clone https://github.com/thunlp/OpenKE
    
OpenNE (🥉14 · ⭐ 1.2K) - An Open-Source Package for Network Embedding (NE). MIT
  • GitHub (👨‍💻 10 · 🔀 390 · 📋 79 - 56% open · ⏱️ 12.08.2019):

     git clone https://github.com/thunlp/OpenNE
    
GraphEmbedding (🥉13 · ⭐ 800) - Implementation and experiments of graph embedding algorithms.deep.. MIT
  • GitHub (👨‍💻 4 · 🔀 260 · 📦 6 · 📋 23 - 65% open · ⏱️ 03.11.2019):

     git clone https://github.com/shenweichen/GraphEmbedding
    
GraphVite (🥉10 · ⭐ 590 · 🐣) - GraphVite: A General and High-performance Graph Embedding System. Apache-2
  • GitHub (👨‍💻 1 · 🔀 79 · 📋 38 - 26% open · ⏱️ 17.11.2019):

     git clone https://github.com/DeepGraphLearning/graphvite
    
  • Conda:

     conda install -c milagraph graphvite
    
Show 5 hidden projects...
igraph (🥇24 · ⭐ 570) - Python interface for igraph. ❗️GPL-2.0
  • GitHub (👨‍💻 38 · 🔀 140 · 📥 240 · 📦 780 · 📋 230 - 27% open · ⏱️ 22.01.2020):

     git clone https://github.com/igraph/python-igraph
    
  • PyPi (📥 14K / month · 📦 360 · ⏱️ 05.06.2015):

     pip install python-igraph
    
DeepWalk (🥈19 · ⭐ 1.7K) - DeepWalk - Deep Learning for Graphs. ❗️GPL-3.0
  • GitHub (👨‍💻 8 · 🔀 650 · 📦 20 · 📋 90 - 17% open · ⏱️ 12.11.2019):

     git clone https://github.com/phanein/deepwalk
    
  • PyPi (📥 290 / month · 📦 4 · ⏱️ 29.04.2018):

     pip install deepwalk
    
PyKEEN (🥉15 · ⭐ 74) - A Python library for learning and evaluating knowledge graph embeddings. MIT
  • GitHub (👨‍💻 9 · 🔀 13 · 📋 26 - 42% open · ⏱️ 05.09.2019):

     git clone https://github.com/SmartDataAnalytics/PyKEEN
    
  • PyPi (📥 170 / month · ⏱️ 13.08.2019):

     pip install pykeen
    
GraphSAGE (🥉14 · ⭐ 1.4K · 💀) - Representation learning on large graphs using stochastic graph convolutions. MIT
  • GitHub (👨‍💻 9 · 🔀 440 · 📋 100 - 54% open · ⏱️ 19.09.2018):

     git clone https://github.com/williamleif/GraphSAGE
    
Karate Club (🥉14 · ⭐ 260 · 🐣) - A general purpose community detection and network embedding library for.. ❗️GPL-3.0
  • GitHub (👨‍💻 2 · 🔀 22 · ⏱️ 23.01.2020):

     git clone https://github.com/benedekrozemberczki/karateclub
    
  • PyPi (📥 980 / month · ⏱️ 23.01.2020):

     pip install karateclub
    

Audio Data

Back to top

Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.

librosa (🥇31 · ⭐ 3.3K) - Python library for audio and music analysis. ISC
  • GitHub (👨‍💻 68 · 🔀 570 · 📦 4.2K · 📋 680 - 7% open · ⏱️ 22.01.2020):

     git clone https://github.com/librosa/librosa
    
  • PyPi (📥 160K / month · 📦 1.8K · ⏱️ 13.01.2020):

     pip install librosa
    
DeepSpeech (🥇30 · ⭐ 13K) - A TensorFlow implementation of Baidu's DeepSpeech architecture. MPL-2.0
  • GitHub (👨‍💻 110 · 🔀 2.4K · 📥 180K · 📦 160 · 📋 1.6K - 4% open · ⏱️ 22.01.2020):

     git clone https://github.com/mozilla/DeepSpeech
    
  • PyPi (📥 8.5K / month · 📦 29 · ⏱️ 10.01.2020):

     pip install deepspeech
    
Pydub (🥈29 · ⭐ 4.1K · 💤) - Manipulate audio with a simple and easy high level interface. MIT
  • GitHub (👨‍💻 69 · 🔀 560 · 📦 2.8K · 📋 320 - 36% open · ⏱️ 18.06.2019):

     git clone https://github.com/jiaaro/pydub
    
  • PyPi (📥 97K / month · 📦 990 · ⏱️ 17.09.2018):

     pip install pydub
    
Magenta (🥈28 · ⭐ 15K) - Magenta: Music and Art Generation with Machine Intelligence. Apache-2
  • GitHub (👨‍💻 120 · 🔀 2.9K · 📦 140 · 📋 660 - 34% open · ⏱️ 22.01.2020):

     git clone https://github.com/tensorflow/magenta
    
  • PyPi (📥 3K / month · 📦 35 · ⏱️ 11.12.2019):

     pip install magenta
    
SpeechRecognition (🥈28 · ⭐ 4.5K) - Speech recognition module for Python, supporting several engines and.. BSD-3
  • GitHub (👨‍💻 41 · 🔀 1.5K · 📦 2.7K · 📋 390 - 34% open · ⏱️ 02.07.2019):

     git clone https://github.com/Uberi/speech_recognition
    
  • PyPi (📥 57K / month · 📦 1.1K · ⏱️ 05.12.2017):

     pip install SpeechRecognition
    
spleeter (🥈25 · ⭐ 9.7K · 🐣) - Deezer source separation library including pretrained models. MIT
  • GitHub (👨‍💻 8 · 🔀 840 · 📥 220K · 📦 30 · 📋 230 - 19% open · ⏱️ 27.12.2019):

     git clone https://github.com/deezer/spleeter
    
  • PyPi (📥 4.6K / month · ⏱️ 27.12.2019):

     pip install spleeter
    
pyAudioAnalysis (🥉24 · ⭐ 2.8K) - Python Audio Analysis Library: Feature Extraction, Classification,.. Apache-2
  • GitHub (👨‍💻 17 · 🔀 790 · 📦 73 · 📋 220 - 66% open · ⏱️ 19.01.2020):

     git clone https://github.com/tyiannak/pyAudioAnalysis
    
  • PyPi (📥 16K / month · 📦 10 · ⏱️ 03.09.2018):

     pip install pyAudioAnalysis
    
torchaudio (🥉24 · ⭐ 790) - Data manipulation and transformation for audio signal processing, powered by.. BSD-2
  • GitHub (👨‍💻 39 · 🔀 160 · 📦 84 · 📋 160 - 38% open · ⏱️ 22.01.2020):

     git clone https://github.com/pytorch/audio
    
  • PyPi (📥 4.2K / month · 📦 9 · ⏱️ 16.01.2020):

     pip install torchaudio
    
Porcupine (🥉22 · ⭐ 2K) - On-device wake word detection powered by deep learning. Apache-2
  • GitHub (👨‍💻 22 · 🔀 240 · 📦 7 · 📋 210 - 0% open · ⏱️ 19.11.2019):

     git clone https://github.com/Picovoice/Porcupine
    
  • PyPi (📥 360 / month · ⏱️ 01.11.2019):

     pip install pvporcupine
    
Madmom (🥉21 · ⭐ 540) - Python audio and music signal processing library. BSD-3
  • GitHub (👨‍💻 18 · 🔀 97 · 📦 72 · 📋 210 - 19% open · ⏱️ 19.12.2019):

     git clone https://github.com/CPJKU/madmom
    
  • PyPi (📥 940 / month · 📦 8 · ⏱️ 14.11.2018):

     pip install madmom
    
Dejavu (🥉20 · ⭐ 4.8K · 💤) - Audio fingerprinting and recognition in Python. MIT
  • GitHub (👨‍💻 20 · 🔀 1K · 📦 13 · 📋 170 - 51% open · ⏱️ 27.05.2019):

     git clone https://github.com/worldveil/dejavu
    
  • PyPi (📥 150 / month · 📦 1 · ⏱️ 19.04.2015):

     pip install PyDejavu
    
Show 3 hidden projects...
Essentia (🥉23 · ⭐ 1.4K) - C++ library for audio and music analysis, description and synthesis,.. ❗️AGPL-3.0
  • GitHub (👨‍💻 63 · 🔀 340 · 📦 54 · 📋 750 - 31% open · ⏱️ 23.01.2020):

     git clone https://github.com/MTG/essentia
    
  • PyPi (📥 6.6K / month · 📦 5 · ⏱️ 23.01.2020):

     pip install essentia
    
aubio (🥉20 · ⭐ 1.6K) - a library for audio and music analysis. ❗️GPL-3.0
  • GitHub (👨‍💻 19 · 🔀 240 · 📦 120 · 📋 230 - 28% open · ⏱️ 17.11.2019):

     git clone https://github.com/aubio/aubio
    
Muda (🥉17 · ⭐ 140) - A library for augmenting annotated audio data. ISC
  • GitHub (👨‍💻 6 · 🔀 24 · 📦 4 · 📋 46 - 4% open · ⏱️ 15.11.2019):

     git clone https://github.com/bmcfee/muda
    
  • PyPi (📥 120 / month · 📦 1 · ⏱️ 15.11.2019):

     pip install muda
    

Geospatial Data

Back to top

Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.

folium (🥇32 · ⭐ 4.4K) - Python Data. Leaflet.js Maps. MIT
  • GitHub (👨‍💻 110 · 🔀 1.5K · 📦 3.7K · 📋 690 - 12% open · ⏱️ 05.01.2020):

     git clone https://github.com/python-visualization/folium
    
  • PyPi (📥 220K / month · 📦 960 · ⏱️ 02.12.2019):

     pip install folium
    
GeoPandas (🥇32 · ⭐ 1.9K) - Python tools for geographic data. BSD-3
  • GitHub (👨‍💻 96 · 🔀 430 · 📥 580 · 📦 2.9K · 📋 670 - 36% open · ⏱️ 20.01.2020):

     git clone https://github.com/geopandas/geopandas
    
  • PyPi (📥 500K / month · 📦 1.1K · ⏱️ 18.11.2019):

     pip install geopandas
    
geopy (🥈30 · ⭐ 2.7K) - Geocoding library for Python. MIT
  • GitHub (👨‍💻 110 · 🔀 450 · 📦 7.9K · 📋 190 - 12% open · ⏱️ 22.01.2020):

     git clone https://github.com/geopy/geopy
    
  • PyPi (📥 1.1M / month · 📦 6.6K · ⏱️ 26.03.2019):

     pip install geopy
    
Rasterio (🥈30 · ⭐ 1.2K) - Rasterio reads and writes geospatial raster datasets. BSD-3
  • GitHub (👨‍💻 95 · 🔀 330 · 📥 610 · 📦 1.3K · 📋 1.2K - 11% open · ⏱️ 19.12.2019):

     git clone https://github.com/mapbox/rasterio
    
  • PyPi (📥 60K / month · 📦 850 · ⏱️ 19.12.2019):

     pip install rasterio
    
pyproj (🥈29 · ⭐ 430) - Python interface to PROJ (cartographic projections and coordinate transformations.. MIT
  • GitHub (👨‍💻 30 · 🔀 120 · 📦 4.1K · 📋 270 - 3% open · ⏱️ 23.01.2020):

     git clone https://github.com/pyproj4/pyproj
    
  • PyPi (📥 910K / month · 📦 2.4K · ⏱️ 01.12.2019):

     pip install pyproj
    
Fiona (🥈28 · ⭐ 650) - Fiona reads and writes geographic data files. BSD-3
  • GitHub (👨‍💻 60 · 🔀 140 · 📦 2.3K · 📋 550 - 9% open · ⏱️ 20.01.2020):

     git clone https://github.com/Toblerity/Fiona
    
  • PyPi (📥 570K / month · 📦 1.2K · ⏱️ 05.12.2019):

     pip install fiona
    
geojson (🥈28 · ⭐ 500) - Python bindings and utilities for GeoJSON. BSD-3
  • GitHub (👨‍💻 39 · 🔀 69 · 📦 3K · 📋 61 - 24% open · ⏱️ 06.09.2019):

     git clone https://github.com/jazzband/geojson
    
  • PyPi (📥 520K / month · 📦 1.6K · ⏱️ 09.08.2019):

     pip install geojson
    
ipyleaflet (🥉25 · ⭐ 840) - A Jupyter - Leaflet.js bridge. MIT
  • GitHub (👨‍💻 52 · 🔀 220 · 📦 9 · 📋 220 - 37% open · ⏱️ 17.01.2020):

     git clone https://github.com/jupyter-widgets/ipyleaflet
    
  • PyPi (📥 11K / month · 📦 98 · ⏱️ 17.01.2020):

     pip install ipyleaflet
    
  • NPM (📥 9.4K / month · 📦 1 · ⏱️ 17.01.2020):

     npm install jupyter-leaflet
    
PySAL (🥉24 · ⭐ 680) - PySAL: Python Spatial Analysis Library Meta-Package. BSD-3
  • GitHub (👨‍💻 67 · 🔀 230 · 📋 570 - 9% open · ⏱️ 03.08.2019):

     git clone https://github.com/pysal/pysal
    
  • PyPi (📥 19K / month · 📦 11 · ⏱️ 30.01.2019):

     pip install pysal
    
ArcGIS API (🥉23 · ⭐ 650) - Documentation and samples for ArcGIS API for Python. Apache-2
  • GitHub (👨‍💻 37 · 🔀 470 · 📋 200 - 34% open · ⏱️ 16.01.2020):

     git clone https://github.com/Esri/arcgis-python-api
    
  • PyPi (📥 4.3K / month · 📦 8 · ⏱️ 31.10.2019):

     pip install arcgis
    
  • Dockerhub (📥 2.9K · ⭐ 25 · ⏱️ 02.07.2019):

     docker pull esridocker/arcgis-api-python-notebook
    
gmaps (🥉22 · ⭐ 620) - Google maps for Jupyter notebooks. BSD-3
  • GitHub (👨‍💻 16 · 🔀 120 · 📦 1 · 📋 180 - 25% open · ⏱️ 22.07.2019):

     git clone https://github.com/pbugnion/gmaps
    
  • PyPi (📥 12K / month · 📦 18 · ⏱️ 20.07.2019):

     pip install gmaps
    
  • NPM (📥 1.2K / month · ⏱️ 21.07.2019):

     npm install jupyter-gmaps
    
Mapbox GL (🥉22 · ⭐ 500) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook. MIT
  • GitHub (👨‍💻 20 · 🔀 94 · 📦 47 · 📋 95 - 30% open · ⏱️ 03.12.2019):

     git clone https://github.com/mapbox/mapboxgl-jupyter
    
  • PyPi (📥 9.2K / month · 📦 16 · ⏱️ 02.06.2019):

     pip install mapboxgl
    
GeoViews (🥉22 · ⭐ 260) - Simple, concise geographical visualization in Python. BSD-3
  • GitHub (👨‍💻 20 · 🔀 53 · 📦 80 · 📋 210 - 30% open · ⏱️ 16.01.2020):

     git clone https://github.com/holoviz/geoviews
    
  • PyPi (📥 1.2K / month · 📦 10 · ⏱️ 26.11.2019):

     pip install geoviews
    
  • Conda:

     conda install -c conda-forge geoviews
    
geoplotlib (🥉20 · ⭐ 810 · 💤) - python toolbox for visualizing geographical data and making maps. MIT
  • GitHub (👨‍💻 8 · 🔀 120 · 📦 50 · 📋 40 - 57% open · ⏱️ 06.05.2019):

     git clone https://github.com/andrea-cuttone/geoplotlib
    
  • PyPi (📥 720 / month · 📦 9 · ⏱️ 27.07.2016):

     pip install geoplotlib
    
Show 6 hidden projects...
Geocoder (🥈29 · ⭐ 1.2K · 💀) - Python Geocoder. MIT
  • GitHub (👨‍💻 74 · 🔀 230 · 📦 1.9K · 📋 260 - 18% open · ⏱️ 12.10.2018):

     git clone https://github.com/DenisCarriere/geocoder
    
  • PyPi (📥 92K / month · 📦 580 · ⏱️ 04.04.2018):

     pip install geocoder
    
Cartopy (🥉27 · ⭐ 1.2K) - Rasterio reads and writes geospatial raster datasets. ❗️LGPL-3.0
  • GitHub (👨‍💻 95 · 🔀 330 · 📥 610 · 📦 1.3K · 📋 1.2K - 11% open · ⏱️ 19.12.2019):

     git clone https://github.com/mapbox/rasterio
    
  • PyPi (📥 10K / month · 📦 96 · ⏱️ 17.11.2018):

     pip install Cartopy
    
Satpy (🥉21 · ⭐ 570) - Python package for earth-observing satellite data processing. ❗️GPL-3.0
  • GitHub (👨‍💻 90 · 🔀 150 · 📦 20 · 📋 360 - 44% open · ⏱️ 21.01.2020):

     git clone https://github.com/pytroll/satpy
    
  • PyPi (📥 500 / month · 📦 3 · ⏱️ 10.01.2020):

     pip install satpy
    
Sentinelsat (🥉20 · ⭐ 390) - Search and download Copernicus Sentinel satellite images. ❗️GPL-3.0
  • GitHub (👨‍💻 29 · 🔀 140 · 📥 74 · 📦 72 · 📋 190 - 13% open · ⏱️ 11.12.2019):

     git clone https://github.com/sentinelsat/sentinelsat
    
  • PyPi (📥 2.9K / month · 📦 22 · ⏱️ 05.04.2019):

     pip install sentinelsat
    
pymap3d (🥉20 · ⭐ 110) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef enu eci. BSD-2
  • GitHub (👨‍💻 7 · 🔀 46 · 📋 19 - 5% open · ⏱️ 29.12.2019):

     git clone https://github.com/scivision/pymap3d
    
  • PyPi (📥 11K / month · 📦 3 · ⏱️ 29.12.2019):

     pip install pymap3d
    
EarthPy (🥉19 · ⭐ 130) - A package built to support working with spatial data using open source python. BSD-3
  • GitHub (👨‍💻 35 · 🔀 61 · 📦 20 · 📋 180 - 5% open · ⏱️ 16.01.2020):

     git clone https://github.com/earthlab/earthpy
    
  • PyPi (📥 870 / month · 📦 2 · ⏱️ 13.11.2019):

     pip install earthpy
    
  • Conda:

     conda install -c conda-forge earthpy
    

Financial Data

Back to top

Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.

zipline (🥇28 · ⭐ 10K) - Zipline, a Pythonic Algorithmic Trading Library. Apache-2
  • GitHub (👨‍💻 140 · 🔀 3.1K · 📦 420 · 📋 940 - 37% open · ⏱️ 21.01.2020):

     git clone https://github.com/quantopian/zipline
    
  • PyPi (📥 4.2K / month · 📦 79 · ⏱️ 17.07.2018):

     pip install zipline
    
Alpha Vantage (🥇28 · ⭐ 1.9K) - A python wrapper for Alpha Vantage API for financial data. MIT
  • GitHub (👨‍💻 27 · 🔀 330 · 📋 130 - 2% open · ⏱️ 16.01.2020):

     git clone https://github.com/RomelTorres/alpha_vantage
    
  • PyPi (📥 83K / month · 📦 200 · ⏱️ 26.08.2018):

     pip install alpha_vantage
    
pyfolio (🥈25 · ⭐ 2.6K) - Portfolio and risk analytics in Python. Apache-2
  • GitHub (👨‍💻 52 · 🔀 720 · 📦 96 · 📋 350 - 26% open · ⏱️ 15.01.2020):

     git clone https://github.com/quantopian/pyfolio
    
  • PyPi (📥 3.1K / month · 📦 26 · ⏱️ 15.04.2019):

     pip install pyfolio
    
yfinance (🥈25 · ⭐ 860) - Yahoo! Finance market data downloader (+faster Pandas Datareader). Apache-2
  • GitHub (👨‍💻 12 · 🔀 210 · 📦 230 · 📋 150 - 21% open · ⏱️ 29.12.2019):

     git clone https://github.com/ranaroussi/yfinance
    
  • PyPi (📥 20K / month · 📦 26 · ⏱️ 27.12.2019):

     pip install yfinance
    
  • Conda:

     conda install -c ranaroussi yfinance
    
empyrical (🥈24 · ⭐ 470) - Common financial risk and performance metrics. Used by zipline and pyfolio. Apache-2
  • GitHub (👨‍💻 21 · 🔀 120 · 📦 290 · 📋 49 - 55% open · ⏱️ 30.09.2019):

     git clone https://github.com/quantopian/empyrical
    
  • PyPi (📥 9.2K / month · 📦 190 · ⏱️ 15.08.2019):

     pip install empyrical
    
Alphalens (🥈23 · ⭐ 1.2K) - Performance analysis of predictive (alpha) stock factors. Apache-2
  • GitHub (👨‍💻 20 · 🔀 400 · 📦 170 · 📋 150 - 11% open · ⏱️ 15.01.2020):

     git clone https://github.com/quantopian/alphalens
    
  • PyPi (📥 1.5K / month · 📦 14 · ⏱️ 07.01.2019):

     pip install alphalens
    
IB-insync (🥈23 · ⭐ 740) - Python sync/async framework for Interactive Brokers API. BSD-2
  • GitHub (👨‍💻 21 · 🔀 250 · 📋 180 - 1% open · ⏱️ 16.01.2020):

     git clone https://github.com/erdewit/ib_insync
    
  • PyPi (📥 2.2K / month · 📦 12 · ⏱️ 02.01.2020):

     pip install ib_insync
    
bt (🥉21 · ⭐ 700) - bt - flexible backtesting for Python. MIT
  • GitHub (👨‍💻 13 · 🔀 200 · 📦 28 · 📋 170 - 31% open · ⏱️ 12.10.2019):

     git clone https://github.com/pmorissette/bt
    
  • PyPi (📥 810 / month · 📦 20 · ⏱️ 28.05.2018):

     pip install bt
    
ffn (🥉21 · ⭐ 530) - ffn - a financial function library for Python. MIT
  • GitHub (👨‍💻 21 · 🔀 140 · 📦 56 · 📋 53 - 41% open · ⏱️ 07.10.2019):

     git clone https://github.com/pmorissette/ffn
    
  • PyPi (📥 2.4K / month · 📦 16 · ⏱️ 28.05.2018):

     pip install ffn
    
finmarketpy (🥉19 · ⭐ 2.1K) - Python library for backtesting trading strategies & analyzing financial.. Apache-2
  • GitHub (👨‍💻 12 · 🔀 360 · 📥 31 · 📋 19 - 89% open · ⏱️ 17.12.2019):

     git clone https://github.com/cuemacro/finmarketpy
    
  • PyPi (📥 100 / month · ⏱️ 04.12.2019):

     pip install finmarketpy
    
Crypto Signals (🥉19 · ⭐ 1.9K) - Automated Crypto Trading & Technical Analysis (TA) Bot for Bittrex, Binance,.. MIT
  • GitHub (👨‍💻 25 · 🔀 520 · 📋 200 - 14% open · ⏱️ 29.10.2019):

     git clone https://github.com/CryptoSignal/crypto-signal
    
  • Dockerhub (📥 32K · ⭐ 4 · ⏱️ 29.10.2019):

     docker pull shadowreaver/crypto-signal
    
Show 6 hidden projects...
backtrader (🥈24 · ⭐ 3.2K) - Python Backtesting library for trading strategies. ❗️GPL-3.0
  • GitHub (👨‍💻 39 · 🔀 1K · 📦 170 · ⏱️ 07.01.2020):

     git clone https://github.com/mementum/backtrader
    
  • PyPi (📥 3.8K / month · 📦 35 · ⏱️ 01.05.2019):

     pip install backtrader
    
Enigma Catalyst (🥈23 · ⭐ 1.7K · 💀) - An Algorithmic Trading Library for Crypto-Assets in Python. Apache-2
  • GitHub (👨‍💻 140 · 🔀 490 · 📦 14 · 📋 490 - 26% open · ⏱️ 11.11.2018):

     git clone https://github.com/enigmampc/catalyst
    
  • PyPi (📥 470 / month · ⏱️ 11.11.2018):

     pip install enigma-catalyst
    
PyAlgoTrade (🥉22 · ⭐ 2.7K · 💀) - Python Algorithmic Trading Library. Apache-2
  • GitHub (👨‍💻 11 · 🔀 1.1K · 📦 47 · 📋 110 - 29% open · ⏱️ 21.08.2018):

     git clone https://github.com/gbeced/pyalgotrade
    
  • PyPi (📥 740 / month · 📦 20 · ⏱️ 21.08.2018):

     pip install pyalgotrade
    
arch (🥉21 · ⭐ 440) - ARCH models in Python. ❗️NCSA
  • GitHub (👨‍💻 20 · 🔀 120 · 📦 94 · 📋 110 - 11% open · ⏱️ 06.01.2020):

     git clone https://github.com/bashtage/arch
    
  • PyPi (📥 8.8K / month · 📦 21 · ⏱️ 22.11.2019):

     pip install arch
    
FinTA (🥉20 · ⭐ 370) - Common financial technical indicators implemented in Pandas. ❗️LGPL-3.0
  • GitHub (👨‍💻 11 · 🔀 120 · 📦 12 · 📋 36 - 22% open · ⏱️ 19.01.2020):

     git clone https://github.com/peerchemist/finta
    
  • PyPi (📥 2.8K / month · 📦 2 · ⏱️ 13.01.2020):

     pip install finta
    
Backtesting.py (🥉15 · ⭐ 260) - Backtest trading strategies in Python. ❗️AGPL-3.0
  • GitHub (👨‍💻 3 · 🔀 80 · 📋 32 - 37% open · ⏱️ 03.01.2020):

     git clone https://github.com/kernc/backtesting.py
    
  • PyPi (📥 1.1K / month · ⏱️ 22.07.2017):

     pip install backtesting
    

Time Series Data

Back to top

Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.

Prophet (🥇28 · ⭐ 10K) - Tool for producing high quality forecasts for time series data that has multiple.. MIT
  • GitHub (👨‍💻 78 · 🔀 2.6K · 📥 310 · 📋 1.1K - 13% open · ⏱️ 06.11.2019):

     git clone https://github.com/facebook/prophet
    
  • PyPi (📥 440K / month · 📦 75 · ⏱️ 15.05.2019):

     pip install fbprophet
    
tsfresh (🥇26 · ⭐ 4.5K) - Automatic extraction of relevant features from time series:. MIT
  • GitHub (👨‍💻 56 · 🔀 660 · 📋 340 - 12% open · ⏱️ 16.01.2020):

     git clone https://github.com/blue-yonder/tsfresh
    
  • PyPi (📥 120K / month · 📦 31 · ⏱️ 23.11.2019):

     pip install tsfresh
    
tslearn (🥈23 · ⭐ 840) - A machine learning toolkit dedicated to time-series data. BSD-2
  • GitHub (👨‍💻 15 · 🔀 150 · 📦 76 · 📋 130 - 22% open · ⏱️ 11.10.2019):

     git clone https://github.com/rtavenar/tslearn
    
  • PyPi (📥 24K / month · 📦 6 · ⏱️ 02.09.2019):

     pip install tslearn
    
Streamz (🥈23 · ⭐ 530) - Real-time stream processing for python. BSD-3
  • GitHub (👨‍💻 29 · 🔀 84 · 📦 86 · 📋 150 - 44% open · ⏱️ 18.01.2020):

     git clone https://github.com/python-streamz/streamz
    
  • PyPi (📥 3.3K / month · 📦 7 · ⏱️ 14.08.2019):

     pip install streamz
    
GluonTS (🥈21 · ⭐ 900) - Probabilistic time series modeling in Python. Apache-2
  • GitHub (👨‍💻 39 · 🔀 180 · 📋 210 - 32% open · ⏱️ 22.01.2020):

     git clone https://github.com/awslabs/gluon-ts
    
  • PyPi (📥 4.2K / month · ⏱️ 26.11.2019):

     pip install gluonts
    
sktime (🥉19 · ⭐ 470) - A scikit-learn compatible Python toolbox for machine learning with time series. BSD-3
  • GitHub (👨‍💻 18 · 🔀 86 · 📦 9 · 📋 110 - 40% open · ⏱️ 21.01.2020):

     git clone https://github.com/alan-turing-institute/sktime
    
  • PyPi (📥 500 / month · ⏱️ 24.10.2019):

     pip install sktime
    
seglearn (🥉19 · ⭐ 290) - Python module for machine learning time series:. BSD-3
  • GitHub (👨‍💻 13 · 🔀 40 · 📦 3 · 📋 12 - 41% open · ⏱️ 31.12.2019):

     git clone https://github.com/dmbee/seglearn
    
  • PyPi (📥 320 / month · 📦 1 · ⏱️ 10.11.2019):

     pip install seglearn
    
tick (🥉19 · ⭐ 260) - Module for statistical learning, with a particular emphasis on time-dependent modelling. BSD-3
  • GitHub (👨‍💻 14 · 🔀 56 · 📥 59 · 📦 15 · 📋 180 - 19% open · ⏱️ 26.11.2019):

     git clone https://github.com/X-DataInitiative/tick
    
  • PyPi (📥 210 / month · ⏱️ 11.09.2019):

     pip install tick
    
STUMPY (🥉18 · ⭐ 1K) - STUMPY is a powerful and scalable Python library that can be used for a variety of.. BSD-3
  • GitHub (👨‍💻 11 · 🔀 84 · 📋 90 - 25% open · ⏱️ 22.01.2020):

     git clone https://github.com/TDAmeritrade/stumpy
    
  • PyPi (📥 990 / month · ⏱️ 30.12.2019):

     pip install stumpy
    
ADTK (🥉17 · ⭐ 360 · 🐣) - A Python toolkit for rule-based/unsupervised anomaly detection in time series. MPL-2.0
  • GitHub (👨‍💻 7 · 🔀 25 · 📋 21 - 47% open · ⏱️ 18.01.2020):

     git clone https://github.com/arundo/adtk
    
  • PyPi (📥 1.9K / month · ⏱️ 14.01.2020):

     pip install adtk
    
Show 2 hidden projects...
PyFlux (🥈21 · ⭐ 1.7K · 💀) - Open source time series library for Python. BSD-3
  • GitHub (👨‍💻 6 · 🔀 180 · 📦 99 · 📋 140 - 53% open · ⏱️ 16.12.2018):

     git clone https://github.com/RJT1990/pyflux
    
  • PyPi (📥 8K / month · 📦 15 · ⏱️ 21.11.2017):

     pip install pyflux
    
luminol (🥈21 · ⭐ 700 · 💀) - Anomaly Detection and Correlation library. Apache-2
  • GitHub (👨‍💻 8 · 🔀 150 · 📦 24 · 📋 36 - 69% open · ⏱️ 09.01.2018):

     git clone https://github.com/linkedin/luminol
    
  • PyPi (📥 310K / month · 📦 12 · ⏱️ 11.12.2017):

     pip install luminol
    

Medical Data

Back to top

Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.

MNE (🥇30 · ⭐ 1.2K) - MNE : Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3
  • GitHub (👨‍💻 210 · 🔀 680 · 📦 390 · 📋 2.8K - 9% open · ⏱️ 23.01.2020):

     git clone https://github.com/mne-tools/mne-python
    
  • PyPi (📥 12K / month · 📦 190 · ⏱️ 18.11.2019):

     pip install mne
    
Lifelines (🥇29 · ⭐ 1.2K) - Survival analysis in Python. MIT
  • GitHub (👨‍💻 72 · 🔀 330 · 📦 220 · 📋 600 - 22% open · ⏱️ 21.01.2020):

     git clone https://github.com/CamDavidsonPilon/lifelines
    
  • PyPi (📥 110K / month · 📦 130 · ⏱️ 21.01.2020):

     pip install lifelines
    
NIPYPE (🥇29 · ⭐ 480) - Workflows and interfaces for neuroimaging packages. Apache-2
  • GitHub (👨‍💻 210 · 🔀 390 · 📦 280 · 📋 1.1K - 25% open · ⏱️ 16.01.2020):

     git clone https://github.com/nipy/nipype
    
  • PyPi (📥 8.1K / month · 📦 190 · ⏱️ 20.12.2019):

     pip install nipype
    
DIPY (🥈28 · ⭐ 300) - DIPY is the paragon 3D/4D+ imaging library in Python. Contains generic methods for.. BSD-3
  • GitHub (👨‍💻 100 · 🔀 270 · 📦 180 · 📋 640 - 21% open · ⏱️ 21.01.2020):

     git clone https://github.com/dipy/dipy
    
  • PyPi (📥 9.3K / month · 📦 94 · ⏱️ 11.01.2020):

     pip install dipy
    
  • Conda:

     conda install -c conda-forge dipy
    
NiBabel (🥈28 · ⭐ 300) - Python package to access a cacophony of neuro-imaging file formats. MIT
  • GitHub (👨‍💻 81 · 🔀 180 · 📦 1.8K · 📋 340 - 28% open · ⏱️ 15.01.2020):

     git clone https://github.com/nipy/nibabel
    
  • PyPi (📥 50K / month · 📦 1.3K · ⏱️ 18.12.2019):

     pip install nibabel
    
Nilearn (🥈27 · ⭐ 580) - Machine learning for NeuroImaging in Python. BSD-3
  • GitHub (👨‍💻 120 · 🔀 320 · 📦 480 · 📋 1.2K - 28% open · ⏱️ 22.01.2020):

     git clone https://github.com/nilearn/nilearn
    
  • PyPi (📥 16K / month · 📦 300 · ⏱️ 22.01.2020):

     pip install nilearn
    
NiftyNet (🥈24 · ⭐ 1.1K) - An open-source convolutional neural networks platform for research in.. Apache-2
  • GitHub (👨‍💻 56 · 🔀 360 · 📦 16 · 📋 310 - 29% open · ⏱️ 17.01.2020):

     git clone https://github.com/NifTK/NiftyNet
    
  • PyPi (📥 310 / month · ⏱️ 10.10.2019):

     pip install niftynet
    
Hail (🥉22 · ⭐ 590) - Scalable genomic data analysis. MIT
  • GitHub (👨‍💻 62 · 🔀 150 · 📋 1.8K - 15% open · ⏱️ 23.01.2020):

     git clone https://github.com/hail-is/hail
    
  • PyPi (📥 1.7K / month · ⏱️ 22.01.2020):

     pip install hail
    
DeepVariant (🥉20 · ⭐ 1.9K) - DeepVariant is an analysis pipeline that uses a deep neural network to call.. BSD-3
  • GitHub (👨‍💻 19 · 🔀 450 · 📥 2.1K · 📋 240 - 2% open · ⏱️ 21.11.2019):

     git clone https://github.com/google/deepvariant
    
  • Conda:

     conda install -c bioconda deepvariant
    
Brainiak (🥉19 · ⭐ 160) - Brain Imaging Analysis Kit. Apache-2
  • GitHub (👨‍💻 27 · 🔀 87 · 📦 10 · 📋 170 - 36% open · ⏱️ 11.12.2019):

     git clone https://github.com/brainiak/brainiak
    
  • PyPi (📥 130 / month · 📦 1 · ⏱️ 27.08.2019):

     pip install brainiak
    
  • Dockerhub (📥 280 · ⭐ 1 · ⏱️ 27.08.2019):

     docker pull brainiak/brainiak
    
DLTK (🥉18 · ⭐ 1.1K · 💤) - Deep Learning Toolkit for Medical Image Analysis. Apache-2
  • GitHub (👨‍💻 9 · 🔀 320 · 📦 7 · 📋 30 - 23% open · ⏱️ 21.01.2019):

     git clone https://github.com/DLTK/DLTK
    
  • PyPi (📥 210 / month · ⏱️ 26.02.2018):

     pip install dltk
    
MedicalTorch (🥉15 · ⭐ 580) - A medical imaging framework for Pytorch. Apache-2
  • GitHub (👨‍💻 7 · 🔀 83 · 📦 6 · 📋 20 - 55% open · ⏱️ 15.08.2019):

     git clone https://github.com/perone/medicaltorch
    
  • PyPi (📥 130 / month · ⏱️ 24.11.2018):

     pip install medicaltorch
    
Medical Detection Toolkit (🥉13 · ⭐ 650) - The Medical Detection Toolkit contains 2D + 3D.. Apache-2
  • GitHub (👨‍💻 3 · 🔀 170 · 📋 90 - 18% open · ⏱️ 19.12.2019):

     git clone https://github.com/pfjaeger/medicaldetectiontoolkit
    
MedicalNet (🥉12 · ⭐ 760 · 🐣) - Many studies have shown that the performance on deep learning is significantly.. MIT
  • GitHub (👨‍💻 1 · 🔀 200 · 📋 41 - 68% open · ⏱️ 07.01.2020):

     git clone https://github.com/Tencent/MedicalNet
    
Show 4 hidden projects...
NIPY (🥉20 · ⭐ 260 · 💤) - Neuroimaging in Python FMRI analysis package. ❗️DSDP
  • GitHub (👨‍💻 63 · 🔀 100 · 📋 130 - 23% open · ⏱️ 10.01.2019):

     git clone https://github.com/nipy/nipy
    
  • PyPi (📥 1.8K / month · 📦 29 · ⏱️ 19.02.2018):

     pip install nipy
    
MedPy (🥉19 · ⭐ 230 · 💤) - Medical image processing in Python. ❗️GPL-3.0
  • GitHub (👨‍💻 13 · 🔀 89 · 📦 96 · 📋 69 - 10% open · ⏱️ 26.02.2019):

     git clone https://github.com/loli/medpy
    
  • PyPi (📥 2.1K / month · 📦 7 · ⏱️ 14.02.2019):

     pip install MedPy
    
DeepNeuro (🥉15 · ⭐ 83) - A deep learning python package for neuroimaging data. Made by:. MIT
  • GitHub (👨‍💻 6 · 🔀 27 · 📦 1 · 📋 40 - 60% open · ⏱️ 30.11.2019):

     git clone https://github.com/QTIM-Lab/DeepNeuro
    
  • PyPi (📥 66 / month · ⏱️ 11.04.2019):

     pip install deepneuro
    
Glow (🥉15 · ⭐ 69 · 🐣) - An open-source toolkit for large-scale genomic analysis. Apache-2
  • GitHub (👨‍💻 7 · 🔀 28 · 📋 26 - 38% open · ⏱️ 22.01.2020):

     git clone https://github.com/projectglow/glow
    
  • PyPi (📥 130 / month · ⏱️ 09.12.2019):

     pip install glow.py
    

Data Containers & Structures

Back to top

General-purpose data containers & structures as well as utilities & extensions for pandas.

pandas (🥇41 · ⭐ 23K) - Flexible and powerful data analysis / manipulation library for Python, providing.. BSD-3
  • GitHub (👨‍💻 2.1K · 🔀 9.1K · 📥 26K · 📦 180K · 📋 17K - 19% open · ⏱️ 23.01.2020):

     git clone https://github.com/pandas-dev/pandas
    
  • PyPi (📥 16M / month · 📦 77K · ⏱️ 01.11.2019):

     pip install pandas
    
  • Conda (⏱️ 16.11.2019):

     conda install -c anaconda pandas
    
numpy (🥇41 · ⭐ 13K) - The fundamental package for scientific computing with Python. BSD-3
  • GitHub (👨‍💻 1K · 🔀 4.2K · 📥 160K · 📦 320K · 📋 8.3K - 24% open · ⏱️ 23.01.2020):

     git clone https://github.com/numpy/numpy
    
  • PyPi (📥 46M / month · 📦 170K · ⏱️ 06.01.2020):

     pip install numpy
    
  • Conda (⏱️ 16.01.2020):

     conda install -c anaconda numpy
    
h5py (🥇34 · ⭐ 1.3K) - HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data.. BSD-3
  • GitHub (👨‍💻 140 · 🔀 340 · 📥 180 · 📦 54K · 📋 960 - 18% open · ⏱️ 20.01.2020):

     git clone https://github.com/h5py/h5py
    
  • PyPi (📥 4.9M / month · 📦 18K · ⏱️ 06.09.2019):

     pip install h5py
    
Arrow (🥈32 · ⭐ 5K) - Apache Arrow is a cross-language development platform for in-memory data. It.. Apache-2
  • GitHub (👨‍💻 420 · 🔀 1.3K · 📦 5 · 📋 510 - 15% open · ⏱️ 23.01.2020):

     git clone https://github.com/apache/arrow
    
  • PyPi (📥 4.9M / month · 📦 800 · ⏱️ 01.11.2019):

     pip install pyarrow
    
xarray (🥈31 · ⭐ 1.5K) - N-D labeled arrays and datasets in Python. Apache-2
  • GitHub (👨‍💻 220 · 🔀 500 · 📋 2.1K - 30% open · ⏱️ 22.01.2020):

     git clone https://github.com/pydata/xarray
    
  • PyPi (📥 140K / month · 📦 1.1K · ⏱️ 19.11.2019):

     pip install xarray
    
numexpr (🥈30 · ⭐ 1.2K) - Fast numerical array expression evaluator for Python, NumPy, PyTables, pandas, bcolz.. MIT
  • GitHub (👨‍💻 51 · 🔀 140 · 📋 290 - 17% open · ⏱️ 15.01.2020):

     git clone https://github.com/pydata/numexpr
    
  • PyPi (📥 630K / month · 📦 5.5K · ⏱️ 14.08.2019):

     pip install numexpr
    
TinyDB (🥈29 · ⭐ 3.3K) - TinyDB is a lightweight document oriented database optimized for your happiness :). MIT
  • GitHub (👨‍💻 55 · 🔀 300 · 📋 190 - 2% open · ⏱️ 02.01.2020):

     git clone https://github.com/msiemens/tinydb
    
  • PyPi (📥 90K / month · 📦 1.1K · ⏱️ 18.12.2019):

     pip install tinydb
    
Blaze (🥈29 · ⭐ 2.8K) - NumPy and Pandas interface to Big Data. BSD-3
  • GitHub (👨‍💻 63 · 🔀 360 · 📦 6.1K · 📋 760 - 33% open · ⏱️ 15.08.2019):

     git clone https://github.com/blaze/blaze
    
  • PyPi (📥 51K / month · 📦 490 · ⏱️ 06.05.2016):

     pip install blaze
    
PyTables (🥈29 · ⭐ 910) - A Python package to manage extremely large amounts of data. BSD-2
  • GitHub (👨‍💻 87 · 🔀 180 · 📥 110 · 📋 570 - 25% open · ⏱️ 21.01.2020):

     git clone https://github.com/PyTables/PyTables
    
  • PyPi (📥 290K / month · 📦 3.2K · ⏱️ 16.10.2019):

     pip install tables
    
Bottleneck (🥈28 · ⭐ 440) - Fast NumPy array functions written in C. BSD-2
  • GitHub (👨‍💻 19 · 🔀 57 · 📦 9K · 📋 180 - 6% open · ⏱️ 20.01.2020):

     git clone https://github.com/pydata/bottleneck
    
  • PyPi (📥 150K / month · 📦 2.8K · ⏱️ 19.11.2019):

     pip install Bottleneck
    
datasketch (🥈27 · ⭐ 1.1K) - MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH.. MIT
  • GitHub (👨‍💻 13 · 🔀 160 · 📥 5 · 📦 150 · 📋 80 - 8% open · ⏱️ 26.11.2019):

     git clone https://github.com/ekzhu/datasketch
    
  • PyPi (📥 85K / month · 📦 84 · ⏱️ 26.11.2019):

     pip install datasketch
    
Modin (🥉26 · ⭐ 4.1K) - Modin: Speed up your Pandas workflows by changing a single line of code. Apache-2
  • GitHub (👨‍💻 33 · 🔀 270 · 📥 190K · 📦 80 · 📋 450 - 27% open · ⏱️ 22.01.2020):

     git clone https://github.com/modin-project/modin
    
  • PyPi (📥 13K / month · 📦 9 · ⏱️ 22.01.2020):

     pip install modin
    
Koalas (🥉26 · ⭐ 1.7K) - Koalas: pandas API on Apache Spark. Apache-2
  • GitHub (👨‍💻 37 · 🔀 200 · 📥 980 · 📦 18 · 📋 350 - 27% open · ⏱️ 23.01.2020):

     git clone https://github.com/databricks/koalas
    
  • PyPi (📥 400K / month · 📦 1 · ⏱️ 23.01.2020):

     pip install koalas
    
  • Conda:

     conda install -c conda-forge koalas
    
zarr (🥉26 · ⭐ 420) - An implementation of chunked, compressed, N-dimensional arrays for Python. MIT
  • GitHub (👨‍💻 25 · 🔀 63 · 📦 180 · 📋 320 - 37% open · ⏱️ 11.01.2020):

     git clone https://github.com/zarr-developers/zarr-python
    
  • PyPi (📥 15K / month · 📦 72 · ⏱️ 11.01.2020):

     pip install zarr
    
  • Conda (⏱️ 07.08.2019):

     conda install -c anaconda zarr
    
Vaex (🥉23 · ⭐ 2.7K) - Out-of-Core DataFrames for Python, visualize and explore big tabular data at a billion.. MIT
  • GitHub (👨‍💻 23 · 🔀 190 · 📥 130 · 📋 290 - 42% open · ⏱️ 22.01.2020):

     git clone https://github.com/vaexio/vaex
    
  • PyPi (📥 1.5K / month · 📦 2 · ⏱️ 21.01.2020):

     pip install vaex
    
  • Conda:

     conda install -c conda-forge vaex
    
datatable (🥉23 · ⭐ 750) - data.table for Python. MPL-2.0
  • GitHub (👨‍💻 23 · 🔀 65 · 📥 160 · 📦 27 · 📋 1K - 10% open · ⏱️ 23.01.2020):

     git clone https://github.com/h2oai/datatable
    
  • PyPi (📥 5.1K / month · 📦 3 · ⏱️ 24.12.2019):

     pip install datatable
    
swifter (🥉21 · ⭐ 980) - A package which efficiently applies any function to a pandas dataframe or series in.. MIT
  • GitHub (👨‍💻 11 · 🔀 44 · 📦 98 · 📋 64 - 9% open · ⏱️ 15.01.2020):

     git clone https://github.com/jmcarpenter2/swifter
    
  • PyPi (📥 24K / month · 📦 16 · ⏱️ 15.01.2020):

     pip install swifter
    
pickleDB (🥉21 · ⭐ 440) - pickleDB is an open source key-value store using Python's json module. BSD-3
  • GitHub (👨‍💻 11 · 🔀 85 · 📦 250 · 📋 53 - 30% open · ⏱️ 15.11.2019):

     git clone https://github.com/patx/pickledb
    
  • PyPi (📥 2.8K / month · 📦 100 · ⏱️ 13.01.2019):

     pip install pickledb
    
Pandas Summary (🥉21 · ⭐ 310) - An extension to pandas dataframes describe function. MIT
  • GitHub (👨‍💻 6 · 🔀 26 · 📦 370 · 📋 13 - 61% open · ⏱️ 24.08.2019):

     git clone https://github.com/mouradmourafiq/pandas-summary
    
  • PyPi (📥 13K / month · 📦 19 · ⏱️ 30.08.2016):

     pip install pandas-summary
    
Pandaral·lel (🥉20 · ⭐ 600) - A simple and efficient tool to parallelize Pandas operations on all.. BSD-3
  • GitHub (👨‍💻 9 · 🔀 54 · 📦 14 · 📋 62 - 37% open · ⏱️ 20.01.2020):

     git clone https://github.com/nalepae/pandarallel
    
  • PyPi (📥 6.4K / month · ⏱️ 20.01.2020):

     pip install pandarallel
    
Bounter (🥉17 · ⭐ 870) - Efficient Counter that uses a limited (bounded) amount of memory regardless of data.. MIT
  • GitHub (👨‍💻 8 · 🔀 44 · 📦 14 · 📋 17 - 52% open · ⏱️ 07.10.2019):

     git clone https://github.com/RaRe-Technologies/bounter
    
  • PyPi (📥 420 / month · 📦 1 · ⏱️ 17.01.2019):

     pip install bounter
    
fletcher (🥉16 · ⭐ 160) - Pandas ExtensionDType/Array backed by Apache Arrow. MIT
  • GitHub (👨‍💻 9 · 🔀 20 · 📦 1 · 📋 38 - 36% open · ⏱️ 23.01.2020):

     git clone https://github.com/xhochy/fletcher
    
  • PyPi (📥 74 / month · ⏱️ 01.09.2019):

     pip install fletcher
    
Show 4 hidden projects...
sklearn-pandas (🥉25 · ⭐ 2.1K · 💤) - Pandas integration with sklearn. ❗️Zlib
  • GitHub (👨‍💻 30 · 🔀 320 · 📦 710 · 📋 120 - 27% open · ⏱️ 31.01.2019):

     git clone https://github.com/scikit-learn-contrib/sklearn-pandas
    
  • PyPi (📥 120K / month · 📦 52 · ⏱️ 01.12.2018):

     pip install sklearn-pandas
    
Arctic (🥉24 · ⭐ 1.8K) - Arctic is a high performance datastore for numeric data. ❗️LGPL-2.1
  • GitHub (👨‍💻 68 · 🔀 380 · 📥 56 · 📦 66 · 📋 470 - 12% open · ⏱️ 28.12.2019):

     git clone https://github.com/man-group/arctic
    
  • PyPi (📥 2.7K / month · 📦 42 · ⏱️ 06.12.2019):

     pip install arctic
    
pandasql (🥉22 · ⭐ 810 · 💀) - sqldf for pandas. MIT
  • GitHub (👨‍💻 15 · 🔀 120 · 📦 270 · 📋 58 - 62% open · ⏱️ 01.02.2017):

     git clone https://github.com/yhat/pandasql
    
  • PyPi (📥 82K / month · 📦 35 · ⏱️ 20.04.2016):

     pip install pandasql
    
StaticFrame (🥉16 · ⭐ 140) - The StaticFrame library consists of the Series and Frame, immutable data.. MIT
  • GitHub (👨‍💻 8 · 🔀 17 · 📦 1 · 📋 140 - 15% open · ⏱️ 23.01.2020):

     git clone https://github.com/InvestmentSystems/static-frame
    
  • PyPi (📥 480 / month · ⏱️ 13.01.2020):

     pip install static-frame
    
  • Conda:

     conda install -c conda-forge static-frame
    

Data Loading & Extraction

Back to top

Libraries for loading, collecting, and extracting data from a variety of data sources and formats.

xmltodict (🥇31 · ⭐ 3.8K) - Python module that makes working with XML feel like you are working with JSON. MIT
  • GitHub (👨‍💻 40 · 🔀 350 · 📦 10K · 📋 170 - 25% open · ⏱️ 26.12.2019):

     git clone https://github.com/martinblech/xmltodict
    
  • PyPi (📥 3.2M / month · 📦 7.9K · ⏱️ 11.02.2019):

     pip install xmltodict
    
xlrd (🥇31 · ⭐ 1.7K) - Please use openpyxl where you can... BSD-3
  • GitHub (👨‍💻 49 · 🔀 360 · 📦 33K · ⏱️ 10.09.2019):

     git clone https://github.com/python-excel/xlrd
    
  • PyPi (📥 3.2M / month · 📦 9.7K · ⏱️ 15.12.2018):

     pip install xlrd
    
Tablib (🥈30 · ⭐ 3.6K) - Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c. MIT
  • GitHub (👨‍💻 110 · 🔀 540 · 📦 5.5K · 📋 220 - 16% open · ⏱️ 14.01.2020):

     git clone https://github.com/jazzband/tablib
    
  • PyPi (📥 310K / month · 📦 2.4K · ⏱️ 13.01.2020):

     pip install tablib
    
tensorflow-datasets (🥈30 · ⭐ 1.8K) - A collection of datasets ready to use with TensorFlow. Apache-2
  • GitHub (👨‍💻 96 · 🔀 440 · 📦 650 · 📋 480 - 54% open · ⏱️ 23.01.2020):

     git clone https://github.com/tensorflow/datasets
    
  • PyPi (📥 470K / month · 📦 100 · ⏱️ 06.12.2019):

     pip install tensorflow-datasets
    
pandas-datareader (🥈30 · ⭐ 1.3K) - Extract data from a wide range of Internet sources into a pandas.. BSD-3
  • GitHub (👨‍💻 63 · 🔀 410 · 📦 3.6K · 📋 380 - 11% open · ⏱️ 08.01.2020):

     git clone https://github.com/pydata/pandas-datareader
    
  • PyPi (📥 70K / month · 📦 1.3K · ⏱️ 25.09.2019):

     pip install pandas-datareader
    
smart-open (🥈29 · ⭐ 1.5K) - Utils for streaming large files (S3, HDFS, gzip, bz2...). MIT
  • GitHub (👨‍💻 50 · 🔀 220 · 📋 200 - 20% open · ⏱️ 09.01.2020):

     git clone https://github.com/RaRe-Technologies/smart_open
    
  • PyPi (📥 1.7M / month · 📦 540 · ⏱️ 03.11.2019):

     pip install smart-open
    
PDFMiner (🥉28 · ⭐ 4.1K) - Python PDF Parser (Not actively maintained). Check out pdfminer.six. MIT
  • GitHub (👨‍💻 28 · 🔀 1.3K · 📦 1.2K · 📋 230 - 84% open · ⏱️ 18.01.2020):

     git clone https://github.com/euske/pdfminer
    
  • PyPi (📥 330K / month · 📦 1.5K · ⏱️ 25.11.2019):

     pip install pdfminer
    
csvkit (🥉28 · ⭐ 4.1K) - A suite of utilities for converting to and working with CSV, the king of tabular file.. MIT
  • GitHub (👨‍💻 90 · 🔀 500 · 📦 590 · 📋 760 - 6% open · ⏱️ 08.12.2019):

     git clone https://github.com/wireservice/csvkit
    
  • PyPi (📥 37K / month · 📦 640 · ⏱️ 16.03.2019):

     pip install csvkit
    
textract (🥉26 · ⭐ 2.7K) - extract text from any document. no muss. no fuss. MIT
  • GitHub (👨‍💻 29 · 🔀 370 · 📋 210 - 43% open · ⏱️ 14.11.2019):

     git clone https://github.com/deanmalmgren/textract
    
  • PyPi (📥 11K / month · 📦 150 · ⏱️ 26.08.2019):

     pip install textract
    
snorkel (🥉25 · ⭐ 3.6K) - A system for quickly generating training data with weak supervision. Apache-2
  • GitHub (👨‍💻 57 · 🔀 630 · 📥 320 · 📦 23 · 📋 880 - 3% open · ⏱️ 20.01.2020):

     git clone https://github.com/snorkel-team/snorkel
    
  • PyPi (📥 5.3K / month · 📦 3 · ⏱️ 12.11.2019):

     pip install snorkel
    
Intake (🥉25 · ⭐ 370) - Intake is a lightweight package for finding, investigating, loading and.. BSD-2
  • GitHub (👨‍💻 37 · 🔀 59 · 📦 120 · 📋 200 - 24% open · ⏱️ 30.12.2019):

     git clone https://github.com/intake/intake
    
  • PyPi (📥 2.2K / month · 📦 74 · ⏱️ 18.12.2019):

     pip install intake
    
Camelot (🥉22 · ⭐ 2.6K) - Camelot: PDF Table Extraction for Humans. MIT
  • GitHub (👨‍💻 23 · 🔀 240 · 📋 290 - 11% open · ⏱️ 15.10.2019):

     git clone https://github.com/atlanhq/camelot
    
  • PyPi (📥 15K / month · 📦 1 · ⏱️ 07.07.2019):

     pip install camelot-py
    
Show 2 hidden projects...
Singer (🥉23 · ⭐ 460 · 💤) - Standard for moving data between databases, web APIs, files, queues, and.. ❗️AGPL-3.0
  • GitHub (👨‍💻 23 · 🔀 50 · 📋 24 - 41% open · ⏱️ 27.06.2019):

     git clone https://github.com/singer-io/getting-started
    
  • PyPi (📥 24K / month · 📦 270 · ⏱️ 04.11.2019):

     pip install singer-python
    
OpenPyXL (🥉19) - A Python library to read/write Excel 2010 xlsx/xlsm files. MIT
  • PyPi (📥 1.9M / month · 📦 9.8K · ⏱️ 10.01.2020):

     pip install openpyxl
    

Web Scraping & Crawling

Back to top

Libraries for web scraping, crawling, downloading, and mining as well as libraries.

youtube-dl (🥇37 · ⭐ 61K) - Command-line program to download videos from YouTube.com and other video.. Unlicense
  • GitHub (👨‍💻 850 · 🔀 11K · 📥 45M · 📋 20K - 16% open · ⏱️ 19.01.2020):

     git clone https://github.com/ytdl-org/youtube-dl
    
  • PyPi (📥 930K / month · 📦 2.9K · ⏱️ 14.01.2020):

     pip install youtube_dl
    
Scrapy (🥇36 · ⭐ 36K) - Scrapy, a fast high-level web crawling & scraping framework for Python. BSD-3
  • GitHub (👨‍💻 400 · 🔀 8.2K · 📦 15K · 📋 2.4K - 30% open · ⏱️ 22.01.2020):

     git clone https://github.com/scrapy/scrapy
    
  • PyPi (📥 340K / month · 📦 7.9K · ⏱️ 29.10.2019):

     pip install scrapy
    
You-Get (🥈29 · ⭐ 30K) - Dumb downloader that scrapes the web. MIT
  • GitHub (👨‍💻 190 · 🔀 5.9K · 📥 300K · ⏱️ 21.01.2020):

     git clone https://github.com/soimort/you-get
    
  • PyPi (📥 34K / month · 📦 18 · ⏱️ 28.12.2019):

     pip install you-get
    
newspaper3k (🥈29 · ⭐ 9K · 💤) - News, full-text, and article metadata extraction in Python 3. Advanced docs:. MIT
  • GitHub (👨‍💻 100 · 🔀 1.5K · 📦 1.1K · 📋 560 - 59% open · ⏱️ 12.04.2019):

     git clone https://github.com/codelucas/newspaper
    
  • PyPi (📥 23K / month · 📦 440 · ⏱️ 28.09.2018):

     pip install newspaper3k
    
pyspider (🥈28 · ⭐ 14K) - A Powerful Spider(Web Crawler) System in Python. Apache-2
  • GitHub (👨‍💻 62 · 🔀 3.4K · 📦 220 · 📋 790 - 32% open · ⏱️ 14.11.2019):

     git clone https://github.com/binux/pyspider
    
  • PyPi (📥 3K / month · 📦 30 · ⏱️ 18.04.2018):

     pip install pyspider
    
Requests-HTML (🥈28 · ⭐ 11K) - Pythonic HTML Parsing for Humans. MIT
  • GitHub (👨‍💻 58 · 🔀 660 · 📦 1.7K · 📋 260 - 36% open · ⏱️ 25.07.2019):

     git clone https://github.com/psf/requests-html
    
  • PyPi (📥 70K / month · 📦 360 · ⏱️ 17.02.2019):

     pip install requests-html
    
Google Images Download (🥉27 · ⭐ 6.3K · 💤) - Python Script to download hundreds of images from 'Google.. MIT
  • GitHub (👨‍💻 38 · 🔀 1.4K · 📋 220 - 30% open · ⏱️ 21.05.2019):

     git clone https://github.com/hardikvasa/google-images-download
    
  • PyPi (📥 34K / month · 📦 210 · ⏱️ 14.05.2019):

     pip install google_images_download
    
MechanicalSoup (🥉27 · ⭐ 3.4K) - A Python library for automating interaction with websites. MIT
  • GitHub (👨‍💻 37 · 🔀 290 · 📥 31 · 📋 130 - 15% open · ⏱️ 10.01.2020):

     git clone https://github.com/MechanicalSoup/MechanicalSoup
    
  • PyPi (📥 25K / month · 📦 390 · ⏱️ 27.08.2019):

     pip install MechanicalSoup
    
Grab (🥉26 · ⭐ 1.9K) - Web Scraping Framework. MIT
  • GitHub (👨‍💻 62 · 🔀 220 · 📦 260 · 📋 210 - 11% open · ⏱️ 07.12.2019):

     git clone https://github.com/lorien/grab
    
  • PyPi (📥 3.7K / month · 📦 100 · ⏱️ 24.06.2018):

     pip install grab
    
micawber (🥉22 · ⭐ 470) - a small library for extracting rich content from urls. MIT
  • GitHub (👨‍💻 24 · 🔀 69 · 📦 370 · 📋 50 - 2% open · ⏱️ 12.10.2019):

     git clone https://github.com/coleifer/micawber
    
  • PyPi (📥 5.4K / month · 📦 360 · ⏱️ 30.06.2019):

     pip install micawber
    
beautifulsoup4 (🥉21) - Screen-scraping library. MIT
  • PyPi (📥 7.8M / month · 📦 62K · ⏱️ 24.12.2019):

     pip install beautifulsoup4
    
gazpacho (🥉15 · ⭐ 220 · 🐣) - gazpacho is a web scraping library. MIT
  • GitHub (👨‍💻 2 · 🔀 15 · 📦 8 · ⏱️ 25.11.2019):

     git clone https://github.com/maxhumber/gazpacho
    
  • PyPi (📥 440 / month · 📦 2 · ⏱️ 25.11.2019):

     pip install gazpacho
    
Transistor (🥉14 · ⭐ 200) - Transistor, a Python web scraping framework for intelligent use cases. MIT
  • GitHub (👨‍💻 2 · 🔀 15 · 📥 8 · 📋 3 - 33% open · ⏱️ 18.09.2019):

     git clone https://github.com/bomquote/transistor
    
  • PyPi (📥 38 / month · ⏱️ 03.12.2018):

     pip install transistor
    
lazynlp (🥉13 · ⭐ 1.7K) - Library to scrape and clean web pages to create massive datasets. MIT
  • GitHub (👨‍💻 4 · 🔀 250 · 📋 8 - 62% open · ⏱️ 07.10.2019):

     git clone https://github.com/chiphuyen/lazynlp
    
Show 3 hidden projects...
Pattern (🥈29 · ⭐ 7.3K · 💀) - Web mining module for Python, with tools for scraping, natural language.. BSD-3
  • GitHub (👨‍💻 29 · 🔀 1.4K · 📦 740 · 📋 180 - 60% open · ⏱️ 18.08.2018):

     git clone https://github.com/clips/pattern
    
  • PyPi (📥 330K / month · 📦 270 · ⏱️ 18.08.2018):

     pip install pattern
    
soupsieve (🥈28 · ⭐ 60) - A modern CSS selector implementation for BeautifulSoup. MIT
  • GitHub (👨‍💻 5 · 🔀 11 · 📦 23K · 📋 73 - 17% open · ⏱️ 20.01.2020):

     git clone https://github.com/facelessuser/soupsieve
    
  • PyPi (📥 6.4M / month · 📦 3.9K · ⏱️ 02.11.2019):

     pip install soupsieve
    
Photon (🥉20 · ⭐ 6.2K) - Incredibly fast crawler designed for OSINT. ❗️GPL-3.0
  • GitHub (👨‍💻 21 · 🔀 760 · 📋 78 - 17% open · ⏱️ 06.12.2019):

     git clone https://github.com/s0md3v/Photon
    
  • PyPi (📥 190 / month · 📦 5 · ⏱️ 25.01.2019):

     pip install photon
    

Data Pipelines & Streaming

Back to top

Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.

Celery (🥇37 · ⭐ 14K) - Asynchronous task queue/job queue based on distributed message passing. BSD-3
  • GitHub (👨‍💻 950 · 🔀 3.6K · 📦 33K · 📋 4.1K - 9% open · ⏱️ 20.01.2020):

     git clone https://github.com/celery/celery
    
  • PyPi (📥 1.9M / month · 📦 28K · ⏱️ 16.12.2019):

     pip install celery
    
Airflow (🥇35 · ⭐ 15K) - Platform to programmatically author, schedule, and monitor workflows. Apache-2
  • GitHub (👨‍💻 1.3K · 🔀 5.8K · 📥 400 · 📦 720 · ⏱️ 23.01.2020):

     git clone https://github.com/apache/airflow
    
  • PyPi (📥 460K / month · 📦 290 · ⏱️ 24.12.2019):

     pip install apache-airflow
    
  • Conda:

     conda install -c conda-forge airflow
    
  • Dockerhub (📥 330K · ⭐ 79 · ⏱️ 23.01.2020):

     docker pull apache/airflow
    
luigi (🥇33 · ⭐ 13K) - Luigi is a Python module that helps you build complex pipelines of batch jobs. It.. Apache-2
  • GitHub (👨‍💻 530 · 🔀 2.1K · 📦 960 · 📋 830 - 6% open · ⏱️ 16.01.2020):

     git clone https://github.com/spotify/luigi
    
  • PyPi (📥 160K / month · 📦 680 · ⏱️ 02.01.2020):

     pip install luigi
    
  • Conda (⏱️ 17.12.2019):

     conda install -c anaconda luigi
    
Beam (🥈32 · ⭐ 3.7K) - Unified programming model to define and execute data processing pipelines,.. Apache-2
  • GitHub (👨‍💻 740 · 🔀 2.2K · ⏱️ 23.01.2020):

     git clone https://github.com/apache/beam
    
  • PyPi (📥 2.6M / month · 📦 190 · ⏱️ 06.01.2020):

     pip install apache-beam
    
mrjob (🥈30 · ⭐ 2.5K) - Run MapReduce jobs on Hadoop or Amazon Web Services. Apache-2
  • GitHub (👨‍💻 140 · 🔀 580 · 📦 400 · 📋 1.3K - 14% open · ⏱️ 27.12.2019):

     git clone https://github.com/Yelp/mrjob
    
  • PyPi (📥 170K / month · 📦 110 · ⏱️ 28.12.2019):

     pip install mrjob
    
faust (🥈28 · ⭐ 4.2K) - Python Stream Processing. BSD-3
  • GitHub (👨‍💻 76 · 🔀 300 · 📦 190 · 📋 290 - 34% open · ⏱️ 22.01.2020):

     git clone https://github.com/robinhood/faust
    
  • PyPi (📥 17K / month · 📦 28 · ⏱️ 22.01.2020):

     pip install faust
    
Kedro (🥈26 · ⭐ 1.9K) - A Python library that implements software engineering best-practice for data and.. Apache-2
  • GitHub (👨‍💻 42 · 🔀 180 · 📦 88 · 📋 140 - 34% open · ⏱️ 23.01.2020):

     git clone https://github.com/quantumblacklabs/kedro
    
  • PyPi (📥 22K / month · 📦 8 · ⏱️ 12.12.2019):

     pip install kedro
    
PyFunctional (🥈26 · ⭐ 1.6K) - Python library for creating data pipelines with chain functional programming. MIT
  • GitHub (👨‍💻 21 · 🔀 84 · 📦 170 · 📋 110 - 11% open · ⏱️ 11.11.2019):

     git clone https://github.com/EntilZha/PyFunctional
    
  • PyPi (📥 11K / month · 📦 66 · ⏱️ 26.10.2019):

     pip install pyfunctional
    
dbt (🥈26 · ⭐ 1.3K) - dbt (data build tool) enables data analysts and engineers to transform their data.. Apache-2
  • GitHub (👨‍💻 55 · 🔀 240 · 📦 33 · 📋 1.1K - 17% open · ⏱️ 20.01.2020):

     git clone https://github.com/fishtown-analytics/dbt
    
  • PyPi (📥 33K / month · 📦 9 · ⏱️ 16.01.2020):

     pip install dbt
    
TFX (🥈26 · ⭐ 860) - TFX is an end-to-end platform for deploying production ML pipelines. Apache-2
  • GitHub (👨‍💻 47 · 🔀 220 · 📦 41 · 📋 360 - 59% open · ⏱️ 23.01.2020):

     git clone https://github.com/tensorflow/tfx
    
  • PyPi (📥 490K / month · ⏱️ 24.10.2019):

     pip install tfx
    
Prefect (🥈25 · ⭐ 1.6K) - The Prefect Core workflow engine. Apache-2
  • GitHub (👨‍💻 42 · 🔀 120 · 📦 41 · 📋 700 - 12% open · ⏱️ 22.01.2020):

     git clone https://github.com/PrefectHQ/prefect
    
  • PyPi (📥 6K / month · 📦 2 · ⏱️ 15.01.2020):

     pip install prefect
    
  • Conda:

     conda install -c conda-forge prefect
    
Dagster (🥈25 · ⭐ 1.2K) - A Python library for building data applications: ETL, ML, Data Pipelines, and.. Apache-2
  • GitHub (👨‍💻 49 · 🔀 110 · 📦 64 · 📋 1.1K - 26% open · ⏱️ 23.01.2020):

     git clone https://github.com/dagster-io/dagster
    
  • PyPi (📥 6.4K / month · 📦 4 · ⏱️ 14.01.2020):

     pip install dagster
    
streamparse (🥉24 · ⭐ 1.4K) - Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a.. Apache-2
  • GitHub (👨‍💻 40 · 🔀 210 · 📦 41 · 📋 320 - 18% open · ⏱️ 16.12.2019):

     git clone https://github.com/Parsely/streamparse
    
  • PyPi (📥 1.5K / month · 📦 24 · ⏱️ 08.04.2019):

     pip install streamparse
    
Great Expectations (🥉24 · ⭐ 1.2K) - Always know what to expect from your data. Apache-2
  • GitHub (👨‍💻 51 · 🔀 120 · 📋 380 - 27% open · ⏱️ 21.01.2020):

     git clone https://github.com/great-expectations/great_expectations
    
  • PyPi (📥 53K / month · 📦 6 · ⏱️ 15.01.2020):

     pip install great_expectations
    
petl (🥉24 · ⭐ 700) - Python Extract Transform and Load Tables of Data. MIT
  • GitHub (👨‍💻 38 · 🔀 130 · 📦 170 · 📋 390 - 17% open · ⏱️ 06.08.2019):

     git clone https://github.com/petl-developers/petl
    
  • PyPi (📥 20K / month · 📦 100 · ⏱️ 25.07.2018):

     pip install petl
    
bonobo (🥉22 · ⭐ 1.2K) - Extract Transform Load for Python 3.5+. Apache-2
  • GitHub (👨‍💻 36 · 🔀 100 · 📦 80 · 📋 160 - 37% open · ⏱️ 20.07.2019):

     git clone https://github.com/python-bonobo/bonobo
    
  • PyPi (📥 4.5K / month · 📦 24 · ⏱️ 20.07.2019):

     pip install bonobo
    
Optimus (🥉22 · ⭐ 840) - Agile Data Science Workflows madeeasy with Pyspark. Apache-2
  • GitHub (👨‍💻 20 · 🔀 170 · 📦 8 · 📋 200 - 28% open · ⏱️ 20.12.2019):

     git clone https://github.com/ironmussa/Optimus
    
  • PyPi (📥 3.7K / month · ⏱️ 30.05.2019):

     pip install optimuspyspark
    
Pypeline (🥉20 · ⭐ 1K) - Concurrent data pipelines made easy -. MIT
  • GitHub (👨‍💻 4 · 🔀 53 · 📦 23 · 📋 22 - 59% open · ⏱️ 21.11.2019):

     git clone https://github.com/cgarciae/pypeln
    
  • PyPi (📥 4.8K / month · 📦 5 · ⏱️ 21.11.2019):

     pip install pypeln
    
TaskTiger (🥉20 · ⭐ 980) - Python task queue using Redis. MIT
  • GitHub (👨‍💻 18 · 🔀 55 · 📦 16 · 📋 60 - 53% open · ⏱️ 18.12.2019):

     git clone https://github.com/closeio/tasktiger
    
  • PyPi (📥 320 / month · 📦 7 · ⏱️ 29.10.2019):

     pip install tasktiger
    
pysparkling (🥉20 · ⭐ 210) - A pure Python implementation of Apache Spark's RDD and DStream interfaces. MIT
  • GitHub (👨‍💻 8 · 🔀 37 · 📦 31 · 📋 21 - 4% open · ⏱️ 23.01.2020):

     git clone https://github.com/svenkreiss/pysparkling
    
  • PyPi (📥 4.8K / month · 📦 5 · ⏱️ 13.07.2019):

     pip install pysparkling
    
Data Integration (🥉18 · ⭐ 1.4K) - A lightweight opinionated ETL framework, halfway between plain scripts and.. MIT
  • GitHub (👨‍💻 10 · 🔀 58 · 📦 4 · 📋 8 - 12% open · ⏱️ 28.12.2019):

     git clone https://github.com/mara/data-integration
    
  • PyPi (📥 51 / month · ⏱️ 07.07.2019):

     pip install data-integration
    
Databolt Flow (🥉18 · ⭐ 830) - Python library for building highly effective data science workflows. MIT
  • GitHub (👨‍💻 7 · 🔀 49 · 📦 5 · 📋 15 - 40% open · ⏱️ 02.01.2020):

     git clone https://github.com/d6t/d6tflow
    
  • PyPi (📥 210 / month · ⏱️ 02.01.2020):

     pip install d6tflow
    
pdpipe (🥉18 · ⭐ 460) - Easy pipelines for pandas DataFrames. MIT
  • GitHub (👨‍💻 5 · 🔀 16 · 📦 3 · 📋 10 - 50% open · ⏱️ 20.01.2020):

     git clone https://github.com/pdpipe/pdpipe
    
  • PyPi (📥 810 / month · ⏱️ 20.01.2020):

     pip install pdpipe
    
riko (🥉17 · ⭐ 1.5K · 💤) - A Python stream processing engine modeled after Yahoo! Pipes. MIT
  • GitHub (👨‍💻 17 · 🔀 59 · 📋 27 - 77% open · ⏱️ 05.02.2019):

     git clone https://github.com/nerevu/riko
    
  • PyPi (📥 93 / month · ⏱️ 13.09.2018):

     pip install riko
    
Botflow (🥉15 · ⭐ 1.1K · 💤) - Python Fast Dataflow programming framework for Data pipeline work( Web.. BSD-3
  • GitHub (👨‍💻 11 · 🔀 82 · 📦 1 · 📋 4 - 50% open · ⏱️ 23.05.2019):

     git clone https://github.com/kkyon/botflow
    
  • PyPi (📥 110 / month · ⏱️ 14.09.2018):

     pip install botflow
    
Show 1 hidden projects...
BatchFlow (🥉18 · ⭐ 120) - BatchFlow helps you conveniently work with random or sequential batches of.. Apache-2
  • GitHub (👨‍💻 26 · 🔀 27 · 📋 72 - 38% open · ⏱️ 23.01.2020):

     git clone https://github.com/analysiscenter/batchflow
    
  • PyPi (📥 42 / month · ⏱️ 23.08.2019):

     pip install batchflow
    

Distributed Machine Learning

Back to top

Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.

dask (🥇34 · ⭐ 6.2K) - Parallel computing with task scheduling. BSD-3
  • GitHub (👨‍💻 340 · 🔀 980 · 📦 13K · 📋 2.8K - 17% open · ⏱️ 22.01.2020):

     git clone https://github.com/dask/dask
    
  • PyPi (📥 1.8M / month · 📦 3.9K · ⏱️ 16.01.2020):

     pip install dask
    
Ray (🥇32 · ⭐ 10K) - A fast and simple framework for building and running distributed applications. Ray.. Apache-2
  • GitHub (👨‍💻 250 · 🔀 1.4K · 📦 450 · 📋 2.9K - 31% open · ⏱️ 23.01.2020):

     git clone https://github.com/ray-project/ray
    
  • PyPi (📥 1.9M / month · 📦 80 · ⏱️ 17.12.2019):

     pip install ray
    
horovod (🥈29 · ⭐ 8.5K) - Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet. Apache-2
  • GitHub (👨‍💻 71 · 🔀 1.3K · 📦 130 · 📋 1.2K - 43% open · ⏱️ 21.01.2020):

     git clone https://github.com/horovod/horovod
    
  • PyPi (📥 80K / month · 📦 20 · ⏱️ 14.01.2020):

     pip install horovod
    
ipyparallel (🥈28 · ⭐ 1.6K) - Interactive Parallel Computing in Python. BSD-3
  • GitHub (👨‍💻 89 · 🔀 620 · 📦 880 · 📋 220 - 55% open · ⏱️ 10.10.2019):

     git clone https://github.com/ipython/ipyparallel
    
  • PyPi (📥 150K / month · 📦 430 · ⏱️ 13.05.2019):

     pip install ipyparallel
    
TensorFlowOnSpark (🥈25 · ⭐ 3.4K) - TensorFlowOnSpark brings TensorFlow programs to Apache Spark.. Apache-2
  • GitHub (👨‍💻 28 · 🔀 880 · 📦 21 · 📋 320 - 1% open · ⏱️ 22.01.2020):

     git clone https://github.com/yahoo/TensorFlowOnSpark
    
  • PyPi (📥 59K / month · 📦 5 · ⏱️ 21.01.2020):

     pip install tensorflowonspark
    
Mesh (🥈25 · ⭐ 390) - Mesh TensorFlow: Model Parallelism Made Easier. Apache-2
  • GitHub (👨‍💻 27 · 🔀 60 · 📦 210 · 📋 34 - 76% open · ⏱️ 17.01.2020):

     git clone https://github.com/tensorflow/mesh
    
  • PyPi (📥 22K / month · 📦 28 · ⏱️ 08.01.2020):

     pip install mesh-tensorflow
    
BigDL (🥉24 · ⭐ 3.2K) - BigDL: Distributed Deep Learning Library for Apache Spark. Apache-2
  • GitHub (👨‍💻 69 · 🔀 820 · 📦 9 · 📋 860 - 18% open · ⏱️ 08.01.2020):

     git clone https://github.com/intel-analytics/BigDL
    
  • PyPi (📥 820 / month · 📦 2 · ⏱️ 04.11.2019):

     pip install bigdl
    
  • Maven:

     <dependency>
     	<groupId>com.intel.analytics.bigdl</groupId>
     	<artifactId>bigdl-SPARK_2.4</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Elephas (🥉24 · ⭐ 1.3K) - Distributed Deep learning with Keras & Spark. MIT
  • GitHub (👨‍💻 23 · 🔀 260 · 📦 19 · 📋 120 - 36% open · ⏱️ 03.01.2020):

     git clone https://github.com/maxpumperla/elephas
    
  • PyPi (📥 15K / month · 📦 2 · ⏱️ 20.03.2019):

     pip install elephas
    
dask-ml (🥉24 · ⭐ 580) - Scalable Machine Learning with Dask. BSD-3
  • GitHub (👨‍💻 46 · 🔀 140 · 📦 130 · 📋 290 - 48% open · ⏱️ 17.01.2020):

     git clone https://github.com/dask/dask-ml
    
  • PyPi (📥 27K / month · 📦 42 · ⏱️ 06.01.2020):

     pip install dask-ml
    
mpi4py (🥉24 · ⭐ 290) - Python bindings for MPI. BSD-3
  • GitHub (👨‍💻 11 · 🔀 45 · ⏱️ 21.01.2020):

     git clone https://github.com/mpi4py/mpi4py
    
  • PyPi (📥 58K / month · 📦 700 · ⏱️ 04.11.2019):

     pip install mpi4py
    
MMLSpark (🥉19 · ⭐ 1.8K) - Microsoft Machine Learning for Apache Spark. MIT
  • GitHub (👨‍💻 53 · 🔀 400 · 📋 300 - 42% open · ⏱️ 23.01.2020):

     git clone https://github.com/Azure/mmlspark
    
TensorFrames (🥉19 · ⭐ 770) - [DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark. Apache-2
  • GitHub (👨‍💻 16 · 🔀 160 · 📋 92 - 55% open · ⏱️ 15.11.2019):

     git clone https://github.com/databricks/tensorframes
    
  • PyPi (📥 3.2K / month · ⏱️ 16.05.2018):

     pip install tensorframes
    
sk-dist (🥉18 · ⭐ 200 · 🐣) - Distributed scikit-learn meta-estimators in PySpark. Apache-2
  • GitHub (👨‍💻 5 · 🔀 31 · 📦 1 · 📋 12 - 25% open · ⏱️ 15.01.2020):

     git clone https://github.com/Ibotta/sk-dist
    
  • PyPi (📥 4.1K / month · 📦 2 · ⏱️ 15.01.2020):

     pip install sk-dist
    
Apache Singa (🥉17 · ⭐ 1.9K) - Mirror of Apache Singa. Apache-2
  • GitHub (👨‍💻 62 · 🔀 480 · ⏱️ 18.01.2020):

     git clone https://github.com/apache/singa
    
  • Conda:

     conda install -c nusdbsystem singa
    
  • Dockerhub (📥 120 · ⭐ 2 · ⏱️ 04.06.2019):

     docker pull apache/singa
    
BytePS (🥉16 · ⭐ 2K) - A high performance and generic framework for distributed DNN training. Apache-2
  • GitHub (👨‍💻 12 · 🔀 280 · 📋 120 - 37% open · ⏱️ 21.01.2020):

     git clone https://github.com/bytedance/byteps
    
  • Dockerhub (📥 550 · ⏱️ 04.11.2019):

     docker pull bytepsimage/byteps_server
    
Show 2 hidden projects...
DEAP (🥈28 · ⭐ 3.4K) - Distributed Evolutionary Algorithms in Python. ❗️LGPL-3.0
  • GitHub (👨‍💻 60 · 🔀 710 · 📦 850 · 📋 330 - 42% open · ⏱️ 21.01.2020):

     git clone https://github.com/deap/deap
    
  • PyPi (📥 52K / month · 📦 480 · ⏱️ 21.01.2020):

     pip install deap
    
LazyCluster (🥉7 · ⭐ 16 · 🐣) - Distributed machine learning made simple. Apache-2
  • GitHub (👨‍💻 2 · 🔀 2 · ⏱️ 23.01.2020):

     git clone https://github.com/ml-tooling/lazycluster
    
  • PyPi (📥 79 / month · ⏱️ 09.08.2019):

     pip install lazycluster
    

Hyperparameter Optimization & AutoML

Back to top

Libraries for hyperparameter optimization, automl and neural architecture search.

Hyperopt (🥇29 · ⭐ 4.2K) - Distributed Asynchronous Hyperparameter Optimization in Python. BSD-3
  • GitHub (👨‍💻 63 · 🔀 730 · 📋 440 - 59% open · ⏱️ 12.01.2020):

     git clone https://github.com/hyperopt/hyperopt
    
  • PyPi (📥 610K / month · 📦 500 · ⏱️ 12.01.2020):

     pip install hyperopt
    
Optuna (🥇28 · ⭐ 1.9K) - A hyperparameter optimization framework. MIT
  • GitHub (👨‍💻 50 · 🔀 180 · 📦 200 · 📋 210 - 25% open · ⏱️ 22.01.2020):

     git clone https://github.com/optuna/optuna
    
  • PyPi (📥 22K / month · 📦 52 · ⏱️ 14.01.2020):

     pip install optuna
    
featuretools (🥇27 · ⭐ 4.5K) - An open source python library for automated feature engineering. BSD-3
  • GitHub (👨‍💻 42 · 🔀 580 · 📦 250 · 📋 320 - 35% open · ⏱️ 07.01.2020):

     git clone https://github.com/FeatureLabs/featuretools
    
  • PyPi (📥 37K / month · 📦 70 · ⏱️ 29.12.2019):

     pip install featuretools
    
  • Conda:

     conda install -c conda-forge featuretools
    
scikit-optimize (🥇27 · ⭐ 1.6K) - Sequential model-based optimization with a `scipy.optimize` interface. BSD-3
  • GitHub (👨‍💻 55 · 🔀 310 · 📦 650 · 📋 440 - 35% open · ⏱️ 22.01.2020):

     git clone https://github.com/scikit-optimize/scikit-optimize
    
  • PyPi (📥 250K / month · 📦 47 · ⏱️ 25.03.2018):

     pip install scikit-optimize
    
Bayesian Optimization (🥈26 · ⭐ 3.8K) - A Python implementation of global optimization with gaussian processes. MIT
  • GitHub (👨‍💻 21 · 🔀 840 · 📦 270 · 📋 140 - 15% open · ⏱️ 13.11.2019):

     git clone https://github.com/fmfn/BayesianOptimization
    
  • PyPi (📥 40K / month · 📦 31 · ⏱️ 12.02.2019):

     pip install bayesian-optimization
    
AutoKeras (🥈25 · ⭐ 6.5K) - An AutoML system based on Keras. MIT
  • GitHub (👨‍💻 68 · 🔀 1K · 📦 62 · 📋 470 - 10% open · ⏱️ 21.01.2020):

     git clone https://github.com/keras-team/autokeras
    
  • PyPi (📥 3.3K / month · 📦 7 · ⏱️ 16.01.2020):

     pip install autokeras
    
auto-sklearn (🥈25 · ⭐ 4.2K) - Automated Machine Learning with scikit-learn. BSD-3
  • GitHub (👨‍💻 49 · 🔀 800 · 📦 75 · 📋 580 - 27% open · ⏱️ 22.01.2020):

     git clone https://github.com/automl/auto-sklearn
    
  • PyPi (📥 4K / month · 📦 15 · ⏱️ 03.01.2020):

     pip install auto-sklearn
    
Hyperas (🥈25 · ⭐ 1.9K) - Keras + Hyperopt: A very simple wrapper for convenient hyperparameter optimization. MIT
  • GitHub (👨‍💻 21 · 🔀 280 · 📦 120 · 📋 230 - 34% open · ⏱️ 13.12.2019):

     git clone https://github.com/maxpumperla/hyperas
    
  • PyPi (📥 26K / month · 📦 16 · ⏱️ 28.02.2019):

     pip install hyperas
    
NNI (🥈24 · ⭐ 5.2K) - An open source AutoML toolkit for neural architecture search, model compression and.. MIT
  • GitHub (👨‍💻 67 · 🔀 650 · 📦 25 · 📋 600 - 18% open · ⏱️ 23.01.2020):

     git clone https://github.com/microsoft/nni
    
  • PyPi (📥 2.1K / month · 📦 8 · ⏱️ 31.12.2019):

     pip install nni
    
AdaNet (🥈24 · ⭐ 3K) - Fast and flexible AutoML with learning guarantees. Apache-2
  • GitHub (👨‍💻 25 · 🔀 440 · 📦 16 · 📋 95 - 56% open · ⏱️ 14.01.2020):

     git clone https://github.com/tensorflow/adanet
    
  • PyPi (📥 2.5K / month · ⏱️ 26.06.2019):

     pip install adanet
    
nevergrad (🥈24 · ⭐ 2.3K) - A Python toolbox for performing gradient-free optimization. MIT
  • GitHub (👨‍💻 25 · 🔀 200 · 📦 24 · 📋 100 - 42% open · ⏱️ 23.01.2020):

     git clone https://github.com/facebookresearch/nevergrad
    
  • PyPi (📥 2.6K / month · 📦 7 · ⏱️ 08.01.2020):

     pip install nevergrad
    
Talos (🥈24 · ⭐ 1.1K) - Hyperparameter Optimization for Keras Models. MIT
  • GitHub (👨‍💻 17 · 🔀 180 · 📦 66 · 📋 310 - 11% open · ⏱️ 06.11.2019):

     git clone https://github.com/autonomio/talos
    
  • PyPi (📥 1.8K / month · 📦 2 · ⏱️ 25.08.2019):

     pip install talos
    
BoTorch (🥈23 · ⭐ 1.4K) - Bayesian optimization in PyTorch. MIT
  • GitHub (👨‍💻 28 · 🔀 100 · 📦 21 · 📋 56 - 28% open · ⏱️ 21.01.2020):

     git clone https://github.com/pytorch/botorch
    
  • PyPi (📥 2.7K / month · ⏱️ 16.01.2020):

     pip install botorch
    
GPyOpt (🥈23 · ⭐ 560) - Gaussian Process Optimization using GPy. BSD-3
  • GitHub (👨‍💻 43 · 🔀 160 · 📦 97 · 📋 230 - 29% open · ⏱️ 02.12.2019):

     git clone https://github.com/SheffieldML/GPyOpt
    
  • PyPi (📥 5.9K / month · 📦 9 · ⏱️ 22.05.2018):

     pip install gpyopt
    
SMAC3 (🥈23 · ⭐ 460) - Sequential Model-based Algorithm Configuration. BSD-3
  • GitHub (👨‍💻 24 · 🔀 120 · 📋 280 - 18% open · ⏱️ 20.09.2019):

     git clone https://github.com/automl/SMAC3
    
  • PyPi (📥 4.7K / month · 📦 27 · ⏱️ 20.09.2019):

     pip install smac
    
Keras Tuner (🥉22 · ⭐ 1.7K) - Hyperparameter tuning for humans. Apache-2
  • GitHub (👨‍💻 20 · 🔀 150 · 📦 20 · 📋 100 - 19% open · ⏱️ 08.01.2020):

     git clone https://github.com/keras-team/keras-tuner
    
  • PyPi (📥 5.3K / month · 📦 2 · ⏱️ 07.01.2020):

     pip install keras-tuner
    
Ax (🥉22 · ⭐ 990) - Adaptive Experimentation Platform. MIT
  • GitHub (👨‍💻 53 · 🔀 86 · 📦 17 · 📋 84 - 15% open · ⏱️ 22.01.2020):

     git clone https://github.com/facebook/Ax
    
  • PyPi (📥 2.6K / month · 📦 2 · ⏱️ 18.01.2020):

     pip install ax-platform
    
Test Tube (🥉21 · ⭐ 530) - Python library to easily log experiments and parallelize hyperparameter search for.. MIT
  • GitHub (👨‍💻 15 · 🔀 46 · 📥 1 · 📋 36 - 47% open · ⏱️ 01.12.2019):

     git clone https://github.com/williamFalcon/test-tube
    
  • PyPi (📥 8.7K / month · 📦 4 · ⏱️ 12.12.2018):

     pip install test_tube
    
AutoGluon (🥉20 · ⭐ 1.8K · 🐣) - AutoGluon: AutoML Toolkit for Deep Learning. Apache-2
  • GitHub (👨‍💻 15 · 🔀 200 · 📋 120 - 38% open · ⏱️ 22.01.2020):

     git clone https://github.com/awslabs/autogluon
    
  • PyPi (📥 2.9K / month · ⏱️ 15.01.2020):

     pip install autogluon
    
Orion (🥉20 · ⭐ 160) - Asynchronous Distributed Hyperparameter Optimization. BSD-3
  • GitHub (👨‍💻 13 · 🔀 23 · 📦 11 · 📋 68 - 16% open · ⏱️ 09.10.2019):

     git clone https://github.com/Epistimio/orion
    
  • PyPi (📥 450 / month · 📦 1 · ⏱️ 09.10.2019):

     pip install orion
    
HpBandSter (🥉19 · ⭐ 320 · 💤) - a distributed Hyperband implementation on Steroids. BSD-3
  • GitHub (👨‍💻 11 · 🔀 70 · 📦 37 · 📋 64 - 50% open · ⏱️ 26.03.2019):

     git clone https://github.com/automl/HpBandSter
    
  • PyPi (📥 2.7K / month · ⏱️ 24.10.2018):

     pip install hpbandster
    
Advisor (🥉18 · ⭐ 1.1K) - Open-source implementation of Google Vizier for hyper parameters tuning. Apache-2
  • GitHub (👨‍💻 11 · 🔀 200 · 📋 32 - 59% open · ⏱️ 11.11.2019):

     git clone https://github.com/tobegit3hub/advisor
    
  • PyPi (📥 93 / month · ⏱️ 18.10.2018):

     pip install advisor
    
  • Dockerhub (📥 1.2K · ⏱️ 11.11.2019):

     docker pull tobegit3hub/advisor
    
HyperparameterHunter (🥉18 · ⭐ 570) - Easy hyperparameter optimization and automatic result saving across.. MIT
  • GitHub (👨‍💻 3 · 🔀 69 · 📥 89 · 📋 110 - 26% open · ⏱️ 08.10.2019):

     git clone https://github.com/HunterMcGushion/hyperparameter_hunter
    
  • PyPi (📥 330 / month · 📦 1 · ⏱️ 06.08.2019):

     pip install hyperparameter-hunter
    
Dragonfly (🥉18 · ⭐ 460) - An open source python library for scalable Bayesian optimisation. MIT
  • GitHub (👨‍💻 10 · 🔀 45 · 📋 26 - 42% open · ⏱️ 30.12.2019):

     git clone https://github.com/dragonfly/dragonfly
    
  • PyPi (📥 200 / month · 📦 1 · ⏱️ 29.08.2019):

     pip install dragonfly-opt
    
Auto Tune Models (🥉17 · ⭐ 470) - Auto Tune Models - A multi-tenant, multi-data system for automated machine.. MIT
  • GitHub (👨‍💻 16 · 🔀 110 · 📦 5 · 📋 86 - 18% open · ⏱️ 08.08.2019):

     git clone https://github.com/HDI-Project/ATM
    
  • PyPi (📥 150 / month · ⏱️ 24.06.2019):

     pip install atm
    
AlphaPy (🥉17 · ⭐ 360) - Automated Machine Learning [AutoML] with Python, scikit-learn, and Keras. Apache-2
  • GitHub (👨‍💻 2 · 🔀 86 · 📋 24 - 16% open · ⏱️ 02.01.2020):

     git clone https://github.com/ScottfreeLLC/AlphaPy
    
  • PyPi (📥 250 / month · ⏱️ 17.11.2019):

     pip install alphapy
    
automl-gs (🥉15 · ⭐ 1.6K · 💤) - Provide an input CSV and a target field to predict, generate a model + code to.. MIT
  • GitHub (👨‍💻 7 · 🔀 140 · 📥 8 · 📋 25 - 80% open · ⏱️ 05.04.2019):

     git clone https://github.com/minimaxir/automl-gs
    
  • PyPi (📥 88 / month · ⏱️ 05.04.2019):

     pip install automl_gs
    
ENAS (🥉13 · ⭐ 2.1K · 💤) - PyTorch implementation of Efficient Neural Architecture Search via Parameters.. Apache-2
  • GitHub (👨‍💻 5 · 🔀 390 · 📋 41 - 82% open · ⏱️ 18.06.2019):

     git clone https://github.com/carpedm20/ENAS-pytorch
    
Devol (🥉11 · ⭐ 890) - Genetic neural architecture search with Keras. MIT
  • GitHub (👨‍💻 17 · 🔀 100 · 📋 26 - 23% open · ⏱️ 19.08.2019):

     git clone https://github.com/joeddav/devol
    
Show 9 hidden projects...
TPOT (🥇28 · ⭐ 6.7K) - A Python Automated Machine Learning tool that optimizes machine learning.. ❗️LGPL-3.0
  • GitHub (👨‍💻 87 · 🔀 1.2K · 📦 240 · 📋 660 - 21% open · ⏱️ 03.01.2020):

     git clone https://github.com/EpistasisLab/tpot
    
  • PyPi (📥 21K / month · 📦 40 · ⏱️ 03.01.2020):

     pip install tpot
    
auto_ml (🥉20 · ⭐ 1.4K · 💀) - [UNMAINTAINED] Automated machine learning for analytics & production. MIT
  • GitHub (👨‍💻 13 · 🔀 270 · 📥 12 · 📋 390 - 44% open · ⏱️ 25.03.2018):

     git clone https://github.com/ClimbsRocks/auto_ml
    
  • PyPi (📥 4.3K / month · 📦 1 · ⏱️ 22.02.2018):

     pip install auto_ml
    
MLBox (🥉19 · ⭐ 990) - MLBox is a powerful Automated Machine Learning python library. ❗️BSD-1-Clause
  • GitHub (👨‍💻 8 · 🔀 210 · 📦 5 · 📋 60 - 6% open · ⏱️ 09.10.2019):

     git clone https://github.com/AxeldeRomblay/MLBox
    
  • PyPi (📥 560 / month · ⏱️ 27.06.2019):

     pip install mlbox
    
Sherpa (🥉19 · ⭐ 180) - Hyperparameter optimization that enables researchers to experiment, visualize,.. ❗️GPL-3.0
  • GitHub (👨‍💻 37 · 🔀 21 · 📦 3 · 📋 39 - 20% open · ⏱️ 05.12.2019):

     git clone https://github.com/sherpa-ai/sherpa
    
  • PyPi (📥 740 / month · 📦 1 · ⏱️ 23.11.2019):

     pip install parameter-sherpa
    
Hypermax (🥉17 · ⭐ 77) - Better, faster hyper-parameter optimization. BSD-3
  • GitHub (👨‍💻 8 · 🔀 10 · 📦 4 · ⏱️ 28.12.2019):

     git clone https://github.com/electricbrainio/hypermax
    
  • PyPi (📥 240 / month · ⏱️ 23.10.2019):

     pip install hypermax
    
Xcessiv (🥉16 · ⭐ 1.2K · 💀) - A web-based application for quick, scalable, and automated hyperparameter.. Apache-2
  • GitHub (👨‍💻 6 · 🔀 98 · 📦 1 · 📋 34 - 61% open · ⏱️ 21.08.2017):

     git clone https://github.com/reiinakano/xcessiv
    
  • PyPi (📥 59 / month · 📦 1 · ⏱️ 21.08.2017):

     pip install xcessiv
    
Parfit (🥉16 · ⭐ 200 · 💀) - A package for parallelizing the fit and flexibly scoring of sklearn machine.. MIT
  • GitHub (👨‍💻 2 · 🔀 25 · 📦 4 · 📋 9 - 44% open · ⏱️ 11.10.2018):

     git clone https://github.com/jmcarpenter2/parfit
    
  • PyPi (📥 6.7K / month · ⏱️ 11.10.2018):

     pip install parfit
    
Hypertunity (🥉13 · ⭐ 110) - A toolset for black-box hyperparameter optimisation. Apache-2
  • GitHub (👨‍💻 2 · 🔀 8 · ⏱️ 18.12.2019):

     git clone https://github.com/gdikov/hypertunity
    
  • PyPi (📥 74 / month · ⏱️ 10.11.2019):

     pip install hypertunity
    
Auptimizer (🥉12 · ⭐ 120 · 🐣) - An automatic ML model optimization tool. ❗️GPL-3.0
  • GitHub (👨‍💻 7 · 🔀 10 · ⏱️ 27.11.2019):

     git clone https://github.com/LGE-ARC-AdvancedAI/auptimizer
    
  • PyPi (📥 130 / month · ⏱️ 21.11.2019):

     pip install auptimizer
    

Reinforcement Learning

Back to top

Libraries for building and evaluating reinforcement learning & agent-based systems.

OpenAI Gym (🥇34 · ⭐ 19K) - A toolkit for developing and comparing reinforcement learning algorithms. MIT
  • GitHub (👨‍💻 220 · 🔀 5.4K · 📦 9K · 📋 1K - 7% open · ⏱️ 14.12.2019):

     git clone https://github.com/openai/gym
    
  • PyPi (📥 170K / month · 📦 3.6K · ⏱️ 08.11.2019):

     pip install gym
    
TensorLayer (🥇27 · ⭐ 5.9K) - Deep Learning and Reinforcement Learning Library for Scientists. Apache-2
  • GitHub (👨‍💻 120 · 🔀 1.3K · 📥 740 · 📋 430 - 6% open · ⏱️ 19.01.2020):

     git clone https://github.com/tensorlayer/tensorlayer
    
  • PyPi (📥 3.8K / month · 📦 68 · ⏱️ 14.01.2020):

     pip install tensorlayer
    
keras-rl (🥈26 · ⭐ 4.4K) - Deep Reinforcement Learning for Keras. MIT
  • GitHub (👨‍💻 40 · 🔀 1.1K · 📦 280 · 📋 240 - 17% open · ⏱️ 11.11.2019):

     git clone https://github.com/keras-rl/keras-rl
    
  • PyPi (📥 3.2K / month · 📦 51 · ⏱️ 01.06.2018):

     pip install keras-rl
    
Dopamine (🥈25 · ⭐ 8.6K) - Dopamine is a research framework for fast prototyping of reinforcement.. Apache-2
  • GitHub (👨‍💻 13 · 🔀 1.1K · 📋 110 - 47% open · ⏱️ 19.12.2019):

     git clone https://github.com/google/dopamine
    
  • PyPi (📥 19K / month · 📦 31 · ⏱️ 19.12.2019):

     pip install dopamine-rl
    
TensorForce (🥈25 · ⭐ 2.6K) - Tensorforce: a TensorFlow library for applied reinforcement learning. Apache-2
  • GitHub (👨‍💻 66 · 🔀 440 · 📋 440 - 1% open · ⏱️ 19.01.2020):

     git clone https://github.com/tensorforce/tensorforce
    
  • PyPi (📥 1.8K / month · 📦 23 · ⏱️ 07.09.2019):

     pip install tensorforce
    
Stable Baselines (🥈25 · ⭐ 1.6K) - A fork of OpenAI Baselines, implementations of reinforcement learning.. MIT
  • GitHub (👨‍💻 91 · 🔀 320 · 📋 480 - 19% open · ⏱️ 22.01.2020):

     git clone https://github.com/hill-a/stable-baselines
    
  • PyPi (📥 8.5K / month · 📦 25 · ⏱️ 19.12.2019):

     pip install stable-baselines
    
TF-Agents (🥈25 · ⭐ 1.1K) - TF-Agents is a library for Reinforcement Learning in TensorFlow. Apache-2
  • GitHub (👨‍💻 47 · 🔀 260 · 📦 45 · 📋 240 - 37% open · ⏱️ 22.01.2020):

     git clone https://github.com/tensorflow/agents
    
  • PyPi (📥 1.8K / month · 📦 6 · ⏱️ 16.12.2019):

     pip install tf-agents
    
ViZDoom (🥉23 · ⭐ 990) - Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. MIT
  • GitHub (👨‍💻 34 · 🔀 250 · 📥 7.9K · 📦 69 · 📋 380 - 21% open · ⏱️ 06.11.2019):

     git clone https://github.com/mwydmuch/ViZDoom
    
  • PyPi (📥 890 / month · 📦 7 · ⏱️ 15.01.2019):

     pip install vizdoom
    
ChainerRL (🥉23 · ⭐ 790) - ChainerRL is a deep reinforcement learning library built on top of Chainer. MIT
  • GitHub (👨‍💻 29 · 🔀 180 · 📦 57 · 📋 200 - 36% open · ⏱️ 18.12.2019):

     git clone https://github.com/chainer/chainerrl
    
  • PyPi (📥 1K / month · 📦 3 · ⏱️ 28.02.2019):

     pip install chainerrl
    
TRFL (🥉21 · ⭐ 2.8K) - TensorFlow Reinforcement Learning. Apache-2
  • GitHub (👨‍💻 11 · 🔀 340 · 📦 11 · 📋 16 - 18% open · ⏱️ 15.01.2020):

     git clone https://github.com/deepmind/trfl
    
  • PyPi (📥 350 / month · 📦 2 · ⏱️ 16.01.2020):

     pip install trfl
    
Coach (🥉21 · ⭐ 1.6K) - Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with.. Apache-2
  • GitHub (👨‍💻 32 · 🔀 320 · 📋 240 - 27% open · ⏱️ 14.01.2020):

     git clone https://github.com/NervanaSystems/coach
    
  • PyPi (📥 160 / month · ⏱️ 10.10.2019):

     pip install rl_coach
    
garage (🥉20 · ⭐ 590) - A toolkit for reproducible reinforcement learning research. MIT
  • GitHub (👨‍💻 57 · 🔀 100 · 📦 7 · 📋 510 - 18% open · ⏱️ 23.01.2020):

     git clone https://github.com/rlworkgroup/garage
    
  • PyPi (📥 450 / month · ⏱️ 09.12.2019):

     pip install garage
    
TensorTrade (🥉19 · ⭐ 1.1K · 🐣) - An open source reinforcement learning framework for training,.. Apache-2
  • GitHub (👨‍💻 27 · 🔀 240 · 📦 3 · 📋 92 - 38% open · ⏱️ 22.01.2020):

     git clone https://github.com/tensortrade-org/tensortrade
    
  • PyPi (📥 520 / month · 📦 1 · ⏱️ 11.11.2019):

     pip install tensortrade
    
ReAgent (🥉16 · ⭐ 2.4K) - A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits,.. BSD-3
  • GitHub (👨‍💻 46 · 🔀 300 · 📋 69 - 23% open · ⏱️ 13.01.2020):

     git clone https://github.com/facebookresearch/ReAgent
    
PARL (🥉15 · ⭐ 570) - PARL A high-performance distributed training framework for Reinforcement Learning. Apache-2
  • GitHub (👨‍💻 11 · 🔀 150 · 📦 4 · 📋 41 - 21% open · ⏱️ 15.01.2020):

     git clone https://github.com/PaddlePaddle/PARL
    
  • PyPi (📥 120 / month · ⏱️ 16.12.2019):

     pip install parl
    
Show 1 hidden projects...
DeepMind Lab (🥉17 · ⭐ 6K) - A customisable 3D platform for agent-based AI research. ❗️GPL-2.0
  • GitHub (👨‍💻 7 · 🔀 1.2K · 📋 170 - 18% open · ⏱️ 07.01.2020):

     git clone https://github.com/deepmind/lab
    

Recommender Systems

Back to top

Libraries for building and evaluating recommendation systems.

scikit-surprise (🥇28 · ⭐ 3.7K) - A Python scikit for building and analyzing recommender systems. BSD-3
  • GitHub (👨‍💻 30 · 🔀 710 · 📦 360 · 📋 290 - 10% open · ⏱️ 17.01.2020):

     git clone https://github.com/NicolasHug/Surprise
    
  • PyPi (📥 76K / month · 📦 16 · ⏱️ 13.09.2019):

     pip install scikit-surprise
    
lightfm (🥈25 · ⭐ 2.9K) - A Python implementation of LightFM, a hybrid recommendation algorithm. Apache-2
  • GitHub (👨‍💻 34 · 🔀 480 · 📦 180 · 📋 340 - 26% open · ⏱️ 19.11.2019):

     git clone https://github.com/lyst/lightfm
    
  • PyPi (📥 35K / month · 📦 27 · ⏱️ 26.05.2018):

     pip install lightfm
    
implicit (🥈25 · ⭐ 1.8K) - Fast Python Collaborative Filtering for Implicit Feedback Datasets. MIT
  • GitHub (👨‍💻 19 · 🔀 380 · 📦 150 · 📋 240 - 22% open · ⏱️ 02.01.2020):

     git clone https://github.com/benfred/implicit
    
  • PyPi (📥 52K / month · 📦 16 · ⏱️ 23.10.2019):

     pip install implicit
    
TensorFlow Ranking (🥈21 · ⭐ 1.5K) - Learning to Rank in TensorFlow. Apache-2
  • GitHub (👨‍💻 16 · 🔀 260 · 📋 140 - 8% open · ⏱️ 17.01.2020):

     git clone https://github.com/tensorflow/ranking
    
  • PyPi (📥 14K / month · 📦 1 · ⏱️ 17.01.2020):

     pip install tensorflow_ranking
    
Recommenders (🥉19 · ⭐ 6.9K) - Best Practices on Recommendation Systems. MIT
  • GitHub (👨‍💻 56 · 🔀 940 · 📦 1 · 📋 430 - 19% open · ⏱️ 17.01.2020):

     git clone https://github.com/microsoft/recommenders
    
tensorrec (🥉19 · ⭐ 1K) - A TensorFlow recommendation algorithm and framework in Python. Apache-2
  • GitHub (👨‍💻 7 · 🔀 180 · 📦 14 · 📋 120 - 23% open · ⏱️ 30.09.2019):

     git clone https://github.com/jfkirk/tensorrec
    
  • PyPi (📥 610 / month · 📦 1 · ⏱️ 02.04.2019):

     pip install tensorrec
    
Spotlight (🥉17 · ⭐ 2K) - Deep recommender models using PyTorch. MIT
  • GitHub (👨‍💻 9 · 🔀 300 · 📋 100 - 56% open · ⏱️ 08.09.2019):

     git clone https://github.com/maciejkula/spotlight
    
  • Conda:

     conda install -c maciejkula spotlight
    
Case Recommender (🥉17 · ⭐ 220) - Case Recommender: A Flexible and Extensible Python Framework for.. MIT
  • GitHub (👨‍💻 9 · 🔀 47 · 📦 1 · 📋 22 - 31% open · ⏱️ 20.08.2019):

     git clone https://github.com/caserec/CaseRecommender
    
  • PyPi (📥 390 / month · ⏱️ 20.08.2019):

     pip install caserecommender
    
OpenRec (🥉16 · ⭐ 290) - OpenRec is an open-source and modular library for neural network-inspired.. Apache-2
  • GitHub (👨‍💻 8 · 🔀 70 · 📦 1 · 📋 12 - 25% open · ⏱️ 11.09.2019):

     git clone https://github.com/ylongqi/openrec
    
  • PyPi (📥 71 / month · 📦 1 · ⏱️ 13.11.2018):

     pip install openrec
    
Show 1 hidden projects...
recmetrics (🥉17 · ⭐ 120) - A library of metrics for evaluating recommender systems. MIT
  • GitHub (👨‍💻 7 · 🔀 25 · 📦 6 · 📋 8 - 75% open · ⏱️ 19.10.2019):

     git clone https://github.com/statisticianinstilettos/recmetrics
    
  • PyPi (📥 890 / month · ⏱️ 10.02.2019):

     pip install recmetrics
    

Privacy Machine Learning

Back to top

Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.

PySyft (🥇25 · ⭐ 4.6K) - A library for encrypted, privacy preserving machine learning. Apache-2
  • GitHub (👨‍💻 250 · 🔀 1K · 📦 26 · 📋 1.5K - 14% open · ⏱️ 22.01.2020):

     git clone https://github.com/OpenMined/PySyft
    
  • PyPi (📥 2.1K / month · 📦 3 · ⏱️ 21.01.2020):

     pip install syft
    
TensorFlow Privacy (🥈21 · ⭐ 990) - Library for training machine learning models with privacy for.. Apache-2
  • GitHub (👨‍💻 22 · 🔀 170 · 📥 7 · 📋 70 - 17% open · ⏱️ 15.01.2020):

     git clone https://github.com/tensorflow/privacy
    
  • PyPi (📥 2.6K / month · ⏱️ 31.10.2019):

     pip install tensorflow-privacy
    
TFEncrypted (🥈21 · ⭐ 590) - A Framework for Machine Learning on Encrypted Data. Apache-2
  • GitHub (👨‍💻 26 · 🔀 87 · 📦 20 · 📋 340 - 42% open · ⏱️ 21.01.2020):

     git clone https://github.com/tf-encrypted/tf-encrypted
    
  • PyPi (📥 590 / month · 📦 3 · ⏱️ 21.10.2019):

     pip install tf-encrypted
    
CrypTen (🥉11 · ⭐ 420 · 🐣) - A framework for Privacy Preserving Machine Learning. MIT
  • GitHub (👨‍💻 11 · 🔀 56 · 📋 9 - 33% open · ⏱️ 16.01.2020):

     git clone https://github.com/facebookresearch/CrypTen
    

Workflow & Experiment Tracking

Back to top

Libraries to organize, track, and visualize machine learning experiments.

Tensorboard (🥇36 · ⭐ 4.4K) - TensorFlow's Visualization Toolkit. Apache-2
  • GitHub (👨‍💻 210 · 🔀 1.1K · 📦 20K · 📋 1.2K - 39% open · ⏱️ 23.01.2020):

     git clone https://github.com/tensorflow/tensorboard
    
  • PyPi (📥 5.4M / month · 📦 3.6K · ⏱️ 05.12.2019):

     pip install tensorboard
    
mlflow (🥇32 · ⭐ 5.8K) - Open source platform for the machine learning lifecycle. Apache-2
  • GitHub (👨‍💻 170 · 🔀 1.2K · 📦 450 · 📋 980 - 36% open · ⏱️ 22.01.2020):

     git clone https://github.com/mlflow/mlflow
    
  • PyPi (📥 1M / month · 📦 150 · ⏱️ 20.12.2019):

     pip install mlflow
    
tensorboardX (🥇29 · ⭐ 6K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
  • GitHub (👨‍💻 58 · 🔀 650 · 📥 190 · 📦 3.9K · 📋 370 - 16% open · ⏱️ 31.12.2019):

     git clone https://github.com/lanpa/tensorboardX
    
  • PyPi (📥 170K / month · 📦 1.3K · ⏱️ 31.12.2019):

     pip install tensorboardX
    
sacred (🥈28 · ⭐ 2.6K) - Sacred is a tool to help you configure, organize, log and reproduce experiments.. MIT
  • GitHub (👨‍💻 79 · 🔀 270 · 📦 360 · 📋 440 - 22% open · ⏱️ 16.12.2019):

     git clone https://github.com/IDSIA/sacred
    
  • PyPi (📥 14K / month · 📦 150 · ⏱️ 27.11.2019):

     pip install sacred
    
DVC (🥈27 · ⭐ 4.4K) - Data Version Control | Git for Data & Models. Apache-2
  • GitHub (👨‍💻 120 · 🔀 390 · 📥 15K · 📦 190 · 📋 1.7K - 20% open · ⏱️ 23.01.2020):

     git clone https://github.com/iterative/dvc
    
  • Conda:

     conda install -c conda-forge dvc
    
kaggle (🥈27 · ⭐ 3K) - Official Kaggle API. Apache-2
  • GitHub (👨‍💻 30 · 🔀 610 · 📦 2K · 📋 190 - 50% open · ⏱️ 19.09.2019):

     git clone https://github.com/Kaggle/kaggle-api
    
  • PyPi (📥 30K / month · 📦 400 · ⏱️ 19.09.2019):

     pip install kaggle
    
SageMaker SDK (🥈27 · ⭐ 960) - A library for training and deploying machine learning models on.. Apache-2
  • GitHub (👨‍💻 110 · 🔀 390 · 📦 190 · 📋 440 - 24% open · ⏱️ 20.01.2020):

     git clone https://github.com/aws/sagemaker-python-sdk
    
  • PyPi (📥 110K / month · 📦 28 · ⏱️ 20.01.2020):

     pip install sagemaker
    
AzureML SDK (🥈26 · ⭐ 1.3K) - Python notebooks with ML and deep learning examples with Azure Machine Learning.. MIT
  • GitHub (👨‍💻 43 · 🔀 830 · 📥 190 · 📋 530 - 41% open · ⏱️ 22.01.2020):

     git clone https://github.com/Azure/MachineLearningNotebooks
    
  • PyPi (📥 180K / month · 📦 30 · ⏱️ 21.01.2020):

     pip install azureml-sdk
    
Catalyst (🥈25 · ⭐ 1.4K) - Accelerated DL & RL. Apache-2
  • GitHub (👨‍💻 50 · 🔀 170 · 📦 76 · 📋 160 - 26% open · ⏱️ 22.01.2020):

     git clone https://github.com/catalyst-team/catalyst
    
  • PyPi (📥 6.8K / month · 📦 19 · ⏱️ 18.01.2020):

     pip install catalyst
    
TensorWatch (🥉22 · ⭐ 2.6K) - Debugging, monitoring and visualization for Python Machine Learning and Data.. MIT
  • GitHub (👨‍💻 10 · 🔀 250 · 📦 18 · 📋 47 - 72% open · ⏱️ 10.01.2020):

     git clone https://github.com/microsoft/tensorwatch
    
  • PyPi (📥 1.2K / month · 📦 4 · ⏱️ 14.11.2019):

     pip install tensorwatch
    
TRAINS (🥉22 · ⭐ 1.1K) - TRAINS - Auto-Magical Experiment Manager & Version Control for AI - NOW WITH.. Apache-2
  • GitHub (👨‍💻 8 · 🔀 140 · 📥 30 · 📦 6 · 📋 76 - 35% open · ⏱️ 22.01.2020):

     git clone https://github.com/allegroai/trains
    
  • PyPi (📥 94K / month · ⏱️ 22.01.2020):

     pip install trains
    
  • Dockerhub (📥 26K · ⏱️ 07.01.2020):

     docker pull allegroai/trains
    
Metaflow (🥉21 · ⭐ 2.8K · 🐣) - Build and manage real-life data science projects with ease. Apache-2
  • GitHub (👨‍💻 8 · 🔀 180 · 📦 17 · 📋 79 - 55% open · ⏱️ 19.12.2019):

     git clone https://github.com/Netflix/metaflow
    
  • PyPi (📥 2.3K / month · ⏱️ 03.12.2019):

     pip install metaflow
    
TNT (🥉21 · ⭐ 1.1K) - Simple tools for logging and visualizing, loading and training. BSD-3
  • GitHub (👨‍💻 34 · 🔀 150 · 📦 230 · 📋 58 - 43% open · ⏱️ 15.01.2020):

     git clone https://github.com/pytorch/tnt
    
  • PyPi (📥 75 / month · 📦 1 · ⏱️ 01.12.2016):

     pip install tnt
    
knockknock (🥉20 · ⭐ 1.2K) - Knock Knock: Get notified when your training ends with only two additional lines.. MIT
  • GitHub (👨‍💻 13 · 🔀 92 · 📦 7 · 📋 19 - 26% open · ⏱️ 06.01.2020):

     git clone https://github.com/huggingface/knockknock
    
  • PyPi (📥 810 / month · 📦 3 · ⏱️ 03.12.2019):

     pip install knockknock
    
MXBoard (🥉20 · ⭐ 310) - Logging MXNet data for visualization in TensorBoard. Apache-2
  • GitHub (👨‍💻 8 · 🔀 46 · 📦 37 · 📋 29 - 51% open · ⏱️ 25.11.2019):

     git clone https://github.com/awslabs/mxboard
    
  • PyPi (📥 9.1K / month · 📦 1 · ⏱️ 22.05.2018):

     pip install mxboard
    
Guild AI (🥉20 · ⭐ 240) - Open source experiment tracking and optimization for machine learning. Apache-2
  • GitHub (👨‍💻 9 · 🔀 19 · 📦 6 · 📋 100 - 49% open · ⏱️ 20.01.2020):

     git clone https://github.com/guildai/guildai
    
  • PyPi (📥 1.3K / month · ⏱️ 31.12.2019):

     pip install guildai
    
VisualDL (🥉18 · ⭐ 2.4K) - A platform to visualize the deep learning process and result. Apache-2
  • GitHub (👨‍💻 17 · 🔀 360 · 📦 20 · 📋 250 - 25% open · ⏱️ 02.07.2019):

     git clone https://github.com/PaddlePaddle/VisualDL
    
  • PyPi (📥 350 / month · 📦 1 · ⏱️ 23.02.2019):

     pip install visualdl
    
TensorBoard Logger (🥉18 · ⭐ 600) - Log TensorBoard events without touching TensorFlow. MIT
  • GitHub (👨‍💻 5 · 🔀 48 · 📋 23 - 39% open · ⏱️ 21.10.2019):

     git clone https://github.com/TeamHG-Memex/tensorboard_logger
    
  • PyPi (📥 3.1K / month · 📦 42 · ⏱️ 08.02.2018):

     pip install tensorboard_logger
    
datmo (🥉18 · ⭐ 310) - Open source production model management tool for data scientists. MIT
  • GitHub (👨‍💻 5 · 🔀 25 · 📦 5 · 📋 190 - 21% open · ⏱️ 29.11.2019):

     git clone https://github.com/datmo/datmo
    
  • PyPi (📥 190 / month · ⏱️ 07.12.2018):

     pip install datmo
    
Studio.ml (🥉17 · ⭐ 340) - Studio: Simplify and expedite model building process. Apache-2
  • GitHub (👨‍💻 19 · 🔀 44 · 📦 2 · 📋 210 - 25% open · ⏱️ 02.08.2019):

     git clone https://github.com/studioml/studio
    
  • PyPi (📥 500 / month · ⏱️ 29.06.2019):

     pip install studioml
    
Show 3 hidden projects...
SKLL (🥉20 · ⭐ 500) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning.. ❗️BSD-1-Clause
  • GitHub (👨‍💻 32 · 🔀 61 · 📥 4 · 📦 27 · 📋 340 - 11% open · ⏱️ 30.10.2019):

     git clone https://github.com/EducationalTestingService/skll
    
  • PyPi (📥 280 / month · 📦 17 · ⏱️ 24.10.2019):

     pip install skll
    
steppy (🥉15 · ⭐ 110 · 💀) - Lightweight, Python library for fast and reproducible experimentation. MIT
  • GitHub (👨‍💻 5 · 🔀 29 · 📦 30 · 📋 66 - 24% open · ⏱️ 23.11.2018):

     git clone https://github.com/neptune-ml/steppy
    
  • PyPi (📥 140 / month · 📦 3 · ⏱️ 23.11.2018):

     pip install steppy
    
ModelChimp (🥉14 · ⭐ 120) - Experiment tracking for machine and deep learning projects. BSD-2
  • GitHub (👨‍💻 2 · 🔀 11 · 📋 14 - 28% open · ⏱️ 06.12.2019):

     git clone https://github.com/ModelChimp/modelchimp
    
  • PyPi (📥 190 / month · ⏱️ 09.04.2019):

     pip install modelchimp
    
  • Dockerhub (📥 620 · ⏱️ 09.04.2019):

     docker pull modelchimp/modelchimp-server
    

Model Serialization & Conversion

Back to top

Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.

onnx (🥇32 · ⭐ 7.8K) - Open Neural Network Exchange. MIT
  • GitHub (👨‍💻 160 · 🔀 1.2K · 📥 1.3K · 📦 800 · 📋 1.1K - 39% open · ⏱️ 17.01.2020):

     git clone https://github.com/onnx/onnx
    
  • PyPi (📥 170K / month · 📦 150 · ⏱️ 28.09.2019):

     pip install onnx
    
Core ML Tools (🥈27 · ⭐ 1.6K) - Core ML Community Tools. BSD-3
  • GitHub (👨‍💻 65 · 🔀 230 · 📥 420 · 📦 260 · 📋 330 - 42% open · ⏱️ 23.01.2020):

     git clone https://github.com/apple/coremltools
    
  • PyPi (📥 64K / month · 📦 130 · ⏱️ 16.01.2020):

     pip install coremltools
    
mmdnn (🥈23 · ⭐ 4.4K) - MMdnn is a set of tools to help users inter-operate among different deep learning.. MIT
  • GitHub (👨‍💻 81 · 🔀 820 · 📥 2.9K · 📦 22 · 📋 500 - 50% open · ⏱️ 06.01.2020):

     git clone https://github.com/Microsoft/MMdnn
    
  • PyPi (📥 1.1K / month · 📦 2 · ⏱️ 16.04.2019):

     pip install mmdnn
    
model-optimization (🥈23 · ⭐ 630) - A suite of tools that users, both novice and advanced, can use to.. Apache-2
  • GitHub (👨‍💻 20 · 🔀 120 · 📦 54 · 📋 69 - 53% open · ⏱️ 23.01.2020):

     git clone https://github.com/tensorflow/model-optimization
    
  • PyPi (📥 13K / month · 📦 3 · ⏱️ 18.12.2019):

     pip install tensorflow-model-optimization
    
pytorch2keras (🥉18 · ⭐ 460) - PyTorch to Keras model convertor. MIT
  • GitHub (👨‍💻 11 · 🔀 75 · 📦 8 · 📋 83 - 26% open · ⏱️ 14.01.2020):

     git clone https://github.com/nerox8664/pytorch2keras
    
  • PyPi (📥 350 / month · 📦 1 · ⏱️ 27.06.2019):

     pip install pytorch2keras
    
sklearn-porter (🥉17 · ⭐ 770) - Transpile trained scikit-learn estimators to C, Java, JavaScript and others. MIT
  • GitHub (👨‍💻 10 · 🔀 99 · 📋 52 - 50% open · ⏱️ 18.12.2019):

     git clone https://github.com/nok/sklearn-porter
    
  • PyPi (📥 400 / month · 📦 1 · ⏱️ 18.12.2019):

     pip install sklearn-porter
    

Model Interpretability

Back to top

Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.

shap (🥇29 · ⭐ 7.7K) - A game theoretic approach to explain the output of any machine learning model. MIT
  • GitHub (👨‍💻 100 · 🔀 1.1K · 📦 540 · 📋 830 - 55% open · ⏱️ 22.01.2020):

     git clone https://github.com/slundberg/shap
    
  • PyPi (📥 430K / month · 📦 140 · ⏱️ 27.12.2019):

     pip install shap
    
eli5 (🥇28 · ⭐ 1.8K) - A library for debugging/inspecting machine learning classifiers and explaining their.. MIT
  • GitHub (👨‍💻 14 · 🔀 250 · 📦 340 · 📋 210 - 51% open · ⏱️ 22.01.2020):

     git clone https://github.com/TeamHG-Memex/eli5
    
  • PyPi (📥 190K / month · 📦 96 · ⏱️ 29.08.2019):

     pip install eli5
    
Lime (🥇27 · ⭐ 7K) - Lime: Explaining the predictions of any machine learning classifier. BSD-2
  • GitHub (👨‍💻 47 · 🔀 1.1K · 📦 440 · 📋 340 - 7% open · ⏱️ 21.12.2019):

     git clone https://github.com/marcotcr/lime
    
  • PyPi (📥 130K / month · 📦 130 · ⏱️ 21.12.2019):

     pip install lime
    
Model Analysis (🥈26 · ⭐ 920) - Model analysis tools for TensorFlow. Apache-2
  • GitHub (👨‍💻 24 · 🔀 160 · 📋 23 - 34% open · ⏱️ 23.01.2020):

     git clone https://github.com/tensorflow/model-analysis
    
  • PyPi (📥 1.1M / month · 📦 10 · ⏱️ 21.01.2020):

     pip install tensorflow-model-analysis
    
yellowbrick (🥈25 · ⭐ 2.6K) - Visual analysis and diagnostic tools to facilitate machine learning.. Apache-2
  • GitHub (👨‍💻 89 · 🔀 400 · 📋 530 - 17% open · ⏱️ 14.01.2020):

     git clone https://github.com/DistrictDataLabs/yellowbrick
    
  • PyPi (📥 18K / month · 📦 14 · ⏱️ 06.10.2019):

     pip install yellowbrick
    
pyLDAvis (🥈25 · ⭐ 1.1K · 💤) - Python library for interactive topic model visualization. Port of the R.. BSD-3
  • GitHub (👨‍💻 25 · 🔀 250 · 📦 700 · 📋 130 - 58% open · ⏱️ 30.03.2019):

     git clone https://github.com/bmabey/pyLDAvis
    
  • PyPi (📥 33K / month · 📦 75 · ⏱️ 05.06.2018):

     pip install pyldavis
    
Lucid (🥈23 · ⭐ 3.4K) - A collection of infrastructure and tools for research in neural network.. Apache-2
  • GitHub (👨‍💻 29 · 🔀 450 · 📦 70 · 📋 140 - 40% open · ⏱️ 16.01.2020):

     git clone https://github.com/tensorflow/lucid
    
  • PyPi (📥 780 / month · 📦 6 · ⏱️ 20.12.2018):

     pip install lucid
    
keras-vis (🥈23 · ⭐ 2.5K · 💤) - Neural network visualization toolkit for keras. MIT
  • GitHub (👨‍💻 9 · 🔀 480 · 📦 200 · 📋 180 - 47% open · ⏱️ 26.06.2019):

     git clone https://github.com/raghakot/keras-vis
    
  • PyPi (📥 1.7K / month · 📦 11 · ⏱️ 06.07.2017):

     pip install keras-vis
    
InterpretML (🥈23 · ⭐ 2.3K) - Fit interpretable machine learning models. Explain blackbox machine learning. MIT
  • GitHub (👨‍💻 15 · 🔀 280 · 📦 25 · 📋 85 - 36% open · ⏱️ 14.01.2020):

     git clone https://github.com/interpretml/interpret
    
  • PyPi (📥 4.7K / month · 📦 2 · ⏱️ 12.12.2019):

     pip install interpret
    
Fairness 360 (🥉22 · ⭐ 820) - A comprehensive set of fairness metrics for datasets and machine learning.. Apache-2
  • GitHub (👨‍💻 27 · 🔀 230 · 📦 21 · 📋 45 - 35% open · ⏱️ 13.01.2020):

     git clone https://github.com/IBM/AIF360
    
  • PyPi (📥 860 / month · 📦 1 · ⏱️ 13.08.2019):

     pip install aif360
    
keract (🥉21 · ⭐ 680) - Activation Maps (Layers Outputs) and Gradients in Keras. MIT
  • GitHub (👨‍💻 9 · 🔀 140 · 📦 25 · 📋 44 - 4% open · ⏱️ 04.01.2020):

     git clone https://github.com/philipperemy/keract
    
  • PyPi (📥 1.1K / month · 📦 5 · ⏱️ 03.01.2020):

     pip install keract
    
TreeInterpreter (🥉20 · ⭐ 550 · 💤) - Package for interpreting scikit-learn's decision tree and random.. BSD-3
  • GitHub (👨‍💻 9 · 🔀 110 · 📦 62 · 📋 15 - 93% open · ⏱️ 31.05.2019):

     git clone https://github.com/andosa/treeinterpreter
    
  • PyPi (📥 42K / month · 📦 4 · ⏱️ 16.12.2018):

     pip install treeinterpreter
    
tf-explain (🥉20 · ⭐ 520 · 🐣) - Interpretability Methods for tf.keras models with Tensorflow 2.0. MIT
  • GitHub (👨‍💻 8 · 🔀 33 · 📦 10 · 📋 44 - 36% open · ⏱️ 09.01.2020):

     git clone https://github.com/sicara/tf-explain
    
  • PyPi (📥 1K / month · 📦 1 · ⏱️ 27.12.2019):

     pip install tf-explain
    
Captum (🥉19 · ⭐ 980 · 🐣) - Model interpretability and understanding for PyTorch. BSD-3
  • GitHub (👨‍💻 21 · 🔀 68 · 📦 6 · 📋 39 - 35% open · ⏱️ 16.01.2020):

     git clone https://github.com/pytorch/captum
    
  • PyPi (📥 790 / month · ⏱️ 09.10.2019):

     pip install captum
    
Alibi (🥉19 · ⭐ 420) - Algorithms for monitoring and explaining machine learning models. Apache-2
  • GitHub (👨‍💻 6 · 🔀 43 · 📦 17 · 📋 70 - 45% open · ⏱️ 05.12.2019):

     git clone https://github.com/SeldonIO/alibi
    
  • PyPi (📥 480 / month · 📦 5 · ⏱️ 01.10.2019):

     pip install alibi
    
iNNvestigate (🥉18 · ⭐ 500 · 💤) - A toolbox to iNNvestigate neural networks' predictions!. BSD-2
  • GitHub (👨‍💻 14 · 🔀 120 · 📦 13 · 📋 150 - 24% open · ⏱️ 02.06.2019):

     git clone https://github.com/albermax/innvestigate
    
  • PyPi (📥 720 / month · ⏱️ 19.03.2019):

     pip install innvestigate
    
Explainability 360 (🥉18 · ⭐ 450 · 🐣) - Interpretability and explainability of data and machine learning.. Apache-2
  • GitHub (👨‍💻 17 · 🔀 91 · 📦 1 · 📋 16 - 56% open · ⏱️ 22.01.2020):

     git clone https://github.com/IBM/AIX360
    
  • PyPi (📥 190 / month · 📦 1 · ⏱️ 09.12.2019):

     pip install aix360
    
sklearn-evaluation (🥉17 · ⭐ 280 · 💤) - scikit-learn model evaluation made easy: plots, tables and markdown.. MIT
  • GitHub (👨‍💻 3 · 🔀 23 · 📦 14 · 📋 30 - 16% open · ⏱️ 03.05.2019):

     git clone https://github.com/edublancas/sklearn-evaluation
    
  • PyPi (📥 280 / month · 📦 2 · ⏱️ 03.05.2019):

     pip install sklearn-evaluation
    
LOFO (🥉16 · ⭐ 160) - Leave One Feature Out Importance. MIT
  • GitHub (👨‍💻 3 · 🔀 16 · 📦 2 · 📋 8 - 12% open · ⏱️ 27.12.2019):

     git clone https://github.com/aerdem4/lofo-importance
    
  • PyPi (📥 280 / month · ⏱️ 27.12.2019):

     pip install lofo-importance
    
Anchor (🥉15 · ⭐ 510) - Code for High-Precision Model-Agnostic Explanations paper. BSD-2
  • GitHub (👨‍💻 7 · 🔀 67 · 📋 35 - 25% open · ⏱️ 21.12.2019):

     git clone https://github.com/marcotcr/anchor
    
  • PyPi (📥 340 / month · ⏱️ 30.04.2018):

     pip install anchor_exp
    
XAI (🥉13 · ⭐ 370) - XAI - An eXplainability toolbox for machine learning. MIT
  • GitHub (👨‍💻 2 · 🔀 45 · 📦 2 · 📋 3 - 66% open · ⏱️ 05.10.2019):

     git clone https://github.com/EthicalML/xai
    
  • PyPi (📥 110 / month · ⏱️ 13.01.2019):

     pip install xai
    
Show 3 hidden projects...
scikit-plot (🥈23 · ⭐ 1.8K · 💀) - An intuitive library to add plotting functionality to scikit-learn objects. MIT
  • GitHub (👨‍💻 13 · 🔀 210 · 📦 270 · 📋 52 - 26% open · ⏱️ 19.08.2018):

     git clone https://github.com/reiinakano/scikit-plot
    
  • PyPi (📥 18K / month · 📦 5 · ⏱️ 19.08.2018):

     pip install scikit-plot
    
Skater (🥉17 · ⭐ 880) - Python Library for Model Interpretation/Explanations. ❗️UPL-1.0
  • GitHub (👨‍💻 30 · 🔀 140 · 📋 150 - 37% open · ⏱️ 30.07.2019):

     git clone https://github.com/oracle/Skater
    
  • PyPi (📥 730 / month · ⏱️ 21.09.2018):

     pip install skater
    
Attribution Priors (🥉12 · ⭐ 51) - Tools for training explainable models using attribution priors. MIT
  • GitHub (👨‍💻 5 · 🔀 2 · 📋 2 - 50% open · ⏱️ 02.11.2019):

     git clone https://github.com/suinleelab/attributionpriors
    
  • PyPi (📥 97 / month · ⏱️ 31.10.2019):

     pip install attributionpriors
    

Vector Similarity Search (ANN)

Back to top

Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search. For an awesome comparison, refer to this benchmark.

Annoy (🥇29 · ⭐ 6.5K) - Approximate Nearest Neighbors in C++/Python optimized for memory usage and.. Apache-2
  • GitHub (👨‍💻 59 · 🔀 710 · 📦 900 · 📋 230 - 7% open · ⏱️ 26.12.2019):

     git clone https://github.com/spotify/annoy
    
  • PyPi (📥 75K / month · 📦 360 · ⏱️ 26.12.2019):

     pip install annoy
    
Faiss (🥈27 · ⭐ 8.8K) - A library for efficient similarity search and clustering of dense vectors. MIT
  • GitHub (👨‍💻 52 · 🔀 1.6K · 📦 94 · 📋 950 - 7% open · ⏱️ 20.01.2020):

     git clone https://github.com/facebookresearch/faiss
    
  • PyPi (📥 8.1K / month · 📦 14 · ⏱️ 16.04.2019):

     pip install faiss
    
NMSLIB (🥈27 · ⭐ 1.8K) - Non-Metric Space Library (NMSLIB): An efficient similarity search library and a.. Apache-2
  • GitHub (👨‍💻 33 · 🔀 280 · 📦 110 · 📋 330 - 22% open · ⏱️ 10.01.2020):

     git clone https://github.com/nmslib/nmslib
    
  • PyPi (📥 21K / month · 📦 42 · ⏱️ 07.11.2019):

     pip install nmslib
    
hnswlib (🥉20 · ⭐ 730) - Header-only C++/python library for fast approximate nearest neighbors. Apache-2
  • GitHub (👨‍💻 25 · 🔀 140 · 📋 130 - 36% open · ⏱️ 09.01.2020):

     git clone https://github.com/nmslib/hnswlib
    
  • PyPi (📥 2K / month · 📦 2 · ⏱️ 16.12.2019):

     pip install hnswlib
    
PyNNDescent (🥉20 · ⭐ 200) - A Python nearest neighbor descent for approximate nearest neighbors. BSD-2
  • GitHub (👨‍💻 9 · 🔀 30 · 📋 39 - 51% open · ⏱️ 21.01.2020):

     git clone https://github.com/lmcinnes/pynndescent
    
  • PyPi (📥 4.2K / month · 📦 3 · ⏱️ 12.01.2020):

     pip install pynndescent
    
Show 3 hidden projects...
Magnitude (🥉21 · ⭐ 1.2K · 💀) - A fast, efficient universal vector embedding utility package. MIT
  • GitHub (👨‍💻 4 · 🔀 81 · 📦 79 · 📋 63 - 28% open · ⏱️ 28.11.2018):

     git clone https://github.com/plasticityai/magnitude
    
  • PyPi (📥 2.4K / month · 📦 2 · ⏱️ 28.11.2018):

     pip install pymagnitude
    
NearPy (🥉21 · ⭐ 610 · 💀) - Python framework for fast (approximated) nearest neighbour search in large, high-.. MIT
  • GitHub (👨‍💻 18 · 🔀 120 · 📦 37 · 📋 56 - 33% open · ⏱️ 21.10.2018):

     git clone https://github.com/pixelogik/NearPy
    
  • PyPi (📥 1.1K / month · 📦 15 · ⏱️ 27.09.2016):

     pip install NearPy
    
PySparNN (🥉11 · ⭐ 790 · 💀) - Approximate Nearest Neighbor Search for Sparse Data in Python!. BSD-3
  • GitHub (👨‍💻 5 · 🔀 130 · 📋 22 - 40% open · ⏱️ 31.01.2018):

     git clone https://github.com/facebookresearch/pysparnn
    

Probabilistic Methods

Back to top

Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, and gaussian processes.

PyMC3 (🥇30 · ⭐ 4.8K) - Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine.. Apache-2
  • GitHub (👨‍💻 270 · 🔀 1.1K · 📥 51 · 📦 860 · 📋 1.9K - 6% open · ⏱️ 23.01.2020):

     git clone https://github.com/pymc-devs/pymc3
    
  • PyPi (📥 54K / month · 📦 140 · ⏱️ 30.11.2019):

     pip install pymc3
    
tensorflow-probability (🥇30 · ⭐ 2.6K) - Probabilistic reasoning and statistical analysis in TensorFlow. Apache-2
  • GitHub (👨‍💻 110 · 🔀 660 · 📦 1K · 📋 600 - 48% open · ⏱️ 23.01.2020):

     git clone https://github.com/tensorflow/probability
    
  • PyPi (📥 97K / month · 📦 250 · ⏱️ 10.01.2020):

     pip install tensorflow-probability
    
Pyro (🥈28 · ⭐ 5.9K) - Deep universal probabilistic programming with Python and PyTorch. Apache-2
  • GitHub (👨‍💻 82 · 🔀 690 · 📦 160 · 📋 700 - 12% open · ⏱️ 23.01.2020):

     git clone https://github.com/pyro-ppl/pyro
    
  • PyPi (📥 9.1K / month · 📦 36 · ⏱️ 23.01.2020):

     pip install pyro-ppl
    
pomegranate (🥈27 · ⭐ 2.2K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT
  • GitHub (👨‍💻 46 · 🔀 390 · 📦 170 · 📋 450 - 8% open · ⏱️ 12.01.2020):

     git clone https://github.com/jmschrei/pomegranate
    
  • PyPi (📥 37K / month · 📦 27 · ⏱️ 08.12.2019):

     pip install pomegranate
    
  • Conda (⏱️ 25.09.2019):

     conda install -c anaconda pomegranate
    
GPyTorch (🥈26 · ⭐ 1.8K) - A highly efficient and modular implementation of Gaussian Processes in PyTorch. MIT
  • GitHub (👨‍💻 46 · 🔀 230 · 📦 42 · 📋 540 - 17% open · ⏱️ 14.01.2020):

     git clone https://github.com/cornellius-gp/gpytorch
    
  • PyPi (📥 4.7K / month · 📦 4 · ⏱️ 14.01.2020):

     pip install gpytorch
    
GPflow (🥈26 · ⭐ 1.1K) - Gaussian processes in TensorFlow. Apache-2
  • GitHub (👨‍💻 47 · 🔀 320 · 📦 110 · 📋 550 - 13% open · ⏱️ 22.01.2020):

     git clone https://github.com/GPflow/GPflow
    
  • PyPi (📥 4.1K / month · 📦 13 · ⏱️ 14.11.2019):

     pip install gpflow
    
pgmpy (🥉23 · ⭐ 1.3K) - Python Library for Probabilistic Graphical Models. MIT
  • GitHub (👨‍💻 73 · 🔀 470 · 📦 81 · 📋 640 - 37% open · ⏱️ 22.01.2020):

     git clone https://github.com/pgmpy/pgmpy
    
  • PyPi (📥 7.9K / month · 📦 6 · ⏱️ 22.01.2020):

     pip install pgmpy
    
pyhsmm (🥉19 · ⭐ 440) - Bayesian inference in HSMMs and HMMs. MIT
  • GitHub (👨‍💻 13 · 🔀 140 · 📦 13 · 📋 85 - 30% open · ⏱️ 31.07.2019):

     git clone https://github.com/mattjj/pyhsmm
    
  • PyPi (📥 130 / month · 📦 4 · ⏱️ 10.05.2017):

     pip install pyhsmm
    
ZhuSuan (🥉15 · ⭐ 1.8K) - A library for Bayesian deep learning, generative models, based on Tensorflow. MIT
  • GitHub (👨‍💻 20 · 🔀 330 · 📋 60 - 13% open · ⏱️ 05.08.2019):

     git clone https://github.com/thu-ml/zhusuan
    
Show 3 hidden projects...
PyStan (🥈26 · ⭐ 760) - PyStan, the Python interface to Stan. ❗️GPL-3.0
  • GitHub (👨‍💻 48 · 🔀 170 · 📥 67 · 📦 1K · 📋 450 - 9% open · ⏱️ 17.12.2019):

     git clone https://github.com/stan-dev/pystan
    
  • PyPi (📥 570K / month · 📦 270 · ⏱️ 27.10.2019):

     pip install pystan
    
Edward (🥉24 · ⭐ 4.4K · 💀) - A probabilistic programming language in TensorFlow. Deep generative.. Apache-2
  • GitHub (👨‍💻 87 · 🔀 770 · 📥 8 · 📦 140 · 📋 540 - 39% open · ⏱️ 25.07.2018):

     git clone https://github.com/blei-lab/edward
    
  • PyPi (📥 2.4K / month · 📦 37 · ⏱️ 22.01.2018):

     pip install edward
    
Funsor (🥉15 · ⭐ 110) - Functional tensors for probabilistic programming. Apache-2
  • GitHub (👨‍💻 6 · 🔀 8 · 📋 77 - 45% open · ⏱️ 12.01.2020):

     git clone https://github.com/pyro-ppl/funsor
    
  • PyPi (📥 50 / month · ⏱️ 23.10.2019):

     pip install funsor
    

Adversarial Robustness

Back to top

Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.

Foolbox (🥇25 · ⭐ 1.3K) - Python toolbox to create adversarial examples that fool neural networks in PyTorch,.. MIT
  • GitHub (👨‍💻 24 · 🔀 240 · 📦 84 · 📋 240 - 9% open · ⏱️ 10.01.2020):

     git clone https://github.com/bethgelab/foolbox
    
  • PyPi (📥 1.3K / month · 📦 16 · ⏱️ 04.11.2019):

     pip install foolbox
    
ART (🥇25 · ⭐ 1.2K) - Python library for adversarial machine learning (evasion, extraction, poisoning,.. MIT
  • GitHub (👨‍💻 35 · 🔀 310 · 📦 19 · 📋 140 - 16% open · ⏱️ 17.01.2020):

     git clone https://github.com/IBM/adversarial-robustness-toolbox
    
  • PyPi (📥 1.3K / month · ⏱️ 08.01.2020):

     pip install adversarial-robustness-toolbox
    
CleverHans (🥈24 · ⭐ 4.4K) - An adversarial example library for constructing attacks, building defenses,.. MIT
  • GitHub (👨‍💻 100 · 🔀 1K · 📦 91 · 📋 400 - 9% open · ⏱️ 04.12.2019):

     git clone https://github.com/tensorflow/cleverhans
    
  • PyPi (📥 1K / month · 📦 3 · ⏱️ 27.11.2018):

     pip install cleverhans
    
AdvBox (🥉17 · ⭐ 790) - Advbox is a toolbox to generate adversarial examples that fool neural networks in.. Apache-2
  • GitHub (👨‍💻 15 · 🔀 170 · ⏱️ 17.01.2020):

     git clone https://github.com/advboxes/AdvBox
    
  • PyPi (📥 54 / month · ⏱️ 05.12.2018):

     pip install advbox
    
Show 2 hidden projects...
advertorch (🥉19 · ⭐ 490) - A Toolbox for Adversarial Robustness Research. ❗️GPL-3.0
  • GitHub (👨‍💻 13 · 🔀 63 · 📦 7 · 📋 24 - 12% open · ⏱️ 10.12.2019):

     git clone https://github.com/BorealisAI/advertorch
    
  • PyPi (📥 380 / month · 📦 3 · ⏱️ 05.12.2019):

     pip install advertorch
    
Adversary (🥉14 · ⭐ 310 · 💀) - Tool to generate adversarial text examples and test machine learning models.. MIT
  • GitHub (👨‍💻 5 · 🔀 43 · 📦 4 · ⏱️ 29.08.2018):

     git clone https://github.com/airbnb/artificial-adversary
    
  • PyPi (📥 53 / month · ⏱️ 29.08.2018):

     pip install Adversary
    

GPU Utilities

Back to top

Libraries that require and make use of CUDA/GPU system capabilities to optimize data handling and machine learning tasks.

CuPy (🥇30 · ⭐ 3.9K) - NumPy-like API accelerated with CUDA. MIT
  • GitHub (👨‍💻 220 · 🔀 330 · 📋 800 - 41% open · ⏱️ 23.01.2020):

     git clone https://github.com/cupy/cupy
    
  • PyPi (📥 9.2K / month · 📦 190 · ⏱️ 22.01.2020):

     pip install cupy
    
  • Dockerhub (📥 44K · ⭐ 5 · ⏱️ 23.01.2020):

     docker pull cupy/cupy
    
jax (🥇27 · ⭐ 6.3K) - Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT.. Apache-2
  • GitHub (👨‍💻 110 · 🔀 450 · 📦 130 · 📋 880 - 36% open · ⏱️ 23.01.2020):

     git clone https://github.com/google/jax
    
  • PyPi (📥 11K / month · 📦 46 · ⏱️ 04.01.2020):

     pip install jax
    
PyCUDA (🥇27 · ⭐ 880) - CUDA integration for Python, plus shiny features. MIT
  • GitHub (👨‍💻 63 · 🔀 190 · 📦 380 · 📋 140 - 27% open · ⏱️ 30.11.2019):

     git clone https://github.com/inducer/pycuda
    
  • PyPi (📥 15K / month · 📦 260 · ⏱️ 12.08.2019):

     pip install pycuda
    
cuDF (🥈24 · ⭐ 2.4K) - cuDF - GPU DataFrame Library. Apache-2
  • GitHub (👨‍💻 150 · 🔀 330 · 📦 26 · 📋 1.9K - 25% open · ⏱️ 23.01.2020):

     git clone https://github.com/rapidsai/cudf
    
  • PyPi (📥 380 / month · ⏱️ 05.02.2019):

     pip install cudf
    
cuML (🥈22 · ⭐ 1K) - cuML - RAPIDS Machine Learning Library. Apache-2
  • GitHub (👨‍💻 76 · 🔀 160 · 📦 10 · 📋 660 - 53% open · ⏱️ 22.01.2020):

     git clone https://github.com/rapidsai/cuml
    
  • PyPi (📥 170 / month · ⏱️ 11.04.2019):

     pip install cuml
    
scikit-cuda (🥈22 · ⭐ 730) - Python interface to GPU-powered libraries. BSD-3
  • GitHub (👨‍💻 43 · 🔀 140 · 📦 75 · 📋 180 - 14% open · ⏱️ 23.01.2020):

     git clone https://github.com/lebedov/scikit-cuda
    
  • PyPi (📥 1.1K / month · 📦 31 · ⏱️ 27.05.2019):

     pip install scikit-cuda
    
ArrayFire (🥈21 · ⭐ 2.9K) - ArrayFire: a general purpose GPU library. BSD-3
  • GitHub (👨‍💻 71 · 🔀 410 · 📋 1.3K - 19% open · ⏱️ 22.01.2020):

     git clone https://github.com/arrayfire/arrayfire
    
  • PyPi (📥 790 / month · 📦 2 · ⏱️ 18.10.2018):

     pip install arrayfire
    
Apex (🥉19 · ⭐ 3.3K) - A PyTorch Extension: Tools for easy mixed precision and distributed training in.. BSD-3
  • GitHub (👨‍💻 52 · 🔀 390 · 📦 110 · 📋 520 - 46% open · ⏱️ 21.01.2020):

     git clone https://github.com/NVIDIA/apex
    
  • Conda:

     conda install -c conda-forge nvidia-apex
    
DALI (🥉18 · ⭐ 2.3K) - A library containing both highly optimized building blocks and an execution engine.. Apache-2
  • GitHub (👨‍💻 41 · 🔀 270 · 📋 490 - 29% open · ⏱️ 23.01.2020):

     git clone https://github.com/NVIDIA/DALI
    
SpeedTorch (🥉18 · ⭐ 530 · 🐣) - Library for faster pinned CPU - GPU transfer in Pytorch. MIT
  • GitHub (👨‍💻 3 · 🔀 32 · 📦 2 · 📋 4 - 75% open · ⏱️ 06.01.2020):

     git clone https://github.com/Santosh-Gupta/SpeedTorch
    
  • PyPi (📥 280 / month · 📦 2 · ⏱️ 06.01.2020):

     pip install SpeedTorch
    
cuGraph (🥉18 · ⭐ 320) - cuGraph - RAPIDS Graph Analytics Library. Apache-2
  • GitHub (👨‍💻 37 · 🔀 59 · 📦 6 · 📋 320 - 28% open · ⏱️ 22.01.2020):

     git clone https://github.com/rapidsai/cugraph
    
  • PyPi (📥 30 / month · ⏱️ 04.04.2019):

     pip install cugraph
    
BlazingSQL (🥉16 · ⭐ 1.1K) - BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built.. Apache-2
  • GitHub (👨‍💻 16 · 🔀 96 · 📋 91 - 46% open · ⏱️ 22.01.2020):

     git clone https://github.com/BlazingDB/blazingsql
    
  • Conda:

     conda install -c blazingsql blazingsql-protocol
    
cuSignal (🥉11 · ⭐ 210 · 🐣) - GPU accelerated signal processing. Apache-2
  • GitHub (👨‍💻 6 · 🔀 19 · 📋 9 - 77% open · ⏱️ 10.01.2020):

     git clone https://github.com/rapidsai/cusignal
    
Show 3 hidden projects...
py3nvml (🥉20 · ⭐ 130) - Python 3 Bindings for NVML library. Get NVIDIA GPU status inside your program. BSD-3
  • GitHub (👨‍💻 5 · 🔀 17 · 📦 71 · 📋 9 - 11% open · ⏱️ 11.10.2019):

     git clone https://github.com/fbcotter/py3nvml
    
  • PyPi (📥 99K / month · 📦 5 · ⏱️ 04.03.2019):

     pip install py3nvml
    
nvidia-ml-py3 (🥉16 · ⭐ 40 · 💤) - Python 3 Bindings for the NVIDIA Management Library. BSD-3
  • GitHub (👨‍💻 2 · 🔀 9 · 📦 800 · ⏱️ 06.03.2019):

     git clone https://github.com/nicolargo/nvidia-ml-py3
    
  • PyPi (📥 150K / month · 📦 12 · ⏱️ 03.06.2017):

     pip install nvidia-ml-py3
    
ipyexperiments (🥉13 · ⭐ 97 · 💤) - jupyter/ipython experiment containers for GPU and general RAM re-use. Apache-2
  • GitHub (👨‍💻 3 · 🔀 7 · 📦 1 · ⏱️ 12.03.2019):

     git clone https://github.com/stas00/ipyexperiments
    
  • PyPi (📥 430 / month · ⏱️ 12.03.2019):

     pip install ipyexperiments
    

Tensorflow Utilities

Back to top

Libraries that extend TensorFlow with general-purpose capabilities.

tensor2tensor (🥇31 · ⭐ 9.5K) - Library of deep learning models and datasets designed to make deep.. Apache-2
  • GitHub (👨‍💻 220 · 🔀 2.4K · 📦 440 · 📋 1.1K - 43% open · ⏱️ 20.01.2020):

     git clone https://github.com/tensorflow/tensor2tensor
    
  • PyPi (📥 23K / month · 📦 82 · ⏱️ 11.01.2020):

     pip install tensor2tensor
    
tensorflow-hub (🥈29 · ⭐ 2.1K) - A library for transfer learning by reusing parts of TensorFlow models. Apache-2
  • GitHub (👨‍💻 37 · 🔀 1.1K · 📦 1.5K · 📋 390 - 10% open · ⏱️ 23.01.2020):

     git clone https://github.com/tensorflow/hub
    
  • PyPi (📥 390K / month · 📦 310 · ⏱️ 31.10.2019):

     pip install tensorflow-hub
    
TensorFlow Transform (🥉28 · ⭐ 790) - Input pipeline framework. Apache-2
  • GitHub (👨‍💻 26 · 🔀 150 · 📦 220 · 📋 100 - 16% open · ⏱️ 23.01.2020):

     git clone https://github.com/tensorflow/transform
    
  • PyPi (📥 2.2M / month · 📦 42 · ⏱️ 17.01.2020):

     pip install tensorflow-transform
    
TensorNets (🥉21 · ⭐ 890) - High level network definitions with pre-trained weights in TensorFlow. MIT
  • GitHub (👨‍💻 4 · 🔀 170 · 📦 18 · 📋 49 - 22% open · ⏱️ 23.01.2020):

     git clone https://github.com/taehoonlee/tensornets
    
  • PyPi (📥 440 / month · 📦 2 · ⏱️ 23.01.2020):

     pip install tensornets
    
Neural Structured Learning (🥉18 · ⭐ 560 · 🐣) - Training neural models with structured signals. Apache-2
  • GitHub (👨‍💻 12 · 🔀 75 · 📦 5 · 📋 27 - 18% open · ⏱️ 21.01.2020):

     git clone https://github.com/tensorflow/neural-structured-learning
    
  • PyPi (📥 380 / month · ⏱️ 18.09.2019):

     pip install neural-structured-learning
    

Pytorch Utilities

Back to top

Libraries that extend Pytorch with general-purpose capabilities.

SRU (🥇22 · ⭐ 1.8K) - Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755). MIT
  • GitHub (👨‍💻 19 · 🔀 270 · 📦 8 · 📋 81 - 50% open · ⏱️ 23.10.2019):

     git clone https://github.com/asappresearch/sru
    
  • PyPi (📥 4.2K / month · ⏱️ 23.10.2019):

     pip install sru
    
torch-scatter (🥇22 · ⭐ 290) - PyTorch Extension Library of Optimized Scatter Operations. MIT
  • GitHub (👨‍💻 8 · 🔀 32 · 📋 91 - 24% open · ⏱️ 21.01.2020):

     git clone https://github.com/rusty1s/pytorch_scatter
    
  • PyPi (📥 8.7K / month · 📦 14 · ⏱️ 08.11.2019):

     pip install torch-scatter
    
AdaBound (🥈20 · ⭐ 2.7K · 💤) - An optimizer that trains as fast as Adam and as good as SGD. Apache-2
  • GitHub (👨‍💻 2 · 🔀 300 · 📦 45 · 📋 20 - 65% open · ⏱️ 06.03.2019):

     git clone https://github.com/Luolc/AdaBound
    
  • PyPi (📥 1.7K / month · 📦 1 · ⏱️ 26.02.2019):

     pip install adabound
    
EfficientNets (🥉18 · ⭐ 750) - Pretrained EfficientNet, MixNet, MobileNetV3, MNASNet A1 and B1, FBNet,.. Apache-2
  • GitHub (👨‍💻 3 · 🔀 92 · 📦 3 · ⏱️ 23.01.2020):

     git clone https://github.com/rwightman/gen-efficientnet-pytorch
    
  • PyPi (📥 170 / month · ⏱️ 23.01.2020):

     pip install geffnet
    
Torchmeta (🥉18 · ⭐ 520) - A collection of extensions and data-loaders for few-shot learning & meta-.. MIT
  • GitHub (👨‍💻 4 · 🔀 52 · 📦 2 · 📋 15 - 33% open · ⏱️ 11.01.2020):

     git clone https://github.com/tristandeleu/pytorch-meta
    
  • PyPi (📥 370 / month · ⏱️ 11.01.2020):

     pip install torchmeta
    
PyTorch Sparse (🥉18 · ⭐ 170) - PyTorch Extension Library of Optimized Autograd Sparse Matrix Operations. MIT
  • GitHub (👨‍💻 5 · 🔀 15 · 📋 26 - 23% open · ⏱️ 22.11.2019):

     git clone https://github.com/rusty1s/pytorch_sparse
    
  • PyPi (📥 6.4K / month · 📦 5 · ⏱️ 01.05.2019):

     pip install torch-sparse
    
Higher (🥉10 · ⭐ 540 · 🐣) - higher is a pytorch library allowing users to obtain higher order.. Apache-2
  • GitHub (👨‍💻 3 · 🔀 35 · 📋 20 - 55% open · ⏱️ 16.01.2020):

     git clone https://github.com/facebookresearch/higher
    

Database Clients

Back to top

Libraries for connecting to, operating, and querying databases.

redis (🥇38 · ⭐ 8.2K) - Redis Python Client. MIT
  • GitHub (👨‍💻 230 · 🔀 1.8K · 📦 66K · 📋 800 - 23% open · ⏱️ 30.12.2019):

     git clone https://github.com/andymccurdy/redis-py
    
  • PyPi (📥 7.3M / month · 📦 49K · ⏱️ 13.10.2019):

     pip install redis
    
google-cloud-storage (🥇36 · ⭐ 3.1K) - Google Cloud Client Library for Python. Apache-2
  • GitHub (👨‍💻 310 · 🔀 1.3K · 📦 5.8K · 📋 3.5K - 7% open · ⏱️ 23.01.2020):

     git clone https://github.com/googleapis/google-cloud-python
    
  • PyPi (📥 8.9M / month · 📦 2.1K · ⏱️ 16.01.2020):

     pip install google-cloud-storage
    
pymongo (🥇36 · ⭐ 2.9K) - PyMongo - the Python driver for MongoDB. Apache-2
  • GitHub (👨‍💻 170 · 🔀 920 · 📦 51K · ⏱️ 15.01.2020):

     git clone https://github.com/mongodb/mongo-python-driver
    
  • PyPi (📥 5.3M / month · 📦 36K · ⏱️ 08.01.2020):

     pip install pymongo
    
SQLAlchemy (🥇36 · ⭐ 1.8K) - The Database Toolkit for Python. MIT
  • GitHub (👨‍💻 410 · 🔀 280 · 📦 140K · 📋 5K - 3% open · ⏱️ 23.01.2020):

     git clone https://github.com/sqlalchemy/sqlalchemy
    
  • PyPi (📥 8.1M / month · 📦 84K · ⏱️ 22.01.2020):

     pip install SQLAlchemy
    
boto3 (🥈35 · ⭐ 5.1K) - AWS SDK for Python. Apache-2
  • GitHub (👨‍💻 86 · 🔀 1.1K · 📦 47K · 📋 1.9K - 17% open · ⏱️ 21.01.2020):

     git clone https://github.com/boto/boto3
    
  • PyPi (📥 33M / month · 📦 23K · ⏱️ 21.01.2020):

     pip install boto3
    
elasticsearch (🥈34 · ⭐ 2.6K) - Official Python low-level client for Elasticsearch. Apache-2
  • GitHub (👨‍💻 150 · 🔀 860 · 📦 12K · 📋 750 - 11% open · ⏱️ 19.01.2020):

     git clone https://github.com/elastic/elasticsearch-py
    
  • PyPi (📥 3.1M / month · 📦 11K · ⏱️ 19.01.2020):

     pip install elasticsearch
    
PyMySQL (🥈33 · ⭐ 5.7K) - Pure Python MySQL Client. MIT
  • GitHub (👨‍💻 100 · 🔀 1.2K · 📦 42K · 📋 480 - 3% open · ⏱️ 11.12.2019):

     git clone https://github.com/PyMySQL/PyMySQL
    
  • PyPi (📥 3.9M / month · 📦 18K · ⏱️ 18.12.2018):

     pip install PyMySQL
    
kafka-python (🥈33 · ⭐ 3.6K) - Python client for Apache Kafka. Apache-2
  • GitHub (👨‍💻 180 · 🔀 1K · 📥 1.6K · 📦 3K · 📋 1.2K - 10% open · ⏱️ 30.12.2019):

     git clone https://github.com/dpkp/kafka-python
    
  • PyPi (📥 1.1M / month · 📦 2K · ⏱️ 30.09.2019):

     pip install kafka-python
    
Cassandra Driver (🥈31 · ⭐ 1.1K) - DataStax Python Driver for Apache Cassandra. Apache-2
  • GitHub (👨‍💻 170 · 🔀 420 · 📦 1.5K · ⏱️ 22.01.2020):

     git clone https://github.com/datastax/python-driver
    
  • PyPi (📥 210K / month · 📦 1.2K · ⏱️ 15.01.2020):

     pip install cassandra-driver
    
  • Conda:

     conda install -c conda-forge cassandra-driver
    
confluent-kafka-python (🥈30 · ⭐ 1.6K) - Confluent's Kafka Python Client. Apache-2
  • GitHub (👨‍💻 53 · 🔀 420 · 📦 980 · 📋 540 - 26% open · ⏱️ 08.01.2020):

     git clone https://github.com/confluentinc/confluent-kafka-python
    
  • PyPi (📥 520K / month · 📦 290 · ⏱️ 13.12.2019):

     pip install confluent-kafka
    
pyodbc (🥈30 · ⭐ 1.6K) - Python ODBC bridge. MIT
  • GitHub (👨‍💻 34 · 🔀 350 · 📋 570 - 22% open · ⏱️ 21.12.2019):

     git clone https://github.com/mkleehammer/pyodbc
    
  • PyPi (📥 1.6M / month · 📦 3.3K · ⏱️ 21.12.2019):

     pip install pyodbc
    
Prometheus Client (🥈30 · ⭐ 1.4K) - Prometheus instrumentation library for Python applications. Apache-2
  • GitHub (👨‍💻 84 · 🔀 380 · 📋 270 - 15% open · ⏱️ 23.12.2019):

     git clone https://github.com/prometheus/client_python
    
  • PyPi (📥 4.5M / month · 📦 1.2K · ⏱️ 20.06.2019):

     pip install prometheus_client
    
  • Conda:

     conda install -c conda-forge prometheus_client
    
influxdb (🥈30 · ⭐ 1.3K) - Python client for InfluxDB. MIT
  • GitHub (👨‍💻 120 · 🔀 420 · 📦 2.4K · 📋 460 - 30% open · ⏱️ 05.12.2019):

     git clone https://github.com/influxdata/influxdb-python
    
  • PyPi (📥 740K / month · 📦 1.9K · ⏱️ 26.08.2019):

     pip install influxdb
    
Motor (🥉29 · ⭐ 1.4K) - Motor - the async Python driver for MongoDB and Tornado or asyncio. Apache-2
  • GitHub (👨‍💻 26 · 🔀 120 · 📦 1.9K · ⏱️ 12.12.2019):

     git clone https://github.com/mongodb/motor
    
  • PyPi (📥 100K / month · 📦 1.8K · ⏱️ 12.12.2019):

     pip install motor
    
dataset (🥉28 · ⭐ 3.7K) - Easy-to-use data handling for SQL data stores with support for implicit table.. MIT
  • GitHub (👨‍💻 63 · 🔀 240 · 📦 1.2K · 📋 210 - 2% open · ⏱️ 11.01.2020):

     git clone https://github.com/pudo/dataset
    
  • PyPi (📥 54K / month · 📦 1K · ⏱️ 20.01.2020):

     pip install dataset
    
libcloud (🥉28 · ⭐ 1.6K) - Apache Libcloud is a Python library which hides differences between different.. Apache-2
  • GitHub (👨‍💻 390 · 🔀 840 · 📦 1.1K · 📋 61 - 42% open · ⏱️ 22.01.2020):

     git clone https://github.com/apache/libcloud
    
  • PyPi (📥 110K / month · 📦 1.5K · ⏱️ 02.01.2020):

     pip install apache-libcloud
    
pandas-gbq (🥉28 · ⭐ 180) - Pandas Google BigQuery. BSD-3
  • GitHub (👨‍💻 26 · 🔀 72 · 📥 73 · 📦 680 · 📋 160 - 16% open · ⏱️ 12.12.2019):

     git clone https://github.com/pydata/pandas-gbq
    
  • PyPi (📥 880K / month · 📦 420 · ⏱️ 12.12.2019):

     pip install pandas-gbq
    
  • Conda:

     conda install -c conda-forge pandas-gbq
    
Records (🥉27 · ⭐ 6.1K) - SQL for Humans. ISC
  • GitHub (👨‍💻 37 · 🔀 480 · 📦 500 · 📋 120 - 33% open · ⏱️ 20.08.2019):

     git clone https://github.com/kennethreitz/records
    
  • PyPi (📥 68K / month · 📦 300 · ⏱️ 21.02.2019):

     pip install records
    
  • Conda:

     conda install -c conda-forge records
    
minio (🥉27 · ⭐ 280) - MinIO Client SDK for Python. Apache-2
  • GitHub (👨‍💻 70 · 🔀 130 · 📦 570 · 📋 310 - 3% open · ⏱️ 23.01.2020):

     git clone https://github.com/minio/minio-py
    
  • PyPi (📥 120K / month · 📦 340 · ⏱️ 30.12.2019):

     pip install minio
    
cx-Oracle (🥉26 · ⭐ 570) - Python interface to Oracle Database conforming to the Python DB API 2.0.. BSD-3
  • GitHub (👨‍💻 15 · 🔀 210 · 📋 380 - 8% open · ⏱️ 21.01.2020):

     git clone https://github.com/oracle/python-cx_Oracle
    
  • PyPi (📥 660K / month · 📦 140 · ⏱️ 03.12.2019):

     pip install cx-Oracle
    
Pony (🥉25 · ⭐ 2K) - Pony Object Relational Mapper. Apache-2
  • GitHub (👨‍💻 19 · 🔀 160 · 📦 1.1K · 📋 450 - 36% open · ⏱️ 30.10.2019):

     git clone https://github.com/ponyorm/pony
    
  • PyPi (📥 14K / month · 📦 490 · ⏱️ 23.10.2019):

     pip install pony
    
  • Conda:

     conda install -c conda-forge pony
    
Ibis (🥉24 · ⭐ 1.3K) - A pandas-like deferred expression system, with first-class SQL support. Apache-2
  • GitHub (👨‍💻 44 · 🔀 180 · 📦 49 · 📋 1.2K - 32% open · ⏱️ 21.01.2020):

     git clone https://github.com/ibis-project/ibis
    
  • PyPi (📥 2.5K / month · 📦 19 · ⏱️ 24.06.2019):

     pip install ibis-framework
    
  • Conda:

     conda install -c conda-forge ibis-framework
    
HappyBase (🥉24 · ⭐ 510) - A developer-friendly Python library to interact with Apache HBase. MIT
  • GitHub (👨‍💻 17 · 🔀 140 · 📦 350 · 📋 190 - 11% open · ⏱️ 07.12.2019):

     git clone https://github.com/wbolster/happybase
    
  • PyPi (📥 180K / month · 📦 410 · ⏱️ 03.04.2017):

     pip install happybase
    
  • Conda:

     conda install -c conda-forge happybase
    
neo4j-driver (🥉23 · ⭐ 450) - Neo4j Bolt driver for Python. Apache-2
  • GitHub (👨‍💻 23 · 🔀 110 · 📋 130 - 8% open · ⏱️ 20.01.2020):

     git clone https://github.com/neo4j/neo4j-python-driver
    
  • PyPi (📥 40K / month · 📦 370 · ⏱️ 11.11.2019):

     pip install neo4j-driver
    
AWS Data Wrangler (🥉20 · ⭐ 450) - DataFrames on AWS. Apache-2
  • GitHub (👨‍💻 8 · 🔀 37 · 📥 1.1K · 📋 57 - 24% open · ⏱️ 22.01.2020):

     git clone https://github.com/awslabs/aws-data-wrangler
    
  • PyPi (📥 8.3K / month · ⏱️ 15.01.2020):

     pip install awswrangler
    
SuperSQLite (🥉15 · ⭐ 610) - A supercharged SQLite library for Python. MIT
  • GitHub (👨‍💻 2 · 🔀 13 · 📦 1 · ⏱️ 27.08.2019):

     git clone https://github.com/plasticityai/supersqlite
    
  • PyPi (📥 450 / month · ⏱️ 19.11.2018):

     pip install supersqlite
    
Show 8 hidden projects...
psycopg2 (🥈34 · ⭐ 1.6K) - PostgreSQL database adapter for the Python programming.. ❗️BSD-3-Clause-Attribution
  • GitHub (👨‍💻 91 · 🔀 300 · 📦 180K · 📋 820 - 4% open · ⏱️ 30.12.2019):

     git clone https://github.com/psycopg/psycopg2
    
  • PyPi (📥 5.8M / month · 📦 120K · ⏱️ 14.04.2019):

     pip install psycopg2
    
  • Conda (⏱️ 08.11.2019):

     conda install -c anaconda psycopg2
    
mysqlclient (🥈30 · ⭐ 1.6K) - MySQL database connector for Python (with Python 3 support). ❗️GPL-2.0
  • GitHub (👨‍💻 62 · 🔀 270 · 📥 4.9K · 📋 230 - 4% open · ⏱️ 20.01.2020):

     git clone https://github.com/PyMySQL/mysqlclient-python
    
  • PyPi (📥 2.1M / month · 📦 6.6K · ⏱️ 21.11.2019):

     pip install mysqlclient
    
  • Conda:

     conda install -c conda-forge mysqlclient
    
mongo-connector (🥉25 · ⭐ 1.6K · 💀) - MongoDB data stream pipeline tools by YouGov (adopted from MongoDB). Apache-2
  • GitHub (👨‍💻 61 · 🔀 440 · 📦 65 · 📋 710 - 30% open · ⏱️ 05.12.2018):

     git clone https://github.com/yougov/mongo-connector
    
  • PyPi (📥 9K / month · 📦 29 · ⏱️ 05.12.2018):

     pip install mongo-connector
    
cloudant (🥉25 · ⭐ 130) - A Python library for Cloudant and CouchDB. Apache-2
  • GitHub (👨‍💻 24 · 🔀 37 · 📦 760 · 📋 230 - 14% open · ⏱️ 15.10.2019):

     git clone https://github.com/cloudant/python-cloudant
    
  • PyPi (📥 250K / month · 📦 490 · ⏱️ 28.03.2019):

     pip install cloudant
    
db.py (🥉21 · ⭐ 1.2K · 💀) - db.py is an easier way to interact with your databases. BSD-2
  • GitHub (👨‍💻 24 · 🔀 110 · 📋 53 - 60% open · ⏱️ 31.03.2017):

     git clone https://github.com/yhat/db.py
    
  • PyPi (📥 1K / month · 📦 19 · ⏱️ 31.03.2017):

     pip install db.py
    
pyhdb (🥉21 · ⭐ 260 · 💀) - SAP HANA Connector in pure Python. Apache-2
  • GitHub (👨‍💻 10 · 🔀 110 · 📥 34 · 📦 75 · 📋 81 - 38% open · ⏱️ 24.05.2018):

     git clone https://github.com/SAP/pyhdb
    
  • PyPi (📥 30K / month · 📦 9 · ⏱️ 16.02.2018):

     pip install pyhdb
    
filedepot (🥉21 · ⭐ 110) - Toolkit for storing files and attachments in web applications. MIT
  • GitHub (👨‍💻 18 · 🔀 30 · 📦 95 · 📋 32 - 6% open · ⏱️ 26.11.2019):

     git clone https://github.com/amol-/depot
    
  • PyPi (📥 5.3K / month · 📦 86 · ⏱️ 13.08.2019):

     pip install filedepot
    
lazydata (🥉17 · ⭐ 610 · 💀) - Lazydata: Scalable data dependencies for Python projects. Apache-2
  • GitHub (👨‍💻 6 · 🔀 23 · 📦 6 · 📋 12 - 50% open · ⏱️ 12.10.2018):

     git clone https://github.com/rstojnic/lazydata
    
  • PyPi (📥 170 / month · ⏱️ 18.09.2018):

     pip install lazydata
    

System Monitoring & Profiling

Back to top

Libraries to monitor, collect, and visualize system metrics (e.g., memory, CPU, GPU).

psutil (🥇34 · ⭐ 5.8K) - Cross-platform lib for process and system monitoring in Python. BSD-3
  • GitHub (👨‍💻 130 · 🔀 910 · 📦 37K · 📋 1.3K - 13% open · ⏱️ 19.01.2020):

     git clone https://github.com/giampaolo/psutil
    
  • PyPi (📥 9.8M / month · 📦 25K · ⏱️ 26.11.2019):

     pip install psutil
    
  • Conda:

     conda install -c conda-forge psutil
    
memory-profiler (🥈27 · ⭐ 2.1K) - Monitor Memory usage of Python code. BSD-3
  • GitHub (👨‍💻 74 · 🔀 240 · 📋 160 - 41% open · ⏱️ 11.01.2020):

     git clone https://github.com/pythonprofilers/memory_profiler
    
  • PyPi (📥 160K / month · 📦 530 · ⏱️ 16.08.2018):

     pip install memory_profiler
    
py-spy (🥈25 · ⭐ 5K) - Sampling profiler for Python programs. MIT
  • GitHub (👨‍💻 15 · 🔀 180 · 📥 960 · 📦 100 · 📋 140 - 22% open · ⏱️ 02.01.2020):

     git clone https://github.com/benfred/py-spy
    
  • PyPi (📥 52K / month · 📦 25 · ⏱️ 02.01.2020):

     pip install py-spy
    
pyinstrument (🥈25 · ⭐ 1.5K) - Call stack profiler for Python. Shows you why your code is slow!. BSD-3
  • GitHub (👨‍💻 19 · 🔀 78 · 📦 90 · 📋 50 - 22% open · ⏱️ 22.12.2019):

     git clone https://github.com/joerick/pyinstrument
    
  • PyPi (📥 27K / month · 📦 130 · ⏱️ 07.12.2019):

     pip install pyinstrument
    
line_profiler (🥉24 · ⭐ 3.3K · 💤) - Line-by-line profiling for Python. BSD-3
  • GitHub (👨‍💻 14 · 🔀 220 · 📋 140 - 43% open · ⏱️ 23.04.2019):

     git clone https://github.com/rkern/line_profiler
    
  • PyPi (📥 49K / month · 📦 300 · ⏱️ 20.12.2017):

     pip install line_profiler
    
Yappi (🥉24 · ⭐ 220) - Yet Another Python Profiler, but this time thread&coroutine aware. MIT
  • GitHub (👨‍💻 16 · 🔀 24 · 📦 200 · 📋 24 - 45% open · ⏱️ 08.01.2020):

     git clone https://github.com/sumerc/yappi
    
  • PyPi (📥 56K / month · 📦 640 · ⏱️ 19.12.2019):

     pip install yappi
    
gpustat (🥉23 · ⭐ 1.6K) - A simple command-line utility for querying and monitoring GPU status. MIT
  • GitHub (👨‍💻 8 · 🔀 140 · 📦 160 · 📋 53 - 33% open · ⏱️ 21.10.2019):

     git clone https://github.com/wookayin/gpustat
    
  • PyPi (📥 22K / month · 📦 40 · ⏱️ 22.07.2019):

     pip install gpustat
    
GPUtil (🥉22 · ⭐ 460) - A Python module for getting the GPU status from NVIDA GPUs using nvidia-smi.. MIT
  • GitHub (👨‍💻 13 · 🔀 59 · 📦 320 · 📋 18 - 27% open · ⏱️ 16.08.2019):

     git clone https://github.com/anderskm/gputil
    
  • PyPi (📥 42K / month · 📦 24 · ⏱️ 18.12.2018):

     pip install gputil
    
Profiling (🥉21 · ⭐ 3K) - An interactive continuous Python profiler. BSD-3
  • GitHub (👨‍💻 18 · 🔀 110 · 📦 22 · 📋 34 - 41% open · ⏱️ 09.12.2019):

     git clone https://github.com/what-studio/profiling
    
  • PyPi (📥 1.1K / month · 📦 5 · ⏱️ 28.06.2017):

     pip install profiling
    
Scalene (🥉19 · ⭐ 1.5K · 🐣) - a high-performance, high-precision CPU and memory profiler for Python. Apache-2
  • GitHub (👨‍💻 6 · 🔀 39 · 📦 1 · 📋 22 - 50% open · ⏱️ 23.01.2020):

     git clone https://github.com/emeryberger/scalene
    
  • PyPi (📥 1.5K / month · ⏱️ 18.01.2020):

     pip install scalene
    
StackImpact (🥉14 · ⭐ 660 · 💤) - StackImpact Python Profiler - Production-Grade Performance Profiler: CPU,.. BSD-3
  • GitHub (👨‍💻 1 · 🔀 17 · 📦 3 · ⏱️ 14.05.2019):

     git clone https://github.com/stackimpact/stackimpact-python
    
  • PyPi (📥 660 / month · 📦 1 · ⏱️ 14.05.2019):

     pip install stackimpact
    
Show 2 hidden projects...
Diamond (🥇28 · ⭐ 1.5K · 💀) - Diamond is a python daemon that collects system metrics and publishes them to.. MIT
  • GitHub (👨‍💻 340 · 🔀 560 · 📦 44 · 📋 340 - 63% open · ⏱️ 15.03.2018):

     git clone https://github.com/python-diamond/Diamond
    
  • PyPi (📥 69K / month · 📦 31 · ⏱️ 25.11.2016):

     pip install diamond
    
Glances (🥈27 · ⭐ 15K) - Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac.. ❗️LGPL-3.0
  • GitHub (👨‍💻 120 · 🔀 990 · 📥 350 · 📦 160 · 📋 1.1K - 10% open · ⏱️ 22.01.2020):

     git clone https://github.com/nicolargo/glances
    
  • PyPi (📥 41K / month · 📦 44 · ⏱️ 27.08.2019):

     pip install glances
    

Others

Back to top

SymPy (🥇35 · ⭐ 6.6K) - A computer algebra system written in pure Python. BSD-3
  • GitHub (👨‍💻 900 · 🔀 2.9K · 📥 370K · 📦 16K · 📋 10K - 36% open · ⏱️ 23.01.2020):

     git clone https://github.com/sympy/sympy
    
  • PyPi (📥 410K / month · 📦 6.4K · ⏱️ 05.01.2020):

     pip install sympy
    
  • Conda (⏱️ 07.01.2020):

     conda install -c anaconda sympy
    
Autograd (🥇29 · ⭐ 4.5K) - Efficiently computes derivatives of numpy code. MIT
  • GitHub (👨‍💻 51 · 🔀 650 · 📦 580 · 📋 360 - 40% open · ⏱️ 18.11.2019):

     git clone https://github.com/HIPS/autograd
    
  • PyPi (📥 180K / month · 📦 270 · ⏱️ 25.07.2019):

     pip install autograd
    
imbalanced-learn (🥇29 · ⭐ 4.1K) - A Python Package to Tackle the Curse of Imbalanced Datasets in Machine.. MIT
  • GitHub (👨‍💻 45 · 🔀 890 · 📦 1.3K · 📋 360 - 6% open · ⏱️ 07.12.2019):

     git clone https://github.com/scikit-learn-contrib/imbalanced-learn
    
  • PyPi (📥 320K / month · 📦 270 · ⏱️ 07.12.2019):

     pip install imbalanced-learn
    
Mlxtend (🥇27 · ⭐ 2.7K) - A library of extension and helper modules for Python's data analysis and.. BSD-3
  • GitHub (👨‍💻 67 · 🔀 590 · 📦 750 · 📋 290 - 26% open · ⏱️ 07.01.2020):

     git clone https://github.com/rasbt/mlxtend
    
  • PyPi (📥 68K / month · 📦 81 · ⏱️ 19.07.2019):

     pip install mlxtend
    
PyOD (🥈26 · ⭐ 2.8K) - A Python Toolbox for Scalable Outlier Detection (Anomaly Detection). BSD-2
  • GitHub (👨‍💻 14 · 🔀 550 · 📦 75 · 📋 100 - 42% open · ⏱️ 30.12.2019):

     git clone https://github.com/yzhao062/pyod
    
  • PyPi (📥 130K / month · 📦 6 · ⏱️ 30.12.2019):

     pip install pyod
    
Streamlit (🥈25 · ⭐ 6.5K · 🐣) - Streamlit The fastest way to build custom ML tools. Apache-2
  • GitHub (👨‍💻 42 · 🔀 550 · 📋 650 - 56% open · ⏱️ 22.01.2020):

     git clone https://github.com/streamlit/streamlit
    
  • PyPi (📥 18K / month · 📦 17 · ⏱️ 15.01.2020):

     pip install streamlit
    
category_encoders (🥈25 · ⭐ 1.2K) - A library of sklearn compatible categorical variable encoders. BSD-3
  • GitHub (👨‍💻 28 · 🔀 240 · 📋 160 - 21% open · ⏱️ 08.01.2020):

     git clone https://github.com/scikit-learn-contrib/categorical-encoding
    
  • PyPi (📥 96K / month · 📦 11 · ⏱️ 14.10.2018):

     pip install category_encoders
    
DeepChem (🥈23 · ⭐ 1.9K) - Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, Materials.. MIT
  • GitHub (👨‍💻 110 · 🔀 600 · 📋 900 - 31% open · ⏱️ 18.01.2020):

     git clone https://github.com/deepchem/deepchem
    
  • PyPi (📥 630 / month · 📦 3 · ⏱️ 03.05.2019):

     pip install deepchem
    
kmodes (🥈23 · ⭐ 660) - Python implementations of the k-modes and k-prototypes clustering algorithms, for.. MIT
  • GitHub (👨‍💻 15 · 🔀 300 · 📦 92 · 📋 120 - 21% open · ⏱️ 07.01.2020):

     git clone https://github.com/nicodv/kmodes
    
  • PyPi (📥 25K / month · 📦 12 · ⏱️ 23.04.2019):

     pip install kmodes
    
metric-learn (🥈22 · ⭐ 930) - Metric learning algorithms in Python. MIT
  • GitHub (👨‍💻 19 · 🔀 190 · 📦 53 · 📋 140 - 33% open · ⏱️ 20.01.2020):

     git clone https://github.com/scikit-learn-contrib/metric-learn
    
  • PyPi (📥 1.1K / month · 📦 9 · ⏱️ 18.07.2019):

     pip install metric-learn
    
tensorly (🥈22 · ⭐ 800) - TensorLy: Tensor Learning in Python. BSD-2
  • GitHub (👨‍💻 29 · 🔀 160 · 📋 84 - 22% open · ⏱️ 06.01.2020):

     git clone https://github.com/tensorly/tensorly
    
  • PyPi (📥 1.8K / month · 📦 9 · ⏱️ 16.12.2019):

     pip install tensorly
    
AstroML (🥈22 · ⭐ 590) - Machine learning, statistics, and data mining for astronomy and astrophysics. BSD-2
  • GitHub (👨‍💻 27 · 🔀 220 · 📦 110 · 📋 120 - 39% open · ⏱️ 31.12.2019):

     git clone https://github.com/astroML/astroML
    
  • PyPi (📥 910 / month · 📦 27 · ⏱️ 02.10.2019):

     pip install astroML
    
pyjanitor (🥈22 · ⭐ 450) - Clean APIs for data cleaning. Python implementation of R package Janitor. MIT
  • GitHub (👨‍💻 75 · 🔀 96 · 📦 20 · 📋 300 - 24% open · ⏱️ 16.01.2020):

     git clone https://github.com/ericmjl/pyjanitor
    
  • PyPi (📥 1.1K / month · 📦 2 · ⏱️ 17.11.2019):

     pip install pyjanitor
    
PySwarms (🥉21 · ⭐ 520) - A research toolkit for particle swarm optimization in Python. MIT
  • GitHub (👨‍💻 30 · 🔀 170 · 📦 23 · 📋 140 - 11% open · ⏱️ 17.11.2019):

     git clone https://github.com/ljvmiranda921/pyswarms
    
  • PyPi (📥 1.9K / month · ⏱️ 18.02.2019):

     pip install pyswarms
    
scikit-multilearn (🥉21 · ⭐ 470 · 💤) - A scikit-learn based module for multi-label et. al. classification. BSD-2
  • GitHub (👨‍💻 15 · 🔀 94 · 📦 120 · 📋 140 - 36% open · ⏱️ 21.05.2019):

     git clone https://github.com/scikit-multilearn/scikit-multilearn
    
  • PyPi (📥 7.8K / month · 📦 3 · ⏱️ 10.12.2018):

     pip install scikit-multilearn
    
Prince (🥉21 · ⭐ 350) - Python factor analysis library (PCA, CA, MCA, MFA, FAMD). MIT
  • GitHub (👨‍💻 8 · 🔀 75 · 📦 29 · 📋 70 - 17% open · ⏱️ 07.01.2020):

     git clone https://github.com/MaxHalford/prince
    
  • PyPi (📥 3.3K / month · 📦 3 · ⏱️ 08.01.2020):

     pip install prince
    
StreamAlert (🥉20 · ⭐ 2.1K) - StreamAlert is a serverless, realtime data analysis framework which.. Apache-2
  • GitHub (👨‍💻 30 · 🔀 240 · 📋 280 - 25% open · ⏱️ 30.08.2019):

     git clone https://github.com/airbnb/streamalert
    
Mars (🥉20 · ⭐ 1.7K) - Mars is a tensor-based unified framework for large-scale data computation which.. Apache-2
  • GitHub (👨‍💻 8 · 🔀 190 · 📦 2 · 📋 400 - 19% open · ⏱️ 22.01.2020):

     git clone https://github.com/mars-project/mars
    
  • PyPi (📥 990 / month · ⏱️ 19.01.2020):

     pip install pymars
    
sklearn-contrib-lightning (🥉20 · ⭐ 1.3K · 💤) - Large-scale linear classification, regression and ranking.. BSD-3
  • GitHub (👨‍💻 13 · 🔀 160 · 📦 36 · 📋 75 - 54% open · ⏱️ 17.06.2019):

     git clone https://github.com/scikit-learn-contrib/lightning
    
  • PyPi (📥 360 / month · 📦 2 · ⏱️ 04.11.2017):

     pip install sklearn-contrib-lightning
    
modAL (🥉20 · ⭐ 620) - A modular active learning framework for Python. MIT
  • GitHub (👨‍💻 7 · 🔀 100 · 📦 20 · 📋 49 - 32% open · ⏱️ 11.11.2019):

     git clone https://github.com/modAL-python/modAL
    
  • PyPi (📥 670 / month · 📦 3 · ⏱️ 11.11.2019):

     pip install modAL
    
Trax (🥉19 · ⭐ 1.5K · 🐣) - Trax your path to advanced deep learning. Apache-2
  • GitHub (👨‍💻 16 · 🔀 110 · 📦 3 · 📋 14 - 71% open · ⏱️ 21.01.2020):

     git clone https://github.com/google/trax
    
  • PyPi (📥 840 / month · ⏱️ 17.01.2020):

     pip install trax
    
gplearn (🥉19 · ⭐ 750 · 💤) - Genetic Programming in Python, with a scikit-learn inspired API. BSD-3
  • GitHub (👨‍💻 10 · 🔀 120 · 📦 49 · 📋 120 - 17% open · ⏱️ 12.06.2019):

     git clone https://github.com/trevorstephens/gplearn
    
  • PyPi (📥 1.3K / month · 📦 7 · ⏱️ 01.06.2019):

     pip install gplearn
    
creme (🥉19 · ⭐ 500) - Online machine learning in Python. BSD-3
  • GitHub (👨‍💻 16 · 🔀 58 · 📦 5 · 📋 160 - 35% open · ⏱️ 22.01.2020):

     git clone https://github.com/creme-ml/creme
    
  • PyPi (📥 660 / month · ⏱️ 11.11.2019):

     pip install creme
    
vecstack (🥉19 · ⭐ 450) - Python package for stacking (machine learning technique). MIT
  • GitHub (👨‍💻 1 · 🔀 60 · 📦 33 · ⏱️ 30.10.2019):

     git clone https://github.com/vecxoz/vecstack
    
  • PyPi (📥 4.6K / month · 📦 4 · ⏱️ 12.08.2019):

     pip install vecstack
    
cleanlab (🥉18 · ⭐ 690) - Finding label errors in datasets and learning with noisy labels. MIT
  • GitHub (👨‍💻 3 · 🔀 65 · 📋 19 - 10% open · ⏱️ 30.12.2019):

     git clone https://github.com/cgnorthcutt/cleanlab
    
  • PyPi (📥 570 / month · ⏱️ 08.11.2019):

     pip install cleanlab
    
dabl (🥉18 · ⭐ 300) - Data Analysis Baseline Library. BSD-3
  • GitHub (👨‍💻 9 · 🔀 26 · 📋 69 - 66% open · ⏱️ 22.01.2020):

     git clone https://github.com/amueller/dabl
    
  • PyPi (📥 240 / month · ⏱️ 05.12.2019):

     pip install dabl
    
BioPandas (🥉18 · ⭐ 230) - Working with molecular structures in pandas DataFrames. BSD-3
  • GitHub (👨‍💻 5 · 🔀 52 · 📦 28 · 📋 29 - 44% open · ⏱️ 10.12.2019):

     git clone https://github.com/rasbt/biopandas
    
  • PyPi (📥 360 / month · 📦 1 · ⏱️ 09.07.2019):

     pip install biopandas
    
  • Conda:

     conda install -c conda-forge biopandas
    
apricot (🥉16 · ⭐ 240) - apricot implements submodular optimization for the purpose of selecting subsets of.. MIT
  • GitHub (👨‍💻 2 · 🔀 23 · 📦 2 · 📋 3 - 33% open · ⏱️ 08.01.2020):

     git clone https://github.com/jmschrei/apricot
    
  • PyPi (📥 160 / month · ⏱️ 03.11.2019):

     pip install apricot-select
    
baikal (🥉15 · ⭐ 380) - A graph-based functional API for building complex scikit-learn pipelines. BSD-3
  • GitHub (👨‍💻 1 · 🔀 17 · 📦 1 · 📋 8 - 50% open · ⏱️ 12.01.2020):

     git clone https://github.com/alegonz/baikal
    
  • PyPi (📥 85 / month · ⏱️ 16.11.2019):

     pip install baikal
    
  • Conda:

     conda install -c conda-forge cython-blis
    
Show 2 hidden projects...
Cython BLIS (🥈25 · ⭐ 140) - Fast matrix-multiplication as a self-contained Python library no system.. BSD-3
  • GitHub (👨‍💻 5 · 🔀 14 · 📦 2K · 📋 10 - 80% open · ⏱️ 14.11.2019):

     git clone https://github.com/explosion/cython-blis
    
  • PyPi (📥 680K / month · 📦 50 · ⏱️ 23.08.2019):

     pip install blis
    
  • Conda:

     conda install -c conda-forge cython-blis
    
mlens (🥉19 · ⭐ 580 · 💀) - ML-Ensemble high performance ensemble learning. MIT
  • GitHub (👨‍💻 7 · 🔀 72 · 📦 35 · 📋 73 - 13% open · ⏱️ 30.10.2018):

     git clone https://github.com/flennerhag/mlens
    
  • PyPi (📥 5.2K / month · ⏱️ 30.10.2018):

     pip install mlens
    

Related Resources

Contributors

TBD

Contribution

Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:

  • Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
  • Edit the projects.yaml file directly on Github and create a pull request with your additions or changes.
  • Fork this repository, clone it to your computer, modify the projects.yaml file, and submit a pull request.

If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository.

For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.

License

CC0


Hit Counter

Created and maintained with ❤️ by developers from SAP in Berlin.

best-of-ml-python's People

Contributors

lukasmasuch avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.