GithubHelp home page GithubHelp logo

fullstack-data-science-old's Introduction

favorite-python-resources

A list of some favorite libraries in Python as well as specific areas of Python e.g. Data Science, Machine Learning, etc. that I've found helpful.

Popular Python APIs (with code examples)

Popular Python ML APIs

Python Videos/Tutorials

Python Libraries


Data Science Workflows

Linear Algebra

Python Data Science Books


My Favorite Tools

Ones to Check out


Machine Learning Libraries

PyTorch

Hyperparameter Tuners


Notebook Tools

Jupyter Notebook Hubs and Extentions


AutoML

Statistics

Bayesian Statistics


Data Visualization -- Inspiration and Answers

Data Visualization

Data


Monte Carlo Simulations

Data Pipeline Tools

  • Pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
  • Modin - Modin: Speed up your Pandas workflows by changing a single line of code
  • Pandas-Profiling - Create HTML profiling reports from pandas DataFrame objects
  • Dask - Parallel computing with task scheduling
  • ray - A system for parallel and distributed Python that unifies the ML ecosystem.
  • Spark - PySpark API
  • Optimus - Agile Data Science Workflows made easy with Pyspark https://hioptimus.com
  • pypeln- Concurrent data pipelines made easy
  • smart_open - Utils for streaming large files (S3, HDFS, gzip, bz2...)
  • Blaze - NumPy and Pandas interface to Big Data
  • Faker - Faker is a Python package that generates fake data for you.
  • Kedro - A Python library for building robust production-ready data and analytics pipelines

Time Series

  • Prophet - A Facebook Time Series Analysis library
  • PyFlux - Time series analysis library with flexible range of modelling and inference options.

NLP/Text Manipulation


Deep Learning Tools

  • Pyro - Pyro is a flexible, scalable deep probabilistic programming library built on PyTorch
  • allennlp - An open-source NLP research library, built on PyTorch.
  • PyTorch-NLP - Supporting Rapid Prototyping with a Toolkit (incl. Datasets and Neural Network Layers)
  • OpenCV-Python - Open Source Computer Vision Library for Python
  • Screenshot-to-code - A neural network that transforms a design mock-up into a static website
  • TensorFlow Models Examples
  • fastai - The fastai deep learning library, plus lessons and and tutorials
  • Keras
  • TensorFlow
  • keras-tuner - Hyperparameter tuning for humans
  • Visdom - A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.
  • talos - Hyperparameter Optimization for Keras Models
  • keras-contrib

Recommender Systems

Web Scraping Tools

SDLC Tools

  • yapf - yet another python formatter
  • black - The uncompromising Python code formatter
  • precommit (article) - A framework for managing and maintaining multi-language pre-commit hooks. (Black + Flake8)

Others

  • wget - free utility for non-interactive downloading files from the web
  • pendulum - Python datetime manipulation made easy
  • python-dotenv - Get and set values in your .env file in local and production servers.
  • sqlparse - sql parsing tool
  • credstash - A little utility for managing credentials in the cloud

Database Connectors

  • graphene - GraphQL framework for Python

Pub-Sub, Message Queues, Streaming

Web Frameworks Some Benchmarks

  • Pyramid - A Python web framework https://trypyramid.com/
  • sanic - Async Python 3.5+ web server that's written to go fast
  • Tornado - Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
  • Falcon - Falcon is a bare-metal Python web API framework for building high-performance microservices, app backends, and higher-level frameworks. Docs
  • Vibora - Fast, asynchronous and elegant Python web framework.
  • japronto - Screaming-fast Python 3.5+ HTTP toolkit integrated with pipelining HTTP server based on uvloop and picohttpparser.
  • aiohttp - Asynchronous HTTP client/server framework for asyncio and Python https://docs.aiohttp.org
  • fastapi, Docs - FastAPI is a modern, fast (high-performance), web framework for building APIs with Python 3.6+ based on standard Python type hints.

Networking Tools

  • uvloop (article) - Ultra fast asyncio event loop (faster than NodeJS, close to Go speed)

Flask Tools

  • Flask-AppBuilder - Simple and rapid application development framework, built on top of Flask. includes detailed security, auto CRUD generation for your models, google charts and much more.
  • Awesome-Flask - A curated list of awesome Flask resources and plugins

Static File Generators

  • Nikola - A static website and blog generator
  • Pelican - Static site generator that supports Markdown and reST syntax. Powered by Python.

PDF Converters

  • python-pdfkit - Wkhtmltopdf python wrapper to convert html to pdf
  • pdftabextract - A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Documentation Libraries

  • Sphinx
  • pweave - Pweave is a scientific report generator and a literate programming tool for Python. It can capture the results and plots from data analysis and works well with numpy, scipy and matplotlib.

D3

APIs

Hacking, PCAPs, and Network Analysis

  • Sublist3r
  • scapy
  • knock - subdomain scanner
  • scapy-http
  • dpkt
  • kamene
  • pcapy - Pcapy is a Python extension module that interfaces with the libpcap packet capture library.
  • pyshark - Python wrapper for tshark, allowing python packet parsing using wireshark dissectors
  • PyPCAPKit - Python multi-engine PCAP analyse kit.
  • fsociety - fsociety Hacking Tools Pack – A Penetration Testing Framework
  • PayloadsAllTheThings - A list of useful payloads and bypass for Web Application Security and Pentest/CTF

GUIs

Repeatable Python Workflows in Notebooks

Prototyping Projects

Machine Learning - Interesting Repos

ML Articles

Docker Articles

Git Commands

Codelabs

Awesome Links

NBA Shot Charts


Courses

Deep Learning Courses


More Random Packages

Notebook tools -- Older, less maintained

Old...

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.