GithubHelp home page GithubHelp logo

datatrekkers's Projects

adtk icon adtk

A Python toolkit for rule-based/unsupervised anomaly detection in time series

amundsen icon amundsen

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

avro-compose icon avro-compose

Utility framework for composing Avro Schemas. From smaller components (types) specified in separate files into large schemas ready to be deployed to schema registry or your application

awesome-etl icon awesome-etl

A curated list of awesome ETL frameworks, libraries, and software.

awesome-selfhosted icon awesome-selfhosted

A list of Free Software network services and web applications which can be hosted on your own servers

bian icon bian

The Banking Industry Architecture Network e.V. (BIAN) model in Archimate 3

blazon icon blazon

A python library for assuring data structure and format via schemas like JSON Schema

blo icon blo

Static blog generator

buildah icon buildah

A tool that facilitates building OCI images.

cherrypy icon cherrypy

CherryPy is a pythonic, object-oriented HTTP framework. https://docs.cherrypy.org/

cli icon cli

GitHub’s official command line tool

cobrix icon cobrix

A COBOL parser and Mainframe/EBCDIC data source for Apache Spark

compose icon compose

A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.

connector-x icon connector-x

Fastest library to load data from DB to DataFrames in Rust and Python

data-accelerator icon data-accelerator

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

data-models icon data-models

A joint collaboration program to support the adoption of a reference architecture and compatible common data models underpinning a digital market of interoperable and replicable smart solutions.

data-pipelines-with-apache-airflow icon data-pipelines-with-apache-airflow

Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3

dataclasses-avroschema icon dataclasses-avroschema

Generate Avro Schemas from a Python class. Serialize and Deserialize python instances with avro schemas

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.