Topic: delta-lake Goto Github
Some thing interesting about delta-lake
Some thing interesting about delta-lake
delta-lake,The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Organization: adidas
Home Page: https://adidas.github.io/lakehouse-engine-docs/
delta-lake,Spark structured streaming examples with using of version 3.5.1
User: andrewkuzmin
delta-lake,This repository exemplifies a simple ELT process using delta to perform upsert and remove data files that aren't in the latest state of the transaction log for the table.
User: anneglienke
delta-lake,Stream Loader for Apache Doris
Organization: apache
Home Page: https://doris.apache.org
delta-lake,Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Organization: apache
Home Page: https://xtable.apache.org/
delta-lake,Delta Lake Examples
User: aravinthsci
delta-lake,Amazon SageMaker Local Mode Examples
Organization: aws-samples
delta-lake,Hive helper functions for apache spark users
User: brayanjuls
delta-lake,Free High-Quality Financial Data in Azure
User: cheukhin1024
delta-lake,Native Delta Lake Implementation in Go
User: csimplestring
delta-lake,Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
User: dacort
Home Page: https://dacort.dev/posts/modern-data-lake-storage-layers/
delta-lake,A Delta Lake reader for Dask
Organization: dask-contrib
delta-lake,Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations should be performed.
Organization: databeans
delta-lake,This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Organization: databricks
Home Page: https://learning.oreilly.com/library/view/learning-spark-2nd/9781492050032/
delta-lake,DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse. Unity Catalog supported in the v0.7.0-rc1 release.Documentation here - https://databrickslabs.github.io/delta-oms/v0.7.0-rc1/
Organization: databrickslabs
Home Page: https://databrickslabs.github.io/delta-oms/
delta-lake,A quick example for Delta Lake running on AWS EMR Serverless Spark
User: davidshtian
delta-lake,A Minimalistic Rust Implementation of Delta Sharing Server.
Organization: delta-incubator
delta-lake,An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Organization: delta-io
Home Page: https://delta.io
delta-lake,A native Rust library for Delta Lake, with bindings into Python
Organization: delta-io
Home Page: https://delta-io.github.io/delta-rs/
delta-lake,An open protocol for secure data sharing
Organization: delta-io
Home Page: https://delta.io/sharing
delta-lake,UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions
User: ebonnal
delta-lake, Build Your First End-to-End Lakehouse Solution (aka.ms/fabconlake)
User: ekote
Home Page: https://azuredataconf.com/#!/workshop/Build%20Your%20First%20End-to-End%20Lakehouse%20Solution/6194
delta-lake,Introducing Delta-Buddy: Your ultimate Delta Lake companion! 🚀 Streamline your data journey with an AI-powered chatbot. Ask Delta-Buddy anything about your Delta Lake.
User: fvaleye
delta-lake,Spark data pipeline that processes movie ratings data.
User: guidok91
delta-lake,Spark Structured Streaming data pipeline that processes movie ratings data in real-time.
User: guidok91
delta-lake,Template to spin up delta lake locally using docker
User: handreassa
delta-lake,Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize and recommend app.
User: harrydevforlife
delta-lake,Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
User: izhangzhihao
delta-lake,dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats
User: jaehyeon-kim
delta-lake,The Internals of Delta Lake
Organization: japila-books
Home Page: https://books.japila.pl/delta-lake-internals
delta-lake,Read Delta tables without any Spark
User: jeppe742
delta-lake,Spark in Action, 2nd edition - chapter 16 - exporting data, using delta lake
User: jgperrin
Home Page: http://jgp.net/sia
delta-lake,Sample project to demonstrate data engineering best practices
User: josephmachado
Home Page: https://www.startdataengineering.com/post/de_best_practices/
delta-lake,Lakehouse storage system benchmark
Organization: lhbench
Home Page: https://lhbench.cs.berkeley.edu
delta-lake,This repository contains my projects and assignments developed during the Learn SQL Basics for Data Science Specialization available on Coursera.
User: marcoshsq
Home Page: https://www.coursera.org/account/accomplishments/specialization/certificate/T4CU2PVZ7QEF
delta-lake,Exercícios do módulo 1 - Bootcamp EDC - IGTI 2021
User: neylsoncrepalde
delta-lake,Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
Organization: nike-inc
Home Page: https://engineering.nike.com/koheesio/
delta-lake,Create full-fledged APIs for slowly moving datasets without writing a single line of code.
Organization: roapi
Home Page: https://roapi.github.io/docs
delta-lake,Analytical database for data-driven Web applications 🪶
Organization: splitgraph
Home Page: https://seafowl.io
delta-lake,StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
Organization: starrocks
Home Page: https://starrocks.io
delta-lake,Books and Papers in Mathematics, Econometrics, Machine Learning, Finance etc for different levels that can be useful for Data Scientists, Developers and everyone whoo is interesting in STEM.
User: tatevkaren
delta-lake,Awesome content all about Azure Databricks
User: tfayyaz
delta-lake,Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline
Organization: tikal-fuseday
delta-lake,Genomic BigData Warehousing with Apache Spark and LakeHouse Architecture
User: victorskl
delta-lake,Repositório dedicado a Workshop de Data Lakehouse com Delta Lake
User: vvalcristina
delta-lake,Implementation of an ETL process for real-time sentiment analysis of tweets with Docker, Apache Kafka, Spark Streaming, MongoDB and Delta Lake
User: wazzabeee
delta-lake,PawMark is a platform for developers to build, schedule and monitor data pipelines.
User: xuwenyihust
delta-lake,Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
User: ysfesr
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.