GithubHelp home page GithubHelp logo

Databricks Labs's Projects

arcuate icon arcuate

Delta Sharing + MLflow for ML model & experiment exchange (arcuate delta - a fan shaped river delta)

automl-toolkit icon automl-toolkit

Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Model selection and training, Hyper parameter optimization and selection, Model interprability.

blueprint icon blueprint

Baseline for Databricks Labs projects written in Python

databricks-sync icon databricks-sync

An experimental tool to synchronize source Databricks deployment with a target Databricks deployment.

dbldatagen icon dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

dbx icon dbx

🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.

delta-oms icon delta-oms

DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse. Unity Catalog supported in the v0.7.0-rc1 release.Documentation here - https://databrickslabs.github.io/delta-oms/v0.7.0-rc1/

discoverx icon discoverx

A Swiss-Army-knife for your Data Intelligence platform administration.

dlt-meta icon dlt-meta

This is metadata driven DLT based framework for bronze/silver pipelines

dolly icon dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

feature-factory icon feature-factory

Accelerator to rapidly deploy customized features for your business

geoscan icon geoscan

Geospatial clustering at massive scale

lsql icon lsql

Lightweight SQL execution wrapper only on top of Databricks SDK

migrate icon migrate

Old scripts for one-off ST-to-E2 migrations. Use "terraform exporter" linked in the readme.

mosaic icon mosaic

An extension to the Apache Spark framework that allows easy and fast processing of very large geospatial datasets.

overwatch icon overwatch

Capture deep metrics on one or all assets within a Databricks workspace

remorph icon remorph

Cross-compiler into Databricks Lakehouse

sandbox icon sandbox

Experimental or low-maturity things

tempo icon tempo

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.