nuthanreddy Goto Github PK
Type: User
Type: User
Code to accompany Advanced Analytics with Spark from O'Reilly Media
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Scripts to benchmark distributed Alternative Least Squares (ALS)
Code to build a simple analytics data pipeline with Python
A skeleton for creating App Engine applications using the Django framework.
Cut and paste your surroundings using AR
Library for configuration management API
:sunglasses: Curated list of awesome lists
A curated list of awesome big data frameworks, ressources and other awesomeness.
A curated list of awesome Go frameworks, libraries and software
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
A curated list of awesome Apache Spark packages and resources.
A curated list of awesome System Design (A.K.A. Distributed Systems) resources.
BigDL: Distributed Deep Learning Library for Apache Spark
Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL
Chaos Monkey is a resiliency tool that helps applications tolerate random instance failures.
Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm
ZooKeeper client wrapper and rich ZooKeeper framework
Check out Dash, the Documentation Browser for 200+ APIs https://kapeli.com/dash
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
The resources of the preparation course for Databricks Data Engineer Associate certification exam
Source-agnostic distributed change data capture system
ETL Library for Machine Learning - data pipelines, data munging and wrangling
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Portably control DNS clouds using java or bash
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.