sagarlakshmipathy Goto Github PK
Name: Sagar Lakshmipathy
Type: User
Company: Onehouse
Bio: Blah
Location: Redmond, WA
Name: Sagar Lakshmipathy
Type: User
Company: Onehouse
Bio: Blah
Location: Redmond, WA
This repo contains a data pipeline code written in python and scheduled in Apache Airflow. It monitors, fetches and cleans tweets and stores in HDFS. And finally, loads it to a hive database.
Safe blue/green deployment of Amazon SageMaker endpoints using AWS CodePipeline, CodeBuild and CodeDeploy.
This project lets users merge two analyses together and create a new analysis or even update the target analysis in place.
Course Files for Complete Python 3 Bootcamp Course on Udemy
Adrian's AWS CSA Assoc Course Refresh 2019
This is a repo with links to everything you'd ever want to learn about data engineering
Free Data Engineering course!
wip
This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.
Upserts, Deletes And Incremental Processing on Big Data.
This repository serves as a guide to work with Hudi tables on Databricks environment
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Performed Regression Analysis using PySpark to predict Life Expectancy based on World Population data.
Guide on creating an API for serving your ML model
The code was written for Big Data Infrastructure final capstone project to predict the customer churn for a telecom company. Data was sourced from Kaggle but can run on databricks independent of supporting documents/datasets. It includes techniques like hyperparameter tuning for feature engineering and model evaluation. Random Forest Classifier model served us with the best accuracy at 71%.
Dimensionality reduction and image recognition on MNIST data set using PCA & T-SNE and SVM.
The code below was written in Pig Latin which finds the best movies sorted by time and worst of the movies sorted by the times it was rated. Repository includes, code for the analysis and datasets 1. "u.data" file (contains rating info) and 2. "u.item" file (metadata).
This repository is to help getting people started with OneTable
A Simple Py4J implementation
Files for Udemy Course on Algorithms and Data Structures
A Python refresher section for all our courses
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.