GithubHelp home page GithubHelp logo

Hi there, I'm Koushik! 👋

Welcome to my GitHub profile! I'm a seasoned Data Engineer specialized in designing, developing, and optimizing scalable batch & streaming data pipelines. With a knack for Data Warehousing and ETL frameworks, I'm here to turn data into actionable insights.

🔗 Let's Connect:
LinkedIn

🛠️ Skills and Expertise

I bring a diverse set of skills to the table, ranging from cloud computing to real-time data streaming.

  • Cloud Platforms: AWS, Azure, GCP
  • Data Warehousing & BI: AWS Redshift, Azure Synapse, Snowflake, Google BigQuery
  • Data Integration: ETL Batch, Real Time Streaming
  • Programming: Python, Bash Shell, Scala
  • Big Data Tech: Apache Spark, Hadoop, Hive
  • Data Streaming: Confluent Kafka, Kinesis, Event Hub, Pub/Sub
  • Serverless: AWS Lambda, Azure Functions, Google Cloud Functions
  • Data Lakes: AWS S3, ADLS, HDFS, GCS
  • Databases: Oracle, PostgreSQL, SQL Server, MongoDB, DynamoDB
  • Scheduling & Automation: Apache Airflow
  • DevOps: Jenkins, Terraform, Cloud Formation
  • Search & Analytics: Elasticsearch, Kibana

⭐️ From koushikt

Skittle's Projects

auto-gpt icon auto-gpt

An experimental open-source attempt to make GPT-4 fully autonomous.

awesome-python icon awesome-python

A curated list of awesome Python frameworks, libraries, software and resources

event-fetcher icon event-fetcher

This repository hosts a cloud-based data pipeline built on AWS. The pipeline is designed to scrape web data using a Python script, process the data, and store the results in a CSV file in an S3 bucket. The pipeline is triggered every day at midnight. It leverages several AWS services including EventBridge, Step Functions, Lambda, EC2, SSM

gq-great-expectations icon gq-great-expectations

Great Expectations Data Quality Checks is a specialized repository designed to harness the capabilities of the great_expectations Python library. With a focus on ensuring data quality, this project provides robust tools and methodologies to validate and check data across various sources.

great-expectations-dq icon great-expectations-dq

This script performs a series of data quality checks and generates a data profiling report. It utilizes Great Expectations for data validation and ydata_profiling for data profiling. The script reads data from a PostgreSQL database, applies various quality checks, and outputs validation results and a data profile HTML report.

kachow icon kachow

Config files for my GitHub profile.

kinesis-lambda icon kinesis-lambda

This repo consists of python lambdas for reading and writing an AWS kinesis stream.

lambda_layer_maker icon lambda_layer_maker

This is a script to make create, update and ready AWS Lambda layers you have in your account.

postgres-mongo-migrator icon postgres-mongo-migrator

This repository contains a Python script for migrating data from a PostgreSQL database to a MongoDB database. The script is designed to be robust and fault-tolerant, capable of handling large datasets.

python-oneliners icon python-oneliners

Some of the Python One-Liners which I regularly use and feel saves a lot of time.

teams-pipeline icon teams-pipeline

Set of scripts and instructions for setting up a Microsoft Teams webhook pipeline with potential Azure functions integration. Includes automation scripts, database structures, and a comprehensive setup guide.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.