Joshua Omolewa's Projects
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw
Youtube Apache NiFi 2022 Series resources
š Papers & tech blogs by companies sharing their work on data science & machine learning in production.
:octocat: A curated awesome list of lists of interview questions. Feel free to contribute! :mortar_board:
A list of useful Apache NiFi resources, processor bundles and tools
Practicing CI/CD using github actions
Docker images for Debezium. Please log issues in our JIRA at https://issues.redhat.com/projects/DBZ/summary
Covid 19 Canada data analysis
This is a repo with links to everything you'd ever want to learn about data engineering
Great resources for data engineering
My data engineering practice
Data Engineering Practice Problems
Examples for running Debezium (Configuration, Docker Compose files etc.)
Workshop on optimizing PySpark pipelines.
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
DevOps resources - Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP
ETL project that uses docker container containing a python script to extract the csv data, transform the csv data by combining files into a single file and then load data into an output folder and also ensure the output csv file file is still available even if the container is shutdown.
This is an AWS data engineering serverless project to track Edmonton weather in near real time using services like Kinesis Data Firehose, S3, AWS lambda, AWS Glue, Athena, IAM,
Apache Flink
This repository is a getting started guide to Singer.
I use this repo to practice my git skills
Building an ETL pipeline using AWS services that extract data from a Job API and then transforms data to meet business requirements and load data to S3 bucket
Google Chrome, Firefox, and Thunderbird extension that lets you write email in Markdown and render it before sending.
Includes notes on Apache Spark, Spark for Physics, Jupyter notebook examples for Spark, Oracle and other DB systems.
This is a GitHub for all of my NiFi Templates
Google IT Automation with Python Professional Certificate - Practice files