GithubHelp home page GithubHelp logo

saboye / real-time-data-streaming-pipeline Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 114 KB

A real-time streaming pipeline streams live tweets data from Twitter and ingests the data to the Apache Kafka clusters as a topic, and consumers consume the hashtag tweets as a message.

License: MIT License

Jupyter Notebook 100.00%
api consumer data json kafka producer python realtime topic tweets twitter zookeeper

real-time-data-streaming-pipeline's Introduction




I am an experienced software QA engineer with over six years of experience in the software industry. I hold a Masters's degree in Computer Science from The University of Iowa and have extensive experience in manual and automated testing.

My expertise in automation testing tools such as Selenium, Cypress, Playwright, and JMeter has helped me develop and maintain a robust suite of automated test cases that ensure the quality and reliability of our products. Additionally, I have experience with performance testing, load testing, and stress testing using JMeter and other tools.

I am well-versed in agile software development methodologies and have worked closely with development teams to ensure our testing efforts align with project timelines and goals.

Moreover, my knowledge of SQL and database technologies enable me to validate the integrity of data stored in our products effectively. I am also familiar with data extraction, web scraping, data wrangling, and data acquisition techniques that enable me to collect, transform, and publish data accurately and efficiently.

Beyond my technical skills, I am an avid learner who constantly seeks new opportunities to improve my knowledge and skills. I have attended several training sessions, workshops, and industry conferences that have helped me stay current with the latest developments in the software industry.

Outside of work, I am passionate about staying up-to-date with the latest trends and developments, and I enjoy participating in online communities and events to learn from and connect with other professionals.

  • Programming and Script Language: Javascript, Python, R, SQL, Bash, Java, HTML, CSS
  • Automation Testing: Selenium, SeleniumBase, Cypress, JMeter, and Playwright
  • Working with Different Data Types: JSON, CSV, EXCEL, Text, XML, SQL, Parquet, Avro, ORC
  • Version Controlling, Container Virtualization: Docker
  • Databases: Postgres, SQL Server, MySQL, SQLite, and MongoDB
  • ETL Database Model Development: Carry out new procedures and create various data warehouses
  • Data Warehouse, Data Lakes, Data Pipelines, Automation
  • Gather Requirements from Business Analysts
  • Develop Physical Data Models Using Erwin
  • Create DDL Scripts to Design Database Schema and Database Objects
  • Cloud Computing: AWS, Microsoft Azure AZ-900, Microsoft Azure DP-900, Microsoft Azure AI-900
  • Perform Database Defragmentation and Optimize SQL Queries
  • Improving Database Performance and Loading Speed
  • Framework: PySpark, and Hadoop
  • Data Visualization: Tableau
  • Operating Systems: Windows, Linux (macOS, Ubuntu, Redhat)


๐Ÿ”นFun fact ๐Ÿ‘‰ 01000011 01101111 01100100 01101001 01101110 01100111 00100000

real-time-data-streaming-pipeline's People

Contributors

saboye avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.