GithubHelp home page GithubHelp logo

Hi there 👋

I'm Pooja Yadav a Data Scientist, Data Engineer, and a Master of Science graduate from San Diego State University.
Thank you for visiting my GitHub profile.

About Me ✍

🔬 Currently a Scientist at IFF
🔭 Previously a Data Engineer at Getinge
💬 Ask me about Data Engineering, Machine Learning, and Deep Learning.

💻 Technical Proficiency

Programming Languages: Python, SQL, Java, T-SQL
Databases: PostgreSQL, MySQL, MongoDB, BigQuery, SSIS, DBeaver
Data Analysis: Pandas, NumPy, Tableau, Streamlit, Plotly, Seaborn, Power BI
Big Data: PySpark, PySparkSQL, Spark

☁️ Cloud Services:

  • AWS: S3, EC2, ECS, ECR, Glue, Lambda, Athena, QuickSight
  • GCP: Cloud Compute, BigQuery, Looker Studio
  • Azure: ADLS gen2, Databricks, Synapse, DevOps, OpenAI

Machine Learning and Deep Learning: Scikit-learn, PCA, XGBoost, Linear Regression, Logistic Regression, Clustering Analysis, VGG16, TensorFlow, Keras, Image Segmentation (Computer Vision)

Tools and Frameworks: Jupyter Notebook, VSCode, Docker, Git, Selenium, AirFlow, React, Kafka

🔥 My Stats :

GitHub Streak

Top Langs

Profile Views

Pooja 's Projects

airbnb icon airbnb

Worked with New York city Airbnb historical data. Performed Linear Regression and XGBoost to predict the New York city Airbnb rent with an RMSE value of 63.4.

body_posture_recognition icon body_posture_recognition

Processed video data using OpenCV, extracted body keypoints using Mediapipe and performed k-means clustering analysis.

ethio_hydro icon ethio_hydro

Worked with Ethiopia's rain and temperature time series data. Processed and analyzed 67 million Data and created a dashboard to monitor it.

kafka-airflow-mongodb icon kafka-airflow-mongodb

Implemented data pipeline for validating email and sending OTP using kafka, mongodb, webhooks and airflow

product_app icon product_app

A website for placing an order and taking all its sales data for predictive analysis using Machine Learning Algorithms such as Linear regression and support vector regressor.

titanic icon titanic

Titanic Disaster kaggle challenge. Performed Exploratory data analysis and applied Logistic regression for predicting the survival of the Titanic passengers.

twitter_classification_project icon twitter_classification_project

Applied KNN to predict whether a tweet will be viral or not based on different features. Applied Naive Bayes classifier to classify the tweets based on different locations.

uber_data_analytics icon uber_data_analytics

Data modeling and ETL pipeline for data analytics on Uber dataset using Google cloud storage, BigQuery, and Looker Studio

us-covid-19 icon us-covid-19

Created a dashboard using streamlit, plotly, and plotly-express for analyzing Covid-19 data of the United States.

virltor icon virltor

AI based real estate service for rating 1.2 million housing community of San Diego

word_processor icon word_processor

Flyweight pattern for word processor in python3. Implemented a Flyweight factory that given a unicode code point returns the Flyweight character object for the character and font. Created RunArray for tracking character sequence in the word document.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.