GithubHelp home page GithubHelp logo

Hello there,

My name is Pablo Gomes de Miranda.

I am actively seeking professional opportunities as a Data Scientist, with a particular interest in roles where I can utilize data to help companies make informed decisions that drive positive outcomes.

While my educational background includes a bachelor's, master's, and PhD in different fields of Humanities, I am currently focusing on expanding my knowledge and skills in various tools used in Data Science. I am actively working on projects to build a portfolio that showcases my abilities.

I believe that my extensive experience in Education and History has equipped me with strong communication skills and the ability to offer unique solutions in the field of Data Science.

Data Science Projects:

This is a clustering project where we worked on segmenting customers for DataSmart, a fictitious e-commerce, with the purpose of creating a loyalty program called Insiders. The planned segmentation followed an RFM logic, where Recency can be considered as the time since the last purchase and the responsibility of our customers, Frequency as the time between transactions and their engagement on the platform, and Monetary as the total revenue and which high-value purchases were made. Using the data available on Kaggle, we carried out an end-to-end project with deployment on AWS, where we elected a cluster of 86 customers with an average gross revenue of US$4179.93.

Tools used:

  1. Python 3.10.10;
  2. VS Code;
  3. Jupyter Notebook;
  4. YData-Profiling;
  5. Metabase;
  6. SQL: SQLite and PostgreSQL;
  7. Git and Github;
  8. Amazon Web Services: S3, RDS and EC2.

This is a Learning to Rank (LTR) project in which the objective is to classify and rank clients interested in purchasing vehicle insurance. The company SafeHarbor Insurance is a fictitious insurance company made up by us, in order to provide a business context for our problem. The data have been acquired in the challenge Health Insurance Cross Sell Prediction from Kaggle, We perform an exploratory data analysis, train different classification Machine Learning models, evaluate the metrics, and test their results.

Tools used:

  1. Python 3.10.10;
  2. VS Code;
  3. Jupyter Notebook;
  4. PostgreSQL;
  5. Git and Github;
  6. Render Cloud;
  7. Flask;
  8. Google Sheets Apps Script.

This is a Classification project where we were hired to develop a model that could help a medical company detect the onset of cardiovascular diseases among patients. Medical data was collected from Kaggle, and in the end, we achieved a classification model that, in the worst-case scenario, with 72% precision, could bring a return of US$ 175,000,000.00, and in the best-case scenario, with 78% precision, a profit of US$ 210,000,000.00 could be expected.

Tools used:

  1. Python 3.10.8;
  2. VS Code;
  3. Jupyter Notebook;
  4. Git and Github;

This is a Regression problem for a sales Forecasting in which we propose the sales prediction of a European pharmaceutical company, Dirk Rossmann GmbH. The data was collected via Kaggle from the Rossmann Store Sales competition. After an exploratory data analysis and the use of an algorithm called boruta to select the best features for a prediction using a XGBoost Regressor Machine Learning model, we have achieved an average prediction of sales of €285,338,016.00 for the next six weeks and implemented the solution in a way that is easily accessible for the company's business team.

Tools used:

  1. Python 3.9.13
  2. VS Code
  3. Jupyter Notebook
  4. Heroku: Cloud Application Platform
  5. Telegram Messenger

Data Analysis:

This is an exploratory data analysis (EDA) project whose objectives are to generate insights to answer two simple questions asked by a fictitious real estate company: given a list of properties:

  1. which ones should be acquired and
  2. what are the sales conditions to obtain the highest profit.

Tools used:

  1. Python 3.9.13
  2. VS Code
  3. Jupyter Notebook
  4. Streamlit
  5. Streamlit Community Cloud

We answered both questions by delivering two csv files containing a list of 157 properties that can be acquired at a reasonable price by the company and sold in different seasons making a good profit. If House Rocket acquire and sell all the suggested properties, it can be expected a total profit of US$24222890.20

This is an exercise to understand the basics of Python, practice data manipulation, and also have a grip on the libraries and packages of this programming language. We also exercised code versioning, both in local and remote repositories. The goal was to produce a list of motorcycles, according to a series of specifications, that could be purchased by a company with the purpose of obtaining profit from their resale.

Tools Used:

  1. Python 3.10.8;
  2. VS Code;
  3. Jupyter Notebook;
  4. Git and Github;
  5. Streamlit Cloud.

Simple dashboard using Microsoft Power BI to demonstrate my data manipulation skills and ability to prepare dashboards with the appropriate tools. The data used was collected from a real survey conducted by a YouTube channel.

Tools Used:

  1. Microsoft Power BI;
  2. Microsoft Excel;
  3. Github.

You can reach me through my e-Mail or LinkedIn

Pablo Miranda's Projects

dio-lab-open-source icon dio-lab-open-source

Repositório do lab "Contribuindo em um Projeto Open Source no GitHub" da Digital Innovation One.

html_dio_1 icon html_dio_1

Projeto pessoal simples de um site estático em HTML

html_dio_2 icon html_dio_2

Projeto pessoal simples de um site estático em HTML

html_dio_3 icon html_dio_3

Projeto pessoal simples de um site estático em HTML

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.