Leakey Mokaya N.'s Projects
Big O notation cheatsheets. algorithms and data structures explanations and implementations
A topic-centric list of HQ open datasets.
a collection of Dataset from various sources
Blockchain.com Data Scientist TakeHome (February 2022)
:link: Some useful websites for programmers.
You are a Data Scientist with a housing agency in Boston MA, you have been given access to a previous dataset on housing prices derived from the U.S. Census Service to present insights to higher management. Based on your experience in Statistics, what information can you provide them to help with making an informed decision?
Predicting shares of Mashable articles based on article metadata
What would you like to do in the next few years? Climb a mountain? Learn to ride a bike? :) Itβs important to keep track of what you have already done and what you are yet to achieve. In the next steps, we will work towards building a bucket list application that helps us record activities we wish to undertake, tick off what we have done and even invite our friends to have fun with us
π€ Build your own (insert technology here)
Intro to time series analysis in R
CIFAR10 multiclass classification
:mortar_board: Path to a free self-taught education in Computer Science!
Analysis of Credit Card Default Dataset of Taiwan for Machine Learning
The increase in the usage of the credit card by the people, the transactions done by credit card increases dramatically in the world. With this drastic increase in the usage of credit cards, the number of fraudulent also increases enormously & it is very difficult to identify the difference between a fraudulent transaction and normal transaction. American Express-issued credit card to 53.7 Million users, however, recorded Rs. 73380 fraud in a year on average. Credit card fraudulent causes serious losses to the individual and the organization. The credit card issuing companies offer credit card fraud detection applications to the users and individuals for their safety. This paper focuses on the different algorithms used for credit card fraud detection and find the optimal algorithm for classification of credit card fraud detection. It uses Logistic Regression, Linear Discriminant Analysis, K-Nearest Neighbors, Support Vector Machine, eXtreme Gradient Boosting, Random Forest and computes the accuracy, AUC-ROC values for all the classifiers.
This is an Artificial Neural Network that can predict, based on 24 attributes of a customer, if an individual customer will default on their payment next month for their credit card (consumer credit risk).
For Data science Projects
For Data Analysis Projects
Materials for a week-long instance of DataCarpentry.
Data Preprocessing Project - Imbalanced classes problem
Collection of useful data science topics along with code and articles
code for Data Science From Scratch book
Data science project code and presentation materials
This contains data analysis projects
This directory is for my Datascience portfolio.
A collection of datasets of ML problem solving
My implementation of Decision Tree ID3 algorithm for all categorical attributes.
Capstone Project: Using machine learning to predict the probability of default of credit card clients.