GithubHelp home page GithubHelp logo

tahsinjahinkhalid / credit_card_approval_classification Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 519 KB

This is a machine learning project that analyzes the Kaggle Dataset and does a classification task of determining whether a credit card is approved or not.

License: MIT License

Python 100.00%
classification data-science machine-learning orange-data-mining tahsinjahinkhalid

credit_card_approval_classification's Introduction

Credit Card Approval Classification

By: Tahsin Jahin Khalid

This is a machine learning project that analyzes the Kaggle Dataset and does a classification task of determining whether a credit card is approved or not.

Technology Used:

  • Python (for Data Preprocessing)
  • Orange Data Mining

Dataset

About Dataset

Commercial banks receive a lot of applications for credit cards. Many of them get rejected for many reasons, like high loan balances, low income levels, or too many inquiries on an individual's credit report, for example. Manually analyzing these applications is mundane, error-prone, and time-consuming.

About Project

Orange Data Mining Workflow

Project Summary

  1. The preprocessed/cleaned data CSV file is loaded visa the CSV File Import widget.

  2. A Select Columns widget is used to designate the "ApprovalStatus" variable as the target variable. The "Unnamed: 0" column is added to the ignore columns field.

  3. We use a Distributions widget to visualise the target variable to check if there is any imbalance.

  4. The stage is training and validating various models on the datasert for classification. For this project we have used logistic regression, decision tree, random forest, neural network and a stacked model of these for comparison.

  5. The metrics of the model (Test and Score widget) are shown below:

    • From these metrics, it can be seen that:
      • Out of the five models used, the stacked model and the neural network models have performed better than the baseline logistic regression model.
      • Comparing between the stacked model and the neural network, the neural nerwork (NN) model has performed marginally better in classification
      • The confusion Matrix of the NN model on training/validation data is shown below:
  6. GPT4 is used to generate synthetic testing data to test the performance of the model on unseen data.

  7. The NN model is used to make predictions on the unseen test data.

    • The AUC on testing data is 0.566, and the Classification Accuracy is 52%.
    • Inspecting the confusion matrix, the model's performance on test data is not optimal.

credit_card_approval_classification's People

Contributors

tahsinjahinkhalid avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.