GithubHelp home page GithubHelp logo

lendingclubcasestudy's Introduction

Lending Club Case Study

Lending Club is US based peer-to-peer consumer finance company founded in 2007.

It specialises in lending various types of loans to urban customers.

It matches borrowers with investors willing to fund their loans.

When the company receives a loan application, the company has to make a decision for loan approval based on the applicant’s profile. Two types of risks are associated with the bank’s decision:

  • If the applicant is likely to repay the loan, then not approving the loan results in a loss of business to the company.
  • If the applicant is not likely to repay the loan, i.e. he/she is likely to default, then approving the loan may lead to a financial loss for the company.

Loan Dataset

Loan Dataset

Table of Contents

Problem Statement

Business Understanding

When a person applies for a loan, there are two types of decisions that could be taken by the company:

Loan accepted: If the company approves the loan, there are 3 possible scenarios described below:

  • Fully paid: Applicant has fully paid the loan (the principal and the interest rate)

  • Current: Applicant is in the process of paying the instalments, i.e. the tenure of the loan is not yet completed. These candidates are not labelled as 'defaulted'.

  • Charged-off: Applicant has not paid the instalments in due time for a long period of time, i.e. he/she has defaulted on the loan

Loan rejected: The company had rejected the loan (because the candidate does not meet their requirements etc.). Since the loan was rejected, there is no transactional history of those applicants with the company and so this data is not available with the company (and thus in this dataset)

Business Objectives

This company is the largest online loan marketplace, facilitating personal loans, business loans, and financing of medical procedures. Borrowers can easily access lower interest rate loans through a fast online interface.

Like most other lending companies, lending loans to ‘risky’ applicants is the largest source of financial loss (called credit loss). Credit loss is the amount of money lost by the lender when the borrower refuses to pay or runs away with the money owed.

In other words, borrowers who default cause the largest amount of loss to the lenders. In this case, the customers labelled as 'charged-off' are the 'defaulters'.

If one is able to identify these risky loan applicants, then such loans can be reduced thereby cutting down the amount of credit loss. Identification of such applicants using EDA is the aim of this case study.

In other words, the company wants to understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default. The company can utilise this knowledge for its portfolio and risk assessment.

To develop your understanding of the domain, you are advised to independently research a little about risk analytics (understanding the types of variables and their significance should be enough).

General Information

Here we will be using EDA to understand how consumer attributes and loan attributes influence the tendency of default.

  • Steps for EDA :
  1. Data Understanding
  2. Data Cleaning
  3. Univariate Analysis
  4. Bivariate Analysis
  5. Multivariate Analysis
  6. Conclusion
- Data Set : Loan Lending Club loans

Download the dataset from below. It contains the complete loan data for all loans issued through the time period 2007 t0 2011.

Loan Zip file

Loan CSV file

You can access the data dictionary which describes the meaning of these variables from the provided link below:

Data dictionary for the fields used

Jupyter Notebook

Presentation PDF

Conclusion:

Continuous Variable:

  • Loans exceeding 15,000 exhibit a higher likelihood of borrower default.
  • Funding amounts surpassing 15,000 correlate with elevated default rates.
  • Investments exceeding 15,000 demonstrate a higher tendency for default.

Interest Rate Influence:

  • A notable increase in default rates accompanies rising interest rates.
Income and Debt-to-Income Ratio (DTI):
  • Elevated annual incomes correspond to reduced default rates.
  • Conversely, increasing DTI correlates with heightened default rates.

Influence of Academic Achievement and Income:

  • Higher incomes coupled with favorable academic performance typically result in fewer defaults.

Creditworthiness Assessment:

  • Grades and subgrades predominantly reflect the creditworthiness of borrowers.

Categorical Variable:

  • Loan terms of 60 months exhibit a higher default propensity compared to 36-month terms.
  • Default rates escalate as the grade (A, B, C, D, E, F, G) decreases.
  • Similarly, default rates rise with decreasing subgrade (A1, A2, B1, B2, etc.).

Verification and Borrower Type:

  • Verified borrowers tend to display a higher percentage of defaulted loans.
  • Small business borrowers are associated with a heightened default risk.

Geographical Influence:

  • The percentage of defaulted loans is notably higher in states like NE and relatively elevated in NV and SD.

Technologies Used

  • matplotlib - 3.7.1
  • Python 3.10.11
  • seaborn - 0.12.2
  • pandas - 2.0.2
  • numpy - 1.24.3

Acknowledgements

Contact

Created by [@aravindvcyber] feel free to contact me!

lendingclubcasestudy's People

Contributors

aravindvcyber avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.