GithubHelp home page GithubHelp logo

customer_clustering_analysis's Introduction

Customer Clustering Analysis

Developed a model for one of the retail store in Boulder to get sense of customers and classify them into different cohorts/groups so that suitable marketing strategies can be used for individual cohort.

image image

Problem Statement

Walmart has asked us to develop a model for one of the retail store in Boulder. If they like your work, they will ask you to replicate the model for other Walmart retail stores as well You would like to develop a model to get sense of customers and classify them into different cohorts/groups You are given the following attributes for 2000 customers (based on the Loyalty card they use at checkout)

  • Customer ID
  • Sex/Gender
  • Martial Status
  • Education
  • Income
  • Occupation (Unskilled/Skilled/Highly skilled)
  • Settlement Size (The size of the city that the customer lives in)

The model should be able to classify customers into different cohorts so that suitable marketing strategies can be used for individual cohort

About Boulder:

  • Boulder is a small town located in Colorado. Many high school and university students come to Boulder for attaining education
  • After attaining education, students get placed in top-notch software companies in Boulder.
  • However, due to limited jobs in Boulder, some students look for job opportunity in their home-town cities or nearby big cities such as Louisville, Westminster, etc. These big cities are a few miles away from Boulder

Dataset

Data Source : https://github.com/keshavsood80/Customer-Clustering-Analysis/blob/main/segmentation%20data.csv

Understand the Variables

Variable Data type Range Description
ID numerical Integer Shows a unique identificator of a customer
Sex categorical {0,1} 0 - male , 1 - female
Marital status categorical {0,1} 0 - single , 1 - non-single (divorced / separated / married / widowed)
Age numerical Integer The age of the customer in years
Education categorical {0,1,2,3} Level of education of the customer -- 0 - other / unknown , 1 - high school, 2 - university , 3 - graduate school
Income numerical Real Self-reported annual income in US dollars of the customer
Occupation categorical {0,1,2} Category of occupation of the customer -- 0 - unemployed / unskilled , 1 - skilled employee / official , 2 - management / self-employed / highly qualified employee / officer
Settlement size categorical {0,1,2} The size of the city that the customer lives in -- 0 - small city , 1 - mid-sized city , 2 - big city

Agenda

  • Step 1: Import Libraries
  • Step 2: Import Dataset
  • Step 3: Exploratory Data Analysis
  • Step 4: Univariate Data Analysis
  • Step 5: Bi-variate Analysis
  • Step 6: Data Preparation
  • Step 7: Modelling
  • Step 8: Evaluate Performance

Models used

  • Hierarchical Clustering
  • K-Means Clustering

Solution

Analysis of Clusters

image

Cluster Name Analysis
(0) Fewer opportunity High school passed people employed mostly in low skilled jobs (living in small cities)
(1) Career Focused Single men focusing on only careers earning 2nd highest salary
(2) Well Settled Married elder people with University+ education and High Income
(3) Average High school passed people employed in skilled job (living in small and medium cities)

customer_clustering_analysis's People

Contributors

keshavsood80 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.