Light

ironhack-labs / lab-customer-analysis-round-1 Goto Github PK

View Code? Open in Web Editor NEW

0.0 2.0 505.0 782 KB

lab-customer-analysis-round-1's Introduction

Lab | Customer Analysis Round 1

Remember the process:

Case Study
Get data
Cleaning/Wrangling/EDA
Processing Data
Modeling
Validation
Reporting

Abstract

The objective of this data is to understand customer demographics and buying behavior. Later during the week, we will use predictive analytics to analyze the most profitable customers and how they interact. After that, we will take targeted actions to increase profitable customer response, retention, and growth.

For this lab, we will gather the data from 3 csv files that are provided in the files_for_lab folder. Use that data and complete the data cleaning tasks as mentioned later in the instructions.

Instructions

Read the three files into python as dataframes
Show the DataFrame's shape.
Standardize header names.
Rearrange the columns in the dataframe as needed
Concatenate the three dataframes
Which columns are numerical?
Which columns are categorical?
Understand the meaning of all columns
Perform the data cleaning operations mentioned so far in class
- Delete the column education and the number of open complaints from the dataframe.
- Correct the values in the column customer lifetime value. They are given as a percent, so multiply them by 100 and change dtype to numerical type.
- Check for duplicate rows in the data and remove if any.
- Filter out the data for customers who have an income of 0 or less.

lab-customer-analysis-round-1's People

Contributors

Watchers

Forkers

ernesto-yelamos drhech bcarrero cornettomlette marcellocanetto catarinamcbatista ahmed-abdelmoteleb louiscostes macarenagortazar vikasrathi1992 danny88o leamartinez1411 ferrazzolidicreddo sogunmola jast92 caitlinsanderson oteri200 vicmatthews hatafred marcpouvi ironhack-course-work ironhackriya haggarw3 gabrielescarrer beto-amaral caaarov saranme jw15667 sdbrugman robonejan kjakubowska88 mubarikibrahim schloss-simms arabellacm urgpan josefin-b vladtarverdov felixley kokeshi777 frmele kymy91 mparjcus mazim-co siloik gius88 alliesegre gg030 rvergnani frcloers alenpavlicravser n4d1n3 thebille johalruiz lupocordero agathantavrazou loretolf lucaciceu horacioe14 polhervella oriolsauleda bunmi-haastrup samcana tonyhathuc n-1983 anastasia-mintz npferrari joangali1997 isabeljabs deepdesh jennipher0716 davis-pudans commanderpoe neilcorteen andreafought flacolaco nayopr annbeele novi0106 wcondevidal ritasilv sumampouw sergioaisolutions amandasilvap callmeishmaelh inesrondeau gonzalo-zurita elizabeth-sames joriencaron hamzapektas lydiavanderputten enricocesaro fariajanela fredericodr renevans felagund93 sanjamil 1francisco1 gonszalo juliea1001 ingridlembo

lab-customer-analysis-round-1's Issues

missing fields in the csv files create problems in the next rounds

There are missing fields in the csv files ( e.g. Response , Sales Channel) that are needed for the next rounds in the customer analysis case study . What happens is that the students work on the data cleaning tasks using these csv files and afterwards they have to redo all their work because they receive new files with new fields in the next rounds, which creates a frustration to them because they have to redo and recheck all their cleaning code on the new file.

The solution: is to make sure that the same files that students work on from the first round can be used(including all fields) in later rounds so they can apply all the data cleaning and exploratory data analysis in one consistent pipeline.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs