GithubHelp home page GithubHelp logo

mart-sales-predtiction's Introduction

JOB-A-THON

The solving the project in a simple way to approch the required solution, following steps are applied.

  1. read the problem statement and data
  2. import the required libraries
  3. data cleaning
  4. EDA
  5. module approch
  6. comparision of trained and test data
  7. creating the sumbmission file

Reading the problem statement : the problem statement is to predict the Sales prize based on store types, location types, holidays and discounts

Importing the data : Asusale the data reading libraries are imported like pandas, and some maths libraries like numpy, maths for scaling the purpose the MinMax scalar, and at last machine learning libraries from sklearn

Data cleaning : the data is checked weather the null values are present are not. but in this WOmart data set their is no null values present. and check the independent and dependent columns.

EDA : the 1st approch in this step is to visualize the all the columns with each other, how they are corelated with each other. the date column is an object data type so we do it in either int or float by spliting it in three other columns like day,month, year. after this we do label encoder to all objective/categorical data type columns. then we split the data in two data frames one in independent and another one in dependent. in independent columns the ID and #Order column is removed beacuse one have all uniques vales with objective data type and its not effect the remaining module, and another one for matching the test and train data shape,(either remove it or compress the data features) the independent column is scaled with the help of Minmax theorm. then split it into training and testing data

Module approch : to check which module performs well for this data we imported 6 machine learning modules like RandomForest, XGB Regressor, Lasso and Ridge etc. created a for loop were to check the all modules and listed out in RMS values comparing all the RMS vlaue the XGB Regressor gives less errors.

comparision of trained and test data : comparing the splited X_train and Y_train with visualization and r2 score

creating the sumbmission file : comparing all the RMS vlaue the XGB Regressor gives less error. so created a separate module to predict the test data set and created submissinon file with column names of ID and Sales, saved into '.csv' file.

Thank you

mart-sales-predtiction's People

Contributors

sandeepbelamagi avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.