GithubHelp home page GithubHelp logo

rafayk330 / house-price-prediction Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 1.36 MB

Prediction of house-prices in King’s County, USA, using a number of features such as area of living room, waterfront etc. | Involves extensive exploratory data analysis and comprehensive predictive modeling using data pipelines

Python 100.00%

house-price-prediction's Introduction

HOUSE PRICE PREDICTION MODEL

PROJECT SYNOPSIS

In this project, I play the role of a Data Analyst for a Real Estate Investment Trust. The Trust would like to start investing in residential real estate. The task at hand is to determine the market price of a house given a set of features. The project predicts housing prices using attributes or features such as square footage, number of bedrooms, number of floors, and so on. A Jupyter notebook has been provided in this repository.


TOOLS USED

This project uses Python for both the analysis and the visualization. An eclectic range of Python libraries have, however, been used:

  • Python 3.8 (visualization + analysis)
  • Jupypter Notebook (IDE)

DATASET DESCRIPTION

This dataset contains house sale prices for King County, which includes Seattle. It includes houses sold between May 2014 and May 2015. It was taken from a Kaggle upload (https://www.kaggle.com/harlfoxem/housesalesprediction).

Here is the description of the data:

  • id: A notation for a house
  • date: Date house was sold
  • price: Price is prediction target
  • bedrooms: Number of bedrooms
  • bathrooms: Number of bathrooms
  • sqft_living: Square footage of the home
  • sqft_lot: Square footage of the lot
  • floors: Total floors (levels) in house
  • waterfront: House which has a view to a waterfront
  • view: Has been viewed
  • condition: How good the condition is overall
  • grade: overall grade given to the housing unit, based on King County grading system
  • sqft_above: Square footage of house apart from basement
  • sqft_basement: Square footage of the basement
  • yr_built: Built Year
  • yr_renovated: Year when house was renovated
  • zipcode: Zip code
  • lat: Latitude coordinate
  • long: Longitude coordinate
  • sqft_living15: Living room area in 2015 (implie some renovations) | This might or might not have affected the lotsize area
  • sqft_lot15: LotSize area in 2015 (implies some renovations)

METHODOLOGY

  1. Imports of Libraries and Packages
  2. Import of Dataset
  3. Data Wrangling/Preprocessing
  4. Exploratory Data Analysis
  5. Feature Selection
  6. Model Development
  7. Creaation of Data Pipeline
  8. Model Evaluation and Refinement

ANALYSIS

All the steps in the analysis have been explained in the Jupyter Notebook for this project. Some examples of visualizations used are as follows: sqft_corr waterfront_boxplot


SUMMARY AND REFLECTION

This is an intermediate-level project which involves some advanced concepts of Machine Learning and Predictive Modeling in Python using an IDE.

All rights related to the published dataset are reserved with the issuing authorities of the same (Kaggle).

The project may be used only as a learning resource; no part of the same must be copied for any other usage whatsover.

house-price-prediction's People

Contributors

rafayk330 avatar

Watchers

 avatar

house-price-prediction's Issues

Incorrect Language Under "Packages"

GitHub automatically recognizes the languages used to code based on the size of the files. The issue here is that it is being displayed as a "Jupyter Notebook" project whereas the code snippet provided (code.py) makes it clear that it is a Python project.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.