GithubHelp home page GithubHelp logo

phase-2-project's Introduction

King County Home Flipping

Business Understanding

Our client, Seattle Home Flippers, wants to optimize their business by studying trends in the Seattle real estate business. They specifically want to know where they should investment and what to focus on in renovations. To identify answers to these questions, first we want to find the area with lowest average sale price, because they will have the most potential profit return on investment. Then, we wanted to find the key feature within a house that increases its value in the buyers eyes. By implementing the findings to both of these problems, Seattle Home Flippers will be able to increase their sale prices on renovated homes. By building multiple-linear regression models, we are able to make predictions of home sale prices and identify the variables that influence the price most (have the most value).

Data Understanding

We used a dataframe containing King County Housing information, which describes properties in the city of Seattle, Washington. Our client is a home flipper, therefore only concerned with how the interior of the house effects price, so we removed properties from the dataframe that had a view. We reasoned that views might unequally effect sale price and create unnecessary outliers that would hurt the model. To make the data more concise, we dropped information we deemed irrelevant or unimpactful to the business problem (year renovated, id, date, floors, latitude, longitude). We then grouped grade into 3 categories to provide more samples per section and organized living space area to provide information on houses with basements and the size of the basements. We decided to OneHot encode the zipcode category in order for the model to reflect the area each property was in. Finally, after viewing the correlation heatmap, we fixed colineraity and were ready to model.

image

Methods

First, we established a baseline model isolating sale price as the depend variable. Our first model was inaccurate, as shown by the low R squared score of .457. We then used log to improve the model and rid ourselves of outliers, which actually decreased R^2. Scrap it. To create a better model, we used the kitchen sink approach. After setting a standard scaler and transformer, the model improved, returning a R^2 of .827. After logging it, our final model returned R^2 of .86. This means that our model, which includes bedrooms, living area (sq. feet), lot area, condition, year built, zipcode, size of neighbors lots, size of basement, has a basement, and grade group (1-3), can explain about 86% of variance for price.

Results

Housing regression model

The model showed us that the variable with most correlation to sale price is the square footage of living space. To display this relationship, we created a scatterplot with a line of best fit set to show the positive linear relationship between the two variables. We found that, acording to the findings, every increase in area by 100 square feet will result in a 3.25% increase in sales price.

image

Additionally, we found that the zipcode with the lowest average sale price ($234,103) was 98002: Auburn, Washington.

cheap zipcode seatlle housing

Recommendations

The data shows that Seattle Home Flippers should concentrate on size primarily; the more area they can provide, the more the house will sell for. Also, they should look into investing in homes in the Auburn community of Seattle.

References

Presentation: https://www.canva.com/design/DAFFF3RLExM/Qh0b7_hVQsitJx8OZRjb7A/edit?utm_content=DAFFF3RLExM&utm_campaign=designshare&utm_medium=link2&utm_source=sharebutton Data: https://info.kingcounty.gov/assessor/esales/Residential.aspx

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.