GithubHelp home page GithubHelp logo

benjaminmgross / api-scrapin Goto Github PK

View Code? Open in Web Editor NEW
3.0 3.0 0.0 680 KB

Project 01 for General Assembly involving API scraping and data visualization

Python 21.10% CSS 20.84% JavaScript 58.06%

api-scrapin's Introduction

#README.md

Introduction

Zillow aggregates some very interesting data, especially if you're interested aspects of home prices, demographics such as income, education, etc. -- not to mention all of this information is provided with latitude and longitude coordinates to boot. This light-weight module takes advantage of that gives you the ability

  1. Take the 'n' largest cities in the US (data scraped from Wikipedia)
  2. Go to Zillow's API, & extract regionid's (of which there are several hundred for any metropolitan city) along with some interesting Zillow Index Data
  3. Join that data with demographic data, such as median house prices, cost per square foot, median income, etc. all provided at a "neighborhood", "city", & "state" level.
  4. Put it all into Jesus' favorite data structure... pandas, to do some more interesting data analysis

Dependencies:

  • Requests: Leveraged heavily to hit the Zillow API, as well as pass the API arguments
  • BeautifulSoup: Specifically bs4, for parsing the horrific, sadder-than-baby-tears expunged xml from the Zillow API

Installation:

$ git clone [email protected]:benjaminmgross/api-scrapin.git #assuming ssh install
$ cd api-scrapin
$ python setup.py install

I know what you're thinking, "why can't I pip install it?" Stop whining! ... fine, I haven't figured out how to do that yet with packages, but I'm working on it...

##Up and Running in 5 Steps

###Step 1: Get Yourself a Zillow API Token

  1. Go to Zillow's Registration Page where you will be prompted to create a login.
  2. After you create a login, go to the Zillow API Overview Page
  3. Click on the get a ZWSID
  4. Fill out the information, click all of the check boxes of different APIs you might wannt, and then get ready to receive your Zillow API key in your inbox!

###Step 2: Install the Package

See installation instructions

###Step 3: Let 'er Rip

The crux of what makes this package special is the ability to merge what are called "region-id" and cities.

For instance, there are 267 region-id's around the New York City area, and for each one of those region-id's, there's extensive demographic information (such as income, commute times, etc), but this information is never provided "together" -- as in, here's the city, all of it's region-id's, and extensive demographic data about those region-id's / cities.

You can try to figure out out how to join all that data from disparate Zillow API's... or you can just use this package.

###Step 4: Do some cool analysis

You got this one covered...

###Step 5: Write me an email and tell me you love me

##To Do:

  • Complete package installation so package can be installed
  • Finish README.md
  • Generate documentation with Sphynx

api-scrapin's People

Contributors

benjaminmgross avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.