GithubHelp home page GithubHelp logo

Comments (12)

NealHumphrey avatar NealHumphrey commented on August 23, 2024

@hansak11 you were working on this with Jeanne, right? Where does this stand? Do you guys have a recommendation yet for which dataset from zillow we should actually use?

from housing-insights.

hansak11 avatar hansak11 commented on August 23, 2024

@NealHumphrey I did look into the zillow data earlier but haven't had a chance to look at it again since we were trying to work with Docker. Do you need this information soon? I can try to work on it this weekend and see if I find a good dataset to use.

from housing-insights.

NealHumphrey avatar NealHumphrey commented on August 23, 2024

No specific timeline, just want to get this integrated in the next few weeks. I was cleaning up all our tickets and this didn't have you as an assignee so I wanted to make sure you were still on it and it hadn't dropped off the radar. Thanks!

from housing-insights.

hansak11 avatar hansak11 commented on August 23, 2024

Apologies, Neal! I ended up staying late at work and the red line was a mess today and hence could not join the work session. I still haven't had a chance to figure out which Zillow dataset will be useful. I should have some time this week to look into it but if someone else wants to take over this task, it might work better (if this is an emergency) I do not have the admin restrictions on my laptop anymore so I should be able to work with Docker.

from housing-insights.

terrysky18 avatar terrysky18 commented on August 23, 2024

I will take over this issue.

from housing-insights.

domaley avatar domaley commented on August 23, 2024

I'm really interested in this data @terrysky18, please let me know once this is pulled together. Also, if there's anything I can do to help, let me know.

from housing-insights.

NealHumphrey avatar NealHumphrey commented on August 23, 2024

@terrysky18 - We briefly talked about this, but the best first step on this is the bullet point on the 'geographical summary':

  • Get the latest copy of all the Zillow Rent Index datasets (there is a separate dataset for each combination of apartment/building types).
  • Calculate roughly how much of the city has data for each of the data sets. Some data sets only report a handful of neighborhoods, while others report all of them.
  • Calculate how much that differs for past years (all the data sets have lower coverage in earlier years)

After you prepare that, maybe it makes sense to set down with @domaley and discuss any other considerations on which rent data to use / exclude or which caveats to display alongside the data?

from housing-insights.

terrysky18 avatar terrysky18 commented on August 23, 2024

@NealHumphrey, @domaley - acknowledged.

from housing-insights.

terrysky18 avatar terrysky18 commented on August 23, 2024

The meta.json file contains 'zillow_zrisqft_neighbor'. The Zillow rent index per square feet for all home file contains the most neighbourhoods in DC; other files only provide either a few neighbourhoods or none for Washington.

The Neighborhood_MedianRentalPrice_1Bedroom.csv file contains data for 10 DC neighbourhoods: Columbia Heights, Capitol Hill, Adams Morgan, Logan Circle, Dupont Circle, Foggy Bottom, Mount Vernon Square, Forest Hills, Woodley Park, Navy Yard. Other median rental price files contain no data for DC at all.

The files for median rent price by postcodes contain similar amount of data. Zip for 1-bedroom file contains data for 10 postcodes. Zip for 2-bedroom file contains 6 postcodes. Files for more bedrooms do not have data for DC.

From data coverage stand point, zri-sqft files by neighbourhoods and postcodes provide the most amount of data; the file names are Neighborhood_ZriPerSqft_AllHomes.csv and Zip_ZriPerSqft_AllHomes.csv respectively. I will look into how a rent price can be derived from a rent index. Comments and suggestions please.

from housing-insights.

terrysky18 avatar terrysky18 commented on August 23, 2024

Other files that may be useful are Neighborhood_Zri_AllHomes.csv and Zip_Zri_AllHomes.csv.

Neighborhood_Zri_AllHomes.csv contains the same number of neighbourhoods as in zri-sqft, which is 92. It gives a monthly rent price for the whole neighbourhood.

Zip_Zri_AllHomes.csv gives the monthly rent price by postcode. However, the zri by postcode only contains 21 entries for DC.

We could use both Neighborhood_Zri_AllHomes.csv and Neighborhood_ZriPerSqft_AllHomes.csv. Rent index for all homes can provide a quick overview of a neighbourhood. Rent index per square feet can be used to calculate more precise rent estimate when room size in a specific neighbourhood is available.

from housing-insights.

domaley avatar domaley commented on August 23, 2024

from housing-insights.

NealHumphrey avatar NealHumphrey commented on August 23, 2024

@domaley That is something I've looked into some. This is the ticket related to calculating this, which is on hold: #148

We have the boundaries of the Zillow neighborhoods, which are different from any other boundaries we have. To roll them up we need some way of splitting a zillow neighborhood between two different neighborhood clusters, for example, when it crosses a border. I recommended using the Master Address Repository which could calculate the number of residential units within the overlapping zones and use that to make weighting factors.

We will probably delay this, because we have weighting factors to use for all our other data sources already...

from housing-insights.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.