GithubHelp home page GithubHelp logo

quattroshapes's Introduction

#quattroshapes

The Global Polygon Gazetteer

Foursquare needs quality place data to power its geocoding engine to ensure the best recommendations. When someone searches for the best coffee in Brooklyn, a simple venue to place point or venue to place bounding box search can result in venues in Manhattan and Jersey City overwhelming the results for Brooklyn.

To improve recommendations, we have created an authoritative source of polygons around a curated list of places. This gazetteer of non-overlapping polygons provides more relevant results than simple point geometries.

View preview images »

This work is based on foursquare checkins, geo tagged photos from Flickr, an extended version of Natural Earth, and open government data. Concordance is provided between quattroshapes, geonames.org, and Yahoo! GeoPlanet unique IDs in the gazetteer.

The quattroshapes technique calculates the dominant place ID for a given area based on heterogeneous inputs. This work is an extension of alphashapes and betashapes (thanks Aaron and Schuyler!) and is used to backfill countries without complete open data.

Matching mapping agency polygons to the geonames.org dataset was performed using shape-gn-matchr. Concording geoplanet to geonames lives at geoplanet-concordance.

Geocoding can be the hardest part about going open source - and reverse geocoding is even harder. Reverse geocoding reports the gazetteer place for a latitude and longitude map location or address string and is useful when source data needs to be normalized. This new polygon gazetteer data is used in Twofishes (demo), the coarse splitting geocoder (and reverse geocoder) written in scala from David Blackman at foursquare.

The quattroshapes code and resulting 30 gb of data are licensed under CC-BY, but includes data licensed from many governments around the world. Check the License for full details and limitations.

Enjoy!


##Downloads

Shapefiles are in WGS84 (geographic) projection and UTF-8 character encoding.

##Goodies

quatroshapes gazetteer (gzipped geojson):

Other:

##Preview

Administrative level 1: (below) States and provinces in orange; regions shown in red. Mix of national mapping agency and Natural Earth.

qs_adm1

Administrative level 2: (below) Counties in bright blue; regions shown in dark blue. National mapping agency data.

qs_adm2

Local administrative level: (below) In green. This level of government assumes municipal type control over the central town and surrounding countryside. National mapping agency data.

qs_localadmin

Localities: (below) In yellow. In the USA this is the smallest unit of government with legal boundaries. For most other countries the localities here are informal parts of local administrative areas. Mix of national mapping agency, quattroshapes enumeration using foursquare checkins & custom data.

qs_localities

Administrative level 0: (below) In gray. Mix of national mapping agency and US State Department data.

qs_adm0

Neighborhoods: (below) In purple. Quattroshape enumeration from geo tagged photos in Flickr using GeoPlanet hierarchy.

qs_neighborhoods

Gazetteer: (below) In light purple. Over 1 million administrative and populated places with around 800,000 having concordance between GeoNames.org and Yahoo! GeoPlanet WOE unique IDs.

qs_gazetteer

quattroshapes's People

Contributors

blackmad avatar nvkelso avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.