Comments (8)
Yep, that would be indeed great to tackle, especially in view of loading all record onto QA. I might look into it on Monday, unless anyone volunteers for implementing a script that takes a list of addresses and feed them (slowly) to Google to obtain geocoding.
from inspire.
@inspirehep/inspire-dir how were this numbers captured in the first place?
We were thinking of writing an easy tool to correct/fill all these numbers based on Google API. Google results offer the possibility to also return the rectangle where an institute is included and this could be used to double check that proposed modifications are consistent or totally on a different place.
from inspire.
You don't need to use Google. As shown before, this API does it using whatever endpoint you want, and some have no access limits at all - https://pypi.python.org/pypi/geopy
from inspire.
Thanks, I have missed this information. Nice API. Now this open up the question of which provider is the best. I assume OpenStreet map will be great in certain countries but not all.
Also to cross check the data we have so far, I guess we can use the available distance function to verify that the coordinate we have are within a ray of e.g. 10KMs
from inspire.
openstreet map is pretty cool. Take some of the more obscure addresses alongside some common ones and see how it does. It doesn't need to be exact, just good enough :)
from inspire.
To my limited experience openstreetmap beats google hands down in the parts of Europe I travelled (even in the more obscure backyards), while it seemed that the US is still Google country. Probably, it is an option to use whatever is best?
from inspire.
Mmh. I have just tried a simple Italian address from INSPIRE but it failed miserably on StreetMap. The benefit of Google is that it does way a lot of heuristic processing thus being able to interpret the weirdest situation.
We're trying a bit more. Anyway it's for a one-off mini project to clean up coordinates, so whatever service it's going to be good.
But indeed the above library also offers wrapper for e.g. specific French maps, so it might be beneficial to use dedicated services.
from inspire.
Here it is on Sentry: https://sentry.cern.ch/inspire-sentry/inspire-labs-qa/group/601659/
from inspire.
Related Issues (20)
- Add display format for HepNames awards HOT 9
- XME format should export full deleted record HOT 8
- What's up with FFT%%? HOT 13
- What to do with records with 0 pages? HOT 3
- The XME format switches 961__c and 961__x HOT 1
- Custom API to export ORCID,DOI association HOT 1
- multiple DOIs for a record mishandled by bst_arxiv_doi_update
- Authorxml check script seems not work properly
- Experiments: what's in 372__a? HOT 8
- Conferences: ill-formatted CNUMs HOT 3
- Institutions: is CORE in 690C or in 980? HOT 4
- CNUM generator is wrong when the starting date is incomplete HOT 6
- export accelerator in experiment XME HOT 1
- Exposing doctype instead of type in FFT in XME
- Google Scholar indexing issues HOT 6
- Journals: what's in 022__m? HOT 2
- A 773__0 is generated in XME even if one was already there
- Some records have empty XME HOT 2
- Fix CDS OAI harvest duplicate detection
- Make validation for dates more strict.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from inspire.