Comments (9)
Where's the official government site? Who is maintaining the spreadsheet?
from li.
The government website is https://koronavirus.gov.hu/
The country numbers are 100% mirrored in JHU, so I belive it makes no sense to mirror it only for the country data.
What is not in JHU are the county case numbers, but they are published on the website as an image, so it's not possible for this project to scrape it, until they start publishing it in a human readable form. That spreadsheet could be a workaround for the image numbers, but it's only reliable as the owner of the spreadsheet keeps entering the numbers.
from li.
out of curiosity, anyone try to run something like this through tesseract?
from li.
This might be quite doable with tessarct, I believe, but the project's policy so far is not to do OCR.
from li.
Where are the policies published?
from li.
Just informally on the Slack channel. There was a scraper which was trying to use tesseract and I think it's still under review.
If you feel like doing a tessarcts sample script for that image and submit a PR, it might be a good reference for other scrapers. But I cannot promise it gets merged, it depends on the team.
from li.
The "megyei" sheet doesn't have all of the states (e.g. it's missing Sopron (iso2:HU-SN)), but it has many of them. Opening a PR :-)
from li.
Opened #560, @hyperknot can you take a peek? It gets the data ok, but doesn't seem to cover all the states.
from li.
Note from the spreadsheet maintainer:
Thank you for your request. Yes, you can reuse and scrape the data. Because we have now relatively few cases we update the dataset on a weekly base, every Monday. If the new cases would grow again, we put back the daily update. I see your comment on Github regarding Sopron. Sopron is not county and not county capital, what you need for that county is Győr-Moson-Sopron column. All the Hungarian counties we have (19), + the capital Budapest. Unfortunately we don't have detailed, more accurate data for smaller administrative units since the government does not disclose them.
from li.
Related Issues (20)
- Fix failing windows ci build HOT 1
- 0 new cases for Kentucky/Indiana from timeseries.csv on 08/23 HOT 3
- Migrate Marin County, California, from ArcGIS to DataWrapper HOT 1
- Data Stopped Updating HOT 4
- UPDATED LINK TO TIMESERIES CSV? HOT 1
- Did the dataset stop updating HOT 1
- The sum of cases for some states at county level does not match the value at state level
- tested data for California counties ends 8/31 HOT 1
- Source for US-ND
- Request: "latest" file that includes reporting dates
- No updates beyond 23 September
- timeseries.csv url haven't update since 10/06
- SHORT_ISSUE_NAME for SOURCE on MM/DD
- timeseries.csv haven't update since 10/06 HOT 8
- No data for several European countries since October 29
- Date range for "timeseries.csv" and "timeseries-byLocation" goes until "2020-11-01". HOT 2
- Covid cases in Sweden are completely off on 2021-02-18 HOT 1
- Updating data
- new
- AWS bucket associated with timeseries.json file is not working properly (Nosuchbucket error)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from li.