Comments (4)
I see it too. Here's what's happening: Spain has stopped reporting historical data, so on the 24-25 of May the data for Spain starts coming from a different source (ECDC) which has different baseline numbers.
It's tricky because the ECDC data source does not have regional data, test counts and hospitalization information, so we can't just remove the old data source. Further, ECDC has clearly erroneous data for Spain in March. I'll try to setup the data pipeline to ignore national deceased counts from the official source and get them only from the ECDC, maybe those are a little more consistent.
from data.
Unfortunately, even the ECDC data appears to have the drop of ~2K in deaths. It must be an adjustment in the way the deaths were being counted (it would be the second time a big adjustment is made by this reporting country, the first one was in April for the way they counted positive tests).
I don't think there's a good way to deal with this problem. You either take the drop as the true value, or you adjust the data to keep it monotonically increasing but deviating from the official counts. Adjustments in the way metrics are computed are not uncommon, but they typically occur in count of confirmed cases and the change is smaller than 7%.
This is why we now provide both the "new_deceased" and "total_deceased" columns. If you want to ignore bogus values, use the "new_deceased" and skip negative values. If you want to keep the graphs as close to the official reporting as possible, use "total_deceased" but keep in mind that adjustments can happen.
from data.
Ok, thanks.
from data.
Closing for now, feel free to reopen if there's something else we need to do on our end
from data.
Related Issues (20)
- Latest-Data almost empty HOT 11
- Keys in data tables don't match the csv/json file content HOT 6
- Tests for pipelines? HOT 3
- Omitted region in Polish data HOT 3
- Latest-data includes all data HOT 2
- I have changed the site address. HOT 6
- Is the data considered to be transactional? HOT 5
- 404 for https://open-covid-19.github.io/data/data.json HOT 1
- Machine-readable schema HOT 2
- @jmullo consider switching to the new file URL paths HOT 1
- @OmarJay1 consider switching to the new file URL paths HOT 2
- Columbia epidemiology data has bad date values. HOT 3
- Switzerland has incorrect epidemiological data. HOT 2
- Metadata Update HOT 1
- Missing state/province information for India HOT 1
- Missing subnational data on the confirmed cases for France HOT 2
- Bad data quality South America HOT 5
- Using your data for my website – thanks! HOT 2
- new epidemiology csv and old data.csv files both only showing philippines data HOT 2
- Too few countries have country-wide recovered counts
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from data.