GithubHelp home page GithubHelp logo

moh-malaysia / covid19-public Goto Github PK

View Code? Open in Web Editor NEW
959.0 94.0 649.0 6.78 GB

Official data on the COVID-19 epidemic in Malaysia. Powered by CPRC, CPRC Hospital System, MKAK, and MySejahtera.

License: Other

Jupyter Notebook 100.00%
covid-19 coronavirus-tracking healthcare

covid19-public's People

Contributors

adibzter avatar agnes-lyy avatar aidilsfwn avatar amirmazmi avatar atlas-github avatar danialsim95 avatar khoohaoyit avatar leeliwei930 avatar moh-malaysia avatar patrickxchong avatar sameu-cloudtech avatar seowwj avatar syafix19 avatar thomassiew avatar timriffe avatar weareblahs avatar wnarifin avatar zukelah avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

covid19-public's Issues

Seeking clarification for unique_ind on MySejahtera data

Hi there, unique_ind is defined as the "number of unique accounts which checked in", may I know is it based on the number of check-in on location?

If one person check-in in three separate locations today (check-in twice in one of the locations), what is the unique individual count?

For example, if I check in once at supermarket A, it is counted as one. And I go to a bank later, I check in another time. But if I go back to supermarket A and checking in again, does it counted as another new check-in?

ICU data read through

Hi, quick check on the ICU data, the aggregate daily icu_covid is far more higher than the daily announced ICU cases by MKN. Not sure how to read through these inconsistencies, anyone can help? Much appreciated.

checkin_malaysia_time.csv outliers, potentially aggregation issue

Hi,

For (checkin_malaysia_time.csv), is there any issue with the data count on the following time-density date

Day 31st May 2021, time density 15 until 21. The precendent and subsequent counts from this time are also strangely low.
I was suspecting that the aggregation wasn't correct around.

Thanks

image

image

Test Cases not being updated for the past 2 days

Hi, just wondering is it intentional that the test cases isn't being updated for the past 2 days? I'm doing a graph about it and just realized it's not being updated recently. Is there any change of test cases being planned or the update being halted?

Actual daily positive test numbers

Request for actual daily positive test numbers from tests in tests_malaysia.csv.
Ideally split according to type of tests.

This data will most likely be delayed by a few days, which is fine. Definitely better than making the false assumption that positive cases reflect the positive rate.

Intention here is to link mobility data to positive test cases.

checkin_malaysia_time.csv column unit

Dear author,

I have checked the file for checkin_malaysia_time.csv. Please state the unit for the column, it had 47 columns, does it mean it recorded the number of check ins for every 30 minutes?

Cheers,
Boo

Clarification on the definition for population.csv dataset

Need clarification on the column pop_18, the documentation mention 18+. Does this mean all aged ≥ 18 y.o. including ≥ 60 y.o. (60+)? Or only those aged between 18+ to 60+? Maybe can change the notation to use ≥ symbol be more descriptive.

Cluster End Date

Can we get data cluster end date.Or if possible consider last updated timestamp case total equal with recovered?

In cases_state.csv, there're two -ve values

Thanks for the data! would love to know if the negative values in cases_state.csv is real?

<style> </style>
date state cases_new
1/8/2021 W.P. Kuala Lumpur -101
5/19/2020 W.P. Putrajaya -1

image

Clarification on total vaccination registration & total population dataset

Hello MoH, I would like to clarify the numbers of total registered vaccination (vaxreg_state.csv) & numbers of total population (population.csv). The reason being is that in some dates & states, the former exceeds the latter

For example, on 23rd July 2021, total number of registered vaccination for W.P. Kuala Lumpur is 1,830,956. While the total population of W.P. Kuala Lumpur is 1,773,700

Can I know the reason of total number of registered vaccination being higher than total number of population? Based on README.md provided, it seems like total is calculated based on number of unique registrants

Thank you!

All-Cause Mortality data

Great initiative!

Could you also provide deaths from all causes (not just COVID) by day/week/month?

Latest Data

covid19-public/epidemic/tests_malaysia.csv - Data is not up to date.

Incorrect data on 16/3/2020

For state data on 16/3/2020, "covid19-public/epidemic/cases_state.csv", line number 2:16, the number marked as new cases are not new cases. These are cumulative cases, and by right left blank/NA.

The same goes to national data on 16/3/2020, "covid19-public/epidemic/cases_malaysia.csv", line 2, the reported new cases on the date is supposed to be 125, not 553, which is actually the cumulative cases by 16/3/2020.

pkrc.csv - Column name typo

File: pkrc.csv

Problem: Column names are mistyped
Affected: discharge_pui, discharge_covid, discharge_total

Comments:
Based on epidemic/README.md, the columns should be labeled discharged_x. This would also be more consistent with the hospital.csv data columns.

Request daily positive by age

Dear author,
Great data. May we request data for daily positive broken down by age or age group?
This will be crucial or important to check the age composition and find how to protect each age group based on activites.

Best

Corrupted/missing data in Mysejahtera 27-31 May?

For checkin_malaysia_time.csv date range 2021-05-27 to 2021-05-31 the numbers are illogically low (bucket 20 or 10:00am onwards). Doesn't seem to be app/server issue, as the numbers are back to normal again at 00:00 each next day.

mysj-bucket-20210527

Daily testing at state level

As of now, the daily testing data is given at national level "covid19-public/epidemic/tests_malaysia.csv". If possible, the data is also given at state level. This allows analysis of positivity rate by state.

Request MySejahtera Checkins by State

We would like to request a granular data from MySejahtera on Checkins by State and District. Currently only the National Level of the MySejahtera and HIDE data points are available.

These data will also show the pattern of clusters vs movements.
Or the use of MySejahtera vs Non MySejahtera recorded clusters.

Thank you

Number of test

Could you include number of test made per day, instead of just number of positive, etc

Request for Additional Data Points - Daily Cases Breakdown by Category (1 to 5)

With the vaccination population going up and that we all know that vaccination will lessen the severity of the covid-19 patient... it will be very helpful to now
(1) have the breakdown by Cat 1 - asymptomatic to Cat 5 - classifications provided.
(2) within each Category, a further breakdown by Not Vaccinated, 1st Dose, and Fully Vaccinated

This will serve as another powerful communication message to general public who still NOT register for vaccination

Total registered users for Mysejahtera per day

Requesting a timeline of mysejahtera uptake in the form of total registered users per day.
Please confirm if this data is available.

Will list this in CONTRIB.MD and submit pull request once confirmed that the data is available.

Thanks.

tests_malaysia.csv - Column label has hidden TAB chars

File: tests_malaysia.csv

Problem: Label rtk-ag and pcr has a hidden TAB character before it

Raw file snippet:

date, rtk-ag, pcr
2020-01-24,0,2
2020-01-25,0,5

ASCII Decode:

rtk-ag -> 009 114 116 107 045 097 103 (TAB R T K - A G)

Discrepancy in count icu.csv and press release

There are discrepancies between the total number of covid patient in ICU and under ventilation support calculated from the icu.csv and the ones reported in the press release (https://kpkesihatan.com).

For example, on 24/7/21, the sum for icu_covid column and vent_covid are 1397 and 799 respectively, while the number reported (https://kpkesihatan.com/2021/07/24/kenyataan-akhbar-kpk-24-julai-2021-situasi-semasa-jangkitan-penyakit-coronavirus-2019-covid-19-di-malaysia/) were 950 and 468 respectively.

Demographic breakdown?

Excellent initiative, many thanks for posting this data. I wonder if it would be possible to offer tabulations of cases, deaths, tests, and vaccinations by age groups and sex breakdowns? This would be valuable information for making international comparisons.
Many thanks,
Tim Riffe

hosp_x in hospital.csv

What is the description for these columns in hospital.csv? I'm sorry if this has been clarified but I can't seem to find it anywhere. Thanks.

Recovered Cases by State

How to get the daily numbers of recovered cases for each state? The discharged numbers do not match the total recovered cases for all of Malaysia combined.

Missing data in the pkrc.csv file

Hi there,
Noticed in the pkrc.csv file - there are no data included at all for "states" WP KL and WP Putrajaya?
Why is this so? I believe there are quarantine centres in both these territories.
Hope KKM is aware and will rectify this gap/ issue real soon.

Regards
Foo

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.