ucsd-ai4h / covid-ct Goto Github PK

View Code? Open in Web Editor NEW

1.1K 45.0 426.0 1.1 GB

COVID-CT-Dataset: A CT Scan Dataset about COVID-19

Python 47.61% Jupyter Notebook 52.39%

covid-19 ct computed-tomography dataset deep-learning computer-vision

covid-ct's People

Contributors

Stargazers

Watchers

Forkers

cjxnew xiaojun-chang xuzhikangnba clyhh lsptb ziweiniexiaoer jkooy crazychickendev lyn2018lyn liq07lzucn slayzzzzz primus-ai watchai philyueyue zhangjiahui56 yangyin2016 binliu2015 samjcheng shengyuking luckydog5 beyond-zw mingzhao2017 ahwxz123 sunflower21 cswin pursu hnhbcc zzzzlalala zhmz1326 gg-yuki leitaoxman duoergun0729 dagongzhizheng happog shiwenqi97 topologyapplied mhelkaddoury ameenbadri7 liuzhongyu superchong1987 xhufdd kingpopen lovedoubledan genhao3 veyseltrk cmendozab qixiuai ksyao2002 naveen584 jacksky64 wxwoods 752772173 ahmadrmusa mamak426 shengrenhou manik-hossain shi3z qt20191125 nosratullah yushanlu wsheffel zengsn andrewpalumbo xiaoye77 cv-ip wowotou1022 carvalhoamc ncovgt2020 mohamedelkaddoury jwjohnson314 hongweilibran masterhazel2 mmmmmmiracle ryanleave birajaghoshal keplaxo danielaplusplus leriomaggio zhaolei-momo cyber-addict monjoybme ylch gpdas jasonbian97 kirtiswagat kiat qqsea nickhub919 regfish7 xiaoxuegao499 fengxpkingwolf lingzi201314 ndinhtuan shaikhzhas gustavocac shuaiw24 cosimofang yf817 alexgzhou piqiuking233

covid-ct's Issues

How the non covid images are collected.

How the non-covid images are collected. ? I did not find any information in Data creation step of the paper

Missing some files and variable 'alpha'

Hello, when i run the code 'CT_predict', a error occurred: alpha doesn't exist and i found this txt also doesn't exist.
f = open('COVID-CT/model_result/DenseNet_{}.txt'.format(alpha), 'a+')

Thanks for providing images again!

Visual results

Dears

Based on your implementation is it possible to show any visual results? Something like a heatmap...

packages issue

pls list all the requirement(packages)..

DenseNet Questions

Thank you for your effort. Please I have two questions.

What is the required total number of epochs to get the reported results? Because in DenseNet_predict.py file I noticed that there are 3000 epochs which is extremely high!!
In the DenseNet_predict.py, did you tuned the networks pretrained weights or you are using them as is?
Thanks

i cloned, then i install everything, but when i try to run "python CT_predict.py" i am getting following error: No such file or directory: 'COVID-CT/train_set'" can you please rell, what chnges i need to made after clone.

yes thanks for providing ct images.
Thanks

About the format of the submission file for COVID-19 CT competition

Thank you very much for providing the dataset and baseline model! I am participating in the COVID-19 CT competition (https://covid-ct.grand-challenge.org/), but I cannot find the format of the submission file on the competition page. Could you provide a sample submission file or describe the format of the submission file in detail?

COVID-19 patient data for research

There's a new dataset that HM Hospitales in Spain is sharing with qualified researchers, consisting of anonymized medical records from COVID-19 patients:

https://www.hmhospitales.com/prensa/notas-de-prensa/comunicado-covid-data-save-lives

From April, 25th onwards, the 'COVID DATA SAVE LIVES' contents will be distributed online, free of cost and other access barriers to worldwide health care institutions, universities, and scientific organizations...

To access this initiative, it is mandatory to submit an application via e-mail to [email protected] which will be assessed by the HM Hospitales Data Science Commission and, where appropriate, reviewed by the HM Hospitales Clinical Research Ethics Committee.

Can such a dataset train?

The resolution of photos is not the same, but also contains a large number of annotations

Updata Code

Hello,

Thank you for your data set .It is very valuable.

I am a graduate student, I clone your zip .But there is bug in code.
So I want a new updated code. My email [email protected]

Thank you again.

A error of the baseline method densenet169

Excuse me~ when I run the baseline methods, I met the following question:
RuntimeError: Given groups=1, weight of size 32 3 3 3, expected input[10, 1, 225, 225] to have 3 channels, but got 1 channels instead.

I have no idea, can you help me?

No such file or directory: 'new_data/newtxt/train.txt'

When I run DenseNet_predict.py, the error comes "No such file or directory: 'new_data/newtxt/train.txt'."

It seem that this code is too old to run. Can anybody update recently?

Failed to decompress on MacOsX

$ unzip NonCOVID-CT-Images.zip

Archive:  NonCOVID-CT-Images.zip
  End-of-central-directory signature not found.  Either this file is not
  a zipfile, or it constitutes one disk of a multi-part archive.  In the
  latter case the central directory and zipfile comment will be found on
  the last disk(s) of this archive.
unzip:  cannot find zipfile directory in one of NonCOVID-CT-Images.zip or
        NonCOVID-CT-Images.zip.zip, and cannot find NonCOVID-CT-Images.zip.ZIP, period.

FIilenotfound error in the File: DenseNet_predict.py

Guys we are trying to run your code. The problem is this error below. I think your github is missing files. Can you please give us a workaround?:

FileNotFoundError Traceback (most recent call last) <ipython-input-18-b71253aaae34> in <module>() 156 txt_COVID='new_data/newtxt/train.txt', 157 txt_NonCOVID='/content/drive/My Drive/Covid_CT/Content2/CovidCTNew/Data-split/COVID/trainCT_NonCOVID.txt', --> 158 transform= train_transformer) 159 valset = CovidCTDataset(root_dir='new_data/4.4_image', 160 txt_COVID='new_data/newtxt/val.txt',

Metadata

Hi,

Thank you for the for the effort of putting this dataset together.

Probably you have checked the metadata that is being collected for the covid x-ray dataset project. I would suggest on collecting a more specific and detailed metadata on each scan, as in the mentioned project, so that better insights can be obtained from the dataset.

Thanks!

How to Run your code? Please give a workflow.

Added to Open Source COVID-19

Thanks for your work to help the people in need! Your site has been added to the Open-Source-COVID-19 page, which collects open source projects related to COVID-19, including maps, data, news, api, analysis, medical and supply information, etc. Please share to anyone who might need the information in the list, or will possibly contribute to some of those projects. You are also welcome to recommend more projects.

http://open-source-covid-19.weileizeng.com/

Cheers!

Crystallization in non-covid images

Hello, thank you very much for making this dataset available. I was looking at the non-covid images and I realized that even though they are classified as non-covid, a "crystallization" similar to covid is observed, do you know what other type of disease it is, if it is not covid?

Non-Covid:

Covid:

How to download dataset?

please delete this, wrong issue

please delete this, I am sorry

Do we prediction code or the gradcam code.

Masks

Can you provide masks (ground glass opacity, consolidation) to evaluate a segmentation algorithm?

I didn't get the same result by your best Self-Trans.pt model

according to your readme file, your result was F1: 0.85，Accuracy: 0.86，AUC: 0.94，but mine was F1:0.74, Accuracy:0.77, AUC:0.88. Should I use the Self-Trans.pt as a pretrained model and fine tune on it?

Data split issue: should be patient level

It seems that the data split is with image-level, which means slices from the same patient could come to tran/test/val. This will bring some data-leakage problem. It is more reasonable to split in the patient level so that images from same patient cannot be assigned to different set(train/test/val)

Learning rate

Hello,

When I run the code 'DenseNet_predict.py', I think there is overfitting that the validation loss could not decrease properly without changing anything.

I got this trend when I run your code.

Could you please explain how to deal with this overfitting?

Thanks in advance.

Incorrect file names in the val datasplit file valCT_COVID.txt

For example, in the file Data-Splits/COVID/valCT_COVID.txt, there is a file called 2020.03.16.20036145-p19-128-4.png.

However, in the image processed folder, Image-Processed/COVID, the file is saved using a JPEG extension as 2020.03.16.20036145-p19-128-4.jpeg .

There are some other files like this too such as Images-processed/CT_COVID/2020.03.18.20038125-p15-54-2.png. I got over this by making sure all files had the same extension but please fix this.

DICOM files

Would Dicom files be part of future updates?

Covid is labeled 0 and NonCovid is labeled 1?

according to your code

self.classes = ['CT_COVID', 'CT_NonCOVID']
self.num_cls = len(self.classes)
self.img_list = []
for c in range(self.num_cls):
    cls_list = [[os.path.join(self.root_dir,self.classes[c],item), c] for item in read_txt(self.txt_path[c])]
    self.img_list += cls_list
self.transform = transform

the label of 'CT_COVID' is 0 and 'CT_NonCOVID' is 1, but when you calculate the metrics，like TP, your code is

TP = ((predlist == 1) & (targetlist == 1)).sum()

I think this is wrong since 'CT_COVID' is positive and its label is 0, we hope the model detect CT_COVID more accurate than CT_NonCOVID, so in this circumstance TP should be?

TP = ((predlist == 0) & (targetlist == 0)).sum()

How do you select the model to use for the test data?

Hi,

I am not sure how do you select the model to use for the test data. It seems to me in your code you did not use the information from the validation dataset to choose the model to use on the test data. I also did not observe any early stopping criteria being used. Is that true? If it is, how do you select the model and what is the purpose of the validation dataset?

Thanks,
Yangze

change learning rate

In your setting in 'DenseNet_predict.py', it seems you miss schedule.step() to change the learning rate. Could you please check this out?

What is a non-COVID class?

It is a bit vague: does it include all non-COVID, i.e. Normal + Common Pneumonia, or only Common Pneumonia/only Normal?

Covid

Does the training dataset have overlaps with SIRM?

Hi,
Thanks for the effort!
In the report, you only mentioned that the images of training set are collected from papers.
Can you guarantee that none of those are originally from SIRM?

How to contribute?

Hi,

I have been spending some hours every other day to check for papers containing CT or Xray images of COVID-19 patients. Are you accepting contributions in any way? In the case of the covid-19 x-ray collection project, they keep a list of already collected papers, and one can inform of not-yet-collected papers in the issues section. Please advice us on how to contribute with papers containing CT images that you might have not yet included in your dataset.

I am not a physician, but upon consulting with collaborators it appears that the following issues should be addressed.

Soft tissue scans appear to be unpopular for this kind of application.
It appears that some cases of viral pneumonia are present in the control set. These will have similar (if not identical) manifestations to COVID-19.

Thank you again.

ucsd-ai4h / covid-ct Goto Github PK

covid-ct's People

Contributors

Stargazers

Watchers

Forkers

covid-ct's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs