Chemotherapy Adverse Events Detection
ML pipeline for data processing and model training for the prediction of adverse events in cancer patients. Adverse events include
- cytopenia (a condition in which blood count levels are dangerously low) prior to their upcoming treatment sessions
- acute care use (emergency department visits and hospitalizations) within a month after their treatment session
- nephrotoxicity (acute kidney injury and chronic kidney disease) within 42 days or prior to their upcoming treatment sessions
- death within weeks, months, and a year after their treatment session
The dataset, provided by ICES [1], is a population based administrative data consisting of data on demographics, cancer diagnosis and treatment, symptom questionnaires, and lab tests for approximately 120000 patients in Ontario, Canada between 2005 and 2020. By predicting adverse events in chemotherapy patients before their next scheduled chemotherapy administration, these models can aid in early intervention and facilitate tailored treatment.
This dataset is not open-source. The code can only be run in the ICES DSH (Data Safe Haven) environment.
Instructions
python scripts/data_pipeline.py [OPTIONAL args]
python scripts/train_pipeline.py [OPTIONAL args]
or convert the python files in the adverse event folders into jupyter notebooks, and run them on a jupyter browser.
Prerequisites
See env.yaml, or run
conda env create -f env.yaml
To use that conda environment on jupyter notebook, run
python -m ipykernel install --user --name=myenv
References
[1] ICES. Data Discovery Better Health. https://www.ices.on.ca/. Accessed Aug 16, 2021.