GithubHelp home page GithubHelp logo

chboey / pathopredict Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 730 KB

Predictive Modelling of Pathological Complete Response Classification and Relapse-Free Survival Regression in Cancer Patients

Python 100.00%
breast-cancer-prediction healthcare-application machine-learning medical-imaging predictive-modelling

pathopredict's Introduction

Predictive Modelling of Pathological Complete Response and Relapse-Free Survival in Cancer Patients

Overview

This repository contains the implementation of a machine learning project aimed at enhancing the accuracy of predicting Pathological Complete Response (PCR) and Relapse-Free Survival (RFS) in chemotherapy-treated breast cancer patients. The project analyzes a dataset with clinical and MRI features across 400 patients to provide additional prognosis information for personalized treatment recommendations.

Introduction

This project aims to improve the predictions of PCR and RFS in breast cancer patients using various machine learning techniques. Accurate predictions can guide treatment decisions and provide better prognostic information.

Dataset

The dataset used in this project is publicly available from The American College of Radiology Imaging Network.

Feature Type Number of Features
Clinical 10
MRI-based 107
Total Records 400

Methods

Data Preprocessing

  • Handling Missing Values: Rows with missing values denoted by '999' were removed. Median imputation was used for remaining missing values.
  • Feature Selection: ANOVA was used to assess the importance of each feature. Features with an f-score above 3.5 were selected.

Model Selection

Model Type Model
Classification Logistic Regression
AdaBoost Classifier
Support Vector Machine (SVM)
Voting Classifier
Model Type Model
Regression Random Forest Regressor
Support Vector Regressor (SVR)
LASSO Regressor
Ridge Regressor
AdaBoost Regressor

Model Evaluation

Evaluation Type Metric
Classification Balanced Accuracy
Precision
Regression Mean Absolute Error (MAE)
Tuning Grid Search

Results

The two best results from classification and regression task was evaluated.

Classification Results

Model Balanced Accuracy Precision
AdaBoost Classifier 73.43% 70.59%
Voting Classifier 70.68% 83.33%

Regression Results

Model Mean Absolute Error (MAE)
Random Forest Regressor 20.43
LASSO Regressor 20.84

Conclusion

The AdaBoost Classifier was the best model for predicting PCR due to its balanced accuracy and precision. The Random Forest Regressor was the most effective for predicting RFS with the lowest MAE and robust performance.

pathopredict's People

Contributors

chboey avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.