GithubHelp home page GithubHelp logo

srimallipudi / capstone-predicting-healthcare-service-type-from-patient-characteristics Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 2.51 MB

This project aims to develop a predictive model determining the type of healthcare service (Inpatient/Outpatient/Emergency) needed based on patient characteristics.

Python 100.00%

capstone-predicting-healthcare-service-type-from-patient-characteristics's Introduction

Predictive Healthcare Service Utilization

Project Overview:

This project focuses on predicting the type of healthcare service a patient might require whether it be Inpatient / Outpatient / Emergency / Residential care based on their demographics, socio-economic factors, and medical conditions. By analyzing a comprehensive dataset comprising patient records, including demographic information, medical history, and healthcare service utilization patterns, we aim to develop predictive models that can assist healthcare providers in efficiently allocating resources and enhance timely & appropriate care for patients.

Data Exploration:

We begin by exploring the Patient Characteristics Survey dataset, which includes information on 196,102 patients. Outpatient care emerges as the most utilized service, with approximately 68% of patients preferring this option. We observe demographic trends, regional disparities, and ethnic influences on healthcare utilization. Insurance type significantly impacts healthcare choices, and certain medical conditions are more prevalent among specific patient groups.

Data Pre-Processing:

The dataset undergoes several pre-processing steps to ensure data quality and compatibility for modeling. We integrate additional datasets by zip code to incorporate other relevant social and environmental factors such as Median house income, number of hospitals, number of parks in that particular zip code. Irrelevant columns are removed, and missing values are handled using various strategies such as imputation and dropping rows. We perform normalization and feature selection techniques to prepare the dataset for modeling.

Modeling and Evaluation:

We train various machine learning models, including Random Forest, Decision Tree, Gradient Boosting, Neural Networks, Logistic Regression, and Naive Bayesian, on three distinct datasets: Original, Oversampled, and Oversampled with PCA. Models are evaluated based on accuracy, precision, recall, and F1 score. Random Forest, Decision Tree, and Gradient Boosting demonstrate strong predictive capabilities for healthcare service utilization.

Table:

Classification Model Precision Recall Accuracy Computation Time (in sec)
Random Forest 0.98 0.97 0.97 215.25
Decision Tree 0.96 0.96 0.96 22.84
Gradient Boosting 0.78 0.79 0.79 2322.39

Methodological Contributions:

Throughout the project, we implement rigorous methodologies to ensure reliable results. We employ Train-Validate-Test splits and stratified sampling for model evaluation. Additionally, we experiment with oversampling techniques and Principal Component Analysis (PCA) for dimensionality reduction to enhance model performance. We also explored k-means clustering for pattern identification in the dataset.

Conclusions and Recommendations:

Our findings suggest that predictive models can effectively forecast healthcare service utilization based on patient characteristics. However, challenges remain in accurately predicting emergency services and incorporating temporal changes in patient demographics and healthcare trends. We recommend updating the dataset to capture the latest trends, conducting external validation for broader applicability, and collaborating with healthcare institutions across diverse regions to enhance model robustness.

Future Projects:

Future research endeavors could focus on developing specialized models for different care needs to further refine predictive capabilities in diverse healthcare settings. Collaboration with healthcare institutions across diverse regions to collect region-specific data would also contribute to enhancing the model's generalizability and predictive accuracy. Additionally, incorporating real-time data streams and advanced machine learning techniques could further improve the accuracy and reliability of predictive models for healthcare service utilization.

Attachments:

Capstone Project Paper.pdf

Capstone Project - Presentation.pdf

capstone-predicting-healthcare-service-type-from-patient-characteristics's People

Contributors

srimallipudi avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.