GithubHelp home page GithubHelp logo

nehaar / lung_cancer_biomarker Goto Github PK

View Code? Open in Web Editor NEW

This project forked from utshabkg/lung_cancer_biomarker

0.0 0.0 0.0 23.4 MB

Bioinformatics Project. Trying to find out the most influential metabolomic biomarkers(from 158) for predicting Lung Cancer. Also, trying to make some predictions about the disease.

Io 2.21% TeX 0.35% Jupyter Notebook 97.44%

lung_cancer_biomarker's Introduction

Author Co Author Co Author MIT Contributions welcome Stars

Metabolomic biomarkers play a vital role in the early identification and prediction of cancer. It is possible to save numerous lives if biomarkers are used to assist medical providers in diagnosing their patients faster. Many researchers have been trying to identify the crucial biomarkers in the early diagnosis of diseases. This paper presents several steps divided into two phases for determining the most important metabolomic biomarkers in the blood for lung cancer prediction using Plasma and Serum samples. We used the Shapiro–Wilk Test, Bartlett’s Test, Levene’s Test, Student’s t-Test, and Kruskal–Wallis Test in the first phase to determine the potential biomarkers. Recursive Feature Elimination with Random Forest was used to identify the final most dominant metabolomic biomarker at the second phase. Lastly, we ended with Ridge Classifier and XGBoost Classifier to assess the consistency of our approaches. Despite the declining number of metabolites up to a greater level, our prediction accuracy was 100% and 90.91% for Plasma and Serum samples, respectively which is higher than the state-of-the-art method. Finally, we made some analysis using the most dominant metabolites that can serve as a source of inspiration for our work.

Setup the Project after git clone.

  1. Open the directory: ~/final and best approach/. You will find mainly 2 approaches I have applied.
  2. First, we go with Approach 1.
  3. Read and run these notebooks one after another by this sequence:
    • plasma_test_final.ipynb
    • serum_test_final.ipynb
    • specific_metabolics_accuracy_final.ipynb
  4. Now it's time for Approach 2.
  5. Read and run this notebook: exploratory_analysis.ipynb.
  6. For the mixed up approach, which have been added as a merge(apporaches 1 and 2), simply run the notebook: mixed_up.ipynb.

Thank you. Please let us know, if you find any mistake or way of development in this repo. Cheers!

Read our Published Journal Research Paper based on this repository. Cite if this helps your work:

    @article{ghosh2022most,
    title={Most dominant metabolomic biomarkers identification for lung cancer},
    author={Ghosh, Utshab Kumar and Al Abir, Fuad and Rifaat, Nahian and Shovan, SM and Sayeed, Abu and Hasan, Md Al Mehedi},
    journal={Informatics in Medicine Unlocked},
    volume={28},
    pages={100824},
    year={2022},
    publisher={Elsevier}
    }

lung_cancer_biomarker's People

Contributors

utshabkg avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.