GithubHelp home page GithubHelp logo

rajarshimaity3235 / breast-cancer-diagnosis Goto Github PK

View Code? Open in Web Editor NEW
3.0 1.0 0.0 1.12 MB

Prediction of Breast Cancer using Logistic Regression/Decision Trees/Boosted Decision Trees

License: MIT License

Jupyter Notebook 100.00% HTML 0.01%
machine-learning data-science data-visualization logistic-regression decision-trees boosted-decision-trees breast-cancer-wisconsin

breast-cancer-diagnosis's Introduction

Breast-Cancer-Diagnosis

For any person suffering from Breast Cancer , the diagnosis whether the cancer is benign or malignant is imperative.We are going to predict whether the tumor is malignant or benign using Machine Learning Models We have developed Logistic Regression/ Decision Tree/ Boosted Tree Classifier Models to classify the diagnosis of Breast Cancer cases into malignant and benign.

Invasive Diagnosis Procedures commonly used for diagnosis of Breast Cancer

Biopsy: For this test, the doctor removes tissue or fluid from your breast. They look at it under a microscope to check for if cancer cells and, if they’re there, learn which type they are. Common procedures include:

  • Fine-needle aspiration:This is for easy-to-reach lumps or those that might be filled with fluid.
  • Core-needle biopsy :This type uses a bigger needle to remove a piece of tissue.
  • Surgical (open) biopsy: A surgeon removes the entire lump along with nearby breast tissue.
  • Lymph node biopsy: The doctor removes a part of the lymph nodes under your arm to see if the cancer has spread.
  • Image-guided biopsy: The doctor uses imaging to guide the needle.

Data collection We are using the following Kaggle Dataset https://www.kaggle.com/uciml/breast-cancer-wisconsin-data.Contains Fine-needle aspiration results.It Contains 32 columns and 569 rows.

About the Dataset Attribute Information:

  1. ID number
  2. Diagnosis (+1 = malignant, -1 = benign)
  3. Ten real-valued features are computed for each cell nucleus:
  • radius
  • texture
  • perimeter
  • area
  • smoothness
  • compactness (perimeter^2 / area - 1.0)
  • concavity (severity of concave portions of the contour)
  • concave points (number of concave portions of the contour)

List of important features :

  • Standard error of fractal dimension
  • Standard error of concave points
  • Worst concave points
  • Mean of concave points

Logistic Regression Model --

Accuracy on Training Data : 0.98
Accuracy on Test Data : 1.00

Logistic Regression with L2 regularization

Boosted Decision Tree Model

Accuracy on Training Data : 0.97
Accuracy on Test Data : 0.95

Comparing Boosted Tree Models with different number of iterations

Decision Tree Model

Accuracy on Training Data : 0.94
Accuracy on Test Data : 0.96

Author

Rajarshi Maity [email protected]

breast-cancer-diagnosis's People

Contributors

rajarshimaity3235 avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.