GithubHelp home page GithubHelp logo

sunagparasu / ml_for_drug_discovery Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 840 KB

Using machine learning to identify potential drug candidates for acetylcholinesterase

Jupyter Notebook 100.00%

ml_for_drug_discovery's Introduction

ML_for_drug_discovery

Using machine learning to identify potential drug candidates for acetylcholinesterase

  • In Part 1, I have collected bioactivity data from the chEMBL database, performed pre-processing like remove missing datapoints, remove duplicates, and labelled compounds based on their bioactivity threshold.
  • In Part 2, I have performed exploratory data analysis which includes cleaning the SMILES notation, removing salt molecules from the structure, calculate lipinski descriptors and perform EDA by plotting graphs like boxplot, scatterplot, etc
  • In Part 3, I performed calculations for additional descriptors using PADEL like creating the input for generating the PADEL descriptor and using them to create fingerprints for the compounds.
  • In Part 4, I have built a RandomForest model to predict the pIC50 values for the compounds. Before that, I removed low variance features and split the data for training and testing.
  • In Part 5, I compared several ML models to figure out which is best suited for the task at hand.
  • In the last part, I have written a script so that the user can provide the input molecule and the model would return back its bioactivity score that can be used to determine if it is a potential drug target.

ml_for_drug_discovery's People

Contributors

sunagparasu avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.