GithubHelp home page GithubHelp logo

folding_tools's Introduction

πŸ“– Table of contents

πŸ’‘ Notes

  • The following lists are curated by humans, as such may be incomplete
  • We only include software targeting the folding problem combining learnings from AlphaFold 2 and protein language models. You may find other ML on protein tools at Kevin's incredible ML for proteins list.
  • We do not wish to advertize one tool over any other, but simply list the tools we are aware of in either random or alphabetical order
  • Any suggestions for improvements and additions are welcome as issues or pull requests
  • Projects we identify as discontinued are marked with πŸ’€ and in a section at the end

⚑️ Brought to you by:


Predictors

[in alphabetical order]

  • MSA-based (uses Multiple Sequence Alignments (MSAs) as input)

    • AlphaFold2
      • The original AlphaFold 2 method
      • Features: monomer, multimer
      • Other: Colab Notebook
    • ColabFold
      • Faster AF2 compiling and MSA generations
      • Features: monomer, multimer
      • Other: localcolabfold
    • FastFold
      • Runtime improvements to OpenFold (see below)
      • Features: monomer
    • HelixFold
      • Reimplementation of AF2 in PaddlePaddle
      • Features: monomer
    • MEGA-Fold
      • Reimplementation of AF2 in MindSpore; provides training code, training dataset and new model params.
      • Features: monomer
    • OpenFold
      • Reimplementation of AF2 in PyTorch; provides training code, training dataset and new model params.
      • Features: monomer
      • Other: Colab Notebook
    • RoseTTAFold
      • Reproduced AF2 in PyTorch before details of AF2 were available; new model parameters.
      • Features: monomer
      • Other: Unofficial Colab Notebook
    • Uni-Fold
      • Reimplementation of AF2 in PyTorch; provides training code and new (monomer/multimer) model parameters.
      • Features: monomer, multimer
      • Resources: Colab Notebook
    • Uni-Fold-jax
      • Implementation of AF2's training code.
  • pLM-based (using embeddings from protein Language Models (pLMs) as input)

  • Other

    • EquiFold
      • Diffusion model to predict protein structures (specifically antibodies)
      • Features: monomer

Tools and Extensions

  • gget (AF2)
  • alphafold_finetune
  • AlphaPulldown
    • protein-protein interaction screens using AlphaFold-Multimer
  • ColabDesign
    • Backprop through AlphaFold for protein design
  • AF2Rank
    • Rank Decoy Structures/Sequences using AlphaFold
    • Resource: Colab Notebook
  • protein_structure_module_of_AF2
    • IPA implementation in pytorch

Databases of predictions

Datasets for training


Webservers


Discontinued

folding_tools's People

Contributors

agitter avatar avilella avatar duerrsimon avatar gelnesr avatar ieremie avatar lfelipesv avatar rcrehuet avatar sacdallago avatar sokrypton avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

folding_tools's Issues

Add a "Features" bullet-point

Hi all,

Thanks for starting this repo, I think it's a great way to have everything going on in one place.

Could I suggest that each entry has a "Feature" bullet-point, which includes key:value pairs like:

Method XYZ
- Paper: URL
- Code repo: URL
- Features = Model_input:MSA; Multimers:no
Method ABC
- Paper: URL
- Code repo: URL
- Features = Model_input:pLM; Multimers:yes

Something like that. My idea is to contribute to the repo but also have a table format based on features, a bit like the ColabFold tables, e.g. see below:

image

See a manually-typed table I created in Google Sheets:

image

Thanks for considering

TmAlphaFold database

We created a database, called Transmembrane AlphaFold database (TmAlphaFold database), where membrane localization was calculated and evaluation of AlphaFold2 predicted alpha-helical transmembrane protein structures was given. We applied TMDET, a simple geometry-based method to visualize the likeliest position of the membrane plane. In addition, we calculated several parameters to evaluate the location of the protein into the membrane. This also allows TmAlphaFold database to show whether the predicted 3D structure is realistic or not. The TmAlphaFold database is available at https://tmalphafold.ttk.hu/ and has been published in NAR (https://pubmed.ncbi.nlm.nih.gov/36318239/)
Please consider to include TmAlphaFold database in your database list.

Submission of folds and tangentials

I could not spot a way to submit (=shameless advertise one's own) repos, so I am opening an issue for this β€”please close it if it is not welcome!

Long term strategy

Since this repo is now addressing multiple topics (folding, design, pLMs), it may make sense to:

  1. Have a general README that redirects to the various sub-lists
  2. Transfer this repo to an agnostic organization (for this purpose I prospectively created https://github.com/biolists) so that it's not "sacdallago" who owns it -- but the community
  3. Fork into that org other lists, past or future, e.g. @yangkky 's Machine learning for proteins: https://github.com/yangkky/Machine-learning-for-proteins

CC @sokrypton @duerrsimon @noeliaferruz

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.