GithubHelp home page GithubHelp logo

shunsunsun / display-of-preset-rosetta-ncaas Goto Github PK

View Code? Open in Web Editor NEW

This project forked from matteoferla/display-of-preset-rosetta-ncaas

0.0 0.0 0.0 698 KB

What exactly are the non-canonical amino acids in the Rosetta database folder?

License: MIT License

Python 100.00%

display-of-preset-rosetta-ncaas's Introduction

Display of the preset Rosetta NCAAs

What exactly are the non-canonical amino acids in the Rosetta database folder?

Names like V02 really do not help... So there is only one way to find out, manually generate a figure for each by recycling a large amount of code.

Script

The file generate.py does the work. To do this, it used both pyrosetta and RDKit. In the case of RDKit the bond order is guessed using a function for a blog post of mine.

In the case of pyrosetta, the mover pyrosetta.rosetta.protocols.simple_moves.MutateResidue is used as the argument new_res can be a custom parameterised residue. Shamefully, I do not know how one can pass the argument extra_res_fa to pyrosetta after it gets initialised as setting it as an option afterward simply failed.

A cool detail is that for once I am not using PyMOL. In fact, the line pyrosetta.rosetta.core.pose.make_pose_from_sequence(pose, 'A', 'fa_standard') generates the pose from scratch.

The fix_bond_order method has a problem with tryptophans due to a valence of 5 carbon. Makes sense though.

Gallery

3-Letter  Image of non-canonical amino acid File SMILES
A34 2-aminomethyl-phenylalanine.paramsN[C@@H](Cc1ccccc1C[NH3+])C(=O)[O-]
A33 2-amino-heptanoic_acid.paramsCCCCC[C@H](N)C(=O)[O-]
NVL NVL.paramsCCC[C@H](N)C(=O)[O-]
C94 trifluoro-leucine_ent2.paramsCC(C[C@H](N)C(=O)[O-])C(F)(F)F
B27 4-methyl-phenylalanine.paramsCc1ccc(C[C@H](N)C(=O)[O-])cc1
A20 2-allyl-glycine.paramsC=CC[C@H](N)C(=O)[O-]
B57 alpha-methyl-leucine.paramsCC(C)CC(C)(N)C(=O)[O-]
C26 homocysteine.paramsN[C@@H](CCS)C(=O)[O-]
A31 2-amino-5-phenyl-pentanoic_acid.paramsN[C@@H](CCCc1ccccc1)C(=O)[O-]
B97 beta-chloro-alanine.paramsN[C@@H](CCl)C(=O)[O-]
B58 alpha-methyl-phenylalanine.paramsCC(N)(Cc1ccccc1)C(=O)[O-]
B19 4-fluoro-proline.params_rotCC(F)C[C@H](N)C(=O)[O-]
C00 beta-cyclohexyl-alanine.paramsN[C@@H](CC1CCCCC1)C(=O)[O-]
A80 3-hydroxy-tyrosine.paramsN[C@@H](Cc1ccc(O)c(O)c1)C(=O)[O-]
NLU NLU.paramsCCCC[C@H](N)C(=O)[O-]
B96 beta.beta-diphenyl-alanine.paramsN[C@H](C(=O)[O-])C(c1ccccc1)c1ccccc1
C27 homophenylalanine.paramsN[C@@H](CCc1ccccc1)C(=O)[O-]
B54 alpha-methyl-3-hydroxy-tyrosine.paramsCC(N)(Cc1ccc(O)c(O)c1)C(=O)[O-]
C30 homoserine.paramsN[C@@H](CCO)C(=O)[O-]
B61 alpha-methyl-tyrosine.paramsCC(N)(Cc1ccc(O)cc1)C(=O)[O-]
BMAA 2-amino-3-methylamino-propanoic_acid.paramsC[NH2+]C[C@H](N)C(=O)[O-]
C04 beta-hydroxy-norvaline.paramsCCC(O)[C@H](N)C(=O)[O-]
A30 2-amino-4-bromo-4-pentenoic_acid.paramsC=C(Br)C[C@H](N)C(=O)[O-]
V04 V04.paramsN[C@H](C(=O)[O-])C(O)c1ccc(O)c(Cl)c1
DAB 2.4-diaminobutyric_acid.paramsN[C@@H](CC[NH3+])C(=O)[O-]
C89 4-fluoro-proline_puck.params_rotCC(F)C[C@H](N)C(=O)[O-]
HLU HLU.paramsCC(C)CC[C@H](N)C(=O)[O-]
A32 2-amino-octanoic_acid.paramsCCCCCC[C@H](N)C(=O)[O-]
B95 beta-beta-dicyclohexyl-alanine.paramsN[C@H](C(=O)[O-])C(C1CCCCC1)C1CCCCC1
A43 2-hydroxy-phenylalanine.paramsN[C@@H](Cc1ccccc1O)C(=O)[O-]
C93 hexafluoro-leucine.paramsN[C@@H](CC(C(F)(F)F)C(F)(F)F)C(=O)[O-]
C20 ethionine.paramsCCSCC[C@H](N)C(=O)[O-]
B74 beta-(2-naphthyl)-alanine.paramsN[C@@H](Cc1ccc2ccccc2c1)C(=O)[O-]
C55 tert-butyl-glycine.paramsCC(C)(C)[C@H](N)C(=O)[O-]
B44 9-anthryl-alanine.paramsN[C@@H](Cc1c2ccccc2cc2ccccc12)C(=O)[O-]
C92 fluoro-leucine_ent2.paramsCC(CF)C[C@H](N)C(=O)[O-]
B62 alpha-methyl-valine.paramsCC(C)C(C)(N)C(=O)[O-]
C41 penicillamine.paramsCC(C)(S)[C@H](N)C(=O)[O-]
ABA ABA.paramsCC[C@H](N)C(=O)[O-]
A98 4-amino-piperidine-4-carboxylic-acid.paramsC[NH2+]CCC(C)(N)C(=O)[O-]
A83 3-methyl-histidine_prot.paramsCN1CNC=C1C[C@H](N)C(=O)[O-]
B59 alpha-methyl-proline.paramsCCCC(C)(N)C(=O)[O-]
B56 alpha-methyl-histidine.paramsCC(N)(Cc1c[nH]cn1)C(=O)[O-]
A45 2-indanyl-glycine_puck2.paramsN[C@H](C(=O)[O-])C1Cc2ccccc2C1
BB8 phenyl-serine.paramsN[C@H](C(=O)[O-])C(O)c1ccccc1
C54 tert-butyl-cysteine.paramsCC(C)(C)SC[C@H](N)C(=O)[O-]
MPA MPA.paramsCc1ccc(C[C@H](N)C(=O)[O-])cc1
TES TES.paramsCC(=O)NC1C(NC(=O)C[C@H](N)C(=O)[O-])OC(CO)C(OC2OC(CO)C(OC3OC(COC4OC(COC5OC(CO)C(O)C(O)C5O)C(O)C(OC5OC(CO)C(O)C(O)C5O)C4O)C(O)C(OC4OC(CO)C(O)C(O)C4O)C3O)C(O)C2NC(C)=O)C1O
C89 4-fluoro-proline_puck.paramsCC(F)C[C@H](N)C(=O)[O-]
C53 tert-butyl-alanine.paramsCC(C)(C)C[C@H](N)C(=O)[O-]
A06 1-methyl-histidine.paramsCn1cnc(C[C@H](N)C(=O)[O-])c1
A69 3-amino-tyrosine.paramsNc1cc(C[C@H](N)C(=O)[O-])ccc1O
HTY HTY.paramsN[C@@H](Cc1ccc(O)cc1O)C(=O)[O-]
B02 4-amino-tetrahydropyran-4-carboxylic_acid.paramsCOCCC(C)(N)C(=O)[O-]
HPR HPR.paramsCC(O)C[C@H](N)C(=O)[O-]
C60 trifluoro-alanine.paramsN[C@H](C(=O)[O-])C(F)(F)F
C01 beta-cyclopentyl-alanine.paramsN[C@@H](CC1CCCC1)C(=O)[O-]
A12 2.4-dimethyl-phenylalanine.paramsCc1ccc(C[C@H](N)C(=O)[O-])c(C)c1
A07 1-methyl-histidine_prot.paramsCN1C=C(C[C@H](N)C(=O)[O-])NC1
A68 3-aminomethyl-phenylalanine.paramsN[C@@H](Cc1cccc(C[NH3+])c1)C(=O)[O-]
V01 V01.paramsN[C@H](C(=O)[O-])C(O)c1ccc(O)c(Cl)c1
C12 cyclohexyl-glycine.paramsN[C@H](C(=O)[O-])C1CCCCC1
A84 3-methyl-phenylalanine.paramsCc1cccc(C[C@H](N)C(=O)[O-])c1
C61 trifluoro-leucine.paramsCC(C[C@H](N)C(=O)[O-])C(F)(F)F
BCS BCS.paramsN[C@@H](CSCc1ccccc1)C(=O)[O-]
A78 3-hydroxy-phenylalanine.paramsN[C@@H](Cc1cccc(O)c1)C(=O)[O-]
C95 3-chloro-phenylalanine.paramsN[C@@H](Cc1cccc(Cl)c1)C(=O)[O-]
A91 4.5-dehydro-leucine.paramsC=C(C)C[C@H](N)C(=O)[O-]
B63 amino-ethyl-cysteine.paramsN[C@@H](CSCC[NH3+])C(=O)[O-]
B19 4-fluoro-proline.paramsCC(F)C[C@H](N)C(=O)[O-]
C03 beta-fluoro-alanine.paramsN[C@@H](CF)C(=O)[O-]
C91 fluoro-leucine_ent1.paramsCC(CF)C[C@H](N)C(=O)[O-]
A94 4-aminomethyl-phenylalanine.paramsN[C@@H](Cc1ccc(C[NH3+])cc1)C(=O)[O-]
ORN ornithine.paramsN[C@@H](CCC[NH3+])C(=O)[O-]
DPP 2.3-diaminopropionic_acid.paramsN[C@@H](C[NH3+])C(=O)[O-]
IGL 2-indanyl-glycine_puck1.paramsN[C@H](C(=O)[O-])C1Cc2ccccc2C1
C05 beta-iodo-alanine.paramsN[C@@H](CI)C(=O)[O-]
A04 1-amino-cyclopentane-carboxylic_acid.paramsCCCC(C)(N)C(=O)[O-]
A92 4.5-dehydro-lysine.paramsN[C@@H](CC=CC[NH3+])C(=O)[O-]
A48 2-methyl-phenylalanine.paramsCc1ccccc1C[C@H](N)C(=O)[O-]
B67 beta-(1-naphthyl)-alanine.paramsN[C@@H](Cc1cccc2ccccc12)C(=O)[O-]
B31 4-tert-butyl-phenylalanine.paramsCC(C)(C)c1ccc(C[C@H](N)C(=O)[O-])cc1
APA APA.paramsNc1ccc(C[C@H](N)C(=O)[O-])cc1
C16 dipropyl-glycine.paramsCC.CCCC(C)(N)C(=O)[O-]

display-of-preset-rosetta-ncaas's People

Contributors

matteoferla avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.