breimanntools / aaanalysis Goto Github PK
View Code? Open in Web Editor NEWPython framework for interpretable protein prediction
Home Page: https://aaanalysis.readthedocs.io
License: BSD 3-Clause "New" or "Revised" License
Python framework for interpretable protein prediction
Home Page: https://aaanalysis.readthedocs.io
License: BSD 3-Clause "New" or "Revised" License
In the "README.rst" file, there is an attempt to showcase two images, but unfortunately, they are currently unreachable.
Links:
Build status
Python-check
Conda - Package Version
The provided demo code crashes, because the loaded scales contains int64 dtype columns, which your code can't handle.
The code crashes during CPP, the int64s are created in load_scales. If you convert the int64 to python ints (which should be able to fit the int64) it works. I'd recommend to either include this fix in your codebase, or handle int64s. Thanks
aa.options["verbose"] = False
df_scales = aa.load_scales()
df_scales = df_scales.astype({col: int for col in df_scales.select_dtypes(include='int64').columns})
non_numeric_columns = df_scales.select_dtypes(exclude=[np.number]).columns.tolist()
dict_dtype = dict(df_scales.dtypes)
non_numeric_columns2 = [(col, dict_dtype[col]) for col in dict_dtype if dict_dtype[col] not in [np.number, int, float]]
df_seq = aa.load_dataset(name="DOM_GSEC", n=50)
Documentation regarding dataset.
Hello, I was wondering whether you could provide some information regarding the DOM_GSEC dataset.
Your .tsv file contains the following entry:
Q14802 MQKVTLGLLVFLAGFPVLDANDLEDKNSPFYYDWHSLQVGGLICAGVLCAMGIIIVMSAKCKCKFGQKSGHHPGETPPLITPGSAQS 0 37 59 NSPFYYDWHS LQVGGLICAGVLCAMGIIIVMSA KCKCKFGQKS
or as an object:
entry Q14802
sequence MQKVTLGLLVFLAGFPVLDANDLEDKNSPFYYDWHSLQVGGLICAG...
label 0
tmd_start 37
tmd_stop 59
jmd_n NSPFYYDWHS
tmd LQVGGLICAGVLCAMGIIIVMSA
jmd_c KCKCKFGQKS
the problem is that if you look into uniprot,
extracelluar domain (I believe this is jmd_n) is: NDLEDKNSPFYYDWHSLQ
transmembrane domain is: VGGLICAGVLCAMGIIIVMSA
cytoplasmic domain is (I believe this to be jmd_c): KCKCKFGQKSGHHPGETPPLITPGSAQS
and I can't resolve the difference.
I mean, the transmembrane domain seems to correspond to the entry in uniprot, the jmd_n terminal seems to be a random substring of the extracelluar domain, and the jmd_c seems to be only the first third of the cytoplasmic domain. Could you elaborate why? Thanks!
Check all references and try to get same error locally as in readthedocs page.
Error message
"KeyError: 'refid'
Exception occurred:
File "/home/docs/checkouts/readthedocs.org/user_builds/aaanalysis/envs/latest/lib/python3.9/site-packages/docutils/nodes.py", line 652, in getitem
return self.attributes[key]
KeyError: 'refid'"
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.