Comments (6)
I tried to feed the program with 1000 structures from the query database and it ran to completion. With 2000 structures it gets the same error. Could it be some missing flags or a lack of RAM on my end?
from foldseek.
Thank you for reporting it. Could you please share your 2,000 structures that causes the error?
from foldseek.
Hi, Thank you for the quick reply!
By running the data in chunks I managed to figure out which ones were the problematic PDB files. It boils down to 14 AlphaFold models for 2 proteins that are huge (M0QZD8 and P04114) with 3,266 and 4,563 residues.
I'm attaching two links to OneDrive with the structures and the database
Problematic structures https://liveuclac-my.sharepoint.com/:u:/g/personal/ucbtnb4_ucl_ac_uk/ETEYfojc9DRKhPyqRVOpAXEBfw6xkHOpTVYS46zbGctD5A?e=cRqNve
Database: https://liveuclac-my.sharepoint.com/:u:/g/personal/ucbtnb4_ucl_ac_uk/EezQQjKSFppJsK3da9a6pCQB15lR6-4WSF_AZX_dSwaPug?e=RaAXsY
Is it possible to output in the log which structure in a set is breaking the run?
Thanks again!
from foldseek.
@bordin89 thank you so much for sharing the data. I ran foldseek with your example but it did not crash. Could you please try the newest version. I recently fixed a createdb bug, which might have cause the issue.
from foldseek.
Does this issue still occur?
from foldseek.
No, the latest release of Foldseek seems to have fixed the issue. Thanks for the follow-up!
from foldseek.
Related Issues (20)
- Download pre-generated afdb.tar.gz HOT 4
- How is the PDB100 database prepared? HOT 1
- Can I search for similar protein structures to those in my query database and pick out representative based on clustering?
- Unable to cluster large set of similar proteins HOT 7
- TM-score reported in the HTML is different from the one in the tabular output
- What exactly does each class in the foldseek_cluster result represent? HOT 1
- foldseek failed to search/process Pymol generated files HOT 2
- What lndb is doing ? HOT 2
- Foldseek TMAlign option alntmscore inconsistent with tmalign server results.
- Possibility to foldseek against only a PDB-chain HOT 1
- Closest neighbour to virtual centre
- createdb extracts (0,0,0) coordinates for residues that don't have C-alphas
- Questions on effects and usage of --num-iterations options HOT 2
- --cluster-search 1 throws error HOT 2
- How is the PDB100 database prepared? HOT 1
- foldseek easy-cluster get error by conda install foldseek
- Get the Latent representations only HOT 1
- Inquiry on Establishing Correspondence between 3Di Structure and Amino Acid Sequences in the Database HOT 2
- How does the process to calculate alnTMscore
- python3 API HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from foldseek.