Comments (5)
Hi Chris,
Ideally yes, the genetic search tools are very IO intensive, hence having an SSD helps.
For more details see e.g. the HH Suite wiki that discusses HHBlits performance: https://github.com/soedinglab/hh-suite/wiki#running-hhblits-efficiently-on-a-computer-cluster
from alphafold.
Thanks. Is a persistent SSD sufficient, or did you all use a set of local SSD? Table of comparison:
https://cloud.google.com/compute/docs/disks/performance#type_comparison
If local SSD, was NVME or SCSI used? (I'm not sure which drivers are even available on nvidia-gpu-cloud-image
)
from alphafold.
I recommend using the fastest possible option, but the sequence search should work in either case – it is just a matter of speed.
from alphafold.
From an engineering perspective, speed needs to be weighed against other factors. The fastest SSDs on GCP are the local SSDs, but they are harder to use since data is not persisted and multiple volumes need to be managed (they have a max size of 375GB).
If the local SSDs give only a small bump in performance, they would not be worth it compared to persistent SSDs. If there is a relative benchmark, that will be useful to people. We may test it at some point and report back, unless someone beats us to it.
from alphafold.
I am considering which storage device option to buy for my own workstation to store the databases.
I understand that the genetic search tool will be very I/O intensive, leaving me undecided: I have the possibility of taking a 4TB NVMe PCI Express 4.0 x4 in my setup, but those are considerably more expensive than a 4TB or 8TB SSD SATA 6Gb/s.
Did anyone see justifiable performance difference between those two types of devices?
from alphafold.
Related Issues (20)
- Linux ARM64 an officially supported platform ?
- SymbolAlreadyExposedError HOT 16
- Error On Section 4, Symbol Zeros is already exposed as () HOT 5
- HOW to change MSA HOT 3
- Notebook - Cell 4 "Making Prediction" - SymbolAlreadyExposedError HOT 4
- SymbolAlreadyExposedError: Symbol Ones is already exposed as () HOT 1
- Resource not found: pdb70_from_mmcif_200401.tar.gz HOT 1
- Install third-party software stalls at 17% HOT 1
- AlphaFold Stuck at hhblits Step on Cluster Compute Node
- AlphaFold Colab crash at 'Search genetic databases' stage: Keras Zeros initializer fault... HOT 2
- alphafold outputs only have msas HOT 4
- ImportError: cannot import name 'SCOPData' from 'Bio.Data' HOT 5
- docker.errors.APIError: 500 Server Error for http+docker://localhost/v1.43/containers/ee2565cd6294bdf23537e9fb81814d89cb240f8c02b23f24a0926e3c66b44aea/start: Internal Server Error ("could not select device driver "nvidia" with capabilities: [[gpu]]") HOT 2
- Can I debug the code in the pycharm? HOT 1
- HHblits query is running time too long! HOT 3
- Missing function in docker build
- RMSD95 Definition
- Calculation of ipTM score for multimers greater than a dimer
- Attention docstring missing head dimension for arguments mask and nonbatched_bias
- HMMER MSAs aren't saved and repeated when running with --use_precomputed_msas=True HOT 10
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from alphafold.