Comments (3)
Sorry for the confusion @slowkow . The partition has to be in the set N choose 2.
I will fix this limitation in the upcoming release
from tomahawk.
Thank you!
I don't think this is a limitation, but it would be nice to add to the documentation so users can split work into jobs small enough for their computing environments.
Running on the Broad servers with the Sun Grid Engine job scheduler, I needed to split into 40186 jobs to get the jobs queued up quickly. Each job takes about 40 minutes or so, and most often less than 1G memory.
I brute forced my way to two numbers that work for me and split the work into a good number of jobs:
tomahawk calc -c 10012 <...>
tomahawk calc -c 40186 <...>
I guess I should be able to use R to find other numbers now (hooray!):
> choose(142, 2) + 1
10012
> choose(284, 2)
40186
I'm not sure why 10012 worked, because it should be 10011... maybe that was my mistake.
from tomahawk.
That's cool. Just keep an eye on the amount of output data you generate. I've calculated genome-wide LD for 1000GP, HRC, and HGDP and the compressed and sorted outputs are ~800 GB, ~100 GB, and ~300 GB, respectively. These data will be available together with the upcoming publication.
I will look into why 142!2 + 1
worked. That is unexpected.
from tomahawk.
Related Issues (16)
- segmentation fault HOT 4
- how to extract TWO binary for region HOT 2
- Clumping and pairwise LD for specified SNP lists HOT 2
- tomahawk import does not generate a twk file HOT 3
- Tutorial for converting two data into HiGlass-compatible formats
- core dump error on import HOT 20
- LTO and gcc version <= 4.9.2
- Unphased Math Reference
- Tomahawk crash with "std::regex_error" error HOT 2
- compilation issue HOT 3
- Why import parameter filtered all my SNPs
- tomahawk fails to import from bcf: Assertion failed: (ref == 0 || ref == 1 || ref == 4 || ref == 5)
- Error while loading shared libraries libhts.so.2
- Another core dump error on import
- Legacy support for < SSE4.2 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tomahawk.