Comments (4)
In lastz, if the --segments option is used, and the input sequences are NOT accessible in random order, the items in the segments file must be consistent with the order in which they 'will be needed'. For most users, "accessible in random order" means YES for twoBit, NO for fasta.
For fasta this should just mean segment names are in the same order as in the fasta file, and all + segments for a given sequence appear before all - segments.
There's some detail here (starting with the paragraph that begins with "Query sequence names must appear in the same order as ..."):
http://www.bx.psu.edu/~rsharris/lastz/README.lastz-1.04.03.html#fmt_segments
That implementation choice seemed more important 15 yrs ago that it does now. Now it would make more sense to read all the segments into memory, sort them by sequence name (to facilitate binary search), and use the appropriate segments as each new fasta sequence arrives.
from segalign.
@glennhickey the error is definitely in one of the lastz jobs that Bob has pointed out. Ideally, SegAlign divides and sorts the segments in a way to take care of this requirement. I can look into it. Could you share the input files?
from segalign.
I can confirm that this error doesn't happen with this older commit ComparativeGenomicsToolkit@1d2d38c
To reproduce:
- install cactus binaries
- make a
config.xml
as described here (I've been meaning to add a command line but haven't gotten around to it yet) - copy and unzip the input files from
/public/home/hickey/dev/work/gpu-lastz
on courtyard - make sure segalign's in your path
- run
cactus-blast ./jobstore 10mammalsplus.txt Anc05.cigar --root Anc05 --pathOverrides panTro6.fa.pp equCab3.fa.pp hg38.fa.pp canFam3.fa.pp felCat8.fa.pp --pathOverrideNames Chimp Horse Human Dog Cat --realTimeLogging --logInfo --retryCount 0 --maxCores 64 --cleanWorkDir never --configFile ./config.xml
from segalign.
@glennhickey I have fixed the error in the latest commit. Thanks!
from segalign.
Related Issues (20)
- couldn't find boost HOT 7
- Running on multi-fasta HOT 2
- "grep: *.err" and "m: cannot remove '*.segments'" errors HOT 2
- SegAlign/progressivecactus errors on LSF HOT 1
- run_segalign_repeat_masker file HOT 2
- cudaErrorIllegalAddress: an illegal memory access was encountered HOT 1
- stdbuf: failed to run command ‘segalign’: No such file or directory HOT 2
- error during cmake HOT 1
- segalign_repeat_masker crashes HOT 3
- segaling_repeat_masker still crashes HOT 7
- segalign crashes while aligning final against final reference block HOT 1
- run_segalign crashes on human-chimp (and exits 0!) HOT 2
- SegAlign crashes while running cactus on Terra HOT 3
- thrust::system::system_error | CUDA free failed: cudaErrorCudartUnloading
- Error: cudaMalloc of 256 bytes for sub_mat failed with error " the provided PTX was compiled with an unsupported toolchain. " HOT 1
- /usr/local/bin/run_segalign: line 60: segalign: command not found HOT 1
- AMD GPU support
- Output file size not correct HOT 1
- Support for linux-ppc64le?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from segalign.