Comments (9)
from edta.
Hi, some steps are not paralized
I see, therefore the divide and conquer. In fact looking at the progress now, with TIRs it looks exactly the same. Perhaps it would be good to mention the single threaded bottlenecks in the readme, can I make a pull request?
Besides that, I do have ~1Mbp of LTRs annotated now :-).
PiRATE
If I will get to bench-marking of TE annotation pipeline, I will certainly take a look.
from edta.
from edta.
I am running the whole EDTA pipeline, and once per while I peek at the CPU usage on the server and everytime time I take a look it's running a single core. As on the following screen shot
It seems that the script does spawn the threads, but they do not seem to run in parallel. I know that now the TIR annotation is using different programs, but the behavior seems to be the same for both - multiple threads spawned, but only single core used.
I mean this as a report of suspicious behaviour. If you think it's not a problem, feel free to ignore this issue.
I appreciate the effort of putting the pipeline together. Thank you for that.
from edta.
from edta.
Alright, then it's probably not a problem, I will close the issue for now/
I think rather simple test to verify the multicore utility would be to run exactly the same job with 1/2/4/8 cores a measure the speedup. I will explore and get back to you if I find anything.
from edta.
from edta.
Is it important to have a full genome? Would one drosophilla chromosome do the trick for efficiency benchmarking?
(that's how I intended to test it, just on a genome subsample)
from edta.
You need a big enough sample to collect enough instances of different TEs, otherwise the outcome will not be representing the whole genome. Subsample of >200 Mb should be enough, but that's bigger than the Ath genome already.
from edta.
Related Issues (20)
- make_masked.pl Permission denied HOT 1
- ERROR: Intact TE annotation not found in Aros.genome.fa.mod.EDTA.intact.gff3 at EDTA.pl line 638. HOT 1
- The inconsistency between the classification labels of the EDTA library and RepeatMasker HOT 1
- Combining RM rows HOT 5
- fa file HOT 1
- ERROR: TE annotation stats results not found in mod.EDTA.TE.fa.stat! HOT 1
- dependencies not found in singularity install HOT 4
- Would you consider adding -w $genome_file_real_path.EDTA.raw/TIR to line 576 of EDTA_raw.pl HOT 1
- Can't locate SearchResult.pm in @INC (you may need to install the SearchResult module) HOT 3
- Hey!Have you solved your problem? I had the same problem. HOT 1
- I just found that this script in the (../../share/RepeatMasker/) folder will not have this error, maybe can copy the input file, I think it can try?If you tried, can you tell me the result? HOT 3
- TIR not found? HOT 1
- 文件缺失 HOT 1
- Stuck by BLAST in LTR finding HOT 2
- PanEDTA test output
- [No LINE, EDTA 2.2.0] Empty LINE file after RM2
- LINE and SINE results files has 0 bp!
- ERROR: TE annotation stats results not found in B.purpurea.fasta.mod.EDTA.TE.fa.stat! HOT 1
- '调用失败' HOT 8
- Statistical genome size
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from edta.