Comments (3)
Hi Alex,
thanks for your feedback. It seems you are running get_homologues with the default two cores, right? How many CPU cores can you take? If your Linux box or laptop has more cores, say 8, you can allocate them with -n 8. This would speed up the analysis, as you are doing 62x62 BLAST searches. You can speed it up further with -X to call DIAMOND instead of BLASTP. Of course, you would have to update the software to the current version, which I strongly recomend as you are missing a few bug fixes.
Now, coming back to your job, it looks like BLAST was completed correctly from your output, but there are issues with homology calling with OMCL. So I would remove your _homologues/tmp folder an re-run again your command with the update software. Keep an eye on the size of the BLAST outfiles, when they are being sorted, as that will tell you whether some of the jobs failed. If you have further issues let us know, cheers, Bruno
from get_homologues.
Hi Bruno,
Thank you for the response. I have previously successfully run get_homologues on this dataset to completion with a typical analysis, however the problem occurs when I want to do a genome composition analysis (add option "-c"). I deleted the tmp folder as you suggested and am re-running it now.
As an aside, is there any way to do the genome composition analysis using a previous run? I have a get_homologues folder of MCL clustered sequences. Is it possible to parse the output from the previous run and do the genome composition analysis without having to redo the blast searches and clustering?
from get_homologues.
Hi Alex,
I just went back to test that release (get_homologues-x86_64-20160413) and found out that indeed that version printed all those "ARRAY(...)" when it shouldn't. So please update your software and re-run on the same folders, so that you don't have to run BLAST again, ok?
So, getting to your second question. Yes you can perform a genome composition analysis (-c) using a previous run provided that you kept the *_homologues folder. That folder should contain old BLAST results which can be recycled provided that you are using version > v.2.0.1.
Cheers, Bruno
from get_homologues.
Related Issues (20)
- Invalid protein cluster files generated by get_homologues-est when .faa and .fna have genes in different order. HOT 4
- Error with FNA and FAA sequence order HOT 6
- File Output? HOT 3
- Pangenome question HOT 1
- Error: find_COGs HOT 6
- COGreadblast produce empty results HOT 2
- Question: Is there a way to identify which genomes correspond to core-genome sample numbers? HOT 10
- Unable to create venn intersection BDBH, COGS, OMCL HOT 14
- Removing input files: do I need re-run all the analysis? Or there's another way? HOT 2
- Error in Get_homologues analysis HOT 1
- Error in input genbank files HOT 1
- Reporting lineage specific genes? HOT 1
- ANI svg HOT 3
- Issue with Input files not being detected by Get_homologues HOT 6
- maybe a bug in annotate_cluster.pl HOT 2
- Phylogenetic tree not showing strain name HOT 1
- Phylogeny Question HOT 1
- core genes list annotations HOT 3
- Generating ANI svg is broken HOT 3
- Problem generating alignment file while using annotate_cluster.pl HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from get_homologues.