GithubHelp home page GithubHelp logo

Comments (3)

eead-csic-compbio avatar eead-csic-compbio commented on May 29, 2024

Hi Alex,
thanks for your feedback. It seems you are running get_homologues with the default two cores, right? How many CPU cores can you take? If your Linux box or laptop has more cores, say 8, you can allocate them with -n 8. This would speed up the analysis, as you are doing 62x62 BLAST searches. You can speed it up further with -X to call DIAMOND instead of BLASTP. Of course, you would have to update the software to the current version, which I strongly recomend as you are missing a few bug fixes.

Now, coming back to your job, it looks like BLAST was completed correctly from your output, but there are issues with homology calling with OMCL. So I would remove your _homologues/tmp folder an re-run again your command with the update software. Keep an eye on the size of the BLAST outfiles, when they are being sorted, as that will tell you whether some of the jobs failed. If you have further issues let us know, cheers, Bruno

from get_homologues.

alexweisberg avatar alexweisberg commented on May 29, 2024

Hi Bruno,
Thank you for the response. I have previously successfully run get_homologues on this dataset to completion with a typical analysis, however the problem occurs when I want to do a genome composition analysis (add option "-c"). I deleted the tmp folder as you suggested and am re-running it now.

As an aside, is there any way to do the genome composition analysis using a previous run? I have a get_homologues folder of MCL clustered sequences. Is it possible to parse the output from the previous run and do the genome composition analysis without having to redo the blast searches and clustering?

from get_homologues.

eead-csic-compbio avatar eead-csic-compbio commented on May 29, 2024

Hi Alex,
I just went back to test that release (get_homologues-x86_64-20160413) and found out that indeed that version printed all those "ARRAY(...)" when it shouldn't. So please update your software and re-run on the same folders, so that you don't have to run BLAST again, ok?
So, getting to your second question. Yes you can perform a genome composition analysis (-c) using a previous run provided that you kept the *_homologues folder. That folder should contain old BLAST results which can be recycled provided that you are using version > v.2.0.1.
Cheers, Bruno

from get_homologues.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.