GithubHelp home page GithubHelp logo

biocore / burrito-fillings Goto Github PK

View Code? Open in Web Editor NEW
7.0 7.0 11.0 516 KB

Application controllers for command line bioinformatics applications

License: BSD 3-Clause "New" or "Revised" License

Python 100.00%

burrito-fillings's People

Contributors

antgonza avatar ekopylova avatar eldeveloper avatar gregcaporaso avatar jairideout avatar josenavas avatar kylebittinger avatar rnaer avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

burrito-fillings's Issues

improve uclust application controller error reporting

ported from biocore/qiime#1340

When running pick_otus.py with uclust, users often report the following error message on the forum:

Error running uclust. Possible causes are unsupported version (current supported version is v1.2.22) is installed or improperly formatted input file was provided

This error can occur even when uclust is correctly installed and a properly formatted input file is provided (e.g., this has occurred with the Illumina tutorial when running [qiime's] pick_open_reference_otus.py). The uclust application controller needs to be updated to report the original exit code and error message / output of running uclust to aid in debugging issues.

brokit.get_clusters_from_fasta_filepath failing to remove output files

@wasade @gregcaporaso

I need to make uclust calls a large number of times in my script, and I get 'OSError: Too many open files' when the dataset size exceeds a certain limit, when I am using brokit.get_clusters_from_fasta_filepath function. However, when I use qiime_system_call instead to call uclust, I don't get an error. I think get_clusters_from_fasta_filepath may not be removing some output files, and I am not sure whether this is related to scikit-bio or brokit. I tried changing suppress_stdout of Uclust class to true, but that did not solve the problem.

Thanks!

Which application controllers do we want to maintain?

Currently, the applications controllers used in QIIME are:

  • blast
  • blat
  • bwa
  • cd_hit
  • clearcut
  • clustalw
  • fastq_join
  • fasttree
  • fasttree_v1
  • formatdb
  • infernal
  • mafft
  • mothur
  • muscle_v38
  • parsinsert
  • pplacer
  • raxml_v730
  • rdp_classifier
  • rtax
  • seqprep
  • uclust
  • usearch

Feel free to add any application controller that I missed

update to swarm 2

The new version of swarm is out and it's pretty different from 1.2.7. This algorithm will be relevant to qiime 2 and EMP.

I'm interested in taking this one, although I'll probably need support.

clean-up usage of "/tmp" for fillings most frequently used by QIIME

Ultimately we'll do this for everything, but focusing on what is most needed for QIIME right now. Application controllers corresponding to the the boxes that are checked off here have been updated (if updates were necessary) and the unit tests have passed locally on @gregcaporaso's OS X machine.

  • vsearch
  • usearch
  • uclust
  • rdp
  • mothur (#61)
  • sortmerna
  • sumaclust (haven't tested locally, but didn't need any changes)
  • swarm
  • fasttree
  • muscle
  • blast

Related to biocore/burrito#8. Many of the base classes depend on biocore/burrito#9 to not use /tmp.

change name of brokit and do a 0.1.0 release

We're going to need to do a release of brokit before QIIME 1.9.0 so QIIME has a release version to depend on.

I think we should change the name at that time to to be something more descriptive. One idea for a new name: burrito-fillings (since it contains burrito derived classes).

Thoughts on this? Ultimately this repo could be the place where we store our application controller derived classes, though we still do need to develop our testing strategy for these.

brokit-wrapped applications that are not currently installed in the QIIME VM

related to #19 (see comment from @josenavas)

  • usearch (but we can't install that)
  • clustalw (1.83, which is what cogent wraps, is available from here)
  • mafft (see qiime-deploy-conf's #36) - I can't find the wrapped version (6.602), and a slightly later version (6.815b, installed from .deb) fails the brokit unit tests; it turns out that I do have the PyCogent-wrapped version installed on compy though.
  • FastTree v1 (do we need this?) available here

brokit changes to help remove cogent dependency from qiime

See QIIME's #1502. if we could remove the requirement that moltype be passed into these functions, we'll be able to remove QIIME's dependency on the cogent DNA moltype. Even if we just had wrappers for these that we could call from QIIME that passed the DNA moltype, that would be useful (though ugly on the brokit side).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.