GithubHelp home page GithubHelp logo

Comments (2)

mengzhou avatar mengzhou commented on July 18, 2024

Hi Pierre-Emmanuel, thank you for trying out this pipeline! For your study, I recommend not using scan for the whole genome, as it was not implemented to handle very large chunk of sequences. And by "large chunk" I mean regions at Mb scale.

My suggestion is using nhmmer on the whole genome first, and use construct to build a profile-HMM on candidate regions identified by nhmmer, which are supposedly at Kb scale. Then you can use scan on those regions to find monomers.

If you have some prior knowledge for the monomer of interest, such as a consensus sequence, you can use hmmbuild in the HMMER suite to construct a profile-HMM with the option --singlemx. This profile-HMM can be used by nhmmer to quickly identify candidate monomer locations in the whole genome. Then you can use the scripts in pipeline for monomer identification in these candidate regions.

The instructions in this Readme file are written for this scenario. Once you have the candidate regions produced by nhmmer, you can follow Step 3 and 4 to generate a refined profile-HMM which can be used for sequence classification.

Hope this helps and please let us know if you have any questions!

from monomerannotation.

pebonte avatar pebonte commented on July 18, 2024

Thank you very much for the fast reply. The process is much more clearer for me now.

I'm currently running nhmmer. As soon as it's finished I will use construct and scan on the candidates regions.

Thanks again and also thanks to Andrew.

Have a nice day.

from monomerannotation.

Related Issues (1)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.