Comments (2)
Hi Pierre-Emmanuel, thank you for trying out this pipeline! For your study, I recommend not using scan
for the whole genome, as it was not implemented to handle very large chunk of sequences. And by "large chunk" I mean regions at Mb scale.
My suggestion is using nhmmer
on the whole genome first, and use construct
to build a profile-HMM on candidate regions identified by nhmmer
, which are supposedly at Kb scale. Then you can use scan
on those regions to find monomers.
If you have some prior knowledge for the monomer of interest, such as a consensus sequence, you can use hmmbuild
in the HMMER suite to construct a profile-HMM with the option --singlemx
. This profile-HMM can be used by nhmmer
to quickly identify candidate monomer locations in the whole genome. Then you can use the scripts in pipeline
for monomer identification in these candidate regions.
The instructions in this Readme file are written for this scenario. Once you have the candidate regions produced by nhmmer
, you can follow Step 3 and 4 to generate a refined profile-HMM which can be used for sequence classification.
Hope this helps and please let us know if you have any questions!
from monomerannotation.
Thank you very much for the fast reply. The process is much more clearer for me now.
I'm currently running nhmmer. As soon as it's finished I will use construct
and scan
on the candidates regions.
Thanks again and also thanks to Andrew.
Have a nice day.
from monomerannotation.
Related Issues (1)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from monomerannotation.