Comments (6)
Just to clarify, is this an issue with the ending of the d1d1?
This is what I understand is happening:
There are two possible endings of d1d1, the correct one is the second one, but the script is picking up the first one.
Should it ALWAYS pick the last one?
@callmcgovern @phycologia
from cims-cyanobacterial-its-motif-slicer.
@nlabrad I'm not sure, cause I guess it depends on where else the supposed ending motif comes up.... I encountered the same sort of thing (a possible D1 ending and then another possible ending about 10 bp further away) when I was first annotating the GARN sequences. Supposedly the base right after the last D1 base will be a G--that's how I decided to go with the second possible ending in my case.
from cims-cyanobacterial-its-motif-slicer.
Yeah when two possible endings were close together I usually had to go with the second one, I don't want to say this is always the case because I'm sure there are always exceptions but it seemed like when the D1D1 strand was a little shorter the second D1D1 ending ended up lengthening the sequence and gave the right structure.
from cims-cyanobacterial-its-motif-slicer.
I think we have a proposed solution in the works.
Now it finds all the possible d1d1 endings and presents them to the user for them to do as they wish. There is no interactivity yet like "hey we found 2 which one you want", I don't know if that's something we want. @callmcgovern? I figure you don't really know which one you need until you fold them, so they wouldn't know anyway.
from cims-cyanobacterial-its-motif-slicer.
The new version is under the "new algorithm" branch.
from cims-cyanobacterial-its-motif-slicer.
@aimeee05 I think this has been fixed. It now searches for all the possible d1d1 endings within the first 150 bases. This limit is defined by us, hardcoded, and because that's what I was told to do lol, but if it needs to be different we can change it.
from cims-cyanobacterial-its-motif-slicer.
Related Issues (20)
- snowella d1d1 illuminates an issue HOT 1
- New algorithm only returns results for first sequence in fasta file with many seqs HOT 1
- d1d1 length is "1" every time. HOT 1
- Do double output for Box B when needed. HOT 1
- can we make the script load dependecies automagically??
- Genbank -g argument not working. Returns "cannot parse this..." HOT 1
- Set limit to how long BoxB can be
- User specified flanking regions would help with accuracy if the user knows what to look for.
- BoxB truncated when script uses primary box b algorithm HOT 1
- BoxB sequences not found in fischerella spp HOT 1
- wrong boxb found with newest code HOT 9
- make length constraint for boxb on the small end HOT 1
- earlier CCTCCTT prevents finding leader/D1 HOT 4
- Changing naming of tRNAs HOT 1
- Change "motif search" to "CIMS" HOT 1
- quick fix for when using the -t flag HOT 1
- d1d1 is not outputting anything anymore... HOT 1
- cant save output file when using genbank fetch.
- synechococus and cynobium contain unique 16s ending and d1d1 beginning! HOT 8
- complete output wont save into text file if the sequence doesnt contain ITS regions HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cims-cyanobacterial-its-motif-slicer.