Comments (4)
Hi,
Please try the latest Cupcake version that i just pushed in. It should recognize the Iso-Seq3 ID formats now. If not, let me know and I can help make sure the formats work.
--Liz
from cdna_cupcake.
Hello Liz,
Sorry for the late reply but thank you so much for updating the scripts. The formats worked, and the abundance.txt and group.stat.txt outputs are as expected :)
However, during the run, I received loads of warnings "WARNING: isoseq3 format detected. Output length
column will be NA
." (specifically, 179,600 of these lines, with no other information).
I don't recall this happening with Isoseq2(beta) on smrtlink. So just wanted to check with you if this is something that I should be expecting with Isoseq3, and what are reads that are giving NA length.
Thank you,
Szi Kay
from cdna_cupcake.
Hi @SziKayLeung ,
That warning was intentionally added in the last Cupcake update. This is because older versions of the FL CCS id format allowed extraction of sequence lengths. The ID format in Iso-Seq1 and 2 used to be <movie>/<zmw>/<start>_<end>_CCS
but now in Iso-Seq 3 it is <movie>/<zmw>/ccs
and the length information is not contained in the ID. It's a minor matter but to kept compatibility I'm still outputting the length
field, but you'll find them NA
when running on Iso-Seq 3 runs.
--Liz
from cdna_cupcake.
Thank you @Magdoll for clarifying!
from cdna_cupcake.
Related Issues (20)
- Question: where does the count.txt file come from?
- cluster id mismatch issue in get_abundance_post_collapse.py HOT 3
- chain_samples.py does not work!
- cupcake2 directory missing, compilation fails HOT 4
- Create sam format for collapse_isoforms_by_sam.py
- lima for skera.bam
- fusion_collate_info.py script has bugs and its not working.
- TypeError: iter_gmap_sam() got an unexpected keyword argument 'type' using collapse_isoforms_by_sam.py
- run_preCluster.py HOT 1
- setup.py requires sklearn instead of scikit-learn HOT 1
- fusion_collate_info.py KeyError: 'count_fl' HOT 3
- Cython compiler error when installing/building cDNA_cupcake HOT 2
- solved
- Saturation analysis bug fixes; compatibility with newer isoseq cluster [make_file_for_subsampling_from_collapsed.py]
- AttributeError: module 'numpy' has no attribute 'int'. HOT 2
- ModuleNotFoundError: No module named 'cupcake.tofu' after installing cdna_cupcake in conda subenv HOT 3
- fusion_finder.py HOT 1
- Demultiplex after clustering / genome alignment
- cdna_cupcake与SQANTI3
- collapse_isoforms_by_sam.py out of memory
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cdna_cupcake.