Comments (3)
Hi @yuqiu77 ,
Please add the --dun_use_group_count
option and this error should go away.
THanks,
--Liz
from cdna_cupcake.
Could I use 'filter_by_count.py' for post-collapsing reads of FLNC reads (ISO-seq3)?
According to the Wiki, I need to an output of 'get_abundance_post_collapse.py' to run 'filter_by_count.py'. However, since I use FLNC reads instead of clustered reads, I can't run 'get_abundance_post_collapse.py' due to the absence of 'cluster_report.csv', which is the output of the clustering step.
I want to filter out singleton FLNC reads as ISO-seq3 clustering does
from cdna_cupcake.
Hi @juheon ,
You can make a "mock" abundance.txt file for the FLNC collapsed result using the Linux commands below.
Suppose your collapsed output prefix is out.collapsed.xxxx. You need to generate the file output.collapsed.abundance.txt
.
echo -e "pbid\tcount_fl" > output.collapsed.abundance.txt
paste <(awk '{print $1}' output.collapsed.group.txt) <(awk -F , '{print NF}' output.collapsed.group.txt ) >> output.collapsed.abundance.txt
This will generate an output that has the format:
pbid count_fl
PB.1.1 1
PB.1.2 2
PB.1.3 2
PB.2.1 26
PB.2.2 1
PB.2.3 37
PB.2.4 2
which then allows you to run:
filter_by_count.py output.collapsed --dun_use_group_count
from cdna_cupcake.
Related Issues (20)
- Question: where does the count.txt file come from?
- cluster id mismatch issue in get_abundance_post_collapse.py HOT 3
- chain_samples.py does not work!
- cupcake2 directory missing, compilation fails HOT 4
- Create sam format for collapse_isoforms_by_sam.py
- lima for skera.bam
- fusion_collate_info.py script has bugs and its not working.
- TypeError: iter_gmap_sam() got an unexpected keyword argument 'type' using collapse_isoforms_by_sam.py
- run_preCluster.py HOT 1
- setup.py requires sklearn instead of scikit-learn HOT 1
- fusion_collate_info.py KeyError: 'count_fl' HOT 3
- Cython compiler error when installing/building cDNA_cupcake HOT 2
- solved
- Saturation analysis bug fixes; compatibility with newer isoseq cluster [make_file_for_subsampling_from_collapsed.py]
- AttributeError: module 'numpy' has no attribute 'int'. HOT 2
- ModuleNotFoundError: No module named 'cupcake.tofu' after installing cdna_cupcake in conda subenv HOT 3
- fusion_finder.py HOT 1
- Demultiplex after clustering / genome alignment
- cdna_cupcake与SQANTI3
- collapse_isoforms_by_sam.py out of memory
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cdna_cupcake.