Comments (7)
Think I figured it out. It looks like the basename extracted from the diamond output file will include the original extension, while the gene counts are stored in the filename without the extension. Running GUNC using gene calls without an extension (e.g. proteins
) works fine.
from gunc.
Yeah i just saw that too.. i can only apologise for that. I will release a new version with a fix as soon as i can, at least for now you could run without an extension. Thanks for pointing it out though.. i dont know how that got through testing..
from gunc.
No worries, happens to all of us.
from gunc.
I just released v1.0.3 which should fix this issue.. it is taking a while to appear on conda but once its there this issue should be solved!
if you notice any other issues do let me know.. thanks again.
from gunc.
Hi @fullama,
I am reviving this issue because I could reproduce the same error as @jdwinkler-lanzatech in GUNC v.1.0.5.
For externally generated gene calls to work, one must still indicate the (.faa
) FASTA file without the extension. This can be problematic for workflow management system based on the filenames to infer dependencies (e.g. Snakemake).
As far as I have understood, there are two possibilities of fix:
A. keeping the prefix (.faa
) in the genes_counts json file (and modify here)
B. Adding a specific condition (if gene_calls) to the extraction of the basename here
The A solution could be complemented by a last read of the json, trimming of the suffixes and rewrite, for compatibility purpose.
Let me know your thoughts and keep up the good work!
Best,
from gunc.
hi, im not sure i understand whats happening here.. did you run with --file_suffix .faa
?
from gunc.
My bad, it worked with --file_suffix .faa
Thanks @fullama for the quick reply.
from gunc.
Related Issues (20)
- installation failure using conda HOT 5
- **conda install** no longer finds **diamond=2.0.4** HOT 9
- Diamond parameters - max-target-seqs HOT 4
- file_suffix flag HOT 24
- panda KeyError: 'contig' HOT 5
- false positives HOT 8
- NAN in the output result HOT 13
- apply GUNC to a large dataset of MAGs HOT 1
- Failed to run Diamond HOT 4
- Incompatibility with pandas 2.0 HOT 2
- diamond error HOT 4
- Command 'diamond --version' returned non-zero exit status 132 HOT 3
- --use_species_level is not a better choice? HOT 1
- --use_species_level is not a better choice? HOT 5
- Suppress deprecation warning from pandas HOT 1
- Threshold CSS values when dealing with genomes in reference database HOT 4
- New GTDB database release HOT 1
- GUNC gene_counts.json file not found! HOT 5
- GUNC diamond output filename HOT 1
- Database update for gunc
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gunc.