Comments (4)
This may either be a case of unexpected behavior on Mac, or your hg38.fa
is really missing the Y
chromosome (probably due to a corrupt download). Can you check, if the sequence is present:
grep chrY /Volumes/bam/DRG/annotations/hg38.fa
This should give you:
>chrY
>chrY_KI270740v1_random
>chrY_KZ208923v1_fix
>chrY_KZ208924v1_fix
>chrY_KN196487v1_fix
I just tested the FastA and GTF files you linked. Arriba does not throw the error on my system (Linux). I will test it on a Mac next. Hopefully, I will find some time tomorrow.
from arriba.
It's pretty strange. I do have the Y chromosome -
| => grep chrY /Volumes/bam/DRG/annotations/hg38.fa
>chrY
>chrY_KI270740v1_random
>chrY_KZ208923v1_fix
>chrY_KZ208924v1_fix
>chrY_KN196487v1_fix
And yet it gives the error I mentioned! This is very mysterious.
from arriba.
I was able to reproduce your problem on a Mac! Just as suspected the root cause is a Mac peculiarity. The function I use to check if a file is empty behaves differently on Mac than on Linux. Mac always reports the file as empty and therefore nothing gets read at all. Arriba only complains about chromosome Y, because that's the first one it checks, but really all of the chromosomes are missing.
Incidentally, I have already fixed the issue in my local code repository, because I recently made some optimizations to the routine that loads the assembly. This optimization happens to circumvent the Mac issue. I will push the changes to GitHub sooner or later. Until then, there is an easy workaround: You should only pass gzipped files to Arriba (i.e., whenever you use -a
, -g
, -b
, -d
, or -k
). Loading of gzipped files uses different code, so there the issue is not triggered. BAM and SAM files (arguments -c
and -x
) need not be zipped, because they also use different code for loading.
from arriba.
Hi Sebastian,
Your suggestion to use the gzipped file worked! Thanks a lot for the prompt and detailed response.
from arriba.
Related Issues (20)
- How to calculate fusion gene expression? HOT 8
- Aligned.out.bam file empty HOT 13
- General question about breakpoints HOT 1
- Low number of structural variants HOT 4
- e-value threshold HOT 2
- possibly incorrect determination of the frame HOT 5
- EOF marker is absent. The input may be truncated HOT 1
- Error downloading reference data HOT 3
- How breakpoints should be interpreted on doubled 5' transcript end gene-fusions? HOT 2
- Segmentation fault on Re-aligning chimeric reads to filter fusions HOT 2
- Split reads evidence for EWSR1-NR4A3 fusion missing in sample.arriba.fusions.discarded.tsv HOT 2
- Validating New Versions HOT 4
- How to get the position of the fused gene on the chromosome? HOT 3
- Giving known fusions list does not recover the fusions I want HOT 4
- request for simulation dataset mentioned in the paper HOT 2
- intergenic breakpoints reported without distances to genes HOT 1
- Can one read be used to support multiple fusions HOT 6
- Use draw_fusions.R to only output partial plots HOT 1
- Arriba for plate-based scRNA-seq HOT 2
- Recommendation for samtools bam>fastq conversion for Arriba HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from arriba.