Comments (8)
Hi,
Thank you for posting. BayesTyper can easily run on non-human species, however there are some details regarding the handling of the reference sequence (sex chromosomes in particular) that you should be aware of. We have written a wiki with the details.
Please do not hesitate to post again if you have further questions.
Br,
Lasse
from bayestyper.
Dear Lasse,
Thank you for the kindly reply, and the detailed wiki helps.
However, when trying the bayesTyper genotyping procedure, I met the error message below and exit when running KMC and bayesTyperTools makeBloom:
$ kmc -fbam test.bam test ./
**
Stage 1: 100%
Stage 2: 100%
1st stage: 0.826148s
2nd stage: 0.417656s
Total : 1.2438s
Tmp size : 1MB
Stats:
No. of k-mers below min. threshold : 29499
No. of k-mers above max. threshold : 0
No. of unique k-mers : 53938
No. of unique counted k-mers : 24439
Total no. of k-mers : 1082156
Total no. of reads : 9549
Total no. of super-k-mers : 138224
$ bayesTyperTools makeBloom -k test -p 1
[19/06/2018 21:32:48] You are using BayesTyperTools (v1.3.1)
[19/06/2018 21:32:48] Running BayesTyperTools (v1.3.1) makeBloom ...
bayesTyperTools: /isdata/kroghgrp/jasi/bayesTyper/code/releases/v1.3.1_static/BayesTyper/src/bayesTyperTools/MakeBloom.cpp:213: kmer_bloom_t* MakeBloom::kmc2bloomThreaded(const string&, float, uint): Assertion `kmc_table_info.kmer_length == kmer_size' failed.
Aborted (core dumped)
Do you have any idea what caused this failure ?
Please find attached the bam files and the KMC3 out files .
Thank you!
from bayestyper.
Hi again,
You need to inform KMC to count k-mers with k=55 (the default BayesTyper k-mer size). Try rerunning KMC with -k55
added to the commandline.
I just realised that this information wasn't in the readme, Ive updated it now. Im sorry for the bother.
br,
from bayestyper.
Thank you again. It finally works.
And I have another question irrelevant with this issue. I have learned that in your paper, the performance of BayesTyper and SVTyper were compared using both deletion and insertion classes of SVs. I have tried SVTyper (version: v0.6.0) and the default settings didn't support insertion calls. So could you please tell me that how did you call insertion genotype using SVTyper?
best wishes!
songtao gui
from bayestyper.
Dear Songtao,
Glad to hear that you got it to work.
Yes, SVTyper does not support insertions (<INS>), however it supports duplications (<DUP>), which can also be classified as insertions. All insertions called by SVTyper in the paper are thus duplications.
Please let me know if you have any other questions.
Best wishes,
Jonas
from bayestyper.
Dear Jonas,
Thank you for the kindly reply.
And I do have another question.
Does BayesTyper use split reads information during genotyping? Because I already have a batch of bam files aligned by bowtie2 (end-to-end), I was wondering is it OK if I use these bowtie2 bam during the BayesTyper Genotyping procedure, or do I have to realign with aligner such as BWA MEM to generate split reads information ?
Thank you for your help.
Best wishes,
Songtao Gui
from bayestyper.
Hi again,
BayesTyper does not use the read alignments for genotyping, but only the kmers counted from the read sequences. Therefore it is not important whether you have split read information or not. However, what is important, is that your bam files also contains the unmapped reads, since BayesTyper uses kmers from all reads including the unmapped. I have added a comment about the latter to the readme.
Having said that, if you are running the variant candidate generation step before genotyping, split reads are needed, since they are important for predicting larger structural variants.
Let me know if you have any other questions.
Thank you for your interest in our method.
Best wishes,
Jonas
from bayestyper.
Dear Jonas & Lasse
Thank you for all your help. Glad to know that split read information is not important for genotyping.
So far I have no more questions and I will close this issue.
Thank you again for developing this wonderful genotyper!
Best wishes,
Songtaogui
from bayestyper.
Related Issues (20)
- [feature request] Genotyping more than 500 samples HOT 3
- Question of the format of VCF file for BayesTyper. HOT 2
- A question about convertAllele.
- Questions about bayesTyperTools combine HOT 4
- Genotyping 500 samples
- Genotyping for the inversions HOT 2
- A question about VCF format of results from the 'genotype' function. HOT 1
- Identifying SV type from output HOT 1
- when cmake, Could NOT find Boost (missing: Boost_INCLUDE_DIR iostreams program_options system filesystem serialization) HOT 1
- Where are the test cases HOT 1
- how to run bayestyper HOT 2
- Classifying kmers error with bayesTyper 1.4.1 and 1.5 HOT 1
- Questions on data bundle and SV genotyping HOT 1
- how to get test case
- downloading variation priors fails HOT 3
- how to generate test cases
- genotype failed: Assertion `parameters.first > 0' failed. HOT 3
- What is the open source protocol for bayesTyper? HOT 1
- If executed multiple times, will the result of the data generated be the same each time? HOT 5
- With bayestyper genotyping SV, the genotype rate decreases as the depth of sample sequencing increases.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bayestyper.