Comments (7)
That depends on what you are interested in. If you want to create haplotype-specific transcripts using the haplotypes from those individuals then you would need to add them to the pangenome graph. However, the genotypes need to be phased for that.
from rpvg.
When we print the code, it showed that "(standard_in) 1: syntax error" and we do not how to solve the problem.
grep -P "\ttranscript\t" gencode.v29.primary_assembly.annotation_renamed_full.gtf | cut -f9 | cut -d ';' -f2 | cut -d '"' -f2 | uniq | shuf | head -n
from rpvg.
Hi,
Are you interested in replicating the benchmark from the manuscript or do you have your own data that you want to run the pipeline on?
The scripts you are referring to are not really part of the standard pipeline. They were used specifically for the data and benchmarking presented in the manuscript:
preprocess.sh
renames the contigs in the annotation (column 1) to match the genome used (removes chr prefix) and filters transcripts that are not full length.exons.sh
creates a BED file of exon coordinates from the annotation.gene_transcripts.sh
creates a table of gene and transcript names.subsample_transcripts.sh
creates a subset of the annotation by removing transcripts. Takes as input the percentage that should be kept.
Running these scripts are only needed if you want to exactly replicate the benchmark det was done in the manuscript. If you have your own data and just want to run the pipeline to get expression estimates these scripts are not really needed.
from rpvg.
Hello developers!
Thank you for your patient and timely reply!
I was building a 15 genome pan-genome using minigraph-cactus and got the pan-genome well. However, I also want to add short sequenced data from 500 individuals to our pan-genome to build a more complete pan-genome in order to obtain more complete transcriptome information.
Is it necessary to add these individuals to our pan-genome?
Best yours,
from rpvg.
Thank you for your reply!
Now we have one more little question.
Our genome is about the same size as the human genome. It has been several days since we used vg auto index, but it is still not finished. Would you please tell me how to solve this problem? If we split up the chromosomes and analyzed them, we couldn't match them in the transcriptome, and that confused us.
Best yours,
from rpvg.
I am unfortunately not able to answer that. You should ask the question on the vg GitHub.
from rpvg.
Thank you for your reply
Best wishes
from rpvg.
Related Issues (20)
- Model for rpvg's output HOT 40
- build rpvg failed HOT 1
- make static version? HOT 2
- Assertion `hasNodeId(node_id)' failed HOT 20
- Could not install the rpvg in Linux HOT 2
- Unable to obtain GWBT HOT 13
- Failed to read BGZF block data at offset 4088217654 expected 4336 bytes; hread returned 1914 HOT 2
- Problems on pan-transcriptome index HOT 3
- Too much time consumed in quantification HOT 8
- 'std::bad_alloc' error in rpvg HOT 3
- Unable to run rpvg analyses without error HOT 2
- Can't run rpvg successfully HOT 5
- Meaning of input and output files of pantranscriptome HOT 5
- Haplotype transcript length inconsistent with GTF HOT 3
- Haplotype probability and read count HOT 2
- Error : void NestedPathAbundanceEstimator::inferPathSubsetAbundance HOT 2
- HST assignment : rpvg assigned two different homo HST instead of expected heterozygous transcripts.
- mapping reads to pan-transcriptom too slow HOT 2
- rpvg stuck for two days
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rpvg.