iobio / bam.iobio.io Goto Github PK

View Code? Open in Web Editor NEW

47.0 47.0 19.0 6.31 MB

http://bam.iobio.io

License: MIT License

CSS 1.02% HTML 0.39% JavaScript 47.51% Shell 0.46% Vue 50.35% Dockerfile 0.27%

bam.iobio.io's People

Contributors

Stargazers

Watchers

Forkers

boydgreenfield jsa-aerial zachcp raonyguimaraes akmorrow13 limbus-medtec color bmpvieira-forks frameshiftgenomics muhamma1 santboi anderspitman bbyun28 adityaekawade lananhle victorfica xquek ciaranschutte demariadaniel

bam.iobio.io's Issues

No fragment length showing for BAM with low coverage

Amazon S3 bam.test.files.private/SRR893046_STP1N240A.bam

Defined range for insertion length

Is it worth allowing a defined range to be input by the user? Right now, if I look at my genome, the default range is 0-600, with the mean ~360. If I click outliers, it jumps to 0-200M, so even with zooming, I have very little control. I want to see if there is a spike from Alus which would be ~700 for this data, but can't expand the range of the default view to get out to ~800 which is what I would like. Would be nice if I could just specify that I want to look at the range 0-800.

Allow sampling of multiple references

data with many small references (e.g. microbiome) need to sample multiple references at the same time to be useful. Need to figure out how to display multiple read coverages at once to be able to implement this feature.

Add region to bam.iobio

Discussion with Omicia led to the suggestion that we should be able to define chromosome and region and include in URL.

The chromosomes are not sorted, so depending on BAM file, chromosomes in Read Coverage line chart panel appear in non-ordered fashion.

It looks the the 1000Genomes BAM files are ordered sequentially (e.g. ch1, ch2, ch4...), but this is not always the case with BAM files. For example, load this BAM (zebrafish)

https://s3.amazonaws.com/bam.test.files.private/zebrafish/8789X1_111118_SN141_0417_BC04PLABXX_6_STP1N60A.bam

Notice how the first chromosome in the panel is ch20.

Incomplete coverage track

This is not always replicatable, which is obviously a pain, but I have seen this happening a number of times recently, including at BoG.

[CRITICAL] Wonky read depth line chart produced when local BAM file loaded

I downloaded the HG04141bam and bai file that is the same as the default URL. However, the line chart draws a bit strangely when I load from the local file vs. the URL. See screen prints.

Line chart from bam served from URL bam http://s3.amazonaws.com/1000genomes/data/HG04141/alignment/HG04141.mapped.ILLUMINA.bwa.BEB.low_coverage.20130415.bam

Line chart from same bam file loaded from my local drive

Need better error reporting when .bai file is not present

Need better error reporting when bai file is missing. Right now, when loading the bam file from a URL, if the corresponding .bai file is not present, the failure is silent. We should send an error message back to the web page that indicates that the corresponding .bai file is not found. Down the road, it might me nice to generate a .bai file on the fly if it is not present.

read coverage y axis

give the read coverage a y axis to make it more useful

Inconsistent results for Read Coverage Distribution with each re-selection of a chromosome.

This issue only occurs on specific BAM files. To reproduce this bug, please load the following BAM file:
https://s3.amazonaws.com/bam.test.files.private/SRR893047_STP1N240A.bam.

Here are the screen prints of the Read Coverage Histogram for ch1, after each time ch1 is selected. Each time, it produces an entirely different distribution (or so it appears....)

improve reference (chromosome) selection

currently the references are just listed, which is impractical for files with a large number of references. Turn into a auto-complete drop down list.

Subselecting a region in depth chart (brush) too quickly eventually causes non-responsive page, 0 reads sampled

Coverage average is off for sparse data

For a BAM with sparse data, zoom in on a section of data where there are reads and the read coverage distribution looks good, but the average is off the chart to the left. I assume that this is showing the average of all the data, which since it is sparse, is close to zero. Would it make more sense to show the average of the data in the selected data?

Please support sftp protocol for remote files

From Howard Sun:

I do not think to first fix it,but more good solution is to load bam via sftp(ssh), because a large bam data is not open for anyone.
There should be many public codes available. I hope you will consider first sftp. That would be great.

Consistency with vcf.iobio

Set colours for the reference sequences and font sign on the URL section to be the same as vcf.

Loading bam files is slow!

Hi All,

Good day and hope all is well.

We have recently installed bam.iobio.io locally with the help of the installation guide at this link: https://github.com/chmille4/iobio/wiki/Local-iobio-setup

However, when loading the bam files by selecting the 'choose bam file' button it takes very long to load and visualize the data. Our bam file is indexed according to http://bam.iobio.io/help.html.

I am wondering if the bam file needs to be in the same virtual server for some reason where the webservice is running . Currently the bam files are in a separate node connected through a high speed data network.

Also is there any recommended server specification (cpu, memory etc.) for installing bam.iobio.locally?

Best Regards,

Anwar

Add chromosome wheel

for chromosome selection.

Missing file isn't handled

If I supply a file that doesn't exist, bam just sits. Console logs show the file doesn't exist, but the user has no idea.

add support for ga4gh data

Doing so would also let authorized users access data on Google Genomics, which will include the MSSNG autism database, TCGA, 1000 Genomes and more

average looks off

Average looks off for some datasets

Example:
http://bam.iobio.io/?bam=http://s3.amazonaws.com/bam.test.files.private/TCGA-Benchmark-v4/G15511.HCC1143.1.bam

Add range selector for fragment length

Add a range selector to fragment length so that a user can look at the outlier fragments without the chart being overwhelmed by the data in the correct fragment range

Non truncated number for percent on Read Coverage Distribution histogram

upgrade to samtools v1

Current version of samtools fails on remote files that have indices named xxx.bai instead of xxx.bam.bai. This is fixed in samtools v1.

Custom BEDs will only load when reference names (e.g. ch9) match. Need an alias lookup so that different naming conventions won't throw off app.

When I tried to load the Illumina TruSeq Sure Select Capture bed, it wouldn't load giving me the message of no matching reference. The bed file has the reference name 1, where the bam file has the reference name chr1.

Handle ftp data urls

Ftp doesn't support byte range requests so we can't support ftp urls.

Detect ftp data urls and do

Check if url is also being served over http and if so use that url
If above doesn't work, give error to use.

Test bam files

Add a set of test BAM files and a doc that provides bam.iobio links to all the test files. This can be used to check through a list of files and ensure that bam.iobio acts as expected on some constructed use cases.

When I click too quickly between references (chromosomes), the sampling doesn't catch up until all selections have been processed

It seems like the current sampling should be interrupted if the user clicks on another reference when sampling is still in progress.

Unable to load custom bed files - 'Bed file doesn't have coordinates for reference: 1. Sampling normally'

Custom BEDs will only load when reference names (e.g. ch9) match. Need an alias lookup so that different naming conventions won't throw off app. Here are the 2 custom bed files I atttempted to load.

Illumina TruSeq exome capture:
truseq_exome_targeted_regions.hg19.bed

NimbleGen exome capture:
SeqCap_EZ_Exome_v3_capture.bed

Tooltip for Read Coverage Distribution should round the % to an integer instead

When I hover over the Read Coverage Distribution, it shows the x,y which is great. However, the y (%) is not rounding, showing many digits to the right of the decimal.

Orientation of names in chromosome wheel

Do we want these to be rotated?

When I increase the sample size (push the up arrow), the screen doesn't refresh.

I loaded the following bam file and it worked the first time and the second time. However, when I tried to increase the sample size again, the page never refreshed.

HG04141.mapped.ILLUMINA.bwa.BEB.low_coverage.20130415.bam

Improve sampling algorithm

sampling currently is solely based on regions. This means we can either over or under sample based on the read depth of the data. Need to improve sampling so that we keep sampling if we get too few reads or stop sampling once we have enough reads.

Mapping rate

Start bam.iobio for whole genome, rather than chromosome 1 and estimate the number of unmapped reads to get proper mapping rate estimate.

Unmapped reads for BoG poster

Can we figure out a way to generate a bam.iobio screenshot showing the real stats for my genome (specifically, mapping rate of ~70%) for BoG poster. If the iobio suite workflow is also going to be demo'd at BoG, we need to be able to have bam.iobio run and generate these stats. Real stats are:

Total reads: 671644470
Mapped reads: 479405211 (71.3778%)
Forward strand: 431035875 (64.1762%)
Reverse strand: 240608595 (35.8238%)
Failed QC: 0 (0%)
Duplicates: 40566531 (6.03988%)
Paired-end reads: 671644470 (100%)
'Proper-pairs': 472073744 (70.2862%)
Both pairs mapped: 477749930 (71.1314%)
Read 1: 335671024
Read 2: 335973446
Singletons: 1655281 (0.246452%)

Gaps (0 depth) in regions of BAM loaded from file

When I compare the Read Depth line charts for the same BAM file, one loaded from a client-side file, the other from a URL, the line charts closely resemble each other; however, on some chromosomes, there are empty regions on the file-loaded BAM where the URL-loaded BAM has point (depth) data. See screen prints below on the default 1000Genomes dataset (HG04141).

Chromosome 13

Chromosome 9

Include link to IOBIO support page

Allowing letters of support etc.

Loading in Read Coverage information

When the index is parsed, the reference are built up, but the read coverage across the chromosomes appears all at once after the sequences are loaded. Should these be drawn in along with the chromosome titles?

Hide or disable Fragment Length chart for BAM with single-end reads

Since fragment length is only calculated with paired-end reads, let's grey out the Fragment Length Distribution chart when the BAM file has single-end reads only.

IE 11 - Cannot load local bam file. Fails silently, no console messages

After choosing a .bam and .bai file, I click on the 'Open' button and the File chooser dialog closes, but the web page stays on the home page.

I will try to troubleshoot this a bit more tomorrow, stepping through the code to see where the error occurs.

Files from samba mounted drive do not load in bam.iobio

Andrew is having trouble with bam.iobio for local files from a mounted drive. I am seeing the same problem from my local computer.

The density chart loads, but then the browser (Chrome in this case) crashes with an ‘out of memory’ error.

Multiple resolutions

We should test on multiple resolutions, in particular when presenting, we want to be sure that the apps still look good. In the viz-dev branch, the scaling of the chromosome bars and the read coverage (top image) are not the same, so the coverage starts partway through the bars (no coverage above chr1) etc. in a presentation.

Add report problem button

a link titled ‘Report Problem’; clicking it would trigger a screen capture and a popup that allowed the user to send a quick message which was sent to a group email account (e.g. - [email protected] or [email protected])

Different number of Reads Sampled on same file

I am running into one behavior that seems odd…. I get a different number of Reads Sampled each time I load the same bam file. Is this expected behavior?

At first, I thought that the stats were different when the .bai was produced from samtools vs bamtools. However, it may just be that the number of reads sampled varies considerably for each load.

On a related note, the line charges for read depth look slightly different for the .bai generated from the same bam but with samtools vs bamtools. Specifically, I don’t see the line going from the high point back down to zero on the line chart for the samtools generated .bai file.

I’ve attached 2 screen prints showing the read depth line charts from same bam, but from .bai files generated from samtools vs. bamtools.

Percent charts Perfect Pairs and Singletons both 0% on dataset, why?

I pulled down a public dataset for a Zebrafish BAM. Here is the URL to the file:
http://s3.amazonaws.com/bam.test.files.private/zebrafish/8789X1_111118_SN141_0417_BC04PLABXX_6_STP1N60A.bam

Can't load BAM from url with https

No errors, warning in console and the server appears to return data for get estimated read depth; however, the all widgets stay in 'sampling...' state.

Read coverage distribution histogram view gets strange (random?) brush selection each time Chromosome is selected from read depth panel.

Good news - This seems to be dependent on the file loaded. For example, I do not see this problem with our HG04141.mapped.ILLUMINA.bwa.BEB.low_coverage.20130415.bam file.

I am seeing it with a public bam file I downloaded from a Univ of Utah data sharing site.
(A1754/Bam/Hg19/SRR893054_STP1N240A.bam)

Here are screen prints for each time the Read Coverage Distribution histogram is refreshed for the times I clicked on ch20:

2nd iteration

3rd iteration

Scaling issues and reference drop down

The drop down for reference sits on top of the wheel and the read coverage doesn't rescale with the window. The reference sequence markers (the coloured blocks) resize as the screen is resized, but the data stays put. This is an issue, because sometimes the scaling is wrong when you load and you

have to resize the window to get them to line up.

iobio / bam.iobio.io Goto Github PK

bam.iobio.io's People

Contributors

Stargazers

Watchers

Forkers

bam.iobio.io's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs