artic-network / rampart Goto Github PK

View Code? Open in Web Editor NEW

79.0 79.0 34.0 122.74 MB

Read Assignment, Mapping, and Phylogenetic Analysis in Real Time

License: GNU General Public License v3.0

HTML 0.46% JavaScript 85.35% CSS 0.52% Python 12.67% Shell 0.21% Procfile 0.01% Dockerfile 0.79%

rampart's People

Contributors

Stargazers

Watchers

rampart's Issues

Wrapping of sample names

These sample names have a space in (i.e., BC03="California/07/2009 BC03") and it seems to wrap the line on the space (even though there is enough space on the line):

Use timestamps of reads to provide timescale?

Would make it possible to restart the server and show history. Would need to record timestamps in the rows of the CSV

Zero indexing issue for channel names

Channels are index from 1 in the CSV files but when looking up the names in run_info.json these are indexed from 0 causing the names to be off by one and an 'undefined' for the last one.

Pointing another browser to the front-end causes a jump up in read number

More colours needed

Need 25 (nice) colours for all the native barcodes + one for 'none'

Add read rate for each channel

As well as total reads per channel, we could have plot of rate of read per unit time (also in the title of each channel panel).

X axis of Reference Matches plot for each channel should have same range

To allow for comparison

Filters panel for UI

A new pop-out panel which allows you to add filters on the data being displayed. Filters will include band-pass lengths, reference sets.

Reads over time - area plot?

Probably more attractive to have an area plot for this rather than the dots.

Use reference index numbers in CSV

Provide reference genome names in configuration JSON. Use index in CSV file to save space.

Command line/config option to specify number of references shown

Currently the maximum number of references shown is hard coded at 10 (and minimum read fraction at 5%). These should be user configurable through a command line option and in the protocol config files.

Jump to sample if you click on a sample name

If you click on a sample name or sample bar in the top panel then scroll to view that sample.

Daemon process should be able to resume

This would involve mapping any reads that have appeared in the inbox since it was last run. Not entirely clear how to do this? Look at the timestamps of the actual reads?

Dynamically pick colour range for different samples

Currently, at line 77 in processServerData.js, state.sampleColours = createSampleColours(25); creates an array of 25 colours on a spectrum. This allows for the 24 native barcodes + 1 extra (i.e., 'none'). This needs to be dynamic for the number of samples.

If the number samples is greater than this then there is an array out of bounds exception thrown in some of the components that use it. These should probably 'wrap' around to avoid the exception.

Option to show absolute number of reference matches in heatmap

The Reference Matches heat map currently shows percent reads mapped per reference for each sample. It might be useful to show absolute number of reads mapped per reference to compare sample to sample better. This could also be shown in log scale.

Generate reports as PDF or similar

Generate reports as well formatted documents. Export figures as SVG. Export data as CSV.

Add modularity

More of a stretch goal but this should go modular - Being able to plug in a module that does additional read annotation in the server process and also adds additional views/components to visualize this in the browser app.

This could include modules for metagenomics with Kraken or kraken-style classification.

Create sample setup front end.

This would probably be a separate app from RAMPART as it would be run at sample receipt before the lab work begins. Would allow you to provide sample IDs, select intended protocols, assign barcodes, specify which RAMPART instance to use (i.e., what virus it is) etc.

Ultimately it should provide a customized protocol ready for lamination (you did bring the laminator didn't you @igoodfel?).

Read mapping by reference plot has gaps

The rows are off:

For read-count axis, add a log option

Log scales might work better for coverage where there is high variability (would need to be a pseudo count).

Create a double clickable start button

Double click an icon on the desktop to start everything up with minimum fuss/command line gubbins.

When a set of barcode names are specified, only those should be shown

If barcode names are specified in the command line:
I.e., --barcodeNames BC01=sample1 BC02=sample2 ...
..then only those samples should be shown in RAMPART. At the moment, other barcodes occasionally come up which just add noise to the display. The simplest thing may be to simply pass these to porechop as the search set.

The order in the RAMPART display should probably be the the order specified (currently it is by the order they are first found).

Add roll-over to get numbers

Roll over graphs to show actual numbers in popup box.

Optional reading of .bed files

At the moment the primer locations are in a json file. Could be possible to point towards a .bed file to get these.

When coverage is low, reports NaNx

Gives a NaN for coverage when low:

denominator of the coverage (for the coverage through time plots) be the total spanned by the amplicons (defined by the primers) rather than to whole genome

Visualise genetic distance from reference

The call to MiniMap2 in map_single_fastq.py returns the %similarity to the mapped reference. It would be good to be able to include this as a chart in each sample panel (either for the majority mapping or selecting a specific reference).

Responsive design for iPads/Phones/MinION Mk1C etc touchscreen

Make the design responsiveness - adapt to smaller screen sizes and add touchscreen abilities

Target iPads and Mk1C screens initially?

Dynamic inclusion of references in the reference heatmap chart

We will often have many more reference genomes than there is space on the reference heatmap but only a few of them will have any significant number of reads mapping to them. This chart should dynamically adjust to only show top references. Perhaps specify a maximum number plus some cutoff for the minimum number of reads for it to be shown.

X-axis for read length plots should line up

Set the x-axis range for all the channel read length plots to match.

Time elapsed reset when page is reloaded

Reads over time loses history.

Resizing or zooming page using cmd – +/- doesn't resize components

If you resize a page or zoom the browser using the cmd + or - options in Safari or Chrome, the components don't resize to fit together until a reload of the page.

Don't show coverage by reference if only one reference

Grey out switch or don't show switch? Or possibly just show solid block of colour with vertical gaps to show 0 coverage?

Bin reads from sample including any filters.

Add a button to each sample panel to bin the reads currently displayed into a file or folder (also some way of doing this as a batch across all samples). Appropriate labelling of file.

Add filters on to reads by clicking on UI components.

I.e., add brushes on the length distribution chart, click reference matches etc. Doing it on the top panel will filter across all samples but can also filter in each individual sample panel.

Read length are messy

The spline smoothing fails with low sample sizes:

Kernel Density Estimates would work better if available. Otherwise a column chart?

Reads per channel plot needs a categorical x axis

Could use numbers or names

Adapt RAMPART's demuxing to use qcat.

Plan to switch from modified version of porechop to using off-the-shelf qcat (https://github.com/nanoporetech/qcat). Need to assess the best way of doing this (i.e., do we still put barcode calls into the read headers).

Dig-down display into each channel

Implement a full page stats page with more detailed plots, tables, QC, and stats for any channel. This would be reached by a button on each channel panel (also same page is available for the whole run at the top). This could also have various 'action' buttons such as to assemble the consensus genome and push it to the analysis package.

Coverage plots with log axes

Coverage plots should have option to switch to log y-axis.

Include an 'unmapped' row in the reference mapping heatmap chart

The bottom row of the reference heatmap could be an unmapped count. It would be useful to see if your references are too divergent for a particular sample.

Need a discrete plot for coverage

The spline curves don't give the detail needed to see the individual amplicons. Use a stepped line chart. For the overview, perhaps a stacked chart to show the channels.

Artifactual peak in read length histogram

There is a peak at length 1020 of 202 reads:

But at length 1050 there are 0 reads but it is still showing a peak of height 202:

Time units switch automatically

Time units for reads over time should switch to minutes then hours as appropriate.

Paths in config file should be relative to config file

Currently the paths in the configuration json are specified as relative to the current working directory. They should be specified relative to the location of the configuration file.

Amplicon caller in annotation script

The annotation script should read the BED file (or similar) and use the MiniMap2 coordinates to infer the amplicon for each read. Some QC could also be done here - I.e. filter multi amplicon chimeras etc. The amplicon number would be added as a column in the annotation CSV file for use by the RAMPART UI.

If barcodeNames are specified then show the sample even if no reads are mapped

A sample panel only appears if at least one read has demuxed and mapped. If you specify barcodeNames then you are expecting those samples so they should be shown even if empty.

Axis with log option toggle by clicking

Read counts (and potentially coverage) axes are in log space. Perhaps log and linear can be toggled by clicking on the axis?

Heatmap colour scale reversed.

Currently the heatmap shows the highest values as dark red and the lowest values as a pale yellow. This isn't the natural scale for a heatmap (hotter is more white). Particularly with the dark background, the brightest colour should represent the highest value. Suggest reverse the colour scale.

artic-network / rampart Goto Github PK

rampart's People

Contributors

Stargazers

Watchers

Forkers

rampart's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs