The scamp from christopherbarrington

Ambiguous method overloading for method java.util.LinkedHashMap#plus.
Cannot resolve which method to invoke for [null] due to overlapping prototypes between:
        [interface java.util.Collection]
        [interface java.util.Map]

add cell ranger multi module and workflow

build index
build sample sheet
count

write documentation yaml files

each main.nf file should have an accompanying meta.yaml file to include easy to access information on the modules inputs/outputs etc

try to avoid using conflicting keys

for example: https://github.com/nf-core/modules/blob/master/modules/nf-core/bwa/mem/meta.yml

download genome files

method to download and format as granges, if not provided in an index

update cell ranger arc workflows and modules with metadata channel

make shell scripts more readable

remove the \ in scripts so they are properly formatted in work directories

make reports for tasks/workflows

write a module (?) that will present the information produced/data analysed/processes/steps for a (sub)workflow.

these should be collectable into a single document/webpage

strip.suffix optional

may be better to have this option in the module so that it can be set as false for multimodal assays where the barcodes are checked

decide what to do with task.yaml

formats differ between R and shell scripts. and versions files need deciding

ability to push to github

completed workbook/webpage reports could be pushed to a (or multiple) GitHub repositories

maybe specified with:

    github:
        remotes:
          - ChristopherBarrington/scamp-push-test:babs/docs

is it worth having multiple remotes? maybe would have a public/private version of reports to different remotes?

move feature types parameters

these should be in the analysis stanza groups either _default or in an analysis

reorganise config files

should be tidier!

make parameters file a parameter for flexibility

use --scamp-params param.yaml for example so that --params.stuff can also be used

provide output parameters with new fields

an output could be a parameters yaml file with additional fields added - eg seurat path so that the pipeline can be restarted from here rather than the cache in the longer-term

cell ranger report titles

reports just titled 'output'

move cr arc seurat to new subworkflow

make scrnaseq and snrna+atac workflows

once a seurat object is created the workflow should be the same independent of technology, so different quantification methods can use the same subworkflow

use fair rather than combine and filter

https://www.nextflow.io/docs/edge/process.html#fair

move shared parameters into pipeline stanzas

new assays feature names

make an assay for ensembl and another for gene names

move genome functions out of subworkflows

the make granges and biomart (for example) may be duplicated, can they be moved to a genome processing subworkflow?

gtf to granges (may be index-dependent?)
identification of mitochondrial features (from biomart?)
identification of cell cycle genes (from cc.genes?)

linkout shortcode and partial duplicated

should only be needed in one place

hive missing from guess parameters

include hive datasets for the guess parameters script, need indexes etc

no asf label for it, sample sheet lists 'None'

wit #54

assay_names = objects_to_create.map{['RNA', 'RNA_alt', 'ATAC']}

scamp/workflows/seurat/prepare/cell_ranger_arc/main.nf

Line 183 in 00d1aa2

misc_names = objects_to_create.map{['gene_models', 'features']}

these could be value channels instead of mapped?

library type may be at start of library name

scamp/bin/guess_scamp_file.py

Line 199 in d59ad25

'Gene Expression': ['_GEX$', '_mxGEX$'],

need to include:

^GEX_
^ATAC_

helper script to take a guess at the parameters file

script
documentations

can nf-validator be used?

validate parameters before running pipeline

https://github.com/nextflow-io/nf-validation

shepherd tours could be tidier

Using js.build instead to pass parameters:

https://discourse.gohugo.io/t/variables-inside-javascript/39086/3

can use markdown in json strings

add documentation for utility functions

move utilities out of modules
make a utility archetype
add documentation yaml for each utility
add creation command to populate documentation script

can the process output + metadata bit be simplified?

currently uses two function calls but seems it could be streamlined

merge_process_emissions(make_object, ['opt', 'seurat'])
  .map{merge_metadata_and_process_output(it)}
  .dump(tag: 'seurat:prepare:cell_ranger:objects', pretty: true)
  .set{objects}

christopherbarrington / scamp Goto Github PK

scamp's People

Contributors

Watchers

scamp's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs