GithubHelp home page GithubHelp logo

christopherbarrington / scamp Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 845 KB

single cell analysis and more pipeline

Home Page: https://christopherbarrington.github.io/scamp/

Nextflow 58.99% R 13.19% Shell 15.54% Python 12.29%

scamp's People

Contributors

christopherbarrington avatar

Watchers

 avatar

scamp's Issues

document parameters for tasks

like software versions yaml file it may be useful to explicitly record these parameters so they can be more-easily reported.

parameter stanzas always have to include something

could

SEURAT:
    8 weeks replicate 1:

be used to indicate that this should be run with all default parameters

Ambiguous method overloading for method java.util.LinkedHashMap#plus.
Cannot resolve which method to invoke for [null] due to overlapping prototypes between:
        [interface java.util.Collection]
        [interface java.util.Map]

download genome files

method to download and format as granges, if not provided in an index

  • blacklist
  • genes
  • promoters
  • transcripts/exons/introns/centromeres
  • repeats?

make reports for tasks/workflows

write a module (?) that will present the information produced/data analysed/processes/steps for a (sub)workflow.

these should be collectable into a single document/webpage

strip.suffix optional

may be better to have this option in the module so that it can be set as false for multimodal assays where the barcodes are checked

ability to push to github

completed workbook/webpage reports could be pushed to a (or multiple) GitHub repositories

maybe specified with:

    github:
        remotes:
          - ChristopherBarrington/scamp-push-test:babs/docs

is it worth having multiple remotes? maybe would have a public/private version of reports to different remotes?

reorganise config files

should be tidier!

  • publishing config
  • resources config
  • ext arguments config
  • profiles
  • crick config ... nfcore configs?

provide output parameters with new fields

an output could be a parameters yaml file with additional fields added - eg seurat path so that the pipeline can be restarted from here rather than the cache in the longer-term

make scrnaseq and snrna+atac workflows

once a seurat object is created the workflow should be the same independent of technology, so different quantification methods can use the same subworkflow

move genome functions out of subworkflows

the make granges and biomart (for example) may be duplicated, can they be moved to a genome processing subworkflow?

  • gtf to granges (may be index-dependent?)
  • identification of mitochondrial features (from biomart?)
  • identification of cell cycle genes (from cc.genes?)

use template scripts

move script blocks to templates/main.sh to keep better organised and easier to read scripts

can the process output + metadata bit be simplified?

currently uses two function calls but seems it could be streamlined

merge_process_emissions(make_object, ['opt', 'seurat'])
  .map{merge_metadata_and_process_output(it)}
  .dump(tag: 'seurat:prepare:cell_ranger:objects', pretty: true)
  .set{objects}

containerisation

processes should have their own environments, not expect software to be available for them

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.