GithubHelp home page GithubHelp logo

fabianackle / nestlink-pipeline Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 43 KB

Pipeline to generate accurate consenus sequences of NestLink libraries.

License: MIT License

Nextflow 27.66% Python 67.97% Shell 4.37%

nestlink-pipeline's Introduction

NestLink-pipeline

NestLink-pipeline is a pipeline for processing NestLink libraries sequenced by nanopore sequencing. Reads are binned according to their flycodes (UMIs). Accurate consensus sequences are calculated using Medaka. Variants are called with the pipeline, resulting in a flycode assignment table that links protein variants to their respective set of flycodes.

Warning

NestLink-pipeline is still in development. Certain library-specific strings are still hard-coded in main.nf and have to be edited before running the pipeline.

Requirements

Running the pipeline

  1. Clone the repository.
  2. Place the basecalled sequencing data and the reference sequence into projectDir/data/.
  3. Run the first workflow "prepare_data" of the pipeline: nextflow run main.nf -entry prepare_data
  4. Generate the consenus sequences using medaka with the data from projectDir/medaka_input/, and place the Medaka output assembly.fasta into the folder projectDir/medaka_input/.
  5. Run the second workflow "nestlink" of the pipeline: nextflow run main.nf -entry nestlink

Generating consensus sequences using Medaka

Example with CUDA and Singularity installed on Ubuntu 20.04.

singularity run --nv \
    --bind /home/ubuntu/calculation/consensus:/data --pwd /data \
    docker://ontresearch/medaka:latest medaka consensus \
    --batch 200 --threads 2 --model r1041_e82_400bps_sup_v5.0.0  \
    merged.sorted.bam results.contigs.hdf

singularity run --nv \
    --bind /home/ubuntu/calculation/consensus:/data --pwd /data \
    docker://ontresearch/medaka:latest medaka stitch \
    results.contigs.hdf reference_all.fasta assembly.fasta

nestlink-pipeline's People

Contributors

fabianackle avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.