GithubHelp home page GithubHelp logo

cnr-ibba / nf-chip-alignment Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 286 KB

Align chip probes to a reference genome

License: MIT License

Python 75.53% Nextflow 24.47%
manifest chip genome alignment blat snpchimp smarter

nf-chip-alignment's Introduction

Nextflow chip alignment

This nextflow pipeline is an attempt to align probes from a chip-manifest file on a reference genomes using blast. The reason of doing this is to define the strand orientation of the probe on the reference sequence, in order to be able to convert a genotype from illumina top to forward (or ALT/REF) coordinates. This pipeline was developed as a component of the SMARTER project (WP4 in particular) and will be the first step towards a new SNPchiMp release.

Requisites

To run this nextflow pipeline, you require both a genome reference sequence (better to download it from NCBI ftp) and a manifest csv file from an illumina/affymetrix chip. Next, you you will need nextflow installed and at least one of this different executors: conda, singularity and docker. You can choose to clone this repository if you plan to change this pipeline according your needs:

git clone https://github.com/cnr-ibba/nf-chip-alignment.git

The other way to running this pipeline is described in pipeline sharing nextflow manual.

Testing pipeline

You can test your local configuration by calling the test profile with one of the three supported executors. For example, to test this pipeline with singularity:

nextflow run cnr-ibba/nf-chip-alignment -resume -profile test,singularity

Running pipeline

You can call this pipeline with you local data using the --genome and --manifest options, for example:

nextflow run cnr-ibba/nf-chip-alignment -resume -profile singularity --manifest <manifest.csv> --genome <reference genome>

gzipped input files are supported.

About alignments

Alignments are made using megablast task, which is more stringent and faster then the default blastn. Alignments are filtered relying percentage aligned and percentage identity. A probe is considered unmapped if all alignments are filtered, if there are more than one alignment after filtering or if the SNP doesn't match to the reference genome. The last condition however is not correlated in a issue in alignment, however it could be difficult to join this particular SNP with other data or refer it to a reference SNP. For such reason SNP is discarded even if there's a perfect match of the probe

nf-chip-alignment's People

Contributors

bunop avatar

Watchers

 avatar

nf-chip-alignment's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.