GithubHelp home page GithubHelp logo

dpnii-seq's Introduction

DpnII-seq

snakemake-run

DpnII-seq is an experimental assay to measure the genome wide cutting frequency of the restriction enzyme DpnII. Originally DpnII-seq was developed alongside Liquid Chromatin Hi-C to control for biases in cutting frequency when estimating chromatin contact stability in K562 cells. However, the DpnII-seq assay and analysis pipeline could be modified for analyzing cutting frequency for variable restriction enzymes in variable cell types. The DpnII-seq workflow performs mapping, filtering of artifacts, multiple resolution binning, copy number correction and plotting of quality metrics.

This workflow has options for digesting with the following restriction enzymes:

As K562 cells have a primarily triploid karyotype with regions of variable copy number, the analysis workflow corrects coverage tracks to a diploid state genome wide. If the user is applying DpnII-seq to cells with variable copy number states we provide scripts to correct for this bias using a Gaussian mixture model approach.

Experimental Protocol

dpn

Computational Workflow

dag

Requirements

Install Snakemake via Miniconda here

Set up channels
conda config --add channels defaults
conda config --add channels conda-forge
conda config --add channels bioconda
conda config --add channels r
Set up environment
conda env create --file envs/DpnII-seq_env.yaml
source activate DpnII-seq
Set up data directory
.
├── data
│   ├── binning
│   ├── copy_number
│   ├── dpnII_sites
│   ├── fastq
│   ├── fatI_sites
│   ├── hindIII_sites
│   └── indexes
├── DpnII-seq

DpnII-seq workflow requires a data directory with the following files:

  • binning/
    • bedfiles denoting binned genome (ex. hg19_40kb.bed)
  • copy_number/
    • bedfiles denoting binned genome with copy number state (4th column)
  • [X]_sites/
    • bedfiles denoting restriction sites of restriction enzyme X for each chromosome (ex. dpnII_sites_chr1.bed)
  • fastq/
    • paired end fastq files (*_R1.fastq, *_R2.fastq)
  • indexes/
    • bowtie-build index files for reference genome (*.ebwt)

Usage

Currently the DpnII-seq workflow has only been tested in an LSF cluster environment

Snake Command
snakemake -j 10 --latency-wait 60 --cluster-config cluster.json --cluster "bsub -q {cluster.queue} -W {cluster.time} -R {cluster.memory} -n {cluster.cores} -o {cluster.output} -e {cluster.error}" -p

Contributing to DpnII-seq

All contributions, bug reports, bug fixes, documentation improvements, enhancements, and ideas are welcome!

References

Houda Belaghzal*, Tyler Borrman*, Andrew D. Stephens, Denis L. Lafontaine, Sergey V. Venev, Zhiping Weng, John F. Marko, Job Dekker. (2019). Compartment-dependent chromatin interaction dynamics revealed by liquid chromatin Hi-C. bioRxiv

License

GNU General Public License v3.0

dpnii-seq's People

Contributors

tborrman avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.