GithubHelp home page GithubHelp logo

avilella / wgbs_tools Goto Github PK

View Code? Open in Web Editor NEW

This project forked from nloyfer/wgbs_tools

0.0 0.0 0.0 32 MB

tools for working with Bisulfite Sequencing data while preserving reads intrinsic dependencies

License: Other

Shell 0.44% C++ 37.73% Python 61.41% Perl 0.29% C 0.03% Makefile 0.11%

wgbs_tools's Introduction

wgbstools - suite for DNA methylation sequencing data representation, visualization, and analysis

wgbstools is an extensive computational suite tailored for bisulfite sequencing data. It allows fast access and ultra-compact representation of high-throughput data, as well as machine learning and statistical analysis, and informative visualizations, from fragment-level to locus-specific representations.

It converts data from standard formats (e.g., bam, bed) into tailored compact yet useful and intuitive formats (pat, beta). These can be visualized in terminal, or analyzed in different ways - subsample, merge, slice, mix, segment and more.

alt text This project is developed by Netanel Loyfer and Jonathan Rosenski in Prof. Tommy Kaplan's lab at the Hebrew University, Jerusalem, Israel.

Quick start

Installation

# Clone
git clone https://github.com/nloyfer/wgbs_tools.git
cd wgbs_tools

# compile
python setup.py

Genome configuration

At least one reference genome must be configured (takes a few minutes).

wgbstools init_genome GENOME_NAME
# E.g, 
wgbstools init_genome hg19
wgbstools init_genome mm9

wgbstools downloads the requested reference FASTA file from the UCSC website. If you prefer using your own reference FASTA, specify the path to the FASTA as follows.

wgbstools init_genome GENOME_NAME --fasta_path /path/to/genome.fa

Dependencies

  • python 3+, with libraries:
    • pandas version 1.0+
    • numpy
    • scipy
  • samtools
  • tabix / bgzip

Dependencies for some features:

  • bedtools

Usage examples

Now you can generate pat.gz and beta files out of bam files:

wgbstools bam2pat Sigmoid_Colon_STL003.bam
# output:
# Sigmoid_Colon_STL003.pat.gz
# Sigmoid_Colon_STL003.beta

Once you have pat and beta files, you can use wgbstools to visualize them. For example:

wgbstools vis Sigmoid_Colon_STL003.pat.gz -r chr3:119528843-119529245

wgbstools vis *.beta -r chr3:119528843-119529245 --heatmap

Deconvolution

To deconvolve tissues or blood samples, see our UXM software

wgbs_tools's People

Contributors

nloyfer avatar yonniejon avatar semenko avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.