GithubHelp home page GithubHelp logo

biren's Introduction

============================================================== DEPENDENCIES:

OS: Linux

  1. Installing torch and basic package dependencies following instructions from http://torch.ch/docs/getting-started.html You may need to install cmake if you do not have it already. It is highly recommended to link against OpenBLAS or other optimized BLAS library when building torch.

  2. npy4th A package to load/save numpy files for troch7 Requirements: torch7 xlua Installation: git clone https://github.com/htwaijry/npy4th.git cd npy4th luarocks make

  3. Python 2.7.x, numpy, pandas, scipy, scikit-learn

  4. Installing Theano and basic package dependencies following instructions from http://deeplearning.net/software/theano/install.html

  5. Installing Lasagne and basic package dependencies following instructions from https://github.com/Lasagne/Lasagne or http://lasagne.readthedocs.org/en/latest/user/installation.html

  6. bedtools (>= 2.25.0) make sure the bedtools has already exported to PATH

  7. bigWigAverageOverBed

=============================================================== USAGE:

STEP 1: Data preprocessing

Example run:

python dataEncode.py example genome.fa EvolutionaryConservation.bw

step1 must contain the following files:

dataEncode.py
transformTo1Kformat.py
genome.fa
OneHotTransform.py
DeepSEA.lua
deepsea.cpu
ExtractCoreRegion.py
bigWigAverageOverBed
EvolutionaryConservation.bw
SortConsBed.py
AddConsToDeepSeaRes.py
RNNDataFormat.py

In this step, the predict target should be .BED format (example.bed), each region should be large than 200bp.

genome.fa is the Human(hg19) whole genome seuqnece which can be obtained from UCSC Genome Browser,please merge all chromosome and rename "genome.fa",or you can download from:

ftp://123.56.134.57/data/genome.fa

EvolutionaryConservation.bw was taken from the vertebrate phastCons44way track from USCS Genome Browser, or you can download from:

ftp://123.56.134.57/data/EvolutionaryConservation.bw

deepsea.cpu is DeepSEA trained model, you can download from:

ftp://123.56.134.57/data/deepsea.cpu

output files will be under "example" folder:

example_data.npy
example_mask.npy  

STEP 2: Prediction procedure

step2 must contain the following files:

BiRen.py
BiRen_trained_model.npy

Example run:

python BiRen.py example

output files will be under ./example : example.out

Here is a minnimal example:

chr20	2719208	2719789	0.220106445764

chr5	2112055	2113430	0.859481014407

The four columns are chromosome, start position, end position and enhancer probabiliy score.

biren's People

Contributors

btpeter avatar wenjiegroup avatar

Watchers

James Cloos avatar Talha Irfan avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.