GithubHelp home page GithubHelp logo

billzt / pyoverchain Goto Github PK

View Code? Open in Web Editor NEW
4.0 2.0 1.0 40 KB

a python pipeline to generate chain files between different genome assemblies for LiftOver.

License: GNU General Public License v3.0

Python 100.00%
liftover genome blat

pyoverchain's Introduction

pyChain

a python pipeline to generate chain files between different genome assemblies for LiftOver.

External Dependencies

This pipeline can only be used under Linux. These softwares must be in your system PATH

Installation

Via pip (release only)

pip3 install pyoverchain

Via Github

git clone https://github.com/billzt/pyOverChain.git
cd pyOverChain
python3 setup.py install

Usage

pyoverchain [-h] [-v] [-n NUM_CHROMOSOME_TASKS] [-p NUM_THREADS_PBLAT]
               [--disable-progress]
               old_genome new_genome chr_map

positional arguments:
  old_genome            Old genome file in FASTA format
  new_genome            New genome file in FASTA format
  chr_map               A chromosome-mapping-file in TSV format

optional arguments:
  -h, --help            show this help message and exit
  -v, --version         show program's version number and exit
  -n NUM_CHROMOSOME_TASKS, --num-chromosome-tasks NUM_CHROMOSOME_TASKS
                        number of parallel tasks to run different chromosomes
                        (default: 2)
  -p NUM_THREADS_PBLAT, --num-threads-pblat NUM_THREADS_PBLAT
                        number of threads for pblat. Only useful if pblat were
                        available (default: 2)
  --disable-progress    Disable progress displaying. Useful if you want to run
                        it in background (default: False)

chromosome-mapping-file

It is a plain text file in TSV format showing the relationship between genome assemblies. Corresponding chromosomes usually have the same or similar names. However, this is not always the case.

#old_genome_chr new_genome_chr
chr1    chr1
chr2    chr2
chr3    chr3

Some useful documents

Warning

This pipeline can only liftOver different assemblies of the same species.

pyoverchain's People

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

avdanurag

pyoverchain's Issues

Working Directory/Mapping File

Hello,
First, thanks for this tool. It's the only one that's been working for me!

However, we've noticed that this tool only runs when both genomes are in the same working directory. Could you clarify this in the documentation?

Error:
mkdir: cannot create directory ‘pyOverChainTmp.NewGenome.axtChain./path/to/genome’: No such file or directory
mkdir: cannot create directory ‘pyOverChainTmp.NewGenome.blat./path/to/genome’: No such file or directory
mkdir: cannot create directory ‘pyOverChainTmp.NewGenome.chain./path/to/genome’: No such file or directory
mkdir: cannot create directory ‘pyOverChainTmp.NewGenome./path/to/genome’: No such file or directory
mkdir: cannot create directory ‘pyOverChainTmp.NewGenome.lift./path/to/genome’: No such file or directory
mkdir: cannot create directory ‘pyOverChainTmp.NewGenome.liftUp./path/to/genome’: No such file or directory
mkdir: cannot create directory ‘pyOverChainTmp.NewGenome.net./path/to/genome’: No such file or directory
mkdir: cannot create directory ‘pyOverChainTmp.NewGenome.over./path/to/genome’: No such file or directory
mkdir: cannot create directory ‘pyOverChainTmp.OldGenome./path/to/genome’: No such file or directory
[pyoverchain:2022-10-26 16:40:12.374990] Begin separating chr1 from /path/to/genome...
sh: pyOverChainTmp.OldGenome./path/to/genome/chr1.fa: No such file or directory
[pyoverchain:2022-10-26 16:40:12.382318] Begin separating chr1 from /path/to/genome...
sh: pyOverChainTmp.NewGenome./path/to/genome/chr1.fa: No such file or directory
[pyoverchain:2022-10-26 16:40:12.390505] Begin separating chr2 from /path/to/genome...
sh: pyOverChainTmp.OldGenome./path/to/genome/chr2.fa: No such file or directory
[pyoverchain:2022-10-26 16:40:12.397635] Begin separating chr2 from /path/to/genome...
sh: pyOverChainTmp.NewGenome./path/to/genome/chr2.fa: No such file or directory

Additionally, I would suggest to integrate creating a mapping file for the two genomes as part of the tool.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.