GithubHelp home page GithubHelp logo

chaoxiansen / mgv Goto Github PK

View Code? Open in Web Editor NEW

This project forked from snayfach/mgv

0.0 0.0 0.0 7.62 MB

Supporting code for uncultivated gut virus manuscript

Shell 12.27% Python 87.53% R 0.20%

mgv's Introduction

Supporting materials for manuscript: "Metagenomic compendium of 189,680 DNA viruses from the human gut microbiome"

Data availability

See: https://portal.nersc.gov/MGV/

Code availability

  1. Viral detection pipeline: Identify viral contigs >=1Kb using the pipeline described in the manuscript

  2. Quality control: Identify and remove putative host regions flanking viral contigs. Quantify genome completeness and apply genome quality standards.

  3. Cluster genomes based on ANI: Average nucleotide identity (ANI) code and centroid based clustering. Used to identify species-level viral clusters

  4. Cluster genomes based on AAI. Average amino acid identity (AAI) code and MCL based clustering. Used to identify genus-level and family-level viral clusters

  5. Create SNP phylogenetic trees. Identify SNPs in core-genome regions based on whole-genome alignments. Build phylogenetic tree based on SNPs. Used in manuscript to create strain-level phylogenies for species-level viral clusters

  6. Create marker-gene phylogenetic trees. Identify prevalent single-copy genes in a viral clade. Use concatenated gene alignments to build phylogenetic tree.

  7. Identify CRISPR spacers. Identify CRISPR spacers using CRT and PILERCR, merge redundant CRISPR arrays, and format output.

For any other code/analysis inquiries, please open a github issue. Note: most of these scripts were written for Python 2. If you get an error using Python 3, try re-running with Python 2.

If this code is useful, please cite: Nayfach et al. Metagenomic compendium of 189,680 DNA viruses from the human gut microbiome. 2021. https://www.nature.com/articles/s41564-021-00928-6.

Updated version of the database

See: https://github.com/snayfach/UHGV

mgv's People

Contributors

snayfach avatar rocknhu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.