GithubHelp home page GithubHelp logo

gwas_enrichment's Introduction

Sum-based method for GWAS Signal Enrichment analysis (SumGSE)

1. Introduction

SumGSE is a tool to integrate genomic information of biological mechanisms with GWAS summary statistics for complex traits. All of the software here is command-line based.

The sum-based method uses signals of all markers within a pre-defined candidate feature. Briefly, we calculated the following summary statistics for candidate regions:

image

In which, imageis the summary statistics for a tested feature group. imageis the number of SNPs located in candidate feature, and β is the estimate of marker effect obtained from GWAS summary statistics. Using this formula, we calculated the image for candidate regions.

2. Getting Started

In order to download SumGSE, you should clone this repository via the commands

git clone https://github.com/WentaoCai/GWAS_enrichment.git 
cd GWAS_enrichment

Once the above has completed, you can run: SumGSE.pl -h

Usage 1: The enrichment of GWAS signals for your chosed regions.

If you want to check if your chosed regions were more enriched with GWAS signals. you can used:

perl SumGSE.pl -i [genome_region.bed] -g [GWAS_summaries.txt]

The usage 1 is useful for the feature regions, such as lncRNAs, ChIP/ATAC peak et al.

Example:

perl SumGSE.pl -i lncRNA.test.bed -g GWAS_statistics.txt -e 50 -n 1000

Usage 2: The enrichment of GWAS signals of your chosed regions limited in specific region.

If you want to check if your chosed regions were more enriched with GWAS signals in the specific regions. you can used:

perl SumGSE.pl -i [genome_region.bed] -g [GWAS_summaries.txt] -s [specific_regions.bed]

The usage 2 may be useful to check the enrichment of differentailly expressed (genes/methylation/...)compare to that of all (genes/methylation...) in genome.

Example:

perl SumGSE.pl -i gene.test.bed -g GWAS_statistics.txt -s swine.gene.bed -e 50 -n 1000

Options:

    -i    input file in bed format (Required). The input file should be genome feature regions(such as DEGs,miRNAs targets, Peaks from ChIP-seq, ATAC, et al.)  The first three columns should be chromosome, start, end. Example: 1   567821 573421  EEF1D

    -g    GWAS summary statistics (Required). The first two columns should be chromosome and position, the last column shoud be effect values,such as t value or beta value. Example: 1  123089 rs0011345  0.00045  -1.4625

    -e    extended range (KB) for genome feature regions (Optional). For example, the -e 100 means genome feature regions should also include their unstream/downstream 100kb region. Default -e is 0.

    -n    repeat n times for the permutation test (Optional). Default -n is 1000.

    -o    output file (Optional). Default the output file name is "SumGSE_permutation.out".

    -s    specific regions (Optional). If assuming -s, the permutation SNPs will be limited in these specific regions.

3. Citation

If you use the software, please cite:

Integrated Small RNA Sequencing, Transcriptome and GWAS Data Reveal microRNA Regulation in Response to Milk Protein Traits in Chinese Holstein Cattle. Frontiers in Genetics, 2021.

Author

Wentao Cai, Institute of Animal Science of CAAS

Issues with SumGSE? Email: [email protected]

gwas_enrichment's People

Contributors

wentaocai avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.