GithubHelp home page GithubHelp logo

rna-seq's Introduction

RNA-seq analysis pipeline, STAR input, edgeR, functional enrichment, visualization

  • RNAseq-workflow - A repository for setting up a RNAseq workflow. Detailed instructions and code for each analysis and visualization step.

  • Analysis_STAR.Rmd - RNA-seq analysis pipeline for STAR counts. Prerequisites:

    • A path to data folder. This folder should have 3 subfolders:
      • 02_STAR-align - gzipped count files with .tab extension outputted by STAR aligner
      • results - folder where the results will be stored
      • data - Must have sample_annotation.csv file, example below
  • GSEA.Rmd - EnrichR (non-directional) and GSEA (directional) analysis using KEGG, GO, MSigDb.

  • GSEA_figures.Rmd - Visualization of GSEA enrichment results as horizontal barplots.

  • Figure_heatmap.Rmd - make heatmap of top 50 differentially expressed genes. Uses TMP.xlsx produced by Analysis*.Rmd. May use a custom signature of genes. Includes EnhancedVolcano and boxplots of selected genes.

  • oncoEnrichR.Rmd - Cancer-dedicated gene set interpretation using the oncoEnrichR R package

  • Pathview.Rmd - visualization of top KEGG pathways. Uses DEGs.xlsx produced by Analysis*.Rmd. Example

  • calcTPM.R - a function to calculate TPMs from gene counts

  • utils.R - helper functions

misc - Outdated scripts

  • Analysis_featurecounts.Rmd - RNA-seq analysis pipeline for featureCount counts. Prerequisites:
    • A path to data folder. This folder should have 3 subfolders:
      • 03_featureCount - gzipped count files outputted by featureCount
      • results - folder where the results will be stored
      • data - Must have sample_annotation.csv file. Annotation file should have "Sample" column with sample names, and any other annotation columns. Include "Group" column containing covariate of interest. Example:
# Sample,Group
VLI10_AA_S61_L006_R1_001.txt.gz,AA
VLI10_AA_S61_L007_R1_001.txt.gz,AA
VLI10_AA_S61_L008_R1_001.txt.gz,AA
VLI11_C_S62_L006_R1_001.txt.gz,C
VLI11_C_S62_L007_R1_001.txt.gz,C
VLI11_C_S62_L008_R1_001.txt.gz,C
  • Figure_clusterProfiler_nes.Rmd - Takes the results of edgeR analysis from an Excel file, performs GO and KEGG GSEA and plots the results as horizontal barplots, sorted by normalized enrichment score (NES). Example

  • Figure_clusterProfiler_asis.Rmd - Takes the results of edgeR analysis from an Excel file, performs GO and KEGG GSEA and plots the results as horizontal barplots, sorted by p-value, as they come out of the enrichment analysis.

  • enrichR_analysis.Rmd - Analyze gene lists using enrichR. Analyze all genes, and up- and downregulated genes separately. Uses DEGs.xlsx produced by Analysis*.Rmd.

  • enrichR_plot.Rmd - barplot of selected enrichment results, similar to Example. WIP

Scripts for running RNA-seq preprocessing steps on a cluster using PBS job submission system. subread-featurecounts scripts are in the dcaf/ngs.rna-seq repository

multiqc --filename multiqc_01_trimmed.html --outdir multiqc_01_trimmed 01_trimmed/
multiqc --filename multiqc_02_STAR-align.html --outdir multiqc_02_STAR-align 02_STAR-align/

CaSpER pipeline detecting CNVs from RNA-seq data

Dedicated repository with detailed instructions: mdozmorov/CaSpER_pipeline

Misc

rna-seq's People

Contributors

mdozmorov avatar azzoam avatar

Stargazers

bio-for-ever avatar  avatar qianche avatar Ipek avatar zm avatar Zhixin Li, PhD avatar Mónica Cabrera avatar Konstantinos Kyriakidis avatar Sigve Nakken avatar Hasim Hakanoglu avatar Won Cheol Yim avatar Audrys G. Pauža avatar  avatar  avatar

Watchers

 avatar  avatar Konstantinos Kyriakidis avatar Audrys G. Pauža avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.