Identifying the interaction of core and accessory genes in P. aeruginosa

Alexandra J Lee, Georgia Doing, Samuel L. Neff, Deborah A Hogan and Casey S Greene

April 2020

University of Pennsylvania

Clinical and environmental strains of Pseudomonas aeruginosa (or P. aeruginosa), an opportunistic pathogen that causes difficult to treat infections, have significant genomic heterogeneity including the presence of diverse accessory genes that are only present in some strains or clades. Both core genes, which are conserved across strains, and accessory genes have been associated with traits such as biofilm formation and virulence. Much of what we know about core and accessory gene content comes from genome analyses. Here, we use a newly assembled transcriptome compendium to analyze the transcriptional patterns of core and accessory gene expression in PAO1 and PA14 strains across thousands of samples from hundreds of distinct experiments. We found that a subset of core genes was transcriptionally stable across strain PAO1 and PA14 strain types and that these genes had fewer accessory genes with correlated expression patterns than did less stable core genes.

Directory Structure

Folder	Description
0_explore_data	This folder contains analysis notebooks to visualize the expression data to get a sense for the variation contained.
1_processing	This folder contains analysis notebooks to determine what threshold to use to partition the gene expression data into PAO1 and PA14 compendia.
2_correlation_analysis	This folder contains analysis notebooks to detect gene co-expression modules starting with gene expression data, applying Pearson correlation and then clustering on this correlation matrix to obtain gene modules.
3_core_core_analysis	This folder contains analysis notebooks to examine the stability of core genes across strains.
4_acc_acc_analysis	This folder contains analysis notebooks to examine accessory-accessory gene modules.
5_core_acc_analysis	This folder contains analysis notebooks to examine the relationship between core genes and accessory genes.
6_common_genes_analysis	This folder contains analysis notebooks to compare common DEGs found in prior work to core and accessory genes
scripts	This folder contains supporting functions that other notebooks in this repository will use.
data	This folder contains metadata used for different analyses.

Usage

Operating Systems: Mac OS, Linux (Note: bioconda libraries not available in Windows)

In order to run this simulation on your own gene expression data the following steps should be performed:

First you need to set up your local repository:

Download and install github's large file tracker. Once downloaded and installed, setup git lfs by running git lfs install
Install miniconda
Navigate to the location where you'd like the code to live and clone the core-accessory-interactome repository by running the following command in the terminal:

git clone https://github.com/greenelab/core-accessory-interactome.git

Note: Git automatically detects the LFS-tracked files and clones them via http. 4. Navigate into the cloned repo by running the following command in the terminal:

cd core-accessory-interactome

Set up your conda environment by running the following command in the terminal:

bash install.sh

Navigate to any of the analysis directories listed in the table above to see the code for how analyses were performed. To reproduce the results and figures of the paper, run the analysis directories in order.

Acknowledgements

We would like to thank Jake Crawford for very insightful discussions about methods and interpretation of gene correlation analyses. We would also like to thank all other members of Greene lab (Natalie Davidson, Ben Heil, Ariel Hippen, David Nicholson, Milton Pividori, Halie Rando, Taylor Reiter) for helpful comments and code review.

greenelab / core-accessory-interactome Goto Github PK

core-accessory-interactome's Introduction

Identifying the interaction of core and accessory genes in P. aeruginosa

Directory Structure

Usage

Acknowledgements

core-accessory-interactome's People

Contributors

Stargazers

Watchers

Forkers

core-accessory-interactome's Issues

Quantify difference in likelihood

Is accessory shift consistent across other groups of samples?

Considerations for tuning network

Try calculating correlations using only PAO1 samples and only PA14 samples

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs