"TIGAR" standing for Transcriptome-Intergrated Genetic Association Resource, which is developed using Python and BASH scripts. TIGAR can fit both Elastic-Net and nonparametric Beyesian model (Dirichlet Process Regression, i.e. DPR) for gene expression imputation, impute genetically regulated gene expression (GReX) from individual-level genotype data, and conduct transcriptome-wide association studies (TWAS) using both individual-level and summary-level GWAS data for univariate and multivariate phenotypes.
Software
Please add the executable file ./Model_Train_Pred/DPR to your linux ${PATH} directory. Assuming ~/bin/ is a directory added to your ${PATH} environmental variable, you can accomodate the following example command
cp ./Model_Train_Pred/DPR ~/bin/
BGZIP, TABIX, Python 3.5 and the following python libraries are required for running TIGAR
Two columns with the first column specifying the Phenotype (P) and Covariate variables (C) from the PED file, and the second column specifying the corresponding variable names in the PED file. The variables specified in the Asso_Info file will be used in TWAS.
The block annotation file is a tab delimited text file with head row of CHROM Start End File, denoting the chromosome number, starting position, ending position, and corresponding reference VCF file name under specified --geno_path. Reference VCF files shall be of one per chromosome, or one for the whole genome-wide variants. Example block annotation file for European samples is provided ./TIGAR/example_data/block_annotation_EUR.txt.
The same format as the first five columns of the Gene Expression File.
CHROM
GeneStart
GeneEnd
TargetID
GeneName
1
100
200
ENSG0000
X
8. Weight File used for TWAS with GWAS summary statistics (./example_data/weight.txt)
First 5 columns have to be of the following format, specifying chromosome number, base pair position, reference allele, alternative allele, and target gene ID.
The column ES (Effect Size) denotes the weights for this given SNP/TargetGene
Using summary-level GWAS data. Take the output *_training_param.txt from imputation model training as the input Weight file here. The first five columns of the gene expression file will be taken as gene annotation file here for --Gene_anno. The same gene expression file can be used as input for --Gene_anno.