GithubHelp home page GithubHelp logo

eupathdb's Introduction

EuPathDB AnnotationHub Recipes and access

This package contains code for generating Bioconductor objects (OrgDb, OrganismDbi, TxDb, GRanges, and BSGenome) for pathogens found on one of the EuPathDB databases:

The primary purposes for this package are:

  1. Generate organism-specific Bioconductor resources to be made available through AnnotationHub.
  2. Generate installable packages for the various resources.
  3. Provide shortcuts for accessing these resources.

To access EuPathDB resources via AnnotationHub, simply load the AnnotationHub package and use the query function like you would for any other AnnotationHub resource:

library(AnnotationHub)

# create an AnnotationHub connection
ah <- AnnotationHub()

# use AnnotationHub to get an OrgDb object for T. gondii ME49
res <- query(ah, c('Toxoplasma gondii ME49', 'OrgDb', 'EuPathDB'))
orgdb <- res[[1]]

To create a local package of a EuPathDB resource, and get information from it:

library(EuPathDB)
## I pretty much always use Leishmania major strain friedlin as my example.
lm_entry <- get_eupath_entry(species="Friedlin")
lm_name <- sc_entry[["Species"]]
lm_name
orgdb_pkgname <- make_eupath_orgdb(lm_entry)
txdb_pkgname <- make_eupath_txdb(lm_entry)
bsgenome_pkgname <- make_eupath_bsgenome(lm_entry)
organismdbi_pkgname <- make_eupath_organismdbi(lm_entry)

## Get a big monster data table of annotations
major_annotations <- load_orgdb_annotations(orgdb_pkgname)
## Or GO categories
major_go <- load_orgdb_go(orgdb_pkgname)

avail_columns <- AnnotationDbi::columns(orgdb_pkgname)

## Or interpro categories
chosen_columns_idx <- grepl(x=avail_columns, pattern="^INTERPRO")
chosen_columns <- avail_columns[chosen_columns_idx]
lm_interpro <- load_orgdb_go(lm_orgdb, columns=chosen_columns)

## Or Orthologs
chosen_columns_idx <- grepl(x=avail_columns, pattern="^ORTHOLOGS")
chosen_columns <- avail_columns[chosen_columns_idx]
lm_ortho <- load_orgdb_go(lm_orgdb, columns=chosen_columns)

## Or Pathway data
chosen_columns_idx <- grepl(x=avail_columns, pattern="^PATHWAY")
chosen_columns <- avail_columns[chosen_columns_idx]
lm_path <- load_orgdb_go(lm_orgdb, columns=chosen_columns)

## Or KEGG
chosen_columns_idx <- grepl(x=avail_columns, pattern="KEGG")
chosen_columns <- avail_columns[chosen_columns_idx]
lm_kegg <- load_orgdb_go(lm_orgdb, columns=chosen_columns)

Check the vignettes for more examples!

eupathdb's People

Contributors

abelew avatar djperalta avatar khughitt avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

eupathdb's Issues

Revise Vignette

Hi Developers.

The current vignette of EuPathDB is quite old. Most of the commands in it does not work now. Kindly update them.

OrgDb datasets missing from recent AH versions?

E.g.:

AnnotationHub with 895 records
# snapshotDate(): 2022-04-25
# $dataprovider: FungiDB, TriTrypDB, PlasmoDB, MicrosporidiaDB, ToxoDB, CryptoDB, AmoebaDB, PiroplasmaDB, GiardiaDB, TrichDB
# $species: Plasmodium falciparum, Trypanosoma cruzi, Toxoplasma gondii, Cryptosporidium hominis, Trypanosoma brucei, Plasmodium yoelii,...
# $rdataclass: GRanges, TxDb
...

Either the process needs to be updated to make those datasets available, or the README should be updated to not point to these files.

How to download .sqlite file for all OrgDB in one go ?

Hi,

I got the list of species and OrgDB objects using code below. How can I download all .sqlite files in one go ?

library(magrittr)
#> Warning: package 'magrittr' was built under R version 4.1.2
metadata <- EuPathDB::download_eupath_metadata(webservice = "fungidb")
#> Unable to find species names for 1 species.
#> Pneumocystis sp. 'macacae' P2C
#> Warning in xref_species(valid = taxa_xref[["matched_metadata"]], invalid =
#> taxa_xref[["unmatched_metadata"]], : How in the flying hell are these still
#> here: Melampsora larici-populina 98AG31.

metadata_tbl <- tibble::enframe(metadata) %>% 
        dplyr::filter(name != "invalid") %>% tidyr::unnest()
#> Warning: `cols` is now required when using unnest().
#> Please use `cols = c(value)`

metadata_tbl_sub <- metadata_tbl %>% 
        dplyr::select(GenusSpecies, TaxonUnmodified, TaxonXref,DataProvider, Genome,OrgdbPkg, OrgdbFile)

metadata_tbl_sub$OrgdbFile %>% head()
#> [1] "EuPathDB/OrgDb/3.14/org.Aluchuensis.IFO.4308.v59.eg.sqlite"               
#> [2] "EuPathDB/OrgDb/3.14/org.Eglyceriae.E277.v59.eg.sqlite"                    
#> [3] "EuPathDB/OrgDb/3.14/org.Aversicolor.CBS.583.65.v59.eg.sqlite"             
#> [4] "EuPathDB/OrgDb/3.14/org.Asydowii.CBS.593.65.v59.eg.sqlite"                
#> [5] "EuPathDB/OrgDb/3.14/org.Pcinnamomi.var.cinnamomi.CBS.144.22.v59.eg.sqlite"
#> [6] "EuPathDB/OrgDb/3.14/org.Awentii.DTO.134E9.v59.eg.sqlite"

Created on 2022-10-18 with reprex v2.0.2

Cannot find OrgDB objects from EuPathDb in AnnotationHub

Hi,

The latest AnnotationnHub data only shows GRanges and TxDB objects from EuPathDB. I am specifically interested to get OrgDB objects from FungiDB. Is there any way to get that? I did that in the past with Bioc version 3.9. However, with the current version cannot see any OrgDB objects from FungiDB or EuPathDB.

image

Create README.md

  • Point users to vignette / AnnotationHub
  • Describe purpose of this package and how it might be used outside of it's use for AH package generation.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.