GithubHelp home page GithubHelp logo

adeslatt / bioitworld2019fairapplications Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ncbi-hackathons/fair_apps

0.0 0.0 0.0 37.23 MB

License: MIT License

R 4.63% Dockerfile 5.61% Makefile 1.17% Perl 7.57% Shell 24.92% Python 50.41% HTML 0.15% Nextflow 5.54%

bioitworld2019fairapplications's Introduction

FAIR_apps

FAIR Data in -> FAIR Application run -> FAIR Data out.

Today is a Learning exercise – and many are coming from many different angles – if we achieve even a portion of this ambitious objective we will have done a lot!

The learning objectives for this hackathon I have gathered from folks is as follows:

  1. Learning about FAIR Applications – what does it mean and how does it happen? FAIR Data In, FAIR Applications, FAIR data out and moving forward
  2. Learning about containers
  3. Learning about workflows

The set up:

The concept is we want to connect the mouse models to human disease.

This is an idealized scenario, and for pedagogical principles, we are going through the exercise of the round robin of data from mouse to application to data from human. We may or may not succeed in actually connecting the dots, but that may not matter for this exercise.

The exact details of this question are to be suggestive and not necessarily conclusive as the best way to go about this problem, but the goal is to be as useful and pragmatic as possible as we wrap our heads around what are the most important things about FAIR that we need to concern ourselves with.

Example Phenotype Data set:

Please find attached the data (and I will be depositing these in github as our starting point).

Beginning with phenotype measures collected from a study, Chesler3, details were downloaded as json files from the phenome.jax.org using the RESTFul API. The measures for that study are detailed in the table that was downloaded as well, measures.csv and details for the study are found in studies.csv.

The attached R script reads this data in and merges the Chesler3 data with the measurements details.

The idea:

Beginning with mouse data, data from a specific mouse strain, subject to specific conditions, what are the measurements, the phenotypes that tell us about that mouse in this particular experiment. With this gene set use Geneweaver to find the homology to human gene sets.

The idea is to identify a mouse strain that has contrasting phenotypes for a syndrome of interest.

  • Making, keeping the data in FAIR:

Mouse gene list

  • Making, keeping the application FAIR:

Homology service to make a homologous gene list given mouse gene lists go to human gene lists and vice versa

  • Making, keeping the data out FAIR:

Human gene list

Beginning with human data, a new exome is obtained for an individual with a disease, and the exomiser is used to obtain a prioritized list of pathogenic regulatory variants in Mendialian disease. From these, a gene list could be derived.

Discussion:

Testing the data that are going into an application regarding whether that data adheres to the principles of FAIR, as determined by the FAIRShake application we will be introduced to today – Testing the application for its FAIRness

The path from mouse to human is the path we want to walk.

The path takes us from the Mouse Phenotype Database (MPD =phenome.jax.org) data with the mouse strains data for specific experiments.

From the mouse strain, we see the genes that are differentiated, affected with the strain. With the genes, we can look at the pathway and we get

Please find attached the studies that are available from the Mouse Phenotype Database, studies.csv And we have the measurements that we can get from the

Data Sources:

https://phenome.jax.org/downloads

https://phenome.jax.org/about/api

https://phenome.jax.org/about/termsofuse https://osp.od.nih.gov/scientific-sharing/policies/ https://phenome.jax.org/about/citing

Background:

  • Ontoforce with their tool DisQover
  • Lifebit.ai with their tool deploit
  • Databiology with their integrated platform.

Here are some details about containers:

https://biocontainers.pro/#/

http://biocontainers-edu.biocontainers.pro/en/latest/best_practices.html

http://biocontainers-edu.biocontainers.pro/en/latest/best_practices.html https://www.sylabs.io/2018/10/whitepaper-containerized-genomic-workflows-with-singularity-part-1/ https://www.sylabs.io/2018/11/whitepaper-containerized-genomic-workflows-with-singularity-part-2/ https://www.sylabs.io/2018/11/whitepaper-containerized-genomic-workflows-with-singularity-part-3/

https://github.com/nf-core

All applications and data will be given a FAIRShake

https://fairshake.cloud/

Herding cats

Hearding cats

DisQover.com

http://54.211.41.249/#start

Lifebit.ai

https://deploit.lifebit.ai/public/jobs/5cb5f8d8b91c3200b25075dc

databiology.com

https://www.demo.databiology.net/dbe/userlab/show-project.html?tab=workunits&projectId=7

Final presentation

Slide1 Slide2 Slide3 Slide4 Slide5 Slide6 Slide7 Slide8 Slide9 Slide10 Slide11 Slide12 Slide13 Slide14 Slide15 Slide16 Slide17 Slide18 Slide19 Slide20 Slide21 Slide22 Slide23 Slide24 Slide25 Slide26 Slide27 Slide28

bioitworld2019fairapplications's People

Contributors

allissadillman avatar johnthomaswright avatar nicholsn avatar ozenaslwork avatar pprieto avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.