GithubHelp home page GithubHelp logo

sujaypatil96 / example-data Goto Github PK

View Code? Open in Web Editor NEW

This project forked from cancerdhc/example-data

0.0 0.0 0.0 4.16 MB

This repository is intended to act as a store of example data files from across the NCI Cancer Research Data Commons (CRDC) nodes in a number of formats.

License: MIT License

Jupyter Notebook 12.94% Python 87.06%

example-data's Introduction

Example data for the CCDH project

nbviewer

This repository is intended to act as a store of example data files from across the NCI Cancer Research Data Commons nodes in a number of formats. Each directory represents a single dataset downloaded from a node, and contains a Jupyter Notebook documenting how they were downloaded. CCDH will use this example data to build and test the CRDC-H data model.

GDC Head and Mouth Dataset and conversion to CRDC-H

Our first example is based on a dataset of 560 cases that we downloaded from the GDC Public API. In a Jupyter Notebook, we describe how we can load this data into Python Data Classes and then export it as YAML, JSON-LD or Turtle. This is not yet intended to be a comprehensive transform of all the retrieved GDC case, but to showcase the features made available through the Python Data Classes that are part of the artifacts generated from the CRDC model. The JSON-LD and Turtle exports of the data are also available.

This example is based on CRDC-H model v1.0-pre1 of the CCDH model, which is included in this repository. We will continue to update this as the model develops, but may be out of sync with the latest version of the model until we have the time to update it.

Using Jupyter Notebooks

Many of the processes in this repository are documented in Jupyter Notebook format files, which have an .ipynb extension. These files can be viewed directly in GitHub (see CDA example for subject 09CO022 as an example). You can also run it in the Jupyter Notebook viewer (see CDA example for subject 09CO022 as an example).

If you would like to execute this file, you will need to install Jupyter Notebook (also available on Homebrew for Mac). You can then download the .ipynb file and open it in Jupyter Notebook on your computer by running:

$ jupyter notebook cptac2-subject-09CO022/CDA\ example\ for\ subject\ 09CO022.ipynb

example-data's People

Contributors

gaurav avatar balhoff avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.