GithubHelp home page GithubHelp logo

cameronraysmith / single-cell-curation Goto Github PK

View Code? Open in Web Editor NEW

This project forked from chanzuckerberg/single-cell-curation

0.0 1.0 0.0 478.99 MB

License: MIT License

Python 58.81% Makefile 0.59% Jupyter Notebook 39.82% Jinja 0.78%

single-cell-curation's Introduction

cellxgene curation tools

codecov

This repository contains documents and code used by cellxgene's curation team. Issues/suggestions pertaining to datasets and how they interact with cellxgene should be created here.

For information/issues about cellxgene and its portal please refer to:

Installation

The primary curation tool is the cellxgene-schema CLI. It enables curators to perform schema validation for datasets to be hosted on the cellxgene Data Portal.

It requires Python >= 3.8. It is available through pip:

pip install cellxgene-schema

It can also be installed from the source by cloning this repository and running:

make install 

And you can run the tests with:

make unit-test

Usage

The CLI validates an AnnData file (*.h5ad) to ensure that it addresses the schema requirements.

Datasets can be validated using the following command line:

cellxgene-schema validate input.h5ad

If the validation succeeds, the command returns a zero exit code; otherwise, it returns a non-zero exit code and prints validation failure messages.


The data portal runs the following in the backend:

cellxgene-schema validate --add-labels output.h5ad input.h5ad

This execution validates the dataset as above AND adds the human-readable labels for the ontology and gene IDs as defined in the schema. If the validation is successful, a new AnnData file (output.h5ad) is written to disk with the labels appended.

This option SHOULD NOT be used by data contributors.

Contributing

Please read our contributing guidelines and make sure adhere to the Contributor Covenant code of conduct.

Reporting Security Issues

Please read our security reporting policy

single-cell-curation's People

Contributors

bento007 avatar pablo-gar avatar brianraymor avatar joyceyan avatar nayib-jose-gloria avatar maximilianlombardo avatar maniarathi avatar ambrosejcarr avatar jahilton avatar metakuni avatar ebezzi avatar millenniumfalconmechanic avatar seve avatar blrnw3 avatar mdunitz avatar mckinsel avatar dependabot[bot] avatar atolopko-czi avatar noopdog avatar github-actions[bot] avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.