GithubHelp home page GithubHelp logo

dar's Introduction

Dar

Dar stands for (Reproducible) Document Archive and specifies a virtual file format that holds multiple digital documents, complete with images and other assets. A Dar consists of a manifest file (manifest.xml) that describes the contents.

<!DOCTYPE manifest PUBLIC "DarManifest 0.1.0" "http://darformat.org/DarManifest-0.1.0.dtd">
<dar>
  <documents>
    <document id="manuscript" name="Reproducible Document Stack" type="article" path="manuscript.xml" />
    <document id="sheet" name="Sheet 1" type="sheet" path="sheet.xml" />
  </documents>
  <assets>
    <asset id="234o23489237498234798" mime-type="image/png" name="Picture 1" path="234o23489237498234798.png"/>
  </assets>
</dar>

There are two types of contents:

  • Documents: Those are meant to be manipulated by a visual editor, and typically stored as XML/HTML or JSON.
  • Assets: Regular files which can be used from any document. For instance, two documents could embed the same image.

Designed for research and scientific publishing

Dar is being designed for storing reproducible research publications, but the underlying concepts are suitable for any kind of digital publications that can be bundled together with their assets.

Goals

  • Establish standardised research publications
  • Self-contained archive (includes manuscript, images, source code and data)
  • Machine-friendly format to ease development of tools
  • Long-term preservation
  • Stand-alone, offline execution of reproducible elements
  • Language agnostic (e.g. run Python, R, Jupyter, Kernels etc.)
  • Tool agnostic (use Jupyter, RMarkdown or Stencila for authoring)

Specifications

The following specifications define a markup language (XML) for research articles and spreadsheets:

  • Texture Article: An XML format, based on JATS, the de facto standard for archiving and interchange of scientific open-access contents with XML

Editors

The following editors are developed to edit document archives of research projects:

  • Stencila: an office suite for reproducible research
  • Texture: an open source manuscript editor designed for publishers and authors

Examples

These two examples are continuously updated, to reflect the latest versions of the related specifications.

Status

This is an early stage proposal (alpha) that will be continuously advanced. We are using existing standards when possible (such as JATS-XML for representing articles) and seek for consensus in the research community to offer the most flexible and concise tagging guidelines.

License

The JATS Standard is copyrighted by NISO, but all of the non-normative information found in this repository is in the CC BY-SA 4.0.

More info at https://creativecommons.org/licenses/by-sa/4.0/

Credits

Dar is developed by the Substance Consortium, an open community formed by the Public Knowledge Project (PKP), the Collaborative Knowledge Foundation (CoKo), SciELO, Érudit, eLife and Stencila.

dar's People

Contributors

integral avatar obuchtala avatar oliver7654 avatar zuphilip avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.