GithubHelp home page GithubHelp logo

kryndex / open-context-data Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ekansa/open-context-data

0.0 1.0 0.0 503.01 MB

Experiment in using GitHub to host Open Context datasets

Home Page: http://opencontext.org

open-context-data's Introduction

Open Context Data [Deprecated, see below]

Rationale:
Why not? GitHub seems like a good way to share these data in a way that allows for easier transformation, forking, duplication, etc. It will probably be years before someone does something interesting with this dataset, but it's still worth sharing via GitHub.

NOTE ABOUT DEPRECATION:
--------------------------------------------
We're still experimenting and trying to work out the best way to use GitHub for data version controll.
After July 2012, we started experiencing problems updating this GitHub repository. The repository contains
about 300K XML documents and as growing above 3 GB. We had trouble adding more data and making commits. GitHub
returned HTTP 500 range errors when we tried to push these changes. 

After consulting colleagues and online help, we decided to share data via GitHub using multiple smaller repositories.
This may not be an ideal solution, but at least it is functional. Please check my list of repositories to get access
Open Context data in GitHub (https://github.com/ekansa?tab=repositories). Alternativly, you can find links directly
to datasets in GitHub directly from project descriptions in Open Context (http://opencontext.org/projects/).

--------------------------------------------

About these XML Files:
In the "data" directory, different directories are named by the project
ID. Each project directory has an XML file with metadata about the project.
Each project directory will have a "subjects" director (with subject XML
documents belonging to that project). They may also have media and document directories. All the XML documents where checked to be valid/well-formed when exported.They should all be in UTF-8 encoding.

Licence Information: 
Each XML document has a Creative Commons license. Most XML documents a Creative Commons Attribution License, but some have the (dreaded) Non-Commercial restriction. Please respect these licenses for linked media files (mainly pictures). These images are not stored in GitHub but can be discovered through their links.

Source Code:
Open Context source code is available here:
https://github.com/ekansa/open-context-code




open-context-data's People

Contributors

ekansa avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.